llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Puchert	c1a31ee65b	[PPCISelLowering] Avoid emitting calls to __multi3, __muloti4 After D108936, @llvm.smul.with.overflow.i64 was lowered to __multi3 instead of __mulodi4, which also doesn't exist on PowerPC 32-bit, not even with compiler-rt. Block it as well so that we get inline code. Because libgcc doesn't have __muloti4, we block that as well. Fixes #54460. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D122090	2022-03-20 20:59:30 +01:00
Kazu Hirata	bce1bf0ee2	[Transform] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)	2022-03-20 10:41:22 -07:00
Mark de Wever	3b2e605e33	[libc++][test][NFC] Remove libcpp-no-concepts. This is no longer needed. Reviewed By: #libc, philnik Differential Revision: https://reviews.llvm.org/D122099	2022-03-20 15:39:26 +01:00
Chen Zheng	973b02b6f1	[PowerPC][NFC] use right hardware loop intrinsics in test case	2022-03-20 10:00:57 -04:00
esmeyi	de20a3b677	[XCOFF] support XCOFFObjectWriter for fileHeader and sectionHeaders in 64-bit XCOFF. This is the first patch to enable the XCOFF64 object writer. Currently only fileHeader and sectionHeaders are supported. Reviewed By: jhenderson, DiggerLin Differential Revision: https://reviews.llvm.org/D120861	2022-03-20 09:31:29 -04:00
Michel Weber	9d6a6fbbbd	[MLIR][Presburger] remove redundant constraints in coalesce This patch improves the representation size of individual `IntegerRelation`s by calling the function `IntegerRelation::removeRedundantConstraints`. While this is only a slight optimization in the current version, it will be necessary for patches to come. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D121989	2022-03-20 13:00:12 +00:00
Luo, Yuanke	10bb623192	enable binop identity constant folds for add Differential Revision: https://reviews.llvm.org/D119654	2022-03-20 19:07:16 +08:00
Shengchen Kan	51e6059c12	[X86] Simplify function isDataInvariant by using X86MnemonicTables This is not a NFC change b/c we add more instructions like IMUL16/32/64r, MOV16ao16 and MOV16rr_REV etc to the list. But I think it's reasonable. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D122063	2022-03-20 18:38:58 +08:00
Florian Hahn	973183612e	[VPlan] Add test for VPExpandSCEVRecipe printing.	2022-03-20 10:11:40 +00:00
Simon Pilgrim	06fa67dc0a	[X86] Add test add with bit0 extraction and improve comments Based on feedback from D122084	2022-03-20 09:31:52 +00:00
Simon Pilgrim	1ae3c4e948	[X86] combineAddOrSubToADCOrSBB - split to more cleanly handle commuted variants. Split combineAddOrSubToADCOrSBB into wrapper (which handles ADDs with commuted args) and the real combine, which no longer has to account for commutation. I'm intending to extend combineAddOrSubToADCOrSBB to detect patterns other than just X86ISD::SETCC, so we need to detect all patterns without detecting them as part of a commutation swap.	2022-03-20 09:14:21 +00:00
Shengchen Kan	e58dadf3e2	[X86][NFC] Generate fields and getters for subtarget features Non-duplicated comments are moved from X86Subtarget.h to X86.td. This is a follow-up patch for D120906.	2022-03-20 15:27:21 +08:00
Alisamar Husain	8271220a99	[trace][intelpt] Instruction count in trace info Added a line to `thread trace dump info` results which shows total number of instructions executed until now. Differential Revision: https://reviews.llvm.org/D122076	2022-03-20 11:28:16 +05:30
Shengchen Kan	ae0ae91903	[X86][NFC] Remove unused variable UseAA	2022-03-20 13:21:25 +08:00
Shengchen Kan	c266776429	[X86][NFC] Remove unused feature UseAA	2022-03-20 13:14:13 +08:00
Shengchen Kan	076a9dc99a	[X86][NFC] Rename hasCMOV() to canUseCMOV(), hasLAHFSAHF() to canUseLAHFSAHF() To make them less like other feature functions. This is a follow-up patch for D121978.	2022-03-20 12:00:25 +08:00
Craig Topper	4eb59f0179	[SelectionDAG][RISCV] Make RegsForValue::getCopyToRegs explicitly zero_extend constants. ComputePHILiveOutRegInfo assumes that constant incoming values to Phis will be zero extended if they aren't a legal type. To guarantee that we should zero_extend rather than any_extend constants. This fixes a bug for RISCV where any_extend of constants can be treated as a sign_extend. Differential Revision: https://reviews.llvm.org/D122053	2022-03-19 18:43:14 -07:00
Craig Topper	268371cf7b	[RISCV] Add test case for miscompile caused by treating ANY_EXTEND of constants as SIGN_EXTEND. The code that inserts AssertZExt based on predecessor information assumes constants are zero extended for phi incoming values this allows AssertZExt to be created in blocks consuming a Phi. SelectionDAG::getNode treats any_extend of i32 constants as sext for RISCV. The code that creates phi incoming values in the predecessors creates an any_extend for the constants which then gets treated as a sext by getNode. This makes the AssertZExt incorrect and can cause zexts to be incorrectly removed. This bug was introduced by D105918 Differential Revision: https://reviews.llvm.org/D122052	2022-03-19 18:43:14 -07:00
Philip Reames	983ed87c61	[slp,tests] Consolidate two test files into one There are some slight changes to the test lines due to different cost threshold choices in the two command lines, but I don't believe these to be interesting the purpose of the tests.	2022-03-19 18:24:35 -07:00
Jacques Pienaar	374208ea15	[bazel][mlir] Add MLIR PDLL LSP server target Add targets for PDLL LSP server.	2022-03-19 18:08:36 -07:00
Philip Reames	89ab020d02	[tests, SLP] Add coverage for missing dependencies for stacksave intrinsics The existing scheduling doesn't account for the scheduling restrictions implied by inalloca allocas combined with stacksave/stackrestore. This adds coverage including one currently miscompiling case.	2022-03-19 18:05:09 -07:00
Jon Chesterfield	bcbd4cf1f2	Revert "[amdgpu][nfc] Pass function instead of module to allocateModuleLDSGlobal" Reconsidered, better to handle per-function state in the constructor as before. This reverts commit `98e474c1b3`.	2022-03-20 00:58:26 +00:00
Will Dietz	02db3cfe7d	mlir: set CMAKE_INCLUDE_CURRENT_DIR to fix out-of-tree builds This option tells CMake to add current source and binary directories to the include path for each directory[1]. Required include directories from build tree (for generated files) were previously added in `mlir_tablegen` but this was changed in `03078ec20b` . These are still needed, however, for out-of-tree builds that don't build as part of LLVM (via LLVM_ENABLE_PROJECTS). Building as part of LLVM works regardless, AFAICT, because LLVM sets this option and so the MLIR build inherits it. FWIW, various other (in-tree) LLVM projects set this as well. And of course this fixes the out-of-tree mlir-by-itself build scenario I'm using. [1] https://cmake.org/cmake/help/latest/variable/CMAKE_INCLUDE_CURRENT_DIR.html Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D122088	2022-03-19 18:22:09 -05:00
River Riddle	3807583b8f	[mlir:PDLL][NFC] Remove a dead comment about constant params These were removed, and the FIXME is no longer relevant.	2022-03-19 13:38:27 -07:00
Philip Reames	6253b77da9	[SLP] Respect control dependence within a block during scheduling This fixes an active miscompile visible in the test changes. The basic problem is that the scheduling dependency graph didn't have any edges for control dependence within a single basic block. The result is that we could (and in some rare cases did) perform reorderings within a block which could introduce new undefined behavior along paths which didn't previously contain any. Impact wise, we have two major cases where control is not guaranteed to reach a later instruction in the block: may throw calls, and calls containing infinite loops. * The former case was mostly covered by the memory dependencies, and to trigger require a function which can throw, but not write to memory. In theory, such a case is possible, but not likely in practice. * The later case is likely more of an issue in practice. After this code was first written, we changed the IR semantics to allow well defined infinite loops without satisifying mustprogress. Even for C/C++ - which do imply mustprogress - recent changes to how we treat atomics (e.g. an atomic read does not always imply a write) could expose this issue. I'm a bit shocked we don't seem to have a bug report which hit this in real code actually. Compile time wise, this results in a single extra scan of the scheduling window in the common case. Since we stop scanning at the next instruction which isn't guaranteed to execute, no matter what order we traverse instructions in, we scan the block once. The exception to this is that when we extend the scheduling window downwards, we invalidate all dependencies, and thus rescan. So the potentially expensive case is when we a call in a big schedule window which is frequently extended. We could optimize this case (by caching the last instruction not guaranteeed to transfer execution and scanning only the extended window) and starting there), but I decided to leave the complexity until it mattered. That same case is already degenerate with memory dependences which is more expensive than the control dependence scan. We could also consider combining the memory dependence and control dependence sets to reduce memory usage, but since it complicates the code slightly and makes debugging a bit harder, I went with the simplest scheme for now. This was noticed while trying to understand the failures reported against D118538, but is not otherwise related to that change.	2022-03-19 13:36:24 -07:00
Tal Kedar	129311ac0b	[libSupport] make CallBacksToRun static local In order to allow compiling with -Werror=global-constructors with c++20 and above. Discussion: https://discourse.llvm.org/t/llvm-lib-support-signals-cpp-fails-to-compile-due-to-werror-global-constructors/61070 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D122067	2022-03-19 20:31:04 +00:00
River Riddle	9595f3568a	[mlir:PDL] Remove the ConstantParams support from native Constraints/Rewrites This support has never really worked well, and is incredibly clunky to use (it effectively creates two argument APIs), and clunky to generate (it isn't clear how we should actually expose this from PDL frontends). Treating these as just attribute arguments is much much cleaner in every aspect of the stack. If we need to optimize lots of constant parameters, it would be better to investigate internal representation optimizations (e.g. batch attribute creation), that do not affect the user (we want a clean external API). Differential Revision: https://reviews.llvm.org/D121569	2022-03-19 13:28:24 -07:00
River Riddle	469c58944d	[mlir][PDLL] Add signature help to the PDLL language server This commit adds signature support to the language server, and initially supports providing help for: operation operands and results, and constraint/rewrite calls. Differential Revision: https://reviews.llvm.org/D121545	2022-03-19 13:28:24 -07:00
River Riddle	008de486f7	[mlir][PDLL] Add code completion to the PDLL language server This commit adds code completion support to the language server, and initially supports providing completions for: Member access, attributes/constraint/dialect/operation names, and pattern metadata. Differential Revision: https://reviews.llvm.org/D121544	2022-03-19 13:28:24 -07:00
River Riddle	8dd4272ca2	[mlir][PDLL] Add symbol support to the PDLL language server This adds support for documenting the top-level "symbols", e.g. patterns, constraints, rewrites, etc., within a PDLL file. Differential Revision: https://reviews.llvm.org/D121543	2022-03-19 13:28:23 -07:00
River Riddle	41ae211458	[mlir][PDLL] Add hover support to the PDLL language server This adds support for providing information when hovering over operation names, variables, patters, constraints, and rewrites. Differential Revision: https://reviews.llvm.org/D121542	2022-03-19 13:28:23 -07:00
River Riddle	52b34df9d6	[mlir][PDLL] Add an initial language server for PDLL This commits adds a basic language server for PDLL to enable providing language features in IDEs such as VSCode. This initial commit only adds support for tracking definitions, references, and diagnostics, but followup commits will build upon this to provide more significant behavior. In addition to the server, this commit also updates mlir-vscode to support the PDLL language and invoke the server. Differential Revision: https://reviews.llvm.org/D121541	2022-03-19 13:28:23 -07:00
Florian Hahn	1a820ff039	[LV] Remove unnecessary uses of Loop* (NFC). Update functions that previously took a loop pointer but only to get the pre-header. Instead, pass the block directly. This removes the requirement for the loop object to be created up-front.	2022-03-19 20:18:47 +00:00
Craig Topper	57b41af838	[X86] Rename FeatureCMPXCHG8B/FeatureCMPXCHG16B to FeatureCX8/CX16 to match CPUID. Rename hasCMPXCHG16B() to canUseCMPXCHG16B() to make it less like other feature functions. Add a similar canUseCMPXCHG8B() that aliases hasCX8() to keep similar naming. Differential Revision: https://reviews.llvm.org/D121978	2022-03-19 12:34:06 -07:00
Simon Pilgrim	b929db5968	[X86] Add some initial test coverage for PR35908 add/sub + bittest patterns	2022-03-19 19:20:19 +00:00
Johannes Doerfert	4166738c38	[OpenMP][FIX] Do not crash when kernels are debug wrapper functions With debug information enabled (-g) Clang will wrap the actual target region into a new function which is called from the "kernel". The problem is that the "kernel" is now basically a wrapper without all the things we expect. More importantly, if we end up asking for an AAKernelInfo for the "target region function" we might try to turn it into SPMD mode. That used to cause an assertion as that function doesn't have an appropriately named `_exec_mode` global. While the global is going away soon we still need to make sure to properly handle this case, e.g., perform optimizations reliably. Differential Revision: https://reviews.llvm.org/D122043	2022-03-19 14:15:55 -05:00
Itay Bookstein	d155c7da51	[docs] Fix a couple of typos Signed-off-by: Itay Bookstein <ibookstein@gmail.com>	2022-03-19 20:24:38 +02:00
Simon Pilgrim	34110a7320	[X86] combineAddOrSubToADCOrSBB - pull out repeated Y.getOperand(1) calls. NFC.	2022-03-19 17:56:11 +00:00
Nikolas Klauser	85e9b2687a	[libc++] Prepare string tests for constexpr These are the last™ changes to the tests for constexpr preparation. Reviewed By: Quuxplusone, #libc, Mordante Spies: Mordante, EricWF, libcxx-commits Differential Revision: https://reviews.llvm.org/D120951	2022-03-19 18:48:14 +01:00
Alisamar Husain	1bcc28b884	[docs] Fixed minor ordering issue Differential Revision: https://reviews.llvm.org/D122073	2022-03-19 22:23:42 +05:30
Simon Pilgrim	b90478d422	[X86] createShuffleMaskFromVSELECT - handle BLENDV constant masks as well as VSELECT constant masks Handle constant masks for both vselect nodes (mask != 0) and blendv nodes (mask < 0)	2022-03-19 16:51:07 +00:00
Philip Reames	bdbcca617a	[SLP,tests] Add coverage showing need for control dependencies during scheduling	2022-03-19 09:45:33 -07:00
Jon Chesterfield	98e474c1b3	[amdgpu][nfc] Pass function instead of module to allocateModuleLDSGlobal	2022-03-19 16:42:17 +00:00
Simon Pilgrim	a6c18bfbe3	[X86] combineSelect - don't constant fold BLENDV nodes like VSELECT If a X86ISD::BLENDV op appears before legalization (in this test case due to the icmp_slt x, 0) its constant mask was being treated as a vselect mask (mask != 0) instead of blendv (mask < 0) This just prevents constant folding entirely for non-VSELECT ops.	2022-03-19 16:31:19 +00:00
Simon Pilgrim	33d2c00814	[X86] Add test showing a bug where a BLENDV mask is being constant folded as VSELECT mask combineSelect doesn't expect X86ISD::BLENDV ops to appear before legalization and is treating the constant mask as a vselect mask (mask != 0) instead of blendv (mask < 0)	2022-03-19 16:31:19 +00:00
Florian Hahn	d5fbcf76fd	[VPlan] Improve pattern in vplan-printing.ll check line. The existing pattern only matched a single value, which breaks if the numbering slightly changes.	2022-03-19 16:03:25 +00:00
Simon Pilgrim	2dacd0d9c3	[X86] Update remaining AVX512 VBMI2 VL intrinsic tests to avoid adds As noticed in D119654, by adding the masked intrinsics results together we can end up with the selects being canonicalized away from the intrinsic - this isn't what we want to test here so replace with a insertvalue chain into a aggregate instead to retain all the results.	2022-03-19 15:41:25 +00:00
Simon Pilgrim	56ad791f46	[X86] LowerAndToBT - fold BT(NOT(X),Y) -> BT(X,Y) and flip the CondCode	2022-03-19 14:03:03 +00:00
Simon Pilgrim	c7ba5a9aff	[X86][SSE] Add initial support for extracting non-constant bool vector elements We can use MOVMSK+TEST/BT to extract individual bool elements even if the index isn't constant This relies on combineBitcastvxi1 so some AVX512 cases still aren't optimized as they avoid MOVMSK usage.	2022-03-19 13:31:05 +00:00
Simon Pilgrim	abb9cbb22e	[X86][SSE] Add tests for non-constant bool vector extractions We should be able to perform this with MOVMSK+TEST/BT instead of spilling to stack	2022-03-19 13:25:21 +00:00

1 2 3 4 5 ...

418524 Commits All Branches Search

418524 Commits

All Branches