llvm-project

Commit Graph

Author	SHA1	Message	Date
John Ericson	34fe6ddce1	Revert "[CMake] Avoid `LLVM_BINARY_DIR` when other more specific variable are better-suited" This reverts commit `ad8c34bc30`.	2022-08-25 11:13:46 -04:00
Eli Friedman	2c29268bfc	Exclude check-polly-unittests and check-polly-isl from check-all The unittests are already included in check-polly, so check-all was running them twice. Running them twice causes a race on the output files, which led to intermittent failures on the reverse-iteration buildbot.	2022-08-24 12:55:45 -07:00
John Ericson	ad8c34bc30	[CMake] Avoid `LLVM_BINARY_DIR` when other more specific variable are better-suited A simple sed doing these substitutions: - `${LLVM_BINARY_DIR}/(\$\{CMAKE_CFG_INTDIR}/)?lib(${LLVM_LIBDIR_SUFFIX})?\>` -> `${LLVM_LIBRARY_DIR}` - `${LLVM_BINARY_DIR}/(\$\{CMAKE_CFG_INTDIR}/)?bin\>` -> `${LLVM_TOOLS_BINARY_DIR}` where `\>` means "word boundary". The only manual modifications were reverting changes in - `compiler-rt/cmake/Modules/CompilerRTUtils.cmake - `runtimes/CMakeLists.txt` because these were "entry points" where we wanted to tread carefully not not introduce a "loop" which would end with an undefined variable being expanded to nothing. This hopefully increases readability overall, and also decreases the usages of `LLVM_LIBDIR_SUFFIX`, preparing us for D130586. Reviewed By: sebastian-ne Differential Revision: https://reviews.llvm.org/D132316	2022-08-24 10:14:05 -04:00
John Ericson	e941b031d3	Revert "[cmake] Use `CMAKE_INSTALL_LIBDIR` too" This reverts commit `f7a33090a9`. Unfortunately this causes a number of failures that didn't show up in my local build.	2022-08-18 22:46:32 -04:00
John Ericson	f7a33090a9	[cmake] Use `CMAKE_INSTALL_LIBDIR` too We held off on this before as `LLVM_LIBDIR_SUFFIX` conflicted with it. Now we return this. `LLVM_LIBDIR_SUFFIX` is kept as a deprecated way to set `CMAKE_INSTALL_LIBDIR`. The other `*_LIBDIR_SUFFIX` are just removed entirely. I imagine this is too potentially-breaking to make LLVM 15. That's fine. I have a more minimal version of this in the disto (NixOS) patches for LLVM 15 (like previous versions). This more expansive version I will test harder after the release is cut. Reviewed By: sebastian-ne, ldionne, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D130586	2022-08-18 15:33:35 -04:00
Vitaly Buka	3f5f2905c4	[test] Propagate HWASAN_OPTIONS	2022-08-17 18:59:49 -07:00
Roman Gareev	a5d981045d	[Polly] Remove the test case that depends on InstCombine and DeLICM.	2022-08-14 12:51:57 +03:00
Gabriel Ravier	ea540bc210	[polly] Fixed a number of typos. NFC I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Reviewed By: inclyc Differential Revision: https://reviews.llvm.org/D131167	2022-08-07 22:56:07 +08:00
Roman Gareev	e8c9eb49ea	[Polly] Suppress the LLVM-IR output for pattern matching tests, if there is no FileCheck-ing for it.	2022-08-07 14:56:26 +03:00
Roman Gareev	b02c7e2b63	[Polly] Generalize the pattern matching to the case of tensor contractions The pattern matching optimization of Polly detects and optimizes dense general matrix-matrix multiplication. The generated code is close to high performance implementations of matrix-matrix multiplications, which are contained in manually tuned libraries. The described pattern matching optimization is a particular case of tensor contraction optimization, which was introduced in [1]. This patch generalizes the pattern matching to the case of tensor contractions using the form of data dependencies and memory accesses produced by tensor contractions [1]. Optimization of tensor contractions will be added in the next patch. Following the ideas introduced in [2], it will logically represent tensor contraction operands as matrix multiplication operands and use an approach for optimization of matrix-matrix multiplications. [1] - Gareev R., Grosser T., Kruse M. High-Performance Generalized Tensor Operations: A Compiler-Oriented Approach // ACM Transactions on Architecture and Code Optimization (TACO). 2018. Vol. 15, no. 3. P. 34:1–34:27. DOI: 10.1145/3235029. [2] - Matthews D. High-Performance Tensor Contraction without BLAS // SIAM Journal on Scientific Computing. 2018. Vol. 40, no. 1. P. C 1—C 24. DOI: 110.1137/16m108968x. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D114336	2022-08-07 13:10:32 +03:00
Michael Kruse	fe0e5b3e43	[Polly] Insert !dbg metadata for emitted CallInsts. The IR Verifier requires that every call instruction to an inlineable function (among other things, its implementation must be visible in the translation unit) must also have !dbg metadata attached to it. When parallelizing, Polly emits calls to OpenMP runtime function out of thin air, or at least not directly derived from a bounded list of previous instruction. While we could search for instructions in the SCoP that has some debug info attached to it, there is no guarantee that we find any. Our solution is to generate a new DILocation that points to line 0 to represent optimized code. The OpenMP function implementation is usually not available in the user's translation unit, but can become visible in an LTO build. For the bug to appear, libomp must also be built with debug symbols. IMHO, the IR verifier rule is too strict. Runtime functions can also be inserted by other optimization passes, such as LoopIdiomRecognize. When inserting a call to e.g. memset, it uses the DebugLoc from a StoreInst from the unoptimized code. It is not required to have !dbg metadata attached either. Fixes #56692	2022-07-26 19:43:53 -05:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Michael Kruse	6fa65f8a98	[Polly][MatMul] Abandon dependence analysis. The copy statements inserted by the matrix-multiplication optimization introduce new dependencies between the copy statements and other statements. As a result, the DependenceInfo must be recomputed. Not recomputing them caused IslAstInfo to deduce that some loops are parallel but cause race conditions when accessing the packed arrays. As a result, matrix-matrix multiplication currently cannot be parallelized. Also see discussion at https://reviews.llvm.org/D125202	2022-06-29 17:20:05 -05:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00
Yang Keao	02f640672e	[Polly] Migrate -polly-mse to the new pass manager. This patch implements the `MaximalStaticExpansion` and its printer in NPM. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D125870	2022-06-01 13:37:58 -05:00
Nikita Popov	03aceab08b	[ValueTracking] Enable -branch-on-poison-as-ub by default Now that SimpleLoopUnswitch and other transforms no longer introduce branch on poison, enable the -branch-on-poison-as-ub option by default. The practical impact of this is mostly better flag preservation in SCEV, and some freeze instructions no longer being necessary. Differential Revision: https://reviews.llvm.org/D125299	2022-06-01 10:46:06 +02:00
Michael Kruse	cc871cf6b5	[Polly][Test] Fix race condition while printing dot files. The tests dot-scops.ll and dot-scops-npm.ll both wrote to the same file scops.func.dot. If they are executed in parallel they will race for the file. Fix by renaming func to func_npm in dot-scops-npm.ll so this test writes dot scops.func_npm.dot. Long-term, we will probably pass a file name (prefix) to the printer pass such that we can use the guaranteed-unique LIT %t placeholder in tests.	2022-05-26 15:58:53 -05:00
Ivan Kosarev	8894c05b0d	[FileCheck] GetCheckTypeAbbreviation() to handle the misspelled case. Also fix directives not covered by D125604.	2022-05-26 12:20:15 +01:00
Ivan Kosarev	ad1d60c3be	[FileCheck] Catch missspelled directives. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D125604	2022-05-26 11:37:19 +01:00
Vitaly Buka	d33c36235d	[lit] Fix setup of sanitizer environment Not all options were propageted into tests. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D122869	2022-05-19 19:24:16 -07:00
Michael Kruse	e61baceedb	[polly] Load NPM pass plugin for NPM test. This fixes the polly-*-plugin buildbots.	2022-05-09 16:10:01 -05:00
Michael Kruse	6b3b87376b	[polly] migrate -polly-show to the new pass manager Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D123678	2022-05-09 14:04:29 -05:00
Michael Kruse	809ca66eac	[Polly] Fix test after D119669.	2022-05-01 13:32:42 -05:00
Arthur Eubanks	caf6af2ed7	[polly] Remove last instances of -analyze As mentioned in D120782, the loop block order can be different depending on if LoopInfo is incrementally updated or freshly computed. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D122195	2022-03-24 09:47:43 -07:00
Michael Kruse	12ac339e9e	[polly] Fix NPM unittests after D121566.	2022-03-18 14:25:44 -05:00
Wael Yehia	c80198b3d3	Reland "Load pass plugins during option processing, so that plugin options are registered and live." Fix Polly failures. Reviewed By: mehdi_amini, Meinersbur Differential Revision: https://reviews.llvm.org/D121566	2022-03-18 03:27:53 +00:00
Sam McCall	75acad41bc	Use lit_config.substitute instead of foo % lit_config.params everywhere This mechanically applies the same changes from D121427 everywhere. Differential Revision: https://reviews.llvm.org/D121746	2022-03-16 09:57:41 +01:00
Michael Kruse	5c02808131	[polly] Introduce -polly-print-* passes to replace -analyze. The `opt -analyze` option only works with the legacy pass manager and might be removed in the future, as explained in llvm.org/PR53733. This patch introduced -polly-print-* passes that print what the pass would print with the `-analyze` option and replaces all uses of `-analyze` in the regression tests. There are two exceptions: `CodeGen\single_loop_param_less_equal.ll` and `CodeGen\loop_with_condition_nested.ll` use `-analyze on the `-loops` pass which is not part of Polly. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120782	2022-03-14 10:27:15 -05:00
Petr Hosek	0c0f6cfb7b	[CMake] Rename TARGET_TRIPLE to LLVM_TARGET_TRIPLE This clarifies that this is an LLVM specific variable and avoids potential conflicts with other projects. Differential Revision: https://reviews.llvm.org/D119918	2022-03-11 15:43:01 -08:00
Arthur Eubanks	30f1cef86b	Revert "[polly] Fix regression test after D110620." This reverts commit `2aa624a94f`. D110620 was reverted.	2022-03-04 20:37:15 -08:00
Michael Kruse	d7851685a3	[polly] Remove trailing whitespace from tests. NFC.	2022-02-22 15:41:13 -06:00
Michael Kruse	2aa624a94f	[polly] Fix regression test after D110620.	2022-02-17 09:47:19 -06:00
Roman Lebedev	4d0c0e6cc2	[SCEV] `createNodeForSelectOrPHIInstWithICmpInstCond()`: generalize eq handling The current logic was: https://alive2.llvm.org/ce/z/j8muXk but in reality the offset to the Y in the 'true' hand does not need to exist: https://alive2.llvm.org/ce/z/MNQ7DZ https://alive2.llvm.org/ce/z/S2pMQD To catch that, instead of computing the Y's in both hands and checking their equality, compute Y and C, and check that C is 0 or 1.	2022-02-11 21:58:19 +03:00
Florian Hahn	782c0dd1a1	[IRBuilder] Migrate and-folding to value-based FoldAnd. Similar to the migration of or-folding to FoldOr, there are a few cases where the fold in IRBuilder::CreateAnd triggered directly. Those have been updated. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D117431	2022-01-20 10:22:21 +00:00
Michael Kruse	937b00ab2c	[Polly][SchedOpt] Account for prevectorization of multiple statements. A prevectorized loop may contain multiple statements, in which case isl_schedule_node_band_sink will sink the vector band to multiple leaves. Instead of statically assuming a specific tree structure after sinking, add a SIMD marker to all inner bands. Fixes llvm.org/PR52637	2021-12-23 14:06:41 -06:00
Michael Kruse	19db33c06e	[Polly] Remove support for code generated by gfortran+DragonEgg. DragonEgg is not maintained anymore, hence there is no need for this functionality. Fixes llvm.org/PR52173	2021-10-14 14:12:06 -05:00
Michael Kruse	203c7fab73	[Polly] Fix test case fixing the colon. Commit `573531fb1f` fixed the colon at the end of a CHECK line (was a semicolon by mistake). With the check enabled, it turned out that it was failing. Check for the correct content. Also add the missing colon to the next CHECK line.	2021-10-08 22:46:55 -05:00
Qiu Chaofan	573531fb1f	Fix typo of colon to semicolon in lit tests	2021-10-09 10:03:50 +08:00
Michael Kruse	64489255be	[Polly] Add greedy fusion algorithm. When the option -polly-loopfusion-greedy is set, the ScheduleOptimizer tries to aggressively fuse any band it can and does not violate any dependences. As part if the implementation, the functionalty for copying a band into an new schedule was extracted out of the ScheduleTreeRewriter.	2021-10-08 20:33:30 -05:00
Michael Kruse	cb879d00d8	[Polly] Completely remove -polly-opt-fusion. This was missing from `07e7cb9433`. The switch did nothing since then.	2021-10-08 02:10:34 -05:00
Philip Reames	d02db32644	[SCEV] Use full logic when infering flags on add and gep This is a followon to D109845. With that landed, we will have fixed all known instances of pr51817, and can thus start inferring flags more aggressively with greatly reduced risk of miscompiles. This patch simply applies the same inference logic used in that patch to our other major flag inference path. We can still do much better here (on both paths), but this is our first step. Differential Revision: https://reviews.llvm.org/D111003	2021-10-03 15:32:15 -07:00
Philip Reames	2ca8a3f213	[SCEV] Stop blindly propagating flags from inbound geps to SCEV nodes This fixes a violation of the wrap flag rules introduced in `c4048d8f`. This was also noted in the (very old) PR23527. The issue being fixed is that we assume the inbound flag on any GEP assumes that all users of any gep (or add) which happens to map to that SCEV would also be UB if the (other) gep overflowed. That's simply not true. In terms of the test diffs, I don't see anything seriously problematic. The lost flags are expected (given the semantic restriction on when its legal to tag the SCEV), and there are several cases where the previously inferred flags are unsound per the new semantics. The only common trend I noticed when looking at the deltas is that by not considering branch on poison as immediate UB in ValueTracking, we do miss a few cases we could reclaim. We may be able to claw some of these back with the follow ideas mentioned in PR51817. It's worth noting that most of the changes are analysis result only changes. The two transform changes are pretty minimal. In one case, we miss the opportunity to infer a nuw (correctly). In the other, we fail to fold an exit and produce a loop invariant form instead. This one is probably over-reduced as the program appears to be undefined in practice, and neither before or after exploits that. Differential Revision: https://reviews.llvm.org/D109789	2021-10-01 16:30:44 -07:00
Roman Gareev	113fa82c3c	[Polly] Check the properties of accesses to operands of a matrix-matrix multiplication The following code modifies elements of the array D. for (i = 0; i < _PB_NI; i++) for (j = 0; j < _PB_NJ; j++) { for (k = 0; k < _PB_NK; k++) { double Mul = A[i][k] * B[k][j]; D[i][j][k] += Mul; C[i][j] += Mul; } } Nevertheless, the code is recognised as a matrix-matrix multiplication, since the second and third dimensions of D are accessed with non-zero strides. This fixes the typo, which was made during the translation to C++ bindings (https://reviews.llvm.org/D35845). Reviewed By: Michael Kruse <llvm@meinersbur.de> Differential Revision: https://reviews.llvm.org/D110491	2021-09-28 22:58:57 +05:00
Michael Kruse	027c036663	[Polly] Reject regions entered by an indirectbr/callbr. SplitBlockPredecessors is unable to insert an additional BasicBlock between an indirectbr/callbr terminator and the successor blocks. This is needed by Polly to normalize the control flow before emitting its optimzed code. This patches rejects regions entered by an indirectbr/callbr to not fail later at code generation. This fixes llvm.org/PR51964 Recommit with "REQUIRES: asserts" in test that uses statistics.	2021-09-27 18:49:11 -05:00
Haowei Wu	283ed7de32	Revert "[Polly] Reject reject regions entered by an indirectbr/callbr." This reverts commit `91f46bb77e` which causes test failures when assertions are off.	2021-09-27 16:05:33 -07:00
Michael Kruse	91f46bb77e	[Polly] Reject reject regions entered by an indirectbr/callbr. SplitBlockPredecessors is unable to insert an additional BasicBlock between an indirectbr/callbr terminator and the successor blocks. This is needed by Polly to normalize the control flow before emitting its optimzed code. This patches rejects regions entered by an indirectbr/callbr to not fail later at code generation. This fixes llvm.org/PR51964	2021-09-26 21:21:50 -05:00
Michael Kruse	9820dd970c	[Polly] Support for InlineAsm. Inline assembly was not handled at all and treated like a llvm::Value. In particular, it tried to create a pointer it which is not allowed. Fix by handling like a llvm::Constant such that it is just reused when required, instead of trying to marshall it in memory. Fixes llvm.org/PR51960	2021-09-26 03:26:43 -05:00
Michael Kruse	d5c87162db	[Polly] Use VirtualUse to determine references. VirtualUse ensures consistency over different source of values with Polly. In particular, this enables its use of instructions moved between Statement. Before the patch, the code wrongly assumed that the BB's instructions are also the ScopStmt's instructions. Reference are determined for OpenMP outlining and GPGPU kernel extraction. GPGPU CodeGen had some problems. For one, it generated GPU kernel parameters for constants. Second, it emitted GPU-side invariant loads which have already been loaded by the host. This has been partially fixed, it still generates a store for the invariant load result, but using the value that the host has already written. WARNING: I did not test the generated PollyACC code on an actual GPU. The improved consistency will be made use of in the next patch.	2021-09-26 03:26:43 -05:00
Michael Kruse	1cea25eec9	[Polly] Remove isConstCall. The function was intended to catch OpenMP functions such as get_thread_id(). If matched, the call would be considered synthesizable. There were a few problems with this: * get_thread_id() is not 'const' in the sense of have the gcc manual defines it: "do not examine any values except their arguments". get_thread_id() reads OpenCL runtime libreary global state. What was inteded was probably 'speculable'. * isConstCall was implemented using mayReadOrWriteMemory(). 'const' is stricter than that, mayReadOrWriteMemory is e.g. true for malloc(), since it may only read/write addresses that are considered inaccessible fro the application. However, malloc is certainly not speculable. * Values that are isConstCall were not handled consistently throughout Polly. In particular, it was not considered for referenced values (OpenMP outlining and PollyACC). Fix by removing special handling for isConstCall entirely.	2021-09-26 03:26:43 -05:00
Michael Kruse	a5d47b3fa0	[Polly] Fix wrong redirect in test case.	2021-09-24 14:53:00 -05:00

1 2 3 4 5 ...

1526 Commits