llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	7cb809983d	[tests] Force invariant load hoisting for test cases that need it -- III llvm-svn: 278673	2016-08-15 15:56:24 +00:00
Johannes Doerfert	8031238017	[GSoC] Add PolyhedralInfo pass - new interface to polly analysis Adding a new pass PolyhedralInfo. This pass will be the interface to Polly. Initially, we will provide the following interface: - #IsParallel(Loop *L) - return a bool depending on whether the loop is parallel or not for the given program order. Patch by Utpal Bora <cs14mtech11017@iith.ac.in> Differential Revision: https://reviews.llvm.org/D21486 llvm-svn: 276637	2016-07-25 12:48:45 +00:00
Tobias Grosser	07b2095234	Update isl to isl-0.17.1-57-g1879898 With this update the isl AST generation extracts disjunctive constraints early on. As a result, code that previously resulted in two branches with (close-to) identical code within them: if (P <= -1) { for (int c0 = 0; c0 < N; c0 += 1) Stmt_store(c0); } else if (P >= 1) for (int c0 = 0; c0 < N; c0 += 1) Stmt_store(c0); results now in only a single branch body: if (P <= -1 \|\| P >= 1) for (int c0 = 0; c0 < N; c0 += 1) Stmt_store(c0); This resolves http://llvm.org/PR27559 Besides the above change, this isl update brings better simplification of sets/maps containing existentially quantified dimensions and fixes a bug in isl's coalescing. llvm-svn: 272500	2016-06-12 04:30:40 +00:00
Tobias Grosser	8620679eb5	Expand test cases affected by next commit As these test cases will be changed in a subsequent commit, we expand and tighten them to make the subsequent changes to them more obvious. As part of this we add more context to some test cases and add CHECK-NEXT lines to ensure no intermediate lines are missed by accident. llvm-svn: 272499	2016-06-12 04:29:57 +00:00
Tobias Grosser	1022ca5646	Codegen: Enable the detection of min/max expressions Min/max expressions are easier to read and can in some cases also result in more concise IR that is generated as the min/max --- when lowered to a cmp+select pattern -- commonly has a simpler condition then the ternary condition isl would normally generate. llvm-svn: 268855	2016-05-07 08:03:44 +00:00
Johannes Doerfert	615e0b85f8	Record wrapping assumptions early Utilizing the record option for assumptions we can simplify the wrapping assumption generation a lot. Additionally, we can now report locations together with wrapping assumptions, though they might not be accurate yet. llvm-svn: 266069	2016-04-12 13:28:39 +00:00
Tobias Grosser	37034db826	Update to isl-0.16.1-145-g243bf7c Just an import to keep track with the latest version of isl. We are not looking for specific features. llvm-svn: 264452	2016-03-25 19:38:18 +00:00
Tobias Grosser	25e8ebe29d	Drop explicit -polly-delinearize parameter Delinearization is now enabled by default and does not need to explicitly need to be enabled in our tests. llvm-svn: 264154	2016-03-23 13:21:02 +00:00
Johannes Doerfert	066dbf3f8e	Track assumptions and restrictions separatly In order to speed up compile time and to avoid random timeouts we now separately track assumptions and restrictions. In this context assumptions describe parameter valuations we need and restrictions describe parameter valuations we do not allow. During AST generation we create a runtime check for both, whereas the one for the restrictions is negated before a conjunction is build. Except the In-Bounds assumptions we currently only track restrictions. Differential Revision: http://reviews.llvm.org/D17247 llvm-svn: 262328	2016-03-01 13:06:28 +00:00
Johannes Doerfert	9dd42ee7c1	Try to build alias checks even when non-affine accesses are allowed From now on we bail only if a non-trivial alias group contains a non-affine access, not when we discover aliasing and non-affine accesses are allowed. llvm-svn: 261863	2016-02-25 14:06:11 +00:00
Tobias Grosser	020fa09a3c	Remove -polly-code-generator=isl from many test cases This is the default since a long time. Setting it again does not add value in any of these test cases. llvm-svn: 253800	2015-11-21 23:05:48 +00:00
Tobias Grosser	b39c96aa19	ScopInfo: Ensure unique names for parameter names coming from load instructions In case the original parameter instruction does not have a name, but it comes from a load instruction where the base pointer has a name we used the name of the load instruction to give some more intuition of where the parameter came from. To ensure this works also through GEPs which may have complex offsets, we originally just dropped the offsets and _only_ used the base pointer name. As this can result in multiple parameters to get the same name, we now prefix the parameter ID to ensure parameter names are unique. This will make it easier to understand debug output. This change does not affect correctness, as parameter IDs (even of the same name) can always be distinguished through the SCEV pointer stored inside them. llvm-svn: 253330	2015-11-17 11:54:51 +00:00
Johannes Doerfert	f85ad0411f	[FIX] Carefully simplify assumptions in the presence of error blocks If a SCoP contains error blocks we cannot use the domain constraints to simplify the assumptions as the domain is already influenced by the assumptions we took. Before this patch we did that and some assumptions became self-fulfilling as they were implied by the domain constraints. llvm-svn: 252424	2015-11-08 20:16:39 +00:00
Tobias Grosser	f4ee371e60	tests: Drop -polly-detect-unprofitable and -polly-no-early-exit These flags are now always passed to all tests and need to be disabled if not needed. Disabling these flags, rather than passing them to almost all tests, significantly simplfies our RUN: lines. llvm-svn: 249422	2015-10-06 15:36:44 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00
Johannes Doerfert	b68cffb5df	Allow general loops with one latch As we do not rely on ScalarEvolution any more we do not need to get the backedge taken count. Additionally, our domain generation handles everything that is affine and has one latch and our ScopDetection will over-approximate everything else. This change will therefor allow loops with: - one latch - exiting conditions that are affine Additionally, it will not check for structured control flow anymore. Hence, loops and conditionals are not necessarily single entry single exit regions any more. Differential Version: http://reviews.llvm.org/D12758 llvm-svn: 247289	2015-09-10 15:27:46 +00:00
Chandler Carruth	66ef16b289	[PM] Update Polly for the new AA infrastructure landed in r247167. llvm-svn: 247198	2015-09-09 22:13:56 +00:00
Tobias Grosser	ed21a1fc7e	Do not detect Scops with only one loop. If a region does not have more than one loop, we do not identify it as a Scop in ScopDetection. The main optimizations Polly is currently performing (tiling, preparation for outer-loop vectorization and loop fusion) are unlikely to have a positive impact on individual loops. In some cases, Polly's run-time alias checks or conditional hoisting may still have a positive impact, but those are mostly enabling transformations which LLVM already performs for individual loops. As we do not focus on individual loops, we leave them untouched to not introduce compile time regressions and execution time noise. This results in good compile time reduction (oourafft: -73.99%, smg2000: -56.25%). Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12268 llvm-svn: 246161	2015-08-27 16:55:18 +00:00
Johannes Doerfert	43788c5783	Check for feasible runtime check context early Instead of generating code for an empty assumed context we bail out early. As the number of assumptions we generate increases this becomes more and more important. Additionally, this change will allow us to hide internal contexts that are only used in runtime checks e.g., a boundary context with constraints not suited for simplifications. llvm-svn: 245540	2015-08-20 05:58:56 +00:00
Johannes Doerfert	338b42c329	Removed redundant alias checks generated during run time. As specified in PR23888, run-time alias check generation is expensive in terms of compile-time. This reduces the compile time by computing minimal/maximal access only once for each base pointer Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 243024	2015-07-23 17:04:54 +00:00
Tobias Grosser	808cd69a92	Use schedule trees to represent execution order of statements Instead of flat schedules, we now use so-called schedule trees to represent the execution order of the statements in a SCoP. Schedule trees make it a lot easier to analyze, understand and modify properties of a schedule, as specific nodes in the tree can be choosen and possibly replaced. This patch does not yet fully move our DependenceInfo pass to schedule trees, as some additional performance analysis is needed here. (In general schedule trees should be faster in compile-time, as the more structured representation is generally easier to analyze and work with). We also can not yet perform the reduction analysis on schedule trees. For more information regarding schedule trees, please see Section 6 of https://lirias.kuleuven.be/handle/123456789/497238 llvm-svn: 242130	2015-07-14 09:33:13 +00:00
Tobias Grosser	173ecab705	Remove target triples from test cases I just learned that target triples prevent test cases to be run on other architectures. Polly test cases are until now sufficiently target independent to not require any target triples. Hence, we drop them. llvm-svn: 235384	2015-04-21 14:28:02 +00:00
Tobias Grosser	619190d5a7	Delinearization of expressions that contain array size parameters This allows us to delinerize code such as: A[][n] for (i for (j A[i][n-j-1] = ... which would previously have been delinearize to an access A[i+1][-j-1]. To recover the correct access we apply the piecewise expression: { A[i][j] -> A[i-1][i+N]: i < 0; A[i][j] -> A[i][i]: i >= 0} This approach generalizes to higher dimensions. llvm-svn: 233566	2015-03-30 17:22:28 +00:00
David Blaikie	c94eca0546	Update Polly tests to handle explicitly typed load changes in LLVM. llvm-svn: 230796	2015-02-27 21:22:50 +00:00
David Blaikie	bad3ff207f	Update Polly tests to handle explicitly typed gep changes in LLVM llvm-svn: 230784	2015-02-27 19:20:19 +00:00
Tobias Grosser	f72bdbfbb1	Use isl_ast_expr_call to create run-time checks isl recently introduced a new interface to create run-time checks from constraint sets. Use this interface to simplify our run-time check generation. llvm-svn: 230640	2015-02-26 15:21:10 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
Tobias Grosser	1fa7b972c0	Update to isl 99d53692ba This commit imports the latest isl version into lib/External/isl. The changes relavant for Polly are: 1) Schedule trees [1] have been introduced as a more structured way to describe schedules. Polly does not yet use them, but we may switch to them in the near future. 2) Another set of coalescing changes [2] simplifies some data dependences and removes a couple of code generation artifacts. We now understand that the following sets can be merged: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i0 >= 0 and i1 <= 1023 - i0 and i1 >= 1 Stmt_S1[i0, 0] -> Stmt_S2[i0] : i0 <= 1023 and i0 >= 1} into: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i1 <= 1023 - i0 and i1 >= 0 and i1 >= 1 - i0 and i0 >= 0 } Changes of this kind reduce unnecessary specialization during code generation. - for (int c3 = 0; c3 <= 1023; c3 += 1) { - if (c3 % 2 == 0) { - Stmt_for_body3(c1, c3); - } else - Stmt_for_body3(c1, c3); - } + for (int c3 = 0; c3 <= 1023; c3 += 1) + Stmt_for_body3(c1, c3); [1] http://impact.gforge.inria.fr/impact2014/papers/impact2014-verdoolaege.pdf [2] http://impact.gforge.inria.fr/impact2015/papers/impact2015-verdoolaege.pdf llvm-svn: 229423	2015-02-16 19:33:40 +00:00
Johannes Doerfert	7ceb040213	Add early exits for SCoPs we did not optimize This allows us to skip ast and code generation if we did not optimize a SCoP and will not generate parallel or alias annotations. The initial heuristic to exit is simple but allows improvements later on. All failing test cases have been modified to disable early exit, thus to keep their coverage. Differential Revision: http://reviews.llvm.org/D7254 llvm-svn: 228851	2015-02-11 17:25:09 +00:00
Johannes Doerfert	9282076ece	[NFC] Drop the "scattering" tuple name llvm-svn: 227801	2015-02-02 13:45:54 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Duncan P. N. Exon Smith	bd62edb20d	Run upgrade script from PR21532 to match LLVM changes Update tests for LLVM assembly format change in r224257 using the script attached to PR21532. I'm hoping this unsticks the bot [1]. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25432 llvm-svn: 224269	2014-12-15 20:28:50 +00:00
Tobias Grosser	13e222ca55	Update to the latest version of isl Isl now specifically marks modulo operations that are compared against zero. They can be implemented with the C/LLVM remainder operation. We also update a couple of test cases where the output of isl has slightly changed. llvm-svn: 223607	2014-12-07 16:04:29 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Johannes Doerfert	13771738d3	[RTC] Split alias groups according to read only base addresses If there are multiple read only base addresses in an alias group we can split it into multiple alias groups each with only one read only access. This way we might reduce the number of comparisons significantly as it grows linear in the number of alias groups but exponential in their size. Differential Revision: http://reviews.llvm.org/D5435 llvm-svn: 218757	2014-10-01 12:40:46 +00:00
Johannes Doerfert	b9fb5a2cc6	[RTC] Runtime Alias Checks for the ISL backend (missing tests) Test files missing in r218046. llvm-svn: 218047	2014-09-18 11:20:36 +00:00
Johannes Doerfert	b7e4083599	Updated to isl 2c19ecd444095d6f560349018f68993bc0e03691 Changed test cases and fixed warnings. llvm-svn: 218043	2014-09-18 11:13:35 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Johannes Doerfert	8e95dc657e	[Fix] OpenMP parallel loop detection for the isl backend There was a bug in the IslAst which caused that no more outermost parallel loops were detected/checked after a parallel outermost loop of depth 1. + Test case attached llvm-svn: 217452	2014-09-09 17:03:54 +00:00
Tobias Grosser	f57d63f906	Do allow negative offsets in the outermost array dimension There is no needed for neither 1-dimensional nor higher dimensional arrays to require positive offsets in the outermost array dimension. We originally introduced this assumption with the support for delinearizing multi-dimensional arrays. llvm-svn: 214665	2014-08-03 21:07:30 +00:00
Johannes Doerfert	b41344a88f	[Fix] Annotate the IslAst with broken reductions (Missing files) + test cases of r214489. llvm-svn: 214491	2014-08-01 08:20:26 +00:00
Tobias Grosser	924e9e0226	IslAst: Enhance parallelism detection test Add more check lines to ensure we do not accidentally generate nested openmp parallel annotations. llvm-svn: 214200	2014-07-29 19:22:46 +00:00
Johannes Doerfert	457f73eaee	Annotate reduction parallel loops in the IslAst textual output + Introduced dependency type TYPE_TC_RED to represent the transitive closure (& the reverse) of reduction dependences. These are used when we check for reduction parallel loops. + Test cases including loop reversals and modulo schedules which compute reductions in a alternated order. llvm-svn: 213019	2014-07-15 00:00:35 +00:00
Tobias Grosser	5e6813d184	Derive run-time conditions for delinearization As our delinearization works optimistically, we need in some cases run-time checks that verify our optimistic assumptions. A simple example is the following code: void foo(long n, long m, long o, double A[n][m][o]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 150; j++) for (long k = 0; k < 200; k++) A[i][j][k] = 1.0; } After clang linearized the access to A and we delinearized it again to A[i][j][k] we need to ensure that we do not access the delinearized array out of bounds (this information is not available in LLVM-IR). Hence, we need to verify the following constraints at run-time: CHECK: Assumed Context: CHECK: [o, m] -> { : m >= 150 and o >= 200 } llvm-svn: 212198	2014-07-02 17:47:48 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00

1 2

54 Commits