llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	7527e3f59c	Do not use the POLLY vector code generator if only strip-mining is requested This fixes http://llvm.org/PR23127 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234113	2015-04-05 06:53:21 +00:00
Tobias Grosser	fe4bb1c81b	[tests] Use -polly-vectorizer=polly directly instead of defining a lit variable %vector-opt. llvm-svn: 234112	2015-04-05 06:53:11 +00:00
Tobias Grosser	4f6bceface	Do not scale tile loops We now generate tile loops as: for (int c1 = 0; c1 <= 47; c1 += 1) for (int c2 = 0; c2 <= 47; c2 += 1) for (int c3 = 0; c3 <= 31; c3 += 1) for (int c4 = 0; c4 <= 31; c4 += 4) #pragma simd for (int c5 = c4; c5 <= c4 + 3; c5 += 1) Stmt_for_body3(32 * c1 + c3, 32 * c2 + c5); instead of for (int c1 = 0; c1 <= 1535; c1 += 32) for (int c2 = 0; c2 <= 1535; c2 += 32) for (int c3 = 0; c3 <= 31; c3 += 1) for (int c4 = 0; c4 <= 31; c4 += 4) #pragma simd for (int c5 = c4; c5 <= c4 + 3; c5 += 1) Stmt_for_body3(c1 + c3, c2 + c5); Run-time performance-wise this makes little difference, but this gives a large reduction in compile time (10-30% on 17 LNT benchmarks). Apparently the isl AST generator is not yet very efficient in generating the latter. llvm-svn: 233675	2015-03-31 07:52:36 +00:00
Tobias Grosser	d654eeb862	Drop some CLooG leftovers llvm-svn: 233572	2015-03-30 17:56:50 +00:00
Tobias Grosser	619190d5a7	Delinearization of expressions that contain array size parameters This allows us to delinerize code such as: A[][n] for (i for (j A[i][n-j-1] = ... which would previously have been delinearize to an access A[i+1][-j-1]. To recover the correct access we apply the piecewise expression: { A[i][j] -> A[i-1][i+N]: i < 0; A[i][j] -> A[i][i]: i >= 0} This approach generalizes to higher dimensions. llvm-svn: 233566	2015-03-30 17:22:28 +00:00
Tobias Grosser	aa660a9957	Gist-simplify access relations in the context of domain constraints This simplifies already one test case and is needed for upcoming improvements to our delinearization. llvm-svn: 233507	2015-03-30 00:07:50 +00:00
Johannes Doerfert	be40996cfe	Strip constant factors from SCoP parameters This will strip the constant factor of a parameter befor we add it to the SCoP. As a result the access functions are simplified, e.g., for the attached test case. llvm-svn: 233501	2015-03-29 20:45:09 +00:00
Tobias Grosser	715007216c	Bail out if too many alias run-time-check comparisions would be needed This fixes a crash observed in ffmpeg. llvm-svn: 233480	2015-03-28 15:11:14 +00:00
Tobias Grosser	6794238c70	Code generate parameters and run-time checks after branching new code region When creating parameters the SCEVexpander may introduce new induction variables, that possibly create scalar dependences in the original scop, before we code generate the scop. The resulting scalar dependences may then inhibit correct code generation of the scop. To prevent this, we first version the code without a run-time check and only then introduce new parameters and the run-time condition. The if-condition that guards the original scop from being modified by the SCEVexpander. This change causes some test case changes as the run-time conditions are now introduced in the split basic block rather than in the entry basic block. This fixes http://llvm.org/PR22069 Test case reduced by: Karthik Senthil llvm-svn: 233477	2015-03-28 09:34:40 +00:00
Tobias Grosser	17778eb826	Drop redundant run line in check llvm-svn: 233476	2015-03-28 09:34:34 +00:00
Tobias Grosser	2873645c51	Drop -polly-vectorizer-unroll-only option This options was earlier used for experiments with the vectorizer, but to my knowledge is not really used anymore. If anybody needs this, we can always reintroduce this feature. llvm-svn: 232934	2015-03-23 07:00:36 +00:00
Tobias Grosser	bbb4cec2e8	Use schedule trees to perform post-scheduling transformations Replacing the old band_tree based code with code that is based on the new schedule tree [1] interface makes applying complex schedule transformations a lot more straightforward. We now do not need to reason about the meaning of flat schedules, but can use a more straightforward tree structure. We do not yet exploit this a lot in the current code, but hopefully we will be able to do so soon. This change also allows us to drop some code, as isl now provides some higher level interfaces to apply loop transformations such as tiling. This change causes some small test case changes as isl uses a slightly different way to perform loop tiling, but no significant functional changes are intended. [1] http://impact.gforge.inria.fr/impact2014/papers/impact2014-verdoolaege.pdf llvm-svn: 232911	2015-03-22 12:06:39 +00:00
Tobias Grosser	9715b7c592	Add forgotten 'FileCheck' to tiling test cases These test cases did not verify the CHECK lines at all. We add the FileCheck and also fix some broken CHECK lines. Being here, we extend the checks to cover the whole loop structure. llvm-svn: 232710	2015-03-19 07:39:34 +00:00
Duncan P. N. Exon Smith	0353f279f1	Fix debug info now that the verifier is on `i32 0` isn't a valid type, and `!{i32 0}` isn't an empty array. Needed because of r232505. llvm-svn: 232514	2015-03-17 18:23:38 +00:00
David Blaikie	4a54fae8cb	Test case updates for explicit type parameter to the gep operator llvm-svn: 232186	2015-03-13 18:21:20 +00:00
Tobias Grosser	f2716ea7d5	Add -polly-vectorizer=stripmine By strip-mining outer loops to the innermost level we can enable LLVM's loop vectorizer to vectorize outer loops. llvm-svn: 232100	2015-03-12 20:48:07 +00:00
Tobias Grosser	bb4126470a	Drop option to prepare code for the BB vectorizer The BB vectorizer is deprecated and there is no point in generating code for it any more. This option was introduced when there was not yet any loop vectorizer in sight. Now being matured, Polly should target the loop vectorizer. llvm-svn: 232099	2015-03-12 20:47:58 +00:00
Tobias Grosser	6e084ccda3	Shorten user report message slightly llvm-svn: 231633	2015-03-09 06:59:16 +00:00
Tobias Grosser	f3c17e65d1	Drop meaningless test case This test case was supposed to test the range analysis but it became just another delinearization test case after enabling delinearization. Suggested-by: Johannes Doerfert llvm-svn: 231599	2015-03-08 16:12:47 +00:00
Johannes Doerfert	1e03f5d10d	Small change to create_ll.sh [NFC] llvm-svn: 231596	2015-03-08 15:36:27 +00:00
Tobias Grosser	bf7193ae61	Update test cases to work independently of delinearization default llvm-svn: 231594	2015-03-08 15:21:15 +00:00
Johannes Doerfert	6a4d81c1f6	Add end user report message for unprofitable regions [NFC] llvm-svn: 231593	2015-03-08 15:11:50 +00:00
Tobias Grosser	90078c5580	Add sign-extension during codegen of index expressions When code generating array index expressions the types of the different components of the index expressions may not always match. We extend the type of the index expression (if possible) and assert otherwise. llvm-svn: 231592	2015-03-08 15:08:32 +00:00
Tobias Grosser	6e4d597e86	Add delinearization test-case that timed out earlier llvm-svn: 231589	2015-03-08 12:07:02 +00:00
Johannes Doerfert	f6557f98a2	Rename the Dependences pass to DependenceInfo [NFC] We rename the Dependences pass to DependenceInfo as a first step to a caching pass policy. The new DependenceInfo pass will later provide "Dependences" for a SCoP. To keep consistency the test folder is renamed too. llvm-svn: 231308	2015-03-04 22:43:40 +00:00
David Blaikie	23f94dfdf4	Update Polly tests for the great metadata schema change llvm-svn: 231089	2015-03-03 18:17:26 +00:00
Johannes Doerfert	d239aac2ee	Do not model scalar accesses in non-affine subregions If a scalar was defined and used only in a non-affine subregion we do not need to model the accesses. However, if the scalar was defined inside the region and escapes the region we have to model the access. The same is true if the scalar was defined outside and used inside the region. llvm-svn: 230960	2015-03-02 14:06:01 +00:00
Johannes Doerfert	6982fa4bb0	[Fix] Two tests that broke during the last changes llvm-svn: 230800	2015-02-27 21:58:26 +00:00
David Blaikie	47d6783913	Fix test I missed This was & is failing at ToT, but now it's failing for the original reason, not because the IR can't be parsed. llvm-svn: 230797	2015-02-27 21:31:00 +00:00
David Blaikie	c94eca0546	Update Polly tests to handle explicitly typed load changes in LLVM. llvm-svn: 230796	2015-02-27 21:22:50 +00:00
David Blaikie	d7b6aa3251	Update one test I missed when updating for the opaque pointer gep changes to LLVM. llvm-svn: 230792	2015-02-27 20:43:19 +00:00
David Blaikie	bad3ff207f	Update Polly tests to handle explicitly typed gep changes in LLVM llvm-svn: 230784	2015-02-27 19:20:19 +00:00
Johannes Doerfert	514f6efa2b	[FIX] Teach RegionGenerator to respect and update dominance When we generate code for a whole region we have to respect dominance and update it too. The first is achieved with multiple "BBMap"s. Each copied block in the region gets its own map. It is initialized only with values mapped in the immediate dominator block, if this block is in the region and was therefor already copied. This way no values defined in a block that doesn't dominate the current one will be used. To update dominance information we check if the immediate dominator of the original block we want to copy is in the region. If so we set the immediate dominator of the current block to the copy of the immediate dominator of the original block. llvm-svn: 230774	2015-02-27 18:29:04 +00:00
Tobias Grosser	f72bdbfbb1	Use isl_ast_expr_call to create run-time checks isl recently introduced a new interface to create run-time checks from constraint sets. Use this interface to simplify our run-time check generation. llvm-svn: 230640	2015-02-26 15:21:10 +00:00
Tobias Grosser	e395da7986	Update isl to 0980603 'isl_tab_pip.c: parallel_constraints: drop useless assignment' This update contains: - Fixes of minor issues detected by clang's scan_build - More schedule tree infrastructure additions This update slightly changes the output of our dependence analysis, but these changes are purely syntactially. llvm-svn: 230528	2015-02-25 19:34:52 +00:00
Johannes Doerfert	275a1756ad	Allow non-affine control flow -- Code Generation This is the code generation for region statements that are created when non-affine control flow was present in the input. A new generator, similar to the block or vector generator, for regions is used to traverse and copy the region statement and to adjust the control flow inside the new region in the end. llvm-svn: 230340	2015-02-24 16:16:32 +00:00
Johannes Doerfert	ff9d1980a7	Allow non-affine control flow -- SCoP Modeling This allows us to model non-affine regions in the SCoP representation. SCoP statements can now describe either basic blocks or non-affine regions. In the latter case all accesses in the region are accumulated for the statement and write accesses, except in the entry, have to be marked as may-write. Differential Revision: http://reviews.llvm.org/D7846 llvm-svn: 230329	2015-02-24 12:00:50 +00:00
Johannes Doerfert	e70449400f	Add ScalarEvolution bounds to non-affine access functions llvm-svn: 230328	2015-02-24 11:58:30 +00:00
Johannes Doerfert	ba65c1672a	Allow non-affine control flow -- SCoP Detection With this patch we allow the SCoP detection to detect regions as SCoPs which have non-affine control flow inside. All non-affine regions are tracked and later accessible to the ScopInfo. As there is no real difference, non-affine branches as well as floating point branches are covered (and both called non-affine control flow). However, the detection is restricted to overapproximate only loop free regions. llvm-svn: 230325	2015-02-24 11:45:21 +00:00
Johannes Doerfert	f9e3462b69	[FIX] 2 broken tests llvm-svn: 230231	2015-02-23 16:34:20 +00:00
Johannes Doerfert	4f8ac3d123	Use ScalarEvolution to create tight bounds on the parameters llvm-svn: 230230	2015-02-23 16:15:51 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
Tobias Grosser	1fa7b972c0	Update to isl 99d53692ba This commit imports the latest isl version into lib/External/isl. The changes relavant for Polly are: 1) Schedule trees [1] have been introduced as a more structured way to describe schedules. Polly does not yet use them, but we may switch to them in the near future. 2) Another set of coalescing changes [2] simplifies some data dependences and removes a couple of code generation artifacts. We now understand that the following sets can be merged: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i0 >= 0 and i1 <= 1023 - i0 and i1 >= 1 Stmt_S1[i0, 0] -> Stmt_S2[i0] : i0 <= 1023 and i0 >= 1} into: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i1 <= 1023 - i0 and i1 >= 0 and i1 >= 1 - i0 and i0 >= 0 } Changes of this kind reduce unnecessary specialization during code generation. - for (int c3 = 0; c3 <= 1023; c3 += 1) { - if (c3 % 2 == 0) { - Stmt_for_body3(c1, c3); - } else - Stmt_for_body3(c1, c3); - } + for (int c3 = 0; c3 <= 1023; c3 += 1) + Stmt_for_body3(c1, c3); [1] http://impact.gforge.inria.fr/impact2014/papers/impact2014-verdoolaege.pdf [2] http://impact.gforge.inria.fr/impact2015/papers/impact2015-verdoolaege.pdf llvm-svn: 229423	2015-02-16 19:33:40 +00:00
Johannes Doerfert	57ef179695	[FIX] Remove XFAIL again llvm-svn: 228868	2015-02-11 19:28:39 +00:00
Johannes Doerfert	c47edb51c6	[FIX] Correctly handle scalar dependences of branch instructions llvm-svn: 228866	2015-02-11 19:12:19 +00:00
Johannes Doerfert	d594aeb248	[FIX] Fix test case that was affected by the early exit patch llvm-svn: 228865	2015-02-11 19:11:57 +00:00
Tobias Grosser	a906ee754d	Drop an assert and XFAIL two test cases This gets the buildbot green to avoid further emails. Johannes will fix this later in the evening. llvm-svn: 228862	2015-02-11 18:46:33 +00:00
Johannes Doerfert	7ceb040213	Add early exits for SCoPs we did not optimize This allows us to skip ast and code generation if we did not optimize a SCoP and will not generate parallel or alias annotations. The initial heuristic to exit is simple but allows improvements later on. All failing test cases have been modified to disable early exit, thus to keep their coverage. Differential Revision: http://reviews.llvm.org/D7254 llvm-svn: 228851	2015-02-11 17:25:09 +00:00
Johannes Doerfert	1f87f485b1	Model scalar writes with uses outside the SCoP These write are important as they will force the scheduling and code generation of an otherwise trivial statement and also impose an order of execution needed to guarantee the correct final value for a scalar in a loop. Added test case modeled after ClamAV/clamscan. llvm-svn: 228847	2015-02-11 17:02:52 +00:00
Johannes Doerfert	b9d18887d7	Allow signed devision in access functions llvm-svn: 228833	2015-02-11 14:54:50 +00:00
Johannes Doerfert	97235c691a	[FIX] Special case for branch users of scalar values llvm-svn: 228832	2015-02-11 14:52:52 +00:00
Johannes Doerfert	4a60b173a7	Do not run independent blocks when we model all scalar dependences llvm-svn: 228441	2015-02-06 21:26:45 +00:00
Johannes Doerfert	76e37fe005	[Fix] Broken test case llvm-svn: 228439	2015-02-06 21:20:14 +00:00
Johannes Doerfert	0ff23ec544	Model PHI nodes without demoting them This allows us to model PHI nodes in the polyhedral description without demoting them. The modeling however will result in the same accesses as the demotion would have introduced. Differential Revision: http://reviews.llvm.org/D7415 llvm-svn: 228433	2015-02-06 20:13:15 +00:00
Tobias Grosser	eb29c68df2	Add test case for r227805 llvm-svn: 227970	2015-02-03 15:11:02 +00:00
Johannes Doerfert	a57746b871	[NFC] Fix typo llvm-svn: 227955	2015-02-03 08:55:01 +00:00
Johannes Doerfert	535ee97853	[FIX] Updated test case (fixed names -> regular expressions) llvm-svn: 227807	2015-02-02 16:13:36 +00:00
Johannes Doerfert	8cd22d4947	[FIX] Check non-deterministic isl output llvm-svn: 227802	2015-02-02 14:07:02 +00:00
Johannes Doerfert	9282076ece	[NFC] Drop the "scattering" tuple name llvm-svn: 227801	2015-02-02 13:45:54 +00:00
Johannes Doerfert	3a3799e43a	[FIX] Activated a pointer test and removed obsolete comment llvm-svn: 227524	2015-01-30 00:36:13 +00:00
Johannes Doerfert	cf0e05a58f	[FIX] Correct two C snippets in test cases llvm-svn: 227407	2015-01-29 00:50:46 +00:00
Johannes Doerfert	ef61def9d5	[FIX] Handle pointer-pointer comparisons This should fix a problem introduced by r225464. llvm-svn: 227404	2015-01-29 00:41:33 +00:00
Johannes Doerfert	07e8a406d6	[FIX] Independent blocks with intrinsics handling Also an old option was removed from some new test cases llvm-svn: 227057	2015-01-25 19:09:49 +00:00
Johannes Doerfert	3f500fa2f6	Support for math/misc intrinsics The support is currently limited as we only allow them in the input but do not emit them in the transformed SCoP due to the possible semantic changes. Differential Revision: http://reviews.llvm.org/D5225 llvm-svn: 227054	2015-01-25 18:07:30 +00:00
Chandler Carruth	78ae1c92ca	[multilib] Teach Polly's CMake to use the libdir suffix variable. This lets 'ninja check-polly' pass for me with a lib64 build of LLVM. I've not updated the standalone side as I don't use it and don't have an easy way to test any changes I've made there. I mostly wanted to be able to actually run Polly's tests when I update its use of LLVM's APIs during my refactorings on the (very unlikely) off chance that I make a change which compiles but does the wrong thing. llvm-svn: 226420	2015-01-19 01:03:05 +00:00
Tobias Grosser	be30c2c56e	Adjust to the new explicit debug metadata This fixes the outfall of r226048 llvm-svn: 226134	2015-01-15 07:02:12 +00:00
Tobias Grosser	c642e95402	Use types of matching size when generating multi-dimensional address expressions This change ensures that the values that represent the array size of a multi-dimensional access are correctly sign-extended when used to compute a memory address used in the run-time alias check. To make the test case more readable, we name the instructions that we generate. llvm-svn: 225818	2015-01-13 19:37:59 +00:00
David Peixotto	dc0a11c21f	Fix maxLoopDepth computation in ScopInfo The max loop depth was incorrectly computed for scops that contain a block from a loop but do not contain the entire loop. We need to check that the full loop is contained in the region when computing the max loop depth. These scops occur when a region containing an inner loop is expanded to include some blocks from the outer loop, but it cannot be fully expanded to contain the outer loop because the region containing the outer loop is invalid. Differential Revision: http://reviews.llvm.org/D6913 llvm-svn: 225812	2015-01-13 18:31:55 +00:00
Tobias Grosser	0a092763e7	Adjust test for the new 'distinct' metadata nodes 'distinct' was introduced in 225474. We now adjust the test cases to match for the additional 'distinct' marker. llvm-svn: 225512	2015-01-09 08:10:36 +00:00
Tobias Grosser	bfbc3690bb	Add experimental support for unsigned expressions This support is still incomplete and consequently hidden behind a switch that needs to be enabled. One problem is ATM that we incorrectly interpret very large unsigned values as negative values even if used in an unsigned comparision. llvm-svn: 225480	2015-01-09 00:01:33 +00:00
Tobias Grosser	55bc4c0767	Add support for pointer types in expressions llvm-svn: 225464	2015-01-08 19:26:53 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Andreas Simbuerger	cd8500e500	(diagnostics) fix typo in test... llvm-svn: 224591	2014-12-19 17:22:46 +00:00
Duncan P. N. Exon Smith	39e21f9c27	Hand-modify a testcase (still PR21532) Bot was still tripping [1] on a testcase the upgrade script didn't handle in 224269. This is still fallout from r224257. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25435 llvm-svn: 224280	2014-12-15 21:43:20 +00:00
Duncan P. N. Exon Smith	bd62edb20d	Run upgrade script from PR21532 to match LLVM changes Update tests for LLVM assembly format change in r224257 using the script attached to PR21532. I'm hoping this unsticks the bot [1]. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25432 llvm-svn: 224269	2014-12-15 20:28:50 +00:00
Tobias Grosser	13e222ca55	Update to the latest version of isl Isl now specifically marks modulo operations that are compared against zero. They can be implemented with the C/LLVM remainder operation. We also update a couple of test cases where the output of isl has slightly changed. llvm-svn: 223607	2014-12-07 16:04:29 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Hongbin Zheng	c5447f4c3b	Do not incorrectly set the inverted flag. In TempScopInfo::buildCondition we extract the conditions to guard the BB in addition of loop bounds. This means we should only consider the conditions in the paths (in CFG) that do not contain cycles (loops). At the same time, we set the invert flag if the FalseBB of the current branch dominates our target BB to indicate that we reach the target BB with an inverted condition from the current branch. In this case, the path from the FalseBB contains a cycle if the FalseBB is the target of a backedge. The conditions implied by such a path should not be consider. We can identify such a case by checking if the TrueBB also dominates our target BB, which means we can also reach our target BB from the TrueBB, without going through the backedge. llvm-svn: 222907	2014-11-28 03:26:06 +00:00
Tobias Grosser	154d9469f4	Add PreHeader always to OuterLoop This fixes a bug introduce in r217525. llvm-svn: 222766	2014-11-25 17:09:21 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	bab3568105	Modify test cases to work with SCEV based code generation This patch includes tests where we actually need to adjust the CHECK lines for SCEV based code generation. Besides these adjustments we add explicit calls to -polly-codegen-scev=[true\|false] and make sure we test both cases. llvm-svn: 222112	2014-11-16 22:43:21 +00:00
Tobias Grosser	95cd1c718e	Make usage of scev based code generation explicit in tests This is in preparation of using SCEV based codegen by default in polly llvm-svn: 222111	2014-11-16 21:43:28 +00:00
Tobias Grosser	2f8732e7c6	Independent blocks: SE->forget() scalars translated to arrays This prevents SCEVs to reference values not valid any more and as a consequence solves a bug where such values reintroduced during ast generation caused the independent blocks pass to fail validation. http://llvm.org/PR21204 llvm-svn: 222103	2014-11-16 20:33:58 +00:00
Tobias Grosser	b05b038b81	Switch default code generation backend to isl The isl based backend has been tested since a long time and with the recently commited OpenMP support the last missing piece of functionality was ported from the CLooG backend. The isl based backend gives us interesting new functionality: - Run-time alias checks (enabled by default) Optimize scops that contain possibly aliasing pointers. This feature has largely increased the number of loop nests we consider for optimization. Thanks Johannes! - Delinearization (not yet enabled by default) Model accesses to multi-dimensional arrays precisely. This will allow us to understand kernels with multi-dimensional VLAs written in Julia, boost::ublas, coremark or C99. Thanks Sebastian! - Generation of higher quality code Sven and me spent a long time to optimize the quality of the generated code. A major focus were expressions as they result from modulos/divisions or piecewise affine expressions (a ? b : c). - Full/Partial tile separation, polyhedral unrolling The isl code generation provides functionality to generate specialized code for core and cleanup loops and to specialize code using polyhedral context information while unrolling statements. (not yet exploited in Polly) - Modifieable access functions We can now use standard isl functionality to remap memory accesses to new data locations. A standard use case is the use of shared memory, where accesses to a larger region in global memory need to be mapped to a smaller shared memory region using a modulo mapping. (not yet exploited in Polly) The cloog based code generation is still available for comparision, but is scheduled for removal. llvm-svn: 222101	2014-11-16 17:02:11 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	d1c12e65cd	Remove one incomplete test case accidentally committed llvm-svn: 222089	2014-11-15 21:34:34 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
David Peixotto	a4817871d2	Safely generate new loop metadata node Polly was accidently modifying a debug info metadata node when attempting to generate a new unique metadata node for the loop id. The problem was that we had dwarf metadata that referred to a metadata node with a null value, like this: !6 = ... some dwarf metadata referring to !7 ... !7 = {null} When we attempt to generate a new metadata node, we reserve the first space for self-referential node by setting the first argument to null and then mutating the node later to refer to itself. However, because the nodes are uniqued based on pointer values, when we get the new metadata node it actually referred to an existing node (!7 in the example). When we went to modify the metadata to point to itself, we were accidently mutating the dwarf metatdata. We ended up in this situation: !6 = ... some dwarf metadata referring to !7 ... !7 = {!7} and this causes an assert when generating the debug info. The fix is simple, we just need to use a unique value when getting a new metadata node. The MDNode::getTemporary() provides exactly the API we need (and it is used in clang to generate the unique nodes). Differential Revision: http://reviews.llvm.org/D6174 llvm-svn: 221550	2014-11-07 21:44:18 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
David Peixotto	8da2b93d9f	Change the RegionSet type to a SetVector This patch changes the RegionSet type used in ScopDetection from a std::set to a llvm::SetVector. The reason for the change is to ensure deterministic output when printing the result of the analysis. We had a windows buildbot failure for the modified test because the output was coming in a different order. Only one test case needed to be modified for this change. We could use CHECK-DAG directives instead of CHECK in the analysis test cases because the actual order of scops does not matter, but I think that change should be done in a separate patch that modifies all the appliciable tests. I simply modified the test to reflect the expected deterministic output. Differential Revision: http://reviews.llvm.org/D5897 llvm-svn: 220423	2014-10-22 20:39:07 +00:00
Johannes Doerfert	9b5786960d	Relax the condition on the jsop accesses regarding the alignment. We restricted the new access functions to be a subset of the old one because we want to keep the alignment, however if the alignment is "not special", thus the default for the type, we can allow any access. Differential Revision: http://reviews.llvm.org/D5680 llvm-svn: 219503	2014-10-10 15:14:29 +00:00
Johannes Doerfert	341a15a64b	Use the new access function (if present) to compute the access stride. Differential Revision: http://reviews.llvm.org/D5661 llvm-svn: 219499	2014-10-10 14:28:46 +00:00
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	f1ee2622be	[Fix] Dead statements should not confuse the RTC generation This fixes http://llvm.org/bugs/show_bug.cgi?id=21166 . Differential Revision: http://reviews.llvm.org/D5623 llvm-svn: 219131	2014-10-06 17:43:00 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Duncan P. N. Exon Smith	52fd68980c	DI: LLVM schema change: fold constants into string Update debug info testcases for the LLVM metadata schema change in r219010 to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219019	2014-10-03 21:08:48 +00:00

1 2 3 4 5 ...

436 Commits