llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	f2716ea7d5	Add -polly-vectorizer=stripmine By strip-mining outer loops to the innermost level we can enable LLVM's loop vectorizer to vectorize outer loops. llvm-svn: 232100	2015-03-12 20:48:07 +00:00
Tobias Grosser	bb4126470a	Drop option to prepare code for the BB vectorizer The BB vectorizer is deprecated and there is no point in generating code for it any more. This option was introduced when there was not yet any loop vectorizer in sight. Now being matured, Polly should target the loop vectorizer. llvm-svn: 232099	2015-03-12 20:47:58 +00:00
Johannes Doerfert	6a4d81c1f6	Add end user report message for unprofitable regions [NFC] llvm-svn: 231593	2015-03-08 15:11:50 +00:00
Tobias Grosser	1fa434992b	Fix leftover Dependences.cpp -> DependenceInfo.cpp llvm-svn: 231355	2015-03-05 06:52:42 +00:00
Johannes Doerfert	7e6424ba5a	Create a dependence struct to hold dependence information for a SCoP. The new Dependences struct in the DependenceInfo holds all information that was formerly part of the DependenceInfo. It also provides the same interface for the user to access this information. This is another step to a more general ScopPass interface that does allow multiple SCoPs to be "in flight". llvm-svn: 231327	2015-03-05 00:43:48 +00:00
Johannes Doerfert	6745822fd1	Add missing forward declaration [NFC] llvm-svn: 231326	2015-03-05 00:40:07 +00:00
Johannes Doerfert	f6557f98a2	Rename the Dependences pass to DependenceInfo [NFC] We rename the Dependences pass to DependenceInfo as a first step to a caching pass policy. The new DependenceInfo pass will later provide "Dependences" for a SCoP. To keep consistency the test folder is renamed too. llvm-svn: 231308	2015-03-04 22:43:40 +00:00
Tobias Grosser	b021a4faad	Add support for conditional 'and' and 'or' expressions No test cases unfortunately as we do not yet generate isl_ast_op_and_then or isl_ast_op_or_else. Those will be added in a later commit. llvm-svn: 231268	2015-03-04 18:14:59 +00:00
Johannes Doerfert	d239aac2ee	Do not model scalar accesses in non-affine subregions If a scalar was defined and used only in a non-affine subregion we do not need to model the accesses. However, if the scalar was defined inside the region and escapes the region we have to model the access. The same is true if the scalar was defined outside and used inside the region. llvm-svn: 230960	2015-03-02 14:06:01 +00:00
Johannes Doerfert	2495cfe01d	[Refactor] Simplify ScopPass interface llvm-svn: 230899	2015-03-01 18:43:50 +00:00
Johannes Doerfert	909a3bf21d	[Refactor] Use virtual and override appropriately + Add override for overwritten methods. + Remove virtual for methods we do not want to be overwritten. llvm-svn: 230898	2015-03-01 18:42:08 +00:00
Johannes Doerfert	3fe584d64f	[Refactor] Add a Scop & as argument to printScop This is the first step in the interface simplification. llvm-svn: 230897	2015-03-01 18:40:25 +00:00
Johannes Doerfert	7b1b724c89	Update obsolete comment llvm-svn: 230857	2015-02-28 17:10:06 +00:00
Johannes Doerfert	514f6efa2b	[FIX] Teach RegionGenerator to respect and update dominance When we generate code for a whole region we have to respect dominance and update it too. The first is achieved with multiple "BBMap"s. Each copied block in the region gets its own map. It is initialized only with values mapped in the immediate dominator block, if this block is in the region and was therefor already copied. This way no values defined in a block that doesn't dominate the current one will be used. To update dominance information we check if the immediate dominator of the original block we want to copy is in the region. If so we set the immediate dominator of the current block to the copy of the immediate dominator of the original block. llvm-svn: 230774	2015-02-27 18:29:04 +00:00
Johannes Doerfert	275a1756ad	Allow non-affine control flow -- Code Generation This is the code generation for region statements that are created when non-affine control flow was present in the input. A new generator, similar to the block or vector generator, for regions is used to traverse and copy the region statement and to adjust the control flow inside the new region in the end. llvm-svn: 230340	2015-02-24 16:16:32 +00:00
Johannes Doerfert	6cad9c4746	[FIX] Some comments llvm-svn: 230335	2015-02-24 16:00:29 +00:00
Johannes Doerfert	ff9d1980a7	Allow non-affine control flow -- SCoP Modeling This allows us to model non-affine regions in the SCoP representation. SCoP statements can now describe either basic blocks or non-affine regions. In the latter case all accesses in the region are accumulated for the statement and write accesses, except in the entry, have to be marked as may-write. Differential Revision: http://reviews.llvm.org/D7846 llvm-svn: 230329	2015-02-24 12:00:50 +00:00
Johannes Doerfert	e70449400f	Add ScalarEvolution bounds to non-affine access functions llvm-svn: 230328	2015-02-24 11:58:30 +00:00
Johannes Doerfert	ba65c1672a	Allow non-affine control flow -- SCoP Detection With this patch we allow the SCoP detection to detect regions as SCoPs which have non-affine control flow inside. All non-affine regions are tracked and later accessible to the ScopInfo. As there is no real difference, non-affine branches as well as floating point branches are covered (and both called non-affine control flow). However, the detection is restricted to overapproximate only loop free regions. llvm-svn: 230325	2015-02-24 11:45:21 +00:00
Johannes Doerfert	b4f08eb671	[REFACTOR] Replace Pass* from BlockGen by the DomTree llvm-svn: 230220	2015-02-23 13:51:35 +00:00
Johannes Doerfert	3f1c285294	[REFACTOR] Simplify the SCoP detection interface a bit llvm-svn: 229879	2015-02-19 18:11:50 +00:00
Johannes Doerfert	3a7e812c66	[NFC] Generalize getIslCompatibleName interface. llvm-svn: 229877	2015-02-19 18:09:39 +00:00
Tobias Grosser	c56dcd52be	Add missing comments to member variables Reported-by: Johannes Doerfert llvm-svn: 229854	2015-02-19 14:28:36 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
David Blaikie	c4d7bc3fcc	Update Polly for the removal of LLVM_DELETED_FUNCTION now that '= delete' works on all supported compilers (MSVC2012 compat has been dropped) llvm-svn: 229344	2015-02-15 23:40:18 +00:00
Johannes Doerfert	48d75034de	Add getSize() to the SCoP class. llvm-svn: 229254	2015-02-14 12:13:17 +00:00
Chandler Carruth	d01918fa13	[PM] Convert Polly over to directly use the legacy pass manager namespace and header rather than the top-level header and using declarations. These helpers impede modular builds and are going away. Migrating away from them will also be necessary to start mixing in any usage of the new pass manager. llvm-svn: 229091	2015-02-13 09:51:50 +00:00
Johannes Doerfert	7ceb040213	Add early exits for SCoPs we did not optimize This allows us to skip ast and code generation if we did not optimize a SCoP and will not generate parallel or alias annotations. The initial heuristic to exit is simple but allows improvements later on. All failing test cases have been modified to disable early exit, thus to keep their coverage. Differential Revision: http://reviews.llvm.org/D7254 llvm-svn: 228851	2015-02-11 17:25:09 +00:00
Johannes Doerfert	be9c91173f	[Refactor] Use only one BlockGenerator for a SCoP This change has two main purposes: 1) We do not use a static interface to hide an object we create and destroy for every basic block we copy. 2) We allow the BlockGenerator to store information between calls to the copyBB method. This will ease scalar/phi code generation later on. While a lot of method signatures were changed this should not cause any real behaviour change. Differential Revision: http://reviews.llvm.org/D7467 llvm-svn: 228443	2015-02-06 21:39:31 +00:00
Johannes Doerfert	0ff23ec544	Model PHI nodes without demoting them This allows us to model PHI nodes in the polyhedral description without demoting them. The modeling however will result in the same accesses as the demotion would have introduced. Differential Revision: http://reviews.llvm.org/D7415 llvm-svn: 228433	2015-02-06 20:13:15 +00:00
Johannes Doerfert	4f33706b53	[NFC] Remove some unnecessary local objects llvm-svn: 227844	2015-02-02 19:41:30 +00:00
Tobias Grosser	c897af3ffc	Correct a typo in a comment llvm-svn: 227569	2015-01-30 12:33:43 +00:00
Johannes Doerfert	9e3a5db000	[FIX] Debug build + instrinsic handling The ignored intrinsics needed to be ignored in three other places as well. Tests and lnt pass now. llvm-svn: 227092	2015-01-26 15:55:54 +00:00
Tobias Grosser	80f6f11330	Make registerPollyPasses public This function is needed for the integration of Polly into Julia. llvm-svn: 225295	2015-01-06 20:40:33 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Tobias Grosser	11e3873516	Dead code elimination: Update dependences after eliminating code Without updating dependences we may lose implicit transitive dependences for which all explicit dependences have gone through the statement iterations we have just eliminated. No test case. We should probably implement a -verify-dependences option. This fixes llvm.org/PR21227 llvm-svn: 224459	2014-12-17 21:13:55 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	7432a64dcb	Drop unused enum value llvm-svn: 222980	2014-11-30 15:57:07 +00:00
Tobias Grosser	71badac9d6	Remove Polly's IndVarSimplify pass Polly had a copy of this pass to create the canonical induction variables necessary for the non-scev-based code generation. As we now always use SCEV based code generation, canonical induction variables are not needed any more. llvm-svn: 222979	2014-11-30 14:33:41 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
Johannes Doerfert	80ef110cca	[Refactor][NFC] Generalize the creation of ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D6031 llvm-svn: 221512	2014-11-07 08:31:31 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
Tobias Grosser	d213a8b810	BlockGenerator: inline lookupAvailableValue into getValue [NFC] There was no good reason why this code was split accross two functions. In subsequent changes we will change the order in which values are looked up. Doing so would make the split into two functions even more arbitrary. We also slightly improve the documentation. llvm-svn: 221388	2014-11-05 19:46:04 +00:00
Johannes Doerfert	5ad8a6a588	Remove the LoopBounds from the TempScop class. We will use ScalarEvolution in the ScopInfo.cpp to get the loop trip count, not cache it in the TempScop object. Differential Revision: http://reviews.llvm.org/D6070 llvm-svn: 221035	2014-11-01 01:14:56 +00:00
Johannes Doerfert	e3da05ac32	Remove the MaxLoopDepth attribute from the TempScop class Now MaxLoopDepth only lives in Scops not in TempScops anymore. This is the first part of a series of changes to make TempScops obsolete. Differential Revision: http://reviews.llvm.org/D6069 llvm-svn: 221026	2014-11-01 00:12:13 +00:00
Johannes Doerfert	75bd66e51d	[Refactor][NFC] Remove unused argument. llvm-svn: 221016	2014-10-31 23:16:02 +00:00
Johannes Doerfert	7c494217f3	[Refactor][NFC] Map basic blocks to SCoP statements. This will simplify the construction of domains and the modeling of PHI's. llvm-svn: 221015	2014-10-31 23:13:39 +00:00

1 2 3 4 5 ...

333 Commits