llvm-project

Commit Graph

Author	SHA1	Message	Date
David Peixotto	dc0a11c21f	Fix maxLoopDepth computation in ScopInfo The max loop depth was incorrectly computed for scops that contain a block from a loop but do not contain the entire loop. We need to check that the full loop is contained in the region when computing the max loop depth. These scops occur when a region containing an inner loop is expanded to include some blocks from the outer loop, but it cannot be fully expanded to contain the outer loop because the region containing the outer loop is invalid. Differential Revision: http://reviews.llvm.org/D6913 llvm-svn: 225812	2015-01-13 18:31:55 +00:00
Tobias Grosser	0a092763e7	Adjust test for the new 'distinct' metadata nodes 'distinct' was introduced in 225474. We now adjust the test cases to match for the additional 'distinct' marker. llvm-svn: 225512	2015-01-09 08:10:36 +00:00
Tobias Grosser	bfbc3690bb	Add experimental support for unsigned expressions This support is still incomplete and consequently hidden behind a switch that needs to be enabled. One problem is ATM that we incorrectly interpret very large unsigned values as negative values even if used in an unsigned comparision. llvm-svn: 225480	2015-01-09 00:01:33 +00:00
Tobias Grosser	55bc4c0767	Add support for pointer types in expressions llvm-svn: 225464	2015-01-08 19:26:53 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Andreas Simbuerger	cd8500e500	(diagnostics) fix typo in test... llvm-svn: 224591	2014-12-19 17:22:46 +00:00
Duncan P. N. Exon Smith	39e21f9c27	Hand-modify a testcase (still PR21532) Bot was still tripping [1] on a testcase the upgrade script didn't handle in 224269. This is still fallout from r224257. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25435 llvm-svn: 224280	2014-12-15 21:43:20 +00:00
Duncan P. N. Exon Smith	bd62edb20d	Run upgrade script from PR21532 to match LLVM changes Update tests for LLVM assembly format change in r224257 using the script attached to PR21532. I'm hoping this unsticks the bot [1]. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25432 llvm-svn: 224269	2014-12-15 20:28:50 +00:00
Tobias Grosser	13e222ca55	Update to the latest version of isl Isl now specifically marks modulo operations that are compared against zero. They can be implemented with the C/LLVM remainder operation. We also update a couple of test cases where the output of isl has slightly changed. llvm-svn: 223607	2014-12-07 16:04:29 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Hongbin Zheng	c5447f4c3b	Do not incorrectly set the inverted flag. In TempScopInfo::buildCondition we extract the conditions to guard the BB in addition of loop bounds. This means we should only consider the conditions in the paths (in CFG) that do not contain cycles (loops). At the same time, we set the invert flag if the FalseBB of the current branch dominates our target BB to indicate that we reach the target BB with an inverted condition from the current branch. In this case, the path from the FalseBB contains a cycle if the FalseBB is the target of a backedge. The conditions implied by such a path should not be consider. We can identify such a case by checking if the TrueBB also dominates our target BB, which means we can also reach our target BB from the TrueBB, without going through the backedge. llvm-svn: 222907	2014-11-28 03:26:06 +00:00
Tobias Grosser	154d9469f4	Add PreHeader always to OuterLoop This fixes a bug introduce in r217525. llvm-svn: 222766	2014-11-25 17:09:21 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	bab3568105	Modify test cases to work with SCEV based code generation This patch includes tests where we actually need to adjust the CHECK lines for SCEV based code generation. Besides these adjustments we add explicit calls to -polly-codegen-scev=[true\|false] and make sure we test both cases. llvm-svn: 222112	2014-11-16 22:43:21 +00:00
Tobias Grosser	95cd1c718e	Make usage of scev based code generation explicit in tests This is in preparation of using SCEV based codegen by default in polly llvm-svn: 222111	2014-11-16 21:43:28 +00:00
Tobias Grosser	2f8732e7c6	Independent blocks: SE->forget() scalars translated to arrays This prevents SCEVs to reference values not valid any more and as a consequence solves a bug where such values reintroduced during ast generation caused the independent blocks pass to fail validation. http://llvm.org/PR21204 llvm-svn: 222103	2014-11-16 20:33:58 +00:00
Tobias Grosser	b05b038b81	Switch default code generation backend to isl The isl based backend has been tested since a long time and with the recently commited OpenMP support the last missing piece of functionality was ported from the CLooG backend. The isl based backend gives us interesting new functionality: - Run-time alias checks (enabled by default) Optimize scops that contain possibly aliasing pointers. This feature has largely increased the number of loop nests we consider for optimization. Thanks Johannes! - Delinearization (not yet enabled by default) Model accesses to multi-dimensional arrays precisely. This will allow us to understand kernels with multi-dimensional VLAs written in Julia, boost::ublas, coremark or C99. Thanks Sebastian! - Generation of higher quality code Sven and me spent a long time to optimize the quality of the generated code. A major focus were expressions as they result from modulos/divisions or piecewise affine expressions (a ? b : c). - Full/Partial tile separation, polyhedral unrolling The isl code generation provides functionality to generate specialized code for core and cleanup loops and to specialize code using polyhedral context information while unrolling statements. (not yet exploited in Polly) - Modifieable access functions We can now use standard isl functionality to remap memory accesses to new data locations. A standard use case is the use of shared memory, where accesses to a larger region in global memory need to be mapped to a smaller shared memory region using a modulo mapping. (not yet exploited in Polly) The cloog based code generation is still available for comparision, but is scheduled for removal. llvm-svn: 222101	2014-11-16 17:02:11 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	d1c12e65cd	Remove one incomplete test case accidentally committed llvm-svn: 222089	2014-11-15 21:34:34 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
David Peixotto	a4817871d2	Safely generate new loop metadata node Polly was accidently modifying a debug info metadata node when attempting to generate a new unique metadata node for the loop id. The problem was that we had dwarf metadata that referred to a metadata node with a null value, like this: !6 = ... some dwarf metadata referring to !7 ... !7 = {null} When we attempt to generate a new metadata node, we reserve the first space for self-referential node by setting the first argument to null and then mutating the node later to refer to itself. However, because the nodes are uniqued based on pointer values, when we get the new metadata node it actually referred to an existing node (!7 in the example). When we went to modify the metadata to point to itself, we were accidently mutating the dwarf metatdata. We ended up in this situation: !6 = ... some dwarf metadata referring to !7 ... !7 = {!7} and this causes an assert when generating the debug info. The fix is simple, we just need to use a unique value when getting a new metadata node. The MDNode::getTemporary() provides exactly the API we need (and it is used in clang to generate the unique nodes). Differential Revision: http://reviews.llvm.org/D6174 llvm-svn: 221550	2014-11-07 21:44:18 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
David Peixotto	8da2b93d9f	Change the RegionSet type to a SetVector This patch changes the RegionSet type used in ScopDetection from a std::set to a llvm::SetVector. The reason for the change is to ensure deterministic output when printing the result of the analysis. We had a windows buildbot failure for the modified test because the output was coming in a different order. Only one test case needed to be modified for this change. We could use CHECK-DAG directives instead of CHECK in the analysis test cases because the actual order of scops does not matter, but I think that change should be done in a separate patch that modifies all the appliciable tests. I simply modified the test to reflect the expected deterministic output. Differential Revision: http://reviews.llvm.org/D5897 llvm-svn: 220423	2014-10-22 20:39:07 +00:00
Johannes Doerfert	9b5786960d	Relax the condition on the jsop accesses regarding the alignment. We restricted the new access functions to be a subset of the old one because we want to keep the alignment, however if the alignment is "not special", thus the default for the type, we can allow any access. Differential Revision: http://reviews.llvm.org/D5680 llvm-svn: 219503	2014-10-10 15:14:29 +00:00
Johannes Doerfert	341a15a64b	Use the new access function (if present) to compute the access stride. Differential Revision: http://reviews.llvm.org/D5661 llvm-svn: 219499	2014-10-10 14:28:46 +00:00
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	f1ee2622be	[Fix] Dead statements should not confuse the RTC generation This fixes http://llvm.org/bugs/show_bug.cgi?id=21166 . Differential Revision: http://reviews.llvm.org/D5623 llvm-svn: 219131	2014-10-06 17:43:00 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Duncan P. N. Exon Smith	52fd68980c	DI: LLVM schema change: fold constants into string Update debug info testcases for the LLVM metadata schema change in r219010 to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219019	2014-10-03 21:08:48 +00:00
Johannes Doerfert	a441783544	[Fix] Accidently changed the type of a libgomp argument in r219003. Only subsequent patches introduced tests for the signature in the generated IR, thus the tests were wrong too and are adjusted now. llvm-svn: 219017	2014-10-03 20:40:24 +00:00
Johannes Doerfert	1356ac75d1	Put the parallel context alloca into the function entry block. We use lifetime markers to limit the actual life range (similar to clang). Differential Revision: http://reviews.llvm.org/D5582 llvm-svn: 219005	2014-10-03 19:12:05 +00:00
Johannes Doerfert	990cd4c2e2	Add option to limit the maximal number of parallel threads. Differential Revision: http://reviews.llvm.org/D5581 llvm-svn: 219004	2014-10-03 19:11:10 +00:00
Johannes Doerfert	12b355a2ce	[Refactor] Generalize parallel code generation + Generalized function names and comments + Removed OpenMP (omp) from the names and comments + Use common names (non OpenMP specific) for runtime library call creation methodes + Commented the parallel code generator and all its member functions + Refactored some values and methodes Differential Revision: http://reviews.llvm.org/D4990 llvm-svn: 219003	2014-10-03 19:10:13 +00:00
Johannes Doerfert	87901453d9	Align copied load/store instructions as the original. This also forbids the json importer to access other memory locations than the original instruction as we to reuse the alignment of the original load/store. Differential Revision: http://reviews.llvm.org/D5560 llvm-svn: 218883	2014-10-02 16:22:19 +00:00
Johannes Doerfert	ecdf263c07	Allow to annotate alias scopes in the new SCoP. The command line flag -polly-annotate-alias-scopes controls whether or not Polly annotates alias scopes in the new SCoP (default ON). This can improve later optimizations as the new SCoP is basically an alias free environment for them. llvm-svn: 218877	2014-10-02 15:31:24 +00:00
Adrian Prantl	e6579cd9a6	Update testcase to new intrinsic format llvm-svn: 218806	2014-10-01 20:40:12 +00:00
Johannes Doerfert	c7b719fc03	Annotate LLVM-IR for all parallel loops This change allows to annotate all parallel loops with loop id metadata. Furthermore, it will annotate memory instructions with llvm.mem.parallel_loop_access metadata for all surrounding parallel loops. This is especially usefull if an external paralleliser is used. This also removes the PollyLoopInfo class and comments the LoopAnnotator. A test case for multiple parallel loops is attached. llvm-svn: 218793	2014-10-01 20:10:44 +00:00
Johannes Doerfert	eeab05a084	[RTC] Use the domain to split alias groups. We use a parametric abstraction of the domain to split alias groups if accesses cannot be executed under the same parameter evaluation. The two test cases check that we can remove alias groups if the pointers which might alias are never accessed under the same parameter evaluation and that the minimal/maximal accesses are not global but with regards to the parameter evaluation. Differential Revision: http://reviews.llvm.org/D5436 llvm-svn: 218758	2014-10-01 12:42:37 +00:00
Johannes Doerfert	13771738d3	[RTC] Split alias groups according to read only base addresses If there are multiple read only base addresses in an alias group we can split it into multiple alias groups each with only one read only access. This way we might reduce the number of comparisons significantly as it grows linear in the number of alias groups but exponential in their size. Differential Revision: http://reviews.llvm.org/D5435 llvm-svn: 218757	2014-10-01 12:40:46 +00:00
Tobias Grosser	f8a678d2fd	Build domtree of new loops correctly This fixes a bug introduced in r217525. llvm-svn: 218581	2014-09-28 22:40:36 +00:00
Johannes Doerfert	9143d67aba	[RTC] Bail if too many parameters are involved in a RTC access. If too many parameters are involved in accesses used to create RTCs we might end up with enormous compile times and RTC expressions. The reason is that the lexmin/lexmax is dependent on all these parameters and isl might need to create a case for every "ordering" of them (e.g., p0 <= p1 <= p2, p1 <= p0 <= p2, ...). The exact number of parameters allowed in accesses is defined by the command line option -polly-rtc-max-parameters=XXX and set by default to 8. Differential Revision: http://reviews.llvm.org/D5500 llvm-svn: 218566	2014-09-27 11:02:39 +00:00
Tobias Grosser	1eedb67fa6	We do not support alias checks for base pointers defined inside the SCoP The run-time alias check places code that involves the base pointer at the beginning of the SCoP. This breaks if the base pointer is defined inside the SCoP. Hence, we can only create a run-time alias check if we are sure the base pointer is not an instruction defined inside the scop. If it is we refuse to handle the SCoP. This commit should unbreak most of our current LNT failures. Differential Revision: http://reviews.llvm.org/D5483 llvm-svn: 218412	2014-09-24 21:04:29 +00:00
Johannes Doerfert	77bd5ae3d9	[Fix] Allow pointer types as access elements and compare them correctly This fixes two problems which are usualy caused together: 1) The elements of an isl AST access expression could be pointers not only integers, floats and vectores thereof. 2) The runtime alias checks need to compare pointers but if they are of a different type we need to cast them into a "max" type similar to the non pointer case. llvm-svn: 218113	2014-09-19 08:49:02 +00:00
Tobias Grosser	3ee7cdab53	Report possible aliasing deterministically This commit drops a call to std::sort, which sorted the base pointers that possibly alias according to the address at which their corresponding llvm::Value was allocated. There does not seem to be any good reason, why those pointers should be (re)sorted and this only makes the output indeterministic. llvm-svn: 218052	2014-09-18 14:45:43 +00:00
Johannes Doerfert	b9fb5a2cc6	[RTC] Runtime Alias Checks for the ISL backend (missing tests) Test files missing in r218046. llvm-svn: 218047	2014-09-18 11:20:36 +00:00
Johannes Doerfert	b164c795b7	[RTC] Runtime Alias Checks for the ISL backend This change will build all alias groups (minimal/maximal accesses to possible aliasing base pointers) we have to check before we can assume an alias free environment. It will also use these to create Runtime Alias Checks (RTC) in the ISL code generation backend, thus allow us to optimize SCoPs despite possibly aliasing pointers when this backend is used. This feature will be enabled for the isl code generator, e.g., --polly-code-generator=isl, but disabled for: - The cloog code generator (still the default). - The case delinearization is enabled. - The case non-affine accesses are allowed. llvm-svn: 218046	2014-09-18 11:17:17 +00:00
Johannes Doerfert	b7e4083599	Updated to isl 2c19ecd444095d6f560349018f68993bc0e03691 Changed test cases and fixed warnings. llvm-svn: 218043	2014-09-18 11:13:35 +00:00
Johannes Doerfert	0fe35dd088	[Fix] Rewire the Region after a unconditional entry edge is created We use SplitEdge to split a conditional entry edge of the SCoP region. However, SplitEdge can cause two different situations (depending on whether or not the edge is critical). This patch tests which one is present and deals with the former unhandled one. It also refactors and unifies the case we have to change the basic blocks of the SCoP to new ones (see replaceScopAndRegionEntry). llvm-svn: 217802	2014-09-15 18:34:45 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Tobias Grosser	230acc4445	Delinearize _all_ accesses to a multi-dimensional array Even though we previously correctly detected the multi-dimensional access pattern for accesses with a certain base address, we only delinearized non-affine accesses to this address. Affine accesses have not been touched and remained as single dimensional accesses. The result was an inconsistent description of accesses to the same array, with some being one dimensional and some being multi-dimensional. This patch ensures that all accesses are delinearized with the same dimensionality as soon as a single one of them has been detected as non-affine. While writing this patch, it became evident that the options -polly-allow-nonaffine and -polly-detect-keep-going have not been properly supported in case delinearization has been turned on. This patch adds relevant test coverage and addresses these issues as well. We also added some more documentation to the functions that are modified in this patch. This fixes llvm.org/PR20123 Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217728	2014-09-13 14:47:55 +00:00
Tobias Grosser	bcd4efffa7	Check that the elements of an array have the same size At the moment we assume that only elements of identical size are stored/loaded to a certain base pointer. This patch adds logic to the scop detection to verify this. Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217727	2014-09-13 14:47:40 +00:00
Tobias Grosser	3762bd34e7	Improve test coverage for non-affine access functions We now verify that such functions are correctly detected even in combination with delinearization. This change is added to ensure we have good test coverage for the subsequent delinearization fix. We also remove unnecessary instructions from the test case. llvm-svn: 217664	2014-09-12 09:07:56 +00:00
Tobias Grosser	0ef617dda0	Remove executable bit on test files Some test files had been marked executable by accident. llvm-svn: 217663	2014-09-12 09:07:50 +00:00
Johannes Doerfert	dd5c144246	Allow to generate a loop without the GuardBB This allows us to omit the GuardBB in front of created loops if we can show the loop trip count is at least one. It also simplifies the dominance relation inside the new created region. A GuardBB (even with a constant branch condition) might trigger false dominance errors during function verification. Differential Revision: http://reviews.llvm.org/D5297 llvm-svn: 217525	2014-09-10 17:33:32 +00:00
Johannes Doerfert	3826224428	[Refactor] Cleanup isl code generation Summary: + Refactor the runtime check (RTC) build function + Added helper function to create an PollyIRBuilder + Change the simplify region function to create not only unique entry and exit edges but also enfore that the entry edge is unconditional + Cleaned the IslCodeGeneration runOnScop function: - less post-creation changes of the created IR + Adjusted and added test cases Reviewers: grosser, sebpop, simbuerg, dpeixott Subscribers: llvm-commits, #polly Differential Revision: http://reviews.llvm.org/D5076 llvm-svn: 217508	2014-09-10 14:50:23 +00:00
David Peixotto	9690f3b596	Add -e to test generation script The -e flag exits the script with a non-zero code if any subcommand fails. This flag allows us to notice as early as possible if the test was not properly regenerated using a command like: $ create_ll.sh t.c && opt < t.ll -polly ... The above pattern is useful when iteratively developing a test case to guard against un-noticed syntax errors. Differential Revision: http://reviews.llvm.org/D5276 llvm-svn: 217463	2014-09-09 22:14:38 +00:00
Johannes Doerfert	8e95dc657e	[Fix] OpenMP parallel loop detection for the isl backend There was a bug in the IslAst which caused that no more outermost parallel loops were detected/checked after a parallel outermost loop of depth 1. + Test case attached llvm-svn: 217452	2014-09-09 17:03:54 +00:00
Tobias Grosser	e7e33ba13a	Always pipe in test files In Polly we used to have a mix of test cases, some that used 'opt %s' and others that used 'opt < %s'. We now change all to use 'opt < %s'. Piping in test files is preferable as it does prevent temporary files to be written to disk. This brings us in line with what is usus in LLVM. llvm-svn: 216816	2014-08-30 09:15:04 +00:00
Tobias Grosser	2faa569c0a	Replace %defaultOpts with explicit pass names This replaces the use of %defaultOpts = '-basicaa -polly-prepare' with the minimal set of passes necessary for a test to succeed. Of the test cases that previously used %defaultOpts 76 test cases require none of these passes, 42 need -basicaa and only 2 need -polly-prepare. Our change makes this requirement explicit. In Polly many test cases have been using a macro '%defaultOpts' which run a couple of preparing passes before the actual Polly test case. This macro was introduced very early in the development of Polly and originally contained a large set of canonicalization passes. However, as the need for additional canonicalization passes makes test cases harder to understand and also more fragile in terms of changes in such passes, we aim since a longer time to only include the minimal set of passes necessary. This patch removes the last leftovers from of %defaultOpts and brings our tests cases more in line to what is usus in LLVM itself. llvm-svn: 216815	2014-08-30 09:13:28 +00:00
Johannes Doerfert	9e7b17b0d4	Added arcanist linters and cleaned errors and warnings Arcanist (arc) will now always run linters before uploading any new commit to Phabricator. All errors/warnings (or their absence) will be shown in the web interface together with a explanation by the commiter (arcanist will ask the commiter if the build was not clean). The linters include: - clang-format - spelling check - permissions check (aka. chmod) - filename check - merge conflict marker check Note, that their scope is sometimes limited (see .arclint for details). This commit also fixes all errors and warnings these linters reported, namely: - spelling mistakes and typos - executable permissions for various text files Differential Revision: http://reviews.llvm.org/D4916 llvm-svn: 215871	2014-08-18 00:40:13 +00:00
Andreas Simbuerger	6bf77979e0	Diagnostic: Provide end-user message for non-affine loop bound errors llvm-svn: 215832	2014-08-17 10:09:15 +00:00
Andreas Simbuerger	d46b935267	Diagnostic: Provide end-user message for non-affine access function errors llvm-svn: 215831	2014-08-17 10:09:11 +00:00
Andreas Simbuerger	f29f625748	Diagnostic: Provide end-user message for aliasing errors This will spill out information about LLVM-internals. However, in cases where the name of the Value matches the name of the array in the source, we provide more useful information. In cases where we spill internals, the information still might help the user to pin down the correct arrays. The problem we face here is: The error is pinned to the debug location of one of the offending values out of the alias set instead of all of them. The more information we give the user about the set of aliasing pointers the better. llvm-svn: 215830	2014-08-17 10:09:07 +00:00
Tobias Grosser	2873594709	Revert "[Refactor] Cleanup runtime code generation" This reverts commit 215466 (and 215528, a trivial formatting fix). The intention of these commits is a good one, but unfortunately they broke our LNT buildbot: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly-codegen-isl Several of the cleanup changes that have been combined in this 'fixup' are trivial and could probably be committed as obvious changes without risking to break the build. The remaining changes are little and it should be easy to figure out what went wrong. llvm-svn: 215817	2014-08-16 09:09:15 +00:00
Tobias Grosser	f4daf34496	Revert "Added support for modulo expressions" This reverts commit 215684. The intention of the commit is great, but unfortunately it seems to be the cause of 14 LNT test suite failures: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly/builds/116 To make our buildbots and performance testers green until this issue is solved, we temporarily revert this commit. llvm-svn: 215816	2014-08-16 09:08:55 +00:00
Johannes Doerfert	5130c849aa	Added support for modulo expressions The support is limited to signed modulo access and condition expressions with a constant right hand side, e.g., A[i % 2] or A[i % 9]. Test cases are modified according to this new feature and new test cases are added. Differential Revision: http://reviews.llvm.org/D4843 llvm-svn: 215684	2014-08-15 01:14:11 +00:00
Johannes Doerfert	9744c4af16	[Refactor] Cleanup runtime code generation + Use regexp in two test case. + Refactor the runtime condition build function llvm-svn: 215466	2014-08-12 18:35:54 +00:00
Johannes Doerfert	fab63f7791	[Minor] Change the number of cut lines for new tests This should cut all metadata community clang produces. llvm-svn: 215422	2014-08-12 03:31:23 +00:00
Tobias Grosser	f57d63f906	Do allow negative offsets in the outermost array dimension There is no needed for neither 1-dimensional nor higher dimensional arrays to require positive offsets in the outermost array dimension. We originally introduced this assumption with the support for delinearizing multi-dimensional arrays. llvm-svn: 214665	2014-08-03 21:07:30 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	b5d1c322f2	Update the jscop tests and port them to isl codegen. The updated tests use a different context than the old ones did. Other than that only their path and the code generation we use changed. llvm-svn: 214657	2014-08-03 01:48:49 +00:00
Tobias Grosser	8c112d838c	Mark a GPGPU test case as XFAIL This area of code is currently not very much tested. It will hopefully be superseeded by Yabin's GSoC project. llvm-svn: 214633	2014-08-02 13:37:32 +00:00
Johannes Doerfert	b41344a88f	[Fix] Annotate the IslAst with broken reductions (Missing files) + test cases of r214489. llvm-svn: 214491	2014-08-01 08:20:26 +00:00
Johannes Doerfert	32868bf4c3	Change the printing of reduction types We use the C operator representation when applicable. + Update all the test cases accordingly. llvm-svn: 214486	2014-08-01 08:13:25 +00:00
Johannes Doerfert	0eefb0258f	[Refactor] Use nicer print callback function in IslAst llvm-svn: 214447	2014-07-31 21:33:49 +00:00
Rafael Espindola	08dfd8f25f	Update for llvm change. llvm-svn: 214358	2014-07-30 23:17:15 +00:00
Tobias Grosser	924e9e0226	IslAst: Enhance parallelism detection test Add more check lines to ensure we do not accidentally generate nested openmp parallel annotations. llvm-svn: 214200	2014-07-29 19:22:46 +00:00
Johannes Doerfert	af9b1e2d80	[Refactor] Remove containsLoop to find innermost loops Use the fact that if we visit a for node first in pre and next in post order we know we did not visit any children, thus we found an innermost loop. + Test case for an innermost loop with a conditional inside llvm-svn: 213870	2014-07-24 15:59:06 +00:00
Johannes Doerfert	f6583176ab	[Refactor] Unify IslAst print methods + Add const annotations to some member functions llvm-svn: 213779	2014-07-23 18:14:43 +00:00
Johannes Doerfert	43e1eadf26	[Refactor] Use attributes to mark function as invalid for polly + Test case annotated with the new attribute + Modified test case to check if subfunctions are annotated llvm-svn: 213093	2014-07-15 21:06:48 +00:00
Johannes Doerfert	457f73eaee	Annotate reduction parallel loops in the IslAst textual output + Introduced dependency type TYPE_TC_RED to represent the transitive closure (& the reverse) of reduction dependences. These are used when we check for reduction parallel loops. + Test cases including loop reversals and modulo schedules which compute reductions in a alternated order. llvm-svn: 213019	2014-07-15 00:00:35 +00:00
Tobias Grosser	c2920ff747	DeadCodeElimination: Fix liveout computation We move back to a simple approach where the liveout is the last must-write statement for a data-location plus all may-write statements. The previous approach did not work out. We would have to consider per-data-access dependences, instead of per-statement dependences to correct it. As this adds complexity and it seems we would not gain anything over the simpler approach that we implement in this commit, I moved us back to the old approach of computing the liveout, but enhanced it to also add may-write accesses. We also fix the test case and explain why we can not perform dead code elimination in this case. llvm-svn: 212925	2014-07-14 08:32:01 +00:00
Tobias Grosser	780ce0f8e3	DeadCodeElim: Compute correct liveout for non-affine accesses Thanks to Johannes Doerfert for narrowing down the bug. Reported-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 212796	2014-07-11 07:12:10 +00:00
Tobias Grosser	5e6813d184	Derive run-time conditions for delinearization As our delinearization works optimistically, we need in some cases run-time checks that verify our optimistic assumptions. A simple example is the following code: void foo(long n, long m, long o, double A[n][m][o]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 150; j++) for (long k = 0; k < 200; k++) A[i][j][k] = 1.0; } After clang linearized the access to A and we delinearized it again to A[i][j][k] we need to ensure that we do not access the delinearized array out of bounds (this information is not available in LLVM-IR). Hence, we need to verify the following constraints at run-time: CHECK: Assumed Context: CHECK: [o, m] -> { : m >= 150 and o >= 200 } llvm-svn: 212198	2014-07-02 17:47:48 +00:00
Johannes Doerfert	f618339a37	Introduce reduction types This change is particularly useful in the code generation as we need to know which binary operator/identity element we need to combine/initialize the privatization locations. + Print the reduction type for each memory access + Adjusted the test cases to comply with the new output format and to test for the right reduction type llvm-svn: 212126	2014-07-01 20:52:51 +00:00
Johannes Doerfert	9890a05287	[FIX] Don't consider reductions which are partially outside the SCoP + Test case llvm-svn: 212080	2014-07-01 00:32:29 +00:00
Johannes Doerfert	1a62c7a34a	[Fix] Deleted renamed test after r211957 llvm-svn: 211964	2014-06-27 21:48:42 +00:00
Johannes Doerfert	e58a012094	Allow multiple reductions per statement Iterate over all store memory accesses and check for valid binary reduction candidate loads by following the operands of the stored value. For each candidate pair we check if they have the same base address and there are no other accesses which may overlap with them. This ensures that no intermediate value can escape into other memory locations or is overwritten at some point. + 17 test cases for reduction detection and reduction dependency modeling llvm-svn: 211957	2014-06-27 20:31:28 +00:00
Andreas Simbuerger	b379edbb3e	Don't expand to invalid Scops with -polly-detect-keep-going Enabling -keep-going in ScopDetection causes expansion to an invalid Scop candidate. Region A <- Valid candidate \| Region B <- Invalid candidate If -keep-going is enabled, ScopDetection would expand A to A+B because the RejectLog is never checked for errors during expansion. With this patch only A becomes a valid Scop. llvm-svn: 211875	2014-06-27 06:21:14 +00:00
Johannes Doerfert	76dd493eff	[Fix] Broken tests after r211796. llvm-svn: 211797	2014-06-26 19:29:11 +00:00
Johannes Doerfert	f8ee915deb	Use wrapped reduction dependences This change will ease the transision to multiple reductions per statement as we can now distinguish the effects of multiple reductions in the same statement. + Wrapped reduction dependences are used to compute privatization dependences + Modified test cases to account for the change llvm-svn: 211795	2014-06-26 18:44:14 +00:00
Johannes Doerfert	ea23b1d561	Hybrid dependency analysis This dependency analysis will keep track of memory accesses if they might be part of a reduction. If not, the dependences are tracked on a statement level. The main reason to do this is to reduce the compile time while beeing able to distinguish the effects of reduction and non-reduction accesses. + Adjusted two test cases llvm-svn: 211794	2014-06-26 18:38:08 +00:00
Andreas Simbuerger	99d4ab2b84	Add diagnostic remark for ReportVariantBasePtr llvm-svn: 211777	2014-06-26 13:33:35 +00:00
Andreas Simbuerger	5569bf300d	Support the new DiagnosticRemarks Add support for generating optimization remarks after completing the detection of Scops. The goal is to provide end-users with useful hints about opportunities that help to increase the size of the detected Scops in their code. By default the remark is unspecified and the debug location is empty. Future patches have to expand on the messages generated. This patch brings a simple test case for ReportFuncCall to demonstrate the feature. Reports all missed opportunities to increase the size/number of valid Scops: clang <...> -Rpass-missed="polly-detect" <...> opt <...> -pass-remarks-missed="polly-detect" <...> Reports beginning and end of all valid Scops: clang <...> -Rpass="polly-detect" <...> opt <...> -pass-remarks="polly-detect" <...> Differential Revision: http://reviews.llvm.org/D4171 llvm-svn: 211769	2014-06-26 10:06:40 +00:00
Tobias Grosser	50a5e6dac0	test/ScopInfo: Remove %defaultOpts and list passes explicitly Due to bad habit we sometimes used a variable %defaultOpts that listed a set of passes commonly run to prepare for Polly. None of these test cases actually needs special preparation and only two of them need the 'basicaa' to be scheduled. Scheduling the required alias analysis explicitly makes the test cases clearer. llvm-svn: 211671	2014-06-25 06:38:18 +00:00
Tobias Grosser	08031390d5	Clean up XFAILed test cases We had a set of test cases that have been incomplete and XFAILED. This patch completes a couple of the interesting ones and removes the ones which seem redundant or not sufficiently reduced to be useful. llvm-svn: 211670	2014-06-25 06:31:19 +00:00
Johannes Doerfert	5e275bc83a	[Refactor] Create nicer test cases from C/C++ Insert a header into the new testcase containing a sample RUN line a FIXME and an XFAIL. Then insert the formated C code and finally the LLVM-IR without attributes, the module ID or the target triple. llvm-svn: 211612	2014-06-24 17:02:53 +00:00
Yabin Hu	cc91169fd7	Remove use of llvm.codegen intrinsic for GPGPU codegen We use llvm.codegen intrinsic to generate code for embedded LLVM-IR strings. The reason we introduce such a intrinsic is that previous clang/opt tools was NOT linked with various LLVM targets and their AsmParsers and AsmPrinters. Since clang/opt been linked with all the needed libraries, we no longer need the llvm.codegen intrinsic. llvm-svn: 211573	2014-06-24 08:11:36 +00:00
Johannes Doerfert	f1906138b4	Model statement wise reduction dependences + Collect reduction dependences + Introduced TYPE_RED in Dependences.h which can be used to obtain the reduction dependences + Used TYPE_RED to prevent parallelization while we do not have a privatizing code generation + Relax the dependences for non-parallel code generation + Add privatization dependences to ensure correctness + 12 Test cases to check for reduction and privatization dependences llvm-svn: 211369	2014-06-20 16:37:11 +00:00
Johannes Doerfert	da80386700	Missing reduction detection test cases llvm-svn: 211235	2014-06-18 23:08:14 +00:00
Tobias Grosser	f4fcbf4097	Test delinearization of 2D diagonal matrix llvm-svn: 210538	2014-06-10 14:48:17 +00:00
Tobias Grosser	be7eaddc69	Adjust another test case to not access out of bounds llvm-svn: 210208	2014-06-04 19:41:47 +00:00
Tobias Grosser	5416a0395f	Adjust multidim test cases to not access out-of-bound memory We do this currently only for test cases where we have integer offsets that clearly access array dimensions out-of-bound. -; for (long i = 0; i < n; i++) -; for (long j = 0; j < m; j++) -; for (long k = 0; k < o; k++) +; for (long i = 0; i < n - 3; i++) +; for (long j = 4; j < m; j++) +; for (long k = 0; k < o - 7; k++) ; A[i+3][j-4][k+7] = 1.0; This will be helpful if we later want to simplify the access functions under the assumption that they do not access memory out of bounds. llvm-svn: 210179	2014-06-04 11:47:54 +00:00
Sebastian Pop	422e33f363	record delinearization result and reuse it in polyhedral translation Without this patch, the testcase would fail on the delinearization of the second array: ; void foo(long n, long m, long o, double A[n][m][o]) { ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) { ; A[i+3][j-4][k+7] = 1.0; ; A[i][0][k] = 2.0; ; } ; } ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[3 + i0, -4 + i1, 7 + i2] }; ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; Here is the output of FileCheck on the testcase without this patch: ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; ^ <stdin>:26:2: note: possible intended match here [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[o0] }; ^ It is possible to find a good delinearization for A[i][0][k] only in the context of the delinearization of both array accesses. There are two ways to delinearize together all array subscripts touching the same base address: either duplicate the code from scop detection to first gather all array references and then run the delinearization; or as implemented in this patch, use the same delinearization info that we computed during scop detection. llvm-svn: 210117	2014-06-03 18:16:31 +00:00
Johannes Doerfert	c3958b214c	Added option for n-dimensional rectangular tiling + CL-option --polly-tile-sizes=<int,...,int> The i'th value is used as a tile size for dimension i, if there is no i'th value, the value of --polly-default-tile-size is used + CL-option --polly-default-tile-size=int Used if no tile size is given for a dimension i + 3 Simple testcases llvm-svn: 209753	2014-05-28 17:21:02 +00:00
Tobias Grosser	5f860fdfe9	Do not run GPGPU test cases without nvptx target Tag the GPGPU codegen test cases as unsupported if the nvptx target is not included in the current llvm build. Contributed-by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 208779	2014-05-14 14:18:14 +00:00
Sebastian Pop	c5c1055e3f	do not build llc and lli for polly test llvm-svn: 208619	2014-05-12 19:43:20 +00:00
Sebastian Pop	e8863b8f00	correct the delinearization failing case collect terms from affine and non affine memory accesses llvm-svn: 208616	2014-05-12 19:02:02 +00:00
Sebastian Pop	fcf68758b8	unxfail passing testcase llvm-svn: 208233	2014-05-07 18:01:32 +00:00
Tobias Grosser	f56af204b9	Add delinearization testcase for ivs that do not follow the loop order This is a test case that is currently failing, but that should start working with an upcoming version of our delinearization pass. llvm-svn: 207678	2014-04-30 17:49:22 +00:00
Tobias Grosser	841009a2cc	We missed two files in the last commit. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206901	2014-04-22 15:57:30 +00:00
Tobias Grosser	0d11dbabc4	Fixed missing cloog test with automake/configure build setup Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206900	2014-04-22 15:30:43 +00:00
Tobias Grosser	954939842f	Really fix the load case. Commit r206510 falsely advertised to fix the load cases, even though it only fixed the store case. This commit adds the same fix for the load case including the missing test coverage. llvm-svn: 206577	2014-04-18 09:46:35 +00:00
Tobias Grosser	50fd7010d8	Ensure a scalar pointer when issuing a vector load Even tough we may want to generate a vector load, the address from which to load still is a scalar. Make sure even if previous address computations may have been vectorized, that the addresses are also available as scalars. This fixes http://llvm.org/PR19469 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206510	2014-04-17 23:13:49 +00:00
Tobias Grosser	75b76729ab	Fix for vector codegen in OpenMP subfunctions Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206332	2014-04-15 22:30:06 +00:00
Tobias Grosser	364c136d08	Dependences: Do not fail in case a schedule eliminates all dependences The following example shows a non-parallel loop void f(int a[]) { int i; for (i = 0; i < 10; ++i) A[i] = A[i+5]; } which, in case we import a schedule that limits the iteration domain to 0 <= i < 5, becomes parallel. Previously we crashed in such cases, now we just recognize it as parallel. This fixes http://llvm.org/PR19435 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206318	2014-04-15 20:14:57 +00:00
Tobias Grosser	efc3013544	Codegeneration: Free memory correctly when using -polly-vectorizer=polly This fixes PR19421. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206156	2014-04-14 08:33:24 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Tobias Grosser	79baa21242	ScopInfo: Scalar accesses are zero dimensional llvm-svn: 205958	2014-04-10 08:38:02 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	64b95123ef	Delete trivial PHI nodes (aka stack slot sharing) During code preperation trivial PHI nodes (mainly introduced by lcssa) are deleted to decrease the number of introduced allocas (==> dependences). However simply replacing them by their only incoming value would cause the independent block pass to introduce new allocas. To prevent this we try to share stack slots during code preperarion, hence to reuse a already created alloca 'to demote' the trivial PHI node. This works if we know that the value stored in this alloca will be the incoming value of the trivial PHI at the end of the predecessor block of this trivial PHI. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 205320	2014-04-01 16:01:33 +00:00
Tobias Grosser	5fa36c0ff6	Updated test/create_ll.sh to work with old & new clang versions. We explicitly specifying all filenames instead of assuming some naming convention used by clang and opt. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 204726	2014-03-25 15:50:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Tobias Grosser	8111a0ae7d	autoconf: Fix module loading in tests llvm-svn: 203925	2014-03-14 13:27:26 +00:00
Sebastian Pop	7537be92f4	add -load polly.so only when not LINK_POLLY_INTO_TOOLS llvm-svn: 203888	2014-03-14 04:04:36 +00:00
Rafael Espindola	80f20133d4	Fix polly tests to not include aliases to declarations. llvm-svn: 203721	2014-03-12 21:48:42 +00:00
Sebastian Pop	1b57e8f028	add dependence of check-polly on llc to avoid an error when directly doing ninja check-polly after cmake 'Could not find llc in .../ninja/bin'. llvm-svn: 203696	2014-03-12 18:55:25 +00:00
Tobias Grosser	4ba60fe9eb	ScheduleOptimizer: Fix prevectorization. In case we are at the innermost band, we try to prepare for vectorization. This means, we look for the innermost parallel loop and strip mine this loop to the innermost level using a strip-mine factor corresponding to the number of vector iterations. For whatever reason, the code that implemented this feature was broken. We now added a comment, a test case and obviously also the right code. llvm-svn: 203544	2014-03-11 06:27:36 +00:00
Tobias Grosser	e655754d57	Update CLooG and some test cases This is necessary to avoid test failures in the CLooG test suite due to the recent isl update. We also need to update two polly test cases which rely on a certain order in the textual description that isl chooses for its sets and maps. Changes here are not often, but we should probably switch to a check that verifies such maps are semantically equivalent instead of represented identically. llvm-svn: 203476	2014-03-10 17:31:22 +00:00
Tobias Grosser	37c9b8e0f2	Emit llvm.loop metadata for parallel loops For now we only mark innermost loops for the loop vectorizer. We could later also mark not-innermost loops to enable the introduction of openmp parallelism. llvm-svn: 202854	2014-03-04 14:59:00 +00:00
Tobias Grosser	356faa8f09	Dead code elimination: Schedule another approximative step before actual DCE In 'obsequi' we have a scop in which the current dead code elimination works, but the generated code is way too complex. To avoid this trouble (and to not disable the DCE entirely) we add an additional approximative step before the actual dead code elimination. This should fix one of the two current nightly-test issues. Polly could be improved to handle 'obsequi' by teaching it to introduce only a single parameter for (%1 and zext %1) which halves the number of parameters and allows polly to derive a simpler representation for the set of live iterations. However, this needs some time to investigate. I will commit a test case as soon as we have a reduced one. llvm-svn: 202010	2014-02-24 08:52:20 +00:00
Tobias Grosser	472d3b7037	codegen: Update LoopInfo correctly Add the 'polly.start' basic block to the loop that surrounds the scop we just codegenerate. This fixes PR13441 llvm-svn: 202000	2014-02-24 00:50:49 +00:00
Tobias Grosser	38c36ea18e	Do not fail in case we do not have valid dependences In case we do not have valid dependences, we do not run dead code elimination or the schedule optimizer. This fixes an infinite loop in the dead code elimination (PR12110). llvm-svn: 201982	2014-02-23 15:15:44 +00:00
Tobias Grosser	88640d2b47	Use -polly-codegen-isl in isl-codegen test Reported-by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 201902	2014-02-21 23:08:54 +00:00
Tobias Grosser	817d51dd1b	DCE: Switch to hybrid precise-unprecise analysis Instead of giving a choice between a precise (but possibly very complex) analysis and an approximative analysis we now use a hybrid approach which uses N precise steps followed by one approximating step. The precision of the analysis can be changed by increasing N. With a default of 'N' = 2, we get fully precise results for our current test cases and should not run into performance problems for more complex test cases. We can adjust this value when we got more experience with this dead code elimination. llvm-svn: 201888	2014-02-21 20:51:46 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Tobias Grosser	37eb422f69	Add polyhedral dead code elimination. This pass eliminates loop iterations that compute results that are not used later on. This can help e.g. in D, where the default zero-initialization is often unnecessary if right after new values are assigned to an array. Contributed-by: Peter Conn <conn.peter@gmail.com> llvm-svn: 201817	2014-02-20 21:43:54 +00:00
Tobias Grosser	d6aafa7c2e	Do not track location of scalar dependences in ScopInfo We do not have a use for this information at the moment. If we need this at some point, the "instruction -> access" mapping needs to be enhanced as a single instruction could then possibly perform multiple accesses. This patch allows us to build the polyhedral information for scops with scalar dependences. llvm-svn: 201815	2014-02-20 21:29:09 +00:00
Tobias Grosser	a1689937ba	Check scops a second time before working on them In rare cases the modification of one scop can effect the validity of other scops, as code generation of an earlier scop may make the scalar evolution functions derived for later scops less precise. The example that triggered this patch was a scop that contained an 'or' expression as follows: %add13710 = or i32 %j.19, 1 --> {(1 + (4 * %l)),+,2}<nsw><%for.body81> Scev could only analyze the 'or' as it knew %j.19 is a multiple of 2. This information was not available after the first scop was code generated (or independent-blocks was run on it) and SCEV could not derive a precise SCEV expression any more. This means we could not any more code generate this SCoP. My current understanding is that there is always the risk that an earlier code generation change invalidates later scops. As the example we have seen here is difficult to avoid, we use this occasion to guard us against all such invalidations. This patch "solves" this issue by verifying right before we start working on a detected scop, if this scop is in fact still valid. This adds a certain overhead. However the verification we run is anyways very fast and secondly it is only run on detected scops. So the overhead should not be very large. As a later optimization we could detect scops only on demand, such that we need to run scop-detections always only a single time. This should fix the single last failure in the LLVM test-suite for the new scev-based code generation. llvm-svn: 201593	2014-02-18 18:49:49 +00:00
Tobias Grosser	933edd04af	IndependentBlocks: Do not assert for PHI nodes outside of scops There does not seem to be a reason that we can not support PHI nodes outside of the scop that reference values within the SCoP. Or at least, the attached test case seems to do the right thing. We remove the assert for now. llvm-svn: 200427	2014-01-29 23:08:10 +00:00
Tobias Grosser	28a70c543d	ScopDetect: Transitively remove all children after region expansion In rare cases, a region R which is itself not valid has an indirect child region that is valid. When R becomes part of a valid region by expansion of another region, then all children of R have to be erased from the set of valid regions. This patch ensures that indirect children are erased in addition to direct children. Contributed-by: Armin Groesslinger <armin.groesslinger@uni-passau.de> Tobias: I added a reduced test case and adjusted the logic of the patch to only recurse until the first child is found. llvm-svn: 200411	2014-01-29 19:05:30 +00:00
Tobias Grosser	458fb78cfa	Check if array base addresses are invariant Array base addresses need to be invariant in the region considered. The base address has to be computed outside the region, or, when it is computed inside, the value must not change with the iterations of the loops. For example, when a two-dimensional array is represented as a pointer to pointers the base address A[i] in an access A[i][j] changes with i; therefore, such regions have to be rejected. Contributed by: Armin Größlinger <armin.groesslinger@uni-passau.de> llvm-svn: 200314	2014-01-28 12:58:58 +00:00
Tobias Grosser	5b5daab9f1	Add more test cases to check loop invariance of the base pointer. llvm-svn: 200305	2014-01-28 10:29:17 +00:00
Tobias Grosser	24d7e669b3	Do not test polybench with 'make check-polly' Those test cases should be tested in the LLVM test suite. For Polly we should extract regression tests for the individual passes. llvm-svn: 200206	2014-01-27 10:37:33 +00:00
Tobias Grosser	54646f7fab	Remove other unneccessary uses of -O3 in the test suite The polly test suite is now -O3 clean. llvm-svn: 200205	2014-01-27 10:37:06 +00:00

1 2 3 4 5 ...

419 Commits