llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	a8cd15249a	Remove redundant semicolon clang-format complained about llvm-svn: 226402	2015-01-18 15:59:16 +00:00
Chandler Carruth	5ec3333d24	[PM] Update Polly for LLVM r226394 and r226396 which changed some of the block splitting interfaces to accept specific analyses rather than a pass. llvm-svn: 226398	2015-01-18 10:52:23 +00:00
Chandler Carruth	6adcf56b0f	[PM] Update Polly for LLVM r226385 which made LoopInfo actually derive from LoopInfoBase<...>, removing the need for the awkward getBase() dance. llvm-svn: 226387	2015-01-18 01:47:30 +00:00
Chandler Carruth	f557987b15	[PM] Update Polly following LLVM r226373 which refactors LoopInfo in preparation for the new pass manager. llvm-svn: 226374	2015-01-17 14:16:56 +00:00
Chandler Carruth	be742b745b	[PM] Track an LLVM API change by switching this code to directly create the wrapper pass for TLI which is now separate from the core class. llvm-svn: 226159	2015-01-15 10:43:18 +00:00
Chandler Carruth	d281ed5109	[PM] Update for LLVM r226078 which moved TargetLibraryInfo to the Analysis library. llvm-svn: 226080	2015-01-15 02:17:27 +00:00
Tobias Grosser	c642e95402	Use types of matching size when generating multi-dimensional address expressions This change ensures that the values that represent the array size of a multi-dimensional access are correctly sign-extended when used to compute a memory address used in the run-time alias check. To make the test case more readable, we name the instructions that we generate. llvm-svn: 225818	2015-01-13 19:37:59 +00:00
David Peixotto	dc0a11c21f	Fix maxLoopDepth computation in ScopInfo The max loop depth was incorrectly computed for scops that contain a block from a loop but do not contain the entire loop. We need to check that the full loop is contained in the region when computing the max loop depth. These scops occur when a region containing an inner loop is expanded to include some blocks from the outer loop, but it cannot be fully expanded to contain the outer loop because the region containing the outer loop is invalid. Differential Revision: http://reviews.llvm.org/D6913 llvm-svn: 225812	2015-01-13 18:31:55 +00:00
Tobias Grosser	2784b0803a	Check assert before the instruction that uses the assumption checked. I did not find a good test case, as the assert should generally hold in Polly. llvm-svn: 225582	2015-01-10 07:40:39 +00:00
Tobias Grosser	bfbc3690bb	Add experimental support for unsigned expressions This support is still incomplete and consequently hidden behind a switch that needs to be enabled. One problem is ATM that we incorrectly interpret very large unsigned values as negative values even if used in an unsigned comparision. llvm-svn: 225480	2015-01-09 00:01:33 +00:00
Tobias Grosser	55bc4c0767	Add support for pointer types in expressions llvm-svn: 225464	2015-01-08 19:26:53 +00:00
Tobias Grosser	021eaef044	Fix a crash in our error reporting AF = dyn_cast<SCEVAddRecExpr>(Pair.second) may be NULL for some SCEVs that we do not support. When reporting the error we still want to pass a pointer that is known to always be non-NULL. I do not yet have a test case for this, unfortunately. llvm-svn: 225461	2015-01-08 19:03:10 +00:00
Tobias Grosser	314587d743	Use explicit StringRef Type We previously used a Twine here, but as pointed out by David Blaikie and Mehdi Amini storing a temporary StringRef in a Twine is not a good idea, as the StringRef will be freed before the Twine is used leaving a Twine that points to uninitialized memory. We now make it explicit that we use a StringRef here. llvm-svn: 225342	2015-01-07 07:43:34 +00:00
Tobias Grosser	5dfcb1a7e0	Do not use a const Twine here This has caused segfaults when using Polly in the context of Julia, that are not reproducible on my machine in 'make check-polly'. llvm-svn: 225326	2015-01-07 00:30:01 +00:00
Tobias Grosser	80f6f11330	Make registerPollyPasses public This function is needed for the integration of Polly into Julia. llvm-svn: 225295	2015-01-06 20:40:33 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Andreas Simbuerger	6492b700c3	(diagnostics) Fix typo. Thanks Albert Cohen. llvm-svn: 224584	2014-12-19 16:49:39 +00:00
Tobias Grosser	11e3873516	Dead code elimination: Update dependences after eliminating code Without updating dependences we may lose implicit transitive dependences for which all explicit dependences have gone through the statement iterations we have just eliminated. No test case. We should probably implement a -verify-dependences option. This fixes llvm.org/PR21227 llvm-svn: 224459	2014-12-17 21:13:55 +00:00
Tobias Grosser	7b00d8f979	Do not run dead code elimination by default The dead code elimination is a pass that looks very promising, but needs some more compile-time tuning before enabling it by default seems sensible. llvm-svn: 223965	2014-12-10 21:12:23 +00:00
Tobias Grosser	bd8f3c1f61	Unbreak after LLVM's metadata split in r223802 llvm-svn: 223838	2014-12-09 22:02:16 +00:00
Tobias Grosser	05fdab9362	Simplify computation of reduction dependences This simplifies the construction of the input for the reduction dependence computation and at the same time removes an assumption that expects the schedule to be of 2D + 1 form (the odd dimensions giving textual order, the even dimensions the loop iterations). llvm-svn: 223621	2014-12-07 22:51:45 +00:00
Duncan P. N. Exon Smith	16173b7f6c	Stop relying on MDNode::get() returning self-references Update for LLVM API change, wherein self-references are not uniqued (and will never be returned by `MDNode::get()`). We already have `Id`; just return it. This should fix the failing buildbot: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25167 http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25168 llvm-svn: 223620	2014-12-07 21:12:10 +00:00
Tobias Grosser	13e222ca55	Update to the latest version of isl Isl now specifically marks modulo operations that are compared against zero. They can be implemented with the C/LLVM remainder operation. We also update a couple of test cases where the output of isl has slightly changed. llvm-svn: 223607	2014-12-07 16:04:29 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	71badac9d6	Remove Polly's IndVarSimplify pass Polly had a copy of this pass to create the canonical induction variables necessary for the non-scev-based code generation. As we now always use SCEV based code generation, canonical induction variables are not needed any more. llvm-svn: 222979	2014-11-30 14:33:41 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Hongbin Zheng	c5447f4c3b	Do not incorrectly set the inverted flag. In TempScopInfo::buildCondition we extract the conditions to guard the BB in addition of loop bounds. This means we should only consider the conditions in the paths (in CFG) that do not contain cycles (loops). At the same time, we set the invert flag if the FalseBB of the current branch dominates our target BB to indicate that we reach the target BB with an inverted condition from the current branch. In this case, the path from the FalseBB contains a cycle if the FalseBB is the target of a backedge. The conditions implied by such a path should not be consider. We can identify such a case by checking if the TrueBB also dominates our target BB, which means we can also reach our target BB from the TrueBB, without going through the backedge. llvm-svn: 222907	2014-11-28 03:26:06 +00:00
Tobias Grosser	68c497fb22	autoconf: Move libraries later to ensure that all symbols are found This fixes llvm.org/PR19540 llvm-svn: 222874	2014-11-27 10:14:27 +00:00
Tobias Grosser	154d9469f4	Add PreHeader always to OuterLoop This fixes a bug introduce in r217525. llvm-svn: 222766	2014-11-25 17:09:21 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	6a629c59ba	Use isl_schedule_get_ctx llvm-svn: 222560	2014-11-21 19:39:42 +00:00
Tobias Grosser	983997ff6a	Fix typo llvm-svn: 222559	2014-11-21 19:39:38 +00:00
Johannes Doerfert	9c14737b9b	Simplify the SCEVAffinator [NFC] llvm-svn: 222360	2014-11-19 15:36:59 +00:00
Tobias Grosser	422b30a017	Use new Small(Ptr)Set API This fixes the recent build failures. llvm-svn: 222358	2014-11-19 14:32:32 +00:00
Tobias Grosser	7031141009	Enable SCEV based code generation by default SCEV based code generation allows Polly to detect and generate code for loops that do not have an explicit induction variable, but only virtual induction variables given by SCEV. Being able to do so has two main benefits: - We can detect more scops by default - We require less canonicalization before Polly, which means we get closer to our goal of not touching the IR before analyzing its properties. Specifically, we do not need to run -polly-indvars to introduce explicit canonical induction variables. This switch became possible as both the isl code generation and -polly-parallel are LNT error free with SCEV based code generation and the isl ast generator. llvm-svn: 222113	2014-11-16 22:50:23 +00:00
Tobias Grosser	a4377d3eb8	Fix formatting llvm-svn: 222106	2014-11-16 21:03:32 +00:00
Tobias Grosser	2f8732e7c6	Independent blocks: SE->forget() scalars translated to arrays This prevents SCEVs to reference values not valid any more and as a consequence solves a bug where such values reintroduced during ast generation caused the independent blocks pass to fail validation. http://llvm.org/PR21204 llvm-svn: 222103	2014-11-16 20:33:58 +00:00
Tobias Grosser	4f124821e4	Remove an unnecessary ifdef Reported-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 222102	2014-11-16 17:16:30 +00:00
Tobias Grosser	b05b038b81	Switch default code generation backend to isl The isl based backend has been tested since a long time and with the recently commited OpenMP support the last missing piece of functionality was ported from the CLooG backend. The isl based backend gives us interesting new functionality: - Run-time alias checks (enabled by default) Optimize scops that contain possibly aliasing pointers. This feature has largely increased the number of loop nests we consider for optimization. Thanks Johannes! - Delinearization (not yet enabled by default) Model accesses to multi-dimensional arrays precisely. This will allow us to understand kernels with multi-dimensional VLAs written in Julia, boost::ublas, coremark or C99. Thanks Sebastian! - Generation of higher quality code Sven and me spent a long time to optimize the quality of the generated code. A major focus were expressions as they result from modulos/divisions or piecewise affine expressions (a ? b : c). - Full/Partial tile separation, polyhedral unrolling The isl code generation provides functionality to generate specialized code for core and cleanup loops and to specialize code using polyhedral context information while unrolling statements. (not yet exploited in Polly) - Modifieable access functions We can now use standard isl functionality to remap memory accesses to new data locations. A standard use case is the use of shared memory, where accesses to a larger region in global memory need to be mapped to a smaller shared memory region using a modulo mapping. (not yet exploited in Polly) The cloog based code generation is still available for comparision, but is scheduled for removal. llvm-svn: 222101	2014-11-16 17:02:11 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
Tobias Grosser	c98a8fccf7	Use nullptr instead of '0' for pointers llvm-svn: 221982	2014-11-14 11:12:31 +00:00
David Peixotto	a4817871d2	Safely generate new loop metadata node Polly was accidently modifying a debug info metadata node when attempting to generate a new unique metadata node for the loop id. The problem was that we had dwarf metadata that referred to a metadata node with a null value, like this: !6 = ... some dwarf metadata referring to !7 ... !7 = {null} When we attempt to generate a new metadata node, we reserve the first space for self-referential node by setting the first argument to null and then mutating the node later to refer to itself. However, because the nodes are uniqued based on pointer values, when we get the new metadata node it actually referred to an existing node (!7 in the example). When we went to modify the metadata to point to itself, we were accidently mutating the dwarf metatdata. We ended up in this situation: !6 = ... some dwarf metadata referring to !7 ... !7 = {!7} and this causes an assert when generating the debug info. The fix is simple, we just need to use a unique value when getting a new metadata node. The MDNode::getTemporary() provides exactly the API we need (and it is used in clang to generate the unique nodes). Differential Revision: http://reviews.llvm.org/D6174 llvm-svn: 221550	2014-11-07 21:44:18 +00:00
Johannes Doerfert	80ef110cca	[Refactor][NFC] Generalize the creation of ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D6031 llvm-svn: 221512	2014-11-07 08:31:31 +00:00
Tobias Grosser	81641db93e	Fix formatting llvm-svn: 221483	2014-11-06 21:25:54 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	ec7d67ede0	Extract SCEV generation into subfunction This makes the code more readable and will be reused in subsequent OpenMP patches. llvm-svn: 221418	2014-11-06 00:27:01 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
Tobias Grosser	d213a8b810	BlockGenerator: inline lookupAvailableValue into getValue [NFC] There was no good reason why this code was split accross two functions. In subsequent changes we will change the order in which values are looked up. Doing so would make the split into two functions even more arbitrary. We also slightly improve the documentation. llvm-svn: 221388	2014-11-05 19:46:04 +00:00
Tobias Grosser	477a1dce2b	Use argument type directly from fflush if available in translation unit When our RuntimeDebugBuilder calles fflush(NULL) to flush all output streams, it is important that the types we use in the call match the ones used in a declaration of fflush possible already available in the translation unit. As we just pass on a NULL pointer, the type of the pointer value does not really matter. However, as LLVM complains in case of mismatched types, we make sure to create a NULL pointer of identical type. No test case, as RuntimeDebugBuilder is not permanently used in Polly. Calls to it are until now only used to add informative output during debugging sessions. llvm-svn: 221251	2014-11-04 09:18:24 +00:00

1 2 3 4 5 ...

849 Commits