llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	6a629c59ba	Use isl_schedule_get_ctx llvm-svn: 222560	2014-11-21 19:39:42 +00:00
Tobias Grosser	983997ff6a	Fix typo llvm-svn: 222559	2014-11-21 19:39:38 +00:00
Johannes Doerfert	9c14737b9b	Simplify the SCEVAffinator [NFC] llvm-svn: 222360	2014-11-19 15:36:59 +00:00
Tobias Grosser	422b30a017	Use new Small(Ptr)Set API This fixes the recent build failures. llvm-svn: 222358	2014-11-19 14:32:32 +00:00
Tobias Grosser	7031141009	Enable SCEV based code generation by default SCEV based code generation allows Polly to detect and generate code for loops that do not have an explicit induction variable, but only virtual induction variables given by SCEV. Being able to do so has two main benefits: - We can detect more scops by default - We require less canonicalization before Polly, which means we get closer to our goal of not touching the IR before analyzing its properties. Specifically, we do not need to run -polly-indvars to introduce explicit canonical induction variables. This switch became possible as both the isl code generation and -polly-parallel are LNT error free with SCEV based code generation and the isl ast generator. llvm-svn: 222113	2014-11-16 22:50:23 +00:00
Tobias Grosser	a4377d3eb8	Fix formatting llvm-svn: 222106	2014-11-16 21:03:32 +00:00
Tobias Grosser	2f8732e7c6	Independent blocks: SE->forget() scalars translated to arrays This prevents SCEVs to reference values not valid any more and as a consequence solves a bug where such values reintroduced during ast generation caused the independent blocks pass to fail validation. http://llvm.org/PR21204 llvm-svn: 222103	2014-11-16 20:33:58 +00:00
Tobias Grosser	4f124821e4	Remove an unnecessary ifdef Reported-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 222102	2014-11-16 17:16:30 +00:00
Tobias Grosser	b05b038b81	Switch default code generation backend to isl The isl based backend has been tested since a long time and with the recently commited OpenMP support the last missing piece of functionality was ported from the CLooG backend. The isl based backend gives us interesting new functionality: - Run-time alias checks (enabled by default) Optimize scops that contain possibly aliasing pointers. This feature has largely increased the number of loop nests we consider for optimization. Thanks Johannes! - Delinearization (not yet enabled by default) Model accesses to multi-dimensional arrays precisely. This will allow us to understand kernels with multi-dimensional VLAs written in Julia, boost::ublas, coremark or C99. Thanks Sebastian! - Generation of higher quality code Sven and me spent a long time to optimize the quality of the generated code. A major focus were expressions as they result from modulos/divisions or piecewise affine expressions (a ? b : c). - Full/Partial tile separation, polyhedral unrolling The isl code generation provides functionality to generate specialized code for core and cleanup loops and to specialize code using polyhedral context information while unrolling statements. (not yet exploited in Polly) - Modifieable access functions We can now use standard isl functionality to remap memory accesses to new data locations. A standard use case is the use of shared memory, where accesses to a larger region in global memory need to be mapped to a smaller shared memory region using a modulo mapping. (not yet exploited in Polly) The cloog based code generation is still available for comparision, but is scheduled for removal. llvm-svn: 222101	2014-11-16 17:02:11 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
Tobias Grosser	c98a8fccf7	Use nullptr instead of '0' for pointers llvm-svn: 221982	2014-11-14 11:12:31 +00:00
David Peixotto	a4817871d2	Safely generate new loop metadata node Polly was accidently modifying a debug info metadata node when attempting to generate a new unique metadata node for the loop id. The problem was that we had dwarf metadata that referred to a metadata node with a null value, like this: !6 = ... some dwarf metadata referring to !7 ... !7 = {null} When we attempt to generate a new metadata node, we reserve the first space for self-referential node by setting the first argument to null and then mutating the node later to refer to itself. However, because the nodes are uniqued based on pointer values, when we get the new metadata node it actually referred to an existing node (!7 in the example). When we went to modify the metadata to point to itself, we were accidently mutating the dwarf metatdata. We ended up in this situation: !6 = ... some dwarf metadata referring to !7 ... !7 = {!7} and this causes an assert when generating the debug info. The fix is simple, we just need to use a unique value when getting a new metadata node. The MDNode::getTemporary() provides exactly the API we need (and it is used in clang to generate the unique nodes). Differential Revision: http://reviews.llvm.org/D6174 llvm-svn: 221550	2014-11-07 21:44:18 +00:00
Johannes Doerfert	80ef110cca	[Refactor][NFC] Generalize the creation of ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D6031 llvm-svn: 221512	2014-11-07 08:31:31 +00:00
Tobias Grosser	81641db93e	Fix formatting llvm-svn: 221483	2014-11-06 21:25:54 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	ec7d67ede0	Extract SCEV generation into subfunction This makes the code more readable and will be reused in subsequent OpenMP patches. llvm-svn: 221418	2014-11-06 00:27:01 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
Tobias Grosser	d213a8b810	BlockGenerator: inline lookupAvailableValue into getValue [NFC] There was no good reason why this code was split accross two functions. In subsequent changes we will change the order in which values are looked up. Doing so would make the split into two functions even more arbitrary. We also slightly improve the documentation. llvm-svn: 221388	2014-11-05 19:46:04 +00:00
Tobias Grosser	477a1dce2b	Use argument type directly from fflush if available in translation unit When our RuntimeDebugBuilder calles fflush(NULL) to flush all output streams, it is important that the types we use in the call match the ones used in a declaration of fflush possible already available in the translation unit. As we just pass on a NULL pointer, the type of the pointer value does not really matter. However, as LLVM complains in case of mismatched types, we make sure to create a NULL pointer of identical type. No test case, as RuntimeDebugBuilder is not permanently used in Polly. Calls to it are until now only used to add informative output during debugging sessions. llvm-svn: 221251	2014-11-04 09:18:24 +00:00
Johannes Doerfert	21aa3dc014	[FIX] Move the statistic code to fix the build. llvm-svn: 221038	2014-11-01 01:30:11 +00:00
Johannes Doerfert	5ad8a6a588	Remove the LoopBounds from the TempScop class. We will use ScalarEvolution in the ScopInfo.cpp to get the loop trip count, not cache it in the TempScop object. Differential Revision: http://reviews.llvm.org/D6070 llvm-svn: 221035	2014-11-01 01:14:56 +00:00
Johannes Doerfert	e3da05ac32	Remove the MaxLoopDepth attribute from the TempScop class Now MaxLoopDepth only lives in Scops not in TempScops anymore. This is the first part of a series of changes to make TempScops obsolete. Differential Revision: http://reviews.llvm.org/D6069 llvm-svn: 221026	2014-11-01 00:12:13 +00:00
Johannes Doerfert	75bd66e51d	[Refactor][NFC] Remove unused argument. llvm-svn: 221016	2014-10-31 23:16:02 +00:00
Johannes Doerfert	7c494217f3	[Refactor][NFC] Map basic blocks to SCoP statements. This will simplify the construction of domains and the modeling of PHI's. llvm-svn: 221015	2014-10-31 23:13:39 +00:00
Tobias Grosser	ab23c9e724	Delete some unnecessary code Originally we have needed this code to map the isl_id of an array to its base pointer. However, as now the isl_id contains a reference to the array itself we obtain the base pointer from this isl_id and we do not need to add this information to the IDToValue map. llvm-svn: 220876	2014-10-29 22:09:54 +00:00
Tobias Grosser	a5605d34b9	Fix typo llvm-svn: 220870	2014-10-29 19:58:28 +00:00
Daniel Jasper	8a1dea0ced	Fix polly formatting after recent clang-format behavior change. llvm-svn: 220711	2014-10-27 19:45:31 +00:00
Tobias Grosser	1d5c6c71d1	Fix typo llvm-svn: 220446	2014-10-22 23:26:48 +00:00
Tobias Grosser	878aba4980	Remove incorrect value description for -enable-polly-aligned The description of the parameter value passed to -enable-polly-aligned did not make any sense at all, but was just a leftover coming from when this option was copied form -enable-polly-openmp. We just drop it as the option description gives sufficient information already. llvm-svn: 220445	2014-10-22 23:22:41 +00:00
Tobias Grosser	01aea5809f	Use stringFromIslObj instead of isl_..._dump to print to dbgs() This makes sure we consistently use dbgs() when printing debug output. Previously, the code just mixed calls to isl_*_dump() with printing to dbgs() and was relying for both methods to interact in predictable ways (same output stream, no unexpected reordering of outputs). llvm-svn: 220443	2014-10-22 23:16:28 +00:00
Tobias Grosser	f084edd0b4	Use braces in multi-statement DEBUG() code [NFC] By adding braces into the DEBUG statement we can make clang-format format code such as: DEBUG(stmt1(); stmt2()) as multi-line code: DEBUG({ stmt1(); stmt2(); }); This makes control-flow in debug statements easier to read. llvm-svn: 220441	2014-10-22 23:00:03 +00:00
David Peixotto	8da2b93d9f	Change the RegionSet type to a SetVector This patch changes the RegionSet type used in ScopDetection from a std::set to a llvm::SetVector. The reason for the change is to ensure deterministic output when printing the result of the analysis. We had a windows buildbot failure for the modified test because the output was coming in a different order. Only one test case needed to be modified for this change. We could use CHECK-DAG directives instead of CHECK in the analysis test cases because the actual order of scops does not matter, but I think that change should be done in a separate patch that modifies all the appliciable tests. I simply modified the test to reflect the expected deterministic output. Differential Revision: http://reviews.llvm.org/D5897 llvm-svn: 220423	2014-10-22 20:39:07 +00:00
Johannes Doerfert	a05214fb5d	[Refactor][NfC] ReportLevel should be used as a bool not an int llvm-svn: 219864	2014-10-15 23:24:28 +00:00
Johannes Doerfert	a99130f042	[Refactor][NfC] Simplify and clean the handling of (new) access relations This patch does not change the semantic on it's own. However, the dependence analysis as well as dce will now use the newest available access relation for each memory access, thus if at some point the json importer or any other pass will run before those two and set a new access relation the behaviour will be different. In general it is unclear if the dependence analysis and dce should be run on the old or new access functions anyway. If we need to access the original access function from the outside later, we can expose the getter again. Differential Revision: http://reviews.llvm.org/D5707 llvm-svn: 219612	2014-10-13 12:58:03 +00:00
Johannes Doerfert	9b5786960d	Relax the condition on the jsop accesses regarding the alignment. We restricted the new access functions to be a subset of the old one because we want to keep the alignment, however if the alignment is "not special", thus the default for the type, we can allow any access. Differential Revision: http://reviews.llvm.org/D5680 llvm-svn: 219503	2014-10-10 15:14:29 +00:00
Johannes Doerfert	341a15a64b	Use the new access function (if present) to compute the access stride. Differential Revision: http://reviews.llvm.org/D5661 llvm-svn: 219499	2014-10-10 14:28:46 +00:00
Johannes Doerfert	1947f863ef	[Refactor][NFC] Simplify (Vector)BlockGenerator This removes an unnecessary cast and a unnecessary local variable. Differential Revision: http://reviews.llvm.org/D5662 llvm-svn: 219338	2014-10-08 20:18:32 +00:00
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	4c7ce479ea	[Refactor] Rename MemoryAccess::Type as it clashes with llvm::Type llvm-svn: 219294	2014-10-08 10:11:33 +00:00
Johannes Doerfert	74f6869435	[Fix] Ignore forwarding alias sets in the alias set tracker. llvm-svn: 219275	2014-10-08 02:23:48 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	3cb6372762	[Refactor] Change the comment style to silence -Wcomment -Wcomment complained about a "multi-line comment" caused by the ascii art used in ScopHelper to describe the CFG. Differential Revision: http://reviews.llvm.org/D5618 llvm-svn: 219207	2014-10-07 14:34:13 +00:00
Johannes Doerfert	f1ee2622be	[Fix] Dead statements should not confuse the RTC generation This fixes http://llvm.org/bugs/show_bug.cgi?id=21166 . Differential Revision: http://reviews.llvm.org/D5623 llvm-svn: 219131	2014-10-06 17:43:00 +00:00
Benjamin Kramer	ae81abfa38	Try to unbreak the polly build. llvm-svn: 219080	2014-10-05 11:58:57 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Johannes Doerfert	a441783544	[Fix] Accidently changed the type of a libgomp argument in r219003. Only subsequent patches introduced tests for the signature in the generated IR, thus the tests were wrong too and are adjusted now. llvm-svn: 219017	2014-10-03 20:40:24 +00:00
Johannes Doerfert	1356ac75d1	Put the parallel context alloca into the function entry block. We use lifetime markers to limit the actual life range (similar to clang). Differential Revision: http://reviews.llvm.org/D5582 llvm-svn: 219005	2014-10-03 19:12:05 +00:00
Johannes Doerfert	990cd4c2e2	Add option to limit the maximal number of parallel threads. Differential Revision: http://reviews.llvm.org/D5581 llvm-svn: 219004	2014-10-03 19:11:10 +00:00

1 2 3 4 5 ...

819 Commits