llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	c98a8fccf7	Use nullptr instead of '0' for pointers llvm-svn: 221982	2014-11-14 11:12:31 +00:00
Tobias Grosser	81641db93e	Fix formatting llvm-svn: 221483	2014-11-06 21:25:54 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	01aea5809f	Use stringFromIslObj instead of isl_..._dump to print to dbgs() This makes sure we consistently use dbgs() when printing debug output. Previously, the code just mixed calls to isl_*_dump() with printing to dbgs() and was relying for both methods to interact in predictable ways (same output stream, no unexpected reordering of outputs). llvm-svn: 220443	2014-10-22 23:16:28 +00:00
Johannes Doerfert	b164c795b7	[RTC] Runtime Alias Checks for the ISL backend This change will build all alias groups (minimal/maximal accesses to possible aliasing base pointers) we have to check before we can assume an alias free environment. It will also use these to create Runtime Alias Checks (RTC) in the ISL code generation backend, thus allow us to optimize SCoPs despite possibly aliasing pointers when this backend is used. This feature will be enabled for the isl code generator, e.g., --polly-code-generator=isl, but disabled for: - The cloog code generator (still the default). - The case delinearization is enabled. - The case non-affine accesses are allowed. llvm-svn: 218046	2014-09-18 11:17:17 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Johannes Doerfert	8e95dc657e	[Fix] OpenMP parallel loop detection for the isl backend There was a bug in the IslAst which caused that no more outermost parallel loops were detected/checked after a parallel outermost loop of depth 1. + Test case attached llvm-svn: 217452	2014-09-09 17:03:54 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	dc6ad99aad	Annotate the IslAst with broken reductions + Split all reduction dependences and map them to the causing memory accesses. + Print the types & base addresses of broken reductions for each "reduction parallel" marked loop (OpenMP style). + 3 test cases to show how reductions are now represented in the isl ast. The mapping "(ast) loops -> broken reductions" is also needed to find the memory accesses we need to privatize in a loop. llvm-svn: 214489	2014-08-01 08:17:19 +00:00
Johannes Doerfert	ed67f8baf6	Change the semantics of is*Parallel The functions isParallel, isInnermostParallel and IsOutermostParallel in IslAstInfo will now return true even in the presence of broken reductions. To compensate for this change the negated result of isReductionParallel can be used. llvm-svn: 214488	2014-08-01 08:14:28 +00:00
Johannes Doerfert	99f6630c82	[Refactor] Remove unecessary check and function + Perform the parallelism check on the innermost loop only once. + Inline the markOpenmpParallel function. + Rename all IslAstUserPayload * into Payload to make it consistent. llvm-svn: 214448	2014-07-31 21:34:32 +00:00
Johannes Doerfert	0eefb0258f	[Refactor] Use nicer print callback function in IslAst llvm-svn: 214447	2014-07-31 21:33:49 +00:00
Johannes Doerfert	47b906c815	Assume no annotations when visiting new domain (IslAst) Whe we build the IslAst we visit for nodes (in pre and post order) as well as user/domain nodes. As these two sets are non overlapping we do not need to check if we annotated a node earlier when we visit it. llvm-svn: 214170	2014-07-29 08:59:56 +00:00
Johannes Doerfert	515f85770c	Replace the dependences parallelism check by the IslAst one llvm-svn: 214061	2014-07-28 03:46:28 +00:00
Johannes Doerfert	af9b1e2d80	[Refactor] Remove containsLoop to find innermost loops Use the fact that if we visit a for node first in pre and next in post order we know we did not visit any children, thus we found an innermost loop. + Test case for an innermost loop with a conditional inside llvm-svn: 213870	2014-07-24 15:59:06 +00:00
Johannes Doerfert	c4968e508b	[Refactor] IslAst and payload struct + Renamed context into build when it's the isl_ast_build + Use the IslAstInfo functions to extract the schedule of a node + Use the IslAstInfo functions to extract the build/context of a node + Move the payload struct into the IslAstInfo class + Use a constructor and destructor (also new and delete) to allocate/initialize the payload struct llvm-svn: 213792	2014-07-23 20:17:28 +00:00
Johannes Doerfert	f6583176ab	[Refactor] Unify IslAst print methods + Add const annotations to some member functions llvm-svn: 213779	2014-07-23 18:14:43 +00:00
Johannes Doerfert	6e81905ece	[Refactor] Move code out of the IslAst header Offer the static functions to extract information out of an IslAst for node as members of IslAstInfo not as top level entities. + Refactor common code + Add isParallel and isReductionParallel + Rename IslAstUser to IslAstUserPayload to make it clear this is just a (or the) payload struct. llvm-svn: 213272	2014-07-17 16:11:28 +00:00
Johannes Doerfert	3151463091	[Format] Clang format IslAst.cpp llvm-svn: 213026	2014-07-15 00:58:57 +00:00
Johannes Doerfert	457f73eaee	Annotate reduction parallel loops in the IslAst textual output + Introduced dependency type TYPE_TC_RED to represent the transitive closure (& the reverse) of reduction dependences. These are used when we check for reduction parallel loops. + Test cases including loop reversals and modulo schedules which compute reductions in a alternated order. llvm-svn: 213019	2014-07-15 00:00:35 +00:00
Tobias Grosser	5e6813d184	Derive run-time conditions for delinearization As our delinearization works optimistically, we need in some cases run-time checks that verify our optimistic assumptions. A simple example is the following code: void foo(long n, long m, long o, double A[n][m][o]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 150; j++) for (long k = 0; k < 200; k++) A[i][j][k] = 1.0; } After clang linearized the access to A and we delinearized it again to A[i][j][k] we need to ensure that we do not access the delinearized array out of bounds (this information is not available in LLVM-IR). Hence, we need to verify the following constraints at run-time: CHECK: Assumed Context: CHECK: [o, m] -> { : m >= 150 and o >= 200 } llvm-svn: 212198	2014-07-02 17:47:48 +00:00
Tobias Grosser	ce67a044e0	Use arguments of user statements to perform induction variable substitution To translate the old induction variables as they exist before Polly to new new induction variables introduced during AST code generation we need to generate code that computes the new values from the old ones. We can do this by just looking at the arguments isl generates in each scheduled statement. Example: // Old for i S(i) // New for c0 for c1 S(c0 + c1) To get the value of i, we need to compute 'c0 + c1'. This expression is readily available in the user statements generated by isl and just needs to be translated to LLVM-IR. This replaces an old confusing construct that constructed during ast generation an isl multi affine expression that described this relation and which was then again ast generated for each statement and argument when translating the isl ast to LLVM-IR. This approach was difficult to understand and the additional ast generation calls where entirely redundant as isl provides the relevant expressions as arguments of the generated user statements. llvm-svn: 212186	2014-07-02 16:26:47 +00:00
Tobias Grosser	bc4ef90802	Remove redundant code and use C++ range fors llvm-svn: 211980	2014-06-28 08:59:38 +00:00
Johannes Doerfert	1530adffdd	[Refactor] Make the used dependence types explicit llvm-svn: 211803	2014-06-26 20:24:17 +00:00
Johannes Doerfert	f1906138b4	Model statement wise reduction dependences + Collect reduction dependences + Introduced TYPE_RED in Dependences.h which can be used to obtain the reduction dependences + Used TYPE_RED to prevent parallelization while we do not have a privatizing code generation + Relax the dependences for non-parallel code generation + Add privatization dependences to ensure correctness + 12 Test cases to check for reduction and privatization dependences llvm-svn: 211369	2014-06-20 16:37:11 +00:00
Tobias Grosser	91f5b26268	Use range-based for loops llvm-svn: 210170	2014-06-04 08:06:40 +00:00
Chandler Carruth	95fef9446c	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, Polly edition. If you want to know more details about this, you can see the recent commits to Debug.h in LLVM. This is just the Polly segment of a cleanup I'm doing globally for this macro. llvm-svn: 206852	2014-04-22 03:30:19 +00:00
Tobias Grosser	5a56cbf496	[C++11] Use nullptr llvm-svn: 206361	2014-04-16 07:33:47 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	54ee0ba74d	IslCodegen: Support for run-time conditions llvm-svn: 194948	2013-11-17 03:18:25 +00:00
Tobias Grosser	d764fcbd5a	Update comments to address Sebastian's review llvm-svn: 193741	2013-10-31 11:50:52 +00:00
Tobias Grosser	e86109f508	ScopInfo: Add support for AssumedContext When constructing a scop sometimes the exact representation of a statement or condition would be very complex, but there is a common case which is a lot simpler, but which is only valid under certain assumptions. The assumed context records the assumptions taken during the construction of this scop and that need to be code generated as a run-time test. At the moment, we do not yet model any assumptions, but only added the AssumedContext as well as the isl-ast generation support. As a next step, this needs to be hooked up with the isl code generation. if (1) /* run-time condition / { / optimized code / } else { / original code */ } llvm-svn: 193652	2013-10-29 21:05:49 +00:00
Tobias Grosser	58032cb029	Integrate latest clang-format changes llvm-svn: 184655	2013-06-23 01:29:29 +00:00
Tobias Grosser	83628182f7	Sort includes llvm-svn: 181297	2013-05-07 08:11:54 +00:00
Tobias Grosser	637bd63123	Move polly options into separate option category Use the new cl::OptionCategory support to move the Polly options into a separate option category. The aim is to hide most options and show by default only the options a user needs to influence '-O3 -polly'. The available options probably need some care, but here is the current status: Polly Options: Configure the polly loop optimizer -enable-polly-openmp - Generate OpenMP parallel code -polly - Enable the polly optimizer (only at -O3) -polly-no-tiling - Disable tiling in the scheduler -polly-only-func=<function-name> - Only run on a single function -polly-report - Print information about the activities of Polly -polly-vectorizer - Select the vectorization strategy =none - No Vectorization =polly - Polly internal vectorizer =unroll-only - Only grouped unroll the vectorize candidate loops =bb - The Basic Block vectorizer driven by Polly llvm-svn: 181295	2013-05-07 07:31:10 +00:00
Tobias Grosser	e602a07662	Reformat with clang-format clang-format become way more stable. This time we mainly reformat function signatures. llvm-svn: 181294	2013-05-07 07:30:56 +00:00
Tobias Grosser	d7e58640a5	Update formatting to latest version of clang-format llvm-svn: 179160	2013-04-10 06:55:45 +00:00
Tobias Grosser	4d96c8d714	clang-format: Many more files After this commit, polly is clang-format clean. This can be tested with 'ninja polly-check-format'. Updates to clang-format may change this, but the differences will hopefully be both small and general improvements to the formatting. We currently have some not very nice formatting for a couple of items, DEBUG() stmts for example. I believe the benefit of being clang-format clean outweights the not perfect layout of this code. llvm-svn: 177796	2013-03-23 01:05:07 +00:00
Tobias Grosser	0fbbbf6aa7	Silence 'variable unused' warning in release mode llvm-svn: 177515	2013-03-20 13:03:25 +00:00
Tobias Grosser	7242ad9226	CodeGen: clang-format llvm-svn: 175872	2013-02-22 08:07:06 +00:00
Tobias Grosser	c14582f276	CodeGen: clang-format goodness The changed files are not yet clang-format clean, but we are getting close. llvm-svn: 174403	2013-02-05 18:01:29 +00:00
Sebastian Pop	ab9ea13f64	avoid initializing twice llvm-svn: 170854	2012-12-21 07:27:17 +00:00
Sebastian Pop	eb283143e2	do not access Info when it is NULL llvm-svn: 170853	2012-12-21 07:27:13 +00:00
Sebastian Pop	04c4ce32ae	isl: vector code generation based on ISL ast Original patch by Tobias Grosser, slightly modified by Sebastian Pop. llvm-svn: 170420	2012-12-18 07:46:13 +00:00
Sebastian Pop	a00a029115	change interface for isStride isStride now takes a partial schedule as input. Patch from Tobias Grosser <tobias@grosser.es>. llvm-svn: 170419	2012-12-18 07:46:06 +00:00
Sebastian Pop	e252c85545	isl: detect vector parallelism llvm-svn: 170138	2012-12-13 16:52:41 +00:00
Tobias Grosser	e36abf6d5d	isl: Detect openmp parallelism Based on code written by Riyadh Baghdadi. llvm-svn: 170102	2012-12-13 06:24:06 +00:00
Tobias Grosser	b2e572c6df	Update the recommended isl version Recent changes in isl: - Allow analysis of loops during code generation This simplifies the detection of parallel loops. - Simplify the way costumized ast printers are defined This enables us to highlight parallel / vector loops in our debug output. - Compile time improvements for codegen contexts that include parameters - Various bug fixes This update also gets us in sync for the isl 0.11 release. llvm-svn: 169100	2012-12-01 21:51:10 +00:00
Sebastian Pop	facd36e5c7	make IslAstInfo::printScop compatible with CloogInfo::printScop llvm-svn: 168714	2012-11-27 18:50:41 +00:00
Tobias Grosser	0c55514a43	autoconf/cmake: Always require isl code generation. This change ensures that isl is only detected if it includes code generation support. This allows us to remove a lot of conditional compilation and also avoids missing test cases in case the feature is not available. llvm-svn: 166403	2012-10-21 21:48:21 +00:00
Tobias Grosser	0934e70e6e	Add an ast pretty printer pass based on the isl code generator llvm-svn: 165036	2012-10-02 19:50:38 +00:00

1 2 3

102 Commits