llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	fb19dd694c	Create parallel code in a separate block This commit basically reverts r246427 but still solves the issue tackled by that commit. Instead of emitting initialization code in the beginning of the start block we now generate parallel code in its own block and thereby guarantee separation. This is necessary as we cannot generate code for hoisted loads prior to the start block but it still needs to be placed prior to everything else. llvm-svn: 248674	2015-09-26 20:57:59 +00:00
Johannes Doerfert	f2cc86edae	Simplify domain generation We now add loop carried information during the second traversal of the region instead of in a intermediate step in-between. This makes the generation simpler, removes code and should even be faster. llvm-svn: 248125	2015-09-20 16:15:32 +00:00
Johannes Doerfert	0c1123a831	[FIX] Repair test case that was unprofitable llvm-svn: 248124	2015-09-20 16:14:41 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00
Tobias Grosser	aaadc5302c	[test] Load Polly before using the polly flags llvm-svn: 247551	2015-09-14 11:49:05 +00:00
Johannes Doerfert	334f9e87c6	[FIX] XFAIL test that depends on pending LLVM commit llvm-svn: 247550	2015-09-14 11:45:34 +00:00
Johannes Doerfert	e114dc024e	[FIX] Handle error blocks in non-affine regions correctly llvm-svn: 247545	2015-09-14 11:15:58 +00:00
Johannes Doerfert	40fa56f59f	[FIX] Allow the whole SCoP to be a non-affine subregion llvm-svn: 247544	2015-09-14 11:15:07 +00:00
Johannes Doerfert	36255eecd8	Revert r247278 "Disable support for modulo expressions" This reverts commit 00c5b6ca8832439193036aadaaaee92a43236219. We can handle modulo expressions in the domain again. llvm-svn: 247542	2015-09-14 11:14:23 +00:00
Johannes Doerfert	ca1e38fa43	Propagate exit conditions as described in the PET paper At some point we build loop trip counts using this method. It was replaced by a simpler trick that works only for affine (e.g., not modulo) constraints and relies on the removal of unbounded parts. In order to allow modulo constrains again we go back to the former, more accurate method. llvm-svn: 247540	2015-09-14 11:12:52 +00:00
David Blaikie	0afc1e4ecc	Update polly for explicit type parameter to global alias change llvm-svn: 247382	2015-09-11 03:42:32 +00:00
Johannes Doerfert	b68cffb5df	Allow general loops with one latch As we do not rely on ScalarEvolution any more we do not need to get the backedge taken count. Additionally, our domain generation handles everything that is affine and has one latch and our ScopDetection will over-approximate everything else. This change will therefor allow loops with: - one latch - exiting conditions that are affine Additionally, it will not check for structured control flow anymore. Hence, loops and conditionals are not necessarily single entry single exit regions any more. Differential Version: http://reviews.llvm.org/D12758 llvm-svn: 247289	2015-09-10 15:27:46 +00:00
Michael Kruse	9cc1b9d31e	Clean-up unit tests Remove redundant flags and duplicate invocations of the same test. llvm-svn: 247285	2015-09-10 14:42:09 +00:00
Johannes Doerfert	5b9ff8b667	Replace ScalarEvolution based domain generation This patch replaces the last legacy part of the domain generation, namely the ScalarEvolution part that was used to obtain loop bounds. We now iterate over the loops in the region and propagate the back edge condition to the header blocks. Afterwards we propagate the new information once through the whole region. In this process we simply ignore unbounded parts of the domain and thereby assume the absence of infinite loops. + This patch already identified a couple of broken unit tests we had for years. + We allow more loops already and the step to multiple exit and multiple back edges is minimal. + It allows to model the overflow checks properly as we actually visit every block in the SCoP and know where which condition is evaluated. - It is currently not compatible with modulo constraints in the domain. Differential Revision: http://reviews.llvm.org/D12499 llvm-svn: 247279	2015-09-10 13:00:06 +00:00
Johannes Doerfert	171f07ed71	Disable support for modulo expressions The support for modulo expressions is not comlete and makes the new domain generation harder. As the currently broken domain generation needs to be replaced, we will first swap in the new, fixed domain generation and make it compatible with the modulo expressions later. llvm-svn: 247278	2015-09-10 12:56:46 +00:00
Chandler Carruth	66ef16b289	[PM] Update Polly for the new AA infrastructure landed in r247167. llvm-svn: 247198	2015-09-09 22:13:56 +00:00
Johannes Doerfert	7ca8dc2d2d	Disable support for pointer expressions The support for pointer expressions is broken as it can only handle some patterns in the IslExprBuilder. We should to treat pointers in expressions the same as integers at some point and revert this patch. llvm-svn: 247147	2015-09-09 14:19:04 +00:00
Johannes Doerfert	717b866798	Allow PHI nodes in the region exit block While we do not need to model PHI nodes in the region exit (as it is not part of the SCoP), we need to prepare for the case that the exit block is split in code generation to create a single exiting block. If this will happen, hence if the region did not have a single exiting block before, we will model the operands of the PHI nodes as escaping scalars in the SCoP. Differential Revision: http://reviews.llvm.org/D12051 llvm-svn: 247078	2015-09-08 21:44:27 +00:00
Tobias Grosser	02e6589bda	Move more compile-time bailouts into -polly-detect-unprofitable Instead of having two separate options -polly-detect-scops-in-functions-without-loops and -polly-detect-scops-in-regions-without-loops we now just use -polly-detect-unprofitable to force the detection of scops ignoring any compile time saving bailout heuristics. llvm-svn: 247057	2015-09-08 19:46:41 +00:00
Tobias Grosser	a89dc57b41	Do not use '.' in subfunction names Certain backends, e.g. NVPTX, do not support '.' in function names. Hence, we ensure all '.' are replaced by '_' when generating function names for subfunctions. For the current OpenMP code generation, this is not strictly necessary, but future uses cases (e.g. GPU offloading) need this issue to be fixed. llvm-svn: 246980	2015-09-08 06:22:17 +00:00
Tobias Grosser	12e650d682	Drop alias metadata in checks of RuntimeDebugBuilder test Our alias metadata is currently not emitted in a deterministic order. As it is not needed in this test, we just drop it for now (but keep in mind to fix this). llvm-svn: 246942	2015-09-06 08:59:50 +00:00
Tobias Grosser	86bc93a9b2	Add option -polly-codegen-add-debug-printing When this option is enabled, Polly will emit printf calls for each scalar load/and store which dump the scalar value loaded/stored at run time. This patch also refactors the RuntimeDebugBuilder to use variadic templates when generating CPU printfs. As result, it now becomes easier to print strings that consist of a set of arguments. Also, as a single printf call is emitted, it is more likely for such strings to be emitted atomically if executed multi-threaded. llvm-svn: 246941	2015-09-06 08:47:57 +00:00
Tobias Grosser	113a4a4cbb	Add forgotten .jscop file llvm-svn: 246925	2015-09-05 10:58:13 +00:00
Tobias Grosser	72b80672d9	OpenMP: Name the values passed to the subfunciton according to the original llvm::Values llvm-svn: 246924	2015-09-05 10:41:19 +00:00
Tobias Grosser	0d8874c0f6	OpenMP codegen: support generation of multi-dimensional access functions When computing the index expressions for new, multi-dimensional memory accesses these new index expressions may reference original llvm::Values that are not transfered into the OpenMP subfunction. Using GlobalMap we now replace references to such values with the rewritten values that have e.g. been passed to the OpenMP subfunction. llvm-svn: 246923	2015-09-05 10:32:56 +00:00
Tobias Grosser	6f73008506	Allow the import of multi-dimensional access functions Originally, we disallowed the import of multi-dimensional access functions due to our code generation not supporting the generation of new address expressions for multi-dimensional memory accesses. When building our run-time alias check infrastructure we added code generation support for multi-dimensional address calculations. Hence, we can now savely allow the import of new multi-dimensional access functions. llvm-svn: 246917	2015-09-05 07:46:47 +00:00
Tobias Grosser	d213d52d0e	Always use the branch instructions to model the PHI-node writes Before this commit we did this only for Arguments or Constants, but indeed an instruction may define a value a lot higher up in the dominance tree, but the actual write generally needs to happen right before branching to the PHI node. Otherwise, the writes of different branches into PHI nodes may get intermixed if they lay higher up in the dominance tree. llvm-svn: 246441	2015-08-31 13:45:54 +00:00
Tobias Grosser	9f3d55cf3d	Generate scalar initialization loads at the beginning of the start BB Our OpenMP code generation generated part of its launching code directly into the start basic block and without this change the scalar initialization was run _after_ the OpenMP threads have been launched. This resulted in uninitialized scalar values to be used. llvm-svn: 246427	2015-08-31 11:06:19 +00:00
Tobias Grosser	f93451802a	OpenMP-codegen: Correctly pass function arguments to subfunctions Before we only checked if certain instructions can be expanded by us. Now we check any value, including function arguments. llvm-svn: 246425	2015-08-31 09:05:43 +00:00
Tobias Grosser	d86bf4271c	Do not model scalar references to constant values llvm-svn: 246418	2015-08-31 06:37:25 +00:00
Johannes Doerfert	96425c2574	Traverse the SCoP to compute non-loop-carried domain conditions In order to compute domain conditions for conditionals we will now traverse the region in the ScopInfo once and build the domains for each block in the region. The SCoP statements can then use these constraints when they build their domain. The reason behind this change is twofold: 1) This removes a big chunk of preprocessing logic from the TempScopInfo, namely the Conditionals we used to build there. Additionally to moving this logic it is also simplified. Instead of walking the dominance tree up for each basic block in the region (as we did before), we now traverse the region only once in order to collect the domain conditions. 2) This is the first step towards the isl based domain creation. The second step will traverse the region similar to this step, however it will propagate back edge conditions. Once both are in place this conditional handling will allow multiple exit loops additional logic. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12428 llvm-svn: 246398	2015-08-30 21:13:53 +00:00
Tobias Grosser	c0091a77f9	Store scalar dependences from outside the scop into alloca locations We already modeled read-only dependences to scalar values defined outside the scop as memory reads and also generated read accesses from the corresponding alloca instructions that have been used to pass these scalar values around during code generation. However, besides for PHI nodes that have already been handled, we failed to store the orignal read-only scalar values into these alloc. This commit extends the initialization of scalar values to all read-only scalar values used within the scop. llvm-svn: 246394	2015-08-30 19:19:34 +00:00
Tobias Grosser	e83a396b1d	Ignore debug intrinsics and do not model their potential scalar metadata reads Our code generation currently does not support scalar references to metadata values. Hence, it would crash if we try to model scalar dependences to metadata values. Fortunately, for one of the common uses, debug information, we can for now just ignore the relevant intrinsics and consequently the issue of how to model scalar dependences to metadata. llvm-svn: 246388	2015-08-30 16:57:20 +00:00
Tobias Grosser	51b65d9370	Drop alias tags from vector test case They are not really part of what is tested here. llvm-svn: 246382	2015-08-30 14:06:30 +00:00
Duncan P. N. Exon Smith	adbcf12029	DI: Fix testcases after LLVM r246327 I ran the script from r246327 and it touched all the right files; committing now to hopefully right the bots, but if my check-polly doesn't come back clean I'll keep looking. http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/33648 llvm-svn: 246341	2015-08-28 22:01:49 +00:00
Tobias Grosser	ed21a1fc7e	Do not detect Scops with only one loop. If a region does not have more than one loop, we do not identify it as a Scop in ScopDetection. The main optimizations Polly is currently performing (tiling, preparation for outer-loop vectorization and loop fusion) are unlikely to have a positive impact on individual loops. In some cases, Polly's run-time alias checks or conditional hoisting may still have a positive impact, but those are mostly enabling transformations which LLVM already performs for individual loops. As we do not focus on individual loops, we leave them untouched to not introduce compile time regressions and execution time noise. This results in good compile time reduction (oourafft: -73.99%, smg2000: -56.25%). Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12268 llvm-svn: 246161	2015-08-27 16:55:18 +00:00
Tobias Grosser	01c8f5f354	[Vectorizer] Detect strides in multi-dimensional arrays The original code was only correct for one-dimensional arrays, but derived incorrect strides for multi-dimensional arrays. llvm-svn: 245888	2015-08-24 22:20:46 +00:00
Tobias Grosser	39f9f30e8b	Only derive number of loop iterations for loops we can actually vectorize llvm-svn: 245870	2015-08-24 20:11:34 +00:00
Roman Gareev	c49724f008	Manually check a loop form Add manual check of a loop form and return non-negative number of iterations in case of trivially vectorizable loop. llvm-svn: 245680	2015-08-21 09:08:14 +00:00
Johannes Doerfert	5d5b30649a	Check feasibility for the runtime check context wrt. the domain. If nothing is executed we can bail out early. Otherwise we can use the constraints that ensure at least one statement is executed for simplification. llvm-svn: 245585	2015-08-20 18:06:30 +00:00
Johannes Doerfert	43788c5783	Check for feasible runtime check context early Instead of generating code for an empty assumed context we bail out early. As the number of assumptions we generate increases this becomes more and more important. Additionally, this change will allow us to hide internal contexts that are only used in runtime checks e.g., a boundary context with constraints not suited for simplifications. llvm-svn: 245540	2015-08-20 05:58:56 +00:00
Tobias Grosser	b0da42fb55	Generate alias metadata even in OpenMP mode To make alias scope metadata generation work in OpenMP mode we now provide the ScopAnnotator with information about the base pointer rewrite that happens when passing arrays into the OpenMP subfunction. llvm-svn: 245451	2015-08-19 16:04:35 +00:00
Michael Kruse	d2b0360197	Fix Codegen adding a second exit out of region executeScopConditionally would destroy a predecessor region if it the scop's entry was the region's exit block by forking it to polly.start and thus creating a secnd exit out of the region. This patch "shrinks" the predecessor region s.t. polly.split_new_and_old is not the region's exit anymore. llvm-svn: 245294	2015-08-18 13:14:42 +00:00
Johannes Doerfert	e69e1141d9	Introduce the ScopExpander as a SCEVExpander replacement The SCEVExpander cannot deal with all SCEVs Polly allows in all kinds of expressions. To this end we introduce a ScopExpander that handles the additional expressions separatly and falls back to the SCEVExpander for everything else. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12066 llvm-svn: 245288	2015-08-18 11:56:00 +00:00
Johannes Doerfert	e1fa6da356	[FIX] Create location if a needed value was not yet demoted This allows the code generation to continue working even if a needed value (that is reloaded anyway) was not yet demoted. Instead of failing it will now create the location for future demotion to memory and load from that location. The stores will use the same location and by construction execute before the load even if the textual order in the generated AST is otherwise. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12072 llvm-svn: 245203	2015-08-17 09:38:46 +00:00
Tobias Grosser	3278b7cd7c	Add 2nd test case for sdiv/srem instructions in a SCEV llvm-svn: 245186	2015-08-16 19:53:21 +00:00
Johannes Doerfert	eca5282dd0	[FIX] Add XFAIL to crashing test case llvm-svn: 245180	2015-08-16 14:54:16 +00:00
Johannes Doerfert	c594dc9ed0	Add a crashing test case for the scalar code generation This test case crashes the scalar code generation as we are not consistent with the usage of the assumed context. To be precise, we use the assumed context for the dependence analysis but not to restrict the domains of the statements. A step by step explanation of the problem is given in the test case. llvm-svn: 245176	2015-08-16 11:12:22 +00:00
Tobias Grosser	bccd1b0af0	Fix test case after recent LLVM changes llvm-svn: 244954	2015-08-13 21:08:15 +00:00
Tobias Grosser	7e584168ab	Manuallt simplify test case llvm-svn: 244907	2015-08-13 16:33:32 +00:00
Michael Kruse	2da3872a99	Add test case for SCEV synthesizing CodeGenerator currently tries to generate code for a parameter using values values that are computed later. llvm-svn: 244903	2015-08-13 15:53:53 +00:00
Tobias Grosser	0164b8ff70	Enable code generation of scalar dependences from function arguments This change extends the BlockGenerator to not only allow Instructions as base elements of scalar dependences, but any llvm::Value. This allows us to code-generate scalar dependences which reference function arguments, as they arise when moddeling read-only scalar dependences. llvm-svn: 244874	2015-08-13 08:07:39 +00:00
Tobias Grosser	a77cea49d1	Always model PHI nodes in scop (if not in same nonaffine subregion) Before we only modeled PHI nodes if at least one incoming basic block was itself part of the region, now we always model them except if all of their operands are part of a single non-affine subregion which we model as a black-box. This change only affects PHI nodes in the entry block, that have exactly one incoming edge. Before this change, we did not model them and as a result code generation would not know how to code generate them. With this change, code generation can code generate them like any other PHI node. This issue was exposed by r244606. Before this change simplifyRegion would have moved these PHI nodes out of the SCoP, so we would never have tried to code generate them. We could implement this behavior again, but changing the IR after the scop has been modeled and transformed always adds a risk of us invalidating earlier analysis results. It seems more save and overall also more consistent to just model and handle this one-entry-edge PHI nodes like any other PHI node in the scop. Solution proposed by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 244721	2015-08-12 07:48:54 +00:00
Michael Kruse	fba24b3775	Add another test case with trival PHI in entry BB This one was extracted from the test-suite's pifft and caused a miscompilation because a scalar was not written to its alloca address. llvm-svn: 244720	2015-08-12 07:34:55 +00:00
Michael Kruse	4f9caf2b28	Add test case for entry node with trivial PHI This is a break-down from the test-suite's oggenc where Polly currently crashes. llvm-svn: 244692	2015-08-11 23:09:19 +00:00
Michael Kruse	22370884c4	Revise the simplification of regions The previous code had several problems: For newly created BasicBlocks it did not (always) call RegionInfo::setRegionFor in order to update its analysis. At the moment RegionInfo does not verify its BBMap, but will in the future. This is fixed by determining the region new BBs belong to and set it accordingly. The new executeScopConditionally() requires accurate getRegionFor information. Which block is created by SplitEdge depends on the incoming and outgoing edges of the blocks it connects, which makes handling its output more difficult than it needs to be. Especially for finding which block has been created an to assign a region to it for the setRegionFor problem above. This patch uses an implementation for splitEdge that always creates a block between the predecessor and successor. simplifyRegion has also been simplified by using SplitBlockPredecessors instead of SplitEdge. Isolating the entries and exits have been refectored into individual functions. Previously simplifyRegion did more than just ensuring that there is only one entering and one exiting edge. It ensured that the entering block had no other outgoing edge which was necessary for executeScopConditionally(). Now the latter uses the alternative splitEdge implementation which can handle this situation so simplifyRegion really only needs to simplify the region. Also, executeScopConditionally assumed that there can be no PHI nodes in blocks with one incoming edge. This is wrong and LCSSA deliberately produces such edges. However, previous passes ensured that there can be no such PHIs in exit nodes, but which will no longer hold in the future. The new code that the property that it preserves the identity of region block (the property that the memory address of the BasicBlock containing the instructions remains the same; new blocks only contain PHI nodes and a terminator), especially the entry block. As a result, there is no need to update the reference to the BasicBlock of ScopStmt that contain its instructions because they have been moved to other basic blocks. Reviewers: grosser Part of Differential Revision: http://reviews.llvm.org/D11867 llvm-svn: 244606	2015-08-11 14:39:21 +00:00
Michael Kruse	874b5c2197	Correct non-existing past participle of split in filename llvm-svn: 244478	2015-08-10 18:37:34 +00:00
Duncan P. N. Exon Smith	20b50f2b2a	Update testcases after LLVM r243885 llvm-svn: 243887	2015-08-03 17:28:43 +00:00
Tobias Grosser	6213913244	Use the branch instruction to define the location of a PHI-node write We use the branch instruction as the location at which a PHI-node write takes place, instead of the PHI-node itself. This allows us to identify the basic-block in a region statement which is on the incoming edge of the PHI-node and for which the write access was originally introduced. As a result we can, during code generation, avoid generating PHI-node write accesses for basic blocks that do not preceed the PHI node without having to look at the IR again. This change fixes a bug which was introduced in r243420, when we started to explicitly model PHI-node reads and writes, but dropped some additional checks that where still necessary during code generation to not emit PHI-node writes for basic-blocks that are not on incoming edges of the original PHI node. Compared to the code before r243420 the new code does not need to inspect the IR any more and we also do not generate multiple redundant writes. llvm-svn: 243852	2015-08-02 16:17:41 +00:00
Tobias Grosser	45e7944bcf	Only use instructions as insert locations for SCEVExpander SCEVExpander, which we are using during code generation, only allows instructions as insert locations, but breaks in case BasicBlock->end() iterators are passed to it due to it trying to obtain the basic block in which code should be generated by calling Instruction->getParent(), which is not defined for ->end() iterators. This change adds an assert to Polly that ensures we only pass valid instructions to SCEVExpander and it fixes one case, where we used IRBuilder->SetInsertBlock() to set an ->end() insert location which was later passed to SCEVExpander. In general, Polly is always trying to build up the CFG first, before we actually insert instructions into the CFG sceleton. As a result, each basic block should already have at least one branch instruction before we start adding code. Hence, always requiring the IRBuilder insert location to be set to a real instruction should always be possible. Thanks Utpal Bora <cs14mtech11017@iith.ac.in> for his help with test case reduction. llvm-svn: 243830	2015-08-01 09:07:57 +00:00
Duncan P. N. Exon Smith	c51714a0c6	Fix polly tests after LLVM IR change in r243774 llvm-svn: 243801	2015-07-31 23:58:50 +00:00
Johannes Doerfert	338b42c329	Removed redundant alias checks generated during run time. As specified in PR23888, run-time alias check generation is expensive in terms of compile-time. This reduces the compile time by computing minimal/maximal access only once for each base pointer Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 243024	2015-07-23 17:04:54 +00:00
Tobias Grosser	808cd69a92	Use schedule trees to represent execution order of statements Instead of flat schedules, we now use so-called schedule trees to represent the execution order of the statements in a SCoP. Schedule trees make it a lot easier to analyze, understand and modify properties of a schedule, as specific nodes in the tree can be choosen and possibly replaced. This patch does not yet fully move our DependenceInfo pass to schedule trees, as some additional performance analysis is needed here. (In general schedule trees should be faster in compile-time, as the more structured representation is generally easier to analyze and work with). We also can not yet perform the reduction analysis on schedule trees. For more information regarding schedule trees, please see Section 6 of https://lirias.kuleuven.be/handle/123456789/497238 llvm-svn: 242130	2015-07-14 09:33:13 +00:00
Tobias Grosser	af4e809ca6	Remove code for scalar and PHI to array translation This removes old code that has been disabled since several weeks and was hidden behind the flags -disable-polly-intra-scop-scalar-to-array=false and -polly-model-phi-nodes=false. Earlier, Polly used to translate scalars and PHI nodes to single element arrays, as this avoided the need for their special handling in Polly. With Johannes' patches adding native support for such scalar references to Polly, this code is not needed any more. After this commit both -polly-prepare and -polly-independent are now mostly no-ops. Only a couple of simple transformations still remain, but they are scheduled for removal too. Thanks again to Johannes Doerfert for his nice work in making all this code obsolete. llvm-svn: 240766	2015-06-26 07:31:18 +00:00
Tobias Grosser	50165ffdee	Add support for srem instruction Remainder operations with constant divisor can be modeled as quasi-affine expression. This patch adds support for detecting and modeling them. We also add a test that ensures they are correctly code generated. This patch was extracted from a larger patch contributed by Johannes Doerfert in http://reviews.llvm.org/D5293 llvm-svn: 240518	2015-06-24 04:13:29 +00:00
Tobias Grosser	22adfb4373	Mark sdivs as 'exact' instead of lowering them ourselves LLVM's instcombine already translates power-of-two sdivs that are known to be exact to fast ashr instructions. Hence, there is no need to add this logic ourselves. Pointed-out-by: Johannes Doerfert llvm-svn: 239025	2015-06-04 07:45:09 +00:00
Tobias Grosser	5cf7860704	Ensure memory access mappings are defined for full domain We now verify that memory access functions imported via JSON are indeed defined for the full iteration domain. Before this change we accidentally imported memory mappings such as i -> i / 127, which only defined a mapped for values of i that are evenly divisible by 127, but which did not define any mapping for the remaining values, with the result that isl just generated an access expression that had undefined behavior for all the unmapped values. In the incorrect test cases, we now either use floor(i/127) or we use p/127 and provide the information that p is indeed a multiple of 127. llvm-svn: 239024	2015-06-04 07:44:35 +00:00
Tobias Grosser	244c8297cf	Lower signed-divisions without rounding to ashr instructions llvm-svn: 238929	2015-06-03 15:14:58 +00:00
Tobias Grosser	cb73f150d4	Translate power-of-two floor-division into ashr Power-of-two floor divisions can be translated into an arithmetic shift operation. This allows us to replace a complex lowering that requires division operations: %pexp.fdiv_q.0 = sub i64 %21, 128 %pexp.fdiv_q.1 = add i64 %pexp.fdiv_q.0, 1 %pexp.fdiv_q.2 = icmp slt i64 %21, 0 %pexp.fdiv_q.3 = select i1 %pexp.fdiv_q.2, i64 %pexp.fdiv_q.1, i64 %21 %pexp.fdiv_q.4 = sdiv i64 %pexp.fdiv_q.3, 128 with a simple ashr: %polly.fdiv_q.shr = ashr i64 %21, 7 llvm-svn: 238905	2015-06-03 06:31:30 +00:00
Tobias Grosser	cdb38e5625	Exploit non-negative numerators isl marks known non-negative numerators in modulo (and soon also division) operations. We now exploit this by generating unsigned operations. This is beneficial as unsigned operations with power-of-two denominators will be translated by isl to fast bitshift or bitwise and operations. llvm-svn: 238577	2015-05-29 17:08:19 +00:00
Tobias Grosser	268205939f	Make use of scalar/phi code generation explicit in the tests This ensures we pass all tests independently of how we set the options -disable-polly-intra-scop-scalar-to-array and -polly-model-phi-nodes. (At least if we enable both or disable both. Enabling them individually makes little sense, as they will hopefully disappear soon anyhow). llvm-svn: 238087	2015-05-23 03:34:35 +00:00
Johannes Doerfert	ecff11dcfb	Add scalar and phi code generation To reduce compile time and to allow more and better quality SCoPs in the long run we introduced scalar dependences and PHI-modeling. This patch will now allow us to generate code if one or both of those options are set. While the principle of demoting scalars as well as PHIs to memory in order to communicate their value stays the same, this allows to delay the demotion till the very end (the actual code generation). Consequently: - We __almost__ do not modify the code if we do not generate code for an optimized SCoP in the end. Thus, the early exit as well as the unprofitable option will now actually preven us from introducing regressions in case we will probably not get better code. - Polly can be used as a "pure" analyzer tool as long as the code generator is set to none. - The original SCoP is almost not touched when the optimized version is placed next to it. Runtime regressions if the runtime checks chooses the original are not to be expected and later optimizations do not need to revert the demotion for that part. - We will generate direct accesses to the demoted values, thus there are no "trivial GEPs" that select the first element of a scalar we demoted and treated as an array. Differential Revision: http://reviews.llvm.org/D7513 llvm-svn: 238070	2015-05-22 23:43:58 +00:00
Tobias Grosser	5db5d2da13	Use base-pointer address space when creating new access functions llvm-svn: 237785	2015-05-20 11:02:12 +00:00
Sunil Srivastava	19be68f088	Changed renaming of local symbols by inserting a dot before the numeric suffix. Modified two test cases to adjust to the above change in renaming. These two files were causing the buildbot failure in Polly, #30204 for example. Details in http://reviews.llvm.org/D9483 This checkin goes with r237150 and r237151 llvm-svn: 237203	2015-05-12 22:44:24 +00:00
Tobias Grosser	09d3069740	Rename IslCodeGeneration to CodeGeneration Besides class, function and file names, we also change the command line option from -polly-codegen-isl to just -polly-codegen. The isl postfix is a leftover from the times when we still had the CLooG based -polly-codegen. Today it is just redundant and we drop it. llvm-svn: 237099	2015-05-12 07:45:52 +00:00
Duncan P. N. Exon Smith	ddf3a0ef38	Update polly for LLVM rename of debug info metadata with DI* prefix Ran the same rename-md-di-prefix.sh script attached to PR23080 as in LLVM r236120 and CFE r236121. llvm-svn: 236127	2015-04-29 17:02:14 +00:00
Tobias Grosser	6325cd2fcd	Remove flag '-polly-annotate-alias-scopes' This option is enabled since a long time and there does not seem to be a situation in which we would not want to print alias scopes. Remove this option to reduce the set of command-line option combinations that may expose bugs. llvm-svn: 235861	2015-04-27 10:43:10 +00:00
Tobias Grosser	173ecab705	Remove target triples from test cases I just learned that target triples prevent test cases to be run on other architectures. Polly test cases are until now sufficiently target independent to not require any target triples. Hence, we drop them. llvm-svn: 235384	2015-04-21 14:28:02 +00:00
David Blaikie	556ffb7806	[opaque pointer types] Explicit non-pointer type for call expressions (migration for recent LLVM change to textual IR for calls) llvm-svn: 235146	2015-04-16 23:24:52 +00:00
Tobias Grosser	eb18649ead	Sign-extend in case of non-matching bitwidth This change ensures that we sign-extend integer types in case non-matching operands are encountered when generating a multi-dimensional access offset. This fixes http://llvm.org/PR23124 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234122	2015-04-05 17:36:42 +00:00
Tobias Grosser	7527e3f59c	Do not use the POLLY vector code generator if only strip-mining is requested This fixes http://llvm.org/PR23127 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234113	2015-04-05 06:53:21 +00:00
Tobias Grosser	fe4bb1c81b	[tests] Use -polly-vectorizer=polly directly instead of defining a lit variable %vector-opt. llvm-svn: 234112	2015-04-05 06:53:11 +00:00
Tobias Grosser	619190d5a7	Delinearization of expressions that contain array size parameters This allows us to delinerize code such as: A[][n] for (i for (j A[i][n-j-1] = ... which would previously have been delinearize to an access A[i+1][-j-1]. To recover the correct access we apply the piecewise expression: { A[i][j] -> A[i-1][i+N]: i < 0; A[i][j] -> A[i][i]: i >= 0} This approach generalizes to higher dimensions. llvm-svn: 233566	2015-03-30 17:22:28 +00:00
Tobias Grosser	6794238c70	Code generate parameters and run-time checks after branching new code region When creating parameters the SCEVexpander may introduce new induction variables, that possibly create scalar dependences in the original scop, before we code generate the scop. The resulting scalar dependences may then inhibit correct code generation of the scop. To prevent this, we first version the code without a run-time check and only then introduce new parameters and the run-time condition. The if-condition that guards the original scop from being modified by the SCEVexpander. This change causes some test case changes as the run-time conditions are now introduced in the split basic block rather than in the entry basic block. This fixes http://llvm.org/PR22069 Test case reduced by: Karthik Senthil llvm-svn: 233477	2015-03-28 09:34:40 +00:00
Tobias Grosser	17778eb826	Drop redundant run line in check llvm-svn: 233476	2015-03-28 09:34:34 +00:00
Tobias Grosser	2873645c51	Drop -polly-vectorizer-unroll-only option This options was earlier used for experiments with the vectorizer, but to my knowledge is not really used anymore. If anybody needs this, we can always reintroduce this feature. llvm-svn: 232934	2015-03-23 07:00:36 +00:00
David Blaikie	4a54fae8cb	Test case updates for explicit type parameter to the gep operator llvm-svn: 232186	2015-03-13 18:21:20 +00:00
Tobias Grosser	bb4126470a	Drop option to prepare code for the BB vectorizer The BB vectorizer is deprecated and there is no point in generating code for it any more. This option was introduced when there was not yet any loop vectorizer in sight. Now being matured, Polly should target the loop vectorizer. llvm-svn: 232099	2015-03-12 20:47:58 +00:00
Tobias Grosser	90078c5580	Add sign-extension during codegen of index expressions When code generating array index expressions the types of the different components of the index expressions may not always match. We extend the type of the index expression (if possible) and assert otherwise. llvm-svn: 231592	2015-03-08 15:08:32 +00:00
David Blaikie	23f94dfdf4	Update Polly tests for the great metadata schema change llvm-svn: 231089	2015-03-03 18:17:26 +00:00
David Blaikie	c94eca0546	Update Polly tests to handle explicitly typed load changes in LLVM. llvm-svn: 230796	2015-02-27 21:22:50 +00:00
David Blaikie	d7b6aa3251	Update one test I missed when updating for the opaque pointer gep changes to LLVM. llvm-svn: 230792	2015-02-27 20:43:19 +00:00
David Blaikie	bad3ff207f	Update Polly tests to handle explicitly typed gep changes in LLVM llvm-svn: 230784	2015-02-27 19:20:19 +00:00
Johannes Doerfert	514f6efa2b	[FIX] Teach RegionGenerator to respect and update dominance When we generate code for a whole region we have to respect dominance and update it too. The first is achieved with multiple "BBMap"s. Each copied block in the region gets its own map. It is initialized only with values mapped in the immediate dominator block, if this block is in the region and was therefor already copied. This way no values defined in a block that doesn't dominate the current one will be used. To update dominance information we check if the immediate dominator of the original block we want to copy is in the region. If so we set the immediate dominator of the current block to the copy of the immediate dominator of the original block. llvm-svn: 230774	2015-02-27 18:29:04 +00:00
Tobias Grosser	f72bdbfbb1	Use isl_ast_expr_call to create run-time checks isl recently introduced a new interface to create run-time checks from constraint sets. Use this interface to simplify our run-time check generation. llvm-svn: 230640	2015-02-26 15:21:10 +00:00
Johannes Doerfert	275a1756ad	Allow non-affine control flow -- Code Generation This is the code generation for region statements that are created when non-affine control flow was present in the input. A new generator, similar to the block or vector generator, for regions is used to traverse and copy the region statement and to adjust the control flow inside the new region in the end. llvm-svn: 230340	2015-02-24 16:16:32 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
Tobias Grosser	1fa7b972c0	Update to isl 99d53692ba This commit imports the latest isl version into lib/External/isl. The changes relavant for Polly are: 1) Schedule trees [1] have been introduced as a more structured way to describe schedules. Polly does not yet use them, but we may switch to them in the near future. 2) Another set of coalescing changes [2] simplifies some data dependences and removes a couple of code generation artifacts. We now understand that the following sets can be merged: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i0 >= 0 and i1 <= 1023 - i0 and i1 >= 1 Stmt_S1[i0, 0] -> Stmt_S2[i0] : i0 <= 1023 and i0 >= 1} into: { Stmt_S1[i0, i1] -> Stmt_S2[i0 + i1] : i1 <= 1023 - i0 and i1 >= 0 and i1 >= 1 - i0 and i0 >= 0 } Changes of this kind reduce unnecessary specialization during code generation. - for (int c3 = 0; c3 <= 1023; c3 += 1) { - if (c3 % 2 == 0) { - Stmt_for_body3(c1, c3); - } else - Stmt_for_body3(c1, c3); - } + for (int c3 = 0; c3 <= 1023; c3 += 1) + Stmt_for_body3(c1, c3); [1] http://impact.gforge.inria.fr/impact2014/papers/impact2014-verdoolaege.pdf [2] http://impact.gforge.inria.fr/impact2015/papers/impact2015-verdoolaege.pdf llvm-svn: 229423	2015-02-16 19:33:40 +00:00
Johannes Doerfert	d594aeb248	[FIX] Fix test case that was affected by the early exit patch llvm-svn: 228865	2015-02-11 19:11:57 +00:00
Tobias Grosser	a906ee754d	Drop an assert and XFAIL two test cases This gets the buildbot green to avoid further emails. Johannes will fix this later in the evening. llvm-svn: 228862	2015-02-11 18:46:33 +00:00
Johannes Doerfert	7ceb040213	Add early exits for SCoPs we did not optimize This allows us to skip ast and code generation if we did not optimize a SCoP and will not generate parallel or alias annotations. The initial heuristic to exit is simple but allows improvements later on. All failing test cases have been modified to disable early exit, thus to keep their coverage. Differential Revision: http://reviews.llvm.org/D7254 llvm-svn: 228851	2015-02-11 17:25:09 +00:00
Tobias Grosser	eb29c68df2	Add test case for r227805 llvm-svn: 227970	2015-02-03 15:11:02 +00:00
Johannes Doerfert	535ee97853	[FIX] Updated test case (fixed names -> regular expressions) llvm-svn: 227807	2015-02-02 16:13:36 +00:00
Johannes Doerfert	9282076ece	[NFC] Drop the "scattering" tuple name llvm-svn: 227801	2015-02-02 13:45:54 +00:00
Johannes Doerfert	3a3799e43a	[FIX] Activated a pointer test and removed obsolete comment llvm-svn: 227524	2015-01-30 00:36:13 +00:00
Johannes Doerfert	cf0e05a58f	[FIX] Correct two C snippets in test cases llvm-svn: 227407	2015-01-29 00:50:46 +00:00
Johannes Doerfert	ef61def9d5	[FIX] Handle pointer-pointer comparisons This should fix a problem introduced by r225464. llvm-svn: 227404	2015-01-29 00:41:33 +00:00
Johannes Doerfert	07e8a406d6	[FIX] Independent blocks with intrinsics handling Also an old option was removed from some new test cases llvm-svn: 227057	2015-01-25 19:09:49 +00:00
Johannes Doerfert	3f500fa2f6	Support for math/misc intrinsics The support is currently limited as we only allow them in the input but do not emit them in the transformed SCoP due to the possible semantic changes. Differential Revision: http://reviews.llvm.org/D5225 llvm-svn: 227054	2015-01-25 18:07:30 +00:00
Tobias Grosser	be30c2c56e	Adjust to the new explicit debug metadata This fixes the outfall of r226048 llvm-svn: 226134	2015-01-15 07:02:12 +00:00
Tobias Grosser	c642e95402	Use types of matching size when generating multi-dimensional address expressions This change ensures that the values that represent the array size of a multi-dimensional access are correctly sign-extended when used to compute a memory address used in the run-time alias check. To make the test case more readable, we name the instructions that we generate. llvm-svn: 225818	2015-01-13 19:37:59 +00:00
Tobias Grosser	0a092763e7	Adjust test for the new 'distinct' metadata nodes 'distinct' was introduced in 225474. We now adjust the test cases to match for the additional 'distinct' marker. llvm-svn: 225512	2015-01-09 08:10:36 +00:00
Tobias Grosser	55bc4c0767	Add support for pointer types in expressions llvm-svn: 225464	2015-01-08 19:26:53 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Duncan P. N. Exon Smith	39e21f9c27	Hand-modify a testcase (still PR21532) Bot was still tripping [1] on a testcase the upgrade script didn't handle in 224269. This is still fallout from r224257. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25435 llvm-svn: 224280	2014-12-15 21:43:20 +00:00
Duncan P. N. Exon Smith	bd62edb20d	Run upgrade script from PR21532 to match LLVM changes Update tests for LLVM assembly format change in r224257 using the script attached to PR21532. I'm hoping this unsticks the bot [1]. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25432 llvm-svn: 224269	2014-12-15 20:28:50 +00:00
Tobias Grosser	13e222ca55	Update to the latest version of isl Isl now specifically marks modulo operations that are compared against zero. They can be implemented with the C/LLVM remainder operation. We also update a couple of test cases where the output of isl has slightly changed. llvm-svn: 223607	2014-12-07 16:04:29 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Tobias Grosser	154d9469f4	Add PreHeader always to OuterLoop This fixes a bug introduce in r217525. llvm-svn: 222766	2014-11-25 17:09:21 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	bab3568105	Modify test cases to work with SCEV based code generation This patch includes tests where we actually need to adjust the CHECK lines for SCEV based code generation. Besides these adjustments we add explicit calls to -polly-codegen-scev=[true\|false] and make sure we test both cases. llvm-svn: 222112	2014-11-16 22:43:21 +00:00
Tobias Grosser	95cd1c718e	Make usage of scev based code generation explicit in tests This is in preparation of using SCEV based codegen by default in polly llvm-svn: 222111	2014-11-16 21:43:28 +00:00
Tobias Grosser	2f8732e7c6	Independent blocks: SE->forget() scalars translated to arrays This prevents SCEVs to reference values not valid any more and as a consequence solves a bug where such values reintroduced during ast generation caused the independent blocks pass to fail validation. http://llvm.org/PR21204 llvm-svn: 222103	2014-11-16 20:33:58 +00:00
Tobias Grosser	b05b038b81	Switch default code generation backend to isl The isl based backend has been tested since a long time and with the recently commited OpenMP support the last missing piece of functionality was ported from the CLooG backend. The isl based backend gives us interesting new functionality: - Run-time alias checks (enabled by default) Optimize scops that contain possibly aliasing pointers. This feature has largely increased the number of loop nests we consider for optimization. Thanks Johannes! - Delinearization (not yet enabled by default) Model accesses to multi-dimensional arrays precisely. This will allow us to understand kernels with multi-dimensional VLAs written in Julia, boost::ublas, coremark or C99. Thanks Sebastian! - Generation of higher quality code Sven and me spent a long time to optimize the quality of the generated code. A major focus were expressions as they result from modulos/divisions or piecewise affine expressions (a ? b : c). - Full/Partial tile separation, polyhedral unrolling The isl code generation provides functionality to generate specialized code for core and cleanup loops and to specialize code using polyhedral context information while unrolling statements. (not yet exploited in Polly) - Modifieable access functions We can now use standard isl functionality to remap memory accesses to new data locations. A standard use case is the use of shared memory, where accesses to a larger region in global memory need to be mapped to a smaller shared memory region using a modulo mapping. (not yet exploited in Polly) The cloog based code generation is still available for comparision, but is scheduled for removal. llvm-svn: 222101	2014-11-16 17:02:11 +00:00
Tobias Grosser	bf34f1d2b2	Introduce minimalistic cost model for auto parallelization Instead of parallelizing every parallel outermost loop, we now use a very minimalistic cost model. Specifically, we assume innermost loops are not worth parallelising and all non-innermost loops are. When parallelizing all loops in LNT we got several slowdowns/timeouts due to us parallelizing innermost loops that are executed only a couple of times (number of iterations not known statically). With this basic heuristic enabled LNT does not show any more timeouts, while several interesting loops are still parallelized. There are many ways to obtain an improved heuristic. Constructing such an improvide heuristic from a position of minimal slow-down and zero code size increase seems to be the best, as it allows us to track progress on LNT. llvm-svn: 222096	2014-11-16 14:24:53 +00:00
Tobias Grosser	d1c12e65cd	Remove one incomplete test case accidentally committed llvm-svn: 222089	2014-11-15 21:34:34 +00:00
Tobias Grosser	e3c0558e35	Add OpenMP code generation to isl backend This backend supports besides the classical code generation the upcoming SCEV based code generation (which the existing CLooG backend does not support robustly). OpenMP code generation in the isl backend benefits from our run-time alias checks such that the set of loops that can possibly be parallelized is a lot larger. The code was tested on LNT. We do not regress on builds without -polly-parallel. When using -polly-parallel most tests work flawlessly, but a few issues still remain and will be addressed in follow up commits. SCEV/non-SCEV codegen: - Compile time failure in ldecod and TimberWolfMC due a problem in our run-time alias check generation triggered by pointers that escape through the OpenMP subfunction (OpenMP specific). - Several execution time failures. Due to the larger set of loops that we now parallelize (compared to the classical code generation), we currently run into some timeouts in tests with a lot loops that have a low trip count and are slowed down by parallelizing them. SCEV only: - One existing failure in lencod due to llvm.org/PR21204 (not OpenMP specific) OpenMP code generation is the last feature that was only available in the CLooG backend. With the isl backend being the only one supporting features such as run-time alias checks and delinearization, we will soon switch to use the isl ast generator by the default and subsequently remove our dependency on CLooG. http://reviews.llvm.org/D5517 llvm-svn: 222088	2014-11-15 21:32:53 +00:00
David Peixotto	a4817871d2	Safely generate new loop metadata node Polly was accidently modifying a debug info metadata node when attempting to generate a new unique metadata node for the loop id. The problem was that we had dwarf metadata that referred to a metadata node with a null value, like this: !6 = ... some dwarf metadata referring to !7 ... !7 = {null} When we attempt to generate a new metadata node, we reserve the first space for self-referential node by setting the first argument to null and then mutating the node later to refer to itself. However, because the nodes are uniqued based on pointer values, when we get the new metadata node it actually referred to an existing node (!7 in the example). When we went to modify the metadata to point to itself, we were accidently mutating the dwarf metatdata. We ended up in this situation: !6 = ... some dwarf metadata referring to !7 ... !7 = {!7} and this causes an assert when generating the debug info. The fix is simple, we just need to use a unique value when getting a new metadata node. The MDNode::getTemporary() provides exactly the API we need (and it is used in clang to generate the unique nodes). Differential Revision: http://reviews.llvm.org/D6174 llvm-svn: 221550	2014-11-07 21:44:18 +00:00
Tobias Grosser	8b5344fda2	Explicitly annotate loops we want to run thread-parallel We introduces a new flag -polly-parallel and use it to annotate the for-nodes in the isl ast that we want to execute thread parallel (e.g., using OpenMP). We previously already emmitted openmp annotations, but we did this for various kinds of parallel loops, including some which we can not run in parallel. With this patch we now have three annotations: 1) #pragma known-parallel [reduction] 2) #pragma omp for 3) #pragma simd meaning: 1) loop has no loop carried dependences 2) loop will be executed thread-parallel 3) loop can possibly be vectorized This patch introduces 1) and reduces the use of 2) to only the cases where we will actually generate thread parallel code. It is in preparation of openmp code generation in our isl backend. Legacy: - We also have a command line option -enable-polly-openmp. This option controls the OpenMP code generation in CLooG. It will become an alias of -polly-parallel after the CLooG code generation has been dropped. http://reviews.llvm.org/D6142 llvm-svn: 221479	2014-11-06 19:35:21 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
Johannes Doerfert	9b5786960d	Relax the condition on the jsop accesses regarding the alignment. We restricted the new access functions to be a subset of the old one because we want to keep the alignment, however if the alignment is "not special", thus the default for the type, we can allow any access. Differential Revision: http://reviews.llvm.org/D5680 llvm-svn: 219503	2014-10-10 15:14:29 +00:00
Johannes Doerfert	341a15a64b	Use the new access function (if present) to compute the access stride. Differential Revision: http://reviews.llvm.org/D5661 llvm-svn: 219499	2014-10-10 14:28:46 +00:00
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Duncan P. N. Exon Smith	52fd68980c	DI: LLVM schema change: fold constants into string Update debug info testcases for the LLVM metadata schema change in r219010 to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219019	2014-10-03 21:08:48 +00:00
Johannes Doerfert	a441783544	[Fix] Accidently changed the type of a libgomp argument in r219003. Only subsequent patches introduced tests for the signature in the generated IR, thus the tests were wrong too and are adjusted now. llvm-svn: 219017	2014-10-03 20:40:24 +00:00
Johannes Doerfert	990cd4c2e2	Add option to limit the maximal number of parallel threads. Differential Revision: http://reviews.llvm.org/D5581 llvm-svn: 219004	2014-10-03 19:11:10 +00:00
Johannes Doerfert	87901453d9	Align copied load/store instructions as the original. This also forbids the json importer to access other memory locations than the original instruction as we to reuse the alignment of the original load/store. Differential Revision: http://reviews.llvm.org/D5560 llvm-svn: 218883	2014-10-02 16:22:19 +00:00
Johannes Doerfert	ecdf263c07	Allow to annotate alias scopes in the new SCoP. The command line flag -polly-annotate-alias-scopes controls whether or not Polly annotates alias scopes in the new SCoP (default ON). This can improve later optimizations as the new SCoP is basically an alias free environment for them. llvm-svn: 218877	2014-10-02 15:31:24 +00:00
Adrian Prantl	e6579cd9a6	Update testcase to new intrinsic format llvm-svn: 218806	2014-10-01 20:40:12 +00:00
Johannes Doerfert	c7b719fc03	Annotate LLVM-IR for all parallel loops This change allows to annotate all parallel loops with loop id metadata. Furthermore, it will annotate memory instructions with llvm.mem.parallel_loop_access metadata for all surrounding parallel loops. This is especially usefull if an external paralleliser is used. This also removes the PollyLoopInfo class and comments the LoopAnnotator. A test case for multiple parallel loops is attached. llvm-svn: 218793	2014-10-01 20:10:44 +00:00
Johannes Doerfert	13771738d3	[RTC] Split alias groups according to read only base addresses If there are multiple read only base addresses in an alias group we can split it into multiple alias groups each with only one read only access. This way we might reduce the number of comparisons significantly as it grows linear in the number of alias groups but exponential in their size. Differential Revision: http://reviews.llvm.org/D5435 llvm-svn: 218757	2014-10-01 12:40:46 +00:00
Tobias Grosser	f8a678d2fd	Build domtree of new loops correctly This fixes a bug introduced in r217525. llvm-svn: 218581	2014-09-28 22:40:36 +00:00
Johannes Doerfert	77bd5ae3d9	[Fix] Allow pointer types as access elements and compare them correctly This fixes two problems which are usualy caused together: 1) The elements of an isl AST access expression could be pointers not only integers, floats and vectores thereof. 2) The runtime alias checks need to compare pointers but if they are of a different type we need to cast them into a "max" type similar to the non pointer case. llvm-svn: 218113	2014-09-19 08:49:02 +00:00
Johannes Doerfert	b9fb5a2cc6	[RTC] Runtime Alias Checks for the ISL backend (missing tests) Test files missing in r218046. llvm-svn: 218047	2014-09-18 11:20:36 +00:00
Johannes Doerfert	b164c795b7	[RTC] Runtime Alias Checks for the ISL backend This change will build all alias groups (minimal/maximal accesses to possible aliasing base pointers) we have to check before we can assume an alias free environment. It will also use these to create Runtime Alias Checks (RTC) in the ISL code generation backend, thus allow us to optimize SCoPs despite possibly aliasing pointers when this backend is used. This feature will be enabled for the isl code generator, e.g., --polly-code-generator=isl, but disabled for: - The cloog code generator (still the default). - The case delinearization is enabled. - The case non-affine accesses are allowed. llvm-svn: 218046	2014-09-18 11:17:17 +00:00
Johannes Doerfert	b7e4083599	Updated to isl 2c19ecd444095d6f560349018f68993bc0e03691 Changed test cases and fixed warnings. llvm-svn: 218043	2014-09-18 11:13:35 +00:00
Johannes Doerfert	0fe35dd088	[Fix] Rewire the Region after a unconditional entry edge is created We use SplitEdge to split a conditional entry edge of the SCoP region. However, SplitEdge can cause two different situations (depending on whether or not the edge is critical). This patch tests which one is present and deals with the former unhandled one. It also refactors and unifies the case we have to change the basic blocks of the SCoP to new ones (see replaceScopAndRegionEntry). llvm-svn: 217802	2014-09-15 18:34:45 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Tobias Grosser	0ef617dda0	Remove executable bit on test files Some test files had been marked executable by accident. llvm-svn: 217663	2014-09-12 09:07:50 +00:00
Johannes Doerfert	dd5c144246	Allow to generate a loop without the GuardBB This allows us to omit the GuardBB in front of created loops if we can show the loop trip count is at least one. It also simplifies the dominance relation inside the new created region. A GuardBB (even with a constant branch condition) might trigger false dominance errors during function verification. Differential Revision: http://reviews.llvm.org/D5297 llvm-svn: 217525	2014-09-10 17:33:32 +00:00
Johannes Doerfert	3826224428	[Refactor] Cleanup isl code generation Summary: + Refactor the runtime check (RTC) build function + Added helper function to create an PollyIRBuilder + Change the simplify region function to create not only unique entry and exit edges but also enfore that the entry edge is unconditional + Cleaned the IslCodeGeneration runOnScop function: - less post-creation changes of the created IR + Adjusted and added test cases Reviewers: grosser, sebpop, simbuerg, dpeixott Subscribers: llvm-commits, #polly Differential Revision: http://reviews.llvm.org/D5076 llvm-svn: 217508	2014-09-10 14:50:23 +00:00
Johannes Doerfert	8e95dc657e	[Fix] OpenMP parallel loop detection for the isl backend There was a bug in the IslAst which caused that no more outermost parallel loops were detected/checked after a parallel outermost loop of depth 1. + Test case attached llvm-svn: 217452	2014-09-09 17:03:54 +00:00
Tobias Grosser	e7e33ba13a	Always pipe in test files In Polly we used to have a mix of test cases, some that used 'opt %s' and others that used 'opt < %s'. We now change all to use 'opt < %s'. Piping in test files is preferable as it does prevent temporary files to be written to disk. This brings us in line with what is usus in LLVM. llvm-svn: 216816	2014-08-30 09:15:04 +00:00
Tobias Grosser	2faa569c0a	Replace %defaultOpts with explicit pass names This replaces the use of %defaultOpts = '-basicaa -polly-prepare' with the minimal set of passes necessary for a test to succeed. Of the test cases that previously used %defaultOpts 76 test cases require none of these passes, 42 need -basicaa and only 2 need -polly-prepare. Our change makes this requirement explicit. In Polly many test cases have been using a macro '%defaultOpts' which run a couple of preparing passes before the actual Polly test case. This macro was introduced very early in the development of Polly and originally contained a large set of canonicalization passes. However, as the need for additional canonicalization passes makes test cases harder to understand and also more fragile in terms of changes in such passes, we aim since a longer time to only include the minimal set of passes necessary. This patch removes the last leftovers from of %defaultOpts and brings our tests cases more in line to what is usus in LLVM itself. llvm-svn: 216815	2014-08-30 09:13:28 +00:00
Tobias Grosser	2873594709	Revert "[Refactor] Cleanup runtime code generation" This reverts commit 215466 (and 215528, a trivial formatting fix). The intention of these commits is a good one, but unfortunately they broke our LNT buildbot: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly-codegen-isl Several of the cleanup changes that have been combined in this 'fixup' are trivial and could probably be committed as obvious changes without risking to break the build. The remaining changes are little and it should be easy to figure out what went wrong. llvm-svn: 215817	2014-08-16 09:09:15 +00:00
Johannes Doerfert	9744c4af16	[Refactor] Cleanup runtime code generation + Use regexp in two test case. + Refactor the runtime condition build function llvm-svn: 215466	2014-08-12 18:35:54 +00:00
Tobias Grosser	f57d63f906	Do allow negative offsets in the outermost array dimension There is no needed for neither 1-dimensional nor higher dimensional arrays to require positive offsets in the outermost array dimension. We originally introduced this assumption with the support for delinearizing multi-dimensional arrays. llvm-svn: 214665	2014-08-03 21:07:30 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	b5d1c322f2	Update the jscop tests and port them to isl codegen. The updated tests use a different context than the old ones did. Other than that only their path and the code generation we use changed. llvm-svn: 214657	2014-08-03 01:48:49 +00:00
Johannes Doerfert	b41344a88f	[Fix] Annotate the IslAst with broken reductions (Missing files) + test cases of r214489. llvm-svn: 214491	2014-08-01 08:20:26 +00:00
Johannes Doerfert	0eefb0258f	[Refactor] Use nicer print callback function in IslAst llvm-svn: 214447	2014-07-31 21:33:49 +00:00
Rafael Espindola	08dfd8f25f	Update for llvm change. llvm-svn: 214358	2014-07-30 23:17:15 +00:00
Tobias Grosser	924e9e0226	IslAst: Enhance parallelism detection test Add more check lines to ensure we do not accidentally generate nested openmp parallel annotations. llvm-svn: 214200	2014-07-29 19:22:46 +00:00
Johannes Doerfert	af9b1e2d80	[Refactor] Remove containsLoop to find innermost loops Use the fact that if we visit a for node first in pre and next in post order we know we did not visit any children, thus we found an innermost loop. + Test case for an innermost loop with a conditional inside llvm-svn: 213870	2014-07-24 15:59:06 +00:00
Johannes Doerfert	f6583176ab	[Refactor] Unify IslAst print methods + Add const annotations to some member functions llvm-svn: 213779	2014-07-23 18:14:43 +00:00
Johannes Doerfert	457f73eaee	Annotate reduction parallel loops in the IslAst textual output + Introduced dependency type TYPE_TC_RED to represent the transitive closure (& the reverse) of reduction dependences. These are used when we check for reduction parallel loops. + Test cases including loop reversals and modulo schedules which compute reductions in a alternated order. llvm-svn: 213019	2014-07-15 00:00:35 +00:00
Tobias Grosser	5e6813d184	Derive run-time conditions for delinearization As our delinearization works optimistically, we need in some cases run-time checks that verify our optimistic assumptions. A simple example is the following code: void foo(long n, long m, long o, double A[n][m][o]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 150; j++) for (long k = 0; k < 200; k++) A[i][j][k] = 1.0; } After clang linearized the access to A and we delinearized it again to A[i][j][k] we need to ensure that we do not access the delinearized array out of bounds (this information is not available in LLVM-IR). Hence, we need to verify the following constraints at run-time: CHECK: Assumed Context: CHECK: [o, m] -> { : m >= 150 and o >= 200 } llvm-svn: 212198	2014-07-02 17:47:48 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Rafael Espindola	80f20133d4	Fix polly tests to not include aliases to declarations. llvm-svn: 203721	2014-03-12 21:48:42 +00:00
Tobias Grosser	37c9b8e0f2	Emit llvm.loop metadata for parallel loops For now we only mark innermost loops for the loop vectorizer. We could later also mark not-innermost loops to enable the introduction of openmp parallelism. llvm-svn: 202854	2014-03-04 14:59:00 +00:00
Tobias Grosser	472d3b7037	codegen: Update LoopInfo correctly Add the 'polly.start' basic block to the loop that surrounds the scop we just codegenerate. This fixes PR13441 llvm-svn: 202000	2014-02-24 00:50:49 +00:00
Tobias Grosser	88640d2b47	Use -polly-codegen-isl in isl-codegen test Reported-by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 201902	2014-02-21 23:08:54 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Tobias Grosser	54646f7fab	Remove other unneccessary uses of -O3 in the test suite The polly test suite is now -O3 clean. llvm-svn: 200205	2014-01-27 10:37:06 +00:00
Tobias Grosser	a7fea8386c	Do not run -O3 to canonicalize test case This is not only not necessary, but in case -03 changes this can actually cause arbitrarily failing test cases such as, e.g., a recent change by Chandler that caused -O3 to unroll the loop body, which made the loop we wanted to detect disappear and consequently this test case fail. llvm-svn: 200204	2014-01-27 10:23:12 +00:00
Tobias Grosser	54ee0ba74d	IslCodegen: Support for run-time conditions llvm-svn: 194948	2013-11-17 03:18:25 +00:00
Tobias Grosser	e86109f508	ScopInfo: Add support for AssumedContext When constructing a scop sometimes the exact representation of a statement or condition would be very complex, but there is a common case which is a lot simpler, but which is only valid under certain assumptions. The assumed context records the assumptions taken during the construction of this scop and that need to be code generated as a run-time test. At the moment, we do not yet model any assumptions, but only added the AssumedContext as well as the isl-ast generation support. As a next step, this needs to be hooked up with the isl code generation. if (1) /* run-time condition / { / optimized code / } else { / original code */ } llvm-svn: 193652	2013-10-29 21:05:49 +00:00
Sebastian Pop	20594a842c	use -polly-codegen-isl in tests under test/Isl llvm-svn: 192110	2013-10-07 16:43:04 +00:00
Tobias Grosser	3613fd7a35	ScopInfo: Correctly handle true/false conditions This is a modified version of the orignally contributed patch. Contributed-by: alexandre.isoard@gmail.com llvm-svn: 190237	2013-09-07 01:54:13 +00:00
Sebastian Pop	2c9ec2e651	scop detection: do not run scop detection on regions without loops otherwise, use -polly-detect-scops-in-regions-without-loops to also detect scops in regions without loops llvm-svn: 183113	2013-06-03 16:35:37 +00:00
Tobias Grosser	93324aef17	Test that independent block pass does not transform induction variables The original test case showed a problem with the independet blocks pass and we decided to XFAIL it for now. Unfortunately the failure is not detected if we build without asserts and the verification of the independent block pass is not run. This change tests now for the actual reason of the failure and should trigger even in a non asserts build. We did not yet solve the underlying bug, but this should at least make the test suite behavior consistent. llvm-svn: 183025	2013-05-31 17:44:38 +00:00
Sebastian Pop	8fe6d11b84	scop detection: only handle functions with loops to detect scops in functions with no loops, use -polly-detect-scops-in-functions-without-loops llvm-svn: 182941	2013-05-30 17:47:32 +00:00
Tobias Grosser	3081b0f5ec	Update LoopInfo correctly When the Polly code generation was written we did not correctly update the LoopInfo data, but still claimed that the loop information is correct. This does not only lead to missed optimizations, but it can also cause miscompilations in case passes such as LoopSimplify are run after Polly. Reported-by: Sergei Larin <slarin@codeaurora.org> llvm-svn: 181987	2013-05-16 06:40:24 +00:00
Tobias Grosser	5db6ffd76f	LoopGenerators: Construct loops such that they are already loop rotated BeforeBB \| v GuardBB / \ __ PreHeaderBB \ / \ / \| latch HeaderBB \| \ / \ / < \ / \ / ExitBB This does not only remove the need for an explicit loop rotate pass, but it also gives us the possibility to skip the construction of the guard condition in case the loop is known to be executed at least once. We do not yet exploit this, but by implementing this analysis in the isl code generator we should be able to remove more guards than the generic loop rotate pass can. Another point is that loop rotation can introduce additional PHI nodes, which may hide that a loop can be executed in parallel. This change avoids this complication and will make it easier to move the openmp code generation into a separate pass. llvm-svn: 181986	2013-05-16 06:40:06 +00:00
Tobias Grosser	e8df5bd92b	IndependentBlocks: We can only reconstruct PHI nodes that are within the ScoP In the classical (non -polly-codegen-scev) mode, we assume that we can always recreate PHI nodes during code generation. This is not true. We can only reconstruct them from the polyhedral information, in case the entire loop of the PHI node is part of the SCoP and consequently the PHI node was translated in the polyhedral description. llvm-svn: 179674	2013-04-17 07:20:36 +00:00
Tobias Grosser	b5f92892d1	Remove unneeded RegionSimplify pass. We now support regions with multiple entries and multiple exits natively. Regions are not needed to be simplified to single entry and single exit. We need to XFAIL two test cases as this change increases the scop coverage and uncoveres two failures in the independent blocks pass. The first failure will be fixed in a subsequent commit, the second one is in the non-default -polly-codegen-scev mode and still needs to be fixed. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179673	2013-04-17 07:20:30 +00:00
Tobias Grosser	36a01b0a28	tests: Fix 'instruction does not dominate all its uses' error The LLVM-IR of this test case was apperently incorrect. llvm-svn: 179672	2013-04-17 07:20:17 +00:00
Tobias Grosser	8edce4ee62	Support SCoPs with multiple entry edges. Regions that have multiple entry edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / for_region This for_region contains two entry edges 'then' -> 'for_region' and 'else' -> 'for_region'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively when the region is in -loop-simplify form, which means the entry block should not be a loop header. Contributed by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179586	2013-04-16 08:04:42 +00:00
Tobias Grosser	0ee50f6ee4	Support SCoPs with multiple exit edges Regions that have multiple exit edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / after Region: if -> after This regions contains the bbs 'if', 'then', 'else', but not 'after'. It has two exit edges 'then' -> 'after' and 'else' -> 'after'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179159	2013-04-10 06:55:31 +00:00
Sebastian Pop	9ca6612731	IndependentBlocks: translate out of SSA all uses escaping the region llvm-svn: 179019	2013-04-08 13:05:41 +00:00
Tobias Grosser	369430ffca	codegen: properly instantiate SCEVs to the place where they are used Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When code generating S using scev based code generation, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S' and that we consequently do not know how to code generate. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10} and it can be code generated without troubles. llvm-svn: 177777	2013-03-22 23:42:53 +00:00
Tobias Grosser	8ff029ccf1	Add failing test case llvm-svn: 177645	2013-03-21 16:14:55 +00:00
Sebastian Pop	97cb813c29	Correct function to decide if a SCEV can be ignored When doing SCEV based code generation, we ignore instructions calculating values that are fully defined by a SCEV expression. The values that are calculated by this instructions are recalculated on demand. This commit improves the check to verify if certain instructions can be ignored and recalculated on demand. llvm-svn: 177313	2013-03-18 20:21:13 +00:00
Bill Wendling	83e9312ece	Use attributes references on call/invoke instructions. llvm-svn: 175881	2013-02-22 09:29:15 +00:00
Tobias Grosser	c92c8f06ec	[isl-codegen]: Fix off by one in getNumberOfIterations We need to remove one dimension. Any is correct as long as it exists. We have choosen for whatever reason the dimension #dims - 2. This is incorrect if there is just one dimension. For CLooG this case did never happen. For isl however, the case can happen and causes undefined behavior including crashes. We choose now always the last dimension #dims - 1. We could have choosen dimension '0' but the last dimension is what we remove conceptionally in the algorithm, so it seems better to actually program it that way. While at it remove another piece of undefined behavior. llvm-svn: 174894	2013-02-11 17:52:36 +00:00
Sebastian Pop	04c4ce32ae	isl: vector code generation based on ISL ast Original patch by Tobias Grosser, slightly modified by Sebastian Pop. llvm-svn: 170420	2012-12-18 07:46:13 +00:00
Sebastian Pop	e252c85545	isl: detect vector parallelism llvm-svn: 170138	2012-12-13 16:52:41 +00:00
Tobias Grosser	e36abf6d5d	isl: Detect openmp parallelism Based on code written by Riyadh Baghdadi. llvm-svn: 170102	2012-12-13 06:24:06 +00:00
Sebastian Pop	a267d9b829	adapt cloog codegen testcases to isl llvm-svn: 169161	2012-12-03 21:34:09 +00:00
Sebastian Pop	b08a52898a	execute cloog specific testcases only with CLOOG_FOUND llvm-svn: 169159	2012-12-03 21:33:40 +00:00

... 3 4 5 6 7 ...

408 Commits