llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	f4ee371e60	tests: Drop -polly-detect-unprofitable and -polly-no-early-exit These flags are now always passed to all tests and need to be disabled if not needed. Disabling these flags, rather than passing them to almost all tests, significantly simplfies our RUN: lines. llvm-svn: 249422	2015-10-06 15:36:44 +00:00
Tobias Grosser	4fdcf7b813	test: By default disable Polly's compile-time profitability heuristics llvm-svn: 249420	2015-10-06 15:30:26 +00:00
Tobias Grosser	935f62cf0d	tests: Explicitly state if profitability tests should be used Polly's profitability heuristic saves compile time by skipping trivial scops or scops were we know no good optimization can be applied. For almost all our tests this heuristic makes little sense as we aim for minimal test cases when testing functionality. Hence, in almost all cases this heuristic is better be disabled. In preparation of disabling Polly's compile time heuristic by default in the test suite we first explicitly enable it in the couple of test cases that really use it (or run with/without heuristic side-by-side). llvm-svn: 249418	2015-10-06 15:19:35 +00:00
Tobias Grosser	1ac26d06fe	test: Disable profitability heuristics to unfail LICM test case This test case was XFAILed under the assumption Polly is unable to detect the scop. However, disabling Polly's profitability heuristics is sufficient to detect this scop. llvm-svn: 249414	2015-10-06 15:10:19 +00:00
Tobias Grosser	d76603fbe7	test: sdiv in loop bounds is supported since a while By disabling our scop-profitability heuristics this becomes also visible in some older test cases. llvm-svn: 249411	2015-10-06 14:59:31 +00:00
Tobias Grosser	b73c695aba	tests: Drop outdated and unused lit variable llvm-svn: 249401	2015-10-06 13:50:20 +00:00
Johannes Doerfert	f17a78ef63	Remove non-executed statements during SCoP simplifcation A statement with an empty domain complicates the invariant load hoisting and does not help any subsequent analysis or transformation. In fact it might introduce parameter dimensions or increase the schedule dimensionality. To this end, we remove statements with an empty domain early in the SCoP simplification. llvm-svn: 249276	2015-10-04 15:00:05 +00:00
Johannes Doerfert	634909c2c9	[FIX] Domain generation for non-affine loops llvm-svn: 249275	2015-10-04 14:57:41 +00:00
Johannes Doerfert	f61df69423	[FIX] Count affine loops correctly The "unprofitable" heuristic was broken and counted boxed loops even though we do not represent and optimize them. llvm-svn: 249274	2015-10-04 14:56:08 +00:00
Johannes Doerfert	757a32b5b3	[FIX] Approximate non-affine loops correctly Before isValidCFG() could hide the fact that a loop is non-affine by over-approximation. This is problematic if a subregion of the loop contains an exit/latch block and is over-approximated. Now we do not over-approximate in the isValidCFG function if we check loop control. If such control is non-affine the whole loop is over-approximated, not only a subregion. llvm-svn: 249273	2015-10-04 14:54:27 +00:00
Johannes Doerfert	3e7d171866	[FIX] Repair broken commit The last invariant load fix was based on a later patch not polly/master, thus needs to be adjusted. llvm-svn: 249145	2015-10-02 15:35:03 +00:00
Johannes Doerfert	8930f4846c	[FIX] Do not hoist from inside a non-affine subregion We have to skip accesses in non-affine subregions during hoisting as they might not be executed under the same condition as the entry of the non-affine subregion. llvm-svn: 249139	2015-10-02 14:51:00 +00:00
Michael Kruse	cac948ef46	Earlier creation of ScopStmt objects This moves the construction of ScopStmt to the beginning of the ScopInfo pass. The late creation was a result of the earlier separation of ScopInfo and TempScopInfo. This will avoid introducing more ScopStmt-like maps in future commits. The AccFuncMap will also be removed in some future commit. DomainMap might also be included into ScopStmt. The order in which ScopStmt are created changes and initially creates empty statements that are removed in a simplification. Differential Revision: http://reviews.llvm.org/D13341 llvm-svn: 249132	2015-10-02 13:53:07 +00:00
Johannes Doerfert	911951f4f8	Hand down referenced & globally mapped values to the subfunction If a value is globally mapped (IslNodeBuilder::ValueMap) and referenced in the code that will be put into a subfunction, we hand down the new value to the subfunction. This patch also removes code that handed down all invariant loads to the subfunction. Instead, only needed invariant loads are given to the subfunction. There are two possible reasons for an invariant load to be handed down: 1) The invariant load is used in a block that is placed in the subfunction but which is not the parent of the load. In this case, the scalar access that will read the loaded value, will cause its base pointer (the preloaded value) to be handed down to the subfunction. 2) The invariant load is defined and used in a block that is placed in the subfunction. With this patch we will hand down the preloaded value to the subfunction as the invariant load is globally mapped to that value. llvm-svn: 249126	2015-10-02 13:11:27 +00:00
Johannes Doerfert	f56738041e	Make the SCoP generation resistent wrt. error blocks When error blocks are not terminated by an unreachable they have successors that might only be reachable via error blocks. Additionally, branches in error blocks are not checked during SCoP detection, thus we might not be able to handle them. With this patch we do not try to model error block exit conditions. Anything that is only reachable via error blocks is ignored too, as it will not be executed in the optimized version of the SCoP anyway. llvm-svn: 249099	2015-10-01 23:48:18 +00:00
Johannes Doerfert	f80f3b0449	Allow user defined error functions The user can provide function names with -polly-error-functions=name1,name2,name3 that will be treated as error functions. Any call to them is assumed not to be executed. This feature is mainly for developers to play around with the new "error block" feature. llvm-svn: 249098	2015-10-01 23:45:51 +00:00
Johannes Doerfert	850d346302	[FIX] Parallel codegen for invariant loads Hand down all preloaded values to the parallel subfunction. llvm-svn: 249010	2015-10-01 13:40:36 +00:00
Tobias Grosser	aff56c8a78	Reapply "BlockGenerator: Generate synthesisable instructions only on-demand" Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instructions and works more reliably than first inserting them and then deleting them later on. This commit was reverted in r248860 due to a remaining miscompile, where we forgot to synthesis the operand values that were referenced from scalar writes. test/Isl/CodeGen/scalar-store-from-same-bb.ll tests that we do this now correctly. llvm-svn: 248900	2015-09-30 13:36:54 +00:00
Johannes Doerfert	ef19ead20e	[FIX] Use escape logic for invariant loads Before we unconditinoally forced all users outside the SCoP to use the preloaded value. However, if the SCoP is not executed due to the runtime checks, we need to use the original value because it might not be invariant in the first place. llvm-svn: 248881	2015-09-30 09:43:20 +00:00
Johannes Doerfert	c1db67e218	Identify and hoist definitively invariant loads As a first step in the direction of assumed invariant loads (loads that are not written in some context) we now detect and hoist definitively invariant loads. These invariant loads will be preloaded in the code generation and used in the optimized version of the SCoP. If the load is only conditionally executed the preloaded version will also only be executed under the same condition, hence we will never access memory that wouldn't have been accessed otherwise. This is also the most distinguishing feature to licm. As hoisting can make statements empty we will simplify the SCoP and remove empty statements that would otherwise cause artifacts in the code generation. Differential Revision: http://reviews.llvm.org/D13194 llvm-svn: 248861	2015-09-29 23:47:21 +00:00
Johannes Doerfert	f6343d74ef	Revert "BlockGenerator: Generate synthesisable instructions only on-demand" This reverts commit 07830c18d789ee72812d5b5b9b4f8ce72ebd4207. The commit broke at least one test in lnt, MultiSource/Benchmarks/Ptrdist/bc/number.c was miss compiled and the test produced a wrong result. One Polly test case that was added later was adjusted too. llvm-svn: 248860	2015-09-29 23:43:40 +00:00
Tobias Grosser	98b3ee50ff	Codegen: Support memory accesses with different types Every once in a while we see code that accesses memory with different types, e.g. to perform operations on a piece of memory using type 'float', but to copy data to this memory using type 'int'. Modeled in C, such codes look like: void foo(float A[], float B[]) { for (long i = 0; i < 100; i++) (int )(&A[i]) = (int )(&B[i]); for (long i = 0; i < 100; i++) A[i] += 10; } We already used the correct types during normal operations, but fall back to our detected type as soon as we import changed memory access functions. For these memory accesses we may generate invalid IR due to a mismatch between the element type of the array we detect and the actual type used in the memory access. To address this issue, we always cast the newly created address of a memory access back to the type of the memory access where the address will be used. llvm-svn: 248781	2015-09-29 06:44:38 +00:00
Tobias Grosser	95e59aaa54	OpenMP: Name addresses in subfunction structure While debugging, this makes it easier to understand due to which memory reference these stores have been introduced. llvm-svn: 248717	2015-09-28 16:46:38 +00:00
Tobias Grosser	28b9a14b07	BlockGenerator: Generate synthesisable instructions only on-demand Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instruction and works more reliably than first inserting them and then deleting them later on. Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> Differential Revision: http://reviews.llvm.org/D13208 llvm-svn: 248712	2015-09-28 13:47:50 +00:00
Johannes Doerfert	9a132f36c3	Allow switch instructions in SCoPs This patch allows switch instructions with affine conditions in the SCoP. Also switch instructions in non-affine subregions are allowed. Both did not require much changes to the code, though there was some refactoring needed to integrate them without code duplication. In the llvm-test suite the number of profitable SCoPs increased from 135 to 139 but more importantly we can handle more benchmarks and user inputs without preprocessing. Differential Revision: http://reviews.llvm.org/D13200 llvm-svn: 248701	2015-09-28 09:33:22 +00:00
Tobias Grosser	f223cdf17e	[tests] Add memory writes to make this scop not trivially empty llvm-svn: 248697	2015-09-28 07:37:06 +00:00
Johannes Doerfert	f32f5f2305	Remove obsolete check This check was needed at some point but seems not useful anymore. Only one adjustment in the domain generation was needed to cope with the cases this check prevented from happening before. llvm-svn: 248695	2015-09-28 01:30:37 +00:00
Tobias Grosser	0722a1e5d5	BlockGenerator: Be less agressive with deleting dead instructions We now only delete trivially dead instructions in the BB we copy (copyBB), but not in any other BB. Only for copyBB we know that there will _never_ be any future uses of instructions that have no use after copyBB has been generated. Other instructions in the AST that have been generated by IslNodeBuilder may look dead at the moment, but may possibly still be referenced by GlobalMaps. If we delete them now, later uses would break surprisingly. We do not have a test case that breaks due to us deleting too many instructions. This issue was found by inspection. llvm-svn: 248688	2015-09-27 19:50:16 +00:00
Tobias Grosser	0ff79e586d	BlockGenerator: Simplify code generated for region statements After having generated a new user statement a couple of inefficient or trivially dead instructions may remain. This commit runs instruction simplification over the newly generated blocks to ensure unneeded instructions are removed right away. This commit does adds simplification for non-affine subregions which was not yet part of 248681. llvm-svn: 248683	2015-09-27 11:35:00 +00:00
Tobias Grosser	412f9774f8	[CodeGen test] Replace undef values with some defined constants Otherwise, part of the computation will be just simplified away when we add instruction simplification support to the RegionGenerator. llvm-svn: 248682	2015-09-27 11:34:53 +00:00
Tobias Grosser	1b9d25a42d	BlockGenerator: Simplify code generated for scop statements After having generated a new user statement a couple of inefficient or trivially dead instructions may remain. This commit runs instruction simplification over the newly generated blocks to ensure unneeded instructions are removed right away. This commit does not yet add simplification for non-affine subregions. llvm-svn: 248681	2015-09-27 11:17:22 +00:00
Johannes Doerfert	fb19dd694c	Create parallel code in a separate block This commit basically reverts r246427 but still solves the issue tackled by that commit. Instead of emitting initialization code in the beginning of the start block we now generate parallel code in its own block and thereby guarantee separation. This is necessary as we cannot generate code for hoisted loads prior to the start block but it still needs to be placed prior to everything else. llvm-svn: 248674	2015-09-26 20:57:59 +00:00
Tobias Grosser	06c495c2b0	Add test case from llvm.org/PR17187 The new domain construction algorithm now correctly models this test case (and derives an empty run-time condition). Add this test case to ensure we do not regress. llvm-svn: 248669	2015-09-26 14:27:54 +00:00
Johannes Doerfert	12155a9ef4	Add test case from open bug The bug (15771) was fixed already with the new domain generation but the test case was not added till now. llvm-svn: 248668	2015-09-26 14:03:29 +00:00
Johannes Doerfert	c6987c18de	[FIX] Use the surrounding loop for non-affine SCoP regions When the whole SCoP is a non-affine region we need to use the surrounding loop in the construction of the schedule as that is the one that will be looked up after the schedule generation. This fixes bug 24947 llvm-svn: 248667	2015-09-26 13:41:43 +00:00
Tobias Grosser	bbda083c75	Add test case for delinearization through bitcasts This was forgotten in r247928 llvm-svn: 248663	2015-09-26 08:55:59 +00:00
Tobias Grosser	99c70dd8d1	Ensure memory accesses to the same array have identical dimensionality When recovering multi-dimensional memory accesses, it may happen that different accesses to the same base array are recovered with different dimensionality. This patch ensures that the dimensionalities are unified by adding zero valued dimensions to acesses with lower dimensionality. When starting to model fixed-size arrays as multi-dimensional in 247906, this has not been taken care of. llvm-svn: 248662	2015-09-26 08:55:54 +00:00
Tobias Grosser	8016f3a4f5	Add missing PHI to test case llvm-svn: 248563	2015-09-25 05:41:30 +00:00
Tobias Grosser	da95a4a7c7	Handle read-only scalars used in PHI-nodes correctly This change addresses three issues: - Read only scalars that enter a PHI node through an edge that comes from outside the scop are not modeled any more, as such PHI nodes will always be initialized to this initial value right before the SCoP is entered. - For PHI nodes that depend on a scalar value that is defined outside the scop, but where the scalar values is passed through an edge that itself comes from a BB that is part of the region, we introduce in this basic block a read of the out-of-scop value to ensure it's value is available to write it into the PHI alloc location. - Read only uses of scalars by PHI nodes are ignored in the general read only handling code, as they are taken care of by the general PHI node modeling code. llvm-svn: 248535	2015-09-24 20:59:59 +00:00
Michael Kruse	2d0ece960f	Remove Analysis Output of TempScopInfo After the merge of TempScopInfo into ScopInfo the analysis output remained because of the existing unit tests. These remains are removed and the units tests converted to match the equivalent output of ScopInfo's analysis output. The unit tests are also moved into the directory of ScopInfo tests. Differential Revision: http://reviews.llvm.org/D13116 llvm-svn: 248485	2015-09-24 11:41:21 +00:00
Tobias Grosser	b1c39429d9	Do not model delinearized and linearized access relation for a single access A missing return statement that previously did not have a visibly negative effect caused after some data-structure changes in r248024 multi-dimensional accesses to be modeled both multi-dimensional as well as linearized. This commit adds the missing return to avoid the incorrect double modeling as well as the compile time increases it caused. llvm-svn: 248171	2015-09-21 16:19:25 +00:00
Johannes Doerfert	6a72a2af13	Use <nsw> AddRecs in the affinator to avoid bounded assumptions If we encounter a <nsw> tagged AddRec for a loop we know the trip count of that loop has to be bounded or the semantics is undefined anyway. Hence, we only need to add unbounded assumptions if no such AddRec is known. llvm-svn: 248128	2015-09-20 16:59:23 +00:00
Johannes Doerfert	707a406078	Add bounded loop assumption So far we ignored the unbounded parts of the iteration domain, however we need to assume they do not occure at all to remain sound if they do. llvm-svn: 248126	2015-09-20 16:38:19 +00:00
Johannes Doerfert	f2cc86edae	Simplify domain generation We now add loop carried information during the second traversal of the region instead of in a intermediate step in-between. This makes the generation simpler, removes code and should even be faster. llvm-svn: 248125	2015-09-20 16:15:32 +00:00
Johannes Doerfert	0c1123a831	[FIX] Repair test case that was unprofitable llvm-svn: 248124	2015-09-20 16:14:41 +00:00
Johannes Doerfert	06c57b594c	Allow loops with multiple back edges In order to allow multiple back edges we: - compute the conditions under which each back edge is taken - build the union over all these conditions, thus the condition that any back edge is taken - apply the same logic to the union we applied to a single back edge llvm-svn: 248120	2015-09-20 15:00:20 +00:00
Johannes Doerfert	7175bdfbe4	Add loop trip count based heuristic for SCoP detection As we currently do not perform any optimizations that targets (or is even aware) small trip counts we will skip them when we count the loops in a region. llvm-svn: 248119	2015-09-20 14:56:54 +00:00
Michael Kruse	e2bccbbfb2	Merge IRAccess into MemoryAccess All MemoryAccess objects will be owned by ScopInfo::AccFuncMap which previously stored the IRAccess objects. Instead of creating new MemoryAccess objects, the already created ones are reused, but their order might be different now. Some fields of IRAccess and MemoryAccess had the same meaning and are merged. This is the last step of fusioning TempScopInfo.{h\|cpp} and ScopInfo.{h.cpp}. Some refactoring might still make sense. Differential Revision: http://reviews.llvm.org/D12843 llvm-svn: 248024	2015-09-18 19:59:43 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00

1 2 3 4 5 ...

573 Commits