llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	2fd89da90d	Remove non-debug printing of domain set Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D15094 llvm-svn: 254343	2015-11-30 22:59:41 +00:00
Tobias Grosser	3a6ac9f9b5	ScopInfo: Further simplify code Acc==MA implies Acc->getAccessInstruction() == MA->getAccessInstruction(). Suggested as post-commit review for 254305 by Michael Kruse. llvm-svn: 254327	2015-11-30 21:13:43 +00:00
Tobias Grosser	ef9ca5db16	ScopInfo: Replace while/iterator construct with std::remove_if The use of C++'s high-level iterator functionality instead of two while loops and explicit iterator handling improves readability of this code. Proposed-by: Michael Kruse <llvm@meinersbur.de> Differential Revision: http://reviews.llvm.org/D15068 llvm-svn: 254305	2015-11-30 17:20:40 +00:00
Michael Kruse	4c86a1d97b	IR cleanup after CodeGeneration Re-run canonicalization passes after Polly's code generation. The set of passes currently added here are nearly all the passes between --polly-position=early and --polly-position=before-vectorizer, i.e. all passes that would usually run after Polly. In order to run these only if Polly actually modified the code, we add a function attribute "polly-optimzed" to a function that contains generated code. The cleanup pass is skipped if the function does not have this attribute. There is no support by the (legacy) PassManager to run passes only under some conditions. One could have wrapped all transformation passes to run only when CodeGeneration changed the code, but the analyses would run anyway. This patch creates an independent pass manager. The disadvantages are that all analyses have to re-run even if preserved and it does not honor compiler switches like the PassManagerBuilder does. Differential Revision: http://reviews.llvm.org/D14333 llvm-svn: 254150	2015-11-26 12:36:25 +00:00
Michael Kruse	cba170e4d0	Introduce origin/kind for exit PHI node accesses Previously, accesses that originate from PHI nodes in the exit block were registered as SCALAR. In some context they are treated as scalars, but it makes a difference in others. We used to check whether the AccessInstruction is a terminator to differentiate the cases. This patch introduces an MemoryAccess origin EXIT_PHI and a ScopArrayInfo kind KIND_EXIT_PHI to make this case more explicit. No behavioural change intended. Differential Revision: http://reviews.llvm.org/D14688 llvm-svn: 254149	2015-11-26 12:26:06 +00:00
Tobias Grosser	2f8e43d677	ScopInfo: Add support for delinearizing fortran arrays gfortran (and fortran in general?) does not compute the address of an array element directly from the array sizes (e.g., %s0, %s1), but takes first the maximum of the sizes and 0 (e.g., max(0, %s0)) before multiplying the resulting value with the per-dimension array subscript expressions. To successfully delinearize index expressions as we see them in fortran, we first filter 'smax' expressions out of the SCEV expression, use them to guess array size parameters and only then continue with the existing delinearization. llvm-svn: 253995	2015-11-24 17:06:38 +00:00
Tobias Grosser	4927c8e606	ScopInfo: Add option to ignore integer wrapping llvm-svn: 253976	2015-11-24 12:50:02 +00:00
Tobias Grosser	d68ba42556	ScopInfo: Split hasAffineMemoryAccesses() into multiple functions [NFC] This makes the overall code more readable. llvm-svn: 253951	2015-11-24 05:00:36 +00:00
Tobias Grosser	5ef2bc316d	Use SmallVector instead of std::vector This was proposed as post-commit review comment for commit r253818. Suggested by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 253862	2015-11-23 10:18:23 +00:00
Tobias Grosser	9737c7b431	ScopInfo: Remove domains of error blocks (and blocks they dominate) early on Trying to build up access functions for any of these blocks is likely to fail, as error blocks may contain invalid/non-representable instructions, and blocks dominated by error blocks may reference such instructions, which wil also cause failures. As all of these blocks are anyhow assumed to not be executed, we can just remove them early on. This fixes http://llvm.org/PR25596 llvm-svn: 253818	2015-11-22 11:06:51 +00:00
Tobias Grosser	fb3fb0a309	isl: Update to isl-0.15-136-g4d5654a The most interesting change for Polly in this isl update is 4d5654af which in certain cases can speed up the construction of run-time checks from an isl set consisting of several disjuncts significantly. llvm-svn: 253794	2015-11-21 20:48:39 +00:00
Johannes Doerfert	a2a74f09fc	Do not enforce lcssa At some point we enforced lcssa for the loop surrounding the entry block. This is not only questionable as it does not check any other loop but also not needed any more. llvm-svn: 253789	2015-11-21 17:00:02 +00:00
Johannes Doerfert	dec27df588	[FIX] Get the correct loop that surrounds a region llvm-svn: 253788	2015-11-21 16:56:13 +00:00
Tobias Grosser	b39c96aa19	ScopInfo: Ensure unique names for parameter names coming from load instructions In case the original parameter instruction does not have a name, but it comes from a load instruction where the base pointer has a name we used the name of the load instruction to give some more intuition of where the parameter came from. To ensure this works also through GEPs which may have complex offsets, we originally just dropped the offsets and _only_ used the base pointer name. As this can result in multiple parameters to get the same name, we now prefix the parameter ID to ensure parameter names are unique. This will make it easier to understand debug output. This change does not affect correctness, as parameter IDs (even of the same name) can always be distinguished through the SCEV pointer stored inside them. llvm-svn: 253330	2015-11-17 11:54:51 +00:00
Tobias Grosser	f920fb19f1	ScopInfo: Reset compute operations before starting compute out region Without this change we may start to refuse scops in larger compilation units just because a lot of code has already been compiled earlier. Found by inspection. I do not yet have a good test case for this. llvm-svn: 253050	2015-11-13 16:56:13 +00:00
Johannes Doerfert	a4b77c079b	[FIX] Bail if access function is not divisible by element size. llvm-svn: 252942	2015-11-12 20:15:32 +00:00
Johannes Doerfert	55b3d8b831	Consistenly use getTypeAllocSize for size estimation. Only when we check for wrapping we want to use the store size, for all other cases we use the alloc size now. Suggested by: Tobias Grosser <tobias@grosser.es> llvm-svn: 252941	2015-11-12 20:15:08 +00:00
Tobias Grosser	2ac2338a03	ScopInfo: Add function to retrieve all memory accesses in a scop llvm-svn: 252894	2015-11-12 14:07:13 +00:00
Tobias Grosser	e5a3514e4a	ScopInfo: Use lambda functions to reduce code duplication. llvm-svn: 252893	2015-11-12 14:07:09 +00:00
Tobias Grosser	bc29e0b27c	RegionGenerator: Only introduce subregion.ivs for loops fully within a subregion IVs of loops for which the loop header is in the subregion, but not the entire loop may be incremented outside of the subregion and can consequently not be kept private to the subregion. Instead, they need to and are modeled as virtual loops in the iteration domains. As this is the case, generating new subregion induction variables for such loops is not needed and indeed wrong as they would hide the virtual induction variables modeled in the scop. This fixes a miscompile in MultiSource/Benchmarks/Ptrdist/bc and MultiSource/Benchmarks/nbench/. Thanks Michael and Johannes for their investiagations and helpful observations regarding this bug. llvm-svn: 252860	2015-11-12 07:34:09 +00:00
Johannes Doerfert	2af10e2eed	Use parameter constraints provided via llvm.assume If an llvm.assume dominates the SCoP entry block and the assumed condition can be expressed as an affine inequality we will now add it to the context. Differential Revision: http://reviews.llvm.org/D14413 llvm-svn: 252851	2015-11-12 03:25:01 +00:00
Johannes Doerfert	d84493e52e	Emit remarks for taken assumptions Differential Revision: http://reviews.llvm.org/D14412 llvm-svn: 252848	2015-11-12 02:33:38 +00:00
Johannes Doerfert	0cf4e0aa42	Emit remark about aliasing pointers llvm-svn: 252847	2015-11-12 02:32:51 +00:00
Johannes Doerfert	48fe86f1ff	Emit SCoP source location as remark during ScopInfo This removes a similar feature from ScopDetection, though with -polly-report that feature present twice anyway. llvm-svn: 252846	2015-11-12 02:32:32 +00:00
Johannes Doerfert	fdbf201fc9	[FIX] Do not generate code for parameters referencing dead values Check if a value that is referenced by a parameter is dead and do not generate code for the parameter in such a case. llvm-svn: 252813	2015-11-11 22:40:51 +00:00
Tobias Grosser	e19fca4525	ScopInfo: Bailing out means assigning isl_set_empty to the AssumedContext I got this the other way around in 252750. Thank you Johannes for noticing. llvm-svn: 252795	2015-11-11 20:21:39 +00:00
Tobias Grosser	910cf26811	ScopInfo: Do not try to model the memory accesses in an error block Error blocks may contain arbitrary instructions, among them some which we can not modeled correctly. As we do not generate ScopStmts for error blocks anyhow there is no point in trying to generate access functions for them. This fixes llvm.org/PR25494 llvm-svn: 252794	2015-11-11 20:15:49 +00:00
Johannes Doerfert	9d7899ef9c	[NFC] Simplify code llvm-svn: 252792	2015-11-11 20:01:31 +00:00
Tobias Grosser	316b5b2507	ScopInfo: Remove unused return value [NFC] llvm-svn: 252780	2015-11-11 19:28:14 +00:00
Tobias Grosser	a52b4da039	ScopInfo: Free set after compute out has been triggered In certain cases isl will not free the return values of operations for which a computeout has been triggered. Hence, make sure we free it explicitly. No test, as I did not manage to reduce one yet. llvm-svn: 252766	2015-11-11 17:59:53 +00:00
Tobias Grosser	4cd07b1188	ScopInfo: Bound compute time spent in boundary context construction For complex inputs our current approach of construction the boundary context may in rare cases become computationally so expensive that it is better to abort. This change adds a compute out check that bounds the compuations we spend on boundary context construction and bails out if this limit is reached. We can probably make our boundary construction algorithm more efficient, but this requires some more investigation and probably also some additional changes to isl. Until these have been added, we bound the compile time to ensure our buildbots are green. llvm-svn: 252758	2015-11-11 17:34:02 +00:00
Tobias Grosser	20a4c0c205	ScopInfo: Limit the number of disjuncts in assumed context In certain rare cases (mostly -polly-process-unprofitable on large sequences of conditions - often without any loop), we see some compile-time timeouts due to the construction of an overly complex assumption context. This change limits the number of disjuncts to 150 (adjustable), to prevent us from creating assumptions contexts that are too large for even the compilation to finish. The limit has been choosen as large as possible to make sure we do not unnecessarily drop test coverage. If such cases also appear in -polly-process-unprofitable=false mode we may need to think about this again, as the current limitations may still allow assumptions that are way to complex to be checked profitably at run-time. There is also certainly room for improvement regarding how (and how efficient) we construct an assumed context, but this requires some more thinking. This completes llvm.org/PR25458 llvm-svn: 252750	2015-11-11 16:22:36 +00:00
Tobias Grosser	e3d1f1c0b0	ScopDetection: Tighten the check for always executed 'error blocks' Basic blocks that are always executed can not be error blocks as their execution can not possibly be an unlikely event. In this commit we tighten the check if an error block to basic blcoks that do not dominate the exit condition, but that dominate all exiting blocks of the scop. llvm-svn: 252726	2015-11-11 13:25:13 +00:00
Tobias Grosser	b12b006c4b	ScopDetection: Do not allow blocks to reference operands in error blocks r252713 introduced a couple of regressions due to later basic blocks refering to instructions defined in error blocks which have not yet been modeled. This commit is currently just encoding limitations of our modeling and code generation backends to ensure correctness. In theory, we should be able to generate and optimize such regions, as everything that is dominated by an error region is assumed to not be executed anyhow. We currently just lack the code to make this happen in practice. llvm-svn: 252725	2015-11-11 12:44:18 +00:00
Tobias Grosser	b76cd3cc56	ScopInfo: Pass domain constraints through error blocks Previously, we just skipped error blocks during scop construction. With this change we make sure we can construct domains for error blocks such that these domains can be forwarded to subsequent basic blocks. This change ensures that basic blocks that post-dominate and are dominated by a basic block that branches to an error condition have the very same iteration domain as the branching basic block. Before, this change we would construct a domain that excludes all error conditions. Such domains could become _very_ complex and were undesirable to build. Another solution would have been to drop these constraints using a dominance/post-dominance check instead of modeling the error blocks. Such a solution could also work in case of unreachable statements or infinite loops in the scop. However, as we currently (to my believe incorrectly) model unreachable basic blocks in the post-dominance tree, such a solution is not yet feasible and requires first a change to LLVM's post-dominance tree construction. This commit addresses the most sever compile time issue reported in: http://llvm.org/PR25458 llvm-svn: 252713	2015-11-11 08:42:20 +00:00
Johannes Doerfert	dcfedf3505	[FIX] Cast pre-loaded values correctly or reload them with adjusted type. Especially for structs, the SAI object of a base pointer does not describe all the types that the user might expect when he loads from that base pointer. While we will still cast integers and pointers we will now reload the value with the correct type if floating point and non-floating point values are involved. However, there are now TODOs where we use bitcasts instead of a proper conversion or reloading. This fixes bug 25479. llvm-svn: 252706	2015-11-11 06:20:25 +00:00
Johannes Doerfert	fc4bfc465a	[FIX] Create empty invariant equivalence classes We now create all invariant equivalence classes for required invariant loads instead of creating them on-demand. This way we can check if a parameter references an invariant load that is actually not executed and was therefor not materialized. If that happens the parameter is not materialized either. This fixes bug 25469. llvm-svn: 252701	2015-11-11 04:30:07 +00:00
Tobias Grosser	6abc75af4c	ScopInfo: Introduce ArrayKind Since 252422 we do not only distinguish two ScopArrayInfo kinds, PHI nodes and others, but work with three kind of ScopArrayInfo objects. SCALAR, PHI and ARRAY objects. Instead of keeping two boolean flags isPHI and isScalar and wonder what an ScopArrayInfo object of kind (!isScalar && isPHI) is, we list now explicitly the three different possible types of memory objects. This change also allows us to remove the confusing nested pairs that have been used in ArrayInfoMapTy. llvm-svn: 252620	2015-11-10 17:31:31 +00:00
Tobias Grosser	b43cc62fe1	stringFromIslObj: Do not crash when printing 'null' objects No test case, as this code path is currently only used for debugging. llvm-svn: 252609	2015-11-10 15:09:44 +00:00
Tobias Grosser	262538435f	ScopInfo: Make getDimensionSize better reflect which dimensions carry sizes In polly the first dimensions of an array as well as all scalars do not carry any size information. This commit makes this explicit in the interface of getDimensionSize. Before this commit getDimensionSize(0) returned the size of the first dimension that carried a size. After this commit getDimensionSize(i) will either return the size of dimension 'i' or assert in case 'i' does not carry a size or does not exist at all. This very same behaviour was already present in getDimensionSizePw(). This commit also adds assertions that ensure getDimensionSizePw() is called appropriately. llvm-svn: 252607	2015-11-10 14:24:21 +00:00
Tobias Grosser	4ea2e07a60	ScopInfo: Make printing of ScopArrayInfo more similar to declarations in C Memory references are now printed as follows: Old New Scalars: i64 MemRef_val[] i64 MemRef_val; Arrays: i64 MemRef_A[][%m][%o][8] i64 MemRef_A[*][%m][%o]; We do not print any more information about the element size in the type. Such information has already been available in a comment after the scalar/array declaration. It was redundant and did not match well with what people were used from C. llvm-svn: 252602	2015-11-10 14:02:54 +00:00
Michael Kruse	c993739e0d	Fix non-affine generated entering node not being recognized as dominating Scalar reloads in the generated entering block were not recognized as dominating the subregions locks when there were multiple entering nodes. This resulted in values defined in there not being copied. As a fix, we unconditionally add the BBMap of the generated entering node to the generated entry. This fixes part of llvm.org/PR25439. This reverts 252449 and reapplies r252445. Its test was failing indeterministically due to r252375 which was reverted in r252522. llvm-svn: 252540	2015-11-09 23:33:40 +00:00
Michael Kruse	d6fb6f1b0c	Fix dominance when subregion exit is outside scop The dominance of the generated non-affine subregion block was based on the scop's merge block, therefore resulted in an invalid DominanceTree. It resulted in some values as assumed to be unusable in the actual generated exit block. We detect the case that the exit block has been moved and decide dominance using the BB at the original exit. If we create another exit node, that exit nodes is dominated by the one generated from where the original exit resides. This fixes llvm.org/PR25438 and part of llvm.org/PR25439. llvm-svn: 252526	2015-11-09 23:07:38 +00:00
Michael Kruse	ebffcbeefa	Revert r252375 "Fix non-affine region dominance of implicitely stored values" It introduced indeterminism as it was iterating over an address-indexed hashtable. The corresponding bug PR25438 will be fixed in a successive commit. llvm-svn: 252522	2015-11-09 22:37:29 +00:00
Johannes Doerfert	7a6e292d86	[FIX] Use same alloca for invariant loads and the scalar users llvm-svn: 252451	2015-11-09 06:28:45 +00:00
Johannes Doerfert	544b23a1ef	Revert "Fix non-affine generated entering node not being recognized as dominating" This reverts commit 9775824b265e574fc541e975d64d3e270243b59d due to a failing unit test. Please check and correct the unit test and commit again. llvm-svn: 252449	2015-11-09 06:04:05 +00:00
Michael Kruse	fd9c89e84b	Fix non-affine generated entering node not being recognized as dominating Scalar reloads in the generated entering block were not recognized as dominating the subregions locks when there were multiple entering nodes. This resulted in values defined in there not being copied. As a fix, we unconditionally add the BBMap of the generated entering node to the generated entry. This fixes part of llvm.org/PR25439. llvm-svn: 252445	2015-11-09 05:00:30 +00:00
Johannes Doerfert	188542fda9	[FIX] Initialize incoming scalar memory locations for PHIs llvm-svn: 252437	2015-11-09 00:21:21 +00:00
Johannes Doerfert	80c716df2b	[NFC] Remove unused variable. llvm-svn: 252436	2015-11-09 00:21:04 +00:00
Johannes Doerfert	f85ad0411f	[FIX] Carefully simplify assumptions in the presence of error blocks If a SCoP contains error blocks we cannot use the domain constraints to simplify the assumptions as the domain is already influenced by the assumptions we took. Before this patch we did that and some assumptions became self-fulfilling as they were implied by the domain constraints. llvm-svn: 252424	2015-11-08 20:16:39 +00:00
Johannes Doerfert	a768624f14	[FIX] Introduce different SAI objects for scalar and memory accesses Even if a scalar and memory access have the same base pointer, we cannot use one SAI object as the type but also the number of dimensions are wrong. For the attached test case this caused a crash in the invariant load hoisting, though it could cause various other problems too. This fixes bug 25428 and a execution time bug in MallocBench/cfrac. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 252422	2015-11-08 19:12:05 +00:00
Johannes Doerfert	3797707695	[FIX] Use unreachable to indicate dead code and repair dominance When we bail out early we make the partially build new code path practically dead, though it was not unreachable. To remove dominance problems we now make it not only dead but also prevent the control flow to join with the original code path, thus allow to use original values after the SCoP without any PHI nodes. This fixes bug 25447. llvm-svn: 252420	2015-11-08 17:57:41 +00:00
Johannes Doerfert	1dd6e37af4	Verify IR after we bail out The bail out in r252412 left the code generation without verifying the (so far) generated IR. This will change now and ensure we always run the verifier. Suggested-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 252419	2015-11-08 16:34:17 +00:00
Johannes Doerfert	c4898504ea	[FIX] Bail out if there is a dependence cycle between invariant loads While the program cannot cause a dependence cycle between invariant loads, additional constraints (e.g., to ensure finite loops) can introduce them. It is hard to detect them in the SCoP description, thus we will only check for them at code generation time. If such a recursion is detected we will bail out the code generation and place a "false" runtime check to guarantee the original code is used. This fixes bug 25443. llvm-svn: 252412	2015-11-07 19:46:04 +00:00
Johannes Doerfert	44483c5599	[FIX] Remove all invariant load occurences from own execution context llvm-svn: 252411	2015-11-07 19:45:27 +00:00
Michael Kruse	0651480b97	Fix non-affine region dominance of implicitely stored values After loop versioning, a dominance check of a non-affine subregion's exit node causes the dominance check to always fail on any block in the subregion if it shares the same exit block with the scop. The subregion's exit block has become polly_merge_new_and_old, which also receives the control flow of the generated code. This would cause that any value for implicit stores is assumed to be not from the scop. We check dominance with the generated exit node instead. This fixes llvm.org/PR25438 llvm-svn: 252375	2015-11-07 00:36:50 +00:00
Duncan P. N. Exon Smith	b8f58b53dd	polly/ADT: Remove implicit ilist iterator conversions, NFC Remove all the implicit ilist iterator conversions from polly, in preparation for making them illegal in ADT. There was one oddity I came across: at line 95 of lib/CodeGen/LoopGenerators.cpp, there was a post-increment `Builder.GetInsertPoint()++`. Since it was a no-op, I removed it, but I admit I wonder if it might be a bug (both before and after this change)? Perhaps it should be a pre-increment? llvm-svn: 252357	2015-11-06 22:56:54 +00:00
Michael Kruse	ddb6528ba6	Fix reuse of non-dominating synthesized value in subregion exit We were adding all generated values in non-affine subregions to be used for the subregions generated exit block. The thought was that only values that are dominating the original exit block can be used there. But it is possible for synthesizable values to be expanded in any block. If the same values is also used for implicit writes, it would try to reuse already synthesized values even if not dominating the exit block. The fix is to only add values to the list of values usable in the exit block only if it is dominating the exit block. This fixes llvm.org/PR25412. llvm-svn: 252301	2015-11-06 13:51:24 +00:00
Tobias Grosser	f1bfd75221	ScopInfo: Allocate globally unique memory access identifiers Before this commit memory reference identifiers have only been unique per basic block, but not per (non-affine) ScopStmt. This commit now uses the MemoryAccess base pointer to uniquely identify each Memory access. llvm-svn: 252200	2015-11-05 20:15:37 +00:00
Tobias Grosser	4bf262a0f2	RunTimeDebugBuilder: Allocate memory _after_ knowing how much is needed This fixes a memory corruption issue, where we accessed more memory than actually allocated. llvm-svn: 252197	2015-11-05 19:43:34 +00:00
Michael Kruse	27149cf32d	Use per-BB value maps for non-exit BBs For generating scalar writes of non-affine subregions, all except phi writes are generated in the exit block. The phi writes are generated in the incoming block for which we errornously used the same BBMap. This can conflict if a value for one block is synthesized, and then reused for another block which is not dominated by the first block. This is fixed by using block-specific BBMaps for phi writes. llvm-svn: 252172	2015-11-05 16:17:17 +00:00
Michael Kruse	f714d470d7	Fix escaping value to subregion entry node phi An incoming value from a block the is not inside the scop is an external use, even if the phi is inside the scop. A previous fix in r251208 did not apply if the phi is inside a non-affine subregion. We move the check for this phi case before the non-affine subregion check. llvm-svn: 252157	2015-11-05 13:18:43 +00:00
Johannes Doerfert	22892687f7	[FIX] Simplify and correct preloading of base pointer origin To simplify and correct the preloading of a base pointer origin, e.g., the base pointer for the current indirect invariant load, we now just check if there is an invariant access class that involves the base pointer of the current class. llvm-svn: 251962	2015-11-03 19:15:33 +00:00
Johannes Doerfert	eca9e890b9	Remove read-only statements from the SCoP We do not need to model read-only statements in the SCoP as they will not cause any side effects that are visible to the outside anyway. Removing them should safe us time and might even simplify the ASTs we generate. Differential Revision: http://reviews.llvm.org/D14272 llvm-svn: 251948	2015-11-03 16:54:49 +00:00
Johannes Doerfert	e071f6d637	[NFC] Name invariant load parameters after base pointer This just makes the debug output nices sometimes. llvm-svn: 251947	2015-11-03 16:49:59 +00:00
Johannes Doerfert	475d8e3f42	[FIX] Ensure base pointer origin was preloaded already If a base pointer of a preloaded value has a base pointer origin, thus it is an indirect invariant load, we have to make sure the base pointer origin is preloaded first. llvm-svn: 251946	2015-11-03 16:49:02 +00:00
Johannes Doerfert	d6fc0701ee	[FIX] Carefully rewrite parameters wrt. invariant equivalence classes ScalarEvolution doesn't allow the operands of an AddRec to be variant in the loop of the AddRec. When we rewrite parameter SCEVs it might seem like the new SCEV violates this property and ScalarEvolution will trigger an assertion. To avoid this we move the start part out of an AddRec when we rewrite it, thus avoid the operands to be possibly variant completely. llvm-svn: 251945	2015-11-03 16:47:58 +00:00
Johannes Doerfert	3181c2ef72	[FIX] Correctly update SAI base pointer If a base pointer load is preloaded, we have change the base pointer of the derived SAI. However, as the derived SAI relationship is is coarse grained, we need to check if we actually preloaded the base pointer or a different element of the base pointer SAI array. llvm-svn: 251881	2015-11-03 01:42:59 +00:00
Johannes Doerfert	dca2837b76	[FIX] Do not crash in the presence of infinite loops. llvm-svn: 251870	2015-11-03 00:28:07 +00:00
Johannes Doerfert	907456fe04	[FIX] Use appropriately sized types for big constants llvm-svn: 251869	2015-11-03 00:26:22 +00:00
Tobias Grosser	8286b83f97	ScopInfo: Bail out in case of mismatching array dimension sizes In some cases different memory accesses access the very same array using a different multi-dimensional array layout where the same dimensions have different sizes. Instead of asserting when encountering this issue, we gracefully bail out for this scop. This fixes llvm.org/PR25252 llvm-svn: 251791	2015-11-02 11:29:32 +00:00
Tobias Grosser	f423c1200f	Remove old and redundant options We remove -polly-detect-unprofitable and -polly-no-early-exit. Both have been superseeded by -polly-process-unprofitable and were only kept as aliases for our buildbots to continue to work. As all buildbots have been moved to the new options, we can now remove the old ones for good. llvm-svn: 251787	2015-11-02 10:13:32 +00:00
Tobias Grosser	3e9560200f	RegionGenerator: Clear local maps after statement construction These maps are only needed during the construction of a single region statement. Clearing them is important, as we otherwise get an assert in case some of the referenced values are erased before the RegionGenerator is deleted. llvm-svn: 251341	2015-10-26 20:41:53 +00:00
Tobias Grosser	bf45e74254	ScopDetect: Bail out for non-simple memory accesses Volatile or atomic memory accesses are currently not supported. Neither did we think about any special handling needed nor do we support the unknown instructions the alias set tracker turns them into sometimes. Before this patch, us not supporting unkown instructions in an alias set caused the following assertion failures: Assertion `AG.size() > 1 && "Alias groups should contain at least two accesses"' failed llvm-svn: 251234	2015-10-25 13:48:40 +00:00
Tobias Grosser	27e19a022e	Fix typo llvm-svn: 251231	2015-10-25 12:05:14 +00:00
Tobias Grosser	907090c37c	ScopDetection: Update DetectionContextMap accordingly When verifying if a scop is still valid we rerun all analysis, but did not update DetectionContextMap. This change ensures that information, e.g. about non-affine regions, is correctly updated llvm-svn: 251227	2015-10-25 10:55:35 +00:00
Tobias Grosser	51174cca30	ScopDetection: Do not crash if we find zero array size candidates for delinearization llvm-svn: 251226	2015-10-25 08:40:44 +00:00
Tobias Grosser	5528dcdf25	ScopDetection: Always refuse multi-dimensional memory accesses with 'undef' in the size expression. We previously only checked if the size expression is 'undef', but allowed size expressions of the form 'undef * undef' by accident. After this change we now require size expressions to be affine which implies no 'undef' appears anywhere in the expression. llvm-svn: 251225	2015-10-25 08:40:38 +00:00
Tobias Grosser	baffa091dd	ScopInfo: PHI-node uses in the EntryNode with an incoming BB that is not part of the Region are external. During code generation we split off the parts of the PHI nodes in the entry block, which have incoming blocks that are not part of the region. As these split-off PHI nodes then are external uses, we consequently also need to model these uses in ScopInfo. llvm-svn: 251208	2015-10-24 20:55:27 +00:00
Tobias Grosser	a3f6edaee1	BlockGenerator: Do not assert when finding model PHI nodes defined outside the scop Such PHI nodes can not only appear in the ExitBlock of the Scop, but indeed any scalar PHI node above the scop and used in the scop is modeled as scalar read access. llvm-svn: 251198	2015-10-24 19:01:09 +00:00
Tobias Grosser	27d742da59	BlockGenerator: Directly handle multi-exit PHI nodes This change adds code to directly code-generate multi-exit PHI nodes, instead of trying to reuse the EscapeMap infrastructure for this. Using escape maps adds a level of indirection that is hard to understand and - more importantly - breaks in certain cases. Specifically, the original code relied on simplifyRegion() to split the original PHI node in two PHI nodes, one merging the values coming from within the scop and a second that merges the first PHI node with the values that come from outside the scop. To generate code the first PHI node is then just handled like any other in-scop value that is used somewhere outside the scop. This fails for the case where all values from inside the scop are identical, as the first PHI node is in such cases automatically simplified and eliminated by LLVM right at construction. As a result, there is no instruction that can be pass to the EscapeMap handling, which means the references in the second PHI node are not updated and may still reference values from within the original scop that do not dominate it. Our new code iterates directly over all modeled ScopArrayInfo objects that represent multi-exit PHI nodes and generates code for them without relying on the EscapeMap infrastructure. Hence, it works also for the case where the first PHI node is eliminated. llvm-svn: 251191	2015-10-24 17:41:29 +00:00
Tobias Grosser	c73d8b0e18	ScopInfo: Drop unnecessary code This case has already been taken care of in r250622 and was then accidentally again committed in 250625. llvm-svn: 251156	2015-10-23 22:36:22 +00:00
Johannes Doerfert	654c3284f4	[FIX] Do not hoist nested variant base pointers This fixes bug 25249. llvm-svn: 250958	2015-10-21 22:14:57 +00:00
Tobias Grosser	ca7f5bb767	Full/partial tile separation for vectorization We isolate full tiles from partial tiles to be able to, for example, vectorize loops with parametric lower and/or upper bounds. If we use -polly-vectorizer=stripmine, we can see execution-time improvements: correlation from 1m7361s to 0m5720s (-67.05 %), covariance from 1m5561s to 0m5680s (-63.50 %), ary3 from 2m3201s to 1m2361s (-46.72 %), CrystalMk from 8m5565s to 7m4285s (-13.18 %). The current full/partial tile separation increases compile-time more than necessary. As a result, we see in compile time regressions, for example, for 3mm from 0m6320s to 0m9881s (56.34%). Some of this compile time increase is expected as we generate more IR and consequently more time is spent in the LLVM backends. However, a first investiagation has shown that a larger portion of compile time is unnecessarily spent inside Polly's parallelism detection and could be eliminated by propagating existing knowledge about vector loop parallelism. Before enabling -polly-vectorizer=stripmine by default, it is necessary to address this compile-time issue. Contributed-by: Roman Gareev <gareevroman@gmail.com> Reviewers: jdoerfert, grosser Subscribers: grosser, #polly Differential Revision: http://reviews.llvm.org/D13779 llvm-svn: 250809	2015-10-20 09:12:21 +00:00
Michael Kruse	dc12222287	Synthesize phi arguments in incoming block New values were always synthesized in the block of the instruction that needed them. This is incorrect for PHI node whose' value must be defined in the respective incoming block. This patch temporarily moves the builder's insert point to the incoming block while synthesizing phi node arguments. This fixes PR25241 (http://llvm.org/bugs/show_bug.cgi?id=25241) llvm-svn: 250693	2015-10-19 09:19:25 +00:00
Johannes Doerfert	9c28bfa72c	[FIX] Only constant integer branch conditions are always affine There are several different kinds of constants that could occur in a branch condition, however we can only handle the most interesting one namely constant integers. To this end we have to treat others as non-affine. This fixes bug 25244. llvm-svn: 250669	2015-10-18 22:56:42 +00:00
Johannes Doerfert	30c2265f98	[FIX] Normalize loops outside the SCoP during schedule generation We build the schedule based on a traversal of the region and accumulate information for each loop in it. The total schedule is associated with the loop surrounding the SCoP, though it can happen that there are blocks in the SCoP which are part of loops that are only partially in the SCoP. Instead of associating information with them (they are not part of the SCoP and consequently are not modeled) we have to associate the schedule information with the surrounding loop if any. This fixes bug 25240. llvm-svn: 250668	2015-10-18 21:17:11 +00:00
Johannes Doerfert	b864c2c3c9	[FIX] Do not try to hoist "empty" accesses Accesses that have a relative offset (in bytes) that is not divisible by the type size (in bytes) will be represented as empty in the SCoP description. This is on its own not good but it also crashed the invariant load hoisting. This patch will fix the latter problem while the former should be addressed too. This fixes bug 25236. llvm-svn: 250664	2015-10-18 19:50:18 +00:00
Johannes Doerfert	bc7cff4c18	[FIX] Do not hoist invariant pointers with non-loaded base ptr in SCoP If the base pointer of a load is invariant and defined in the SCoP but not loaded we cannot hoist the load as we would not hoist the base pointer definition. This fixes bug 25237. llvm-svn: 250663	2015-10-18 19:49:25 +00:00
Johannes Doerfert	af3e301a67	[FIX] Restructure invariant load equivalence classes Sorting is replaced by a demand driven code generation that will pre-load a value when it is needed or, if it was not needed before, at some point determined by the order of invariant accesses in the program. Only in very little cases this demand driven pre-loading will kick in, though it will prevent us from generating faulty code. An example where it is needed is shown in: test/ScopInfo/invariant_loads_complicated_dependences.ll Invariant loads that appear in parameters but are not on the top-level (e.g., the parameter is not a SCEVUnknown) will now be treated correctly. Differential Revision: http://reviews.llvm.org/D13831 llvm-svn: 250655	2015-10-18 12:39:19 +00:00
Johannes Doerfert	d8b6ad255f	[FIX] Cast preloaded values Preloaded values have to match the type of their counterpart in the original code and not the type of the base array. llvm-svn: 250654	2015-10-18 12:36:42 +00:00
Johannes Doerfert	01978cfa0c	Remove independent blocks pass Polly can now be used as a analysis only tool as long as the code generation is disabled. However, we do not have an alternative to the independent blocks pass in place yet, though in the relevant cases this does not seem to impact the performance much. Nevertheless, a virtual alternative that allows the same transformations without changing the input region will follow shortly. llvm-svn: 250652	2015-10-18 12:28:00 +00:00
Tobias Grosser	b8d27aab7d	Revert to original BlockGenerator::getOrCreateAlloca(MemoryAccess &Access) Expressing this in terms of BlockGenerator::getOrCreateAlloca(const ScopArrayInfo *Array) does not work as the MemoryAccess BasePtr is in case of invariant load hoisting different to the ScopArrayInfo BasePtr. Until this is investigated and fixed, we move back to code that just uses the baseptr of MemoryAccess. llvm-svn: 250637	2015-10-18 00:51:13 +00:00
Tobias Grosser	a4f0988df5	BlockGenerator: Add getOrCreateAlloca(const ScopArrayInfo *Array) This allows the caller to get the alloca locations of an array without the need to thank if Array is a PHI or a non-PHI Array. We directly make use of this in BlockGenerator::getOrCreateAlloca(MemoryAccess &Access). llvm-svn: 250628	2015-10-17 22:16:00 +00:00
Tobias Grosser	05d7fa79b6	Format comment properly While clang-format takes care that the line-length is not surpassed, the resulting comments sometimes look not optimal. We re-flow the text in the comment to avoid these ugly single-word lines. llvm-svn: 250626	2015-10-17 21:46:28 +00:00
Michael Kruse	225f0d1ee2	Load/Store scalar accesses before/after the statement itself Instead of generating implicit loads within basic blocks, put them before the instructions of the statment itself, including non-affine subregions. The region's entry node is dominating all blocks in the region and therefore the loaded value will be available there. Implicit writes in block-stmts were already stored back at the end of the block. Now, also generate the stores of non-affine subregions when leaving the statement, i.e. in the exiting block. This change is required for array-mapped implicits ("De-LICM") to ensure that there are no dependencies of demoted scalars within statments. Statement load all required values, operator on copied in registers, and then write back the changed value to the demoted memory. Lifetimes analysis within statements becomes unecessary. Differential Revision: http://reviews.llvm.org/D13487 llvm-svn: 250625	2015-10-17 21:36:00 +00:00
Michael Kruse	01cb379fed	Avoid unnecessay .s2a write access when used only in PHIs Accesses for exit node phis will be handled separately by buildPHIAccesses if there is more than one exiting edge, buildScalarDependences does not need to create additional SCALAR accesses. This is a corrected version of r250517, which was reverted in r250607. Differential Revision: http://reviews.llvm.org/D13848 llvm-svn: 250622	2015-10-17 21:07:08 +00:00
Tobias Grosser	ffa2446f28	BlockGenerator: Register outside users of scalars directly Instead of checking at code generation time for each ScopStmt if a scalar has external uses, we just iterate over the ScopArrayInfo descriptions we have and check each of these for possible external uses. Besides being somehow clearer, this approach has the benefit that we will always create valid LLVM-IR even in case we disable the code generation of ScopStmt bodies e.g. for testing purposes. llvm-svn: 250608	2015-10-17 08:54:13 +00:00
Tobias Grosser	3839b422e6	Revert "Avoid unnecessay .s2a write access when used only in PHIs" This reverts commit r250606 due to some bugs it introduced. After these bugs have been resolved, we will add it back to tree. llvm-svn: 250607	2015-10-17 08:54:05 +00:00
Tobias Grosser	26e59ee746	Drop unused parameter from handleOutsideUsers llvm-svn: 250606	2015-10-17 08:25:54 +00:00
Michael Kruse	aeceab770e	Avoid unnecessay .s2a write access when used only in PHIs PHI accesses will be handled separately by buildPHIAccesses, buildScalarDependences does not need to create additional accesses. llvm-svn: 250517	2015-10-16 15:14:40 +00:00
Michael Kruse	668af71b82	Do not add accesses for intra-ScopStmt scalar def-use chains When pulling a llvm::Value to be written as a PHI write, the former code did only check whether it is within the same basic block, but it could also be the same non-affine subregion. In that case some unecessary pair of MemoryAccesses would have been created. Two unit test were explicitely checking for the unecessary writes, including the comments that the writes are unecessary. llvm-svn: 250411	2015-10-15 14:45:48 +00:00
Tobias Grosser	0e3a6b13a4	Sort includes using 'clang-format -sort-includes' llvm-svn: 250392	2015-10-15 12:17:36 +00:00
Tobias Grosser	f30be2f370	RegisterPasses: Optionally run inliner before Polly This will allow us to optimize C++ template code with Polly. This support is mostly for debugging purpose and individual experiments. The ultimate goal is still to run Polly later in the pass manager when inlining already happened. llvm-svn: 250092	2015-10-12 20:03:44 +00:00
Tobias Grosser	d17183f20f	Use EP_ModuleOptimizerEarly to run early polly passes, instead of llvm::PassManagerBuilder::EP_EarlyAsPossible. This will allow us to run actual module passes in Polly's canonicalization sequence, but should otherwise have only little impact. llvm-svn: 250091	2015-10-12 20:03:41 +00:00
Tobias Grosser	e2c8275346	ScopInfo: Allow simple 'AddRec * Parameter' products in delinearization We also allow such products for cases where 'Parameter' is loaded within the scop, but where we can dynamically verify that the value of 'Parameter' remains unchanged during the execution of the scop. This change relies on Polly's new RequiredILS tracking infrastructure recently contributed by Johannes. llvm-svn: 250019	2015-10-12 08:02:30 +00:00
Tobias Grosser	1c55e2b7f3	Also add -polly-no-early-exit back until LNT is restarted llvm-svn: 249975	2015-10-11 13:49:27 +00:00
Tobias Grosser	3ecb71696b	Add back -polly-detect-unprofitable as alias of -polly-process-unprofitable This flag was still used in our LNT server. We leave it until it has been removed from LNT as well. llvm-svn: 249973	2015-10-11 13:39:17 +00:00
Johannes Doerfert	9b1f9c8b61	Allow eager evaluated binary && and \|\| conditions The domain generation can handle lazy && and \|\| by default but eager evaluated expressions were dismissed as non-affine. With this patch we will allow arbitrary combinations of and/or bit-operations in the conditions of branches. Differential Revision: http://reviews.llvm.org/D13624 llvm-svn: 249971	2015-10-11 13:21:03 +00:00
Johannes Doerfert	f363ed9804	[NFC] Move helper functions to ScopHelper Helper functions in the BlockGenerators.h/cpp introduce dependences from the frontend to the backend of Polly. As they are used in ScopDetection, ScopInfo, etc. we move them to the ScopHelper file. llvm-svn: 249919	2015-10-09 23:40:24 +00:00
David Blaikie	91e113d1dd	Remove some unused variables in -Asserts builds llvm-svn: 249866	2015-10-09 18:22:18 +00:00
Johannes Doerfert	697fdf891c	Consolidate invariant loads If a (assumed) invariant location is loaded multiple times we generated a parameter for each location. However, this caused compile time problems for several benchmarks (e.g., 445_gobmk in SPEC2006 and BT in the NAS benchmarks). Additionally, the code we generate is suboptimal as we preload the same location multiple times and perform the same checks on all the parameters that refere to the same value. With this patch we consolidate the invariant loads in three steps: 1) During SCoP initialization required invariant loads are put in equivalence classes based on their pointer operand. One representing load is used to generate a parameter for the whole class, thus we never generate multiple parameters for the same location. 2) During the SCoP simplification we remove invariant memory accesses that are in the same equivalence class. While doing so we build the union of all execution domains as it is only important that the location is at least accessed once. 3) During code generation we only preload one element of each equivalence class with the unified execution domain. All others are mapped to that preloaded value. Differential Revision: http://reviews.llvm.org/D13338 llvm-svn: 249853	2015-10-09 17:12:26 +00:00
Johannes Doerfert	f7e2967293	[FIX] Add missing projection for invariant load domains This was left out from the original patch proposed in http://reviews.llvm.org/D13195 even though it is needed to define an order invariant loads are hoisted. llvm-svn: 249680	2015-10-08 11:05:57 +00:00
Johannes Doerfert	c7ab83dfb7	Remove unused flag polly-allow-non-scev-backedge-taken-count Drop an unused flag polly-allow-non-scev-backedge-taken-count and also its occurrences from the tests. Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D13400 llvm-svn: 249675	2015-10-08 10:05:48 +00:00
Johannes Doerfert	c06b7d67cd	Expose the detection context to ScopDetection users ScopDetection users are interested in the detection context and access these via different get-methods. However, not all information was exposed though the number of maps to hold it was increasing steadily. With this change only the detection contexts the rejection log and the ValidRegions set are mapped. The former is needed, the second could be integrated in the first and the ValidRegions set is only needed for the deterministic order of the regions. llvm-svn: 249614	2015-10-07 20:46:06 +00:00
Johannes Doerfert	08d90a3cee	Treat conditionally executed non-pure calls as errors This replaces the support for user defined error functions by a heuristic that tries to determine if a call to a non-pure function should be considered "an error". If so the block is assumed not to be executed at runtime. While treating all non-pure function calls as errors will allow a lot more regions to be analyzed, it will also cause us to dismiss a lot again due to an infeasible runtime context. This patch tries to limit that effect. A non-pure function call is considered an error if it is executed only in conditionally with regards to a cheap but simple heuristic. llvm-svn: 249611	2015-10-07 20:32:43 +00:00
Johannes Doerfert	d8dd8630b2	[NFC] Make LoopInfo a member and simplify arguments llvm-svn: 249609	2015-10-07 20:31:36 +00:00
Johannes Doerfert	09e3697f44	Allow invariant loads in the SCoP description This patch allows invariant loads to be used in the SCoP description, e.g., as loop bounds, conditions or in memory access functions. First we collect "required invariant loads" during SCoP detection that would otherwise make an expression we care about non-affine. To this end a new level of abstraction was introduced before SCEVValidator::isAffineExpr() namely ScopDetection::isAffine() and ScopDetection::onlyValidRequiredInvariantLoads(). Here we can decide if we want a load inside the region to be optimistically assumed invariant or not. If we do, it will be marked as required and in the SCoP generation we bail if it is actually not invariant. If we don't it will be a non-affine expression as before. At the moment we optimistically assume all "hoistable" (namely non-loop-carried) loads to be invariant. This causes us to expand some SCoPs and dismiss them later but it also allows us to detect a lot we would dismiss directly if we would ask e.g., AliasAnalysis::canBasicBlockModify(). We also allow potential aliases between optimistically assumed invariant loads and other pointers as our runtime alias checks are sound in case the loads are actually invariant. Together with the invariant checks this combination allows to handle a lot more than LICM can. The code generation of the invariant loads had to be extended as we can now have dependences between parameters and invariant (hoisted) loads as well as the other way around, e.g., test/Isl/CodeGen/invariant_load_parameters_cyclic_dependence.ll First, it is important to note that we cannot have real cycles but only dependences from a hoisted load to a parameter and from another parameter to that hoisted load (and so on). To handle such cases we materialize llvm::Values for parameters that are referred by a hoisted load on demand and then materialize the remaining parameters. Second, there are new kinds of dependences between hoisted loads caused by the constraints on their execution. If a hoisted load is conditionally executed it might depend on the value of another hoisted load. To deal with such situations we sort them already in the ScopInfo such that they can be generated in the order they are listed in the Scop::InvariantAccesses list (see compareInvariantAccesses). The dependences between hoisted loads caused by indirect accesses are handled the same way as before. llvm-svn: 249607	2015-10-07 20:17:36 +00:00
Johannes Doerfert	521dd5842f	Move the ValueMapT declaration out of BlockGenerator Value maps are created and used in many places and it is not always possible to include CodeGen/Blockgenerators.h. To this end, ValueMapT now lives in the ScopHelper.h which does not have any dependences itself. This patch also replaces uses of different other value map types with the ValueMapT. llvm-svn: 249606	2015-10-07 20:15:56 +00:00
Tobias Grosser	6bab9f800b	IRBuilder: Use Map.lookup instead of Map.find [NFC] This simplifies the code. Suggested-by: Johannes Doerfert llvm-svn: 249545	2015-10-07 13:35:20 +00:00
Tobias Grosser	d79cb0378b	IRBuilder: Ensure we do not add empty map elements Do not use "Map[Key] == nullptr" to check if a Key is in the map, but use "Map.find(Key) == Map.end()". Map[Key] always adds Key into the map, a side-effect we do not want. Found by inspection. This is hard to test outside of a targetted unit test, which seems too much overhead for this individual issue. llvm-svn: 249544	2015-10-07 13:19:06 +00:00
Tobias Grosser	73725b8ba5	IRBuilder: Simplify code and reduce indention [NFC] llvm-svn: 249543	2015-10-07 13:19:03 +00:00
David Blaikie	ba2e4857b1	Fix configure checks when applied by the latest clang Clang's been taught to warn on more things here (unused values when calling pure functions and ignoring their result, for example). It might be better to figure out how to have cmake compile these tests without -Werror/without any warnings enabled. But this'll do for now & I don't know enough about cmake to fix it any other way, or to understand the tradeoffs in that space. llvm-svn: 249472	2015-10-06 21:45:06 +00:00
Tobias Grosser	3593739cb2	IslAst: Give some hints why code generation might have been skipped llvm-svn: 249427	2015-10-06 16:10:46 +00:00
Tobias Grosser	575aca8d43	Introduce -polly-process-unprofitable This single option replaces -polly-detect-unprofitable and -polly-no-early-exit and is supposed to be the only option that disables compile-time heuristics that aim to bail out early on scops that are believed to not benefit from Polly optimizations. Suggested-by: Johannes Doerfert llvm-svn: 249426	2015-10-06 16:10:29 +00:00
Johannes Doerfert	f17a78ef63	Remove non-executed statements during SCoP simplifcation A statement with an empty domain complicates the invariant load hoisting and does not help any subsequent analysis or transformation. In fact it might introduce parameter dimensions or increase the schedule dimensionality. To this end, we remove statements with an empty domain early in the SCoP simplification. llvm-svn: 249276	2015-10-04 15:00:05 +00:00
Johannes Doerfert	634909c2c9	[FIX] Domain generation for non-affine loops llvm-svn: 249275	2015-10-04 14:57:41 +00:00
Johannes Doerfert	f61df69423	[FIX] Count affine loops correctly The "unprofitable" heuristic was broken and counted boxed loops even though we do not represent and optimize them. llvm-svn: 249274	2015-10-04 14:56:08 +00:00
Johannes Doerfert	757a32b5b3	[FIX] Approximate non-affine loops correctly Before isValidCFG() could hide the fact that a loop is non-affine by over-approximation. This is problematic if a subregion of the loop contains an exit/latch block and is over-approximated. Now we do not over-approximate in the isValidCFG function if we check loop control. If such control is non-affine the whole loop is over-approximated, not only a subregion. llvm-svn: 249273	2015-10-04 14:54:27 +00:00
Johannes Doerfert	30ffb6fcb6	[FIX] Check loop latches for valid control too. This patch cannot be tested on its own as the isValidCFG currently hides the fact that control is actually non-affine with over-approximation. This will be corrected in the next patch and a test for non-affine latches will be added. llvm-svn: 249272	2015-10-04 14:53:18 +00:00
Johannes Doerfert	8dba07770f	[NFC] Remove unused classes llvm-svn: 249271	2015-10-04 14:52:43 +00:00
Tobias Grosser	d78616f98a	Make ScopAnnotator a function-local variable to ensure it is freed at each run When the ScopAnnotator was a class member variable some of the maps it contains have not been properly cleared. As a result we had dangling pointers to llvm::Value(s) which got detected by the AssertingVH we recently added. No test case as this issue is hard to reproduce reliably as subsequent optimizations need to delete some of the llvm::Values we still keep in our lists. llvm-svn: 249269	2015-10-04 11:19:13 +00:00
Tobias Grosser	2f1acac610	BlockGenerator: Use plain Value * instead of const Value * The use of const qualified Value pointers prevents the use of AssertingVH. We could probably think of adding const support to AssertingVH, but as const correctness seems to currently provide limited benefit in Polly, we do not do this yet. llvm-svn: 249266	2015-10-04 10:18:45 +00:00
Tobias Grosser	9646e3fe4b	BlockGenerators: Use auto to be less sensitive to type changes llvm-svn: 249265	2015-10-04 10:18:39 +00:00
Tobias Grosser	f4bb7a6a4d	Consolidate the different ValueMapTypes we are using There have been various places where llvm::DenseMap<const llvm::Value , llvm::Value > types have been defined, but all types have been expected to be identical. We make this more clear by consolidating the different types and use BlockGenerator::ValueMapT wherever there is a need for types to match BlockGenerator::ValueMapT. llvm-svn: 249264	2015-10-04 10:18:32 +00:00
Tobias Grosser	1d45c6dadd	IslExprBuilder: Use AssertingVH for IdToValueTy llvm-svn: 249239	2015-10-03 17:20:00 +00:00
Tobias Grosser	e9cb5a0983	BlockGenerator: Use AssertingVH in maps By using asserting value handles, we will get assertions when we forget to clear any of the Value maps instead of difficult to debug undefined behavior. llvm-svn: 249237	2015-10-03 17:19:49 +00:00
Michael Kruse	afe0670863	Bail-out early if all statements have been simplified away Treat the scop as invalid instead of creating dummy domains. llvm-svn: 249151	2015-10-02 16:33:27 +00:00
Johannes Doerfert	8930f4846c	[FIX] Do not hoist from inside a non-affine subregion We have to skip accesses in non-affine subregions during hoisting as they might not be executed under the same condition as the entry of the non-affine subregion. llvm-svn: 249139	2015-10-02 14:51:00 +00:00
Michael Kruse	cac948ef46	Earlier creation of ScopStmt objects This moves the construction of ScopStmt to the beginning of the ScopInfo pass. The late creation was a result of the earlier separation of ScopInfo and TempScopInfo. This will avoid introducing more ScopStmt-like maps in future commits. The AccFuncMap will also be removed in some future commit. DomainMap might also be included into ScopStmt. The order in which ScopStmt are created changes and initially creates empty statements that are removed in a simplification. Differential Revision: http://reviews.llvm.org/D13341 llvm-svn: 249132	2015-10-02 13:53:07 +00:00
Johannes Doerfert	911951f4f8	Hand down referenced & globally mapped values to the subfunction If a value is globally mapped (IslNodeBuilder::ValueMap) and referenced in the code that will be put into a subfunction, we hand down the new value to the subfunction. This patch also removes code that handed down all invariant loads to the subfunction. Instead, only needed invariant loads are given to the subfunction. There are two possible reasons for an invariant load to be handed down: 1) The invariant load is used in a block that is placed in the subfunction but which is not the parent of the load. In this case, the scalar access that will read the loaded value, will cause its base pointer (the preloaded value) to be handed down to the subfunction. 2) The invariant load is defined and used in a block that is placed in the subfunction. With this patch we will hand down the preloaded value to the subfunction as the invariant load is globally mapped to that value. llvm-svn: 249126	2015-10-02 13:11:27 +00:00
Johannes Doerfert	478a7de18b	[NFC] Make the ScopDetection analysis a member of the Scop class llvm-svn: 249125	2015-10-02 13:09:31 +00:00
Johannes Doerfert	f56738041e	Make the SCoP generation resistent wrt. error blocks When error blocks are not terminated by an unreachable they have successors that might only be reachable via error blocks. Additionally, branches in error blocks are not checked during SCoP detection, thus we might not be able to handle them. With this patch we do not try to model error block exit conditions. Anything that is only reachable via error blocks is ignored too, as it will not be executed in the optimized version of the SCoP anyway. llvm-svn: 249099	2015-10-01 23:48:18 +00:00
Johannes Doerfert	f80f3b0449	Allow user defined error functions The user can provide function names with -polly-error-functions=name1,name2,name3 that will be treated as error functions. Any call to them is assumed not to be executed. This feature is mainly for developers to play around with the new "error block" feature. llvm-svn: 249098	2015-10-01 23:45:51 +00:00
Johannes Doerfert	850d346302	[FIX] Parallel codegen for invariant loads Hand down all preloaded values to the parallel subfunction. llvm-svn: 249010	2015-10-01 13:40:36 +00:00
Johannes Doerfert	e46925f324	[FIX] Erase stall results during the SCoP detection With this patch we erase cached results for regions that are invalid as early as possible. If we do not (as before), it is possible that two expanded regions will have the same address and the tracked results for both are mixed. Currently this would "only" cause pessimism in later passes but that will change when we allow invariant loads in the SCoP. Additionally, it triggers non-deterministic results as we might dismiss a later region because of results cached for an earlier one. There is no test case as the problem occurs only non-deterministically. llvm-svn: 249000	2015-10-01 10:59:14 +00:00
Johannes Doerfert	59984322c3	[FIX] Handle identity mappings in the ScopExpander If the VMap in the ScopExpander contains identity mappings we now ignore the mapping. Reported-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 248946	2015-09-30 21:12:12 +00:00
Johannes Doerfert	bb9387444c	[FIX] Remove unknown variable llvm-svn: 248926	2015-09-30 17:31:31 +00:00
Johannes Doerfert	c0729a3216	Move remapping functionality in the ScopExpander Because we handle more than SCEV does it is not possible to rewrite an expression on the top-level using the SCEVParameterRewriter only. With this patch we will do the rewriting on demand only and also recursively, thus not only on the top-level. llvm-svn: 248916	2015-09-30 16:52:03 +00:00
Johannes Doerfert	6206d7a67c	[FIX] Clear all maps between runs llvm-svn: 248915	2015-09-30 16:51:05 +00:00
Tobias Grosser	aff56c8a78	Reapply "BlockGenerator: Generate synthesisable instructions only on-demand" Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instructions and works more reliably than first inserting them and then deleting them later on. This commit was reverted in r248860 due to a remaining miscompile, where we forgot to synthesis the operand values that were referenced from scalar writes. test/Isl/CodeGen/scalar-store-from-same-bb.ll tests that we do this now correctly. llvm-svn: 248900	2015-09-30 13:36:54 +00:00
Tobias Grosser	33cb9f9419	BlockGenerator: Extract value synthesis into its own function [NFC] This will allow us to reuse this code in a subsequent commit. llvm-svn: 248893	2015-09-30 11:56:19 +00:00
Johannes Doerfert	ebfd72493c	[NFC] Extract materialization of parameters llvm-svn: 248882	2015-09-30 09:52:08 +00:00
Johannes Doerfert	ef19ead20e	[FIX] Use escape logic for invariant loads Before we unconditinoally forced all users outside the SCoP to use the preloaded value. However, if the SCoP is not executed due to the runtime checks, we need to use the original value because it might not be invariant in the first place. llvm-svn: 248881	2015-09-30 09:43:20 +00:00
Michael Kruse	76e924d31b	Assign scop directly This makes ScopInfo's scop member available earlier to other methods which will make some planned changes simpler. No behavioral change intended llvm-svn: 248879	2015-09-30 09:16:07 +00:00
Johannes Doerfert	c1db67e218	Identify and hoist definitively invariant loads As a first step in the direction of assumed invariant loads (loads that are not written in some context) we now detect and hoist definitively invariant loads. These invariant loads will be preloaded in the code generation and used in the optimized version of the SCoP. If the load is only conditionally executed the preloaded version will also only be executed under the same condition, hence we will never access memory that wouldn't have been accessed otherwise. This is also the most distinguishing feature to licm. As hoisting can make statements empty we will simplify the SCoP and remove empty statements that would otherwise cause artifacts in the code generation. Differential Revision: http://reviews.llvm.org/D13194 llvm-svn: 248861	2015-09-29 23:47:21 +00:00
Johannes Doerfert	f6343d74ef	Revert "BlockGenerator: Generate synthesisable instructions only on-demand" This reverts commit 07830c18d789ee72812d5b5b9b4f8ce72ebd4207. The commit broke at least one test in lnt, MultiSource/Benchmarks/Ptrdist/bc/number.c was miss compiled and the test produced a wrong result. One Polly test case that was added later was adjusted too. llvm-svn: 248860	2015-09-29 23:43:40 +00:00
Tobias Grosser	98b3ee50ff	Codegen: Support memory accesses with different types Every once in a while we see code that accesses memory with different types, e.g. to perform operations on a piece of memory using type 'float', but to copy data to this memory using type 'int'. Modeled in C, such codes look like: void foo(float A[], float B[]) { for (long i = 0; i < 100; i++) (int )(&A[i]) = (int )(&B[i]); for (long i = 0; i < 100; i++) A[i] += 10; } We already used the correct types during normal operations, but fall back to our detected type as soon as we import changed memory access functions. For these memory accesses we may generate invalid IR due to a mismatch between the element type of the array we detect and the actual type used in the memory access. To address this issue, we always cast the newly created address of a memory access back to the type of the memory access where the address will be used. llvm-svn: 248781	2015-09-29 06:44:38 +00:00
Tobias Grosser	95e59aaa54	OpenMP: Name addresses in subfunction structure While debugging, this makes it easier to understand due to which memory reference these stores have been introduced. llvm-svn: 248717	2015-09-28 16:46:38 +00:00
Tobias Grosser	28b9a14b07	BlockGenerator: Generate synthesisable instructions only on-demand Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instruction and works more reliably than first inserting them and then deleting them later on. Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> Differential Revision: http://reviews.llvm.org/D13208 llvm-svn: 248712	2015-09-28 13:47:50 +00:00
Johannes Doerfert	58a7c75c86	[NFC] Add accidentally removed comment line llvm-svn: 248704	2015-09-28 09:48:53 +00:00
Johannes Doerfert	9a132f36c3	Allow switch instructions in SCoPs This patch allows switch instructions with affine conditions in the SCoP. Also switch instructions in non-affine subregions are allowed. Both did not require much changes to the code, though there was some refactoring needed to integrate them without code duplication. In the llvm-test suite the number of profitable SCoPs increased from 135 to 139 but more importantly we can handle more benchmarks and user inputs without preprocessing. Differential Revision: http://reviews.llvm.org/D13200 llvm-svn: 248701	2015-09-28 09:33:22 +00:00
Johannes Doerfert	f32f5f2305	Remove obsolete check This check was needed at some point but seems not useful anymore. Only one adjustment in the domain generation was needed to cope with the cases this check prevented from happening before. llvm-svn: 248695	2015-09-28 01:30:37 +00:00
Johannes Doerfert	91ad092bb2	[NFC] Remove unused SCoP diagnostic llvm-svn: 248694	2015-09-28 01:29:44 +00:00
Tobias Grosser	0722a1e5d5	BlockGenerator: Be less agressive with deleting dead instructions We now only delete trivially dead instructions in the BB we copy (copyBB), but not in any other BB. Only for copyBB we know that there will _never_ be any future uses of instructions that have no use after copyBB has been generated. Other instructions in the AST that have been generated by IslNodeBuilder may look dead at the moment, but may possibly still be referenced by GlobalMaps. If we delete them now, later uses would break surprisingly. We do not have a test case that breaks due to us deleting too many instructions. This issue was found by inspection. llvm-svn: 248688	2015-09-27 19:50:16 +00:00
Tobias Grosser	a43b6e935c	Drop unused variable llvm-svn: 248687	2015-09-27 17:54:50 +00:00
Johannes Doerfert	45be64464b	[NFC] Consistenly use commented and annotated ScopPass functions The changes affect methods that are part of the Pass interface and include: - Comments that describe the methods purpose. - A consistent use of the keywords override and virtual. Additionally, the printScop method is now optional and removed from SCoP passes that do not implement it. llvm-svn: 248685	2015-09-27 15:43:29 +00:00
Johannes Doerfert	0f37630849	[NFC] Use releaseMemory to release internal memory llvm-svn: 248684	2015-09-27 15:42:28 +00:00
Tobias Grosser	0ff79e586d	BlockGenerator: Simplify code generated for region statements After having generated a new user statement a couple of inefficient or trivially dead instructions may remain. This commit runs instruction simplification over the newly generated blocks to ensure unneeded instructions are removed right away. This commit does adds simplification for non-affine subregions which was not yet part of 248681. llvm-svn: 248683	2015-09-27 11:35:00 +00:00
Tobias Grosser	1b9d25a42d	BlockGenerator: Simplify code generated for scop statements After having generated a new user statement a couple of inefficient or trivially dead instructions may remain. This commit runs instruction simplification over the newly generated blocks to ensure unneeded instructions are removed right away. This commit does not yet add simplification for non-affine subregions. llvm-svn: 248681	2015-09-27 11:17:22 +00:00
Johannes Doerfert	fb19dd694c	Create parallel code in a separate block This commit basically reverts r246427 but still solves the issue tackled by that commit. Instead of emitting initialization code in the beginning of the start block we now generate parallel code in its own block and thereby guarantee separation. This is necessary as we cannot generate code for hoisted loads prior to the start block but it still needs to be placed prior to everything else. llvm-svn: 248674	2015-09-26 20:57:59 +00:00
Michael Kruse	caac2b6930	Fix typo llvm-svn: 248670	2015-09-26 15:51:44 +00:00
Johannes Doerfert	c6987c18de	[FIX] Use the surrounding loop for non-affine SCoP regions When the whole SCoP is a non-affine region we need to use the surrounding loop in the construction of the schedule as that is the one that will be looked up after the schedule generation. This fixes bug 24947 llvm-svn: 248667	2015-09-26 13:41:43 +00:00
Tobias Grosser	99c70dd8d1	Ensure memory accesses to the same array have identical dimensionality When recovering multi-dimensional memory accesses, it may happen that different accesses to the same base array are recovered with different dimensionality. This patch ensures that the dimensionalities are unified by adding zero valued dimensions to acesses with lower dimensionality. When starting to model fixed-size arrays as multi-dimensional in 247906, this has not been taken care of. llvm-svn: 248662	2015-09-26 08:55:54 +00:00
Michael Kruse	8d0b734e71	Let MemoryAccess remember its purpose There are three possible reasons to add a memory memory access: For explicit load and stores, for llvm::Value defs/uses, and to emulate PHI nodes (the latter two called implicit accesses). Previously MemoryAccess only stored IsPHI. Register accesses could be identified through the isScalar() method if it was no IsPHI. isScalar() determined the number of dimensions of the underlaying array, scalars represented by zero dimensions. For the work on de-LICM, implicit accesses can have more than zero dimensions, making the distinction of isScalars() useless, hence now stored explicitly in the MemoryAccess. Instead, we replace it by isImplicit() and avoid the term scalar for zero-dimensional arrays as it might be confused with llvm::Value which are also often referred to as scalars (or alternatively, as registers). No behavioral change intended, under the condition that it was impossible to create explicit accesses to zero-dimensional "arrays". llvm-svn: 248616	2015-09-25 21:21:00 +00:00
Michael Kruse	33d6c0bbc5	Use per-Purpose overloads for MemoryAccess creation This makes the intent of each created object clearer and allows to add more specific asserts. The bug fixed in r248535 has been discovered this way. No functional change intended; everything should behave as before. llvm-svn: 248603	2015-09-25 18:53:27 +00:00
Tobias Grosser	c90ed44b31	[cmake] terminate isl/stdint.h with a newline This avoids warnings of the form: stdint.h:1:20: warning: no newline at end of file [-Wnewline-eof] llvm-svn: 248570	2015-09-25 11:45:26 +00:00
Tobias Grosser	c2bb0cbe00	Sort includes using Chandler's sort_includes.py script llvm-svn: 248568	2015-09-25 09:49:19 +00:00
Tobias Grosser	da95a4a7c7	Handle read-only scalars used in PHI-nodes correctly This change addresses three issues: - Read only scalars that enter a PHI node through an edge that comes from outside the scop are not modeled any more, as such PHI nodes will always be initialized to this initial value right before the SCoP is entered. - For PHI nodes that depend on a scalar value that is defined outside the scop, but where the scalar values is passed through an edge that itself comes from a BB that is part of the region, we introduce in this basic block a read of the out-of-scop value to ensure it's value is available to write it into the PHI alloc location. - Read only uses of scalars by PHI nodes are ignored in the general read only handling code, as they are taken care of by the general PHI node modeling code. llvm-svn: 248535	2015-09-24 20:59:59 +00:00
Michael Kruse	26ed65e00d	Fix comparison signed/unsigned mismatch warning; NFC llvm-svn: 248520	2015-09-24 17:32:49 +00:00
Michael Kruse	b280ded108	Rename Polly_isl to PollyISL Library names in the LLVM framework usually do not contain underscores. llvm-svn: 248487	2015-09-24 12:38:49 +00:00
Michael Kruse	2d0ece960f	Remove Analysis Output of TempScopInfo After the merge of TempScopInfo into ScopInfo the analysis output remained because of the existing unit tests. These remains are removed and the units tests converted to match the equivalent output of ScopInfo's analysis output. The unit tests are also moved into the directory of ScopInfo tests. Differential Revision: http://reviews.llvm.org/D13116 llvm-svn: 248485	2015-09-24 11:41:21 +00:00
Michael Kruse	519b3cfd27	Compile ISL into its own library Refactor all ISL-related cmake build instructions into its own CMakeLists.txt and build as a separate library. This is useful to apply ISL-related build flags to ISL only and not to Polly's files. Also, it the separation of both projects becomes clearer. Proposed name of the library is Polly_isl. It is not "isl" to avoid mix-up with potentially installed libisl.{a\|so}. Tested configurations: - Windows with cmake 3.2 - Ubuntu with cmake 3.0.2 - Ubuntu with cmake 3.0.2 BUILD_SHARED_LIBS on - Ubuntu with cmake 2.8.12.2 (LLVM minimum version) - Ubuntu out-of-LLVM-tree Differential Revision: http://reviews.llvm.org/D12810 llvm-svn: 248484	2015-09-24 11:30:22 +00:00
Johannes Doerfert	e526de5a47	Make MIN_LOOP_TRIP_COUNT a static constant llvm-svn: 248192	2015-09-21 19:10:11 +00:00
Tobias Grosser	b1c39429d9	Do not model delinearized and linearized access relation for a single access A missing return statement that previously did not have a visibly negative effect caused after some data-structure changes in r248024 multi-dimensional accesses to be modeled both multi-dimensional as well as linearized. This commit adds the missing return to avoid the incorrect double modeling as well as the compile time increases it caused. llvm-svn: 248171	2015-09-21 16:19:25 +00:00
Johannes Doerfert	6a72a2af13	Use <nsw> AddRecs in the affinator to avoid bounded assumptions If we encounter a <nsw> tagged AddRec for a loop we know the trip count of that loop has to be bounded or the semantics is undefined anyway. Hence, we only need to add unbounded assumptions if no such AddRec is known. llvm-svn: 248128	2015-09-20 16:59:23 +00:00
Johannes Doerfert	707a406078	Add bounded loop assumption So far we ignored the unbounded parts of the iteration domain, however we need to assume they do not occure at all to remain sound if they do. llvm-svn: 248126	2015-09-20 16:38:19 +00:00
Johannes Doerfert	f2cc86edae	Simplify domain generation We now add loop carried information during the second traversal of the region instead of in a intermediate step in-between. This makes the generation simpler, removes code and should even be faster. llvm-svn: 248125	2015-09-20 16:15:32 +00:00
Johannes Doerfert	06c57b594c	Allow loops with multiple back edges In order to allow multiple back edges we: - compute the conditions under which each back edge is taken - build the union over all these conditions, thus the condition that any back edge is taken - apply the same logic to the union we applied to a single back edge llvm-svn: 248120	2015-09-20 15:00:20 +00:00
Johannes Doerfert	7175bdfbe4	Add loop trip count based heuristic for SCoP detection As we currently do not perform any optimizations that targets (or is even aware) small trip counts we will skip them when we count the loops in a region. llvm-svn: 248119	2015-09-20 14:56:54 +00:00
Johannes Doerfert	b276bde162	[NFC] Remove obsolete diagnostic for unstructured control flow llvm-svn: 248118	2015-09-20 14:55:50 +00:00
Michael Kruse	84f70acd68	Remove unused variable Dimension [NFC] llvm-svn: 248026	2015-09-18 20:03:32 +00:00
Michael Kruse	e2bccbbfb2	Merge IRAccess into MemoryAccess All MemoryAccess objects will be owned by ScopInfo::AccFuncMap which previously stored the IRAccess objects. Instead of creating new MemoryAccess objects, the already created ones are reused, but their order might be different now. Some fields of IRAccess and MemoryAccess had the same meaning and are merged. This is the last step of fusioning TempScopInfo.{h\|cpp} and ScopInfo.{h.cpp}. Some refactoring might still make sense. Differential Revision: http://reviews.llvm.org/D12843 llvm-svn: 248024	2015-09-18 19:59:43 +00:00
Tobias Grosser	b09455dee0	Store EscapeMap as Value* instead of AllocInst This currently does not change the behavior in Polly, but it allows us to later also overwrite the EscapeMap with our GlobalMap. llvm-svn: 247970	2015-09-18 06:01:11 +00:00
Tobias Grosser	6f36d9ab01	Delinearize multi-dimensional arrays through bitcasts In some cases instcombine introduces bitcasts that slightly obfuscate the multi-dimensionality of an array. This patch teaches our fixed-size delinearization how to look through bitcasts. llvm-svn: 247928	2015-09-17 20:16:21 +00:00
Tobias Grosser	0537f41de5	Do not use the assumed context in the dependence analysis any more This information is implicitly available through the multi-dimensionality of memory accesses. This reduces compile time for 3mm from 430ms to 400ms and should generally benefit compile time for cases where the assumed context is complex. llvm-svn: 247907	2015-09-17 17:28:19 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Tobias Grosser	faf8f6f62e	Extract function that derives the index expressions of a GEP instruction [NFC] We currently use this functionality to add run-time assumptions that check its in-bound property. llvm-svn: 247893	2015-09-17 15:47:52 +00:00
Tobias Grosser	e375d8058a	Add option to enable/disable reduction usage in dependence analysis llvm-svn: 247781	2015-09-16 09:50:17 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00
Johannes Doerfert	cef616fe2d	Use blocks instead of domains in SCEVAffinator Due to the new domain generation, the SCoP keeps track of the domain for all blocks, thus the SCEVAffinator can now work with blocks to avoid duplication of the domains. llvm-svn: 247731	2015-09-15 22:49:04 +00:00
Johannes Doerfert	b20f151d56	Coalesce the constructed domains early llvm-svn: 247728	2015-09-15 22:11:49 +00:00
Michael Kruse	2846877d88	Replace some SmallVector-typed parameters by ArrayRef ArrayRef avoids making implementation details such as the number of stack elements to be part of the function signature. llvm-svn: 247572	2015-09-14 15:45:33 +00:00
Tobias Grosser	0b13890042	Fix formatting llvm-svn: 247549	2015-09-14 11:38:06 +00:00
Johannes Doerfert	e114dc024e	[FIX] Handle error blocks in non-affine regions correctly llvm-svn: 247545	2015-09-14 11:15:58 +00:00
Johannes Doerfert	40fa56f59f	[FIX] Allow the whole SCoP to be a non-affine subregion llvm-svn: 247544	2015-09-14 11:15:07 +00:00
Johannes Doerfert	36255eecd8	Revert r247278 "Disable support for modulo expressions" This reverts commit 00c5b6ca8832439193036aadaaaee92a43236219. We can handle modulo expressions in the domain again. llvm-svn: 247542	2015-09-14 11:14:23 +00:00
Johannes Doerfert	ca1e38fa43	Propagate exit conditions as described in the PET paper At some point we build loop trip counts using this method. It was replaced by a simpler trick that works only for affine (e.g., not modulo) constraints and relies on the removal of unbounded parts. In order to allow modulo constrains again we go back to the former, more accurate method. llvm-svn: 247540	2015-09-14 11:12:52 +00:00
Michael Kruse	9d08009dff	Merge TempScop into Scop Summary: TempScop is basically a holder for AccFuncMap, the dictionary from BasicBlocks to IRAccess lists. We move the list into polly::Scop and remove the polly::TempScop class. There is one small change in behavior: If ScopInfo finds that its AssumedContext is impossible, it bails out by deleting the Scop object. The TempScop::print (invoked with opt -polly-scops -analyze) cannot print the AccFuncMap anymore as it would with a separate TempScop. Differential Revision: http://reviews.llvm.org/D12803 llvm-svn: 247480	2015-09-11 21:41:48 +00:00
Sylvestre Ledru	3e94031632	Update autoconf too: Analysis/TempScopInfo.cpp has been removed llvm-svn: 247419	2015-09-11 15:05:29 +00:00
Michael Kruse	ef3cf01d1c	Add Polly header files to IDE projects llvm-svn: 247398	2015-09-11 09:01:55 +00:00
Johannes Doerfert	90db75ed24	Runtime error check elimination Hoist runtime checks in the loop nest if they guard an "error" like event. Such events are recognized as blocks with an unreachable terminator or a call to the ubsan function that deals with out of bound accesses. Other "error" events can be added easily. We will ignore these blocks when we detect/model/optmize and code generate SCoPs but we will make sure that they would not have been executed using the assumption framework. llvm-svn: 247310	2015-09-10 17:51:27 +00:00
Johannes Doerfert	f4fa9879fb	[FIX] Do not assume only one loop can be left at a time llvm-svn: 247291	2015-09-10 15:53:59 +00:00
Johannes Doerfert	b68cffb5df	Allow general loops with one latch As we do not rely on ScalarEvolution any more we do not need to get the backedge taken count. Additionally, our domain generation handles everything that is affine and has one latch and our ScopDetection will over-approximate everything else. This change will therefor allow loops with: - one latch - exiting conditions that are affine Additionally, it will not check for structured control flow anymore. Hence, loops and conditionals are not necessarily single entry single exit regions any more. Differential Version: http://reviews.llvm.org/D12758 llvm-svn: 247289	2015-09-10 15:27:46 +00:00
Michael Kruse	d868b5d509	Merge TempScopInfo into ScopInfo The TempScopInfo (-polly-analyze-ir) pass is removed and its work taken over by ScopInfo (-polly-scops). Several tests depend on -polly-analyze-ir and use -polly-scops instead which for the moment prints the output of both passes. This again is not expected by some other tests, especially those with negative searches, which have been adapted. Differential Version: http://reviews.llvm.org/D12694 llvm-svn: 247288	2015-09-10 15:25:24 +00:00
Johannes Doerfert	32ae76e7f9	[NFC] Remove obsolete arguments Remove some arguments that survived the recent changes but are not used any more. llvm-svn: 247280	2015-09-10 13:12:02 +00:00
Johannes Doerfert	5b9ff8b667	Replace ScalarEvolution based domain generation This patch replaces the last legacy part of the domain generation, namely the ScalarEvolution part that was used to obtain loop bounds. We now iterate over the loops in the region and propagate the back edge condition to the header blocks. Afterwards we propagate the new information once through the whole region. In this process we simply ignore unbounded parts of the domain and thereby assume the absence of infinite loops. + This patch already identified a couple of broken unit tests we had for years. + We allow more loops already and the step to multiple exit and multiple back edges is minimal. + It allows to model the overflow checks properly as we actually visit every block in the SCoP and know where which condition is evaluated. - It is currently not compatible with modulo constraints in the domain. Differential Revision: http://reviews.llvm.org/D12499 llvm-svn: 247279	2015-09-10 13:00:06 +00:00
Johannes Doerfert	171f07ed71	Disable support for modulo expressions The support for modulo expressions is not comlete and makes the new domain generation harder. As the currently broken domain generation needs to be replaced, we will first swap in the new, fixed domain generation and make it compatible with the modulo expressions later. llvm-svn: 247278	2015-09-10 12:56:46 +00:00
Michael Kruse	7bf3944d23	Merge TempScopInfo.{cpp\|h} into ScopInfo.{cpp\|h} This prepares for a series of patches that merges TempScopInfo into ScopInfo to reduce Polly's code complexity. Only ScopInfo.{cpp\|h} will be left thereafter. Moving the code of TempScopInfo in one commit makes the mains diffs simpler to understand. In detail, merging the following classes is planned: TempScopInfo into ScopInfo TempScop into Scop IRAccess into MemoryAccess Only moving code, no functional changes intended. Differential Version: http://reviews.llvm.org/D12693 llvm-svn: 247274	2015-09-10 12:46:52 +00:00
Chandler Carruth	66ef16b289	[PM] Update Polly for the new AA infrastructure landed in r247167. llvm-svn: 247198	2015-09-09 22:13:56 +00:00
Michael Kruse	d16550de92	Fix typo: zycle -> cycle [NFC] llvm-svn: 247172	2015-09-09 18:20:31 +00:00
Johannes Doerfert	7ca8dc2d2d	Disable support for pointer expressions The support for pointer expressions is broken as it can only handle some patterns in the IslExprBuilder. We should to treat pointers in expressions the same as integers at some point and revert this patch. llvm-svn: 247147	2015-09-09 14:19:04 +00:00
Michael Kruse	da943ce613	Generate gitversion.h in autoconf builds Add a custom makefile rule to generate lib/External/isl/gitversion.h from GIT_HEAD_ID and trigger it using BULIT_SOURCES to ensure the file exists before compilation starts. The latest ISL creates gitversion.h from Makefile.am only, instead also from configure.ac in previous version. While the Polly build invokes configure, it does not invoke ISL's make such that the file was missing. Invoking ISL's make would come with additional problems such as triggering automake because of not preserved file time stamps. Re-running automake might not be successful on other system configurations for instance because it was preconfigured without --with-clang option. llvm-svn: 247142	2015-09-09 13:15:11 +00:00
Tobias Grosser	f1ac57c6cd	IslNodeBuilder: Add virtual function to obtain the schedule of an ast node Not all users of our IslNodeBuilder will attach scheduling information to the AST in the same way IslAstInfo is doing it today. By going through a virtual function when extracting the schedule of an AST node other users can provide their own functions for extract scheduling information in case they attach scheduling information in a different way to the AST nodes. No functional change for Polly itself intended. llvm-svn: 247126	2015-09-09 09:24:38 +00:00
Johannes Doerfert	717b866798	Allow PHI nodes in the region exit block While we do not need to model PHI nodes in the region exit (as it is not part of the SCoP), we need to prepare for the case that the exit block is split in code generation to create a single exiting block. If this will happen, hence if the region did not have a single exiting block before, we will model the operands of the PHI nodes as escaping scalars in the SCoP. Differential Revision: http://reviews.llvm.org/D12051 llvm-svn: 247078	2015-09-08 21:44:27 +00:00
Tobias Grosser	02e6589bda	Move more compile-time bailouts into -polly-detect-unprofitable Instead of having two separate options -polly-detect-scops-in-functions-without-loops and -polly-detect-scops-in-regions-without-loops we now just use -polly-detect-unprofitable to force the detection of scops ignoring any compile time saving bailout heuristics. llvm-svn: 247057	2015-09-08 19:46:41 +00:00
Tobias Grosser	a89dc57b41	Do not use '.' in subfunction names Certain backends, e.g. NVPTX, do not support '.' in function names. Hence, we ensure all '.' are replaced by '_' when generating function names for subfunctions. For the current OpenMP code generation, this is not strictly necessary, but future uses cases (e.g. GPU offloading) need this issue to be fixed. llvm-svn: 246980	2015-09-08 06:22:17 +00:00
Tobias Grosser	86bc93a9b2	Add option -polly-codegen-add-debug-printing When this option is enabled, Polly will emit printf calls for each scalar load/and store which dump the scalar value loaded/stored at run time. This patch also refactors the RuntimeDebugBuilder to use variadic templates when generating CPU printfs. As result, it now becomes easier to print strings that consist of a set of arguments. Also, as a single printf call is emitted, it is more likely for such strings to be emitted atomically if executed multi-threaded. llvm-svn: 246941	2015-09-06 08:47:57 +00:00
Tobias Grosser	e58d358171	RuntimeDebugPrinter: Simplify code [NFC] llvm-svn: 246940	2015-09-06 07:17:54 +00:00
Tobias Grosser	e3d8c05c5f	Add some more documentation and structure to the collection of subtree references Some of the structures are renamed, subfunction introduced to clarify the individual steps and comments are added describing their functionality. llvm-svn: 246929	2015-09-05 15:45:25 +00:00
Tobias Grosser	abcec37f64	IslNodeBuilder: Only obtain the isl_ast_build, when needed In the common case, the access functions are not modified, hence there is no need to obtain the IslAstBuild context at all. This should not only be minimally faster, but this also allows the IslNodeBuilder to work on asts that are not annotated with isl_ast_builds as long as the memory accesses are not modified. llvm-svn: 246928	2015-09-05 13:03:57 +00:00
Tobias Grosser	8eae8361fc	RegionGenerator: Do not modify GlobalMaps By inspection the update of the GlobalMaps in the RegionGenerator seems unneed, and is removed as also no test cases fail when dropping this. Johannes Doerfert confirmed that this is indeed save: "I think that code was needed when we did not use the scalar codegen by default. Now everything defined in a non-affine region should be communicated via memory and reloaded in the user block. Hence, we should be good removing this code." llvm-svn: 246926	2015-09-05 11:26:30 +00:00
Tobias Grosser	72b80672d9	OpenMP: Name the values passed to the subfunciton according to the original llvm::Values llvm-svn: 246924	2015-09-05 10:41:19 +00:00
Tobias Grosser	0d8874c0f6	OpenMP codegen: support generation of multi-dimensional access functions When computing the index expressions for new, multi-dimensional memory accesses these new index expressions may reference original llvm::Values that are not transfered into the OpenMP subfunction. Using GlobalMap we now replace references to such values with the rewritten values that have e.g. been passed to the OpenMP subfunction. llvm-svn: 246923	2015-09-05 10:32:56 +00:00
Tobias Grosser	bc13260775	BlockGenerator: Make GlobalMap a member variable The GlobalMap variable used in BlockGenerator should always reference the same list througout the entire code generation, hence we can make it a member variable to avoid passing it around through every function call. History: Before we switched to the SCEV based code generation the GlobalMap also contained a mapping form old to new induction variables, hence it was different for each ScopStmt, which is why we passed it as function argument to copyStmt. The new SCEV based code generation now uses a separate mapping called LTS -> LoopToSCEV that maps each original loop to a new loop iteration variable provided as a SCEVExpr. The GlobalMap is currently mostly used for OpenMP code generation, where references to parameters in the original function need to be rewritten to the locations of these variables after they have been passed to the subfunction. Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 246920	2015-09-05 09:56:54 +00:00
Tobias Grosser	6f73008506	Allow the import of multi-dimensional access functions Originally, we disallowed the import of multi-dimensional access functions due to our code generation not supporting the generation of new address expressions for multi-dimensional memory accesses. When building our run-time alias check infrastructure we added code generation support for multi-dimensional address calculations. Hence, we can now savely allow the import of new multi-dimensional access functions. llvm-svn: 246917	2015-09-05 07:46:47 +00:00
Tobias Grosser	166c422952	Use uppercase variable names [NFC] llvm-svn: 246916	2015-09-05 07:46:40 +00:00
Tobias Grosser	2df884f95a	ScopInfo: use project_out instead of remove_dims By just removing dimensions (and the constraints they are involved in) we may loose information about the dimensions we do not remove. By instead using project_out, we are sure all constraints on the outer dimensions are preserved. No test case, as this error condition is very unlikely to be triggered by isl's current code. We still 'fix' this, as isl gives little guarantees regarding the behavior of remove_divs. llvm-svn: 246567	2015-09-01 18:17:41 +00:00
Tobias Grosser	f560ca90ef	Update isl to isl-0.15-129-gb086c90 llvm-svn: 246552	2015-09-01 15:42:13 +00:00
Tobias Grosser	40820ca286	Fix another typo in the subloop counting ... as well as the corresponding test cases. Thank's Johannes for finding this bug. llvm-svn: 246483	2015-08-31 21:04:51 +00:00
Johannes Doerfert	5f912d3797	Do Not Model Unbounded Loops Code generation currently does not expect unbounded loops. When using ISL to compute the loop trip count, if we find that the iteration domain remains unbounded, we invalidate the Scop by creating an infeasible context. Contributed-by: Matthew Simpson <mssimpso@codeaurora.org> This fixes PR24634. Differential Revision: http://reviews.llvm.org/D12493 llvm-svn: 246477	2015-08-31 19:58:24 +00:00
Johannes Doerfert	f08bd00229	Build the domains with correct number of dimensions Instead of building domains with MaxLoopDepth dimensions, we now build the domains such that they have the right amount of dimensions all the time. llvm-svn: 246443	2015-08-31 13:56:32 +00:00
Tobias Grosser	d213d52d0e	Always use the branch instructions to model the PHI-node writes Before this commit we did this only for Arguments or Constants, but indeed an instruction may define a value a lot higher up in the dominance tree, but the actual write generally needs to happen right before branching to the PHI node. Otherwise, the writes of different branches into PHI nodes may get intermixed if they lay higher up in the dominance tree. llvm-svn: 246441	2015-08-31 13:45:54 +00:00
Tobias Grosser	050e0cbc0e	ScopDetection: Correctly count the loops in a region There is no reason the loops in a region need to touch either entry or exit block. Hence, we need to look through all loops that may touch the region as well as their children to understand if our region has at least two loops. llvm-svn: 246433	2015-08-31 12:08:11 +00:00
Tobias Grosser	44b34b0e8a	Also build scalar dependences for store instructions While ignoring read-only scalar dependences it was not necessary to consider store instructins, but as store instructions can be the target of a scalar read-only dependency we need to consider them for the construction of scalar read-only dependences. llvm-svn: 246429	2015-08-31 11:15:00 +00:00
Tobias Grosser	9f3d55cf3d	Generate scalar initialization loads at the beginning of the start BB Our OpenMP code generation generated part of its launching code directly into the start basic block and without this change the scalar initialization was run _after_ the OpenMP threads have been launched. This resulted in uninitialized scalar values to be used. llvm-svn: 246427	2015-08-31 11:06:19 +00:00
Tobias Grosser	f93451802a	OpenMP-codegen: Correctly pass function arguments to subfunctions Before we only checked if certain instructions can be expanded by us. Now we check any value, including function arguments. llvm-svn: 246425	2015-08-31 09:05:43 +00:00
Tobias Grosser	58758ef4ea	Enable modeling of scalar read-only dependences Even though these are not strictly necessary for sequential code generation, we still model both for sequential and parallel code generation to reduce the set of configurations that needs to be tested. If this turns out, against what we currently see, to be significant overhead, we can decide to limit this feature again to parallel code-generation use cases only. llvm-svn: 246420	2015-08-31 06:46:32 +00:00
Tobias Grosser	d86bf4271c	Do not model scalar references to constant values llvm-svn: 246418	2015-08-31 06:37:25 +00:00
Tobias Grosser	64c0ff4141	Add support for scalar dependences to OpenMP code generation Scalar dependences between scop statements have caused troubles during parallel code generation as we did not pass on the new stack allocation created for such scalars to the parallel subfunctions. This change now detects all scalar reads/writes in parallel subfunctions, creates the allocas for these scalar objects, passes the resulting memory locations to the subfunctions and ensures that within the subfunction requests for these memory locations will return the rewritten values. Johannes suggested as a future optimization to privatizing some of the scalars in the subfunction. llvm-svn: 246414	2015-08-31 05:52:24 +00:00
Johannes Doerfert	96425c2574	Traverse the SCoP to compute non-loop-carried domain conditions In order to compute domain conditions for conditionals we will now traverse the region in the ScopInfo once and build the domains for each block in the region. The SCoP statements can then use these constraints when they build their domain. The reason behind this change is twofold: 1) This removes a big chunk of preprocessing logic from the TempScopInfo, namely the Conditionals we used to build there. Additionally to moving this logic it is also simplified. Instead of walking the dominance tree up for each basic block in the region (as we did before), we now traverse the region only once in order to collect the domain conditions. 2) This is the first step towards the isl based domain creation. The second step will traverse the region similar to this step, however it will propagate back edge conditions. Once both are in place this conditional handling will allow multiple exit loops additional logic. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12428 llvm-svn: 246398	2015-08-30 21:13:53 +00:00
Johannes Doerfert	b1e3bbb7c9	[FIX] Keep a copy of the Domain set in the SCEVAffinator llvm-svn: 246397	2015-08-30 19:52:06 +00:00
Tobias Grosser	2fc50df900	Do not store into a temporary twine For some reason, this causes memory corruption issues. Let's just avoid it. llvm-svn: 246396	2015-08-30 19:51:01 +00:00
Tobias Grosser	c0091a77f9	Store scalar dependences from outside the scop into alloca locations We already modeled read-only dependences to scalar values defined outside the scop as memory reads and also generated read accesses from the corresponding alloca instructions that have been used to pass these scalar values around during code generation. However, besides for PHI nodes that have already been handled, we failed to store the orignal read-only scalar values into these alloc. This commit extends the initialization of scalar values to all read-only scalar values used within the scop. llvm-svn: 246394	2015-08-30 19:19:34 +00:00
Tobias Grosser	b649e26a50	getNewScalarValue: Get ScalarMap directly from member variable [NFC] There is no need to pass the ScalarMap to getNewScalarValue as this map is (indirectly) used when calling getOrCreateScalarAlloca. llvm-svn: 246390	2015-08-30 17:37:55 +00:00
Tobias Grosser	655a4570cd	createScalarInitialization: Always store PHI-node value The current code really tries hard to use getNewScalarValue(), which checks if not the original value, but a possible copy or demoted value needs to be stored. In this calling context it seems, that we _always_ use the ScalarValue that comes from the incoming PHI node, but never any other value. As also no test cases fail, it seems right to just drop this call to getNewScalarValue and remove the parameters that are not needed any more. Johannes suggested that code like this might be needed for parallel code generation with offloading, but it was still unclear if/what exactly would be needed. As the parallel code generation does currently not support scalars at all, we will remove this code for now and add relevant code back when complitng the support of scalars in the parallel code generation. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D12470 llvm-svn: 246389	2015-08-30 17:32:39 +00:00
Tobias Grosser	e83a396b1d	Ignore debug intrinsics and do not model their potential scalar metadata reads Our code generation currently does not support scalar references to metadata values. Hence, it would crash if we try to model scalar dependences to metadata values. Fortunately, for one of the common uses, debug information, we can for now just ignore the relevant intrinsics and consequently the issue of how to model scalar dependences to metadata. llvm-svn: 246388	2015-08-30 16:57:20 +00:00
Tobias Grosser	9c0ffe3a1d	Remove some code duplication [NFC] llvm-svn: 246387	2015-08-30 16:57:15 +00:00
Tobias Grosser	fcfac082ea	Minor code style improvement [NFC] llvm-svn: 246386	2015-08-30 16:01:58 +00:00
Tobias Grosser	2985400a0e	Remove isNew from getOrCreateAlloca This commit drops some dead code. Specifically, there is no need to initialize the virtual memory locations of scalars in BlockGenerator::handleOutsideUsers, the function that initalizes the escape map that keeps track of out-of-scope uses of scalar values. We already model instructions inside the scop that are used outside the scope (escaping instructions) as scalar memory writes at the position of the instruction. As a result, the virtual memory location of this instructions is already initialized when code-generating the corresponding virtual scalar write and consequently does not need to be initialized later on when generating the set of escaping values. Code references: In TempScopInfo::buildScalarDependences we detect scalar cross-statement dependences for all instructions (including PHIs) that have uses outside of the scop's region: // Check whether or not the use is in the SCoP. if (!R->contains(UseParent)) { AnyCrossStmtUse = true; continue; } We use this information in TempScopInfo::buildAccessFunctions were we build scalar write memory accesses for all these instructions: if (!isa<StoreInst>(Inst) && buildScalarDependences(Inst, &R, NonAffineSubRegion)) { // If the Instruction is used outside the statement, we need to build the // write access. IRAccess ScalarAccess(IRAccess::MUST_WRITE, Inst, ZeroOffset, 1, true, Inst); Functions.push_back(std::make_pair(ScalarAccess, Inst)); } Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D12472 llvm-svn: 246383	2015-08-30 15:03:59 +00:00
Tobias Grosser	f8d55f7e4e	Remove some code duplication when creating Allocas [NFC] llvm-svn: 246364	2015-08-29 18:12:03 +00:00
Johannes Doerfert	b409fdc0d7	[NFC] Make SCEVAffinator work without a statement llvm-svn: 246290	2015-08-28 09:24:35 +00:00
Tobias Grosser	b79a67df78	BlockGenerator: Make scalar memory locations accessible For external users, the memory locations into which we generate scalar values may be of interest. This change introduces two functions that allow to obtain (or create) the AllocInsts for a given BasePointer. We use this change to simplify the code in BlockGenerators. llvm-svn: 246285	2015-08-28 08:23:35 +00:00
Tobias Grosser	ed21a1fc7e	Do not detect Scops with only one loop. If a region does not have more than one loop, we do not identify it as a Scop in ScopDetection. The main optimizations Polly is currently performing (tiling, preparation for outer-loop vectorization and loop fusion) are unlikely to have a positive impact on individual loops. In some cases, Polly's run-time alias checks or conditional hoisting may still have a positive impact, but those are mostly enabling transformations which LLVM already performs for individual loops. As we do not focus on individual loops, we leave them untouched to not introduce compile time regressions and execution time noise. This results in good compile time reduction (oourafft: -73.99%, smg2000: -56.25%). Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12268 llvm-svn: 246161	2015-08-27 16:55:18 +00:00
Tobias Grosser	2d1ed0bfa7	BlockGenerator: Add the possiblity to pass a set of new access functions This change allows the BlockGenerator to be reused in contexts where we want to provide different/modified isl_ast_expressions, which are not only changed to a different access relation than the original statement, but which may indeed be different for each code-generated instance of the statement. We ensure testing of this feature by moving Polly's support to import changed access functions through a jscop file to use the BlockGenerators support for generating arbitary access functions if provided. This commit should not change the behavior of Polly for now. The diff is rather large, but most changes are due to us passing the NewAccesses hash table through functions. This style, even though rather verbose, matches what is done throughout the BlockGenerator with other per-statement properties. llvm-svn: 246144	2015-08-27 07:28:16 +00:00
Johannes Doerfert	d020b77295	Use ISL to Determine Loop Trip Count Use ISL to compute the loop trip count when scalar evolution is unable to do so. Contributed-by: Matthew Simpson <mssimpso@codeaurora.org> Differential Revision: http://reviews.llvm.org/D9444 llvm-svn: 246142	2015-08-27 06:53:52 +00:00
Tobias Grosser	01c8f5f354	[Vectorizer] Detect strides in multi-dimensional arrays The original code was only correct for one-dimensional arrays, but derived incorrect strides for multi-dimensional arrays. llvm-svn: 245888	2015-08-24 22:20:46 +00:00
Tobias Grosser	39f9f30e8b	Only derive number of loop iterations for loops we can actually vectorize llvm-svn: 245870	2015-08-24 20:11:34 +00:00
Tobias Grosser	fa57e9b7e6	Make our data-locality schedule tree transforms externally accessible Other passes which perform different optimizations might be interested in also applying data-locality transformations as part of their overall transformation. llvm-svn: 245824	2015-08-24 06:01:47 +00:00
Tobias Grosser	1ac884d73a	Use marker nodes to annotate the different levels of tiling Currently, marker nodes are ignored during AST generation, but visible in the -debug-only=polly-ast output. llvm-svn: 245809	2015-08-23 09:11:00 +00:00
Tobias Grosser	75296901f7	Fix 'unused variable' warning in NASSERTS build llvm-svn: 245723	2015-08-21 19:23:21 +00:00
Roman Gareev	c49724f008	Manually check a loop form Add manual check of a loop form and return non-negative number of iterations in case of trivially vectorizable loop. llvm-svn: 245680	2015-08-21 09:08:14 +00:00
Tobias Grosser	daaed0e19f	Do not intersect with AssumedContext in calculateMinMaxAccess Originally, we intersected the iteration space with the AssumedContext before computing the minimal/maximal memory offset in our run-time alias checks. With this patch we drop this intersection as the AssumedContext can - for larger or more complex scops - become very complicated (contain many disjuncts). When intersecting an object with many disjuncts with other objects, the number of disjuncts in these other objects also increases quickly. As a result, the compile time is unnecessarily increased. This patch now drops the intersection with the assumed context to ensure we do not pay unnecessary compile time costs. With this patch we see -3.17% reduction in compile time for 3mm with default flags and -17.87% when compiling 3mm with -DPOLYBENCH_USE_C99_PROTO flag. We did not observe any regressions in LNT. Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12198 llvm-svn: 245617	2015-08-20 21:29:26 +00:00
Tobias Grosser	fc490a99f5	Do really not unroll the vector loop in combination with register tiling The previous commit lacked a test case for register tiling + pre-vectorization and we obviously got it immediately wrong. llvm-svn: 245599	2015-08-20 19:08:16 +00:00
Tobias Grosser	d83b8a83ec	Add option to control reduction detection llvm-svn: 245598	2015-08-20 19:08:11 +00:00
Tobias Grosser	40985016b2	Fix formatting llvm-svn: 245597	2015-08-20 19:08:05 +00:00
Johannes Doerfert	120de4be96	Simplify the SCoP creation and bookkeeping To avoid multiple exits and the resulting complicated conditions when creating a SCoP we now use the single hasFeasibleRuntimeContext() check to decide if a SCoP should be dismissed right after construction. If building runtime checks failed the assumed context is made infeasible, hence the optimized version will never be executed and the SCoP can be dismissed. llvm-svn: 245593	2015-08-20 18:30:08 +00:00
Johannes Doerfert	5d5b30649a	Check feasibility for the runtime check context wrt. the domain. If nothing is executed we can bail out early. Otherwise we can use the constraints that ensure at least one statement is executed for simplification. llvm-svn: 245585	2015-08-20 18:06:30 +00:00
Johannes Doerfert	4eed5bea54	Link ScopArrayInfo objects We will record if a SAI is the base of another SAI or derived from it. This will allow to reason about indirect base pointers later on and allows a clearer picture of indirection also in the SCoP dump. llvm-svn: 245584	2015-08-20 18:04:22 +00:00
Tobias Grosser	42e2489553	Add experimental support for trivial register tiling Register tiling in Polly is for now just an additional level of tiling which is fully unrolled. It is disabled by default. To make this useful for more than experiments, we still need a cost function as well as possibly further optimizations that teach LLVM to actually put some of the values we got into scalar registers. llvm-svn: 245564	2015-08-20 13:45:05 +00:00
Tobias Grosser	0483271662	Add support for two-level tiling By default we only use one level of tiling for loops, but in general tiling for multiple levels is trivial for us. Hence, we add a set of options that allow people to play with a second level of tiling. If this is profitable for some cases we can work on heuristics that allow us to identify these cases and use two-level tiling for them. llvm-svn: 245563	2015-08-20 13:45:02 +00:00
Tobias Grosser	862b9b5239	Factor out check for tileable band node. llvm-svn: 245559	2015-08-20 12:32:45 +00:00
Tobias Grosser	9bdea573bd	Introduce tileBand function to simplify code llvm-svn: 245558	2015-08-20 12:22:37 +00:00
Tobias Grosser	d891b54132	Add some forgotten isl memory annotations llvm-svn: 245557	2015-08-20 12:16:23 +00:00
Johannes Doerfert	43788c5783	Check for feasible runtime check context early Instead of generating code for an empty assumed context we bail out early. As the number of assumptions we generate increases this becomes more and more important. Additionally, this change will allow us to hide internal contexts that are only used in runtime checks e.g., a boundary context with constraints not suited for simplifications. llvm-svn: 245540	2015-08-20 05:58:56 +00:00
Tobias Grosser	b0da42fb55	Generate alias metadata even in OpenMP mode To make alias scope metadata generation work in OpenMP mode we now provide the ScopAnnotator with information about the base pointer rewrite that happens when passing arrays into the OpenMP subfunction. llvm-svn: 245451	2015-08-19 16:04:35 +00:00
Tobias Grosser	d8e3c8c665	Fix typo llvm-svn: 245441	2015-08-19 14:22:48 +00:00
Tobias Grosser	07c1c2fcc9	Make prevectorization width configurable Polly uses 'prevectorization' to enable outer loop vectorization. When vectorizing an outer loop, we strip-mine <number-of-prevec-dims> loop iterations which are than interchanged to the innermost level such that LLVM's inner loop vectorizer (or Polly's simple vectorizer) can easily vectorize this loop. The number of loop iterations to strip-mine is now configurable with the option -polly-prevect-width=<number-of-prevec-dims>. This is mostly a debugging option. We should probably add a heuristic that derives the number of prevectorization dimensions from the target data and the data types used. llvm-svn: 245424	2015-08-19 08:46:11 +00:00
Tobias Grosser	161c9081e5	Do not use negative option name Instead of -polly-no-tiling, we use -polly-tiling=false to disable tiling. llvm-svn: 245423	2015-08-19 08:22:06 +00:00
Tobias Grosser	f10f4636ff	Simplify tiling code a bit We only need to allocate the tile size vector if we actually want to perform a tiling. llvm-svn: 245422	2015-08-19 08:03:37 +00:00
Michael Kruse	acb6ade757	Move early exit to the beginning of the function If the function exits early there is no reason to enter the loop. llvm-svn: 245316	2015-08-18 17:25:48 +00:00
Roman Gareev	f2bd72e00d	Use isl_set_is_subset instead of isl_set_is_equal It helps to detect correct strides in case of parametric constraints of Stride in MemoryAccess::isStrideX. Reviewers: grosser llvm-svn: 245303	2015-08-18 16:12:05 +00:00
Tobias Grosser	cf9ebb63d6	Use schedule trees to compute dependences This patch changes Polly to compute the data-dependences on the schedule tree instead of a flat schedule representation. Calculating dependences directly on the schedule tree results in some good compile-time improvements (adi : -23.35%, 3mm : -9.57%), as the structure of the schedule can be exploited for increased efficiency. Earlier experiments with schedule tree based dependence analysis in Polly showed some compile-time regressions. These regressions arose due to the schedule tree based dependence analysis not taking into account the domain constraints of the schedule tree. As a result, the computed dependences were different and this difference caused in some cases the schedule optimizer to take a very long time. Since isl version fe865996 the schedule tree based dependence analysis takes domain constraints into account, which fixes the earlier compile-time issues. Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 245300	2015-08-18 15:05:29 +00:00
Michael Kruse	d2b0360197	Fix Codegen adding a second exit out of region executeScopConditionally would destroy a predecessor region if it the scop's entry was the region's exit block by forking it to polly.start and thus creating a secnd exit out of the region. This patch "shrinks" the predecessor region s.t. polly.split_new_and_old is not the region's exit anymore. llvm-svn: 245294	2015-08-18 13:14:42 +00:00
Johannes Doerfert	e69e1141d9	Introduce the ScopExpander as a SCEVExpander replacement The SCEVExpander cannot deal with all SCEVs Polly allows in all kinds of expressions. To this end we introduce a ScopExpander that handles the additional expressions separatly and falls back to the SCEVExpander for everything else. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12066 llvm-svn: 245288	2015-08-18 11:56:00 +00:00
Tobias Grosser	77c0f5a3b7	Drop dead and disable code from IndependentBlocks Since Polly has now support for the code generation of scalar and PHI dependences this code was unused and is now dropped. llvm-svn: 245284	2015-08-18 09:30:28 +00:00
Johannes Doerfert	d86f2157e5	Add a field to the memory access class for a related value. The new field in the MemoryAccess allows us to track a value related to that access: - For real memory accesses the value is the loaded result or the stored value. - For straigt line scalar accesses it is the access instruction itself. - For PHI operand accesses it is the operand value. We use this value to simplify code which deduced information about the value later in the Polly pipeline and was known to be error prone. Reviewers: grosser, Meinsersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12062 llvm-svn: 245213	2015-08-17 10:58:17 +00:00
Tobias Grosser	c5bcf246d1	Fix Polly after SCEV port to new pass manager This fixes compilation after LLVM commit r245193. llvm-svn: 245211	2015-08-17 10:57:08 +00:00
Johannes Doerfert	e1fa6da356	[FIX] Create location if a needed value was not yet demoted This allows the code generation to continue working even if a needed value (that is reloaded anyway) was not yet demoted. Instead of failing it will now create the location for future demotion to memory and load from that location. The stores will use the same location and by construction execute before the load even if the textual order in the generated AST is otherwise. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12072 llvm-svn: 245203	2015-08-17 09:38:46 +00:00
Johannes Doerfert	45545ff782	Build the ScopStmt domain in-place. This will build the statement domains in-place, hence using the ScopStmt::Domain member instead of some intermediate isl_set. llvm-svn: 245179	2015-08-16 14:36:01 +00:00

... 4 5 6 7 8 ...

1659 Commits