llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Kruse	fd9c89e84b	Fix non-affine generated entering node not being recognized as dominating Scalar reloads in the generated entering block were not recognized as dominating the subregions locks when there were multiple entering nodes. This resulted in values defined in there not being copied. As a fix, we unconditionally add the BBMap of the generated entering node to the generated entry. This fixes part of llvm.org/PR25439. llvm-svn: 252445	2015-11-09 05:00:30 +00:00
Johannes Doerfert	188542fda9	[FIX] Initialize incoming scalar memory locations for PHIs llvm-svn: 252437	2015-11-09 00:21:21 +00:00
Johannes Doerfert	80c716df2b	[NFC] Remove unused variable. llvm-svn: 252436	2015-11-09 00:21:04 +00:00
Johannes Doerfert	f85ad0411f	[FIX] Carefully simplify assumptions in the presence of error blocks If a SCoP contains error blocks we cannot use the domain constraints to simplify the assumptions as the domain is already influenced by the assumptions we took. Before this patch we did that and some assumptions became self-fulfilling as they were implied by the domain constraints. llvm-svn: 252424	2015-11-08 20:16:39 +00:00
Johannes Doerfert	a768624f14	[FIX] Introduce different SAI objects for scalar and memory accesses Even if a scalar and memory access have the same base pointer, we cannot use one SAI object as the type but also the number of dimensions are wrong. For the attached test case this caused a crash in the invariant load hoisting, though it could cause various other problems too. This fixes bug 25428 and a execution time bug in MallocBench/cfrac. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 252422	2015-11-08 19:12:05 +00:00
Johannes Doerfert	3797707695	[FIX] Use unreachable to indicate dead code and repair dominance When we bail out early we make the partially build new code path practically dead, though it was not unreachable. To remove dominance problems we now make it not only dead but also prevent the control flow to join with the original code path, thus allow to use original values after the SCoP without any PHI nodes. This fixes bug 25447. llvm-svn: 252420	2015-11-08 17:57:41 +00:00
Johannes Doerfert	1dd6e37af4	Verify IR after we bail out The bail out in r252412 left the code generation without verifying the (so far) generated IR. This will change now and ensure we always run the verifier. Suggested-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 252419	2015-11-08 16:34:17 +00:00
Johannes Doerfert	c4898504ea	[FIX] Bail out if there is a dependence cycle between invariant loads While the program cannot cause a dependence cycle between invariant loads, additional constraints (e.g., to ensure finite loops) can introduce them. It is hard to detect them in the SCoP description, thus we will only check for them at code generation time. If such a recursion is detected we will bail out the code generation and place a "false" runtime check to guarantee the original code is used. This fixes bug 25443. llvm-svn: 252412	2015-11-07 19:46:04 +00:00
Johannes Doerfert	44483c5599	[FIX] Remove all invariant load occurences from own execution context llvm-svn: 252411	2015-11-07 19:45:27 +00:00
Michael Kruse	0651480b97	Fix non-affine region dominance of implicitely stored values After loop versioning, a dominance check of a non-affine subregion's exit node causes the dominance check to always fail on any block in the subregion if it shares the same exit block with the scop. The subregion's exit block has become polly_merge_new_and_old, which also receives the control flow of the generated code. This would cause that any value for implicit stores is assumed to be not from the scop. We check dominance with the generated exit node instead. This fixes llvm.org/PR25438 llvm-svn: 252375	2015-11-07 00:36:50 +00:00
Duncan P. N. Exon Smith	b8f58b53dd	polly/ADT: Remove implicit ilist iterator conversions, NFC Remove all the implicit ilist iterator conversions from polly, in preparation for making them illegal in ADT. There was one oddity I came across: at line 95 of lib/CodeGen/LoopGenerators.cpp, there was a post-increment `Builder.GetInsertPoint()++`. Since it was a no-op, I removed it, but I admit I wonder if it might be a bug (both before and after this change)? Perhaps it should be a pre-increment? llvm-svn: 252357	2015-11-06 22:56:54 +00:00
Tobias Grosser	712229ec59	Add missing '%loadPolly' to test case llvm-svn: 252302	2015-11-06 14:03:35 +00:00
Michael Kruse	ddb6528ba6	Fix reuse of non-dominating synthesized value in subregion exit We were adding all generated values in non-affine subregions to be used for the subregions generated exit block. The thought was that only values that are dominating the original exit block can be used there. But it is possible for synthesizable values to be expanded in any block. If the same values is also used for implicit writes, it would try to reuse already synthesized values even if not dominating the exit block. The fix is to only add values to the list of values usable in the exit block only if it is dominating the exit block. This fixes llvm.org/PR25412. llvm-svn: 252301	2015-11-06 13:51:24 +00:00
Tobias Grosser	6578f001bf	Adjust debug metadata to LLVM changes in 252219 llvm-svn: 252273	2015-11-06 06:27:39 +00:00
Tobias Grosser	f1bfd75221	ScopInfo: Allocate globally unique memory access identifiers Before this commit memory reference identifiers have only been unique per basic block, but not per (non-affine) ScopStmt. This commit now uses the MemoryAccess base pointer to uniquely identify each Memory access. llvm-svn: 252200	2015-11-05 20:15:37 +00:00
Tobias Grosser	4bf262a0f2	RunTimeDebugBuilder: Allocate memory _after_ knowing how much is needed This fixes a memory corruption issue, where we accessed more memory than actually allocated. llvm-svn: 252197	2015-11-05 19:43:34 +00:00
Michael Kruse	27149cf32d	Use per-BB value maps for non-exit BBs For generating scalar writes of non-affine subregions, all except phi writes are generated in the exit block. The phi writes are generated in the incoming block for which we errornously used the same BBMap. This can conflict if a value for one block is synthesized, and then reused for another block which is not dominated by the first block. This is fixed by using block-specific BBMaps for phi writes. llvm-svn: 252172	2015-11-05 16:17:17 +00:00
Michael Kruse	f714d470d7	Fix escaping value to subregion entry node phi An incoming value from a block the is not inside the scop is an external use, even if the phi is inside the scop. A previous fix in r251208 did not apply if the phi is inside a non-affine subregion. We move the check for this phi case before the non-affine subregion check. llvm-svn: 252157	2015-11-05 13:18:43 +00:00
Johannes Doerfert	22892687f7	[FIX] Simplify and correct preloading of base pointer origin To simplify and correct the preloading of a base pointer origin, e.g., the base pointer for the current indirect invariant load, we now just check if there is an invariant access class that involves the base pointer of the current class. llvm-svn: 251962	2015-11-03 19:15:33 +00:00
Johannes Doerfert	eca9e890b9	Remove read-only statements from the SCoP We do not need to model read-only statements in the SCoP as they will not cause any side effects that are visible to the outside anyway. Removing them should safe us time and might even simplify the ASTs we generate. Differential Revision: http://reviews.llvm.org/D14272 llvm-svn: 251948	2015-11-03 16:54:49 +00:00
Johannes Doerfert	e071f6d637	[NFC] Name invariant load parameters after base pointer This just makes the debug output nices sometimes. llvm-svn: 251947	2015-11-03 16:49:59 +00:00
Johannes Doerfert	475d8e3f42	[FIX] Ensure base pointer origin was preloaded already If a base pointer of a preloaded value has a base pointer origin, thus it is an indirect invariant load, we have to make sure the base pointer origin is preloaded first. llvm-svn: 251946	2015-11-03 16:49:02 +00:00
Johannes Doerfert	d6fc0701ee	[FIX] Carefully rewrite parameters wrt. invariant equivalence classes ScalarEvolution doesn't allow the operands of an AddRec to be variant in the loop of the AddRec. When we rewrite parameter SCEVs it might seem like the new SCEV violates this property and ScalarEvolution will trigger an assertion. To avoid this we move the start part out of an AddRec when we rewrite it, thus avoid the operands to be possibly variant completely. llvm-svn: 251945	2015-11-03 16:47:58 +00:00
Johannes Doerfert	3181c2ef72	[FIX] Correctly update SAI base pointer If a base pointer load is preloaded, we have change the base pointer of the derived SAI. However, as the derived SAI relationship is is coarse grained, we need to check if we actually preloaded the base pointer or a different element of the base pointer SAI array. llvm-svn: 251881	2015-11-03 01:42:59 +00:00
Johannes Doerfert	dca2837b76	[FIX] Do not crash in the presence of infinite loops. llvm-svn: 251870	2015-11-03 00:28:07 +00:00
Johannes Doerfert	907456fe04	[FIX] Use appropriately sized types for big constants llvm-svn: 251869	2015-11-03 00:26:22 +00:00
Tobias Grosser	8286b83f97	ScopInfo: Bail out in case of mismatching array dimension sizes In some cases different memory accesses access the very same array using a different multi-dimensional array layout where the same dimensions have different sizes. Instead of asserting when encountering this issue, we gracefully bail out for this scop. This fixes llvm.org/PR25252 llvm-svn: 251791	2015-11-02 11:29:32 +00:00
Tobias Grosser	f423c1200f	Remove old and redundant options We remove -polly-detect-unprofitable and -polly-no-early-exit. Both have been superseeded by -polly-process-unprofitable and were only kept as aliases for our buildbots to continue to work. As all buildbots have been moved to the new options, we can now remove the old ones for good. llvm-svn: 251787	2015-11-02 10:13:32 +00:00
Tobias Grosser	3e9560200f	RegionGenerator: Clear local maps after statement construction These maps are only needed during the construction of a single region statement. Clearing them is important, as we otherwise get an assert in case some of the referenced values are erased before the RegionGenerator is deleted. llvm-svn: 251341	2015-10-26 20:41:53 +00:00
Tobias Grosser	4dc909cbed	tests: Add test cases for LLVM commit r251267 This fixes llvm.org/PR25242 llvm-svn: 251268	2015-10-25 22:56:42 +00:00
Tobias Grosser	bf45e74254	ScopDetect: Bail out for non-simple memory accesses Volatile or atomic memory accesses are currently not supported. Neither did we think about any special handling needed nor do we support the unknown instructions the alias set tracker turns them into sometimes. Before this patch, us not supporting unkown instructions in an alias set caused the following assertion failures: Assertion `AG.size() > 1 && "Alias groups should contain at least two accesses"' failed llvm-svn: 251234	2015-10-25 13:48:40 +00:00
Tobias Grosser	27e19a022e	Fix typo llvm-svn: 251231	2015-10-25 12:05:14 +00:00
Tobias Grosser	4d935cd9aa	tests: Add test case forgotten in 251191 llvm-svn: 251228	2015-10-25 10:55:40 +00:00
Tobias Grosser	907090c37c	ScopDetection: Update DetectionContextMap accordingly When verifying if a scop is still valid we rerun all analysis, but did not update DetectionContextMap. This change ensures that information, e.g. about non-affine regions, is correctly updated llvm-svn: 251227	2015-10-25 10:55:35 +00:00
Tobias Grosser	51174cca30	ScopDetection: Do not crash if we find zero array size candidates for delinearization llvm-svn: 251226	2015-10-25 08:40:44 +00:00
Tobias Grosser	5528dcdf25	ScopDetection: Always refuse multi-dimensional memory accesses with 'undef' in the size expression. We previously only checked if the size expression is 'undef', but allowed size expressions of the form 'undef * undef' by accident. After this change we now require size expressions to be affine which implies no 'undef' appears anywhere in the expression. llvm-svn: 251225	2015-10-25 08:40:38 +00:00
Tobias Grosser	baffa091dd	ScopInfo: PHI-node uses in the EntryNode with an incoming BB that is not part of the Region are external. During code generation we split off the parts of the PHI nodes in the entry block, which have incoming blocks that are not part of the region. As these split-off PHI nodes then are external uses, we consequently also need to model these uses in ScopInfo. llvm-svn: 251208	2015-10-24 20:55:27 +00:00
Tobias Grosser	ffd6b3bb29	Add a missing '-S' llvm-svn: 251199	2015-10-24 19:02:01 +00:00
Tobias Grosser	a3f6edaee1	BlockGenerator: Do not assert when finding model PHI nodes defined outside the scop Such PHI nodes can not only appear in the ExitBlock of the Scop, but indeed any scalar PHI node above the scop and used in the scop is modeled as scalar read access. llvm-svn: 251198	2015-10-24 19:01:09 +00:00
Tobias Grosser	27d742da59	BlockGenerator: Directly handle multi-exit PHI nodes This change adds code to directly code-generate multi-exit PHI nodes, instead of trying to reuse the EscapeMap infrastructure for this. Using escape maps adds a level of indirection that is hard to understand and - more importantly - breaks in certain cases. Specifically, the original code relied on simplifyRegion() to split the original PHI node in two PHI nodes, one merging the values coming from within the scop and a second that merges the first PHI node with the values that come from outside the scop. To generate code the first PHI node is then just handled like any other in-scop value that is used somewhere outside the scop. This fails for the case where all values from inside the scop are identical, as the first PHI node is in such cases automatically simplified and eliminated by LLVM right at construction. As a result, there is no instruction that can be pass to the EscapeMap handling, which means the references in the second PHI node are not updated and may still reference values from within the original scop that do not dominate it. Our new code iterates directly over all modeled ScopArrayInfo objects that represent multi-exit PHI nodes and generates code for them without relying on the EscapeMap infrastructure. Hence, it works also for the case where the first PHI node is eliminated. llvm-svn: 251191	2015-10-24 17:41:29 +00:00
Tobias Grosser	c73d8b0e18	ScopInfo: Drop unnecessary code This case has already been taken care of in r250622 and was then accidentally again committed in 250625. llvm-svn: 251156	2015-10-23 22:36:22 +00:00
Johannes Doerfert	654c3284f4	[FIX] Do not hoist nested variant base pointers This fixes bug 25249. llvm-svn: 250958	2015-10-21 22:14:57 +00:00
Tobias Grosser	ca7f5bb767	Full/partial tile separation for vectorization We isolate full tiles from partial tiles to be able to, for example, vectorize loops with parametric lower and/or upper bounds. If we use -polly-vectorizer=stripmine, we can see execution-time improvements: correlation from 1m7361s to 0m5720s (-67.05 %), covariance from 1m5561s to 0m5680s (-63.50 %), ary3 from 2m3201s to 1m2361s (-46.72 %), CrystalMk from 8m5565s to 7m4285s (-13.18 %). The current full/partial tile separation increases compile-time more than necessary. As a result, we see in compile time regressions, for example, for 3mm from 0m6320s to 0m9881s (56.34%). Some of this compile time increase is expected as we generate more IR and consequently more time is spent in the LLVM backends. However, a first investiagation has shown that a larger portion of compile time is unnecessarily spent inside Polly's parallelism detection and could be eliminated by propagating existing knowledge about vector loop parallelism. Before enabling -polly-vectorizer=stripmine by default, it is necessary to address this compile-time issue. Contributed-by: Roman Gareev <gareevroman@gmail.com> Reviewers: jdoerfert, grosser Subscribers: grosser, #polly Differential Revision: http://reviews.llvm.org/D13779 llvm-svn: 250809	2015-10-20 09:12:21 +00:00
Michael Kruse	48ea8efd59	Correct typo in CHECK line Thanks Tobias for the hint. llvm-svn: 250695	2015-10-19 10:51:20 +00:00
Michael Kruse	dc12222287	Synthesize phi arguments in incoming block New values were always synthesized in the block of the instruction that needed them. This is incorrect for PHI node whose' value must be defined in the respective incoming block. This patch temporarily moves the builder's insert point to the incoming block while synthesizing phi node arguments. This fixes PR25241 (http://llvm.org/bugs/show_bug.cgi?id=25241) llvm-svn: 250693	2015-10-19 09:19:25 +00:00
Johannes Doerfert	9c28bfa72c	[FIX] Only constant integer branch conditions are always affine There are several different kinds of constants that could occur in a branch condition, however we can only handle the most interesting one namely constant integers. To this end we have to treat others as non-affine. This fixes bug 25244. llvm-svn: 250669	2015-10-18 22:56:42 +00:00
Johannes Doerfert	30c2265f98	[FIX] Normalize loops outside the SCoP during schedule generation We build the schedule based on a traversal of the region and accumulate information for each loop in it. The total schedule is associated with the loop surrounding the SCoP, though it can happen that there are blocks in the SCoP which are part of loops that are only partially in the SCoP. Instead of associating information with them (they are not part of the SCoP and consequently are not modeled) we have to associate the schedule information with the surrounding loop if any. This fixes bug 25240. llvm-svn: 250668	2015-10-18 21:17:11 +00:00
Johannes Doerfert	b864c2c3c9	[FIX] Do not try to hoist "empty" accesses Accesses that have a relative offset (in bytes) that is not divisible by the type size (in bytes) will be represented as empty in the SCoP description. This is on its own not good but it also crashed the invariant load hoisting. This patch will fix the latter problem while the former should be addressed too. This fixes bug 25236. llvm-svn: 250664	2015-10-18 19:50:18 +00:00
Johannes Doerfert	bc7cff4c18	[FIX] Do not hoist invariant pointers with non-loaded base ptr in SCoP If the base pointer of a load is invariant and defined in the SCoP but not loaded we cannot hoist the load as we would not hoist the base pointer definition. This fixes bug 25237. llvm-svn: 250663	2015-10-18 19:49:25 +00:00
Johannes Doerfert	af3e301a67	[FIX] Restructure invariant load equivalence classes Sorting is replaced by a demand driven code generation that will pre-load a value when it is needed or, if it was not needed before, at some point determined by the order of invariant accesses in the program. Only in very little cases this demand driven pre-loading will kick in, though it will prevent us from generating faulty code. An example where it is needed is shown in: test/ScopInfo/invariant_loads_complicated_dependences.ll Invariant loads that appear in parameters but are not on the top-level (e.g., the parameter is not a SCEVUnknown) will now be treated correctly. Differential Revision: http://reviews.llvm.org/D13831 llvm-svn: 250655	2015-10-18 12:39:19 +00:00
Johannes Doerfert	d8b6ad255f	[FIX] Cast preloaded values Preloaded values have to match the type of their counterpart in the original code and not the type of the base array. llvm-svn: 250654	2015-10-18 12:36:42 +00:00
Johannes Doerfert	01978cfa0c	Remove independent blocks pass Polly can now be used as a analysis only tool as long as the code generation is disabled. However, we do not have an alternative to the independent blocks pass in place yet, though in the relevant cases this does not seem to impact the performance much. Nevertheless, a virtual alternative that allows the same transformations without changing the input region will follow shortly. llvm-svn: 250652	2015-10-18 12:28:00 +00:00
Tobias Grosser	b8d27aab7d	Revert to original BlockGenerator::getOrCreateAlloca(MemoryAccess &Access) Expressing this in terms of BlockGenerator::getOrCreateAlloca(const ScopArrayInfo *Array) does not work as the MemoryAccess BasePtr is in case of invariant load hoisting different to the ScopArrayInfo BasePtr. Until this is investigated and fixed, we move back to code that just uses the baseptr of MemoryAccess. llvm-svn: 250637	2015-10-18 00:51:13 +00:00
Tobias Grosser	a4f0988df5	BlockGenerator: Add getOrCreateAlloca(const ScopArrayInfo *Array) This allows the caller to get the alloca locations of an array without the need to thank if Array is a PHI or a non-PHI Array. We directly make use of this in BlockGenerator::getOrCreateAlloca(MemoryAccess &Access). llvm-svn: 250628	2015-10-17 22:16:00 +00:00
Michael Kruse	8c1e5ec86a	Return nullptr if MemoryAccess list is empty Other places (e.g. hoistInvariantLoads) assume that an empty lookup will return nullptr. The situation can currently not arise because MemoryAccesses are not removed before hoistInvariantLoads. llvm-svn: 250627	2015-10-17 22:11:07 +00:00
Tobias Grosser	05d7fa79b6	Format comment properly While clang-format takes care that the line-length is not surpassed, the resulting comments sometimes look not optimal. We re-flow the text in the comment to avoid these ugly single-word lines. llvm-svn: 250626	2015-10-17 21:46:28 +00:00
Michael Kruse	225f0d1ee2	Load/Store scalar accesses before/after the statement itself Instead of generating implicit loads within basic blocks, put them before the instructions of the statment itself, including non-affine subregions. The region's entry node is dominating all blocks in the region and therefore the loaded value will be available there. Implicit writes in block-stmts were already stored back at the end of the block. Now, also generate the stores of non-affine subregions when leaving the statement, i.e. in the exiting block. This change is required for array-mapped implicits ("De-LICM") to ensure that there are no dependencies of demoted scalars within statments. Statement load all required values, operator on copied in registers, and then write back the changed value to the demoted memory. Lifetimes analysis within statements becomes unecessary. Differential Revision: http://reviews.llvm.org/D13487 llvm-svn: 250625	2015-10-17 21:36:00 +00:00
Michael Kruse	01cb379fed	Avoid unnecessay .s2a write access when used only in PHIs Accesses for exit node phis will be handled separately by buildPHIAccesses if there is more than one exiting edge, buildScalarDependences does not need to create additional SCALAR accesses. This is a corrected version of r250517, which was reverted in r250607. Differential Revision: http://reviews.llvm.org/D13848 llvm-svn: 250622	2015-10-17 21:07:08 +00:00
Tobias Grosser	ffa2446f28	BlockGenerator: Register outside users of scalars directly Instead of checking at code generation time for each ScopStmt if a scalar has external uses, we just iterate over the ScopArrayInfo descriptions we have and check each of these for possible external uses. Besides being somehow clearer, this approach has the benefit that we will always create valid LLVM-IR even in case we disable the code generation of ScopStmt bodies e.g. for testing purposes. llvm-svn: 250608	2015-10-17 08:54:13 +00:00
Tobias Grosser	3839b422e6	Revert "Avoid unnecessay .s2a write access when used only in PHIs" This reverts commit r250606 due to some bugs it introduced. After these bugs have been resolved, we will add it back to tree. llvm-svn: 250607	2015-10-17 08:54:05 +00:00
Tobias Grosser	26e59ee746	Drop unused parameter from handleOutsideUsers llvm-svn: 250606	2015-10-17 08:25:54 +00:00
Michael Kruse	e71893d580	Add testcase for r250517 llvm-svn: 250518	2015-10-16 15:17:26 +00:00
Michael Kruse	aeceab770e	Avoid unnecessay .s2a write access when used only in PHIs PHI accesses will be handled separately by buildPHIAccesses, buildScalarDependences does not need to create additional accesses. llvm-svn: 250517	2015-10-16 15:14:40 +00:00
Tobias Grosser	b860289dbd	Add ScopInfo test case for r250411 llvm-svn: 250439	2015-10-15 18:26:06 +00:00
Tobias Grosser	473a5c3253	test: Correctly check for branch statements In r250408 'CHECK-NEXT: br' lines were removed as they also matched a '%polly.subregion.iv.inc' instruction and did consequently not check what they were supposed to check. However, without these lines we can not test that the .s2a instructions that are not any more generated since r250411 really are not emitted. Hence, we add back the CHECK-NEXT lines to ensure there are really no instructions generated between the store that we check for and the branch at the end of the basic block. To ensure we do not match too early, we now check for 'br i1' or 'br label'. llvm-svn: 250435	2015-10-15 18:04:20 +00:00
Michael Kruse	668af71b82	Do not add accesses for intra-ScopStmt scalar def-use chains When pulling a llvm::Value to be written as a PHI write, the former code did only check whether it is within the same basic block, but it could also be the same non-affine subregion. In that case some unecessary pair of MemoryAccesses would have been created. Two unit test were explicitely checking for the unecessary writes, including the comments that the writes are unecessary. llvm-svn: 250411	2015-10-15 14:45:48 +00:00
Michael Kruse	90428328ee	Remove "CHECK: br" from some unit tests They happen to match %polly.subregion.iv.inc = add i32 %polly.subregion.iv, 1 ^^ ^^ that is, are misleading in what they actually check. llvm-svn: 250408	2015-10-15 14:40:40 +00:00
Tobias Grosser	5da48f392c	Add -sort-includes to our automatic source code formatting llvm-svn: 250393	2015-10-15 12:18:37 +00:00
Tobias Grosser	0e3a6b13a4	Sort includes using 'clang-format -sort-includes' llvm-svn: 250392	2015-10-15 12:17:36 +00:00
Michael Kruse	b987be12bf	Add testcase for SCEV explansion in non-affine subregions When sharing the same map from old to new value, CodeGeneration would reuse the same new value for each basic block. However, the SCEV expander might emit code in a basic block that does not dominate a use of the SCEV in another basic block. This test checks whether both such blocks have their own expanded new values. llvm-svn: 250389	2015-10-15 10:40:14 +00:00
Tobias Grosser	6b948d5efb	[tests] More testing for PHI-nodes in non-affine regions We harden one test case by ensuring no additional stores may possibly be introduced between the stores we check for and the basic block terminator statements. We also add a test case for the situation where a value that is passed from a non-affine region to a PHI node does not dominate the exit of the non-affine region. This case has come up in patch reviews, so we make sure it is properly handled today and in the future. llvm-svn: 250217	2015-10-13 20:03:09 +00:00
Tobias Grosser	f30be2f370	RegisterPasses: Optionally run inliner before Polly This will allow us to optimize C++ template code with Polly. This support is mostly for debugging purpose and individual experiments. The ultimate goal is still to run Polly later in the pass manager when inlining already happened. llvm-svn: 250092	2015-10-12 20:03:44 +00:00
Tobias Grosser	d17183f20f	Use EP_ModuleOptimizerEarly to run early polly passes, instead of llvm::PassManagerBuilder::EP_EarlyAsPossible. This will allow us to run actual module passes in Polly's canonicalization sequence, but should otherwise have only little impact. llvm-svn: 250091	2015-10-12 20:03:41 +00:00
Tobias Grosser	e2c8275346	ScopInfo: Allow simple 'AddRec * Parameter' products in delinearization We also allow such products for cases where 'Parameter' is loaded within the scop, but where we can dynamically verify that the value of 'Parameter' remains unchanged during the execution of the scop. This change relies on Polly's new RequiredILS tracking infrastructure recently contributed by Johannes. llvm-svn: 250019	2015-10-12 08:02:30 +00:00
Tobias Grosser	3c037fd396	Delete leftover function declaration The function's definition was already removed in r247289. llvm-svn: 250015	2015-10-12 05:14:03 +00:00
Tobias Grosser	1c55e2b7f3	Also add -polly-no-early-exit back until LNT is restarted llvm-svn: 249975	2015-10-11 13:49:27 +00:00
Tobias Grosser	3ecb71696b	Add back -polly-detect-unprofitable as alias of -polly-process-unprofitable This flag was still used in our LNT server. We leave it until it has been removed from LNT as well. llvm-svn: 249973	2015-10-11 13:39:17 +00:00
Johannes Doerfert	9b1f9c8b61	Allow eager evaluated binary && and \|\| conditions The domain generation can handle lazy && and \|\| by default but eager evaluated expressions were dismissed as non-affine. With this patch we will allow arbitrary combinations of and/or bit-operations in the conditions of branches. Differential Revision: http://reviews.llvm.org/D13624 llvm-svn: 249971	2015-10-11 13:21:03 +00:00
Johannes Doerfert	f363ed9804	[NFC] Move helper functions to ScopHelper Helper functions in the BlockGenerators.h/cpp introduce dependences from the frontend to the backend of Polly. As they are used in ScopDetection, ScopInfo, etc. we move them to the ScopHelper file. llvm-svn: 249919	2015-10-09 23:40:24 +00:00
David Blaikie	91e113d1dd	Remove some unused variables in -Asserts builds llvm-svn: 249866	2015-10-09 18:22:18 +00:00
Johannes Doerfert	697fdf891c	Consolidate invariant loads If a (assumed) invariant location is loaded multiple times we generated a parameter for each location. However, this caused compile time problems for several benchmarks (e.g., 445_gobmk in SPEC2006 and BT in the NAS benchmarks). Additionally, the code we generate is suboptimal as we preload the same location multiple times and perform the same checks on all the parameters that refere to the same value. With this patch we consolidate the invariant loads in three steps: 1) During SCoP initialization required invariant loads are put in equivalence classes based on their pointer operand. One representing load is used to generate a parameter for the whole class, thus we never generate multiple parameters for the same location. 2) During the SCoP simplification we remove invariant memory accesses that are in the same equivalence class. While doing so we build the union of all execution domains as it is only important that the location is at least accessed once. 3) During code generation we only preload one element of each equivalence class with the unified execution domain. All others are mapped to that preloaded value. Differential Revision: http://reviews.llvm.org/D13338 llvm-svn: 249853	2015-10-09 17:12:26 +00:00
Johannes Doerfert	f7e2967293	[FIX] Add missing projection for invariant load domains This was left out from the original patch proposed in http://reviews.llvm.org/D13195 even though it is needed to define an order invariant loads are hoisted. llvm-svn: 249680	2015-10-08 11:05:57 +00:00
Johannes Doerfert	c7ab83dfb7	Remove unused flag polly-allow-non-scev-backedge-taken-count Drop an unused flag polly-allow-non-scev-backedge-taken-count and also its occurrences from the tests. Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D13400 llvm-svn: 249675	2015-10-08 10:05:48 +00:00
Johannes Doerfert	c06b7d67cd	Expose the detection context to ScopDetection users ScopDetection users are interested in the detection context and access these via different get-methods. However, not all information was exposed though the number of maps to hold it was increasing steadily. With this change only the detection contexts the rejection log and the ValidRegions set are mapped. The former is needed, the second could be integrated in the first and the ValidRegions set is only needed for the deterministic order of the regions. llvm-svn: 249614	2015-10-07 20:46:06 +00:00
Johannes Doerfert	08d90a3cee	Treat conditionally executed non-pure calls as errors This replaces the support for user defined error functions by a heuristic that tries to determine if a call to a non-pure function should be considered "an error". If so the block is assumed not to be executed at runtime. While treating all non-pure function calls as errors will allow a lot more regions to be analyzed, it will also cause us to dismiss a lot again due to an infeasible runtime context. This patch tries to limit that effect. A non-pure function call is considered an error if it is executed only in conditionally with regards to a cheap but simple heuristic. llvm-svn: 249611	2015-10-07 20:32:43 +00:00
Johannes Doerfert	d8dd8630b2	[NFC] Make LoopInfo a member and simplify arguments llvm-svn: 249609	2015-10-07 20:31:36 +00:00
Johannes Doerfert	09e3697f44	Allow invariant loads in the SCoP description This patch allows invariant loads to be used in the SCoP description, e.g., as loop bounds, conditions or in memory access functions. First we collect "required invariant loads" during SCoP detection that would otherwise make an expression we care about non-affine. To this end a new level of abstraction was introduced before SCEVValidator::isAffineExpr() namely ScopDetection::isAffine() and ScopDetection::onlyValidRequiredInvariantLoads(). Here we can decide if we want a load inside the region to be optimistically assumed invariant or not. If we do, it will be marked as required and in the SCoP generation we bail if it is actually not invariant. If we don't it will be a non-affine expression as before. At the moment we optimistically assume all "hoistable" (namely non-loop-carried) loads to be invariant. This causes us to expand some SCoPs and dismiss them later but it also allows us to detect a lot we would dismiss directly if we would ask e.g., AliasAnalysis::canBasicBlockModify(). We also allow potential aliases between optimistically assumed invariant loads and other pointers as our runtime alias checks are sound in case the loads are actually invariant. Together with the invariant checks this combination allows to handle a lot more than LICM can. The code generation of the invariant loads had to be extended as we can now have dependences between parameters and invariant (hoisted) loads as well as the other way around, e.g., test/Isl/CodeGen/invariant_load_parameters_cyclic_dependence.ll First, it is important to note that we cannot have real cycles but only dependences from a hoisted load to a parameter and from another parameter to that hoisted load (and so on). To handle such cases we materialize llvm::Values for parameters that are referred by a hoisted load on demand and then materialize the remaining parameters. Second, there are new kinds of dependences between hoisted loads caused by the constraints on their execution. If a hoisted load is conditionally executed it might depend on the value of another hoisted load. To deal with such situations we sort them already in the ScopInfo such that they can be generated in the order they are listed in the Scop::InvariantAccesses list (see compareInvariantAccesses). The dependences between hoisted loads caused by indirect accesses are handled the same way as before. llvm-svn: 249607	2015-10-07 20:17:36 +00:00
Johannes Doerfert	521dd5842f	Move the ValueMapT declaration out of BlockGenerator Value maps are created and used in many places and it is not always possible to include CodeGen/Blockgenerators.h. To this end, ValueMapT now lives in the ScopHelper.h which does not have any dependences itself. This patch also replaces uses of different other value map types with the ValueMapT. llvm-svn: 249606	2015-10-07 20:15:56 +00:00
Tobias Grosser	6bab9f800b	IRBuilder: Use Map.lookup instead of Map.find [NFC] This simplifies the code. Suggested-by: Johannes Doerfert llvm-svn: 249545	2015-10-07 13:35:20 +00:00
Tobias Grosser	d79cb0378b	IRBuilder: Ensure we do not add empty map elements Do not use "Map[Key] == nullptr" to check if a Key is in the map, but use "Map.find(Key) == Map.end()". Map[Key] always adds Key into the map, a side-effect we do not want. Found by inspection. This is hard to test outside of a targetted unit test, which seems too much overhead for this individual issue. llvm-svn: 249544	2015-10-07 13:19:06 +00:00
Tobias Grosser	73725b8ba5	IRBuilder: Simplify code and reduce indention [NFC] llvm-svn: 249543	2015-10-07 13:19:03 +00:00
Tobias Grosser	369c1d663b	test: Add example of scalar that is reused accross loop llvm-svn: 249527	2015-10-07 09:00:29 +00:00
David Blaikie	ba2e4857b1	Fix configure checks when applied by the latest clang Clang's been taught to warn on more things here (unused values when calling pure functions and ignoring their result, for example). It might be better to figure out how to have cmake compile these tests without -Werror/without any warnings enabled. But this'll do for now & I don't know enough about cmake to fix it any other way, or to understand the tradeoffs in that space. llvm-svn: 249472	2015-10-06 21:45:06 +00:00
Tobias Grosser	3593739cb2	IslAst: Give some hints why code generation might have been skipped llvm-svn: 249427	2015-10-06 16:10:46 +00:00
Tobias Grosser	575aca8d43	Introduce -polly-process-unprofitable This single option replaces -polly-detect-unprofitable and -polly-no-early-exit and is supposed to be the only option that disables compile-time heuristics that aim to bail out early on scops that are believed to not benefit from Polly optimizations. Suggested-by: Johannes Doerfert llvm-svn: 249426	2015-10-06 16:10:29 +00:00
Tobias Grosser	f4ee371e60	tests: Drop -polly-detect-unprofitable and -polly-no-early-exit These flags are now always passed to all tests and need to be disabled if not needed. Disabling these flags, rather than passing them to almost all tests, significantly simplfies our RUN: lines. llvm-svn: 249422	2015-10-06 15:36:44 +00:00
Tobias Grosser	4fdcf7b813	test: By default disable Polly's compile-time profitability heuristics llvm-svn: 249420	2015-10-06 15:30:26 +00:00
Tobias Grosser	935f62cf0d	tests: Explicitly state if profitability tests should be used Polly's profitability heuristic saves compile time by skipping trivial scops or scops were we know no good optimization can be applied. For almost all our tests this heuristic makes little sense as we aim for minimal test cases when testing functionality. Hence, in almost all cases this heuristic is better be disabled. In preparation of disabling Polly's compile time heuristic by default in the test suite we first explicitly enable it in the couple of test cases that really use it (or run with/without heuristic side-by-side). llvm-svn: 249418	2015-10-06 15:19:35 +00:00
Tobias Grosser	1ac26d06fe	test: Disable profitability heuristics to unfail LICM test case This test case was XFAILed under the assumption Polly is unable to detect the scop. However, disabling Polly's profitability heuristics is sufficient to detect this scop. llvm-svn: 249414	2015-10-06 15:10:19 +00:00
Tobias Grosser	d76603fbe7	test: sdiv in loop bounds is supported since a while By disabling our scop-profitability heuristics this becomes also visible in some older test cases. llvm-svn: 249411	2015-10-06 14:59:31 +00:00
Tobias Grosser	b73c695aba	tests: Drop outdated and unused lit variable llvm-svn: 249401	2015-10-06 13:50:20 +00:00
Johannes Doerfert	f17a78ef63	Remove non-executed statements during SCoP simplifcation A statement with an empty domain complicates the invariant load hoisting and does not help any subsequent analysis or transformation. In fact it might introduce parameter dimensions or increase the schedule dimensionality. To this end, we remove statements with an empty domain early in the SCoP simplification. llvm-svn: 249276	2015-10-04 15:00:05 +00:00
Johannes Doerfert	634909c2c9	[FIX] Domain generation for non-affine loops llvm-svn: 249275	2015-10-04 14:57:41 +00:00
Johannes Doerfert	f61df69423	[FIX] Count affine loops correctly The "unprofitable" heuristic was broken and counted boxed loops even though we do not represent and optimize them. llvm-svn: 249274	2015-10-04 14:56:08 +00:00
Johannes Doerfert	757a32b5b3	[FIX] Approximate non-affine loops correctly Before isValidCFG() could hide the fact that a loop is non-affine by over-approximation. This is problematic if a subregion of the loop contains an exit/latch block and is over-approximated. Now we do not over-approximate in the isValidCFG function if we check loop control. If such control is non-affine the whole loop is over-approximated, not only a subregion. llvm-svn: 249273	2015-10-04 14:54:27 +00:00
Johannes Doerfert	30ffb6fcb6	[FIX] Check loop latches for valid control too. This patch cannot be tested on its own as the isValidCFG currently hides the fact that control is actually non-affine with over-approximation. This will be corrected in the next patch and a test for non-affine latches will be added. llvm-svn: 249272	2015-10-04 14:53:18 +00:00
Johannes Doerfert	8dba07770f	[NFC] Remove unused classes llvm-svn: 249271	2015-10-04 14:52:43 +00:00
Tobias Grosser	d78616f98a	Make ScopAnnotator a function-local variable to ensure it is freed at each run When the ScopAnnotator was a class member variable some of the maps it contains have not been properly cleared. As a result we had dangling pointers to llvm::Value(s) which got detected by the AssertingVH we recently added. No test case as this issue is hard to reproduce reliably as subsequent optimizations need to delete some of the llvm::Values we still keep in our lists. llvm-svn: 249269	2015-10-04 11:19:13 +00:00
Tobias Grosser	ecdfae6b15	IRBuilder: Use AssertingVH llvm-svn: 249268	2015-10-04 10:18:56 +00:00
Tobias Grosser	5762b31df4	Use AssertingVH for ValueToValue Maps By using AssertingVH we will see assertions in case Values to which still pointers in our maps exists are deleted. This is very useful as we previously had some bugs that were caused by such stale Value pointers. llvm-svn: 249267	2015-10-04 10:18:49 +00:00
Tobias Grosser	2f1acac610	BlockGenerator: Use plain Value * instead of const Value * The use of const qualified Value pointers prevents the use of AssertingVH. We could probably think of adding const support to AssertingVH, but as const correctness seems to currently provide limited benefit in Polly, we do not do this yet. llvm-svn: 249266	2015-10-04 10:18:45 +00:00
Tobias Grosser	9646e3fe4b	BlockGenerators: Use auto to be less sensitive to type changes llvm-svn: 249265	2015-10-04 10:18:39 +00:00
Tobias Grosser	f4bb7a6a4d	Consolidate the different ValueMapTypes we are using There have been various places where llvm::DenseMap<const llvm::Value , llvm::Value > types have been defined, but all types have been expected to be identical. We make this more clear by consolidating the different types and use BlockGenerator::ValueMapT wherever there is a need for types to match BlockGenerator::ValueMapT. llvm-svn: 249264	2015-10-04 10:18:32 +00:00
Tobias Grosser	1d45c6dadd	IslExprBuilder: Use AssertingVH for IdToValueTy llvm-svn: 249239	2015-10-03 17:20:00 +00:00
Tobias Grosser	b28ee0fbb0	ScopInfo: Use AssertingVH in maps By using asserting value handles, we will get assertions when we forget to clear any of the Value maps instead of difficult to debug undefined behavior. llvm-svn: 249238	2015-10-03 17:19:53 +00:00
Tobias Grosser	e9cb5a0983	BlockGenerator: Use AssertingVH in maps By using asserting value handles, we will get assertions when we forget to clear any of the Value maps instead of difficult to debug undefined behavior. llvm-svn: 249237	2015-10-03 17:19:49 +00:00
Michael Kruse	afe0670863	Bail-out early if all statements have been simplified away Treat the scop as invalid instead of creating dummy domains. llvm-svn: 249151	2015-10-02 16:33:27 +00:00
Johannes Doerfert	3e7d171866	[FIX] Repair broken commit The last invariant load fix was based on a later patch not polly/master, thus needs to be adjusted. llvm-svn: 249145	2015-10-02 15:35:03 +00:00
Johannes Doerfert	8930f4846c	[FIX] Do not hoist from inside a non-affine subregion We have to skip accesses in non-affine subregions during hoisting as they might not be executed under the same condition as the entry of the non-affine subregion. llvm-svn: 249139	2015-10-02 14:51:00 +00:00
Michael Kruse	cac948ef46	Earlier creation of ScopStmt objects This moves the construction of ScopStmt to the beginning of the ScopInfo pass. The late creation was a result of the earlier separation of ScopInfo and TempScopInfo. This will avoid introducing more ScopStmt-like maps in future commits. The AccFuncMap will also be removed in some future commit. DomainMap might also be included into ScopStmt. The order in which ScopStmt are created changes and initially creates empty statements that are removed in a simplification. Differential Revision: http://reviews.llvm.org/D13341 llvm-svn: 249132	2015-10-02 13:53:07 +00:00
Johannes Doerfert	911951f4f8	Hand down referenced & globally mapped values to the subfunction If a value is globally mapped (IslNodeBuilder::ValueMap) and referenced in the code that will be put into a subfunction, we hand down the new value to the subfunction. This patch also removes code that handed down all invariant loads to the subfunction. Instead, only needed invariant loads are given to the subfunction. There are two possible reasons for an invariant load to be handed down: 1) The invariant load is used in a block that is placed in the subfunction but which is not the parent of the load. In this case, the scalar access that will read the loaded value, will cause its base pointer (the preloaded value) to be handed down to the subfunction. 2) The invariant load is defined and used in a block that is placed in the subfunction. With this patch we will hand down the preloaded value to the subfunction as the invariant load is globally mapped to that value. llvm-svn: 249126	2015-10-02 13:11:27 +00:00
Johannes Doerfert	478a7de18b	[NFC] Make the ScopDetection analysis a member of the Scop class llvm-svn: 249125	2015-10-02 13:09:31 +00:00
Johannes Doerfert	f56738041e	Make the SCoP generation resistent wrt. error blocks When error blocks are not terminated by an unreachable they have successors that might only be reachable via error blocks. Additionally, branches in error blocks are not checked during SCoP detection, thus we might not be able to handle them. With this patch we do not try to model error block exit conditions. Anything that is only reachable via error blocks is ignored too, as it will not be executed in the optimized version of the SCoP anyway. llvm-svn: 249099	2015-10-01 23:48:18 +00:00
Johannes Doerfert	f80f3b0449	Allow user defined error functions The user can provide function names with -polly-error-functions=name1,name2,name3 that will be treated as error functions. Any call to them is assumed not to be executed. This feature is mainly for developers to play around with the new "error block" feature. llvm-svn: 249098	2015-10-01 23:45:51 +00:00
Johannes Doerfert	850d346302	[FIX] Parallel codegen for invariant loads Hand down all preloaded values to the parallel subfunction. llvm-svn: 249010	2015-10-01 13:40:36 +00:00
Johannes Doerfert	e46925f324	[FIX] Erase stall results during the SCoP detection With this patch we erase cached results for regions that are invalid as early as possible. If we do not (as before), it is possible that two expanded regions will have the same address and the tracked results for both are mixed. Currently this would "only" cause pessimism in later passes but that will change when we allow invariant loads in the SCoP. Additionally, it triggers non-deterministic results as we might dismiss a later region because of results cached for an earlier one. There is no test case as the problem occurs only non-deterministically. llvm-svn: 249000	2015-10-01 10:59:14 +00:00
Johannes Doerfert	47128f3af7	[FIX] Reintroduce an include needed to locally compile Polly llvm-svn: 248947	2015-09-30 21:19:44 +00:00
Johannes Doerfert	59984322c3	[FIX] Handle identity mappings in the ScopExpander If the VMap in the ScopExpander contains identity mappings we now ignore the mapping. Reported-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 248946	2015-09-30 21:12:12 +00:00
Tobias Grosser	fd9d07eac1	Drop unneeded includes llvm-svn: 248937	2015-09-30 20:20:42 +00:00
Johannes Doerfert	bb9387444c	[FIX] Remove unknown variable llvm-svn: 248926	2015-09-30 17:31:31 +00:00
Johannes Doerfert	c0729a3216	Move remapping functionality in the ScopExpander Because we handle more than SCEV does it is not possible to rewrite an expression on the top-level using the SCEVParameterRewriter only. With this patch we will do the rewriting on demand only and also recursively, thus not only on the top-level. llvm-svn: 248916	2015-09-30 16:52:03 +00:00
Johannes Doerfert	6206d7a67c	[FIX] Clear all maps between runs llvm-svn: 248915	2015-09-30 16:51:05 +00:00
Tobias Grosser	aff56c8a78	Reapply "BlockGenerator: Generate synthesisable instructions only on-demand" Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instructions and works more reliably than first inserting them and then deleting them later on. This commit was reverted in r248860 due to a remaining miscompile, where we forgot to synthesis the operand values that were referenced from scalar writes. test/Isl/CodeGen/scalar-store-from-same-bb.ll tests that we do this now correctly. llvm-svn: 248900	2015-09-30 13:36:54 +00:00
Tobias Grosser	33cb9f9419	BlockGenerator: Extract value synthesis into its own function [NFC] This will allow us to reuse this code in a subsequent commit. llvm-svn: 248893	2015-09-30 11:56:19 +00:00
Johannes Doerfert	ebfd72493c	[NFC] Extract materialization of parameters llvm-svn: 248882	2015-09-30 09:52:08 +00:00
Johannes Doerfert	ef19ead20e	[FIX] Use escape logic for invariant loads Before we unconditinoally forced all users outside the SCoP to use the preloaded value. However, if the SCoP is not executed due to the runtime checks, we need to use the original value because it might not be invariant in the first place. llvm-svn: 248881	2015-09-30 09:43:20 +00:00
Michael Kruse	76e924d31b	Assign scop directly This makes ScopInfo's scop member available earlier to other methods which will make some planned changes simpler. No behavioral change intended llvm-svn: 248879	2015-09-30 09:16:07 +00:00
Johannes Doerfert	c1db67e218	Identify and hoist definitively invariant loads As a first step in the direction of assumed invariant loads (loads that are not written in some context) we now detect and hoist definitively invariant loads. These invariant loads will be preloaded in the code generation and used in the optimized version of the SCoP. If the load is only conditionally executed the preloaded version will also only be executed under the same condition, hence we will never access memory that wouldn't have been accessed otherwise. This is also the most distinguishing feature to licm. As hoisting can make statements empty we will simplify the SCoP and remove empty statements that would otherwise cause artifacts in the code generation. Differential Revision: http://reviews.llvm.org/D13194 llvm-svn: 248861	2015-09-29 23:47:21 +00:00
Johannes Doerfert	f6343d74ef	Revert "BlockGenerator: Generate synthesisable instructions only on-demand" This reverts commit 07830c18d789ee72812d5b5b9b4f8ce72ebd4207. The commit broke at least one test in lnt, MultiSource/Benchmarks/Ptrdist/bc/number.c was miss compiled and the test produced a wrong result. One Polly test case that was added later was adjusted too. llvm-svn: 248860	2015-09-29 23:43:40 +00:00
Tobias Grosser	81f005617e	Replace default destructors by {} destructors Hope this fixes the buildbots for now. llvm-svn: 248823	2015-09-29 19:52:09 +00:00
Tobias Grosser	7dce3a60f2	Add default region generator This hopefully helps to to address the following compile error on our buildbots BlockGenerators.h:683:7: error: looser throw specifier for ‘virtual polly::RegionGenerator::~RegionGenerator()’ BlockGenerators.h:164:11: error: overriding ‘virtual polly::BlockGenerator::~BlockGenerator() noexcept (true)` llvm-svn: 248812	2015-09-29 18:07:17 +00:00
Tobias Grosser	98b3ee50ff	Codegen: Support memory accesses with different types Every once in a while we see code that accesses memory with different types, e.g. to perform operations on a piece of memory using type 'float', but to copy data to this memory using type 'int'. Modeled in C, such codes look like: void foo(float A[], float B[]) { for (long i = 0; i < 100; i++) (int )(&A[i]) = (int )(&B[i]); for (long i = 0; i < 100; i++) A[i] += 10; } We already used the correct types during normal operations, but fall back to our detected type as soon as we import changed memory access functions. For these memory accesses we may generate invalid IR due to a mismatch between the element type of the array we detect and the actual type used in the memory access. To address this issue, we always cast the newly created address of a memory access back to the type of the memory access where the address will be used. llvm-svn: 248781	2015-09-29 06:44:38 +00:00
David Blaikie	6163f67ad0	Remove unnecessary default dtor. The base dtor is already virtual and the derived dtor adds nothing. llvm-svn: 248765	2015-09-29 00:12:50 +00:00
David Blaikie	8e9ea2a439	Make Polly -Wdeprecated clean by explicitly making BlockGenerator copy constructible This is a bit of an awkward API and I'm not sure what the right solution is. Having a publicly copy constructible base class makes it easy to accidentally slice derived objects in a number of contexts. llvm-svn: 248764	2015-09-29 00:00:29 +00:00
Tobias Grosser	95e59aaa54	OpenMP: Name addresses in subfunction structure While debugging, this makes it easier to understand due to which memory reference these stores have been introduced. llvm-svn: 248717	2015-09-28 16:46:38 +00:00
Tobias Grosser	28b9a14b07	BlockGenerator: Generate synthesisable instructions only on-demand Instructions which we can synthesis from a SCEV expression are not generated directly, but only when they are used as an operand of another instruction. This avoids generating unnecessary instruction and works more reliably than first inserting them and then deleting them later on. Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> Differential Revision: http://reviews.llvm.org/D13208 llvm-svn: 248712	2015-09-28 13:47:50 +00:00
Michael Kruse	5c0f97d537	Improve comments related to MemoryAccess::MemoryOrigin; NFC llvm-svn: 248705	2015-09-28 10:06:50 +00:00
Johannes Doerfert	58a7c75c86	[NFC] Add accidentally removed comment line llvm-svn: 248704	2015-09-28 09:48:53 +00:00
Johannes Doerfert	9a132f36c3	Allow switch instructions in SCoPs This patch allows switch instructions with affine conditions in the SCoP. Also switch instructions in non-affine subregions are allowed. Both did not require much changes to the code, though there was some refactoring needed to integrate them without code duplication. In the llvm-test suite the number of profitable SCoPs increased from 135 to 139 but more importantly we can handle more benchmarks and user inputs without preprocessing. Differential Revision: http://reviews.llvm.org/D13200 llvm-svn: 248701	2015-09-28 09:33:22 +00:00
Tobias Grosser	f223cdf17e	[tests] Add memory writes to make this scop not trivially empty llvm-svn: 248697	2015-09-28 07:37:06 +00:00

1 2 3 4 5 ...

2047 Commits