llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	652f780894	CodeGeneration: Add back verification of generated code This got accidentally dropped in r260025 llvm-svn: 260857	2016-02-14 20:56:49 +00:00
Tobias Grosser	5e27ec33ea	Revert "[ScopDectect] Allow memory accesses with different element types by default" This reverts commit https://llvm.org/svn/llvm-project/polly/trunk@260853 We unfortunately still have two bugs left which show only up with -polly-process-unprofitable and which I forgot to test before committing. llvm-svn: 260854	2016-02-14 19:59:29 +00:00
Tobias Grosser	5f105f9bdd	[ScopDectect] Allow memory accesses with different element types by default First support for this feature was committed in r259784. Support for loop invariant load hoisting with different types was added by Johannes Doerfert in r260045. This fixed the last known bug. llvm-svn: 260853	2016-02-14 19:11:16 +00:00
Hongbin Zheng	fec328083a	Use unique_ptr to manage Scop inside ScopInfo. llvm-svn: 260821	2016-02-13 15:13:02 +00:00
Hongbin Zheng	660f3ccfa5	Move AccFuncMap from ScopInfo into Scop Since the origin AccFuncMap in ScopInfo is used by the underlying Scop only, and it must stay alive until we delete the Scop. It will be better if we simply move the origin AccFuncMap in ScopInfo into the Scop class. llvm-svn: 260820	2016-02-13 15:12:58 +00:00
Hongbin Zheng	192f69a0fb	Do not carry LoopInfo along with a Scop. Make Scop become more portable such that we can use it in a CallGraphSCC pass. The first step is to drop the analyses that are only used during Scop construction. This patch drop LoopInfo from Scop. llvm-svn: 260819	2016-02-13 15:12:54 +00:00
Hongbin Zheng	f53ffa6dc3	Do not carry DominatorTree along with a Scop. Make Scop become more portable such that we can use it in a CallGraphSCC pass. The first step is to drop the analyses that are only used during Scop construction. This patch drop DominatorTree from Scop. llvm-svn: 260818	2016-02-13 15:12:51 +00:00
Hongbin Zheng	7dddfba7dc	Do not carry ScopDetection along with a Scop. Make Scop become more portable such that we can use it in a CallGraphSCC pass. The first step is to drop the analyses that are only used during Scop construction. This patch drop ScopDecection from Scop. llvm-svn: 260817	2016-02-13 15:12:47 +00:00
Johannes Doerfert	96e5471139	Separate invariant equivalence classes by type We now distinguish invariant loads to the same memory location if they have different types. This will cause us to pre-load an invariant location once for each type that is used to access it. However, we can thereby avoid invalid casting, especially if an array is accessed though different typed/sized invariant loads. This basically reverts the changes in r260023 but keeps the test cases. llvm-svn: 260045	2016-02-07 17:30:13 +00:00
Johannes Doerfert	e708790c59	[FIX] Two "off-by-one" error in constant range usage llvm-svn: 260031	2016-02-07 13:59:03 +00:00
Johannes Doerfert	adeab372ca	Simplify code [NFC] llvm-svn: 260030	2016-02-07 13:57:32 +00:00
Tobias Grosser	8ebdc2dd53	Make memory accesses with different element types optional We also disable this feature by default, as there are still some issues in combination with invariant load hoisting that slipped through my initial testing. llvm-svn: 260025	2016-02-07 08:48:57 +00:00
Tobias Grosser	46bafbd0fe	Do not yet consider loads with non-canonical element size for load hoisting. Invariant load hoisting of memory accesses with non-canonical element types lacks support for equivalence classes that contain elements of different width/size. This support should be added, but to get our buildbots back to green, we disable load hoisting for memory accesses with non-canonical element size for now. llvm-svn: 260023	2016-02-07 08:11:36 +00:00
Tobias Grosser	107cd5f5f6	IslNodeBuilder: Invariant load hoisting of elements with differing sizes Always use access-instruction pointer type to load the invariant values. Otherwise mismatches between ScopArrayInfo element type and memory access element type will result in invalid casts. These type mismatches are after r259784 a lot more common and also arise with types of different size, which have not been handled before. Interestingly, this change actually simplifies the code, as we now have only one code path that is always taken, rather then a standard code path for the common case and a "fixup" code path that replaces the standard code path in case of mismatching types. llvm-svn: 260009	2016-02-06 21:23:39 +00:00
Michael Kruse	2e02d560aa	Follow uses to create value MemoryAccesses The previously implemented approach is to follow value definitions and create write accesses ("push defs") while searching for uses. This requires the same relatively validity- and requirement conditions to be replicated at multiple locations (PHI instructions, other instructions, uses by PHIs). We replace this by iterating over the uses in a SCoP ("pull in requirements"), and add writes only when at least one read has been added. It turns out to be simpler code because each use is only iterated over once and writes are added for the first access that reads it. We need another iteration to identify escaping values (uses not in the SCoP), which also makes the difference between such accesses more obvious. As a side-effect, the order of scalar MemoryAccess can change. Differential Revision: http://reviews.llvm.org/D15706 llvm-svn: 259987	2016-02-06 09:19:40 +00:00
Tobias Grosser	d840fc7277	Support accesses with differently sized types to the same array This allows code such as: void multiple_types(char Short, char Float, char Double) { for (long i = 0; i < 100; i++) { Short[i] = (short )&Short[2 i]; Float[i] = (float )&Float[4 * i]; Double[i] = (double )&Double[8 * i]; } } To model such code we use as canonical element type of the modeled array the smallest element type of all original array accesses, if type allocation sizes are multiples of each other. Otherwise, we use a newly created iN type, where N is the gcd of the allocation size of the types used in the accesses to this array. Accesses with types larger as the canonical element type are modeled as multiple accesses with the smaller type. For example the second load access is modeled as: { Stmt_bb2[i0] -> MemRef_Float[o0] : 4i0 <= o0 <= 3 + 4i0 } To support code-generating these memory accesses, we introduce a new method getAccessAddressFunction that assigns each statement instance a single memory location, the address we load from/store to. Currently we obtain this address by taking the lexmin of the access function. We may consider keeping track of the memory location more explicitly in the future. We currently do _not_ handle multi-dimensional arrays and also keep the restriction of not supporting accesses where the offset expression is not a multiple of the access element type size. This patch adds tests that ensure we correctly invalidate a scop in case these accesses are found. Both types of accesses can be handled using the very same model, but are left to be added in the future. We also move the initialization of the scop-context into the constructor to ensure it is already available when invalidating the scop. Finally, we add this as a new item to the 2.9 release notes Reviewers: jdoerfert, Meinersbur Differential Revision: http://reviews.llvm.org/D16878 llvm-svn: 259784	2016-02-04 13:18:42 +00:00
Tobias Grosser	a7f1e04031	Update to isl-0.16.1-20-gee54b48 This includes some (optional) improvements to the isl scheduler, which we do not use yet, as well as a fix for a bug previously also affecting Polly: commit 662ee9b7d45ebeb7629b239d3ed43442e25bf87c Author: Sven Verdoolaege <skimo@kotnet.org> Date: Mon Jan 25 16:59:32 2016 +0100 isl_basic_map_realign: perform Gaussian elimination on result Many parts of isl assume that Gaussian elimination has been applied to the equality constraints. In particular singleton_extract_point makes this assumption. The input to singleton_extract_point may have undergone parameter alignment. This parameter alignment (ultimately performed by isl_basic_map_realign) therefore needs to make sure the result preserves this property llvm-svn: 259757	2016-02-04 06:19:12 +00:00
Tobias Grosser	bea6c54723	Properly build shared libraries if LLVM_LINK_LLVM_DYLIB is enabled. Contributed-by: Jack Howarth <howarthjw@gmail.com> llvm-svn: 259659	2016-02-03 16:29:04 +00:00
Tobias Grosser	e2c31210b2	Revert "Support loads with differently sized types from a single array" This reverts commit (@259587). It needs some further discussions. llvm-svn: 259629	2016-02-03 05:53:27 +00:00
Tobias Grosser	5d3fc1ea43	Support loads with differently sized types from a single array We support now code such as: void multiple_types(char Short, char Float, char Double) { for (long i = 0; i < 100; i++) { Short[i] = (short )&Short[2 i]; Float[i] = (float )&Float[4 * i]; Double[i] = (double )&Double[8 * i]; } } To support such code we use as element type of the modeled array the smallest element type of all original array accesses. Accesses with larger types are modeled as multiple accesses with the smaller type. For example the second load access is modeled as: { Stmt_bb2[i0] -> MemRef_Float[o0] : 4i0 <= o0 <= 3 + 4i0 } To support jscop-rewritable memory accesses we need each statement instance to only be assigned a single memory location, which will be the address at which we load the value. Currently we obtain this address by taking the lexmin of the access function. We may consider keeping track of the memory location more explicitly in the future. llvm-svn: 259587	2016-02-02 22:05:29 +00:00
Tobias Grosser	db543ede7e	ScopInfo: Split memory access construction into different cases We create separate functions for fixed-size multi-dimensional, parameteric-sized multi-dimensional, as well as single-dimensional memory accesses to reduce the complexity of a large monolithic function. Suggested-by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 259522	2016-02-02 16:46:49 +00:00
Tobias Grosser	5d51afedd8	ScopInfo: Do not track element-size as additional array size dimension [NFC] There is no need to pass the size of the elements as the last size dimension to ScopArrayInfo. This information is already available through the ElementType. Tracking it twice is not only redundant but may result in inconsistencies. llvm-svn: 259521	2016-02-02 16:46:45 +00:00
Johannes Doerfert	800e17a75c	Add const keyword to MemoryAccess argument [NFC] llvm-svn: 259504	2016-02-02 14:16:01 +00:00
Johannes Doerfert	a1f291ed18	Rename the DataLayout member [NFC] llvm-svn: 259502	2016-02-02 14:15:13 +00:00
Johannes Doerfert	a87810351c	Remove helper function [NFC] llvm-svn: 259501	2016-02-02 14:14:40 +00:00
Johannes Doerfert	c36d39b7a8	Remove unnecessary getAnalysis call [NFC] llvm-svn: 259500	2016-02-02 14:14:20 +00:00
Tobias Grosser	cc77950b2b	ScopInfo: Rename AccessType -> ElementType to unify naming with method declaration llvm-svn: 259495	2016-02-02 13:22:54 +00:00
Tobias Grosser	c2fd8b411d	ScopInfo: Correct schedule construction For schedule generation we assumed that the reverse post order traversal used by the domain generation is sufficient, however it is not. Once a loop is discovered, we have to completely traverse it, before we can generate the schedule for any block/region that is only reachable through a loop exiting block. To this end, we add a "loop stack" that will keep track of loops we discovered during the traversal but have not yet traversed completely. We will never visit a basic block (or region) outside the most recent (thus smallest) loop in the loop stack but instead queue such blocks (or regions) in a waiting list. If the waiting list is not empty and (might) contain blocks from the most recent loop in the loop stack the next block/region to visit is drawn from there, otherwise from the reverse post order iterator. We exploit the new property of loops being always completed before additional loops are processed, by removing the LoopSchedules map and instead keep all information in LoopStack. This clarifies that we indeed always only keep a stack of in-process loops, but will never keep incomplete schedules for an arbitrary set of loops. As a result, we can simplify some of the existing code. This patch also adds some more documentation about how our schedule construction works. This fixes http://llvm.org/PR25879 This patch is an modified version of Johannes Doerfert's initial fix. Differential Revision: http://reviews.llvm.org/D15679 llvm-svn: 259354	2016-02-01 11:54:13 +00:00
Tobias Grosser	ccbe383594	ScopInfo: Drop dead code in schedule description In https://llvm.org/svn/llvm-project/polly/trunk@251870 code was committed to avoid a failure in the presence of infinite loops, but the test case committed along with this change passes without the actual change. I looked back into the code and also checked with the original committer (Johannes), but could not find the reason why the code is needed. The introduction of LoopStacks for buildSchedule in one of the next commits will make it even more clear that this code is not needed, but I remove this ahead of time to facilitate bisecting in case I missed something. llvm-svn: 259347	2016-02-01 10:07:43 +00:00
Tobias Grosser	0dd4a9a9f4	ScopInfo: use std::distance to shorten code [NFC] llvm-svn: 259337	2016-02-01 01:55:08 +00:00
Craig Topper	7fb6e47101	Replace utostr_32 use with utostr to match removal from llvm. llvm-svn: 259333	2016-01-31 20:36:20 +00:00
Tobias Grosser	ed873cd3c1	cmake: Add additional libraries to fix the darwin shared library build darwin requires the additional linkages of... LLVMBitReader LLVMMCParser LLVMObject LLVMProfileData LLVMTarget LLVMVectorize as the darwin requires all of the weak undefined symbols in a library to be resolved when linking it against an executable (unless -Wl,-undefined,dynamic_lookup is used to override the default behavior of -Wl,-undefined,error). Contributed-by: Jack Howarth llvm-svn: 259332	2016-01-31 20:25:46 +00:00
Michael Kruse	26311f00e8	Remove autotools build system The autotools build system is based on and requires LLVM's autotools build system to work, which has been depricated and finally removed in r258861. Consequently we also remove the autotools build system from Polly. Differential Revision: http://reviews.llvm.org/D16655 llvm-svn: 259041	2016-01-28 12:00:33 +00:00
Michael Kruse	fd46308de4	ScopInfo: Never add read accesses for synthesizable values Before adding a MK_Value READ MemoryAccess, check whether the read is necessary or synthesizable. Synthesizable values are later generated by the SCEVExpander and therefore do not need to be transferred explicitly. This can happen because the check for synthesizability has presumbly been forgotten in the case where a phi's incoming value has been defined in a different statement. Differential Revision: http://reviews.llvm.org/D15687 llvm-svn: 258998	2016-01-27 22:51:56 +00:00
Michael Kruse	70131d3416	Introduce MemAccInst helper class; NFC MemAccInst wraps the common members of LoadInst and StoreInst. Also use of this class in: - ScopInfo::buildMemoryAccess - BlockGenerator::generateLocationAccessed - ScopInfo::addArrayAccess - Scop::buildAliasGroups - Replace every use of polly::getPointerOperand Reviewers: jdoerfert, grosser Differential Revision: http://reviews.llvm.org/D16530 llvm-svn: 258947	2016-01-27 17:09:17 +00:00
Michael Kruse	ee6a4fc680	Unique phi write accesses Ensure that there is at most one phi write access per PHINode and ScopStmt. In particular, this would be possible for non-affine subregions with multiple exiting blocks. We replace multiple MAY_WRITE accesses by one MUST_WRITE access. The written value is constructed using a PHINode of all exiting blocks. The interpretation of the PHI WRITE's "accessed value" changed from the incoming value to the PHI like for PHI READs since there is no unique incoming value. Because region simplification shuffles around PHI nodes -- particularly with exit node PHIs -- the PHINodes at analysis time does not always exist anymore in the code generation pass. We instead remember the incoming block/value pair in the MemoryAccess. Differential Revision: http://reviews.llvm.org/D15681 llvm-svn: 258809	2016-01-26 13:33:27 +00:00
Michael Kruse	ad28e5a589	Unique value read accesses Keep at most one value read MemoryAccess per value and statement; multiple generated loads do not have any additional effect. As one such MemoryAccess can cater multiple uses within the statement, the AccessInstruction property is not unique any more and set to nullptr. Differential Revision: http://reviews.llvm.org/D15510 llvm-svn: 258808	2016-01-26 13:33:15 +00:00
Michael Kruse	436db620e7	Unique value write accesses Ensure there is at most one write access per definition of an llvm::Value. Keep track of already created value write access by using a (dense) map. Replace addValueWriteAccess by ensureValueStore which can be uses more liberally without worrying to add redundant accesses. It will be used, e.g. in a logical correspondant for value reads -- ensureValueReload -- to ensure that the expected definition has been written when loading it. Differential Revision: http://reviews.llvm.org/D15483 llvm-svn: 258807	2016-01-26 13:33:10 +00:00
Johannes Doerfert	6f50c29ab2	[FIX] Domain generation error due to loops in non-affine regions llvm-svn: 258803	2016-01-26 11:03:25 +00:00
Johannes Doerfert	432658d7b8	[FIX] Build correct domain for non-affine region SCoPs llvm-svn: 258802	2016-01-26 11:01:41 +00:00
Tobias Grosser	f2cdd144e5	BlockGenerators: Replace getNewScalarValue with getNewValue Both functions implement the same functionality, with the difference that getNewScalarValue assumes that globals and out-of-scop scalars can be directly reused without loading them from their corresponding stack slot. This is correct for sequential code generation, but causes issues with outlining code e.g. for OpenMP code generation. getNewValue handles such cases correctly. Hence, we can replace getNewScalarValue with getNewValue. This is not only more future proof, but also eliminates a bunch of code. The only functionality that was available in getNewScalarValue that is lost is the on-demand creation of scalar values. However, this is not necessary any more as scalars are always loaded at the beginning of each basic block and will consequently always be available when scalar stores are generated. As this was not the case in older versions of Polly, it seems the on-demand loading is just some older code that has not yet been removed. Finally, generateScalarLoads also generated loads for values that are loop invariant, available in GlobalMap and which are preferred over the ones loaded in generateScalarLoads. Hence, we can just skip the code generation of such scalar values, avoiding the generation of dead code. Differential Revision: http://reviews.llvm.org/D16522 llvm-svn: 258799	2016-01-26 10:01:35 +00:00
Tobias Grosser	5c7f16be6b	BlockGenerators: Avoid redundant map lookup [NFC] llvm-svn: 258660	2016-01-24 14:16:59 +00:00
Tobias Grosser	c9abde8c51	ScopInfo: Simplify code by folding definition into if llvm-svn: 258632	2016-01-23 20:23:06 +00:00
Tobias Grosser	1c3a6d7808	ScopDetection: Do not detect regions with irreducible control as scops Polly currently does not support irreducible control and it is probably not worth supporting. This patch adds code that checks for irreducible control and refuses regions containing irreducible control. Polly traditionally had rather restrictive checks on the control flow structure which would have refused irregular control, but within the last couple of months most of the control flow restrictions have been removed. As part of this generalization we accidentally allowed irregular control flow. Contributed-by: Karthik Senthil and Ajith Pandel llvm-svn: 258497	2016-01-22 09:44:37 +00:00
Sumanth Gundapaneni	4b1472fb2b	Fix the error in non-asserts Release mode build. NFC llvm-svn: 258318	2016-01-20 15:41:30 +00:00
Johannes Doerfert	370cf00c9f	Make sure we preserve alignment information after hoisting invariant load In Polly, after hoisting loop invariant loads outside loop, the alignment information for hoisted loads are missing, this patch restore them. Contributed-by: Lawrence Hu <lawrence@codeaurora.org> Differential Revision: http://reviews.llvm.org/D16160 llvm-svn: 258105	2016-01-19 00:17:21 +00:00
Michael Kruse	dc8508e72a	Do not check JSON alignment of scalar accesses When importing a schedule, do not verify the load/store alignment of scalar accesses. Scalar loads/store are always created newly in code generation with no alignment restrictions. Previously, scalar alignment was checked if the access instruction happened to be a LoadInst or StoreInst, but only its array (MK_Array) access is relevant. This will be implicitly unit-tested when the access instruction of a value read can be nullptr. Differential Revision: http://reviews.llvm.org/D15680 llvm-svn: 257904	2016-01-15 16:49:33 +00:00
Michael Kruse	959a8dc39f	Update to ISL 0.16.1 llvm-svn: 257898	2016-01-15 15:54:45 +00:00
Michael Kruse	f7d5e40860	Add option to update-isl.sh for specific commit This is used e.g. to update specifically to ISL 0.16.1 although newer revisions exist in the official repository. llvm-svn: 257895	2016-01-15 15:41:59 +00:00
Roman Gareev	b0c4e49a37	Fix of r257495. Remove redundant "FPM->add(createDemoteRegisterToMemoryPass());" llvm-svn: 257514	2016-01-12 20:47:48 +00:00
Roman Gareev	6ebc01c973	We do not need to schedule another loop interchange pass after Polly, as Polly should perform loop interchanges itself. This also fixes a bug we see due to the "loop-interchange" pass producing incorrect IR when compiling linpack-pc.c from the LLVM test-suite with "-polly-position=before-vectorizer". Reviewed-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 257495	2016-01-12 17:59:06 +00:00
Roman Gareev	10595a1739	Call assumeNoOutOfBound only in updateDimensionality Call assumeNoOutOfBound only in updateDimensionality to process situations when new dimensions are added and new bounds checks are required. Contributed-by: Tobias Grosser, Gareev Roman llvm-svn: 257170	2016-01-08 14:01:59 +00:00
Tobias Grosser	8362c26113	Define buildScheduleRec on RegionNodes and pull out the tree traversal [NFC] This change clarifies that for Not-NonAffine-SubRegions we actually iterate over the subnodes and for both NonAffine-SubRegions and BasicBlocks, we perform the schedule construction. As a result, the tree traversal becomes trivial, the special case for a scop consisting just of a single non-affine region disappears and the indentation of the code is reduced. No functional change intended. llvm-svn: 256940	2016-01-06 15:30:06 +00:00
Johannes Doerfert	f9711ef922	Extract constant parts of the schedule generation [NFC] llvm-svn: 256931	2016-01-06 12:59:23 +00:00
Tobias Grosser	05e71b9f61	ScopInfo: use getStmtForRegionNode to simplify code slightly llvm-svn: 256914	2016-01-06 05:18:20 +00:00
Michael Kruse	58fa3bb63a	Do not store scalar accesses in InstructionToAccess At code generation, scalar reads are generated before the other statement's instructions, respectively scalar writes after them, in contrast to array accesses which are "executed" with the instructions they are linked to. Therefore it makes sense to not map the scalar accesses to a place of execution. Follow-up patches will also remove some of the directs links from a scalar access to a single instruction, such that only having array accesses in InstructionToAccess ensures consistency. Differential Revision: http://reviews.llvm.org/D13676 llvm-svn: 256298	2015-12-22 23:25:11 +00:00
Johannes Doerfert	5dced2693e	Refactor canSynthesize in the BlockGenerators [NFC] llvm-svn: 256269	2015-12-22 19:08:49 +00:00
Johannes Doerfert	28f8ac1db2	Treat inline assembly as a constant in the code generation. llvm-svn: 256267	2015-12-22 19:08:24 +00:00
Johannes Doerfert	42df8d1db6	Reduce indention in BlockGenerator::trySynthesizeNewValue [NFC] llvm-svn: 256266	2015-12-22 19:08:01 +00:00
Tobias Grosser	c6424ae46c	ScopDetection: Simplify std::distance(....) to BB->size() Suggested by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 256260	2015-12-22 17:38:59 +00:00
Tobias Grosser	fcabb155c1	BlockGenerators: Remove unnecessary const_cast llvm-svn: 256227	2015-12-22 01:41:25 +00:00
Tobias Grosser	c900633d60	ScopInfo: Small improvement to schedule construction [NFC] We clarify that certain code is only executed if LSchedule is != nullptr. Previously some of these functions have been executed, but they only passed a nullptr through. This caused some confusion when reading the code. llvm-svn: 256209	2015-12-21 23:01:53 +00:00
Tobias Grosser	cbf7ae8fef	ScopInfo: Polish the implementation of mapToDimension Besides improving the documentation and the code we now assert in case the input is invalid (N < 0) and also do not any more return a nullptr in case USet is empty. This should make the code more readable. llvm-svn: 256208	2015-12-21 22:45:53 +00:00
Tobias Grosser	c1a269bf0e	Add option to assume single-loop scops with sufficient compute are profitable If a loop has a sufficiently large amount of compute instruction in its loop body, it is unlikely that our rewrite of the loop iterators introduces large performance changes. As Polly can also apply beneficical optimizations (such as parallelization) to such loop nests, we mark them as profitable. This option is currently "disabled" by default, but can be used to run experiments. If enabled by setting it e.g. to 40 instructions, we currently see some compile-time increases on LNT without any significant run-time changes. llvm-svn: 256199	2015-12-21 21:00:43 +00:00
Tobias Grosser	5624d3c978	Adjust formatting to clang-format changes in 256149 llvm-svn: 256151	2015-12-21 12:38:56 +00:00
Tobias Grosser	97fc5bb7f7	ScopDetect: Extract profitability check into subfunction .. and add some documentation. We also simplify the code by dropping an early check that is also covered by the the later checks. This might have a small compile time impact, but as the scops that are skipped are small we should probably only add this back in the unlikely case that this has a notable compile-time cost. No functional change intended. llvm-svn: 256149	2015-12-21 12:14:48 +00:00
Tobias Grosser	594882573e	ScopInfo: Return immediately if scop is unprofitable and marked invalid As we already log an error when calling invalid, scops unprofitable scops are in any case marked invalid, but returning immediately safes (a tiny bit of) compile time and is consistent with our use of 'invalid' in the remainder of the file. Found by inspection. llvm-svn: 256140	2015-12-21 09:09:44 +00:00
Tobias Grosser	bfaf1ae309	ScopInfo: Return in case we found an invalid array size Without this return we still log the incorrect array size (and do not detect this scop), but we would unnecessarily continue to verify that access functions are affine. As we do not need to do this, we can return right ahead and consequently safe compile time. This issue was found by inspection. llvm-svn: 256139	2015-12-21 09:09:39 +00:00
Tobias Grosser	949e8c6ac6	ScopInfo: Check for the existance of a single memory accesses Instead of counting all array memory accesses associated with a load instruction, we now explicitly check that the single array access that could (potentially) be associated with a load instruction does not exist. This helps to document the current behavior of Polly where load instructions can indeed have at most one associated array access. In the unlikely case this changes in the future, we add an assert for the case where two load accesses would prevent us to return a single memory access, but we still should communicate that not all array memory accesses have been removed. This addresses post-commit comments from Johannes Doerfert for commit 255776. llvm-svn: 256136	2015-12-21 07:10:39 +00:00
Johannes Doerfert	30e2307f61	[FIX] Schedule generation for block exiting multiple loops. This fixes bug PR25604. llvm-svn: 256125	2015-12-20 17:12:22 +00:00
Michael Kruse	8fc2896ee0	Compile fix: Use "&&" operator instead of "and" llvm-svn: 256124	2015-12-20 14:42:32 +00:00
Tobias Grosser	75dc40c3be	ScopInfo: Bail out in case of complex branch structures Scops that contain many complex branches are likely to result in complex domain conditions that consist of a large (> 100) number of conjucts. Transforming such domains is expensive and unlikely to result in efficient code. To avoid long compile times we detect this case and skip such scops. In the future we may improve this by either using non-affine subregions to hide such complex condition structures or by exploiting in certain cases properties (e.g., dominance) that allow us to construct the domains of a scop in a way that results in a smaller number improving conjuncts. Example of a code that results in complex iteration spaces: loop.header / \| \ \ A0 A2 A4 \ \ / \ / \ A1 A3 \ / \ / \ \| B0 B2 B4 \| \ / \ / \| B1 B3 ^ / \ / \ \| C0 C2 C4 \| \ / \ / / C1 C3 / \ / / loop backedge llvm-svn: 256123	2015-12-20 13:31:48 +00:00
Roman Gareev	8aa437503c	Fix delinearization of fortran arrays The patch fixes Bug 25759 produced by inappropriate handling of unsigned maximum SCEV expressions by SCEVRemoveMax. Without a fix, we get an infinite loop and a segmentation fault, if we try to process, for example, '((-1 + (-1 * %b1)) umax {(-1 + (-1 * %yStart)),+,-1}<%.preheader>)'. It also fixes a potential issue related to signed maximum SCEV expressions. Tested-by: Roman Gareev <gareevroman@gmail.com> Fixed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: http://reviews.llvm.org/D15563 llvm-svn: 255922	2015-12-17 20:37:17 +00:00
Tobias Grosser	deb4abb409	Fix formatting llvm-svn: 255892	2015-12-17 13:04:30 +00:00
Tobias Grosser	eb2eebe486	ScopGraphPrinter: Only show functions that contain at least one scop When running 'clang -O3 -mllvm -polly -mllvm -polly-show' we now only show the CFGs of functions with at least one detected scop. For larger files/projects this reduces the number of graphs printed significantly and is likely what developers want to see. The new option -polly-view-all enforces all graphs to be printed and the exiting option -poll-view-only limites the graph printing to functions that match a certain pattern. This patch requires https://llvm.org/svn/llvm-project/llvm/trunk@255889 (and vice versa) to compile correctly. llvm-svn: 255891	2015-12-17 12:55:26 +00:00
Tobias Grosser	10120189ab	ScopInfo: Directly store MemoryAccessList in InstructionToAccess This avoids the need for explicit memory management, simplifies the code and also fixes a memory leak in removeMemoryAccesses. llvm-svn: 255777	2015-12-16 16:14:03 +00:00
Tobias Grosser	2ed317383b	ScopInfo: Introduce getNumberOfArrayAccesses Use the new function to clarify that we indeed only want to know it at least one array access is associated with an instruction. llvm-svn: 255776	2015-12-16 16:14:00 +00:00
Tobias Grosser	35ec5fbb8c	ScopInfo: Use getArrayAccessFor in reduction detection Load instructions may possibly be related to multiple memory accesses, but we are only interested in the array read access that describes the memory location the load instructions loads from. By using getArrayAccessfor we ensure to always obtain the right memory access. This issue was found by inspection without having a failing test case. llvm-svn: 255716	2015-12-15 23:50:04 +00:00
Tobias Grosser	184a4926b3	BlockGenerator: Use getArrayAccessFor for vector code generation getAccessFor does not guarantee a certain access to be returned in case an instruction is related to multiple accesses. However, in the vector code generation we want to know the stride of the array access of a store instruction. By using getArrayAccessFor we ensure we always get the correct memory access. This patch fixes a potential bug, but I was unable to produce a failing test case. Several existing test cases cover this code, but all of them already passed out of luck (or the specific but not-guaranteed order in which we build memory accesses). llvm-svn: 255715	2015-12-15 23:50:01 +00:00
Tobias Grosser	a69d4f0d83	VectorBlockGenerator: Generate scalar loads for vector statements When generating scalar loads/stores separately the vector code has not been updated. This commit adds code to generate scalar loads for vector code as well as code to assert in case scalar stores are encountered within a vector loop. llvm-svn: 255714	2015-12-15 23:49:58 +00:00
Tobias Grosser	0921477248	ScopInfo: Look up first (and only) array access When rewriting the access functions of load/store statements, we are only interested in the actual array memory location. The current code just took the very first memory access, which could be a scalar or an array access. As a result, we failed to update access functions even though this was requested via .jscop. llvm-svn: 255713	2015-12-15 23:49:53 +00:00
Michael Kruse	5bbc0e1888	Fix typos; NFC llvm-svn: 255580	2015-12-14 23:41:32 +00:00
Tobias Grosser	9bd0dad926	BlockGenerator: Do not use fast-path for external constants This change should not change the behavior of Polly today, but it allows external constants to be remapped e.g. when targetting multiple LLVM modules. llvm-svn: 255506	2015-12-14 16:19:59 +00:00
Tobias Grosser	6f764bbd9c	BlockGenerator: Drop unneeded const_casts llvm-svn: 255505	2015-12-14 16:19:54 +00:00
Tobias Grosser	f4f6870ff2	Revert "Always treat scalar writes as MUST_WRITEs" This reverts commit r255471. Johannes raised in the post-commit review of r255471 the concern that PHI writes in non-affine regions with two exiting blocks are not really MUST_WRITE, but we just know that at least one out of the set of all possible PHI writes will be executed. Modeling all PHI nodes as MUST_WRITEs is probably save, but adding the needed documentation for such a special case is probably not worth the effort. Michael will be proposing a new patch that ensures only a single PHI_WRITE is created for non-affine regions, which - besides other benefits - should also allow us to use a single well-defined MUST_WRITE for such PHI writes. (This is not a full revert, but the condition and documentation have been slightly extended) llvm-svn: 255503	2015-12-14 15:05:37 +00:00
Michael Kruse	34e1122a0d	Rename addScalar(Read\|Write)Access to addValue(Read\|Write)Access Adapt the method names to the new scheme introduced in r255467. llvm-svn: 255474	2015-12-13 22:47:43 +00:00
Michael Kruse	e934725f1d	Check guaranteed execution by using DominatorTree Before this commit, only the region's entry block was assumed to always execute in a non-affine subregion. We replace this by a test whether it dominates the exit block (this necessarily includes the entry block) which should be more accurate. llvm-svn: 255473	2015-12-13 22:10:40 +00:00
Michael Kruse	daf669418c	Store DominatorTree as a field in ScopInfo This harmonizes DT with the other analyses in ScopInfo and makes it available for use in its methods. llvm-svn: 255472	2015-12-13 22:10:37 +00:00
Michael Kruse	b06e3029d1	Always treat scalar writes as MUST_WRITEs LLVM's IR guarantees that a value definition occurs before any use, and also the value of a PHI must be one of the incoming values, "written" in one of the incoming blocks. Hence, such writes are never conditional in the context of a non-affine subregion. llvm-svn: 255471	2015-12-13 22:10:32 +00:00
Tobias Grosser	29f38ab732	ScopInfo: Split out invariant load hoisting into multiple functions [NFC] This reduces indentation and makes the code more readable. llvm-svn: 255468	2015-12-13 21:00:40 +00:00
Tobias Grosser	a535dff471	ScopInfo: Harmonize the different array kinds Over time different vocabulary has been introduced to describe the different memory objects in Polly, resulting in different - often inconsistent - naming schemes in different parts of Polly. We now standartize this to the following scheme: KindArray, KindValue, KindPHI, KindExitPHI \| ------- isScalar -----------\| In most cases this naming scheme has already been used previously (this minimizes changes and ensures we remain consistent with previous publications). The main change is that we remove KindScalar to clearify the difference between a scalar as a memory object of kind Value, PHI or ExitPHI and a value (former KindScalar) which is a memory object modeling a llvm::Value. We also move all documentation to the Kind* enum in the ScopArrayInfo class, remove the second enum in the MemoryAccess class and update documentation to be formulated from the perspective of the memory object, rather than the memory access. The terms "Implicit"/"Explicit", formerly used to describe memory accesses, have been dropped. From the perspective of memory accesses they described the different memory kinds well - especially from the perspective of code generation - but just from the perspective of a memory object it seems more straightforward to talk about scalars and arrays, rather than explicit and implicit arrays. The last comment is clearly subjective, though. A less subjective reason to go for these terms is the historic use both in mailing list discussions and publications. llvm-svn: 255467	2015-12-13 19:59:01 +00:00
Michael Kruse	b8d2644732	Print "null" for ISL objects that are nullptr Use it to print "null" if a MemoryAccess's access relation is not available instead of printing nothing. Suggested-by: Johannes Doerfert llvm-svn: 255466	2015-12-13 19:35:26 +00:00
Michael Kruse	a902ba6f1e	Reuse ScopStmt::isEmpty() function Introduce a function getStmtForRegionNode() to the corresponding ScopStmt of a RegionNode. We can use it to call the existing ScopStmt::isEmpty() function instead of searching for accesses. llvm-svn: 255465	2015-12-13 19:21:45 +00:00
Michael Kruse	e3ec4563b2	Check if access relation is available before printing Currently MemoryAccesses are not printed before the access relations are available, but might be printed during gdb sessions. llvm-svn: 255464	2015-12-13 18:47:02 +00:00
Tobias Grosser	8d4f6267f9	ScopInfo: Add helper function to invalidate a scop llvm-svn: 255430	2015-12-12 09:52:26 +00:00
Tobias Grosser	2d3d4ec860	executeScopConditionally: Introduce special exiting block When introducing separate control flow for the original and optimized code we introduce now a special 'ExitingBlock': \ / EnteringBB \| SplitBlock---------\ _____\|_____ \| / EntryBB \ StartBlock \| (region) \| \| \_ExitingBB_/ ExitingBlock \| \| MergeBlock---------/ \| ExitBB / \ This 'ExitingBlock' contains code such as the final_reloads for scalars, which previously were just added to whichever statement/loop_exit/branch-merge block had been generated last. Having an explicit basic block makes it easier to find these constructs when looking at the CFG. llvm-svn: 255107	2015-12-09 11:38:22 +00:00
Tobias Grosser	31441c4614	ScopInfo: Add MemoryAccess::isScalar() Suggested-by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 254921	2015-12-07 18:06:08 +00:00
Tobias Grosser	a5d9e65e17	Update isl to isl-0.15-142-gf101714 This update brings in improvements to isl's 'isolate' option that reduce the number of code versions generated. This results in both code-size and compile time reduction for outer loop vectorization. Thanks to Roman Garev and Sven Verdoolaege for working on this improvement. llvm-svn: 254706	2015-12-04 08:46:14 +00:00
Michael Kruse	aed1e03888	Update isl to isl-0.15-140-g9279e30 The motivation is to fix a compilation error with Visual Studio 2013. See http://reviews.llvm.org/D14886. Thanks to Sumanth Gundapaneni for finding the issue and suggesting a patch. llvm-svn: 254498	2015-12-02 09:35:04 +00:00
Michael Kruse	054729b420	Add script to update isl The script will checkout the most recent master from http://repo.or.cz/isl.git into /tmp, create a distribution tarball, and extract it as replacement of lib/External/isl. After that it can be committed to the Polly repository. llvm-svn: 254497	2015-12-02 09:34:46 +00:00
Tobias Grosser	2fd89da90d	Remove non-debug printing of domain set Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D15094 llvm-svn: 254343	2015-11-30 22:59:41 +00:00
Tobias Grosser	3a6ac9f9b5	ScopInfo: Further simplify code Acc==MA implies Acc->getAccessInstruction() == MA->getAccessInstruction(). Suggested as post-commit review for 254305 by Michael Kruse. llvm-svn: 254327	2015-11-30 21:13:43 +00:00
Tobias Grosser	ef9ca5db16	ScopInfo: Replace while/iterator construct with std::remove_if The use of C++'s high-level iterator functionality instead of two while loops and explicit iterator handling improves readability of this code. Proposed-by: Michael Kruse <llvm@meinersbur.de> Differential Revision: http://reviews.llvm.org/D15068 llvm-svn: 254305	2015-11-30 17:20:40 +00:00
Michael Kruse	4c86a1d97b	IR cleanup after CodeGeneration Re-run canonicalization passes after Polly's code generation. The set of passes currently added here are nearly all the passes between --polly-position=early and --polly-position=before-vectorizer, i.e. all passes that would usually run after Polly. In order to run these only if Polly actually modified the code, we add a function attribute "polly-optimzed" to a function that contains generated code. The cleanup pass is skipped if the function does not have this attribute. There is no support by the (legacy) PassManager to run passes only under some conditions. One could have wrapped all transformation passes to run only when CodeGeneration changed the code, but the analyses would run anyway. This patch creates an independent pass manager. The disadvantages are that all analyses have to re-run even if preserved and it does not honor compiler switches like the PassManagerBuilder does. Differential Revision: http://reviews.llvm.org/D14333 llvm-svn: 254150	2015-11-26 12:36:25 +00:00
Michael Kruse	cba170e4d0	Introduce origin/kind for exit PHI node accesses Previously, accesses that originate from PHI nodes in the exit block were registered as SCALAR. In some context they are treated as scalars, but it makes a difference in others. We used to check whether the AccessInstruction is a terminator to differentiate the cases. This patch introduces an MemoryAccess origin EXIT_PHI and a ScopArrayInfo kind KIND_EXIT_PHI to make this case more explicit. No behavioural change intended. Differential Revision: http://reviews.llvm.org/D14688 llvm-svn: 254149	2015-11-26 12:26:06 +00:00
Tobias Grosser	2f8e43d677	ScopInfo: Add support for delinearizing fortran arrays gfortran (and fortran in general?) does not compute the address of an array element directly from the array sizes (e.g., %s0, %s1), but takes first the maximum of the sizes and 0 (e.g., max(0, %s0)) before multiplying the resulting value with the per-dimension array subscript expressions. To successfully delinearize index expressions as we see them in fortran, we first filter 'smax' expressions out of the SCEV expression, use them to guess array size parameters and only then continue with the existing delinearization. llvm-svn: 253995	2015-11-24 17:06:38 +00:00
Tobias Grosser	4927c8e606	ScopInfo: Add option to ignore integer wrapping llvm-svn: 253976	2015-11-24 12:50:02 +00:00
Tobias Grosser	d68ba42556	ScopInfo: Split hasAffineMemoryAccesses() into multiple functions [NFC] This makes the overall code more readable. llvm-svn: 253951	2015-11-24 05:00:36 +00:00
Tobias Grosser	5ef2bc316d	Use SmallVector instead of std::vector This was proposed as post-commit review comment for commit r253818. Suggested by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 253862	2015-11-23 10:18:23 +00:00
Tobias Grosser	9737c7b431	ScopInfo: Remove domains of error blocks (and blocks they dominate) early on Trying to build up access functions for any of these blocks is likely to fail, as error blocks may contain invalid/non-representable instructions, and blocks dominated by error blocks may reference such instructions, which wil also cause failures. As all of these blocks are anyhow assumed to not be executed, we can just remove them early on. This fixes http://llvm.org/PR25596 llvm-svn: 253818	2015-11-22 11:06:51 +00:00
Tobias Grosser	fb3fb0a309	isl: Update to isl-0.15-136-g4d5654a The most interesting change for Polly in this isl update is 4d5654af which in certain cases can speed up the construction of run-time checks from an isl set consisting of several disjuncts significantly. llvm-svn: 253794	2015-11-21 20:48:39 +00:00
Johannes Doerfert	a2a74f09fc	Do not enforce lcssa At some point we enforced lcssa for the loop surrounding the entry block. This is not only questionable as it does not check any other loop but also not needed any more. llvm-svn: 253789	2015-11-21 17:00:02 +00:00
Johannes Doerfert	dec27df588	[FIX] Get the correct loop that surrounds a region llvm-svn: 253788	2015-11-21 16:56:13 +00:00
Tobias Grosser	b39c96aa19	ScopInfo: Ensure unique names for parameter names coming from load instructions In case the original parameter instruction does not have a name, but it comes from a load instruction where the base pointer has a name we used the name of the load instruction to give some more intuition of where the parameter came from. To ensure this works also through GEPs which may have complex offsets, we originally just dropped the offsets and _only_ used the base pointer name. As this can result in multiple parameters to get the same name, we now prefix the parameter ID to ensure parameter names are unique. This will make it easier to understand debug output. This change does not affect correctness, as parameter IDs (even of the same name) can always be distinguished through the SCEV pointer stored inside them. llvm-svn: 253330	2015-11-17 11:54:51 +00:00
Tobias Grosser	f920fb19f1	ScopInfo: Reset compute operations before starting compute out region Without this change we may start to refuse scops in larger compilation units just because a lot of code has already been compiled earlier. Found by inspection. I do not yet have a good test case for this. llvm-svn: 253050	2015-11-13 16:56:13 +00:00
Johannes Doerfert	a4b77c079b	[FIX] Bail if access function is not divisible by element size. llvm-svn: 252942	2015-11-12 20:15:32 +00:00
Johannes Doerfert	55b3d8b831	Consistenly use getTypeAllocSize for size estimation. Only when we check for wrapping we want to use the store size, for all other cases we use the alloc size now. Suggested by: Tobias Grosser <tobias@grosser.es> llvm-svn: 252941	2015-11-12 20:15:08 +00:00
Tobias Grosser	2ac2338a03	ScopInfo: Add function to retrieve all memory accesses in a scop llvm-svn: 252894	2015-11-12 14:07:13 +00:00
Tobias Grosser	e5a3514e4a	ScopInfo: Use lambda functions to reduce code duplication. llvm-svn: 252893	2015-11-12 14:07:09 +00:00
Tobias Grosser	bc29e0b27c	RegionGenerator: Only introduce subregion.ivs for loops fully within a subregion IVs of loops for which the loop header is in the subregion, but not the entire loop may be incremented outside of the subregion and can consequently not be kept private to the subregion. Instead, they need to and are modeled as virtual loops in the iteration domains. As this is the case, generating new subregion induction variables for such loops is not needed and indeed wrong as they would hide the virtual induction variables modeled in the scop. This fixes a miscompile in MultiSource/Benchmarks/Ptrdist/bc and MultiSource/Benchmarks/nbench/. Thanks Michael and Johannes for their investiagations and helpful observations regarding this bug. llvm-svn: 252860	2015-11-12 07:34:09 +00:00
Johannes Doerfert	2af10e2eed	Use parameter constraints provided via llvm.assume If an llvm.assume dominates the SCoP entry block and the assumed condition can be expressed as an affine inequality we will now add it to the context. Differential Revision: http://reviews.llvm.org/D14413 llvm-svn: 252851	2015-11-12 03:25:01 +00:00
Johannes Doerfert	d84493e52e	Emit remarks for taken assumptions Differential Revision: http://reviews.llvm.org/D14412 llvm-svn: 252848	2015-11-12 02:33:38 +00:00
Johannes Doerfert	0cf4e0aa42	Emit remark about aliasing pointers llvm-svn: 252847	2015-11-12 02:32:51 +00:00
Johannes Doerfert	48fe86f1ff	Emit SCoP source location as remark during ScopInfo This removes a similar feature from ScopDetection, though with -polly-report that feature present twice anyway. llvm-svn: 252846	2015-11-12 02:32:32 +00:00
Johannes Doerfert	fdbf201fc9	[FIX] Do not generate code for parameters referencing dead values Check if a value that is referenced by a parameter is dead and do not generate code for the parameter in such a case. llvm-svn: 252813	2015-11-11 22:40:51 +00:00
Tobias Grosser	e19fca4525	ScopInfo: Bailing out means assigning isl_set_empty to the AssumedContext I got this the other way around in 252750. Thank you Johannes for noticing. llvm-svn: 252795	2015-11-11 20:21:39 +00:00
Tobias Grosser	910cf26811	ScopInfo: Do not try to model the memory accesses in an error block Error blocks may contain arbitrary instructions, among them some which we can not modeled correctly. As we do not generate ScopStmts for error blocks anyhow there is no point in trying to generate access functions for them. This fixes llvm.org/PR25494 llvm-svn: 252794	2015-11-11 20:15:49 +00:00
Johannes Doerfert	9d7899ef9c	[NFC] Simplify code llvm-svn: 252792	2015-11-11 20:01:31 +00:00
Tobias Grosser	316b5b2507	ScopInfo: Remove unused return value [NFC] llvm-svn: 252780	2015-11-11 19:28:14 +00:00
Tobias Grosser	a52b4da039	ScopInfo: Free set after compute out has been triggered In certain cases isl will not free the return values of operations for which a computeout has been triggered. Hence, make sure we free it explicitly. No test, as I did not manage to reduce one yet. llvm-svn: 252766	2015-11-11 17:59:53 +00:00
Tobias Grosser	4cd07b1188	ScopInfo: Bound compute time spent in boundary context construction For complex inputs our current approach of construction the boundary context may in rare cases become computationally so expensive that it is better to abort. This change adds a compute out check that bounds the compuations we spend on boundary context construction and bails out if this limit is reached. We can probably make our boundary construction algorithm more efficient, but this requires some more investigation and probably also some additional changes to isl. Until these have been added, we bound the compile time to ensure our buildbots are green. llvm-svn: 252758	2015-11-11 17:34:02 +00:00
Tobias Grosser	20a4c0c205	ScopInfo: Limit the number of disjuncts in assumed context In certain rare cases (mostly -polly-process-unprofitable on large sequences of conditions - often without any loop), we see some compile-time timeouts due to the construction of an overly complex assumption context. This change limits the number of disjuncts to 150 (adjustable), to prevent us from creating assumptions contexts that are too large for even the compilation to finish. The limit has been choosen as large as possible to make sure we do not unnecessarily drop test coverage. If such cases also appear in -polly-process-unprofitable=false mode we may need to think about this again, as the current limitations may still allow assumptions that are way to complex to be checked profitably at run-time. There is also certainly room for improvement regarding how (and how efficient) we construct an assumed context, but this requires some more thinking. This completes llvm.org/PR25458 llvm-svn: 252750	2015-11-11 16:22:36 +00:00
Tobias Grosser	e3d1f1c0b0	ScopDetection: Tighten the check for always executed 'error blocks' Basic blocks that are always executed can not be error blocks as their execution can not possibly be an unlikely event. In this commit we tighten the check if an error block to basic blcoks that do not dominate the exit condition, but that dominate all exiting blocks of the scop. llvm-svn: 252726	2015-11-11 13:25:13 +00:00
Tobias Grosser	b12b006c4b	ScopDetection: Do not allow blocks to reference operands in error blocks r252713 introduced a couple of regressions due to later basic blocks refering to instructions defined in error blocks which have not yet been modeled. This commit is currently just encoding limitations of our modeling and code generation backends to ensure correctness. In theory, we should be able to generate and optimize such regions, as everything that is dominated by an error region is assumed to not be executed anyhow. We currently just lack the code to make this happen in practice. llvm-svn: 252725	2015-11-11 12:44:18 +00:00
Tobias Grosser	b76cd3cc56	ScopInfo: Pass domain constraints through error blocks Previously, we just skipped error blocks during scop construction. With this change we make sure we can construct domains for error blocks such that these domains can be forwarded to subsequent basic blocks. This change ensures that basic blocks that post-dominate and are dominated by a basic block that branches to an error condition have the very same iteration domain as the branching basic block. Before, this change we would construct a domain that excludes all error conditions. Such domains could become _very_ complex and were undesirable to build. Another solution would have been to drop these constraints using a dominance/post-dominance check instead of modeling the error blocks. Such a solution could also work in case of unreachable statements or infinite loops in the scop. However, as we currently (to my believe incorrectly) model unreachable basic blocks in the post-dominance tree, such a solution is not yet feasible and requires first a change to LLVM's post-dominance tree construction. This commit addresses the most sever compile time issue reported in: http://llvm.org/PR25458 llvm-svn: 252713	2015-11-11 08:42:20 +00:00
Johannes Doerfert	dcfedf3505	[FIX] Cast pre-loaded values correctly or reload them with adjusted type. Especially for structs, the SAI object of a base pointer does not describe all the types that the user might expect when he loads from that base pointer. While we will still cast integers and pointers we will now reload the value with the correct type if floating point and non-floating point values are involved. However, there are now TODOs where we use bitcasts instead of a proper conversion or reloading. This fixes bug 25479. llvm-svn: 252706	2015-11-11 06:20:25 +00:00
Johannes Doerfert	fc4bfc465a	[FIX] Create empty invariant equivalence classes We now create all invariant equivalence classes for required invariant loads instead of creating them on-demand. This way we can check if a parameter references an invariant load that is actually not executed and was therefor not materialized. If that happens the parameter is not materialized either. This fixes bug 25469. llvm-svn: 252701	2015-11-11 04:30:07 +00:00
Tobias Grosser	6abc75af4c	ScopInfo: Introduce ArrayKind Since 252422 we do not only distinguish two ScopArrayInfo kinds, PHI nodes and others, but work with three kind of ScopArrayInfo objects. SCALAR, PHI and ARRAY objects. Instead of keeping two boolean flags isPHI and isScalar and wonder what an ScopArrayInfo object of kind (!isScalar && isPHI) is, we list now explicitly the three different possible types of memory objects. This change also allows us to remove the confusing nested pairs that have been used in ArrayInfoMapTy. llvm-svn: 252620	2015-11-10 17:31:31 +00:00
Tobias Grosser	b43cc62fe1	stringFromIslObj: Do not crash when printing 'null' objects No test case, as this code path is currently only used for debugging. llvm-svn: 252609	2015-11-10 15:09:44 +00:00
Tobias Grosser	262538435f	ScopInfo: Make getDimensionSize better reflect which dimensions carry sizes In polly the first dimensions of an array as well as all scalars do not carry any size information. This commit makes this explicit in the interface of getDimensionSize. Before this commit getDimensionSize(0) returned the size of the first dimension that carried a size. After this commit getDimensionSize(i) will either return the size of dimension 'i' or assert in case 'i' does not carry a size or does not exist at all. This very same behaviour was already present in getDimensionSizePw(). This commit also adds assertions that ensure getDimensionSizePw() is called appropriately. llvm-svn: 252607	2015-11-10 14:24:21 +00:00
Tobias Grosser	4ea2e07a60	ScopInfo: Make printing of ScopArrayInfo more similar to declarations in C Memory references are now printed as follows: Old New Scalars: i64 MemRef_val[] i64 MemRef_val; Arrays: i64 MemRef_A[][%m][%o][8] i64 MemRef_A[*][%m][%o]; We do not print any more information about the element size in the type. Such information has already been available in a comment after the scalar/array declaration. It was redundant and did not match well with what people were used from C. llvm-svn: 252602	2015-11-10 14:02:54 +00:00
Michael Kruse	c993739e0d	Fix non-affine generated entering node not being recognized as dominating Scalar reloads in the generated entering block were not recognized as dominating the subregions locks when there were multiple entering nodes. This resulted in values defined in there not being copied. As a fix, we unconditionally add the BBMap of the generated entering node to the generated entry. This fixes part of llvm.org/PR25439. This reverts 252449 and reapplies r252445. Its test was failing indeterministically due to r252375 which was reverted in r252522. llvm-svn: 252540	2015-11-09 23:33:40 +00:00
Michael Kruse	d6fb6f1b0c	Fix dominance when subregion exit is outside scop The dominance of the generated non-affine subregion block was based on the scop's merge block, therefore resulted in an invalid DominanceTree. It resulted in some values as assumed to be unusable in the actual generated exit block. We detect the case that the exit block has been moved and decide dominance using the BB at the original exit. If we create another exit node, that exit nodes is dominated by the one generated from where the original exit resides. This fixes llvm.org/PR25438 and part of llvm.org/PR25439. llvm-svn: 252526	2015-11-09 23:07:38 +00:00
Michael Kruse	ebffcbeefa	Revert r252375 "Fix non-affine region dominance of implicitely stored values" It introduced indeterminism as it was iterating over an address-indexed hashtable. The corresponding bug PR25438 will be fixed in a successive commit. llvm-svn: 252522	2015-11-09 22:37:29 +00:00
Johannes Doerfert	7a6e292d86	[FIX] Use same alloca for invariant loads and the scalar users llvm-svn: 252451	2015-11-09 06:28:45 +00:00
Johannes Doerfert	544b23a1ef	Revert "Fix non-affine generated entering node not being recognized as dominating" This reverts commit 9775824b265e574fc541e975d64d3e270243b59d due to a failing unit test. Please check and correct the unit test and commit again. llvm-svn: 252449	2015-11-09 06:04:05 +00:00
Michael Kruse	fd9c89e84b	Fix non-affine generated entering node not being recognized as dominating Scalar reloads in the generated entering block were not recognized as dominating the subregions locks when there were multiple entering nodes. This resulted in values defined in there not being copied. As a fix, we unconditionally add the BBMap of the generated entering node to the generated entry. This fixes part of llvm.org/PR25439. llvm-svn: 252445	2015-11-09 05:00:30 +00:00
Johannes Doerfert	188542fda9	[FIX] Initialize incoming scalar memory locations for PHIs llvm-svn: 252437	2015-11-09 00:21:21 +00:00
Johannes Doerfert	80c716df2b	[NFC] Remove unused variable. llvm-svn: 252436	2015-11-09 00:21:04 +00:00
Johannes Doerfert	f85ad0411f	[FIX] Carefully simplify assumptions in the presence of error blocks If a SCoP contains error blocks we cannot use the domain constraints to simplify the assumptions as the domain is already influenced by the assumptions we took. Before this patch we did that and some assumptions became self-fulfilling as they were implied by the domain constraints. llvm-svn: 252424	2015-11-08 20:16:39 +00:00
Johannes Doerfert	a768624f14	[FIX] Introduce different SAI objects for scalar and memory accesses Even if a scalar and memory access have the same base pointer, we cannot use one SAI object as the type but also the number of dimensions are wrong. For the attached test case this caused a crash in the invariant load hoisting, though it could cause various other problems too. This fixes bug 25428 and a execution time bug in MallocBench/cfrac. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 252422	2015-11-08 19:12:05 +00:00
Johannes Doerfert	3797707695	[FIX] Use unreachable to indicate dead code and repair dominance When we bail out early we make the partially build new code path practically dead, though it was not unreachable. To remove dominance problems we now make it not only dead but also prevent the control flow to join with the original code path, thus allow to use original values after the SCoP without any PHI nodes. This fixes bug 25447. llvm-svn: 252420	2015-11-08 17:57:41 +00:00
Johannes Doerfert	1dd6e37af4	Verify IR after we bail out The bail out in r252412 left the code generation without verifying the (so far) generated IR. This will change now and ensure we always run the verifier. Suggested-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 252419	2015-11-08 16:34:17 +00:00
Johannes Doerfert	c4898504ea	[FIX] Bail out if there is a dependence cycle between invariant loads While the program cannot cause a dependence cycle between invariant loads, additional constraints (e.g., to ensure finite loops) can introduce them. It is hard to detect them in the SCoP description, thus we will only check for them at code generation time. If such a recursion is detected we will bail out the code generation and place a "false" runtime check to guarantee the original code is used. This fixes bug 25443. llvm-svn: 252412	2015-11-07 19:46:04 +00:00
Johannes Doerfert	44483c5599	[FIX] Remove all invariant load occurences from own execution context llvm-svn: 252411	2015-11-07 19:45:27 +00:00
Michael Kruse	0651480b97	Fix non-affine region dominance of implicitely stored values After loop versioning, a dominance check of a non-affine subregion's exit node causes the dominance check to always fail on any block in the subregion if it shares the same exit block with the scop. The subregion's exit block has become polly_merge_new_and_old, which also receives the control flow of the generated code. This would cause that any value for implicit stores is assumed to be not from the scop. We check dominance with the generated exit node instead. This fixes llvm.org/PR25438 llvm-svn: 252375	2015-11-07 00:36:50 +00:00
Duncan P. N. Exon Smith	b8f58b53dd	polly/ADT: Remove implicit ilist iterator conversions, NFC Remove all the implicit ilist iterator conversions from polly, in preparation for making them illegal in ADT. There was one oddity I came across: at line 95 of lib/CodeGen/LoopGenerators.cpp, there was a post-increment `Builder.GetInsertPoint()++`. Since it was a no-op, I removed it, but I admit I wonder if it might be a bug (both before and after this change)? Perhaps it should be a pre-increment? llvm-svn: 252357	2015-11-06 22:56:54 +00:00
Michael Kruse	ddb6528ba6	Fix reuse of non-dominating synthesized value in subregion exit We were adding all generated values in non-affine subregions to be used for the subregions generated exit block. The thought was that only values that are dominating the original exit block can be used there. But it is possible for synthesizable values to be expanded in any block. If the same values is also used for implicit writes, it would try to reuse already synthesized values even if not dominating the exit block. The fix is to only add values to the list of values usable in the exit block only if it is dominating the exit block. This fixes llvm.org/PR25412. llvm-svn: 252301	2015-11-06 13:51:24 +00:00
Tobias Grosser	f1bfd75221	ScopInfo: Allocate globally unique memory access identifiers Before this commit memory reference identifiers have only been unique per basic block, but not per (non-affine) ScopStmt. This commit now uses the MemoryAccess base pointer to uniquely identify each Memory access. llvm-svn: 252200	2015-11-05 20:15:37 +00:00
Tobias Grosser	4bf262a0f2	RunTimeDebugBuilder: Allocate memory _after_ knowing how much is needed This fixes a memory corruption issue, where we accessed more memory than actually allocated. llvm-svn: 252197	2015-11-05 19:43:34 +00:00
Michael Kruse	27149cf32d	Use per-BB value maps for non-exit BBs For generating scalar writes of non-affine subregions, all except phi writes are generated in the exit block. The phi writes are generated in the incoming block for which we errornously used the same BBMap. This can conflict if a value for one block is synthesized, and then reused for another block which is not dominated by the first block. This is fixed by using block-specific BBMaps for phi writes. llvm-svn: 252172	2015-11-05 16:17:17 +00:00
Michael Kruse	f714d470d7	Fix escaping value to subregion entry node phi An incoming value from a block the is not inside the scop is an external use, even if the phi is inside the scop. A previous fix in r251208 did not apply if the phi is inside a non-affine subregion. We move the check for this phi case before the non-affine subregion check. llvm-svn: 252157	2015-11-05 13:18:43 +00:00
Johannes Doerfert	22892687f7	[FIX] Simplify and correct preloading of base pointer origin To simplify and correct the preloading of a base pointer origin, e.g., the base pointer for the current indirect invariant load, we now just check if there is an invariant access class that involves the base pointer of the current class. llvm-svn: 251962	2015-11-03 19:15:33 +00:00
Johannes Doerfert	eca9e890b9	Remove read-only statements from the SCoP We do not need to model read-only statements in the SCoP as they will not cause any side effects that are visible to the outside anyway. Removing them should safe us time and might even simplify the ASTs we generate. Differential Revision: http://reviews.llvm.org/D14272 llvm-svn: 251948	2015-11-03 16:54:49 +00:00
Johannes Doerfert	e071f6d637	[NFC] Name invariant load parameters after base pointer This just makes the debug output nices sometimes. llvm-svn: 251947	2015-11-03 16:49:59 +00:00
Johannes Doerfert	475d8e3f42	[FIX] Ensure base pointer origin was preloaded already If a base pointer of a preloaded value has a base pointer origin, thus it is an indirect invariant load, we have to make sure the base pointer origin is preloaded first. llvm-svn: 251946	2015-11-03 16:49:02 +00:00
Johannes Doerfert	d6fc0701ee	[FIX] Carefully rewrite parameters wrt. invariant equivalence classes ScalarEvolution doesn't allow the operands of an AddRec to be variant in the loop of the AddRec. When we rewrite parameter SCEVs it might seem like the new SCEV violates this property and ScalarEvolution will trigger an assertion. To avoid this we move the start part out of an AddRec when we rewrite it, thus avoid the operands to be possibly variant completely. llvm-svn: 251945	2015-11-03 16:47:58 +00:00
Johannes Doerfert	3181c2ef72	[FIX] Correctly update SAI base pointer If a base pointer load is preloaded, we have change the base pointer of the derived SAI. However, as the derived SAI relationship is is coarse grained, we need to check if we actually preloaded the base pointer or a different element of the base pointer SAI array. llvm-svn: 251881	2015-11-03 01:42:59 +00:00
Johannes Doerfert	dca2837b76	[FIX] Do not crash in the presence of infinite loops. llvm-svn: 251870	2015-11-03 00:28:07 +00:00
Johannes Doerfert	907456fe04	[FIX] Use appropriately sized types for big constants llvm-svn: 251869	2015-11-03 00:26:22 +00:00
Tobias Grosser	8286b83f97	ScopInfo: Bail out in case of mismatching array dimension sizes In some cases different memory accesses access the very same array using a different multi-dimensional array layout where the same dimensions have different sizes. Instead of asserting when encountering this issue, we gracefully bail out for this scop. This fixes llvm.org/PR25252 llvm-svn: 251791	2015-11-02 11:29:32 +00:00
Tobias Grosser	f423c1200f	Remove old and redundant options We remove -polly-detect-unprofitable and -polly-no-early-exit. Both have been superseeded by -polly-process-unprofitable and were only kept as aliases for our buildbots to continue to work. As all buildbots have been moved to the new options, we can now remove the old ones for good. llvm-svn: 251787	2015-11-02 10:13:32 +00:00
Tobias Grosser	3e9560200f	RegionGenerator: Clear local maps after statement construction These maps are only needed during the construction of a single region statement. Clearing them is important, as we otherwise get an assert in case some of the referenced values are erased before the RegionGenerator is deleted. llvm-svn: 251341	2015-10-26 20:41:53 +00:00
Tobias Grosser	bf45e74254	ScopDetect: Bail out for non-simple memory accesses Volatile or atomic memory accesses are currently not supported. Neither did we think about any special handling needed nor do we support the unknown instructions the alias set tracker turns them into sometimes. Before this patch, us not supporting unkown instructions in an alias set caused the following assertion failures: Assertion `AG.size() > 1 && "Alias groups should contain at least two accesses"' failed llvm-svn: 251234	2015-10-25 13:48:40 +00:00
Tobias Grosser	27e19a022e	Fix typo llvm-svn: 251231	2015-10-25 12:05:14 +00:00
Tobias Grosser	907090c37c	ScopDetection: Update DetectionContextMap accordingly When verifying if a scop is still valid we rerun all analysis, but did not update DetectionContextMap. This change ensures that information, e.g. about non-affine regions, is correctly updated llvm-svn: 251227	2015-10-25 10:55:35 +00:00
Tobias Grosser	51174cca30	ScopDetection: Do not crash if we find zero array size candidates for delinearization llvm-svn: 251226	2015-10-25 08:40:44 +00:00
Tobias Grosser	5528dcdf25	ScopDetection: Always refuse multi-dimensional memory accesses with 'undef' in the size expression. We previously only checked if the size expression is 'undef', but allowed size expressions of the form 'undef * undef' by accident. After this change we now require size expressions to be affine which implies no 'undef' appears anywhere in the expression. llvm-svn: 251225	2015-10-25 08:40:38 +00:00
Tobias Grosser	baffa091dd	ScopInfo: PHI-node uses in the EntryNode with an incoming BB that is not part of the Region are external. During code generation we split off the parts of the PHI nodes in the entry block, which have incoming blocks that are not part of the region. As these split-off PHI nodes then are external uses, we consequently also need to model these uses in ScopInfo. llvm-svn: 251208	2015-10-24 20:55:27 +00:00
Tobias Grosser	a3f6edaee1	BlockGenerator: Do not assert when finding model PHI nodes defined outside the scop Such PHI nodes can not only appear in the ExitBlock of the Scop, but indeed any scalar PHI node above the scop and used in the scop is modeled as scalar read access. llvm-svn: 251198	2015-10-24 19:01:09 +00:00
Tobias Grosser	27d742da59	BlockGenerator: Directly handle multi-exit PHI nodes This change adds code to directly code-generate multi-exit PHI nodes, instead of trying to reuse the EscapeMap infrastructure for this. Using escape maps adds a level of indirection that is hard to understand and - more importantly - breaks in certain cases. Specifically, the original code relied on simplifyRegion() to split the original PHI node in two PHI nodes, one merging the values coming from within the scop and a second that merges the first PHI node with the values that come from outside the scop. To generate code the first PHI node is then just handled like any other in-scop value that is used somewhere outside the scop. This fails for the case where all values from inside the scop are identical, as the first PHI node is in such cases automatically simplified and eliminated by LLVM right at construction. As a result, there is no instruction that can be pass to the EscapeMap handling, which means the references in the second PHI node are not updated and may still reference values from within the original scop that do not dominate it. Our new code iterates directly over all modeled ScopArrayInfo objects that represent multi-exit PHI nodes and generates code for them without relying on the EscapeMap infrastructure. Hence, it works also for the case where the first PHI node is eliminated. llvm-svn: 251191	2015-10-24 17:41:29 +00:00
Tobias Grosser	c73d8b0e18	ScopInfo: Drop unnecessary code This case has already been taken care of in r250622 and was then accidentally again committed in 250625. llvm-svn: 251156	2015-10-23 22:36:22 +00:00
Johannes Doerfert	654c3284f4	[FIX] Do not hoist nested variant base pointers This fixes bug 25249. llvm-svn: 250958	2015-10-21 22:14:57 +00:00
Tobias Grosser	ca7f5bb767	Full/partial tile separation for vectorization We isolate full tiles from partial tiles to be able to, for example, vectorize loops with parametric lower and/or upper bounds. If we use -polly-vectorizer=stripmine, we can see execution-time improvements: correlation from 1m7361s to 0m5720s (-67.05 %), covariance from 1m5561s to 0m5680s (-63.50 %), ary3 from 2m3201s to 1m2361s (-46.72 %), CrystalMk from 8m5565s to 7m4285s (-13.18 %). The current full/partial tile separation increases compile-time more than necessary. As a result, we see in compile time regressions, for example, for 3mm from 0m6320s to 0m9881s (56.34%). Some of this compile time increase is expected as we generate more IR and consequently more time is spent in the LLVM backends. However, a first investiagation has shown that a larger portion of compile time is unnecessarily spent inside Polly's parallelism detection and could be eliminated by propagating existing knowledge about vector loop parallelism. Before enabling -polly-vectorizer=stripmine by default, it is necessary to address this compile-time issue. Contributed-by: Roman Gareev <gareevroman@gmail.com> Reviewers: jdoerfert, grosser Subscribers: grosser, #polly Differential Revision: http://reviews.llvm.org/D13779 llvm-svn: 250809	2015-10-20 09:12:21 +00:00
Michael Kruse	dc12222287	Synthesize phi arguments in incoming block New values were always synthesized in the block of the instruction that needed them. This is incorrect for PHI node whose' value must be defined in the respective incoming block. This patch temporarily moves the builder's insert point to the incoming block while synthesizing phi node arguments. This fixes PR25241 (http://llvm.org/bugs/show_bug.cgi?id=25241) llvm-svn: 250693	2015-10-19 09:19:25 +00:00
Johannes Doerfert	9c28bfa72c	[FIX] Only constant integer branch conditions are always affine There are several different kinds of constants that could occur in a branch condition, however we can only handle the most interesting one namely constant integers. To this end we have to treat others as non-affine. This fixes bug 25244. llvm-svn: 250669	2015-10-18 22:56:42 +00:00
Johannes Doerfert	30c2265f98	[FIX] Normalize loops outside the SCoP during schedule generation We build the schedule based on a traversal of the region and accumulate information for each loop in it. The total schedule is associated with the loop surrounding the SCoP, though it can happen that there are blocks in the SCoP which are part of loops that are only partially in the SCoP. Instead of associating information with them (they are not part of the SCoP and consequently are not modeled) we have to associate the schedule information with the surrounding loop if any. This fixes bug 25240. llvm-svn: 250668	2015-10-18 21:17:11 +00:00
Johannes Doerfert	b864c2c3c9	[FIX] Do not try to hoist "empty" accesses Accesses that have a relative offset (in bytes) that is not divisible by the type size (in bytes) will be represented as empty in the SCoP description. This is on its own not good but it also crashed the invariant load hoisting. This patch will fix the latter problem while the former should be addressed too. This fixes bug 25236. llvm-svn: 250664	2015-10-18 19:50:18 +00:00
Johannes Doerfert	bc7cff4c18	[FIX] Do not hoist invariant pointers with non-loaded base ptr in SCoP If the base pointer of a load is invariant and defined in the SCoP but not loaded we cannot hoist the load as we would not hoist the base pointer definition. This fixes bug 25237. llvm-svn: 250663	2015-10-18 19:49:25 +00:00
Johannes Doerfert	af3e301a67	[FIX] Restructure invariant load equivalence classes Sorting is replaced by a demand driven code generation that will pre-load a value when it is needed or, if it was not needed before, at some point determined by the order of invariant accesses in the program. Only in very little cases this demand driven pre-loading will kick in, though it will prevent us from generating faulty code. An example where it is needed is shown in: test/ScopInfo/invariant_loads_complicated_dependences.ll Invariant loads that appear in parameters but are not on the top-level (e.g., the parameter is not a SCEVUnknown) will now be treated correctly. Differential Revision: http://reviews.llvm.org/D13831 llvm-svn: 250655	2015-10-18 12:39:19 +00:00
Johannes Doerfert	d8b6ad255f	[FIX] Cast preloaded values Preloaded values have to match the type of their counterpart in the original code and not the type of the base array. llvm-svn: 250654	2015-10-18 12:36:42 +00:00
Johannes Doerfert	01978cfa0c	Remove independent blocks pass Polly can now be used as a analysis only tool as long as the code generation is disabled. However, we do not have an alternative to the independent blocks pass in place yet, though in the relevant cases this does not seem to impact the performance much. Nevertheless, a virtual alternative that allows the same transformations without changing the input region will follow shortly. llvm-svn: 250652	2015-10-18 12:28:00 +00:00
Tobias Grosser	b8d27aab7d	Revert to original BlockGenerator::getOrCreateAlloca(MemoryAccess &Access) Expressing this in terms of BlockGenerator::getOrCreateAlloca(const ScopArrayInfo *Array) does not work as the MemoryAccess BasePtr is in case of invariant load hoisting different to the ScopArrayInfo BasePtr. Until this is investigated and fixed, we move back to code that just uses the baseptr of MemoryAccess. llvm-svn: 250637	2015-10-18 00:51:13 +00:00
Tobias Grosser	a4f0988df5	BlockGenerator: Add getOrCreateAlloca(const ScopArrayInfo *Array) This allows the caller to get the alloca locations of an array without the need to thank if Array is a PHI or a non-PHI Array. We directly make use of this in BlockGenerator::getOrCreateAlloca(MemoryAccess &Access). llvm-svn: 250628	2015-10-17 22:16:00 +00:00
Tobias Grosser	05d7fa79b6	Format comment properly While clang-format takes care that the line-length is not surpassed, the resulting comments sometimes look not optimal. We re-flow the text in the comment to avoid these ugly single-word lines. llvm-svn: 250626	2015-10-17 21:46:28 +00:00
Michael Kruse	225f0d1ee2	Load/Store scalar accesses before/after the statement itself Instead of generating implicit loads within basic blocks, put them before the instructions of the statment itself, including non-affine subregions. The region's entry node is dominating all blocks in the region and therefore the loaded value will be available there. Implicit writes in block-stmts were already stored back at the end of the block. Now, also generate the stores of non-affine subregions when leaving the statement, i.e. in the exiting block. This change is required for array-mapped implicits ("De-LICM") to ensure that there are no dependencies of demoted scalars within statments. Statement load all required values, operator on copied in registers, and then write back the changed value to the demoted memory. Lifetimes analysis within statements becomes unecessary. Differential Revision: http://reviews.llvm.org/D13487 llvm-svn: 250625	2015-10-17 21:36:00 +00:00
Michael Kruse	01cb379fed	Avoid unnecessay .s2a write access when used only in PHIs Accesses for exit node phis will be handled separately by buildPHIAccesses if there is more than one exiting edge, buildScalarDependences does not need to create additional SCALAR accesses. This is a corrected version of r250517, which was reverted in r250607. Differential Revision: http://reviews.llvm.org/D13848 llvm-svn: 250622	2015-10-17 21:07:08 +00:00
Tobias Grosser	ffa2446f28	BlockGenerator: Register outside users of scalars directly Instead of checking at code generation time for each ScopStmt if a scalar has external uses, we just iterate over the ScopArrayInfo descriptions we have and check each of these for possible external uses. Besides being somehow clearer, this approach has the benefit that we will always create valid LLVM-IR even in case we disable the code generation of ScopStmt bodies e.g. for testing purposes. llvm-svn: 250608	2015-10-17 08:54:13 +00:00
Tobias Grosser	3839b422e6	Revert "Avoid unnecessay .s2a write access when used only in PHIs" This reverts commit r250606 due to some bugs it introduced. After these bugs have been resolved, we will add it back to tree. llvm-svn: 250607	2015-10-17 08:54:05 +00:00
Tobias Grosser	26e59ee746	Drop unused parameter from handleOutsideUsers llvm-svn: 250606	2015-10-17 08:25:54 +00:00

... 2 3 4 5 6 ...

1659 Commits