llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	a1f291ed18	Rename the DataLayout member [NFC] llvm-svn: 259502	2016-02-02 14:15:13 +00:00
Johannes Doerfert	a87810351c	Remove helper function [NFC] llvm-svn: 259501	2016-02-02 14:14:40 +00:00
Johannes Doerfert	c36d39b7a8	Remove unnecessary getAnalysis call [NFC] llvm-svn: 259500	2016-02-02 14:14:20 +00:00
Tobias Grosser	cc77950b2b	ScopInfo: Rename AccessType -> ElementType to unify naming with method declaration llvm-svn: 259495	2016-02-02 13:22:54 +00:00
Tobias Grosser	f5e239c4c3	www: Mark full/partial tile separation for vectorization as 'done' Roman contributed this llvm-svn: 259486	2016-02-02 09:55:59 +00:00
Tobias Grosser	0720570eee	www: Update statistic code llvm-svn: 259485	2016-02-02 09:55:55 +00:00
Tobias Grosser	6c2ae4592d	www: update formatting of polyhedral.info news llvm-svn: 259483	2016-02-02 09:30:01 +00:00
Tobias Grosser	542625eb6e	www: Fix overlapping lines II llvm-svn: 259477	2016-02-02 08:49:34 +00:00
Tobias Grosser	0dcafe94f8	Avoid overlapping lines llvm-svn: 259476	2016-02-02 08:47:56 +00:00
Tobias Grosser	c2fd8b411d	ScopInfo: Correct schedule construction For schedule generation we assumed that the reverse post order traversal used by the domain generation is sufficient, however it is not. Once a loop is discovered, we have to completely traverse it, before we can generate the schedule for any block/region that is only reachable through a loop exiting block. To this end, we add a "loop stack" that will keep track of loops we discovered during the traversal but have not yet traversed completely. We will never visit a basic block (or region) outside the most recent (thus smallest) loop in the loop stack but instead queue such blocks (or regions) in a waiting list. If the waiting list is not empty and (might) contain blocks from the most recent loop in the loop stack the next block/region to visit is drawn from there, otherwise from the reverse post order iterator. We exploit the new property of loops being always completed before additional loops are processed, by removing the LoopSchedules map and instead keep all information in LoopStack. This clarifies that we indeed always only keep a stack of in-process loops, but will never keep incomplete schedules for an arbitrary set of loops. As a result, we can simplify some of the existing code. This patch also adds some more documentation about how our schedule construction works. This fixes http://llvm.org/PR25879 This patch is an modified version of Johannes Doerfert's initial fix. Differential Revision: http://reviews.llvm.org/D15679 llvm-svn: 259354	2016-02-01 11:54:13 +00:00
Tobias Grosser	ccbe383594	ScopInfo: Drop dead code in schedule description In https://llvm.org/svn/llvm-project/polly/trunk@251870 code was committed to avoid a failure in the presence of infinite loops, but the test case committed along with this change passes without the actual change. I looked back into the code and also checked with the original committer (Johannes), but could not find the reason why the code is needed. The introduction of LoopStacks for buildSchedule in one of the next commits will make it even more clear that this code is not needed, but I remove this ahead of time to facilitate bisecting in case I missed something. llvm-svn: 259347	2016-02-01 10:07:43 +00:00
Tobias Grosser	0dd4a9a9f4	ScopInfo: use std::distance to shorten code [NFC] llvm-svn: 259337	2016-02-01 01:55:08 +00:00
Craig Topper	7fb6e47101	Replace utostr_32 use with utostr to match removal from llvm. llvm-svn: 259333	2016-01-31 20:36:20 +00:00
Tobias Grosser	ed873cd3c1	cmake: Add additional libraries to fix the darwin shared library build darwin requires the additional linkages of... LLVMBitReader LLVMMCParser LLVMObject LLVMProfileData LLVMTarget LLVMVectorize as the darwin requires all of the weak undefined symbols in a library to be resolved when linking it against an executable (unless -Wl,-undefined,dynamic_lookup is used to override the default behavior of -Wl,-undefined,error). Contributed-by: Jack Howarth llvm-svn: 259332	2016-01-31 20:25:46 +00:00
Michael Kruse	26311f00e8	Remove autotools build system The autotools build system is based on and requires LLVM's autotools build system to work, which has been depricated and finally removed in r258861. Consequently we also remove the autotools build system from Polly. Differential Revision: http://reviews.llvm.org/D16655 llvm-svn: 259041	2016-01-28 12:00:33 +00:00
Michael Kruse	fd46308de4	ScopInfo: Never add read accesses for synthesizable values Before adding a MK_Value READ MemoryAccess, check whether the read is necessary or synthesizable. Synthesizable values are later generated by the SCEVExpander and therefore do not need to be transferred explicitly. This can happen because the check for synthesizability has presumbly been forgotten in the case where a phi's incoming value has been defined in a different statement. Differential Revision: http://reviews.llvm.org/D15687 llvm-svn: 258998	2016-01-27 22:51:56 +00:00
Michael Kruse	70131d3416	Introduce MemAccInst helper class; NFC MemAccInst wraps the common members of LoadInst and StoreInst. Also use of this class in: - ScopInfo::buildMemoryAccess - BlockGenerator::generateLocationAccessed - ScopInfo::addArrayAccess - Scop::buildAliasGroups - Replace every use of polly::getPointerOperand Reviewers: jdoerfert, grosser Differential Revision: http://reviews.llvm.org/D16530 llvm-svn: 258947	2016-01-27 17:09:17 +00:00
Michael Kruse	ee6a4fc680	Unique phi write accesses Ensure that there is at most one phi write access per PHINode and ScopStmt. In particular, this would be possible for non-affine subregions with multiple exiting blocks. We replace multiple MAY_WRITE accesses by one MUST_WRITE access. The written value is constructed using a PHINode of all exiting blocks. The interpretation of the PHI WRITE's "accessed value" changed from the incoming value to the PHI like for PHI READs since there is no unique incoming value. Because region simplification shuffles around PHI nodes -- particularly with exit node PHIs -- the PHINodes at analysis time does not always exist anymore in the code generation pass. We instead remember the incoming block/value pair in the MemoryAccess. Differential Revision: http://reviews.llvm.org/D15681 llvm-svn: 258809	2016-01-26 13:33:27 +00:00
Michael Kruse	ad28e5a589	Unique value read accesses Keep at most one value read MemoryAccess per value and statement; multiple generated loads do not have any additional effect. As one such MemoryAccess can cater multiple uses within the statement, the AccessInstruction property is not unique any more and set to nullptr. Differential Revision: http://reviews.llvm.org/D15510 llvm-svn: 258808	2016-01-26 13:33:15 +00:00
Michael Kruse	436db620e7	Unique value write accesses Ensure there is at most one write access per definition of an llvm::Value. Keep track of already created value write access by using a (dense) map. Replace addValueWriteAccess by ensureValueStore which can be uses more liberally without worrying to add redundant accesses. It will be used, e.g. in a logical correspondant for value reads -- ensureValueReload -- to ensure that the expected definition has been written when loading it. Differential Revision: http://reviews.llvm.org/D15483 llvm-svn: 258807	2016-01-26 13:33:10 +00:00
Johannes Doerfert	6f50c29ab2	[FIX] Domain generation error due to loops in non-affine regions llvm-svn: 258803	2016-01-26 11:03:25 +00:00
Johannes Doerfert	432658d7b8	[FIX] Build correct domain for non-affine region SCoPs llvm-svn: 258802	2016-01-26 11:01:41 +00:00
Tobias Grosser	f2cdd144e5	BlockGenerators: Replace getNewScalarValue with getNewValue Both functions implement the same functionality, with the difference that getNewScalarValue assumes that globals and out-of-scop scalars can be directly reused without loading them from their corresponding stack slot. This is correct for sequential code generation, but causes issues with outlining code e.g. for OpenMP code generation. getNewValue handles such cases correctly. Hence, we can replace getNewScalarValue with getNewValue. This is not only more future proof, but also eliminates a bunch of code. The only functionality that was available in getNewScalarValue that is lost is the on-demand creation of scalar values. However, this is not necessary any more as scalars are always loaded at the beginning of each basic block and will consequently always be available when scalar stores are generated. As this was not the case in older versions of Polly, it seems the on-demand loading is just some older code that has not yet been removed. Finally, generateScalarLoads also generated loads for values that are loop invariant, available in GlobalMap and which are preferred over the ones loaded in generateScalarLoads. Hence, we can just skip the code generation of such scalar values, avoiding the generation of dead code. Differential Revision: http://reviews.llvm.org/D16522 llvm-svn: 258799	2016-01-26 10:01:35 +00:00
Tobias Grosser	232905089e	test: Name instructions in a test case [NFC] llvm-svn: 258662	2016-01-24 17:51:37 +00:00
Tobias Grosser	5c7f16be6b	BlockGenerators: Avoid redundant map lookup [NFC] llvm-svn: 258660	2016-01-24 14:16:59 +00:00
Tobias Grosser	c9abde8c51	ScopInfo: Simplify code by folding definition into if llvm-svn: 258632	2016-01-23 20:23:06 +00:00
Tobias Grosser	1c3a6d7808	ScopDetection: Do not detect regions with irreducible control as scops Polly currently does not support irreducible control and it is probably not worth supporting. This patch adds code that checks for irreducible control and refuses regions containing irreducible control. Polly traditionally had rather restrictive checks on the control flow structure which would have refused irregular control, but within the last couple of months most of the control flow restrictions have been removed. As part of this generalization we accidentally allowed irregular control flow. Contributed-by: Karthik Senthil and Ajith Pandel llvm-svn: 258497	2016-01-22 09:44:37 +00:00
Tobias Grosser	b3a9538e95	Remove irreducible control flow from test case The test case we look at does not necessarily require irreducible control flow, but a normal loop is sufficient to create a non-affine region containing more than one basic block that dominates the exit node. We replace this irreducible control flow with a normal loop for the following reasons: 1) This is easier to understand 2) We will subsequently commit a patch that ensures Polly does not process irreducible control flow. Within non-affine regions, we could possibly handle irreducible control flow. llvm-svn: 258496	2016-01-22 09:33:33 +00:00
Tobias Grosser	40038d2d1e	www: Update links to LLVM bug tracker Polly recently got its own product in LLVM's bug tracker, which will make it easier for people to file Polly bugs. This change updates the bugtracker links on the Polly website. llvm-svn: 258494	2016-01-22 08:19:54 +00:00
Sumanth Gundapaneni	4b1472fb2b	Fix the error in non-asserts Release mode build. NFC llvm-svn: 258318	2016-01-20 15:41:30 +00:00
Johannes Doerfert	370cf00c9f	Make sure we preserve alignment information after hoisting invariant load In Polly, after hoisting loop invariant loads outside loop, the alignment information for hoisted loads are missing, this patch restore them. Contributed-by: Lawrence Hu <lawrence@codeaurora.org> Differential Revision: http://reviews.llvm.org/D16160 llvm-svn: 258105	2016-01-19 00:17:21 +00:00
Michael Kruse	dc8508e72a	Do not check JSON alignment of scalar accesses When importing a schedule, do not verify the load/store alignment of scalar accesses. Scalar loads/store are always created newly in code generation with no alignment restrictions. Previously, scalar alignment was checked if the access instruction happened to be a LoadInst or StoreInst, but only its array (MK_Array) access is relevant. This will be implicitly unit-tested when the access instruction of a value read can be nullptr. Differential Revision: http://reviews.llvm.org/D15680 llvm-svn: 257904	2016-01-15 16:49:33 +00:00
Michael Kruse	959a8dc39f	Update to ISL 0.16.1 llvm-svn: 257898	2016-01-15 15:54:45 +00:00
Michael Kruse	f7d5e40860	Add option to update-isl.sh for specific commit This is used e.g. to update specifically to ISL 0.16.1 although newer revisions exist in the official repository. llvm-svn: 257895	2016-01-15 15:41:59 +00:00
Michael Kruse	5a9a65e43f	Prepare unit tests for update to ISL 0.16 ISL 0.16 will change how sets are printed which breaks 117 unit tests that text-compare printed sets. This patch re-formats most of these unit tests using a script and small manual editing on top of that. When actually updating ISL, most work is done by just re-running the script to adapt to the changed output. Some tests that compare IR and tests with single CHECK-lines that can be easily updated manually are not included here. The re-format script will also be committed afterwards. The per-test formatter invocation command lines options will not be added in the near future because it is ad hoc and would overwrite the manual edits. Ideally it also shouldn't be required anymore because ISL's set printing has become more stable in 0.16. Differential Revision: http://reviews.llvm.org/D16095 llvm-svn: 257851	2016-01-15 00:48:42 +00:00
Roman Gareev	b0c4e49a37	Fix of r257495. Remove redundant "FPM->add(createDemoteRegisterToMemoryPass());" llvm-svn: 257514	2016-01-12 20:47:48 +00:00
Roman Gareev	6ebc01c973	We do not need to schedule another loop interchange pass after Polly, as Polly should perform loop interchanges itself. This also fixes a bug we see due to the "loop-interchange" pass producing incorrect IR when compiling linpack-pc.c from the LLVM test-suite with "-polly-position=before-vectorizer". Reviewed-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 257495	2016-01-12 17:59:06 +00:00
Roman Gareev	10595a1739	Call assumeNoOutOfBound only in updateDimensionality Call assumeNoOutOfBound only in updateDimensionality to process situations when new dimensions are added and new bounds checks are required. Contributed-by: Tobias Grosser, Gareev Roman llvm-svn: 257170	2016-01-08 14:01:59 +00:00
Tobias Grosser	8362c26113	Define buildScheduleRec on RegionNodes and pull out the tree traversal [NFC] This change clarifies that for Not-NonAffine-SubRegions we actually iterate over the subnodes and for both NonAffine-SubRegions and BasicBlocks, we perform the schedule construction. As a result, the tree traversal becomes trivial, the special case for a scop consisting just of a single non-affine region disappears and the indentation of the code is reduced. No functional change intended. llvm-svn: 256940	2016-01-06 15:30:06 +00:00
Johannes Doerfert	f9711ef922	Extract constant parts of the schedule generation [NFC] llvm-svn: 256931	2016-01-06 12:59:23 +00:00
Tobias Grosser	05e71b9f61	ScopInfo: use getStmtForRegionNode to simplify code slightly llvm-svn: 256914	2016-01-06 05:18:20 +00:00
Tobias Grosser	c28ae257c0	TODO: Polly can handle boolean expressions (Open->Done) The necessary support was committed by Johannes in r249971. llvm-svn: 256826	2016-01-05 11:48:59 +00:00
Tobias Grosser	100ef6b30c	TODO: We do not use -independent-blocks any more (Open -> Done) llvm-svn: 256825	2016-01-05 11:45:26 +00:00
Tobias Grosser	ce9ef39c10	IslExprBuilder: Provide PointerLikeTypeTraits for isl_id Providing an explicit PointerLikeTypeTraits implementation became necessary since LLVM started in https://llvm.org/svn/llvm-project/llvm/trunk@256620 to automatically derive the pointer alignment from the pointer element type, which does not work for incomplete types as used by isl. To ensure our code still compiles, we provide an instantiation of PointerLikeTypeTraits for isl_id which assumes no minimal alignment. isl pointers are likely to have a "higher" alignment. We can exploit this later in case this becomes performance relevant. llvm-svn: 256650	2015-12-30 20:11:48 +00:00
Tobias Grosser	5a90a985b9	ScopDetection: Add include to make complete type of llvm::Region available This is necessary for the recent ptr-traits work in LLVM, which now requires complete type definitions. llvm-svn: 256649	2015-12-30 20:11:43 +00:00
Michael Kruse	58fa3bb63a	Do not store scalar accesses in InstructionToAccess At code generation, scalar reads are generated before the other statement's instructions, respectively scalar writes after them, in contrast to array accesses which are "executed" with the instructions they are linked to. Therefore it makes sense to not map the scalar accesses to a place of execution. Follow-up patches will also remove some of the directs links from a scalar access to a single instruction, such that only having array accesses in InstructionToAccess ensures consistency. Differential Revision: http://reviews.llvm.org/D13676 llvm-svn: 256298	2015-12-22 23:25:11 +00:00
Johannes Doerfert	5dced2693e	Refactor canSynthesize in the BlockGenerators [NFC] llvm-svn: 256269	2015-12-22 19:08:49 +00:00
Johannes Doerfert	28f8ac1db2	Treat inline assembly as a constant in the code generation. llvm-svn: 256267	2015-12-22 19:08:24 +00:00
Johannes Doerfert	42df8d1db6	Reduce indention in BlockGenerator::trySynthesizeNewValue [NFC] llvm-svn: 256266	2015-12-22 19:08:01 +00:00
Tobias Grosser	c6424ae46c	ScopDetection: Simplify std::distance(....) to BB->size() Suggested by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 256260	2015-12-22 17:38:59 +00:00
Tobias Grosser	fcabb155c1	BlockGenerators: Remove unnecessary const_cast llvm-svn: 256227	2015-12-22 01:41:25 +00:00
Tobias Grosser	c900633d60	ScopInfo: Small improvement to schedule construction [NFC] We clarify that certain code is only executed if LSchedule is != nullptr. Previously some of these functions have been executed, but they only passed a nullptr through. This caused some confusion when reading the code. llvm-svn: 256209	2015-12-21 23:01:53 +00:00
Tobias Grosser	cbf7ae8fef	ScopInfo: Polish the implementation of mapToDimension Besides improving the documentation and the code we now assert in case the input is invalid (N < 0) and also do not any more return a nullptr in case USet is empty. This should make the code more readable. llvm-svn: 256208	2015-12-21 22:45:53 +00:00
Tobias Grosser	c1a269bf0e	Add option to assume single-loop scops with sufficient compute are profitable If a loop has a sufficiently large amount of compute instruction in its loop body, it is unlikely that our rewrite of the loop iterators introduces large performance changes. As Polly can also apply beneficical optimizations (such as parallelization) to such loop nests, we mark them as profitable. This option is currently "disabled" by default, but can be used to run experiments. If enabled by setting it e.g. to 40 instructions, we currently see some compile-time increases on LNT without any significant run-time changes. llvm-svn: 256199	2015-12-21 21:00:43 +00:00
Tobias Grosser	5624d3c978	Adjust formatting to clang-format changes in 256149 llvm-svn: 256151	2015-12-21 12:38:56 +00:00
Tobias Grosser	97fc5bb7f7	ScopDetect: Extract profitability check into subfunction .. and add some documentation. We also simplify the code by dropping an early check that is also covered by the the later checks. This might have a small compile time impact, but as the scops that are skipped are small we should probably only add this back in the unlikely case that this has a notable compile-time cost. No functional change intended. llvm-svn: 256149	2015-12-21 12:14:48 +00:00
Tobias Grosser	594882573e	ScopInfo: Return immediately if scop is unprofitable and marked invalid As we already log an error when calling invalid, scops unprofitable scops are in any case marked invalid, but returning immediately safes (a tiny bit of) compile time and is consistent with our use of 'invalid' in the remainder of the file. Found by inspection. llvm-svn: 256140	2015-12-21 09:09:44 +00:00
Tobias Grosser	bfaf1ae309	ScopInfo: Return in case we found an invalid array size Without this return we still log the incorrect array size (and do not detect this scop), but we would unnecessarily continue to verify that access functions are affine. As we do not need to do this, we can return right ahead and consequently safe compile time. This issue was found by inspection. llvm-svn: 256139	2015-12-21 09:09:39 +00:00
Tobias Grosser	949e8c6ac6	ScopInfo: Check for the existance of a single memory accesses Instead of counting all array memory accesses associated with a load instruction, we now explicitly check that the single array access that could (potentially) be associated with a load instruction does not exist. This helps to document the current behavior of Polly where load instructions can indeed have at most one associated array access. In the unlikely case this changes in the future, we add an assert for the case where two load accesses would prevent us to return a single memory access, but we still should communicate that not all array memory accesses have been removed. This addresses post-commit comments from Johannes Doerfert for commit 255776. llvm-svn: 256136	2015-12-21 07:10:39 +00:00
Johannes Doerfert	30e2307f61	[FIX] Schedule generation for block exiting multiple loops. This fixes bug PR25604. llvm-svn: 256125	2015-12-20 17:12:22 +00:00
Michael Kruse	8fc2896ee0	Compile fix: Use "&&" operator instead of "and" llvm-svn: 256124	2015-12-20 14:42:32 +00:00
Tobias Grosser	75dc40c3be	ScopInfo: Bail out in case of complex branch structures Scops that contain many complex branches are likely to result in complex domain conditions that consist of a large (> 100) number of conjucts. Transforming such domains is expensive and unlikely to result in efficient code. To avoid long compile times we detect this case and skip such scops. In the future we may improve this by either using non-affine subregions to hide such complex condition structures or by exploiting in certain cases properties (e.g., dominance) that allow us to construct the domains of a scop in a way that results in a smaller number improving conjuncts. Example of a code that results in complex iteration spaces: loop.header / \| \ \ A0 A2 A4 \ \ / \ / \ A1 A3 \ / \ / \ \| B0 B2 B4 \| \ / \ / \| B1 B3 ^ / \ / \ \| C0 C2 C4 \| \ / \ / / C1 C3 / \ / / loop backedge llvm-svn: 256123	2015-12-20 13:31:48 +00:00
Roman Gareev	22803d4488	Fix of a comment. llvm-svn: 255923	2015-12-17 20:47:10 +00:00
Roman Gareev	8aa437503c	Fix delinearization of fortran arrays The patch fixes Bug 25759 produced by inappropriate handling of unsigned maximum SCEV expressions by SCEVRemoveMax. Without a fix, we get an infinite loop and a segmentation fault, if we try to process, for example, '((-1 + (-1 * %b1)) umax {(-1 + (-1 * %yStart)),+,-1}<%.preheader>)'. It also fixes a potential issue related to signed maximum SCEV expressions. Tested-by: Roman Gareev <gareevroman@gmail.com> Fixed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: http://reviews.llvm.org/D15563 llvm-svn: 255922	2015-12-17 20:37:17 +00:00
Tobias Grosser	deb4abb409	Fix formatting llvm-svn: 255892	2015-12-17 13:04:30 +00:00
Tobias Grosser	eb2eebe486	ScopGraphPrinter: Only show functions that contain at least one scop When running 'clang -O3 -mllvm -polly -mllvm -polly-show' we now only show the CFGs of functions with at least one detected scop. For larger files/projects this reduces the number of graphs printed significantly and is likely what developers want to see. The new option -polly-view-all enforces all graphs to be printed and the exiting option -poll-view-only limites the graph printing to functions that match a certain pattern. This patch requires https://llvm.org/svn/llvm-project/llvm/trunk@255889 (and vice versa) to compile correctly. llvm-svn: 255891	2015-12-17 12:55:26 +00:00
Tobias Grosser	10120189ab	ScopInfo: Directly store MemoryAccessList in InstructionToAccess This avoids the need for explicit memory management, simplifies the code and also fixes a memory leak in removeMemoryAccesses. llvm-svn: 255777	2015-12-16 16:14:03 +00:00
Tobias Grosser	2ed317383b	ScopInfo: Introduce getNumberOfArrayAccesses Use the new function to clarify that we indeed only want to know it at least one array access is associated with an instruction. llvm-svn: 255776	2015-12-16 16:14:00 +00:00
Tobias Grosser	04d4964462	ScopInfo: Delete code that has been made dead by previous commits In case this functionality is needed in later patches, it should probably be (re)added by these patches. llvm-svn: 255717	2015-12-15 23:50:09 +00:00
Tobias Grosser	35ec5fbb8c	ScopInfo: Use getArrayAccessFor in reduction detection Load instructions may possibly be related to multiple memory accesses, but we are only interested in the array read access that describes the memory location the load instructions loads from. By using getArrayAccessfor we ensure to always obtain the right memory access. This issue was found by inspection without having a failing test case. llvm-svn: 255716	2015-12-15 23:50:04 +00:00
Tobias Grosser	184a4926b3	BlockGenerator: Use getArrayAccessFor for vector code generation getAccessFor does not guarantee a certain access to be returned in case an instruction is related to multiple accesses. However, in the vector code generation we want to know the stride of the array access of a store instruction. By using getArrayAccessFor we ensure we always get the correct memory access. This patch fixes a potential bug, but I was unable to produce a failing test case. Several existing test cases cover this code, but all of them already passed out of luck (or the specific but not-guaranteed order in which we build memory accesses). llvm-svn: 255715	2015-12-15 23:50:01 +00:00
Tobias Grosser	a69d4f0d83	VectorBlockGenerator: Generate scalar loads for vector statements When generating scalar loads/stores separately the vector code has not been updated. This commit adds code to generate scalar loads for vector code as well as code to assert in case scalar stores are encountered within a vector loop. llvm-svn: 255714	2015-12-15 23:49:58 +00:00
Tobias Grosser	0921477248	ScopInfo: Look up first (and only) array access When rewriting the access functions of load/store statements, we are only interested in the actual array memory location. The current code just took the very first memory access, which could be a scalar or an array access. As a result, we failed to update access functions even though this was requested via .jscop. llvm-svn: 255713	2015-12-15 23:49:53 +00:00
Michael Kruse	5bbc0e1888	Fix typos; NFC llvm-svn: 255580	2015-12-14 23:41:32 +00:00
Tobias Grosser	9bd0dad926	BlockGenerator: Do not use fast-path for external constants This change should not change the behavior of Polly today, but it allows external constants to be remapped e.g. when targetting multiple LLVM modules. llvm-svn: 255506	2015-12-14 16:19:59 +00:00
Tobias Grosser	6f764bbd9c	BlockGenerator: Drop unneeded const_casts llvm-svn: 255505	2015-12-14 16:19:54 +00:00
Tobias Grosser	f4f6870ff2	Revert "Always treat scalar writes as MUST_WRITEs" This reverts commit r255471. Johannes raised in the post-commit review of r255471 the concern that PHI writes in non-affine regions with two exiting blocks are not really MUST_WRITE, but we just know that at least one out of the set of all possible PHI writes will be executed. Modeling all PHI nodes as MUST_WRITEs is probably save, but adding the needed documentation for such a special case is probably not worth the effort. Michael will be proposing a new patch that ensures only a single PHI_WRITE is created for non-affine regions, which - besides other benefits - should also allow us to use a single well-defined MUST_WRITE for such PHI writes. (This is not a full revert, but the condition and documentation have been slightly extended) llvm-svn: 255503	2015-12-14 15:05:37 +00:00
Michael Kruse	e0d135c536	Add unit test for r255473 Check that memory accesses in non-affine regions that are always executed are MUST_WRITE. llvm-svn: 255500	2015-12-14 14:53:30 +00:00
Michael Kruse	34e1122a0d	Rename addScalar(Read\|Write)Access to addValue(Read\|Write)Access Adapt the method names to the new scheme introduced in r255467. llvm-svn: 255474	2015-12-13 22:47:43 +00:00
Michael Kruse	e934725f1d	Check guaranteed execution by using DominatorTree Before this commit, only the region's entry block was assumed to always execute in a non-affine subregion. We replace this by a test whether it dominates the exit block (this necessarily includes the entry block) which should be more accurate. llvm-svn: 255473	2015-12-13 22:10:40 +00:00
Michael Kruse	daf669418c	Store DominatorTree as a field in ScopInfo This harmonizes DT with the other analyses in ScopInfo and makes it available for use in its methods. llvm-svn: 255472	2015-12-13 22:10:37 +00:00
Michael Kruse	b06e3029d1	Always treat scalar writes as MUST_WRITEs LLVM's IR guarantees that a value definition occurs before any use, and also the value of a PHI must be one of the incoming values, "written" in one of the incoming blocks. Hence, such writes are never conditional in the context of a non-affine subregion. llvm-svn: 255471	2015-12-13 22:10:32 +00:00
Tobias Grosser	29f38ab732	ScopInfo: Split out invariant load hoisting into multiple functions [NFC] This reduces indentation and makes the code more readable. llvm-svn: 255468	2015-12-13 21:00:40 +00:00
Tobias Grosser	a535dff471	ScopInfo: Harmonize the different array kinds Over time different vocabulary has been introduced to describe the different memory objects in Polly, resulting in different - often inconsistent - naming schemes in different parts of Polly. We now standartize this to the following scheme: KindArray, KindValue, KindPHI, KindExitPHI \| ------- isScalar -----------\| In most cases this naming scheme has already been used previously (this minimizes changes and ensures we remain consistent with previous publications). The main change is that we remove KindScalar to clearify the difference between a scalar as a memory object of kind Value, PHI or ExitPHI and a value (former KindScalar) which is a memory object modeling a llvm::Value. We also move all documentation to the Kind* enum in the ScopArrayInfo class, remove the second enum in the MemoryAccess class and update documentation to be formulated from the perspective of the memory object, rather than the memory access. The terms "Implicit"/"Explicit", formerly used to describe memory accesses, have been dropped. From the perspective of memory accesses they described the different memory kinds well - especially from the perspective of code generation - but just from the perspective of a memory object it seems more straightforward to talk about scalars and arrays, rather than explicit and implicit arrays. The last comment is clearly subjective, though. A less subjective reason to go for these terms is the historic use both in mailing list discussions and publications. llvm-svn: 255467	2015-12-13 19:59:01 +00:00
Michael Kruse	b8d2644732	Print "null" for ISL objects that are nullptr Use it to print "null" if a MemoryAccess's access relation is not available instead of printing nothing. Suggested-by: Johannes Doerfert llvm-svn: 255466	2015-12-13 19:35:26 +00:00
Michael Kruse	a902ba6f1e	Reuse ScopStmt::isEmpty() function Introduce a function getStmtForRegionNode() to the corresponding ScopStmt of a RegionNode. We can use it to call the existing ScopStmt::isEmpty() function instead of searching for accesses. llvm-svn: 255465	2015-12-13 19:21:45 +00:00
Michael Kruse	e3ec4563b2	Check if access relation is available before printing Currently MemoryAccesses are not printed before the access relations are available, but might be printed during gdb sessions. llvm-svn: 255464	2015-12-13 18:47:02 +00:00
Tobias Grosser	8d4f6267f9	ScopInfo: Add helper function to invalidate a scop llvm-svn: 255430	2015-12-12 09:52:26 +00:00
Tobias Grosser	2d3d4ec860	executeScopConditionally: Introduce special exiting block When introducing separate control flow for the original and optimized code we introduce now a special 'ExitingBlock': \ / EnteringBB \| SplitBlock---------\ _____\|_____ \| / EntryBB \ StartBlock \| (region) \| \| \_ExitingBB_/ ExitingBlock \| \| MergeBlock---------/ \| ExitBB / \ This 'ExitingBlock' contains code such as the final_reloads for scalars, which previously were just added to whichever statement/loop_exit/branch-merge block had been generated last. Having an explicit basic block makes it easier to find these constructs when looking at the CFG. llvm-svn: 255107	2015-12-09 11:38:22 +00:00
Tobias Grosser	87a44d29a2	test: Fix misspelled test line llvm-svn: 255106	2015-12-09 11:38:08 +00:00
Tobias Grosser	31441c4614	ScopInfo: Add MemoryAccess::isScalar() Suggested-by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 254921	2015-12-07 18:06:08 +00:00
Tobias Grosser	a5d9e65e17	Update isl to isl-0.15-142-gf101714 This update brings in improvements to isl's 'isolate' option that reduce the number of code versions generated. This results in both code-size and compile time reduction for outer loop vectorization. Thanks to Roman Garev and Sven Verdoolaege for working on this improvement. llvm-svn: 254706	2015-12-04 08:46:14 +00:00
Michael Kruse	aed1e03888	Update isl to isl-0.15-140-g9279e30 The motivation is to fix a compilation error with Visual Studio 2013. See http://reviews.llvm.org/D14886. Thanks to Sumanth Gundapaneni for finding the issue and suggesting a patch. llvm-svn: 254498	2015-12-02 09:35:04 +00:00
Michael Kruse	054729b420	Add script to update isl The script will checkout the most recent master from http://repo.or.cz/isl.git into /tmp, create a distribution tarball, and extract it as replacement of lib/External/isl. After that it can be committed to the Polly repository. llvm-svn: 254497	2015-12-02 09:34:46 +00:00
Tobias Grosser	2fd89da90d	Remove non-debug printing of domain set Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D15094 llvm-svn: 254343	2015-11-30 22:59:41 +00:00
Tobias Grosser	3a6ac9f9b5	ScopInfo: Further simplify code Acc==MA implies Acc->getAccessInstruction() == MA->getAccessInstruction(). Suggested as post-commit review for 254305 by Michael Kruse. llvm-svn: 254327	2015-11-30 21:13:43 +00:00
Tobias Grosser	ef9ca5db16	ScopInfo: Replace while/iterator construct with std::remove_if The use of C++'s high-level iterator functionality instead of two while loops and explicit iterator handling improves readability of this code. Proposed-by: Michael Kruse <llvm@meinersbur.de> Differential Revision: http://reviews.llvm.org/D15068 llvm-svn: 254305	2015-11-30 17:20:40 +00:00
Michael Kruse	4c86a1d97b	IR cleanup after CodeGeneration Re-run canonicalization passes after Polly's code generation. The set of passes currently added here are nearly all the passes between --polly-position=early and --polly-position=before-vectorizer, i.e. all passes that would usually run after Polly. In order to run these only if Polly actually modified the code, we add a function attribute "polly-optimzed" to a function that contains generated code. The cleanup pass is skipped if the function does not have this attribute. There is no support by the (legacy) PassManager to run passes only under some conditions. One could have wrapped all transformation passes to run only when CodeGeneration changed the code, but the analyses would run anyway. This patch creates an independent pass manager. The disadvantages are that all analyses have to re-run even if preserved and it does not honor compiler switches like the PassManagerBuilder does. Differential Revision: http://reviews.llvm.org/D14333 llvm-svn: 254150	2015-11-26 12:36:25 +00:00
Michael Kruse	cba170e4d0	Introduce origin/kind for exit PHI node accesses Previously, accesses that originate from PHI nodes in the exit block were registered as SCALAR. In some context they are treated as scalars, but it makes a difference in others. We used to check whether the AccessInstruction is a terminator to differentiate the cases. This patch introduces an MemoryAccess origin EXIT_PHI and a ScopArrayInfo kind KIND_EXIT_PHI to make this case more explicit. No behavioural change intended. Differential Revision: http://reviews.llvm.org/D14688 llvm-svn: 254149	2015-11-26 12:26:06 +00:00
Tobias Grosser	2f8e43d677	ScopInfo: Add support for delinearizing fortran arrays gfortran (and fortran in general?) does not compute the address of an array element directly from the array sizes (e.g., %s0, %s1), but takes first the maximum of the sizes and 0 (e.g., max(0, %s0)) before multiplying the resulting value with the per-dimension array subscript expressions. To successfully delinearize index expressions as we see them in fortran, we first filter 'smax' expressions out of the SCEV expression, use them to guess array size parameters and only then continue with the existing delinearization. llvm-svn: 253995	2015-11-24 17:06:38 +00:00

1 2 3 4 5 ...

2142 Commits