llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	01978cfa0c	Remove independent blocks pass Polly can now be used as a analysis only tool as long as the code generation is disabled. However, we do not have an alternative to the independent blocks pass in place yet, though in the relevant cases this does not seem to impact the performance much. Nevertheless, a virtual alternative that allows the same transformations without changing the input region will follow shortly. llvm-svn: 250652	2015-10-18 12:28:00 +00:00
Tobias Grosser	05d7fa79b6	Format comment properly While clang-format takes care that the line-length is not surpassed, the resulting comments sometimes look not optimal. We re-flow the text in the comment to avoid these ugly single-word lines. llvm-svn: 250626	2015-10-17 21:46:28 +00:00
Michael Kruse	225f0d1ee2	Load/Store scalar accesses before/after the statement itself Instead of generating implicit loads within basic blocks, put them before the instructions of the statment itself, including non-affine subregions. The region's entry node is dominating all blocks in the region and therefore the loaded value will be available there. Implicit writes in block-stmts were already stored back at the end of the block. Now, also generate the stores of non-affine subregions when leaving the statement, i.e. in the exiting block. This change is required for array-mapped implicits ("De-LICM") to ensure that there are no dependencies of demoted scalars within statments. Statement load all required values, operator on copied in registers, and then write back the changed value to the demoted memory. Lifetimes analysis within statements becomes unecessary. Differential Revision: http://reviews.llvm.org/D13487 llvm-svn: 250625	2015-10-17 21:36:00 +00:00
Michael Kruse	01cb379fed	Avoid unnecessay .s2a write access when used only in PHIs Accesses for exit node phis will be handled separately by buildPHIAccesses if there is more than one exiting edge, buildScalarDependences does not need to create additional SCALAR accesses. This is a corrected version of r250517, which was reverted in r250607. Differential Revision: http://reviews.llvm.org/D13848 llvm-svn: 250622	2015-10-17 21:07:08 +00:00
Tobias Grosser	3839b422e6	Revert "Avoid unnecessay .s2a write access when used only in PHIs" This reverts commit r250606 due to some bugs it introduced. After these bugs have been resolved, we will add it back to tree. llvm-svn: 250607	2015-10-17 08:54:05 +00:00
Michael Kruse	aeceab770e	Avoid unnecessay .s2a write access when used only in PHIs PHI accesses will be handled separately by buildPHIAccesses, buildScalarDependences does not need to create additional accesses. llvm-svn: 250517	2015-10-16 15:14:40 +00:00
Michael Kruse	668af71b82	Do not add accesses for intra-ScopStmt scalar def-use chains When pulling a llvm::Value to be written as a PHI write, the former code did only check whether it is within the same basic block, but it could also be the same non-affine subregion. In that case some unecessary pair of MemoryAccesses would have been created. Two unit test were explicitely checking for the unecessary writes, including the comments that the writes are unecessary. llvm-svn: 250411	2015-10-15 14:45:48 +00:00
Johannes Doerfert	9b1f9c8b61	Allow eager evaluated binary && and \|\| conditions The domain generation can handle lazy && and \|\| by default but eager evaluated expressions were dismissed as non-affine. With this patch we will allow arbitrary combinations of and/or bit-operations in the conditions of branches. Differential Revision: http://reviews.llvm.org/D13624 llvm-svn: 249971	2015-10-11 13:21:03 +00:00
Johannes Doerfert	f363ed9804	[NFC] Move helper functions to ScopHelper Helper functions in the BlockGenerators.h/cpp introduce dependences from the frontend to the backend of Polly. As they are used in ScopDetection, ScopInfo, etc. we move them to the ScopHelper file. llvm-svn: 249919	2015-10-09 23:40:24 +00:00
David Blaikie	91e113d1dd	Remove some unused variables in -Asserts builds llvm-svn: 249866	2015-10-09 18:22:18 +00:00
Johannes Doerfert	697fdf891c	Consolidate invariant loads If a (assumed) invariant location is loaded multiple times we generated a parameter for each location. However, this caused compile time problems for several benchmarks (e.g., 445_gobmk in SPEC2006 and BT in the NAS benchmarks). Additionally, the code we generate is suboptimal as we preload the same location multiple times and perform the same checks on all the parameters that refere to the same value. With this patch we consolidate the invariant loads in three steps: 1) During SCoP initialization required invariant loads are put in equivalence classes based on their pointer operand. One representing load is used to generate a parameter for the whole class, thus we never generate multiple parameters for the same location. 2) During the SCoP simplification we remove invariant memory accesses that are in the same equivalence class. While doing so we build the union of all execution domains as it is only important that the location is at least accessed once. 3) During code generation we only preload one element of each equivalence class with the unified execution domain. All others are mapped to that preloaded value. Differential Revision: http://reviews.llvm.org/D13338 llvm-svn: 249853	2015-10-09 17:12:26 +00:00
Johannes Doerfert	f7e2967293	[FIX] Add missing projection for invariant load domains This was left out from the original patch proposed in http://reviews.llvm.org/D13195 even though it is needed to define an order invariant loads are hoisted. llvm-svn: 249680	2015-10-08 11:05:57 +00:00
Johannes Doerfert	08d90a3cee	Treat conditionally executed non-pure calls as errors This replaces the support for user defined error functions by a heuristic that tries to determine if a call to a non-pure function should be considered "an error". If so the block is assumed not to be executed at runtime. While treating all non-pure function calls as errors will allow a lot more regions to be analyzed, it will also cause us to dismiss a lot again due to an infeasible runtime context. This patch tries to limit that effect. A non-pure function call is considered an error if it is executed only in conditionally with regards to a cheap but simple heuristic. llvm-svn: 249611	2015-10-07 20:32:43 +00:00
Johannes Doerfert	d8dd8630b2	[NFC] Make LoopInfo a member and simplify arguments llvm-svn: 249609	2015-10-07 20:31:36 +00:00
Johannes Doerfert	09e3697f44	Allow invariant loads in the SCoP description This patch allows invariant loads to be used in the SCoP description, e.g., as loop bounds, conditions or in memory access functions. First we collect "required invariant loads" during SCoP detection that would otherwise make an expression we care about non-affine. To this end a new level of abstraction was introduced before SCEVValidator::isAffineExpr() namely ScopDetection::isAffine() and ScopDetection::onlyValidRequiredInvariantLoads(). Here we can decide if we want a load inside the region to be optimistically assumed invariant or not. If we do, it will be marked as required and in the SCoP generation we bail if it is actually not invariant. If we don't it will be a non-affine expression as before. At the moment we optimistically assume all "hoistable" (namely non-loop-carried) loads to be invariant. This causes us to expand some SCoPs and dismiss them later but it also allows us to detect a lot we would dismiss directly if we would ask e.g., AliasAnalysis::canBasicBlockModify(). We also allow potential aliases between optimistically assumed invariant loads and other pointers as our runtime alias checks are sound in case the loads are actually invariant. Together with the invariant checks this combination allows to handle a lot more than LICM can. The code generation of the invariant loads had to be extended as we can now have dependences between parameters and invariant (hoisted) loads as well as the other way around, e.g., test/Isl/CodeGen/invariant_load_parameters_cyclic_dependence.ll First, it is important to note that we cannot have real cycles but only dependences from a hoisted load to a parameter and from another parameter to that hoisted load (and so on). To handle such cases we materialize llvm::Values for parameters that are referred by a hoisted load on demand and then materialize the remaining parameters. Second, there are new kinds of dependences between hoisted loads caused by the constraints on their execution. If a hoisted load is conditionally executed it might depend on the value of another hoisted load. To deal with such situations we sort them already in the ScopInfo such that they can be generated in the order they are listed in the Scop::InvariantAccesses list (see compareInvariantAccesses). The dependences between hoisted loads caused by indirect accesses are handled the same way as before. llvm-svn: 249607	2015-10-07 20:17:36 +00:00
Johannes Doerfert	f17a78ef63	Remove non-executed statements during SCoP simplifcation A statement with an empty domain complicates the invariant load hoisting and does not help any subsequent analysis or transformation. In fact it might introduce parameter dimensions or increase the schedule dimensionality. To this end, we remove statements with an empty domain early in the SCoP simplification. llvm-svn: 249276	2015-10-04 15:00:05 +00:00
Johannes Doerfert	634909c2c9	[FIX] Domain generation for non-affine loops llvm-svn: 249275	2015-10-04 14:57:41 +00:00
Johannes Doerfert	8dba07770f	[NFC] Remove unused classes llvm-svn: 249271	2015-10-04 14:52:43 +00:00
Michael Kruse	afe0670863	Bail-out early if all statements have been simplified away Treat the scop as invalid instead of creating dummy domains. llvm-svn: 249151	2015-10-02 16:33:27 +00:00
Johannes Doerfert	8930f4846c	[FIX] Do not hoist from inside a non-affine subregion We have to skip accesses in non-affine subregions during hoisting as they might not be executed under the same condition as the entry of the non-affine subregion. llvm-svn: 249139	2015-10-02 14:51:00 +00:00
Michael Kruse	cac948ef46	Earlier creation of ScopStmt objects This moves the construction of ScopStmt to the beginning of the ScopInfo pass. The late creation was a result of the earlier separation of ScopInfo and TempScopInfo. This will avoid introducing more ScopStmt-like maps in future commits. The AccFuncMap will also be removed in some future commit. DomainMap might also be included into ScopStmt. The order in which ScopStmt are created changes and initially creates empty statements that are removed in a simplification. Differential Revision: http://reviews.llvm.org/D13341 llvm-svn: 249132	2015-10-02 13:53:07 +00:00
Johannes Doerfert	478a7de18b	[NFC] Make the ScopDetection analysis a member of the Scop class llvm-svn: 249125	2015-10-02 13:09:31 +00:00
Johannes Doerfert	f56738041e	Make the SCoP generation resistent wrt. error blocks When error blocks are not terminated by an unreachable they have successors that might only be reachable via error blocks. Additionally, branches in error blocks are not checked during SCoP detection, thus we might not be able to handle them. With this patch we do not try to model error block exit conditions. Anything that is only reachable via error blocks is ignored too, as it will not be executed in the optimized version of the SCoP anyway. llvm-svn: 249099	2015-10-01 23:48:18 +00:00
Michael Kruse	76e924d31b	Assign scop directly This makes ScopInfo's scop member available earlier to other methods which will make some planned changes simpler. No behavioral change intended llvm-svn: 248879	2015-09-30 09:16:07 +00:00
Johannes Doerfert	c1db67e218	Identify and hoist definitively invariant loads As a first step in the direction of assumed invariant loads (loads that are not written in some context) we now detect and hoist definitively invariant loads. These invariant loads will be preloaded in the code generation and used in the optimized version of the SCoP. If the load is only conditionally executed the preloaded version will also only be executed under the same condition, hence we will never access memory that wouldn't have been accessed otherwise. This is also the most distinguishing feature to licm. As hoisting can make statements empty we will simplify the SCoP and remove empty statements that would otherwise cause artifacts in the code generation. Differential Revision: http://reviews.llvm.org/D13194 llvm-svn: 248861	2015-09-29 23:47:21 +00:00
Johannes Doerfert	58a7c75c86	[NFC] Add accidentally removed comment line llvm-svn: 248704	2015-09-28 09:48:53 +00:00
Johannes Doerfert	9a132f36c3	Allow switch instructions in SCoPs This patch allows switch instructions with affine conditions in the SCoP. Also switch instructions in non-affine subregions are allowed. Both did not require much changes to the code, though there was some refactoring needed to integrate them without code duplication. In the llvm-test suite the number of profitable SCoPs increased from 135 to 139 but more importantly we can handle more benchmarks and user inputs without preprocessing. Differential Revision: http://reviews.llvm.org/D13200 llvm-svn: 248701	2015-09-28 09:33:22 +00:00
Johannes Doerfert	f32f5f2305	Remove obsolete check This check was needed at some point but seems not useful anymore. Only one adjustment in the domain generation was needed to cope with the cases this check prevented from happening before. llvm-svn: 248695	2015-09-28 01:30:37 +00:00
Tobias Grosser	a43b6e935c	Drop unused variable llvm-svn: 248687	2015-09-27 17:54:50 +00:00
Michael Kruse	caac2b6930	Fix typo llvm-svn: 248670	2015-09-26 15:51:44 +00:00
Johannes Doerfert	c6987c18de	[FIX] Use the surrounding loop for non-affine SCoP regions When the whole SCoP is a non-affine region we need to use the surrounding loop in the construction of the schedule as that is the one that will be looked up after the schedule generation. This fixes bug 24947 llvm-svn: 248667	2015-09-26 13:41:43 +00:00
Tobias Grosser	99c70dd8d1	Ensure memory accesses to the same array have identical dimensionality When recovering multi-dimensional memory accesses, it may happen that different accesses to the same base array are recovered with different dimensionality. This patch ensures that the dimensionalities are unified by adding zero valued dimensions to acesses with lower dimensionality. When starting to model fixed-size arrays as multi-dimensional in 247906, this has not been taken care of. llvm-svn: 248662	2015-09-26 08:55:54 +00:00
Michael Kruse	8d0b734e71	Let MemoryAccess remember its purpose There are three possible reasons to add a memory memory access: For explicit load and stores, for llvm::Value defs/uses, and to emulate PHI nodes (the latter two called implicit accesses). Previously MemoryAccess only stored IsPHI. Register accesses could be identified through the isScalar() method if it was no IsPHI. isScalar() determined the number of dimensions of the underlaying array, scalars represented by zero dimensions. For the work on de-LICM, implicit accesses can have more than zero dimensions, making the distinction of isScalars() useless, hence now stored explicitly in the MemoryAccess. Instead, we replace it by isImplicit() and avoid the term scalar for zero-dimensional arrays as it might be confused with llvm::Value which are also often referred to as scalars (or alternatively, as registers). No behavioral change intended, under the condition that it was impossible to create explicit accesses to zero-dimensional "arrays". llvm-svn: 248616	2015-09-25 21:21:00 +00:00
Michael Kruse	33d6c0bbc5	Use per-Purpose overloads for MemoryAccess creation This makes the intent of each created object clearer and allows to add more specific asserts. The bug fixed in r248535 has been discovered this way. No functional change intended; everything should behave as before. llvm-svn: 248603	2015-09-25 18:53:27 +00:00
Tobias Grosser	c2bb0cbe00	Sort includes using Chandler's sort_includes.py script llvm-svn: 248568	2015-09-25 09:49:19 +00:00
Tobias Grosser	da95a4a7c7	Handle read-only scalars used in PHI-nodes correctly This change addresses three issues: - Read only scalars that enter a PHI node through an edge that comes from outside the scop are not modeled any more, as such PHI nodes will always be initialized to this initial value right before the SCoP is entered. - For PHI nodes that depend on a scalar value that is defined outside the scop, but where the scalar values is passed through an edge that itself comes from a BB that is part of the region, we introduce in this basic block a read of the out-of-scop value to ensure it's value is available to write it into the PHI alloc location. - Read only uses of scalars by PHI nodes are ignored in the general read only handling code, as they are taken care of by the general PHI node modeling code. llvm-svn: 248535	2015-09-24 20:59:59 +00:00
Michael Kruse	26ed65e00d	Fix comparison signed/unsigned mismatch warning; NFC llvm-svn: 248520	2015-09-24 17:32:49 +00:00
Michael Kruse	2d0ece960f	Remove Analysis Output of TempScopInfo After the merge of TempScopInfo into ScopInfo the analysis output remained because of the existing unit tests. These remains are removed and the units tests converted to match the equivalent output of ScopInfo's analysis output. The unit tests are also moved into the directory of ScopInfo tests. Differential Revision: http://reviews.llvm.org/D13116 llvm-svn: 248485	2015-09-24 11:41:21 +00:00
Tobias Grosser	b1c39429d9	Do not model delinearized and linearized access relation for a single access A missing return statement that previously did not have a visibly negative effect caused after some data-structure changes in r248024 multi-dimensional accesses to be modeled both multi-dimensional as well as linearized. This commit adds the missing return to avoid the incorrect double modeling as well as the compile time increases it caused. llvm-svn: 248171	2015-09-21 16:19:25 +00:00
Johannes Doerfert	6a72a2af13	Use <nsw> AddRecs in the affinator to avoid bounded assumptions If we encounter a <nsw> tagged AddRec for a loop we know the trip count of that loop has to be bounded or the semantics is undefined anyway. Hence, we only need to add unbounded assumptions if no such AddRec is known. llvm-svn: 248128	2015-09-20 16:59:23 +00:00
Johannes Doerfert	707a406078	Add bounded loop assumption So far we ignored the unbounded parts of the iteration domain, however we need to assume they do not occure at all to remain sound if they do. llvm-svn: 248126	2015-09-20 16:38:19 +00:00
Johannes Doerfert	f2cc86edae	Simplify domain generation We now add loop carried information during the second traversal of the region instead of in a intermediate step in-between. This makes the generation simpler, removes code and should even be faster. llvm-svn: 248125	2015-09-20 16:15:32 +00:00
Johannes Doerfert	06c57b594c	Allow loops with multiple back edges In order to allow multiple back edges we: - compute the conditions under which each back edge is taken - build the union over all these conditions, thus the condition that any back edge is taken - apply the same logic to the union we applied to a single back edge llvm-svn: 248120	2015-09-20 15:00:20 +00:00
Michael Kruse	84f70acd68	Remove unused variable Dimension [NFC] llvm-svn: 248026	2015-09-18 20:03:32 +00:00
Michael Kruse	e2bccbbfb2	Merge IRAccess into MemoryAccess All MemoryAccess objects will be owned by ScopInfo::AccFuncMap which previously stored the IRAccess objects. Instead of creating new MemoryAccess objects, the already created ones are reused, but their order might be different now. Some fields of IRAccess and MemoryAccess had the same meaning and are merged. This is the last step of fusioning TempScopInfo.{h\|cpp} and ScopInfo.{h.cpp}. Some refactoring might still make sense. Differential Revision: http://reviews.llvm.org/D12843 llvm-svn: 248024	2015-09-18 19:59:43 +00:00
Tobias Grosser	6f36d9ab01	Delinearize multi-dimensional arrays through bitcasts In some cases instcombine introduces bitcasts that slightly obfuscate the multi-dimensionality of an array. This patch teaches our fixed-size delinearization how to look through bitcasts. llvm-svn: 247928	2015-09-17 20:16:21 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Tobias Grosser	faf8f6f62e	Extract function that derives the index expressions of a GEP instruction [NFC] We currently use this functionality to add run-time assumptions that check its in-bound property. llvm-svn: 247893	2015-09-17 15:47:52 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00
Johannes Doerfert	cef616fe2d	Use blocks instead of domains in SCEVAffinator Due to the new domain generation, the SCoP keeps track of the domain for all blocks, thus the SCEVAffinator can now work with blocks to avoid duplication of the domains. llvm-svn: 247731	2015-09-15 22:49:04 +00:00

1 2 3 4 5 ...

305 Commits