llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	91ad092bb2	[NFC] Remove unused SCoP diagnostic llvm-svn: 248694	2015-09-28 01:29:44 +00:00
Tobias Grosser	a43b6e935c	Drop unused variable llvm-svn: 248687	2015-09-27 17:54:50 +00:00
Michael Kruse	caac2b6930	Fix typo llvm-svn: 248670	2015-09-26 15:51:44 +00:00
Johannes Doerfert	c6987c18de	[FIX] Use the surrounding loop for non-affine SCoP regions When the whole SCoP is a non-affine region we need to use the surrounding loop in the construction of the schedule as that is the one that will be looked up after the schedule generation. This fixes bug 24947 llvm-svn: 248667	2015-09-26 13:41:43 +00:00
Tobias Grosser	99c70dd8d1	Ensure memory accesses to the same array have identical dimensionality When recovering multi-dimensional memory accesses, it may happen that different accesses to the same base array are recovered with different dimensionality. This patch ensures that the dimensionalities are unified by adding zero valued dimensions to acesses with lower dimensionality. When starting to model fixed-size arrays as multi-dimensional in 247906, this has not been taken care of. llvm-svn: 248662	2015-09-26 08:55:54 +00:00
Michael Kruse	8d0b734e71	Let MemoryAccess remember its purpose There are three possible reasons to add a memory memory access: For explicit load and stores, for llvm::Value defs/uses, and to emulate PHI nodes (the latter two called implicit accesses). Previously MemoryAccess only stored IsPHI. Register accesses could be identified through the isScalar() method if it was no IsPHI. isScalar() determined the number of dimensions of the underlaying array, scalars represented by zero dimensions. For the work on de-LICM, implicit accesses can have more than zero dimensions, making the distinction of isScalars() useless, hence now stored explicitly in the MemoryAccess. Instead, we replace it by isImplicit() and avoid the term scalar for zero-dimensional arrays as it might be confused with llvm::Value which are also often referred to as scalars (or alternatively, as registers). No behavioral change intended, under the condition that it was impossible to create explicit accesses to zero-dimensional "arrays". llvm-svn: 248616	2015-09-25 21:21:00 +00:00
Michael Kruse	33d6c0bbc5	Use per-Purpose overloads for MemoryAccess creation This makes the intent of each created object clearer and allows to add more specific asserts. The bug fixed in r248535 has been discovered this way. No functional change intended; everything should behave as before. llvm-svn: 248603	2015-09-25 18:53:27 +00:00
Tobias Grosser	c2bb0cbe00	Sort includes using Chandler's sort_includes.py script llvm-svn: 248568	2015-09-25 09:49:19 +00:00
Tobias Grosser	da95a4a7c7	Handle read-only scalars used in PHI-nodes correctly This change addresses three issues: - Read only scalars that enter a PHI node through an edge that comes from outside the scop are not modeled any more, as such PHI nodes will always be initialized to this initial value right before the SCoP is entered. - For PHI nodes that depend on a scalar value that is defined outside the scop, but where the scalar values is passed through an edge that itself comes from a BB that is part of the region, we introduce in this basic block a read of the out-of-scop value to ensure it's value is available to write it into the PHI alloc location. - Read only uses of scalars by PHI nodes are ignored in the general read only handling code, as they are taken care of by the general PHI node modeling code. llvm-svn: 248535	2015-09-24 20:59:59 +00:00
Michael Kruse	26ed65e00d	Fix comparison signed/unsigned mismatch warning; NFC llvm-svn: 248520	2015-09-24 17:32:49 +00:00
Michael Kruse	2d0ece960f	Remove Analysis Output of TempScopInfo After the merge of TempScopInfo into ScopInfo the analysis output remained because of the existing unit tests. These remains are removed and the units tests converted to match the equivalent output of ScopInfo's analysis output. The unit tests are also moved into the directory of ScopInfo tests. Differential Revision: http://reviews.llvm.org/D13116 llvm-svn: 248485	2015-09-24 11:41:21 +00:00
Johannes Doerfert	e526de5a47	Make MIN_LOOP_TRIP_COUNT a static constant llvm-svn: 248192	2015-09-21 19:10:11 +00:00
Tobias Grosser	b1c39429d9	Do not model delinearized and linearized access relation for a single access A missing return statement that previously did not have a visibly negative effect caused after some data-structure changes in r248024 multi-dimensional accesses to be modeled both multi-dimensional as well as linearized. This commit adds the missing return to avoid the incorrect double modeling as well as the compile time increases it caused. llvm-svn: 248171	2015-09-21 16:19:25 +00:00
Johannes Doerfert	6a72a2af13	Use <nsw> AddRecs in the affinator to avoid bounded assumptions If we encounter a <nsw> tagged AddRec for a loop we know the trip count of that loop has to be bounded or the semantics is undefined anyway. Hence, we only need to add unbounded assumptions if no such AddRec is known. llvm-svn: 248128	2015-09-20 16:59:23 +00:00
Johannes Doerfert	707a406078	Add bounded loop assumption So far we ignored the unbounded parts of the iteration domain, however we need to assume they do not occure at all to remain sound if they do. llvm-svn: 248126	2015-09-20 16:38:19 +00:00
Johannes Doerfert	f2cc86edae	Simplify domain generation We now add loop carried information during the second traversal of the region instead of in a intermediate step in-between. This makes the generation simpler, removes code and should even be faster. llvm-svn: 248125	2015-09-20 16:15:32 +00:00
Johannes Doerfert	06c57b594c	Allow loops with multiple back edges In order to allow multiple back edges we: - compute the conditions under which each back edge is taken - build the union over all these conditions, thus the condition that any back edge is taken - apply the same logic to the union we applied to a single back edge llvm-svn: 248120	2015-09-20 15:00:20 +00:00
Johannes Doerfert	7175bdfbe4	Add loop trip count based heuristic for SCoP detection As we currently do not perform any optimizations that targets (or is even aware) small trip counts we will skip them when we count the loops in a region. llvm-svn: 248119	2015-09-20 14:56:54 +00:00
Johannes Doerfert	b276bde162	[NFC] Remove obsolete diagnostic for unstructured control flow llvm-svn: 248118	2015-09-20 14:55:50 +00:00
Michael Kruse	84f70acd68	Remove unused variable Dimension [NFC] llvm-svn: 248026	2015-09-18 20:03:32 +00:00
Michael Kruse	e2bccbbfb2	Merge IRAccess into MemoryAccess All MemoryAccess objects will be owned by ScopInfo::AccFuncMap which previously stored the IRAccess objects. Instead of creating new MemoryAccess objects, the already created ones are reused, but their order might be different now. Some fields of IRAccess and MemoryAccess had the same meaning and are merged. This is the last step of fusioning TempScopInfo.{h\|cpp} and ScopInfo.{h.cpp}. Some refactoring might still make sense. Differential Revision: http://reviews.llvm.org/D12843 llvm-svn: 248024	2015-09-18 19:59:43 +00:00
Tobias Grosser	6f36d9ab01	Delinearize multi-dimensional arrays through bitcasts In some cases instcombine introduces bitcasts that slightly obfuscate the multi-dimensionality of an array. This patch teaches our fixed-size delinearization how to look through bitcasts. llvm-svn: 247928	2015-09-17 20:16:21 +00:00
Tobias Grosser	0537f41de5	Do not use the assumed context in the dependence analysis any more This information is implicitly available through the multi-dimensionality of memory accesses. This reduces compile time for 3mm from 430ms to 400ms and should generally benefit compile time for cases where the assumed context is complex. llvm-svn: 247907	2015-09-17 17:28:19 +00:00
Tobias Grosser	5fd8c0961e	Model fixed-size multi-dimensional arrays if possible multi-dimensional If the GEP instructions give us enough insights, model scalar accesses as multi-dimensional (and generate the relevant run-time checks to ensure correctness). This will allow us to simplify the dependence computation in a subsequent commit. llvm-svn: 247906	2015-09-17 17:28:15 +00:00
Tobias Grosser	faf8f6f62e	Extract function that derives the index expressions of a GEP instruction [NFC] We currently use this functionality to add run-time assumptions that check its in-bound property. llvm-svn: 247893	2015-09-17 15:47:52 +00:00
Tobias Grosser	e375d8058a	Add option to enable/disable reduction usage in dependence analysis llvm-svn: 247781	2015-09-16 09:50:17 +00:00
Johannes Doerfert	883f8c1d2f	Use modulo semantic to generate non-integer-overflow assumptions This will allow to generate non-wrap assumptions for integer expressions that are part of the SCoP. We compare the common isl representation of the expression with one computed with modulo semantic. For all parameter combinations they are not equal we can have integer overflows. The nsw flags are respected when the modulo representation is computed, nuw and nw flags are ignored for now. In order to not increase compile time to much, the non-wrap assumptions are collected in a separate boundary context instead of the assumed context. This helps compile time as the boundary context can become complex and it is therefor not advised to use it in other operations except runtime check generation. However, the assumed context is e.g., used to tighten dependences. While the boundary context might help to tighten the assumed context it is doubtful that it will help in practice (it does not effect lnt much) as the boundary (or no-wrap assumptions) only restrict the very end of the possible value range of parameters. PET uses a different approach to compute the no-wrap context, though lnt runs have shown that this version performs slightly better for us. llvm-svn: 247732	2015-09-15 22:52:53 +00:00
Johannes Doerfert	cef616fe2d	Use blocks instead of domains in SCEVAffinator Due to the new domain generation, the SCoP keeps track of the domain for all blocks, thus the SCEVAffinator can now work with blocks to avoid duplication of the domains. llvm-svn: 247731	2015-09-15 22:49:04 +00:00
Johannes Doerfert	b20f151d56	Coalesce the constructed domains early llvm-svn: 247728	2015-09-15 22:11:49 +00:00
Michael Kruse	2846877d88	Replace some SmallVector-typed parameters by ArrayRef ArrayRef avoids making implementation details such as the number of stack elements to be part of the function signature. llvm-svn: 247572	2015-09-14 15:45:33 +00:00
Tobias Grosser	0b13890042	Fix formatting llvm-svn: 247549	2015-09-14 11:38:06 +00:00
Johannes Doerfert	e114dc024e	[FIX] Handle error blocks in non-affine regions correctly llvm-svn: 247545	2015-09-14 11:15:58 +00:00
Johannes Doerfert	40fa56f59f	[FIX] Allow the whole SCoP to be a non-affine subregion llvm-svn: 247544	2015-09-14 11:15:07 +00:00
Johannes Doerfert	ca1e38fa43	Propagate exit conditions as described in the PET paper At some point we build loop trip counts using this method. It was replaced by a simpler trick that works only for affine (e.g., not modulo) constraints and relies on the removal of unbounded parts. In order to allow modulo constrains again we go back to the former, more accurate method. llvm-svn: 247540	2015-09-14 11:12:52 +00:00
Michael Kruse	9d08009dff	Merge TempScop into Scop Summary: TempScop is basically a holder for AccFuncMap, the dictionary from BasicBlocks to IRAccess lists. We move the list into polly::Scop and remove the polly::TempScop class. There is one small change in behavior: If ScopInfo finds that its AssumedContext is impossible, it bails out by deleting the Scop object. The TempScop::print (invoked with opt -polly-scops -analyze) cannot print the AccFuncMap anymore as it would with a separate TempScop. Differential Revision: http://reviews.llvm.org/D12803 llvm-svn: 247480	2015-09-11 21:41:48 +00:00
Johannes Doerfert	90db75ed24	Runtime error check elimination Hoist runtime checks in the loop nest if they guard an "error" like event. Such events are recognized as blocks with an unreachable terminator or a call to the ubsan function that deals with out of bound accesses. Other "error" events can be added easily. We will ignore these blocks when we detect/model/optmize and code generate SCoPs but we will make sure that they would not have been executed using the assumption framework. llvm-svn: 247310	2015-09-10 17:51:27 +00:00
Johannes Doerfert	f4fa9879fb	[FIX] Do not assume only one loop can be left at a time llvm-svn: 247291	2015-09-10 15:53:59 +00:00
Johannes Doerfert	b68cffb5df	Allow general loops with one latch As we do not rely on ScalarEvolution any more we do not need to get the backedge taken count. Additionally, our domain generation handles everything that is affine and has one latch and our ScopDetection will over-approximate everything else. This change will therefor allow loops with: - one latch - exiting conditions that are affine Additionally, it will not check for structured control flow anymore. Hence, loops and conditionals are not necessarily single entry single exit regions any more. Differential Version: http://reviews.llvm.org/D12758 llvm-svn: 247289	2015-09-10 15:27:46 +00:00
Michael Kruse	d868b5d509	Merge TempScopInfo into ScopInfo The TempScopInfo (-polly-analyze-ir) pass is removed and its work taken over by ScopInfo (-polly-scops). Several tests depend on -polly-analyze-ir and use -polly-scops instead which for the moment prints the output of both passes. This again is not expected by some other tests, especially those with negative searches, which have been adapted. Differential Version: http://reviews.llvm.org/D12694 llvm-svn: 247288	2015-09-10 15:25:24 +00:00
Johannes Doerfert	32ae76e7f9	[NFC] Remove obsolete arguments Remove some arguments that survived the recent changes but are not used any more. llvm-svn: 247280	2015-09-10 13:12:02 +00:00
Johannes Doerfert	5b9ff8b667	Replace ScalarEvolution based domain generation This patch replaces the last legacy part of the domain generation, namely the ScalarEvolution part that was used to obtain loop bounds. We now iterate over the loops in the region and propagate the back edge condition to the header blocks. Afterwards we propagate the new information once through the whole region. In this process we simply ignore unbounded parts of the domain and thereby assume the absence of infinite loops. + This patch already identified a couple of broken unit tests we had for years. + We allow more loops already and the step to multiple exit and multiple back edges is minimal. + It allows to model the overflow checks properly as we actually visit every block in the SCoP and know where which condition is evaluated. - It is currently not compatible with modulo constraints in the domain. Differential Revision: http://reviews.llvm.org/D12499 llvm-svn: 247279	2015-09-10 13:00:06 +00:00
Michael Kruse	7bf3944d23	Merge TempScopInfo.{cpp\|h} into ScopInfo.{cpp\|h} This prepares for a series of patches that merges TempScopInfo into ScopInfo to reduce Polly's code complexity. Only ScopInfo.{cpp\|h} will be left thereafter. Moving the code of TempScopInfo in one commit makes the mains diffs simpler to understand. In detail, merging the following classes is planned: TempScopInfo into ScopInfo TempScop into Scop IRAccess into MemoryAccess Only moving code, no functional changes intended. Differential Version: http://reviews.llvm.org/D12693 llvm-svn: 247274	2015-09-10 12:46:52 +00:00
Chandler Carruth	66ef16b289	[PM] Update Polly for the new AA infrastructure landed in r247167. llvm-svn: 247198	2015-09-09 22:13:56 +00:00
Johannes Doerfert	7ca8dc2d2d	Disable support for pointer expressions The support for pointer expressions is broken as it can only handle some patterns in the IslExprBuilder. We should to treat pointers in expressions the same as integers at some point and revert this patch. llvm-svn: 247147	2015-09-09 14:19:04 +00:00
Johannes Doerfert	717b866798	Allow PHI nodes in the region exit block While we do not need to model PHI nodes in the region exit (as it is not part of the SCoP), we need to prepare for the case that the exit block is split in code generation to create a single exiting block. If this will happen, hence if the region did not have a single exiting block before, we will model the operands of the PHI nodes as escaping scalars in the SCoP. Differential Revision: http://reviews.llvm.org/D12051 llvm-svn: 247078	2015-09-08 21:44:27 +00:00
Tobias Grosser	02e6589bda	Move more compile-time bailouts into -polly-detect-unprofitable Instead of having two separate options -polly-detect-scops-in-functions-without-loops and -polly-detect-scops-in-regions-without-loops we now just use -polly-detect-unprofitable to force the detection of scops ignoring any compile time saving bailout heuristics. llvm-svn: 247057	2015-09-08 19:46:41 +00:00
Tobias Grosser	6f73008506	Allow the import of multi-dimensional access functions Originally, we disallowed the import of multi-dimensional access functions due to our code generation not supporting the generation of new address expressions for multi-dimensional memory accesses. When building our run-time alias check infrastructure we added code generation support for multi-dimensional address calculations. Hence, we can now savely allow the import of new multi-dimensional access functions. llvm-svn: 246917	2015-09-05 07:46:47 +00:00
Tobias Grosser	166c422952	Use uppercase variable names [NFC] llvm-svn: 246916	2015-09-05 07:46:40 +00:00
Tobias Grosser	2df884f95a	ScopInfo: use project_out instead of remove_dims By just removing dimensions (and the constraints they are involved in) we may loose information about the dimensions we do not remove. By instead using project_out, we are sure all constraints on the outer dimensions are preserved. No test case, as this error condition is very unlikely to be triggered by isl's current code. We still 'fix' this, as isl gives little guarantees regarding the behavior of remove_divs. llvm-svn: 246567	2015-09-01 18:17:41 +00:00
Tobias Grosser	40820ca286	Fix another typo in the subloop counting ... as well as the corresponding test cases. Thank's Johannes for finding this bug. llvm-svn: 246483	2015-08-31 21:04:51 +00:00
Johannes Doerfert	5f912d3797	Do Not Model Unbounded Loops Code generation currently does not expect unbounded loops. When using ISL to compute the loop trip count, if we find that the iteration domain remains unbounded, we invalidate the Scop by creating an infeasible context. Contributed-by: Matthew Simpson <mssimpso@codeaurora.org> This fixes PR24634. Differential Revision: http://reviews.llvm.org/D12493 llvm-svn: 246477	2015-08-31 19:58:24 +00:00
Johannes Doerfert	f08bd00229	Build the domains with correct number of dimensions Instead of building domains with MaxLoopDepth dimensions, we now build the domains such that they have the right amount of dimensions all the time. llvm-svn: 246443	2015-08-31 13:56:32 +00:00
Tobias Grosser	d213d52d0e	Always use the branch instructions to model the PHI-node writes Before this commit we did this only for Arguments or Constants, but indeed an instruction may define a value a lot higher up in the dominance tree, but the actual write generally needs to happen right before branching to the PHI node. Otherwise, the writes of different branches into PHI nodes may get intermixed if they lay higher up in the dominance tree. llvm-svn: 246441	2015-08-31 13:45:54 +00:00
Tobias Grosser	050e0cbc0e	ScopDetection: Correctly count the loops in a region There is no reason the loops in a region need to touch either entry or exit block. Hence, we need to look through all loops that may touch the region as well as their children to understand if our region has at least two loops. llvm-svn: 246433	2015-08-31 12:08:11 +00:00
Tobias Grosser	44b34b0e8a	Also build scalar dependences for store instructions While ignoring read-only scalar dependences it was not necessary to consider store instructins, but as store instructions can be the target of a scalar read-only dependency we need to consider them for the construction of scalar read-only dependences. llvm-svn: 246429	2015-08-31 11:15:00 +00:00
Tobias Grosser	58758ef4ea	Enable modeling of scalar read-only dependences Even though these are not strictly necessary for sequential code generation, we still model both for sequential and parallel code generation to reduce the set of configurations that needs to be tested. If this turns out, against what we currently see, to be significant overhead, we can decide to limit this feature again to parallel code-generation use cases only. llvm-svn: 246420	2015-08-31 06:46:32 +00:00
Tobias Grosser	d86bf4271c	Do not model scalar references to constant values llvm-svn: 246418	2015-08-31 06:37:25 +00:00
Johannes Doerfert	96425c2574	Traverse the SCoP to compute non-loop-carried domain conditions In order to compute domain conditions for conditionals we will now traverse the region in the ScopInfo once and build the domains for each block in the region. The SCoP statements can then use these constraints when they build their domain. The reason behind this change is twofold: 1) This removes a big chunk of preprocessing logic from the TempScopInfo, namely the Conditionals we used to build there. Additionally to moving this logic it is also simplified. Instead of walking the dominance tree up for each basic block in the region (as we did before), we now traverse the region only once in order to collect the domain conditions. 2) This is the first step towards the isl based domain creation. The second step will traverse the region similar to this step, however it will propagate back edge conditions. Once both are in place this conditional handling will allow multiple exit loops additional logic. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12428 llvm-svn: 246398	2015-08-30 21:13:53 +00:00
Tobias Grosser	e83a396b1d	Ignore debug intrinsics and do not model their potential scalar metadata reads Our code generation currently does not support scalar references to metadata values. Hence, it would crash if we try to model scalar dependences to metadata values. Fortunately, for one of the common uses, debug information, we can for now just ignore the relevant intrinsics and consequently the issue of how to model scalar dependences to metadata. llvm-svn: 246388	2015-08-30 16:57:20 +00:00
Tobias Grosser	9c0ffe3a1d	Remove some code duplication [NFC] llvm-svn: 246387	2015-08-30 16:57:15 +00:00
Johannes Doerfert	b409fdc0d7	[NFC] Make SCEVAffinator work without a statement llvm-svn: 246290	2015-08-28 09:24:35 +00:00
Tobias Grosser	ed21a1fc7e	Do not detect Scops with only one loop. If a region does not have more than one loop, we do not identify it as a Scop in ScopDetection. The main optimizations Polly is currently performing (tiling, preparation for outer-loop vectorization and loop fusion) are unlikely to have a positive impact on individual loops. In some cases, Polly's run-time alias checks or conditional hoisting may still have a positive impact, but those are mostly enabling transformations which LLVM already performs for individual loops. As we do not focus on individual loops, we leave them untouched to not introduce compile time regressions and execution time noise. This results in good compile time reduction (oourafft: -73.99%, smg2000: -56.25%). Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12268 llvm-svn: 246161	2015-08-27 16:55:18 +00:00
Johannes Doerfert	d020b77295	Use ISL to Determine Loop Trip Count Use ISL to compute the loop trip count when scalar evolution is unable to do so. Contributed-by: Matthew Simpson <mssimpso@codeaurora.org> Differential Revision: http://reviews.llvm.org/D9444 llvm-svn: 246142	2015-08-27 06:53:52 +00:00
Tobias Grosser	01c8f5f354	[Vectorizer] Detect strides in multi-dimensional arrays The original code was only correct for one-dimensional arrays, but derived incorrect strides for multi-dimensional arrays. llvm-svn: 245888	2015-08-24 22:20:46 +00:00
Tobias Grosser	daaed0e19f	Do not intersect with AssumedContext in calculateMinMaxAccess Originally, we intersected the iteration space with the AssumedContext before computing the minimal/maximal memory offset in our run-time alias checks. With this patch we drop this intersection as the AssumedContext can - for larger or more complex scops - become very complicated (contain many disjuncts). When intersecting an object with many disjuncts with other objects, the number of disjuncts in these other objects also increases quickly. As a result, the compile time is unnecessarily increased. This patch now drops the intersection with the assumed context to ensure we do not pay unnecessary compile time costs. With this patch we see -3.17% reduction in compile time for 3mm with default flags and -17.87% when compiling 3mm with -DPOLYBENCH_USE_C99_PROTO flag. We did not observe any regressions in LNT. Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12198 llvm-svn: 245617	2015-08-20 21:29:26 +00:00
Tobias Grosser	d83b8a83ec	Add option to control reduction detection llvm-svn: 245598	2015-08-20 19:08:11 +00:00
Tobias Grosser	40985016b2	Fix formatting llvm-svn: 245597	2015-08-20 19:08:05 +00:00
Johannes Doerfert	120de4be96	Simplify the SCoP creation and bookkeeping To avoid multiple exits and the resulting complicated conditions when creating a SCoP we now use the single hasFeasibleRuntimeContext() check to decide if a SCoP should be dismissed right after construction. If building runtime checks failed the assumed context is made infeasible, hence the optimized version will never be executed and the SCoP can be dismissed. llvm-svn: 245593	2015-08-20 18:30:08 +00:00
Johannes Doerfert	5d5b30649a	Check feasibility for the runtime check context wrt. the domain. If nothing is executed we can bail out early. Otherwise we can use the constraints that ensure at least one statement is executed for simplification. llvm-svn: 245585	2015-08-20 18:06:30 +00:00
Johannes Doerfert	4eed5bea54	Link ScopArrayInfo objects We will record if a SAI is the base of another SAI or derived from it. This will allow to reason about indirect base pointers later on and allows a clearer picture of indirection also in the SCoP dump. llvm-svn: 245584	2015-08-20 18:04:22 +00:00
Johannes Doerfert	43788c5783	Check for feasible runtime check context early Instead of generating code for an empty assumed context we bail out early. As the number of assumptions we generate increases this becomes more and more important. Additionally, this change will allow us to hide internal contexts that are only used in runtime checks e.g., a boundary context with constraints not suited for simplifications. llvm-svn: 245540	2015-08-20 05:58:56 +00:00
Roman Gareev	f2bd72e00d	Use isl_set_is_subset instead of isl_set_is_equal It helps to detect correct strides in case of parametric constraints of Stride in MemoryAccess::isStrideX. Reviewers: grosser llvm-svn: 245303	2015-08-18 16:12:05 +00:00
Tobias Grosser	cf9ebb63d6	Use schedule trees to compute dependences This patch changes Polly to compute the data-dependences on the schedule tree instead of a flat schedule representation. Calculating dependences directly on the schedule tree results in some good compile-time improvements (adi : -23.35%, 3mm : -9.57%), as the structure of the schedule can be exploited for increased efficiency. Earlier experiments with schedule tree based dependence analysis in Polly showed some compile-time regressions. These regressions arose due to the schedule tree based dependence analysis not taking into account the domain constraints of the schedule tree. As a result, the computed dependences were different and this difference caused in some cases the schedule optimizer to take a very long time. Since isl version fe865996 the schedule tree based dependence analysis takes domain constraints into account, which fixes the earlier compile-time issues. Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 245300	2015-08-18 15:05:29 +00:00
Johannes Doerfert	d86f2157e5	Add a field to the memory access class for a related value. The new field in the MemoryAccess allows us to track a value related to that access: - For real memory accesses the value is the loaded result or the stored value. - For straigt line scalar accesses it is the access instruction itself. - For PHI operand accesses it is the operand value. We use this value to simplify code which deduced information about the value later in the Polly pipeline and was known to be error prone. Reviewers: grosser, Meinsersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12062 llvm-svn: 245213	2015-08-17 10:58:17 +00:00
Tobias Grosser	c5bcf246d1	Fix Polly after SCEV port to new pass manager This fixes compilation after LLVM commit r245193. llvm-svn: 245211	2015-08-17 10:57:08 +00:00
Johannes Doerfert	45545ff782	Build the ScopStmt domain in-place. This will build the statement domains in-place, hence using the ScopStmt::Domain member instead of some intermediate isl_set. llvm-svn: 245179	2015-08-16 14:36:01 +00:00
Tobias Grosser	8a9c2353f9	Add -polly-context option to provide additional context information This option allows the user to provide additional information about parameter values as an isl_set. To specify that N has the value 1024, we can provide the context -polly-context='[N] -> {: N = 1024}'. llvm-svn: 245175	2015-08-16 10:19:29 +00:00
Michael Kruse	82a1c7de09	Make TempScopInfo a RegionPass This modifies the order in which Polly passes are executed. Assuming a function has two scops (A and B), the order before was: FunctionPassManager ScopDetection IndependentBlocks TempScopInfo for A and B RegionPassManager ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B After this patch: FunctionPassManager ScopDetection IndependentBlocks RegionPassManager TempScopInfo for A ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A TempScopInfo for B ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B TempScopInfo for B might store information and references to the IR that CodeGeneration for A might modify. Changing the order ensures that the IR is not modified from the analysis of a region until code generation. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12014 llvm-svn: 245091	2015-08-14 20:10:27 +00:00
Tobias Grosser	d46fd5ed95	Make the dimension sizes of in ScopArrayInfo available as isl_pw_affs This makes it easier to reason about the size of an array dimension with isl. llvm-svn: 244757	2015-08-12 15:27:16 +00:00
Johannes Doerfert	574182d394	Expose the SCEVAffinator and make it a member of a SCoP. This change has three major advantages: - The ScopInfo becomes smaller. - It allows to use the SCEVAffinator from outside the ScopInfo. - A member object allows state which in turn allows e.g., caching. Differential Revision: http://reviews.llvm.org/D9099 llvm-svn: 244730	2015-08-12 10:19:50 +00:00
Tobias Grosser	a77cea49d1	Always model PHI nodes in scop (if not in same nonaffine subregion) Before we only modeled PHI nodes if at least one incoming basic block was itself part of the region, now we always model them except if all of their operands are part of a single non-affine subregion which we model as a black-box. This change only affects PHI nodes in the entry block, that have exactly one incoming edge. Before this change, we did not model them and as a result code generation would not know how to code generate them. With this change, code generation can code generate them like any other PHI node. This issue was exposed by r244606. Before this change simplifyRegion would have moved these PHI nodes out of the SCoP, so we would never have tried to code generate them. We could implement this behavior again, but changing the IR after the scop has been modeled and transformed always adds a risk of us invalidating earlier analysis results. It seems more save and overall also more consistent to just model and handle this one-entry-edge PHI nodes like any other PHI node in the scop. Solution proposed by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 244721	2015-08-12 07:48:54 +00:00
Michael Kruse	046dde40c5	[Polly] Refactor buildScop Summary: The extracted function buildBBScopStmt will be needed later to be invoked individually on the region's exit block. Reviewers: grosser, jdoerfert Subscribers: jdoerfert, llvm-commits, pollydev Projects: #polly Differential Revision: http://reviews.llvm.org/D11878 llvm-svn: 244443	2015-08-10 13:01:57 +00:00
Tobias Grosser	dcc3b435ab	Optionally model read-only scalars Even though read-only accesses to scalars outside of a scop do not need to be modeled to derive valid transformations or to generate valid sequential code, but information about them is useful when we considering memory footprint analysis and/or kernel offloading. llvm-svn: 243981	2015-08-04 13:54:20 +00:00
Tobias Grosser	ac3a95f347	Do not add spaces into memory-access identifiers llvm-svn: 243889	2015-08-03 17:53:21 +00:00
Tobias Grosser	14e67e43a7	Add option -polly-view-only If set, this option instructs -view-scops and -polly-show to only print functions that contain the specified string in their name. This allows to look at the scops of a specific function in a large .ll file, without flooding the screen with .dot graphs. llvm-svn: 243882	2015-08-03 16:39:56 +00:00
Tobias Grosser	6213913244	Use the branch instruction to define the location of a PHI-node write We use the branch instruction as the location at which a PHI-node write takes place, instead of the PHI-node itself. This allows us to identify the basic-block in a region statement which is on the incoming edge of the PHI-node and for which the write access was originally introduced. As a result we can, during code generation, avoid generating PHI-node write accesses for basic blocks that do not preceed the PHI node without having to look at the IR again. This change fixes a bug which was introduced in r243420, when we started to explicitly model PHI-node reads and writes, but dropped some additional checks that where still necessary during code generation to not emit PHI-node writes for basic-blocks that are not on incoming edges of the original PHI node. Compared to the code before r243420 the new code does not need to inspect the IR any more and we also do not generate multiple redundant writes. llvm-svn: 243852	2015-08-02 16:17:41 +00:00
Tobias Grosser	d2d15a8c65	Dependences: Zero pad the schedule map The schedule map we derive from a schedule tree map may map statements into schedule spaces of different dimensionality. This change adds zero padding to ensure just a single schedule space is used and the translation from a union_map to an isl_multi_union_pw_aff does not fail. llvm-svn: 243849	2015-08-02 13:30:33 +00:00
Michael Kruse	471a5e3388	Move computations out of constructors It is common practice to keep constructors lightweight. The reasons include: - The vtable during the constructor's execution is set to the static type of the object, not to the vtable of the derived class. That is, method calls behave differently in constructors and ordinary methods. This way it is possible to call unimplemented methods of abstract classes, which usually results in a segmentation fault. - If an exception is thrown in the constructor, the destructor is not called, potentially leaking memory. - Code in constructors cannot be called in a regular way, e.g. from non-constructor methods of derived classes. - Because it is common practice, people may not expect the constructor to do more than initializing data and skip them when looking for bugs. Not all of these are applicable to LLVM (e.g. exceptions are disabled). This patch refactors out the computational work in the constructors of Scop and IslAst into regular init functions and introduces static create-functions as replacement. Differential revision: http://reviews.llvm.org/D11491 Reviewers: grosser, jdoerfert llvm-svn: 243677	2015-07-30 19:27:04 +00:00
Tobias Grosser	80e237bd53	Do not detect scops that are delinearized to arrays with "undef" size Such codes are not interesting to optimize and most likely never appear in the normal compilation flow. However, they show up during test case reduction with bugpoint and trigger -- without this change -- an assert in polly::MemoryAccess::foldAccess(). It is better to detect them in ScopDetection itself and just bail out. Contributed-by: Utpal Bora <cs14mtech11017@iith.ac.in> Reviewers: grosser Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11425 llvm-svn: 243515	2015-07-29 13:52:05 +00:00
Tobias Grosser	d20aba7047	Remove some dead code llvm-svn: 243465	2015-07-28 19:16:46 +00:00
Tobias Grosser	922452285a	Keep track of ScopArrayInfo objects that model PHI node storage Summary: When translating PHI nodes into memory dependences during code generation we require two kinds of memory. 'Normal memory' as for all scalar dependences and 'PHI node memory' to store the incoming values of the PHI node. With this patch we now mark and track these two kinds of memories, which we previously incorrectly marked as a single memory object. Being aware of PHI node storage makes code generation easier, as we do not need to guess what kind of storage a scalar reference requires. This simplifies the code nicely. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11554 llvm-svn: 243420	2015-07-28 14:53:44 +00:00
Johannes Doerfert	210b09aa21	Remove explicit heap allocation to fix and prevent memory leaks llvm-svn: 243245	2015-07-26 13:14:38 +00:00
Tobias Grosser	bb853c24b1	Fix formatting of recent alias-analysis commit llvm-svn: 243215	2015-07-25 12:31:03 +00:00
Johannes Doerfert	338b42c329	Removed redundant alias checks generated during run time. As specified in PR23888, run-time alias check generation is expensive in terms of compile-time. This reduces the compile time by computing minimal/maximal access only once for each base pointer Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 243024	2015-07-23 17:04:54 +00:00
Tobias Grosser	808cd69a92	Use schedule trees to represent execution order of statements Instead of flat schedules, we now use so-called schedule trees to represent the execution order of the statements in a SCoP. Schedule trees make it a lot easier to analyze, understand and modify properties of a schedule, as specific nodes in the tree can be choosen and possibly replaced. This patch does not yet fully move our DependenceInfo pass to schedule trees, as some additional performance analysis is needed here. (In general schedule trees should be faster in compile-time, as the more structured representation is generally easier to analyze and work with). We also can not yet perform the reduction analysis on schedule trees. For more information regarding schedule trees, please see Section 6 of https://lirias.kuleuven.be/handle/123456789/497238 llvm-svn: 242130	2015-07-14 09:33:13 +00:00
Tobias Grosser	16c4403a91	Make non-affine statement names isl compatible Named isl sets can generally have any name if they remain within Polly, but only certain strings can be parsed by isl. The new names we create ensure that we can always copy-past isl strings from Polly to other isl tools, e.g. for debugging. llvm-svn: 241787	2015-07-09 07:31:45 +00:00
Tobias Grosser	1b13ddea50	Add first support to delinearize A[t%2][i][j] This is very preliminary support, but it seems to work for the most common case. When observing more/different test cases, we can work on generalizing this. llvm-svn: 240955	2015-06-29 14:44:22 +00:00
Tobias Grosser	23bceb2eec	Fix delinearization after it's move to ScalarEvoltion llvm-svn: 240954	2015-06-29 14:44:17 +00:00
Tobias Grosser	1a07e19180	Increase the dependence-analysis compute out As Polly got a lot faster after the small-integer-optimization imath patch, we now increase the compute out to optimize larger kernels. This should also expose additional slow-downs for us to address. In LNT this gives us a 3.4x speedup on 3mm, at a cost of a 2x increase in compile time (now 0.77s). reg_detect, oorafft and adi also show some compile time increases. This compile time cost is divided between more time in isl and more time in LLVM's backends due to increased code size (versioning and tiling). llvm-svn: 240840	2015-06-26 21:58:42 +00:00
Tobias Grosser	f54bb7743a	Drop divs before adding array-out-of-bounds assumptions In case we have modulo operations in the access function (supported since r240518), the assumptions generated to ensure array accesses remain within bounds can contain existentially quantified dimensions which results in more complex and more difficult to handle integer sets. As a result LNT's linpack benchmark started to fail due to excessive compile time. We now just drop the existentially quantified dimensions. This should be generally save, but may result in less precise assumptions which may consequently make us fall back to the original (unoptimized) code more often. In practice, these cases probably do not appear to often. I had difficulties to extract a good test case, but fortunately our LNT bots cover this one well. llvm-svn: 240775	2015-06-26 12:09:28 +00:00
Tobias Grosser	af4e809ca6	Remove code for scalar and PHI to array translation This removes old code that has been disabled since several weeks and was hidden behind the flags -disable-polly-intra-scop-scalar-to-array=false and -polly-model-phi-nodes=false. Earlier, Polly used to translate scalars and PHI nodes to single element arrays, as this avoided the need for their special handling in Polly. With Johannes' patches adding native support for such scalar references to Polly, this code is not needed any more. After this commit both -polly-prepare and -polly-independent are now mostly no-ops. Only a couple of simple transformations still remain, but they are scheduled for removal too. Thanks again to Johannes Doerfert for his nice work in making all this code obsolete. llvm-svn: 240766	2015-06-26 07:31:18 +00:00
Tobias Grosser	50165ffdee	Add support for srem instruction Remainder operations with constant divisor can be modeled as quasi-affine expression. This patch adds support for detecting and modeling them. We also add a test that ensures they are correctly code generated. This patch was extracted from a larger patch contributed by Johannes Doerfert in http://reviews.llvm.org/D5293 llvm-svn: 240518	2015-06-24 04:13:29 +00:00
Chandler Carruth	afa4ea7121	[PM/AA] Update Polly for r239886 which removed UnknownSize from AliasAnalysis by getting it from its newly canonical home of MemoryLocation::UnknownSize. llvm-svn: 239890	2015-06-17 08:29:32 +00:00
Michael Kruse	dd6011c494	Orthography: substracting -> subtracting This is a test commit for being granted commit-after-approval access. llvm-svn: 239725	2015-06-15 10:52:28 +00:00
Tobias Grosser	d8308fbed9	Avoid the use of std::map emplace This functionality does not yet seem to exist on all buildbots. llvm-svn: 239133	2015-06-05 05:52:15 +00:00
Tobias Grosser	d5d93ecd03	Use owning pointers to avoid memory leaks This fixes a memory leak caused by us not freeing the expanded region nodes. llvm-svn: 239061	2015-06-04 17:59:54 +00:00
Tobias Grosser	a5c092d844	Store ArrayShape in shared_ptr and MemAccs as actual objects This fixes two more memory leaks. llvm-svn: 239050	2015-06-04 16:03:16 +00:00
Tobias Grosser	ff22e9e5f6	Store comparison objects as objects, not pointers This fixes a memory leak. If we store the actual objects we can not forget to free them. llvm-svn: 239033	2015-06-04 11:44:09 +00:00
Chandler Carruth	bdb4a39ad8	Update Polly for the AA/MemoryLocation refactoring. llvm-svn: 239008	2015-06-04 03:49:46 +00:00
Tobias Grosser	d6a50b3a1e	Add DEBUG output to -polly-scops pass llvm-svn: 238644	2015-05-30 06:26:21 +00:00
Tobias Grosser	b2f399264d	Update isl to 93b8e43d This update brings mostly interface cleanups, but also fixes two bugs in imath (a memory leak, some undefined behavior). llvm-svn: 238422	2015-05-28 13:32:11 +00:00
Tobias Grosser	57411e3fc6	Drop const in front of iterator David Blaikie: "find returns an iterator by value, so it's just added complexity/strangeness to then use reference lifetime extension to give it the same semantics as if you'd used a value type instead of a reference type." llvm-svn: 238294	2015-05-27 06:51:34 +00:00
Tobias Grosser	7c3bad52dd	Use value semantics for list of ScopStmt(s) instead of std::owningptr David Blaike suggested this as an alternative to the use of owningptr(s) for our memory management, as value semantics allow to avoid the additional interface complexity caused by owningptr while still providing similar memory consistency guarantees. We could also have used a std::vector, but the use of std::vector would yield possibly changing pointers which currently causes problems as for example the memory accesses carry pointers to their parent statements. Such pointers should not change. Reviewer: jblaikie, jdoerfert Differential Revision: http://reviews.llvm.org/D10041 llvm-svn: 238290	2015-05-27 05:16:57 +00:00
Tobias Grosser	eeb9f3ce15	Drop unnecessary 'this->' pointers llvm-svn: 238257	2015-05-26 21:37:31 +00:00
Tobias Grosser	2d7611f45e	Remove unnecessary indirection through SCEV llvm-svn: 238092	2015-05-23 05:58:30 +00:00
Tobias Grosser	ab6714464a	Use unique_ptr to clarify ownershop of ScopArrayInfoMap llvm-svn: 238091	2015-05-23 05:58:27 +00:00
Tobias Grosser	679dfafd33	Use unique_ptr to clarify ownership of ScopStmt llvm-svn: 238090	2015-05-23 05:14:09 +00:00
Tobias Grosser	ac60f4594f	Enable scalar and PHI code generation for Polly The feature itself has been committed by Johannes in r238070. As this is the way forward, we now enable it to ensure we get test coverage. Thank you Johannes for this nice work! llvm-svn: 238088	2015-05-23 03:34:41 +00:00
Johannes Doerfert	ecff11dcfb	Add scalar and phi code generation To reduce compile time and to allow more and better quality SCoPs in the long run we introduced scalar dependences and PHI-modeling. This patch will now allow us to generate code if one or both of those options are set. While the principle of demoting scalars as well as PHIs to memory in order to communicate their value stays the same, this allows to delay the demotion till the very end (the actual code generation). Consequently: - We __almost__ do not modify the code if we do not generate code for an optimized SCoP in the end. Thus, the early exit as well as the unprofitable option will now actually preven us from introducing regressions in case we will probably not get better code. - Polly can be used as a "pure" analyzer tool as long as the code generator is set to none. - The original SCoP is almost not touched when the optimized version is placed next to it. Runtime regressions if the runtime checks chooses the original are not to be expected and later optimizations do not need to revert the demotion for that part. - We will generate direct accesses to the demoted values, thus there are no "trivial GEPs" that select the first element of a scalar we demoted and treated as an array. Differential Revision: http://reviews.llvm.org/D7513 llvm-svn: 238070	2015-05-22 23:43:58 +00:00
Tobias Grosser	1b6ea573f2	Replace low-level constraint building with higher level functions Instead of explicitly building constraints and adding them to our maps we now use functions like map_order_le to add the relevant information to the maps. llvm-svn: 237934	2015-05-21 19:02:44 +00:00
Tobias Grosser	a8512b1784	Add diagnostic for unsigned integer comparisions llvm-svn: 237800	2015-05-20 15:37:11 +00:00
Tobias Grosser	9a6bef8ba4	Drop redundant condition This condition was accidentally introduced in r211875. llvm-svn: 237796	2015-05-20 15:04:27 +00:00
Tobias Grosser	49ad36ca16	Add printing and testing to ScopArrayInfo Being here, we extend the interface to return the element type and not a pointer to the element type. We also provide a function to get the size (in bytes) of the elements stored in this array. We currently still store the element size as an innermost dimension in ScopArrayInfo, which is somehow inconsistent and should be addressed in future patches. llvm-svn: 237779	2015-05-20 08:05:31 +00:00
Tobias Grosser	4ac75ba449	Remove need for separate loop index counter Suggested-by: Johannes Doerfert llvm-svn: 237441	2015-05-15 12:24:12 +00:00
Tobias Grosser	e29d31ce5a	Simplify string formatting Suggested-by: Johannes Doerfert llvm-svn: 237440	2015-05-15 12:24:09 +00:00
Tobias Grosser	6f48e0fd2b	Give each memory access a reference ID This reference ID is handy for use cases where we need to identify individual memory accesses (e.g. to modify their access functions). This is a reworked version of a patch originally developed by Yabin Hu as part of his summer of code project. llvm-svn: 237431	2015-05-15 09:58:32 +00:00
Tobias Grosser	cd524dc51d	Add explicit #includes for used isl features llvm-svn: 236931	2015-05-09 09:36:38 +00:00
Tobias Grosser	ba0d09227c	Sort include directives Upcoming revisions of isl require us to include header files explicitly, which have previously been already transitively included. Before we add them, we sort the existing includes. Thanks to Chandler for sort_includes.py. A simple, but very convenient script. llvm-svn: 236930	2015-05-09 09:13:42 +00:00
Tobias Grosser	e71ed19841	Add iterators for the ArrayInfo objects of the scop This patch also changes the implementation of the ArrayInfoMap to a MapVector which will ensure that iterating over the list of ArrayInfo objects gives predictable results. The single loop that currently enumerates the ArrayInfo objects only frees the individual objectes, hence a possibly changing iteration order does not affect the outcome. The added robustness is for future users of this interface. llvm-svn: 236583	2015-05-06 10:05:20 +00:00
Johannes Doerfert	8983031b5e	[FIX] Invalid recognition of multidimensional access In the lnt benchmark MultiSource/Benchmarks/MallocBench/gs/gs with scalar and PHI modeling we detected the multidimensional accesses with sizes variant in the SCoP. This will check the sizes for validity. llvm-svn: 236395	2015-05-03 16:03:01 +00:00
Tobias Grosser	a63b7cee66	Adding debug location information to Polly's JSCOP and dot exports This change adds location information for the detected regions in Polly when the required debug information is available. The JSCOP output format is extended with a "location" field which contains the information in the format "source.c:start-end" The dot output is extended to contain the location information for each nested region in the analyzed function. As part of this change, the existing getDebugLocation function has been moved into lib/Support/ScopLocation.cpp to avoid having to include polly/ScopDetectionDiagnostics.h. Differential Revision: http://reviews.llvm.org/D9431 Contributed-by: Roal Jordans <r.jordans@tue.nl> llvm-svn: 236393	2015-05-03 05:21:36 +00:00
Duncan P. N. Exon Smith	ddf3a0ef38	Update polly for LLVM rename of debug info metadata with DI* prefix Ran the same rename-md-di-prefix.sh script attached to PR23080 as in LLVM r236120 and CFE r236121. llvm-svn: 236127	2015-04-29 17:02:14 +00:00
Johannes Doerfert	8f8af43fef	Use all available range information for parameters In the following even full-range information will help to avoid runtime checks for wrapping integers, hence we enable it now. llvm-svn: 235823	2015-04-26 20:07:21 +00:00
Johannes Doerfert	d5d8f67dc5	Use the original no-wrap flags for normalized AddRecs llvm-svn: 235822	2015-04-26 19:55:21 +00:00
Tobias Grosser	654af8f62f	Fix some formatting issues llvm-svn: 235381	2015-04-21 11:42:01 +00:00
Tobias Grosser	5483931117	Rename 'scattering' to 'schedule' In Polly we used both the term 'scattering' and the term 'schedule' to describe the execution order of a statement without actually distinguishing between them. We now uniformly use the term 'schedule' for the execution order. This corresponds to the terminology of isl. History: CLooG introduced the term scattering as the generated code can be used as a sequential execution order (schedule) or as a parallel dimension enumerating different threads of execution (placement). In Polly and/or isl the term placement was never used, but we uniformly refer to an execution order as a schedule and only later introduce parallelism. When doing so we do not talk about about specific placement dimensions. llvm-svn: 235380	2015-04-21 11:37:25 +00:00
Tobias Grosser	b69d16ab24	Pass an actual schedule to the isl_union_access [NFC] This change is a step towards using a single isl_schedule object throughout Polly. At the moment the schedule is just constructed from the flat isl_union_map that defines the schedule. Later we will obtain it directly from the scop and potentially obtain a schedule with a non-trivial internal structure that will allow faster dependence analysis. llvm-svn: 235378	2015-04-21 11:01:34 +00:00
Tobias Grosser	93fdc12358	Move to isl's new dependence analysis interface [NFC] isl_union_map_compute_flow() has been replaced by isl_union_access_info_compute_flow(). This change does not intend to change funcitonality, yet. However, it will allow us to pass in subsequent changes schedule trees to the dependence analysis instead of flat schedules. This should speed up dependence analysis for important cases significantly. llvm-svn: 235373	2015-04-21 08:47:29 +00:00
Duncan P. N. Exon Smith	f06f583f1f	Fix polly build after LLVM r235327 llvm-svn: 235343	2015-04-20 20:28:43 +00:00
Tobias Grosser	6f8fd8e98f	Dependences: Allow to disable dependences computeout llvm-svn: 235271	2015-04-19 07:07:26 +00:00
Tobias Grosser	4b6aa6eb8e	Clear InsnToMemAcc map in ScopDetection after each runOnFunction Otherwise, instructions in different functions that share the same pointer (due to earlier modifications), might get assigned incorrect memory access information (belonging to instructions in previous functions), which can result in arbitrary memory corruption and assertion failures. This fixes llvm.org/PR23160 and possibly also llvm.org/PR23167. Note: InsnToMemAcc is a global variable that should never have existed in the first place. We will clean this up in a subsequent patch. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> Debugged-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 235254	2015-04-18 11:01:25 +00:00
David Blaikie	556ffb7806	[opaque pointer types] Explicit non-pointer type for call expressions (migration for recent LLVM change to textual IR for calls) llvm-svn: 235146	2015-04-16 23:24:52 +00:00
Johannes Doerfert	f8206cf6d4	Allow loops in non-affine subregions -- SCoP Modeling This will allow the ScopInfo to build the polyhedral representation for non-affine regions that contain loops. Such loops are basically not visible in the SCoP representation. Accesses that are variant in such loops are therefor represented as non-affine accesses. Differential Revision: http://reviews.llvm.org/D8153 llvm-svn: 234713	2015-04-12 22:58:40 +00:00
Johannes Doerfert	f3e98f44e3	Allow loops in non-affine subregions -- SCoP Detection This will allow the ScopDetection to detect non-affine regions that contain loops. All loops contained will be collected and are accessible to later passes in order to adjust the access functions. As the loops are non-affine and will not be part of the polyhedral representation later, all accesses that are variant in these loops have to be over approximated as non-affine accesses. They are therefore handled the same way as other non-affine accesses. Additionally, we do not count non-affine loops for the profitability heuristic, thus a region with only a non-affine loop will only be detected if the general detection of loop free regions is enabled. Differential Revision: http://reviews.llvm.org/D8152 llvm-svn: 234711	2015-04-12 22:52:20 +00:00
Duncan P. N. Exon Smith	7cf5fedbad	Fix polly build after LLVM r234263 llvm-svn: 234266	2015-04-07 00:29:05 +00:00
Tobias Grosser	f4c24b29e2	Make run-time alias check generation deterministic llvm-svn: 234117	2015-04-05 13:11:54 +00:00
Duncan P. N. Exon Smith	c7ba53f9a9	DebugInfo: Use the new DebugLoc API from r233573 This should fix the build [1] after r233599 removed the old API. [1]: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly-parallel-fast/builds/5265 llvm-svn: 233605	2015-03-30 21:47:17 +00:00
Tobias Grosser	619190d5a7	Delinearization of expressions that contain array size parameters This allows us to delinerize code such as: A[][n] for (i for (j A[i][n-j-1] = ... which would previously have been delinearize to an access A[i+1][-j-1]. To recover the correct access we apply the piecewise expression: { A[i][j] -> A[i-1][i+N]: i < 0; A[i][j] -> A[i][i]: i >= 0} This approach generalizes to higher dimensions. llvm-svn: 233566	2015-03-30 17:22:28 +00:00
Tobias Grosser	6e6c7e014a	Add forgotten underscore llvm-svn: 233534	2015-03-30 12:22:39 +00:00
Tobias Grosser	4f663aae09	Add some missing isl prefixes llvm-svn: 233533	2015-03-30 11:52:59 +00:00
Tobias Grosser	aa660a9957	Gist-simplify access relations in the context of domain constraints This simplifies already one test case and is needed for upcoming improvements to our delinearization. llvm-svn: 233507	2015-03-30 00:07:50 +00:00
Tobias Grosser	eefe941619	Avoid nullptr dereference in the ScopGraphPrinter Reported-by: http://buildd-clang.debian.net/scan-build llvm-svn: 233505	2015-03-29 22:00:54 +00:00
Johannes Doerfert	be40996cfe	Strip constant factors from SCoP parameters This will strip the constant factor of a parameter befor we add it to the SCoP. As a result the access functions are simplified, e.g., for the attached test case. llvm-svn: 233501	2015-03-29 20:45:09 +00:00
Tobias Grosser	715007216c	Bail out if too many alias run-time-check comparisions would be needed This fixes a crash observed in ffmpeg. llvm-svn: 233480	2015-03-28 15:11:14 +00:00
Tobias Grosser	50d4e2ebf1	Use an early return on failure llvm-svn: 233479	2015-03-28 14:50:32 +00:00
Tobias Grosser	6e084ccda3	Shorten user report message slightly llvm-svn: 231633	2015-03-09 06:59:16 +00:00
Johannes Doerfert	e4bd53bd0c	[FIX] Use the correct functions to extract the LB/UB from a range The current tests will continue to cover this code and more will be added when non-affine loops are supported. llvm-svn: 231606	2015-03-08 19:49:50 +00:00
Tobias Grosser	6973cb6ec7	Enable delinearization by default - second try After having fixed the LNT bugs in the previous commits, lets reenable the delinearization. llvm-svn: 231595	2015-03-08 15:21:18 +00:00
Johannes Doerfert	6a4d81c1f6	Add end user report message for unprofitable regions [NFC] llvm-svn: 231593	2015-03-08 15:11:50 +00:00
Tobias Grosser	9d3c0b655a	Revert "Enable delinearization by default" This reverts commit 231590. Apparantly we have three more issues left in oggenc, smg2000 and linpack. llvm-svn: 231591	2015-03-08 12:57:31 +00:00
Tobias Grosser	16c701d0c9	Enable delinearization by default The performance test case just committed was the last open issue I was aware of. We enable this by default to increase test coverage and to possibly trigger reports of issues yet unknown. llvm-svn: 231590	2015-03-08 12:30:37 +00:00
Tobias Grosser	140b394e10	Fix compilation after 'Make DataLayout Non-Optional in the Module' commit llvm-svn: 231362	2015-03-05 09:48:20 +00:00
Tobias Grosser	1fa434992b	Fix leftover Dependences.cpp -> DependenceInfo.cpp llvm-svn: 231355	2015-03-05 06:52:42 +00:00
Johannes Doerfert	7e6424ba5a	Create a dependence struct to hold dependence information for a SCoP. The new Dependences struct in the DependenceInfo holds all information that was formerly part of the DependenceInfo. It also provides the same interface for the user to access this information. This is another step to a more general ScopPass interface that does allow multiple SCoPs to be "in flight". llvm-svn: 231327	2015-03-05 00:43:48 +00:00
Johannes Doerfert	f6557f98a2	Rename the Dependences pass to DependenceInfo [NFC] We rename the Dependences pass to DependenceInfo as a first step to a caching pass policy. The new DependenceInfo pass will later provide "Dependences" for a SCoP. To keep consistency the test folder is renamed too. llvm-svn: 231308	2015-03-04 22:43:40 +00:00
Johannes Doerfert	3f21e27ad3	Simplify dropConstantScheduleDims [NFC] llvm-svn: 231304	2015-03-04 22:23:21 +00:00
Johannes Doerfert	d239aac2ee	Do not model scalar accesses in non-affine subregions If a scalar was defined and used only in a non-affine subregion we do not need to model the accesses. However, if the scalar was defined inside the region and escapes the region we have to model the access. The same is true if the scalar was defined outside and used inside the region. llvm-svn: 230960	2015-03-02 14:06:01 +00:00
Johannes Doerfert	2495cfe01d	[Refactor] Simplify ScopPass interface llvm-svn: 230899	2015-03-01 18:43:50 +00:00
Johannes Doerfert	3fe584d64f	[Refactor] Add a Scop & as argument to printScop This is the first step in the interface simplification. llvm-svn: 230897	2015-03-01 18:40:25 +00:00
Johannes Doerfert	0200f2507e	[Cleanup] Remove unused passes llvm-svn: 230896	2015-03-01 18:38:35 +00:00
Johannes Doerfert	a36842f837	Allow non-affine control flow by default With the patches r230325, r230329 and r230340 we can handle non-affine control flow in (loop-free) subregions. As all LLVM test-suite tests pass and we get ~20% more non-trivial SCoPs, we activate it now by default. llvm-svn: 230624	2015-02-26 11:09:24 +00:00
Johannes Doerfert	65971a8e72	Omit a default argument llvm-svn: 230337	2015-02-24 16:02:16 +00:00
Johannes Doerfert	6cad9c4746	[FIX] Some comments llvm-svn: 230335	2015-02-24 16:00:29 +00:00
Johannes Doerfert	ff9d1980a7	Allow non-affine control flow -- SCoP Modeling This allows us to model non-affine regions in the SCoP representation. SCoP statements can now describe either basic blocks or non-affine regions. In the latter case all accesses in the region are accumulated for the statement and write accesses, except in the entry, have to be marked as may-write. Differential Revision: http://reviews.llvm.org/D7846 llvm-svn: 230329	2015-02-24 12:00:50 +00:00
Johannes Doerfert	e70449400f	Add ScalarEvolution bounds to non-affine access functions llvm-svn: 230328	2015-02-24 11:58:30 +00:00
Johannes Doerfert	ba65c1672a	Allow non-affine control flow -- SCoP Detection With this patch we allow the SCoP detection to detect regions as SCoPs which have non-affine control flow inside. All non-affine regions are tracked and later accessible to the ScopInfo. As there is no real difference, non-affine branches as well as floating point branches are covered (and both called non-affine control flow). However, the detection is restricted to overapproximate only loop free regions. llvm-svn: 230325	2015-02-24 11:45:21 +00:00
Johannes Doerfert	4f8ac3d123	Use ScalarEvolution to create tight bounds on the parameters llvm-svn: 230230	2015-02-23 16:15:51 +00:00
Johannes Doerfert	fb79a96d42	[NFC] Unify the use of Context.CurRegion llvm-svn: 230222	2015-02-23 14:18:28 +00:00
Johannes Doerfert	ca08c44a46	Remove leftover code llvm-svn: 230140	2015-02-21 16:18:28 +00:00
Tobias Grosser	f567e1aab7	Fix formatting llvm-svn: 229920	2015-02-19 22:16:12 +00:00
Johannes Doerfert	3f1c285294	[REFACTOR] Simplify the SCoP detection interface a bit llvm-svn: 229879	2015-02-19 18:11:50 +00:00
Tobias Grosser	d4f232f03c	Drop left over code from FinalRead statement removal. We removed FinalRead statements in r152319. llvm-svn: 229867	2015-02-19 16:24:57 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
Johannes Doerfert	b6755bbd5f	Overaproximate divisions in for the alias checks. Alias checks might become costly if there are divisions that complicate the description of the accessed locations. By overaproximating them we get fairly accurate results without the huge compile time cost. llvm-svn: 229252	2015-02-14 12:00:06 +00:00
Tobias Grosser	a906ee754d	Drop an assert and XFAIL two test cases This gets the buildbot green to avoid further emails. Johannes will fix this later in the evening. llvm-svn: 228862	2015-02-11 18:46:33 +00:00
Johannes Doerfert	7ceb040213	Add early exits for SCoPs we did not optimize This allows us to skip ast and code generation if we did not optimize a SCoP and will not generate parallel or alias annotations. The initial heuristic to exit is simple but allows improvements later on. All failing test cases have been modified to disable early exit, thus to keep their coverage. Differential Revision: http://reviews.llvm.org/D7254 llvm-svn: 228851	2015-02-11 17:25:09 +00:00
Johannes Doerfert	1f87f485b1	Model scalar writes with uses outside the SCoP These write are important as they will force the scheduling and code generation of an otherwise trivial statement and also impose an order of execution needed to guarantee the correct final value for a scalar in a loop. Added test case modeled after ClamAV/clamscan. llvm-svn: 228847	2015-02-11 17:02:52 +00:00
Johannes Doerfert	b9d18887d7	Allow signed devision in access functions llvm-svn: 228833	2015-02-11 14:54:50 +00:00
Johannes Doerfert	97235c691a	[FIX] Special case for branch users of scalar values llvm-svn: 228832	2015-02-11 14:52:52 +00:00
Johannes Doerfert	be9c91173f	[Refactor] Use only one BlockGenerator for a SCoP This change has two main purposes: 1) We do not use a static interface to hide an object we create and destroy for every basic block we copy. 2) We allow the BlockGenerator to store information between calls to the copyBB method. This will ease scalar/phi code generation later on. While a lot of method signatures were changed this should not cause any real behaviour change. Differential Revision: http://reviews.llvm.org/D7467 llvm-svn: 228443	2015-02-06 21:39:31 +00:00
Johannes Doerfert	0ff23ec544	Model PHI nodes without demoting them This allows us to model PHI nodes in the polyhedral description without demoting them. The modeling however will result in the same accesses as the demotion would have introduced. Differential Revision: http://reviews.llvm.org/D7415 llvm-svn: 228433	2015-02-06 20:13:15 +00:00
Johannes Doerfert	4f33706b53	[NFC] Remove some unnecessary local objects llvm-svn: 227844	2015-02-02 19:41:30 +00:00
Johannes Doerfert	9282076ece	[NFC] Drop the "scattering" tuple name llvm-svn: 227801	2015-02-02 13:45:54 +00:00
Johannes Doerfert	9e3a5db000	[FIX] Debug build + instrinsic handling The ignored intrinsics needed to be ignored in three other places as well. Tests and lnt pass now. llvm-svn: 227092	2015-01-26 15:55:54 +00:00
Johannes Doerfert	3f500fa2f6	Support for math/misc intrinsics The support is currently limited as we only allow them in the input but do not emit them in the transformed SCoP due to the possible semantic changes. Differential Revision: http://reviews.llvm.org/D5225 llvm-svn: 227054	2015-01-25 18:07:30 +00:00
Chandler Carruth	f557987b15	[PM] Update Polly following LLVM r226373 which refactors LoopInfo in preparation for the new pass manager. llvm-svn: 226374	2015-01-17 14:16:56 +00:00
David Peixotto	dc0a11c21f	Fix maxLoopDepth computation in ScopInfo The max loop depth was incorrectly computed for scops that contain a block from a loop but do not contain the entire loop. We need to check that the full loop is contained in the region when computing the max loop depth. These scops occur when a region containing an inner loop is expanded to include some blocks from the outer loop, but it cannot be fully expanded to contain the outer loop because the region containing the outer loop is invalid. Differential Revision: http://reviews.llvm.org/D6913 llvm-svn: 225812	2015-01-13 18:31:55 +00:00
Tobias Grosser	bfbc3690bb	Add experimental support for unsigned expressions This support is still incomplete and consequently hidden behind a switch that needs to be enabled. One problem is ATM that we incorrectly interpret very large unsigned values as negative values even if used in an unsigned comparision. llvm-svn: 225480	2015-01-09 00:01:33 +00:00
Tobias Grosser	55bc4c0767	Add support for pointer types in expressions llvm-svn: 225464	2015-01-08 19:26:53 +00:00
Tobias Grosser	021eaef044	Fix a crash in our error reporting AF = dyn_cast<SCEVAddRecExpr>(Pair.second) may be NULL for some SCEVs that we do not support. When reporting the error we still want to pass a pointer that is known to always be non-NULL. I do not yet have a test case for this, unfortunately. llvm-svn: 225461	2015-01-08 19:03:10 +00:00
Tobias Grosser	3f29619614	Drop all constant scheduling dimensions Schedule dimensions that have the same constant value accross all statements do not carry any information, but due to the increased dimensionality of the schedule cost compile time. To not pay this cost, we remove constant dimensions if possible. llvm-svn: 225067	2015-01-01 23:01:11 +00:00
Andreas Simbuerger	6492b700c3	(diagnostics) Fix typo. Thanks Albert Cohen. llvm-svn: 224584	2014-12-19 16:49:39 +00:00
Tobias Grosser	11e3873516	Dead code elimination: Update dependences after eliminating code Without updating dependences we may lose implicit transitive dependences for which all explicit dependences have gone through the statement iterations we have just eliminated. No test case. We should probably implement a -verify-dependences option. This fixes llvm.org/PR21227 llvm-svn: 224459	2014-12-17 21:13:55 +00:00
Tobias Grosser	05fdab9362	Simplify computation of reduction dependences This simplifies the construction of the input for the reduction dependence computation and at the same time removes an assumption that expects the schedule to be of 2D + 1 form (the odd dimensions giving textual order, the even dimensions the loop iterations). llvm-svn: 223621	2014-12-07 22:51:45 +00:00
Johannes Doerfert	305fed96e6	Drop Cloog support This commit drops the Cloog support for Polly. The scripts and documentation are changed to only use isl as prerequisity. In the code all Cloog specific parts have been removed and all relevant tests have been ported to the isl backend when it was created. llvm-svn: 223141	2014-12-02 19:26:58 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Hongbin Zheng	c5447f4c3b	Do not incorrectly set the inverted flag. In TempScopInfo::buildCondition we extract the conditions to guard the BB in addition of loop bounds. This means we should only consider the conditions in the paths (in CFG) that do not contain cycles (loops). At the same time, we set the invert flag if the FalseBB of the current branch dominates our target BB to indicate that we reach the target BB with an inverted condition from the current branch. In this case, the path from the FalseBB contains a cycle if the FalseBB is the target of a backedge. The conditions implied by such a path should not be consider. We can identify such a case by checking if the TrueBB also dominates our target BB, which means we can also reach our target BB from the TrueBB, without going through the backedge. llvm-svn: 222907	2014-11-28 03:26:06 +00:00
Tobias Grosser	7b50beebe4	Assume GetElementPtr offsets to be inbounds In case a GEP instruction references into a fixed size array e.g., an access A[i][j] into an array A[100x100], LLVM-IR does not guarantee that the subscripts always compute values that are within array bounds. We now derive the set of parameter values for which all accesses are within bounds and add the assumption that the scop is only every executed with this set of parameter values. Example: void foo(float A[][20], long n, long m { for (long i = 0; i < n; i++) for (long j = 0; j < m; j++) A[i][j] = ... This loop yields out-of-bound accesses if m is at least 20 and at the same time at least one iteration of the outer loop is executed. Hence, we assume: n <= 0 or m <= 20. Doing so simplifies the dependence analysis problem, allows us to perform more optimizations and generate better code. TODO: The location where the GEP instruction is executed is not necessarily the location where the memory is actually accessed. As a result scanning for GEP[s] is imprecise. Even though this is not a correctness problem, this imprecision may result in missed optimizations or non-optimal run-time checks. In polybench where this mismatch between parametric loop bounds and fixed size arrays is common, we see with this patch significant reductions in compile time (up to 50%) and execution time (up to 70%). We see two significant compile time regressions (fdtd-2d, jacobi-2d-imper), and one execution time regression (trmm). Both regressions arise due to additional optimizations that have been enabled by this patch. They can be addressed in subsequent commits. http://reviews.llvm.org/D6369 llvm-svn: 222754	2014-11-25 10:51:12 +00:00
Tobias Grosser	983997ff6a	Fix typo llvm-svn: 222559	2014-11-21 19:39:38 +00:00
Johannes Doerfert	9c14737b9b	Simplify the SCEVAffinator [NFC] llvm-svn: 222360	2014-11-19 15:36:59 +00:00
Tobias Grosser	c98a8fccf7	Use nullptr instead of '0' for pointers llvm-svn: 221982	2014-11-14 11:12:31 +00:00
Johannes Doerfert	80ef110cca	[Refactor][NFC] Generalize the creation of ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D6031 llvm-svn: 221512	2014-11-07 08:31:31 +00:00
Johannes Doerfert	21aa3dc014	[FIX] Move the statistic code to fix the build. llvm-svn: 221038	2014-11-01 01:30:11 +00:00
Johannes Doerfert	5ad8a6a588	Remove the LoopBounds from the TempScop class. We will use ScalarEvolution in the ScopInfo.cpp to get the loop trip count, not cache it in the TempScop object. Differential Revision: http://reviews.llvm.org/D6070 llvm-svn: 221035	2014-11-01 01:14:56 +00:00
Johannes Doerfert	e3da05ac32	Remove the MaxLoopDepth attribute from the TempScop class Now MaxLoopDepth only lives in Scops not in TempScops anymore. This is the first part of a series of changes to make TempScops obsolete. Differential Revision: http://reviews.llvm.org/D6069 llvm-svn: 221026	2014-11-01 00:12:13 +00:00
Johannes Doerfert	75bd66e51d	[Refactor][NFC] Remove unused argument. llvm-svn: 221016	2014-10-31 23:16:02 +00:00
Johannes Doerfert	7c494217f3	[Refactor][NFC] Map basic blocks to SCoP statements. This will simplify the construction of domains and the modeling of PHI's. llvm-svn: 221015	2014-10-31 23:13:39 +00:00
Tobias Grosser	a5605d34b9	Fix typo llvm-svn: 220870	2014-10-29 19:58:28 +00:00
Daniel Jasper	8a1dea0ced	Fix polly formatting after recent clang-format behavior change. llvm-svn: 220711	2014-10-27 19:45:31 +00:00
Tobias Grosser	1d5c6c71d1	Fix typo llvm-svn: 220446	2014-10-22 23:26:48 +00:00
Tobias Grosser	f084edd0b4	Use braces in multi-statement DEBUG() code [NFC] By adding braces into the DEBUG statement we can make clang-format format code such as: DEBUG(stmt1(); stmt2()) as multi-line code: DEBUG({ stmt1(); stmt2(); }); This makes control-flow in debug statements easier to read. llvm-svn: 220441	2014-10-22 23:00:03 +00:00
David Peixotto	8da2b93d9f	Change the RegionSet type to a SetVector This patch changes the RegionSet type used in ScopDetection from a std::set to a llvm::SetVector. The reason for the change is to ensure deterministic output when printing the result of the analysis. We had a windows buildbot failure for the modified test because the output was coming in a different order. Only one test case needed to be modified for this change. We could use CHECK-DAG directives instead of CHECK in the analysis test cases because the actual order of scops does not matter, but I think that change should be done in a separate patch that modifies all the appliciable tests. I simply modified the test to reflect the expected deterministic output. Differential Revision: http://reviews.llvm.org/D5897 llvm-svn: 220423	2014-10-22 20:39:07 +00:00
Johannes Doerfert	a05214fb5d	[Refactor][NfC] ReportLevel should be used as a bool not an int llvm-svn: 219864	2014-10-15 23:24:28 +00:00
Johannes Doerfert	a99130f042	[Refactor][NfC] Simplify and clean the handling of (new) access relations This patch does not change the semantic on it's own. However, the dependence analysis as well as dce will now use the newest available access relation for each memory access, thus if at some point the json importer or any other pass will run before those two and set a new access relation the behaviour will be different. In general it is unclear if the dependence analysis and dce should be run on the old or new access functions anyway. If we need to access the original access function from the outside later, we can expose the getter again. Differential Revision: http://reviews.llvm.org/D5707 llvm-svn: 219612	2014-10-13 12:58:03 +00:00
Johannes Doerfert	341a15a64b	Use the new access function (if present) to compute the access stride. Differential Revision: http://reviews.llvm.org/D5661 llvm-svn: 219499	2014-10-10 14:28:46 +00:00
Johannes Doerfert	4c7ce479ea	[Refactor] Rename MemoryAccess::Type as it clashes with llvm::Type llvm-svn: 219294	2014-10-08 10:11:33 +00:00
Johannes Doerfert	74f6869435	[Fix] Ignore forwarding alias sets in the alias set tracker. llvm-svn: 219275	2014-10-08 02:23:48 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	f1ee2622be	[Fix] Dead statements should not confuse the RTC generation This fixes http://llvm.org/bugs/show_bug.cgi?id=21166 . Differential Revision: http://reviews.llvm.org/D5623 llvm-svn: 219131	2014-10-06 17:43:00 +00:00
Benjamin Kramer	ae81abfa38	Try to unbreak the polly build. llvm-svn: 219080	2014-10-05 11:58:57 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Johannes Doerfert	eeab05a084	[RTC] Use the domain to split alias groups. We use a parametric abstraction of the domain to split alias groups if accesses cannot be executed under the same parameter evaluation. The two test cases check that we can remove alias groups if the pointers which might alias are never accessed under the same parameter evaluation and that the minimal/maximal accesses are not global but with regards to the parameter evaluation. Differential Revision: http://reviews.llvm.org/D5436 llvm-svn: 218758	2014-10-01 12:42:37 +00:00
Johannes Doerfert	13771738d3	[RTC] Split alias groups according to read only base addresses If there are multiple read only base addresses in an alias group we can split it into multiple alias groups each with only one read only access. This way we might reduce the number of comparisons significantly as it grows linear in the number of alias groups but exponential in their size. Differential Revision: http://reviews.llvm.org/D5435 llvm-svn: 218757	2014-10-01 12:40:46 +00:00
Johannes Doerfert	928229fda6	Disable runtime alias checks when we ignore aliasing. This is just a optimization to save the compile time and execution time for runtime alias checks if the user guarantees no aliasing all together. llvm-svn: 218613	2014-09-29 17:06:29 +00:00
Johannes Doerfert	9143d67aba	[RTC] Bail if too many parameters are involved in a RTC access. If too many parameters are involved in accesses used to create RTCs we might end up with enormous compile times and RTC expressions. The reason is that the lexmin/lexmax is dependent on all these parameters and isl might need to create a case for every "ordering" of them (e.g., p0 <= p1 <= p2, p1 <= p0 <= p2, ...). The exact number of parameters allowed in accesses is defined by the command line option -polly-rtc-max-parameters=XXX and set by default to 8. Differential Revision: http://reviews.llvm.org/D5500 llvm-svn: 218566	2014-09-27 11:02:39 +00:00
Tobias Grosser	1eedb67fa6	We do not support alias checks for base pointers defined inside the SCoP The run-time alias check places code that involves the base pointer at the beginning of the SCoP. This breaks if the base pointer is defined inside the SCoP. Hence, we can only create a run-time alias check if we are sure the base pointer is not an instruction defined inside the scop. If it is we refuse to handle the SCoP. This commit should unbreak most of our current LNT failures. Differential Revision: http://reviews.llvm.org/D5483 llvm-svn: 218412	2014-09-24 21:04:29 +00:00
Tobias Grosser	3ee7cdab53	Report possible aliasing deterministically This commit drops a call to std::sort, which sorted the base pointers that possibly alias according to the address at which their corresponding llvm::Value was allocated. There does not seem to be any good reason, why those pointers should be (re)sorted and this only makes the output indeterministic. llvm-svn: 218052	2014-09-18 14:45:43 +00:00
Johannes Doerfert	b164c795b7	[RTC] Runtime Alias Checks for the ISL backend This change will build all alias groups (minimal/maximal accesses to possible aliasing base pointers) we have to check before we can assume an alias free environment. It will also use these to create Runtime Alias Checks (RTC) in the ISL code generation backend, thus allow us to optimize SCoPs despite possibly aliasing pointers when this backend is used. This feature will be enabled for the isl code generator, e.g., --polly-code-generator=isl, but disabled for: - The cloog code generator (still the default). - The case delinearization is enabled. - The case non-affine accesses are allowed. llvm-svn: 218046	2014-09-18 11:17:17 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Tobias Grosser	230acc4445	Delinearize _all_ accesses to a multi-dimensional array Even though we previously correctly detected the multi-dimensional access pattern for accesses with a certain base address, we only delinearized non-affine accesses to this address. Affine accesses have not been touched and remained as single dimensional accesses. The result was an inconsistent description of accesses to the same array, with some being one dimensional and some being multi-dimensional. This patch ensures that all accesses are delinearized with the same dimensionality as soon as a single one of them has been detected as non-affine. While writing this patch, it became evident that the options -polly-allow-nonaffine and -polly-detect-keep-going have not been properly supported in case delinearization has been turned on. This patch adds relevant test coverage and addresses these issues as well. We also added some more documentation to the functions that are modified in this patch. This fixes llvm.org/PR20123 Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217728	2014-09-13 14:47:55 +00:00
Tobias Grosser	bcd4efffa7	Check that the elements of an array have the same size At the moment we assume that only elements of identical size are stored/loaded to a certain base pointer. This patch adds logic to the scop detection to verify this. Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217727	2014-09-13 14:47:40 +00:00
Johannes Doerfert	8f7124ceb9	[Polly] Move MAY/MUST write property to IRAccess Differential Revision: http://reviews.llvm.org/D5223 llvm-svn: 217665	2014-09-12 11:00:49 +00:00
Tobias Grosser	31f3da3d2a	No need to check for non-existing std::map elements It seems we added guards to check for non-existing std::map elements to make sure they are default constructed before first accessed. Besides, the code being wrong because of checking Context.NonAffineAccesses[BasePointer].size() instead of Context.cound(BasePointer), such a check is also not necessary as std::map takes care of this already. From the std::map documentation: "If k does not match the key of any element in the container, the function inserts a new element with that key and returns a reference to its mapped value. Notice that this always increases the container size by one, even if no mapped value is assigned to the element (the element is constructed using its default constructor)." llvm-svn: 217506	2014-09-10 14:38:12 +00:00
Johannes Doerfert	9e7b17b0d4	Added arcanist linters and cleaned errors and warnings Arcanist (arc) will now always run linters before uploading any new commit to Phabricator. All errors/warnings (or their absence) will be shown in the web interface together with a explanation by the commiter (arcanist will ask the commiter if the build was not clean). The linters include: - clang-format - spelling check - permissions check (aka. chmod) - filename check - merge conflict marker check Note, that their scope is sometimes limited (see .arclint for details). This commit also fixes all errors and warnings these linters reported, namely: - spelling mistakes and typos - executable permissions for various text files Differential Revision: http://reviews.llvm.org/D4916 llvm-svn: 215871	2014-08-18 00:40:13 +00:00
Andreas Simbuerger	6bf77979e0	Diagnostic: Provide end-user message for non-affine loop bound errors llvm-svn: 215832	2014-08-17 10:09:15 +00:00
Andreas Simbuerger	d46b935267	Diagnostic: Provide end-user message for non-affine access function errors llvm-svn: 215831	2014-08-17 10:09:11 +00:00
Andreas Simbuerger	f29f625748	Diagnostic: Provide end-user message for aliasing errors This will spill out information about LLVM-internals. However, in cases where the name of the Value matches the name of the array in the source, we provide more useful information. In cases where we spill internals, the information still might help the user to pin down the correct arrays. The problem we face here is: The error is pinned to the debug location of one of the offending values out of the alias set instead of all of them. The more information we give the user about the set of aliasing pointers the better. llvm-svn: 215830	2014-08-17 10:09:07 +00:00
Andreas Simbuerger	3efe40b8d3	Diagnostic: Enable error tracking by default llvm-svn: 215829	2014-08-17 10:09:03 +00:00
Tobias Grosser	f4daf34496	Revert "Added support for modulo expressions" This reverts commit 215684. The intention of the commit is great, but unfortunately it seems to be the cause of 14 LNT test suite failures: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly/builds/116 To make our buildbots and performance testers green until this issue is solved, we temporarily revert this commit. llvm-svn: 215816	2014-08-16 09:08:55 +00:00

... 3 4 5 6 7 ...

770 Commits