llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	e6e3c9246a	Check late for profitability Before this patch we only expanded valid __and__ profitable region. Therefor we did not allow the expansion to create a profitable region from a non-profitable one. With this patch we will remember and expand all valid regions and check for profitability only at the end. This patch increases the number of valid SCoPs in the LLVM-TS and SPEC 2000/2006 by 28% (from 303 to 390), including the hot loop in hmmer. llvm-svn: 269343	2016-05-12 20:21:50 +00:00
Johannes Doerfert	6c7639b380	Cleanup rejection log handling [NFC] This patch cleans up the rejection log handling during the ScopDetection. It consists of two interconnected parts: - We keep all detection contexts for a function in order to provide more information to the user, e.g., about the rejection of extended/intermediate regions. - We remove the mutable "RejectLogs" member as the information is available through the detection contexts. llvm-svn: 269323	2016-05-12 18:50:01 +00:00
Johannes Doerfert	5c2b556b13	Bring some comments up to date [NFC] llvm-svn: 269301	2016-05-12 15:15:50 +00:00
Johannes Doerfert	6f1bb7a9d9	Support truncate operations Truncate operations are basically modulo operations, thus we can model them that way. However, for large types we assume the operand to fit in the new type size instead of introducing a modulo with a very large constant. llvm-svn: 269300	2016-05-12 15:13:49 +00:00
Johannes Doerfert	404a0f81ea	Check overflows in RTCs and bail accordingly We utilize assumptions on the input to model IR in polyhedral world. To verify these assumptions we version the code and guard it with a runtime-check (RTC). However, since the RTCs are themselves generated from the polyhedral representation we generate them under the same assumptions that they should verify. In other words, the guarantees that we try to provide with the RTCs do not hold for the RTCs themselves. To this end it is necessary to employ a different check for the RTCs that will verify the assumptions did hold for them too. Differential Revision: http://reviews.llvm.org/D20165 llvm-svn: 269299	2016-05-12 15:12:43 +00:00
Johannes Doerfert	27d12d3d1f	Invalidate unprofitable SCoPs after creation If a profitable run is performed we will check if the SCoP seems to be profitable after creation but before e.g., dependence are computed. This is needed as SCoP detection only approximates the actual SCoP representation. In the end this should allow us to be less conservative during the SCoP detection while keeping the compile time in check. llvm-svn: 269074	2016-05-10 16:38:09 +00:00
Johannes Doerfert	bf9473b2d8	Weaken profitability constraints during ScopDetection Regions with one affine loop can be profitable if the loop is distributable. To this end we will allow them to be treated as profitable if they contain at least two non-trivial basic blocks. llvm-svn: 269064	2016-05-10 14:42:30 +00:00
Johannes Doerfert	ede4ecaefb	[FIX] Cleanup isl objects prior to early exit llvm-svn: 269061	2016-05-10 14:01:21 +00:00
Johannes Doerfert	2b92a0e4ee	Handle llvm.assume inside the SCoP The assumption attached to an llvm.assume in the SCoP needs to be combined with the domain of the surrounding statement but can nevertheless be used to refine the context. This fixes the problems mentioned in PR27067. llvm-svn: 269060	2016-05-10 14:00:57 +00:00
Johannes Doerfert	297c720d15	Propagate complexity problems during domain generation [NFC] This patches makes the propagation of complexity problems during domain generation consistent. Additionally, it makes it less likely to encounter ill-formed domains later, e.g., during schedule generation. llvm-svn: 269055	2016-05-10 13:06:42 +00:00
Johannes Doerfert	14b1cf35b5	[FIX] Create error-restrictions late Before this patch we generated error-restrictions only for error-blocks, thus blocks (or regions) containing a not represented function call. However, the same reasoning is needed if the invalid domain of a statement subsumes its actual domain. To this end we move the generation of error-restrictions after the propagation of the invalid domains. Consequently, error-statements are now defined more general as statements that are assumed to be not executed. Additionally, we do not record an empty domain for such statements but a nullptr instead. This allows to distinguish between error-statements and dead-statements. llvm-svn: 269053	2016-05-10 12:42:26 +00:00
Johannes Doerfert	2640454d1c	Refactor simplifySCoP [NFC] Remove obsolete code and decrease the indention in the Scop::simplifySCoP() function. llvm-svn: 269049	2016-05-10 12:19:47 +00:00
Johannes Doerfert	a60ad845c0	Simplify the internal representation according to the context [NFC] We now use context information to simplify the domains and access functions of the SCoP instead of just aligning them with the parameter space. llvm-svn: 269048	2016-05-10 12:18:22 +00:00
Johannes Doerfert	e243753a4d	Simplify access relation for invariant loads early [NFC] llvm-svn: 269046	2016-05-10 11:59:59 +00:00
Johannes Doerfert	5f173d414e	Prevent complex access ranges with low number of pieces. Previously we checked the number of pieces to decide whether or not a invariant load was to complex to be generated. However, there are cases when e.g., divisions cause the complexity to spike regardless of the number of pieces. To this end we now check the number of totally involved dimensions which will increase with the number of pieces but also the number of divisions. llvm-svn: 269045	2016-05-10 11:46:57 +00:00
Johannes Doerfert	56b377644a	Expose interpretAsUnsigned in the SCEVAffinator [NFC] This exposes the functionality to interpret a SCEV, or better the piece-wise function created from the SCEV, as an unsigned value instead of a signed one. llvm-svn: 269044	2016-05-10 11:45:46 +00:00
Tobias Grosser	1022ca5646	Codegen: Enable the detection of min/max expressions Min/max expressions are easier to read and can in some cases also result in more concise IR that is generated as the min/max --- when lowered to a cmp+select pattern -- commonly has a simpler condition then the ternary condition isl would normally generate. llvm-svn: 268855	2016-05-07 08:03:44 +00:00
Tobias Grosser	7ec06a86c1	test: Use CHECK-NEXT to not miss instructions in test output llvm-svn: 268854	2016-05-07 08:03:32 +00:00
Tobias Grosser	6b49f17764	Update isl to isl-0.17-5-g57dc5ff This update fixes an assertion in the isl scheduler. llvm-svn: 268853	2016-05-07 07:41:25 +00:00
Michael Kruse	e0b34f366f	Update to ISL 0.17. This release includes sevaral improvments compared to the previous version isl-0.16.1-145-g243bf7c (from the ISL 0.17 announcement): - optionally combine SCCs incrementally in scheduler - optionally maximize coincidence in scheduler - optionally avoid loop coalescing in scheduler - minor AST generator improvements - improve support for expansions in schedule trees llvm-svn: 268500	2016-05-04 14:41:36 +00:00
Michael Kruse	f7a4a94d05	Typo: ToComplex -> TooComplex. NFC. llvm-svn: 268224	2016-05-02 12:25:36 +00:00
Michael Kruse	bc150127ae	Rename Conjuncts -> Disjunctions. NFC. The check for complexity compares the number of polyhedra in a set, which are combined by disjunctions (union, "OR"), not conjunctions (intersection, "AND"). llvm-svn: 268223	2016-05-02 12:25:18 +00:00
Michael Kruse	315aa3278e	[ScheduleOptimizer] Add -polly-opt-outer-coincidence option. Add a command line switch to set the isl_options_set_schedule_outer_coincidence option. ISL then tries to build schedules where the outer member of a band satisfies the coincidence constraints. In practice this allows loop skewing for more parallelism in inner loops. llvm-svn: 268222	2016-05-02 11:35:27 +00:00
Johannes Doerfert	90f5fed10b	[WWW] Mark task as done and me as owner of some task llvm-svn: 268221	2016-05-02 11:21:30 +00:00
Michael Kruse	2d3ff2a5ba	Typo: isToComplex -> isTooComplex. NFC. llvm-svn: 268220	2016-05-02 10:44:20 +00:00
Tobias Grosser	c4a80160b0	doc: A source code with Polly does not use a separate module (by default) llvm-svn: 268034	2016-04-29 12:35:46 +00:00
Johannes Doerfert	172dd8b923	Allow unsigned divisions After zero-extend operations and unsigned comparisons we now allow unsigned divisions. The handling is basically the same as for signed division, except the interpretation of the operands. As the divisor has to be constant in both cases we can simply interpret it as an unsigned value without additional complexity in the representation. For the dividend we could choose from the different representation schemes introduced for zero-extend operations but for now we will simply use an assumption. llvm-svn: 268032	2016-04-29 11:53:35 +00:00
Johannes Doerfert	ba9725ff41	Refactor SCEVAffinator [NFC] llvm-svn: 268031	2016-04-29 11:52:30 +00:00
Tobias Grosser	2937b59393	ScopInfo: Add option to control abort on isl errors For debugging it is often convenient to not abort at the very first memory management error. This option allows to control this behavior at run-time. llvm-svn: 268030	2016-04-29 11:43:20 +00:00
Johannes Doerfert	99ec00a2bb	[FIX] Typo llvm-svn: 268027	2016-04-29 10:47:07 +00:00
Johannes Doerfert	64c69f79fb	[FIX] Prevent division/modulo by zero in parameters -- test case This commits a test case for r268023. llvm-svn: 268026	2016-04-29 10:45:39 +00:00
Johannes Doerfert	3e48ee2ab9	[FIX] Unsigned comparisons change invalid domain It does not suffice to take a global assumptions for unsigned comparisons but we also need to adjust the invalid domain of the statements guarded by such an assumption. To this end we allow to specialize the getPwAff call now in order to indicate unsigned interpretation. llvm-svn: 268025	2016-04-29 10:44:41 +00:00
Johannes Doerfert	bfaa63a82e	[FIX] Prevent division/modulo by zero in parameters When we materialize parameter SCEVs we did so without considering the side effects they might have, e.g., both division and modulo are undefined if the right hand side is zero. This is a problem because we potentially extended the domain under which we evaluate parameters, thus we might have introduced such undefined behaviour. To prevent that from happening we will now guard divisions and modulo operations in the parameters with a compare and select. llvm-svn: 268023	2016-04-29 10:36:58 +00:00
Johannes Doerfert	8475d1c163	[FIX] Correct assumption simplification Assumptions and restrictions can both be simplified with the domain of a statement but not the same way. After this patch we will correctly distinguish them. llvm-svn: 267885	2016-04-28 14:32:58 +00:00
Tobias Grosser	947dbe3aae	test: Make test case independent of earlier instructions Instead of matching for %6, we use a regexp to match for the result strings. This test case caused unrelated noise in http://reviews.llvm.org/D15722. llvm-svn: 267875	2016-04-28 12:36:39 +00:00
Tobias Grosser	2e27a0f5fd	BlockGenerator: Drop leftover debug statement llvm-svn: 267874	2016-04-28 12:31:05 +00:00
Johannes Doerfert	8ab2803b63	[FIX] Propagate execution domain of invariant loads If the base pointer of an invariant load is is loaded conditionally, that condition needs to hold for the invariant load too. The structure of the program will imply this for domain constraints but not for imprecisions in the modeling. To this end we will propagate the execution context of base pointers during code generation and thus ensure the derived pointer does not access an invalid base pointer. llvm-svn: 267707	2016-04-27 12:49:11 +00:00
Johannes Doerfert	792374b941	Allow unsigned comparisons With this patch we will optimistically assume that the result of an unsigned comparison is the same as the result of the same comparison interpreted as signed. llvm-svn: 267559	2016-04-26 14:33:12 +00:00
Johannes Doerfert	323ab3975b	[FIX] Adjust assumption space for zext instructions llvm-svn: 267552	2016-04-26 12:44:01 +00:00
Johannes Doerfert	b2885799d1	Do not use the number of parameters in the complexity check llvm-svn: 267532	2016-04-26 09:20:41 +00:00
Johannes Doerfert	625bb1fc10	Do not add but record signed-unsigned assumptions llvm-svn: 267528	2016-04-26 09:16:36 +00:00
Johannes Doerfert	9cc8340fea	Extract some constant factors from "SCEVAddExprs" Additive expressions can have constant factors too that we can extract and thereby simplify the internal representation. For now we do compute the gcd of all constant factors but only extract the same (possibly negated) factor if there is one. llvm-svn: 267445	2016-04-25 19:09:10 +00:00
Johannes Doerfert	d5c369f460	Do not check all GEPs for assumptions Before, we checked all GEPs in a statement in order to derive out-of-bound assumptions. However, this can not only introduce new parameters but it is also not clear what we can learn from GEPs that are not immediately used in a memory accesses inside the SCoP. As this case is very rare, no actual change in the behaviour is expected. llvm-svn: 267442	2016-04-25 18:55:15 +00:00
Johannes Doerfert	c78ce7dc21	Only add user assumptions on known parameters [NFC] Before, assumptions derived from llvm.assume could reference new parameters that were not known to the SCoP before. These were neither beneficial to the representation nor to the user that reads the emitted remark. Now we project them out and keep only user assumptions on known parameters. Nevertheless, the new parameters are still part of the SCoPs parameter space as the SCEVAffinator currently adds them on demand. llvm-svn: 267441	2016-04-25 18:51:27 +00:00
Johannes Doerfert	4e3bb7b98c	Refactor Scop parameter handling The new handling is consistent with the remaining code, e.g., we do not create a new parameter id for each lookup call but copy an existing one. Additionally, we now use the implicit order defined by the Parameters set instead of an explicit one defined in a map. llvm-svn: 267423	2016-04-25 16:15:13 +00:00
Johannes Doerfert	c3596284c3	Model zext-extend instructions A zero-extended value can be interpreted as a piecewise defined signed value. If the value was non-negative it stays the same, otherwise it is the sum of the original value and 2^n where n is the bit-width of the original (or operand) type. Examples: zext i8 127 to i32 -> { [127] } zext i8 -1 to i32 -> { [256 + (-1)] } = { [255] } zext i8 %v to i32 -> [v] -> { [v] \| v >= 0; [256 + v] \| v < 0 } However, LLVM/Scalar Evolution uses zero-extend (potentially lead by a truncate) to represent some forms of modulo computation. The left-hand side of the condition in the code below would result in the SCEV "zext i1 <false, +, true>for.body" which is just another description of the C expression "i & 1 != 0" or, equivalently, "i % 2 != 0". for (i = 0; i < N; i++) if (i & 1 != 0 /* == i % 2 /) / do something / If we do not make the modulo explicit but only use the mechanism described above we will get the very restrictive assumption "N < 3", because for all values of N >= 3 the SCEVAddRecExpr operand of the zero-extend would wrap. Alternatively, we can make the modulo in the operand explicit in the resulting piecewise function and thereby avoid the assumption on N. For the example this would result in the following piecewise affine function: { [i0] -> [(1)] : 2floor((-1 + i0)/2) = -1 + i0; [i0] -> [(0)] : 2*floor((i0)/2) = i0 } To this end we can first determine if the (immediate) operand of the zero-extend can wrap and, in case it might, we will use explicit modulo semantic to compute the result instead of emitting non-wrapping assumptions. Note that operands with large bit-widths are less likely to be negative because it would result in a very large access offset or loop bound after the zero-extend. To this end one can optimistically assume the operand to be positive and avoid the piecewise definition if the bit-width is bigger than some threshold (here MaxZextSmallBitWidth). We choose to go with a hybrid solution of all modeling techniques described above. For small bit-widths (up to MaxZextSmallBitWidth) we will model the wrapping explicitly and use a piecewise defined function. However, if the bit-width is bigger than MaxZextSmallBitWidth we will employ overflow assumptions and assume the "former negative" piece will not exist. llvm-svn: 267408	2016-04-25 14:01:36 +00:00
Johannes Doerfert	517d8d2f94	Check only loop control of loops that are part of the region This also removes a duplicated line of code in the region generator that caused a SPEC benchmark to fail with the new SCoPs. llvm-svn: 267404	2016-04-25 13:37:24 +00:00
Johannes Doerfert	a4dd8ef40f	Initialize the invalid domain of an access with an empty set llvm-svn: 267403	2016-04-25 13:36:23 +00:00
Johannes Doerfert	e4459a24cc	Do not propagate invalid domains over back edges llvm-svn: 267402	2016-04-25 13:34:50 +00:00
Johannes Doerfert	f560b3d2db	Introduce a parameter set type [NFC] llvm-svn: 267401	2016-04-25 13:33:07 +00:00

1 2 3 4 5 ...

2383 Commits