llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	50165ffdee	Add support for srem instruction Remainder operations with constant divisor can be modeled as quasi-affine expression. This patch adds support for detecting and modeling them. We also add a test that ensures they are correctly code generated. This patch was extracted from a larger patch contributed by Johannes Doerfert in http://reviews.llvm.org/D5293 llvm-svn: 240518	2015-06-24 04:13:29 +00:00
Tobias Grosser	a608569856	Replace srem by function call in nonaffine test cases This makes the test cases nonaffine even if Polly some days gains support for the srem instruction, an instruction which is currently not modeled but which can clearly be modeled statically. A call to a function without definition will always remain non-affine, as there is just insufficient static information for it to be modeled more precisely. llvm-svn: 240458	2015-06-23 20:55:05 +00:00
Tobias Grosser	aa9f575ae1	Adjust to personality function change in 239940 llvm-svn: 239992	2015-06-18 05:02:11 +00:00
Tobias Grosser	8199c722c7	Disable output for test case that does not need output llvm-svn: 239060	2015-06-04 17:59:51 +00:00
Tobias Grosser	22adfb4373	Mark sdivs as 'exact' instead of lowering them ourselves LLVM's instcombine already translates power-of-two sdivs that are known to be exact to fast ashr instructions. Hence, there is no need to add this logic ourselves. Pointed-out-by: Johannes Doerfert llvm-svn: 239025	2015-06-04 07:45:09 +00:00
Tobias Grosser	5cf7860704	Ensure memory access mappings are defined for full domain We now verify that memory access functions imported via JSON are indeed defined for the full iteration domain. Before this change we accidentally imported memory mappings such as i -> i / 127, which only defined a mapped for values of i that are evenly divisible by 127, but which did not define any mapping for the remaining values, with the result that isl just generated an access expression that had undefined behavior for all the unmapped values. In the incorrect test cases, we now either use floor(i/127) or we use p/127 and provide the information that p is indeed a multiple of 127. llvm-svn: 239024	2015-06-04 07:44:35 +00:00
Tobias Grosser	244c8297cf	Lower signed-divisions without rounding to ashr instructions llvm-svn: 238929	2015-06-03 15:14:58 +00:00
Tobias Grosser	cb73f150d4	Translate power-of-two floor-division into ashr Power-of-two floor divisions can be translated into an arithmetic shift operation. This allows us to replace a complex lowering that requires division operations: %pexp.fdiv_q.0 = sub i64 %21, 128 %pexp.fdiv_q.1 = add i64 %pexp.fdiv_q.0, 1 %pexp.fdiv_q.2 = icmp slt i64 %21, 0 %pexp.fdiv_q.3 = select i1 %pexp.fdiv_q.2, i64 %pexp.fdiv_q.1, i64 %21 %pexp.fdiv_q.4 = sdiv i64 %pexp.fdiv_q.3, 128 with a simple ashr: %polly.fdiv_q.shr = ashr i64 %21, 7 llvm-svn: 238905	2015-06-03 06:31:30 +00:00
Tobias Grosser	cdb38e5625	Exploit non-negative numerators isl marks known non-negative numerators in modulo (and soon also division) operations. We now exploit this by generating unsigned operations. This is beneficial as unsigned operations with power-of-two denominators will be translated by isl to fast bitshift or bitwise and operations. llvm-svn: 238577	2015-05-29 17:08:19 +00:00
Tobias Grosser	c825fae020	Tighten the PHI modeling test cases While looking through the test cases I realized we did not have a CHECK line for a duplicate memory access which we may want to eliminate later. To ensure we do not have (or later introduce) unnecessary memory accesses, we now tighten the test cases to look for such a pattern (and add the CHECK: line that shows the redundant memory access). llvm-svn: 238227	2015-05-26 18:05:45 +00:00
Tobias Grosser	268205939f	Make use of scalar/phi code generation explicit in the tests This ensures we pass all tests independently of how we set the options -disable-polly-intra-scop-scalar-to-array and -polly-model-phi-nodes. (At least if we enable both or disable both. Enabling them individually makes little sense, as they will hopefully disappear soon anyhow). llvm-svn: 238087	2015-05-23 03:34:35 +00:00
Johannes Doerfert	ecff11dcfb	Add scalar and phi code generation To reduce compile time and to allow more and better quality SCoPs in the long run we introduced scalar dependences and PHI-modeling. This patch will now allow us to generate code if one or both of those options are set. While the principle of demoting scalars as well as PHIs to memory in order to communicate their value stays the same, this allows to delay the demotion till the very end (the actual code generation). Consequently: - We __almost__ do not modify the code if we do not generate code for an optimized SCoP in the end. Thus, the early exit as well as the unprofitable option will now actually preven us from introducing regressions in case we will probably not get better code. - Polly can be used as a "pure" analyzer tool as long as the code generator is set to none. - The original SCoP is almost not touched when the optimized version is placed next to it. Runtime regressions if the runtime checks chooses the original are not to be expected and later optimizations do not need to revert the demotion for that part. - We will generate direct accesses to the demoted values, thus there are no "trivial GEPs" that select the first element of a scalar we demoted and treated as an array. Differential Revision: http://reviews.llvm.org/D7513 llvm-svn: 238070	2015-05-22 23:43:58 +00:00
Tobias Grosser	5db5d2da13	Use base-pointer address space when creating new access functions llvm-svn: 237785	2015-05-20 11:02:12 +00:00
Tobias Grosser	49ad36ca16	Add printing and testing to ScopArrayInfo Being here, we extend the interface to return the element type and not a pointer to the element type. We also provide a function to get the size (in bytes) of the elements stored in this array. We currently still store the element size as an innermost dimension in ScopArrayInfo, which is somehow inconsistent and should be addressed in future patches. llvm-svn: 237779	2015-05-20 08:05:31 +00:00
Sunil Srivastava	19be68f088	Changed renaming of local symbols by inserting a dot before the numeric suffix. Modified two test cases to adjust to the above change in renaming. These two files were causing the buildbot failure in Polly, #30204 for example. Details in http://reviews.llvm.org/D9483 This checkin goes with r237150 and r237151 llvm-svn: 237203	2015-05-12 22:44:24 +00:00
Tobias Grosser	09d3069740	Rename IslCodeGeneration to CodeGeneration Besides class, function and file names, we also change the command line option from -polly-codegen-isl to just -polly-codegen. The isl postfix is a leftover from the times when we still had the CLooG based -polly-codegen. Today it is just redundant and we drop it. llvm-svn: 237099	2015-05-12 07:45:52 +00:00
Tobias Grosser	3e6070ef03	Update isl to c3892bebc0 Various smaller improvements and bugfixes. llvm-svn: 236932	2015-05-09 09:37:30 +00:00
Johannes Doerfert	8983031b5e	[FIX] Invalid recognition of multidimensional access In the lnt benchmark MultiSource/Benchmarks/MallocBench/gs/gs with scalar and PHI modeling we detected the multidimensional accesses with sizes variant in the SCoP. This will check the sizes for validity. llvm-svn: 236395	2015-05-03 16:03:01 +00:00
Duncan P. N. Exon Smith	ddf3a0ef38	Update polly for LLVM rename of debug info metadata with DI* prefix Ran the same rename-md-di-prefix.sh script attached to PR23080 as in LLVM r236120 and CFE r236121. llvm-svn: 236127	2015-04-29 17:02:14 +00:00
Tobias Grosser	6325cd2fcd	Remove flag '-polly-annotate-alias-scopes' This option is enabled since a long time and there does not seem to be a situation in which we would not want to print alias scopes. Remove this option to reduce the set of command-line option combinations that may expose bugs. llvm-svn: 235861	2015-04-27 10:43:10 +00:00
Johannes Doerfert	8f8af43fef	Use all available range information for parameters In the following even full-range information will help to avoid runtime checks for wrapping integers, hence we enable it now. llvm-svn: 235823	2015-04-26 20:07:21 +00:00
Johannes Doerfert	d5d8f67dc5	Use the original no-wrap flags for normalized AddRecs llvm-svn: 235822	2015-04-26 19:55:21 +00:00
Tobias Grosser	173ecab705	Remove target triples from test cases I just learned that target triples prevent test cases to be run on other architectures. Polly test cases are until now sufficiently target independent to not require any target triples. Hence, we drop them. llvm-svn: 235384	2015-04-21 14:28:02 +00:00
Tobias Grosser	5483931117	Rename 'scattering' to 'schedule' In Polly we used both the term 'scattering' and the term 'schedule' to describe the execution order of a statement without actually distinguishing between them. We now uniformly use the term 'schedule' for the execution order. This corresponds to the terminology of isl. History: CLooG introduced the term scattering as the generated code can be used as a sequential execution order (schedule) or as a parallel dimension enumerating different threads of execution (placement). In Polly and/or isl the term placement was never used, but we uniformly refer to an execution order as a schedule and only later introduce parallelism. When doing so we do not talk about about specific placement dimensions. llvm-svn: 235380	2015-04-21 11:37:25 +00:00
Tobias Grosser	094999bb55	Drop unneccessary -basicaa passes in DependenceInfo test cases llvm-svn: 235374	2015-04-21 09:17:52 +00:00
David Blaikie	556ffb7806	[opaque pointer types] Explicit non-pointer type for call expressions (migration for recent LLVM change to textual IR for calls) llvm-svn: 235146	2015-04-16 23:24:52 +00:00
Johannes Doerfert	f8206cf6d4	Allow loops in non-affine subregions -- SCoP Modeling This will allow the ScopInfo to build the polyhedral representation for non-affine regions that contain loops. Such loops are basically not visible in the SCoP representation. Accesses that are variant in such loops are therefor represented as non-affine accesses. Differential Revision: http://reviews.llvm.org/D8153 llvm-svn: 234713	2015-04-12 22:58:40 +00:00
Johannes Doerfert	c3e91b4d51	[FIX] Change old diagnostic output llvm-svn: 234712	2015-04-12 22:53:33 +00:00
Johannes Doerfert	f3e98f44e3	Allow loops in non-affine subregions -- SCoP Detection This will allow the ScopDetection to detect non-affine regions that contain loops. All loops contained will be collected and are accessible to later passes in order to adjust the access functions. As the loops are non-affine and will not be part of the polyhedral representation later, all accesses that are variant in these loops have to be over approximated as non-affine accesses. They are therefore handled the same way as other non-affine accesses. Additionally, we do not count non-affine loops for the profitability heuristic, thus a region with only a non-affine loop will only be detected if the general detection of loop free regions is enabled. Differential Revision: http://reviews.llvm.org/D8152 llvm-svn: 234711	2015-04-12 22:52:20 +00:00
Duncan P. N. Exon Smith	7431fb0257	Upgrade testcases after LLVM r234181 Until r234181 we were silently upgrading old `@llvm.dbg` intrinsics. Fix testcases in polly that were relying on that. llvm-svn: 234192	2015-04-06 18:25:51 +00:00
Tobias Grosser	02cf69a6ed	Make -polly-no-tiling work again llvm-svn: 234125	2015-04-05 21:52:21 +00:00
Tobias Grosser	eb18649ead	Sign-extend in case of non-matching bitwidth This change ensures that we sign-extend integer types in case non-matching operands are encountered when generating a multi-dimensional access offset. This fixes http://llvm.org/PR23124 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234122	2015-04-05 17:36:42 +00:00
Tobias Grosser	2a586c387b	Do not assume all multi-parameter products are affine As soon as one operand of the product is invalid, the entire product is invalid. This happens for example if one of the operands is not loop-invariant. This fixes http://llvm.org/PR23125 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com llvm-svn: 234119	2015-04-05 14:57:50 +00:00
Tobias Grosser	7527e3f59c	Do not use the POLLY vector code generator if only strip-mining is requested This fixes http://llvm.org/PR23127 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234113	2015-04-05 06:53:21 +00:00
Tobias Grosser	fe4bb1c81b	[tests] Use -polly-vectorizer=polly directly instead of defining a lit variable %vector-opt. llvm-svn: 234112	2015-04-05 06:53:11 +00:00
Tobias Grosser	4f6bceface	Do not scale tile loops We now generate tile loops as: for (int c1 = 0; c1 <= 47; c1 += 1) for (int c2 = 0; c2 <= 47; c2 += 1) for (int c3 = 0; c3 <= 31; c3 += 1) for (int c4 = 0; c4 <= 31; c4 += 4) #pragma simd for (int c5 = c4; c5 <= c4 + 3; c5 += 1) Stmt_for_body3(32 * c1 + c3, 32 * c2 + c5); instead of for (int c1 = 0; c1 <= 1535; c1 += 32) for (int c2 = 0; c2 <= 1535; c2 += 32) for (int c3 = 0; c3 <= 31; c3 += 1) for (int c4 = 0; c4 <= 31; c4 += 4) #pragma simd for (int c5 = c4; c5 <= c4 + 3; c5 += 1) Stmt_for_body3(c1 + c3, c2 + c5); Run-time performance-wise this makes little difference, but this gives a large reduction in compile time (10-30% on 17 LNT benchmarks). Apparently the isl AST generator is not yet very efficient in generating the latter. llvm-svn: 233675	2015-03-31 07:52:36 +00:00
Tobias Grosser	d654eeb862	Drop some CLooG leftovers llvm-svn: 233572	2015-03-30 17:56:50 +00:00
Tobias Grosser	619190d5a7	Delinearization of expressions that contain array size parameters This allows us to delinerize code such as: A[][n] for (i for (j A[i][n-j-1] = ... which would previously have been delinearize to an access A[i+1][-j-1]. To recover the correct access we apply the piecewise expression: { A[i][j] -> A[i-1][i+N]: i < 0; A[i][j] -> A[i][i]: i >= 0} This approach generalizes to higher dimensions. llvm-svn: 233566	2015-03-30 17:22:28 +00:00
Tobias Grosser	aa660a9957	Gist-simplify access relations in the context of domain constraints This simplifies already one test case and is needed for upcoming improvements to our delinearization. llvm-svn: 233507	2015-03-30 00:07:50 +00:00
Johannes Doerfert	be40996cfe	Strip constant factors from SCoP parameters This will strip the constant factor of a parameter befor we add it to the SCoP. As a result the access functions are simplified, e.g., for the attached test case. llvm-svn: 233501	2015-03-29 20:45:09 +00:00
Tobias Grosser	715007216c	Bail out if too many alias run-time-check comparisions would be needed This fixes a crash observed in ffmpeg. llvm-svn: 233480	2015-03-28 15:11:14 +00:00
Tobias Grosser	6794238c70	Code generate parameters and run-time checks after branching new code region When creating parameters the SCEVexpander may introduce new induction variables, that possibly create scalar dependences in the original scop, before we code generate the scop. The resulting scalar dependences may then inhibit correct code generation of the scop. To prevent this, we first version the code without a run-time check and only then introduce new parameters and the run-time condition. The if-condition that guards the original scop from being modified by the SCEVexpander. This change causes some test case changes as the run-time conditions are now introduced in the split basic block rather than in the entry basic block. This fixes http://llvm.org/PR22069 Test case reduced by: Karthik Senthil llvm-svn: 233477	2015-03-28 09:34:40 +00:00
Tobias Grosser	17778eb826	Drop redundant run line in check llvm-svn: 233476	2015-03-28 09:34:34 +00:00
Tobias Grosser	2873645c51	Drop -polly-vectorizer-unroll-only option This options was earlier used for experiments with the vectorizer, but to my knowledge is not really used anymore. If anybody needs this, we can always reintroduce this feature. llvm-svn: 232934	2015-03-23 07:00:36 +00:00
Tobias Grosser	bbb4cec2e8	Use schedule trees to perform post-scheduling transformations Replacing the old band_tree based code with code that is based on the new schedule tree [1] interface makes applying complex schedule transformations a lot more straightforward. We now do not need to reason about the meaning of flat schedules, but can use a more straightforward tree structure. We do not yet exploit this a lot in the current code, but hopefully we will be able to do so soon. This change also allows us to drop some code, as isl now provides some higher level interfaces to apply loop transformations such as tiling. This change causes some small test case changes as isl uses a slightly different way to perform loop tiling, but no significant functional changes are intended. [1] http://impact.gforge.inria.fr/impact2014/papers/impact2014-verdoolaege.pdf llvm-svn: 232911	2015-03-22 12:06:39 +00:00
Tobias Grosser	9715b7c592	Add forgotten 'FileCheck' to tiling test cases These test cases did not verify the CHECK lines at all. We add the FileCheck and also fix some broken CHECK lines. Being here, we extend the checks to cover the whole loop structure. llvm-svn: 232710	2015-03-19 07:39:34 +00:00
Duncan P. N. Exon Smith	0353f279f1	Fix debug info now that the verifier is on `i32 0` isn't a valid type, and `!{i32 0}` isn't an empty array. Needed because of r232505. llvm-svn: 232514	2015-03-17 18:23:38 +00:00
David Blaikie	4a54fae8cb	Test case updates for explicit type parameter to the gep operator llvm-svn: 232186	2015-03-13 18:21:20 +00:00
Tobias Grosser	f2716ea7d5	Add -polly-vectorizer=stripmine By strip-mining outer loops to the innermost level we can enable LLVM's loop vectorizer to vectorize outer loops. llvm-svn: 232100	2015-03-12 20:48:07 +00:00
Tobias Grosser	bb4126470a	Drop option to prepare code for the BB vectorizer The BB vectorizer is deprecated and there is no point in generating code for it any more. This option was introduced when there was not yet any loop vectorizer in sight. Now being matured, Polly should target the loop vectorizer. llvm-svn: 232099	2015-03-12 20:47:58 +00:00

1 2 3 4 5 ...

419 Commits