llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	5f860fdfe9	Do not run GPGPU test cases without nvptx target Tag the GPGPU codegen test cases as unsupported if the nvptx target is not included in the current llvm build. Contributed-by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 208779	2014-05-14 14:18:14 +00:00
Sebastian Pop	c5c1055e3f	do not build llc and lli for polly test llvm-svn: 208619	2014-05-12 19:43:20 +00:00
Sebastian Pop	e8863b8f00	correct the delinearization failing case collect terms from affine and non affine memory accesses llvm-svn: 208616	2014-05-12 19:02:02 +00:00
Sebastian Pop	fcf68758b8	unxfail passing testcase llvm-svn: 208233	2014-05-07 18:01:32 +00:00
Tobias Grosser	f56af204b9	Add delinearization testcase for ivs that do not follow the loop order This is a test case that is currently failing, but that should start working with an upcoming version of our delinearization pass. llvm-svn: 207678	2014-04-30 17:49:22 +00:00
Tobias Grosser	841009a2cc	We missed two files in the last commit. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206901	2014-04-22 15:57:30 +00:00
Tobias Grosser	0d11dbabc4	Fixed missing cloog test with automake/configure build setup Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206900	2014-04-22 15:30:43 +00:00
Tobias Grosser	954939842f	Really fix the load case. Commit r206510 falsely advertised to fix the load cases, even though it only fixed the store case. This commit adds the same fix for the load case including the missing test coverage. llvm-svn: 206577	2014-04-18 09:46:35 +00:00
Tobias Grosser	50fd7010d8	Ensure a scalar pointer when issuing a vector load Even tough we may want to generate a vector load, the address from which to load still is a scalar. Make sure even if previous address computations may have been vectorized, that the addresses are also available as scalars. This fixes http://llvm.org/PR19469 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206510	2014-04-17 23:13:49 +00:00
Tobias Grosser	75b76729ab	Fix for vector codegen in OpenMP subfunctions Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206332	2014-04-15 22:30:06 +00:00
Tobias Grosser	364c136d08	Dependences: Do not fail in case a schedule eliminates all dependences The following example shows a non-parallel loop void f(int a[]) { int i; for (i = 0; i < 10; ++i) A[i] = A[i+5]; } which, in case we import a schedule that limits the iteration domain to 0 <= i < 5, becomes parallel. Previously we crashed in such cases, now we just recognize it as parallel. This fixes http://llvm.org/PR19435 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206318	2014-04-15 20:14:57 +00:00
Tobias Grosser	efc3013544	Codegeneration: Free memory correctly when using -polly-vectorizer=polly This fixes PR19421. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206156	2014-04-14 08:33:24 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Tobias Grosser	79baa21242	ScopInfo: Scalar accesses are zero dimensional llvm-svn: 205958	2014-04-10 08:38:02 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	64b95123ef	Delete trivial PHI nodes (aka stack slot sharing) During code preperation trivial PHI nodes (mainly introduced by lcssa) are deleted to decrease the number of introduced allocas (==> dependences). However simply replacing them by their only incoming value would cause the independent block pass to introduce new allocas. To prevent this we try to share stack slots during code preperarion, hence to reuse a already created alloca 'to demote' the trivial PHI node. This works if we know that the value stored in this alloca will be the incoming value of the trivial PHI at the end of the predecessor block of this trivial PHI. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 205320	2014-04-01 16:01:33 +00:00
Tobias Grosser	5fa36c0ff6	Updated test/create_ll.sh to work with old & new clang versions. We explicitly specifying all filenames instead of assuming some naming convention used by clang and opt. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 204726	2014-03-25 15:50:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Tobias Grosser	8111a0ae7d	autoconf: Fix module loading in tests llvm-svn: 203925	2014-03-14 13:27:26 +00:00
Sebastian Pop	7537be92f4	add -load polly.so only when not LINK_POLLY_INTO_TOOLS llvm-svn: 203888	2014-03-14 04:04:36 +00:00
Rafael Espindola	80f20133d4	Fix polly tests to not include aliases to declarations. llvm-svn: 203721	2014-03-12 21:48:42 +00:00
Sebastian Pop	1b57e8f028	add dependence of check-polly on llc to avoid an error when directly doing ninja check-polly after cmake 'Could not find llc in .../ninja/bin'. llvm-svn: 203696	2014-03-12 18:55:25 +00:00
Tobias Grosser	4ba60fe9eb	ScheduleOptimizer: Fix prevectorization. In case we are at the innermost band, we try to prepare for vectorization. This means, we look for the innermost parallel loop and strip mine this loop to the innermost level using a strip-mine factor corresponding to the number of vector iterations. For whatever reason, the code that implemented this feature was broken. We now added a comment, a test case and obviously also the right code. llvm-svn: 203544	2014-03-11 06:27:36 +00:00
Tobias Grosser	e655754d57	Update CLooG and some test cases This is necessary to avoid test failures in the CLooG test suite due to the recent isl update. We also need to update two polly test cases which rely on a certain order in the textual description that isl chooses for its sets and maps. Changes here are not often, but we should probably switch to a check that verifies such maps are semantically equivalent instead of represented identically. llvm-svn: 203476	2014-03-10 17:31:22 +00:00
Tobias Grosser	37c9b8e0f2	Emit llvm.loop metadata for parallel loops For now we only mark innermost loops for the loop vectorizer. We could later also mark not-innermost loops to enable the introduction of openmp parallelism. llvm-svn: 202854	2014-03-04 14:59:00 +00:00
Tobias Grosser	356faa8f09	Dead code elimination: Schedule another approximative step before actual DCE In 'obsequi' we have a scop in which the current dead code elimination works, but the generated code is way too complex. To avoid this trouble (and to not disable the DCE entirely) we add an additional approximative step before the actual dead code elimination. This should fix one of the two current nightly-test issues. Polly could be improved to handle 'obsequi' by teaching it to introduce only a single parameter for (%1 and zext %1) which halves the number of parameters and allows polly to derive a simpler representation for the set of live iterations. However, this needs some time to investigate. I will commit a test case as soon as we have a reduced one. llvm-svn: 202010	2014-02-24 08:52:20 +00:00
Tobias Grosser	472d3b7037	codegen: Update LoopInfo correctly Add the 'polly.start' basic block to the loop that surrounds the scop we just codegenerate. This fixes PR13441 llvm-svn: 202000	2014-02-24 00:50:49 +00:00
Tobias Grosser	38c36ea18e	Do not fail in case we do not have valid dependences In case we do not have valid dependences, we do not run dead code elimination or the schedule optimizer. This fixes an infinite loop in the dead code elimination (PR12110). llvm-svn: 201982	2014-02-23 15:15:44 +00:00
Tobias Grosser	88640d2b47	Use -polly-codegen-isl in isl-codegen test Reported-by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 201902	2014-02-21 23:08:54 +00:00
Tobias Grosser	817d51dd1b	DCE: Switch to hybrid precise-unprecise analysis Instead of giving a choice between a precise (but possibly very complex) analysis and an approximative analysis we now use a hybrid approach which uses N precise steps followed by one approximating step. The precision of the analysis can be changed by increasing N. With a default of 'N' = 2, we get fully precise results for our current test cases and should not run into performance problems for more complex test cases. We can adjust this value when we got more experience with this dead code elimination. llvm-svn: 201888	2014-02-21 20:51:46 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Tobias Grosser	37eb422f69	Add polyhedral dead code elimination. This pass eliminates loop iterations that compute results that are not used later on. This can help e.g. in D, where the default zero-initialization is often unnecessary if right after new values are assigned to an array. Contributed-by: Peter Conn <conn.peter@gmail.com> llvm-svn: 201817	2014-02-20 21:43:54 +00:00
Tobias Grosser	d6aafa7c2e	Do not track location of scalar dependences in ScopInfo We do not have a use for this information at the moment. If we need this at some point, the "instruction -> access" mapping needs to be enhanced as a single instruction could then possibly perform multiple accesses. This patch allows us to build the polyhedral information for scops with scalar dependences. llvm-svn: 201815	2014-02-20 21:29:09 +00:00
Tobias Grosser	a1689937ba	Check scops a second time before working on them In rare cases the modification of one scop can effect the validity of other scops, as code generation of an earlier scop may make the scalar evolution functions derived for later scops less precise. The example that triggered this patch was a scop that contained an 'or' expression as follows: %add13710 = or i32 %j.19, 1 --> {(1 + (4 * %l)),+,2}<nsw><%for.body81> Scev could only analyze the 'or' as it knew %j.19 is a multiple of 2. This information was not available after the first scop was code generated (or independent-blocks was run on it) and SCEV could not derive a precise SCEV expression any more. This means we could not any more code generate this SCoP. My current understanding is that there is always the risk that an earlier code generation change invalidates later scops. As the example we have seen here is difficult to avoid, we use this occasion to guard us against all such invalidations. This patch "solves" this issue by verifying right before we start working on a detected scop, if this scop is in fact still valid. This adds a certain overhead. However the verification we run is anyways very fast and secondly it is only run on detected scops. So the overhead should not be very large. As a later optimization we could detect scops only on demand, such that we need to run scop-detections always only a single time. This should fix the single last failure in the LLVM test-suite for the new scev-based code generation. llvm-svn: 201593	2014-02-18 18:49:49 +00:00
Tobias Grosser	933edd04af	IndependentBlocks: Do not assert for PHI nodes outside of scops There does not seem to be a reason that we can not support PHI nodes outside of the scop that reference values within the SCoP. Or at least, the attached test case seems to do the right thing. We remove the assert for now. llvm-svn: 200427	2014-01-29 23:08:10 +00:00
Tobias Grosser	28a70c543d	ScopDetect: Transitively remove all children after region expansion In rare cases, a region R which is itself not valid has an indirect child region that is valid. When R becomes part of a valid region by expansion of another region, then all children of R have to be erased from the set of valid regions. This patch ensures that indirect children are erased in addition to direct children. Contributed-by: Armin Groesslinger <armin.groesslinger@uni-passau.de> Tobias: I added a reduced test case and adjusted the logic of the patch to only recurse until the first child is found. llvm-svn: 200411	2014-01-29 19:05:30 +00:00
Tobias Grosser	458fb78cfa	Check if array base addresses are invariant Array base addresses need to be invariant in the region considered. The base address has to be computed outside the region, or, when it is computed inside, the value must not change with the iterations of the loops. For example, when a two-dimensional array is represented as a pointer to pointers the base address A[i] in an access A[i][j] changes with i; therefore, such regions have to be rejected. Contributed by: Armin Größlinger <armin.groesslinger@uni-passau.de> llvm-svn: 200314	2014-01-28 12:58:58 +00:00
Tobias Grosser	5b5daab9f1	Add more test cases to check loop invariance of the base pointer. llvm-svn: 200305	2014-01-28 10:29:17 +00:00
Tobias Grosser	24d7e669b3	Do not test polybench with 'make check-polly' Those test cases should be tested in the LLVM test suite. For Polly we should extract regression tests for the individual passes. llvm-svn: 200206	2014-01-27 10:37:33 +00:00
Tobias Grosser	54646f7fab	Remove other unneccessary uses of -O3 in the test suite The polly test suite is now -O3 clean. llvm-svn: 200205	2014-01-27 10:37:06 +00:00
Tobias Grosser	a7fea8386c	Do not run -O3 to canonicalize test case This is not only not necessary, but in case -03 changes this can actually cause arbitrarily failing test cases such as, e.g., a recent change by Chandler that caused -O3 to unroll the loop body, which made the loop we wanted to detect disappear and consequently this test case fail. llvm-svn: 200204	2014-01-27 10:23:12 +00:00
Tobias Grosser	b917f47fc4	Dependences: Bound the time dependence calculation is allowed to take Count the number of computational steps that have been used to solve the dependence problem and abort in case we reach the "compute-out". This ensures we do not hang forever in cases the dependence problem is too difficult to solve. There is just a single case in the LLVM test-suite that runs into the compute-out. Even in this case, we can probably coalesce some of the parameters (i32 b, i32 b zext i64, ...) to simplify the problem enough to not hit the compute out. However, for now we set the compute out in place to address the general issue. The compute out was choosen such that it stops on a recent laptop after about 8 seconds. llvm-svn: 200156	2014-01-26 19:38:34 +00:00
Tobias Grosser	0d43646f93	Adjust test case to changed cloog output llvm-svn: 199587	2014-01-19 11:53:51 +00:00
Tobias Grosser	8519f897e7	Report detected scops using the new diagnostics We now report the following: $ polly-clang -O3 -mllvm -polly -mllvm -polly-report test.c -c \ -gline-tables-only note: Polly detected an optimizable loop region (scop) in function 'foo' test.c:2: Start of scop test.c:3: End of scop note: Polly detected an optimizable loop region (scop) in function 'bar' test.c:9: Start of scop test.c:13: End of scop llvm-svn: 197558	2013-12-18 10:49:53 +00:00
Tobias Grosser	7b6f9ba572	ScopValidator: smax expressions are no parameters This fixes PR18155 which is a regression introduced in 152913. llvm-svn: 196827	2013-12-09 21:51:46 +00:00
Tobias Grosser	7d66a19fe4	test: Remove use of defaultOpts llvm-svn: 196826	2013-12-09 21:51:31 +00:00
Tobias Grosser	54ee0ba74d	IslCodegen: Support for run-time conditions llvm-svn: 194948	2013-11-17 03:18:25 +00:00
Tobias Grosser	e86109f508	ScopInfo: Add support for AssumedContext When constructing a scop sometimes the exact representation of a statement or condition would be very complex, but there is a common case which is a lot simpler, but which is only valid under certain assumptions. The assumed context records the assumptions taken during the construction of this scop and that need to be code generated as a run-time test. At the moment, we do not yet model any assumptions, but only added the AssumedContext as well as the isl-ast generation support. As a next step, this needs to be hooked up with the isl code generation. if (1) /* run-time condition / { / optimized code / } else { / original code */ } llvm-svn: 193652	2013-10-29 21:05:49 +00:00
Tobias Grosser	4f8c0877e8	This test case requires assertions llvm-svn: 192530	2013-10-12 09:15:56 +00:00

1 2 3 4 5

210 Commits