llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	5e275bc83a	[Refactor] Create nicer test cases from C/C++ Insert a header into the new testcase containing a sample RUN line a FIXME and an XFAIL. Then insert the formated C code and finally the LLVM-IR without attributes, the module ID or the target triple. llvm-svn: 211612	2014-06-24 17:02:53 +00:00
Yabin Hu	cc91169fd7	Remove use of llvm.codegen intrinsic for GPGPU codegen We use llvm.codegen intrinsic to generate code for embedded LLVM-IR strings. The reason we introduce such a intrinsic is that previous clang/opt tools was NOT linked with various LLVM targets and their AsmParsers and AsmPrinters. Since clang/opt been linked with all the needed libraries, we no longer need the llvm.codegen intrinsic. llvm-svn: 211573	2014-06-24 08:11:36 +00:00
Johannes Doerfert	f1906138b4	Model statement wise reduction dependences + Collect reduction dependences + Introduced TYPE_RED in Dependences.h which can be used to obtain the reduction dependences + Used TYPE_RED to prevent parallelization while we do not have a privatizing code generation + Relax the dependences for non-parallel code generation + Add privatization dependences to ensure correctness + 12 Test cases to check for reduction and privatization dependences llvm-svn: 211369	2014-06-20 16:37:11 +00:00
Johannes Doerfert	da80386700	Missing reduction detection test cases llvm-svn: 211235	2014-06-18 23:08:14 +00:00
Tobias Grosser	f4fcbf4097	Test delinearization of 2D diagonal matrix llvm-svn: 210538	2014-06-10 14:48:17 +00:00
Tobias Grosser	be7eaddc69	Adjust another test case to not access out of bounds llvm-svn: 210208	2014-06-04 19:41:47 +00:00
Tobias Grosser	5416a0395f	Adjust multidim test cases to not access out-of-bound memory We do this currently only for test cases where we have integer offsets that clearly access array dimensions out-of-bound. -; for (long i = 0; i < n; i++) -; for (long j = 0; j < m; j++) -; for (long k = 0; k < o; k++) +; for (long i = 0; i < n - 3; i++) +; for (long j = 4; j < m; j++) +; for (long k = 0; k < o - 7; k++) ; A[i+3][j-4][k+7] = 1.0; This will be helpful if we later want to simplify the access functions under the assumption that they do not access memory out of bounds. llvm-svn: 210179	2014-06-04 11:47:54 +00:00
Sebastian Pop	422e33f363	record delinearization result and reuse it in polyhedral translation Without this patch, the testcase would fail on the delinearization of the second array: ; void foo(long n, long m, long o, double A[n][m][o]) { ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) { ; A[i+3][j-4][k+7] = 1.0; ; A[i][0][k] = 2.0; ; } ; } ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[3 + i0, -4 + i1, 7 + i2] }; ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; Here is the output of FileCheck on the testcase without this patch: ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; ^ <stdin>:26:2: note: possible intended match here [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[o0] }; ^ It is possible to find a good delinearization for A[i][0][k] only in the context of the delinearization of both array accesses. There are two ways to delinearize together all array subscripts touching the same base address: either duplicate the code from scop detection to first gather all array references and then run the delinearization; or as implemented in this patch, use the same delinearization info that we computed during scop detection. llvm-svn: 210117	2014-06-03 18:16:31 +00:00
Johannes Doerfert	c3958b214c	Added option for n-dimensional rectangular tiling + CL-option --polly-tile-sizes=<int,...,int> The i'th value is used as a tile size for dimension i, if there is no i'th value, the value of --polly-default-tile-size is used + CL-option --polly-default-tile-size=int Used if no tile size is given for a dimension i + 3 Simple testcases llvm-svn: 209753	2014-05-28 17:21:02 +00:00
Tobias Grosser	5f860fdfe9	Do not run GPGPU test cases without nvptx target Tag the GPGPU codegen test cases as unsupported if the nvptx target is not included in the current llvm build. Contributed-by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 208779	2014-05-14 14:18:14 +00:00
Sebastian Pop	c5c1055e3f	do not build llc and lli for polly test llvm-svn: 208619	2014-05-12 19:43:20 +00:00
Sebastian Pop	e8863b8f00	correct the delinearization failing case collect terms from affine and non affine memory accesses llvm-svn: 208616	2014-05-12 19:02:02 +00:00
Sebastian Pop	fcf68758b8	unxfail passing testcase llvm-svn: 208233	2014-05-07 18:01:32 +00:00
Tobias Grosser	f56af204b9	Add delinearization testcase for ivs that do not follow the loop order This is a test case that is currently failing, but that should start working with an upcoming version of our delinearization pass. llvm-svn: 207678	2014-04-30 17:49:22 +00:00
Tobias Grosser	841009a2cc	We missed two files in the last commit. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206901	2014-04-22 15:57:30 +00:00
Tobias Grosser	0d11dbabc4	Fixed missing cloog test with automake/configure build setup Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206900	2014-04-22 15:30:43 +00:00
Tobias Grosser	954939842f	Really fix the load case. Commit r206510 falsely advertised to fix the load cases, even though it only fixed the store case. This commit adds the same fix for the load case including the missing test coverage. llvm-svn: 206577	2014-04-18 09:46:35 +00:00
Tobias Grosser	50fd7010d8	Ensure a scalar pointer when issuing a vector load Even tough we may want to generate a vector load, the address from which to load still is a scalar. Make sure even if previous address computations may have been vectorized, that the addresses are also available as scalars. This fixes http://llvm.org/PR19469 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206510	2014-04-17 23:13:49 +00:00
Tobias Grosser	75b76729ab	Fix for vector codegen in OpenMP subfunctions Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206332	2014-04-15 22:30:06 +00:00
Tobias Grosser	364c136d08	Dependences: Do not fail in case a schedule eliminates all dependences The following example shows a non-parallel loop void f(int a[]) { int i; for (i = 0; i < 10; ++i) A[i] = A[i+5]; } which, in case we import a schedule that limits the iteration domain to 0 <= i < 5, becomes parallel. Previously we crashed in such cases, now we just recognize it as parallel. This fixes http://llvm.org/PR19435 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206318	2014-04-15 20:14:57 +00:00
Tobias Grosser	efc3013544	Codegeneration: Free memory correctly when using -polly-vectorizer=polly This fixes PR19421. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206156	2014-04-14 08:33:24 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Tobias Grosser	79baa21242	ScopInfo: Scalar accesses are zero dimensional llvm-svn: 205958	2014-04-10 08:38:02 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	64b95123ef	Delete trivial PHI nodes (aka stack slot sharing) During code preperation trivial PHI nodes (mainly introduced by lcssa) are deleted to decrease the number of introduced allocas (==> dependences). However simply replacing them by their only incoming value would cause the independent block pass to introduce new allocas. To prevent this we try to share stack slots during code preperarion, hence to reuse a already created alloca 'to demote' the trivial PHI node. This works if we know that the value stored in this alloca will be the incoming value of the trivial PHI at the end of the predecessor block of this trivial PHI. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 205320	2014-04-01 16:01:33 +00:00
Tobias Grosser	5fa36c0ff6	Updated test/create_ll.sh to work with old & new clang versions. We explicitly specifying all filenames instead of assuming some naming convention used by clang and opt. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 204726	2014-03-25 15:50:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Tobias Grosser	8111a0ae7d	autoconf: Fix module loading in tests llvm-svn: 203925	2014-03-14 13:27:26 +00:00
Sebastian Pop	7537be92f4	add -load polly.so only when not LINK_POLLY_INTO_TOOLS llvm-svn: 203888	2014-03-14 04:04:36 +00:00
Rafael Espindola	80f20133d4	Fix polly tests to not include aliases to declarations. llvm-svn: 203721	2014-03-12 21:48:42 +00:00
Sebastian Pop	1b57e8f028	add dependence of check-polly on llc to avoid an error when directly doing ninja check-polly after cmake 'Could not find llc in .../ninja/bin'. llvm-svn: 203696	2014-03-12 18:55:25 +00:00
Tobias Grosser	4ba60fe9eb	ScheduleOptimizer: Fix prevectorization. In case we are at the innermost band, we try to prepare for vectorization. This means, we look for the innermost parallel loop and strip mine this loop to the innermost level using a strip-mine factor corresponding to the number of vector iterations. For whatever reason, the code that implemented this feature was broken. We now added a comment, a test case and obviously also the right code. llvm-svn: 203544	2014-03-11 06:27:36 +00:00
Tobias Grosser	e655754d57	Update CLooG and some test cases This is necessary to avoid test failures in the CLooG test suite due to the recent isl update. We also need to update two polly test cases which rely on a certain order in the textual description that isl chooses for its sets and maps. Changes here are not often, but we should probably switch to a check that verifies such maps are semantically equivalent instead of represented identically. llvm-svn: 203476	2014-03-10 17:31:22 +00:00
Tobias Grosser	37c9b8e0f2	Emit llvm.loop metadata for parallel loops For now we only mark innermost loops for the loop vectorizer. We could later also mark not-innermost loops to enable the introduction of openmp parallelism. llvm-svn: 202854	2014-03-04 14:59:00 +00:00
Tobias Grosser	356faa8f09	Dead code elimination: Schedule another approximative step before actual DCE In 'obsequi' we have a scop in which the current dead code elimination works, but the generated code is way too complex. To avoid this trouble (and to not disable the DCE entirely) we add an additional approximative step before the actual dead code elimination. This should fix one of the two current nightly-test issues. Polly could be improved to handle 'obsequi' by teaching it to introduce only a single parameter for (%1 and zext %1) which halves the number of parameters and allows polly to derive a simpler representation for the set of live iterations. However, this needs some time to investigate. I will commit a test case as soon as we have a reduced one. llvm-svn: 202010	2014-02-24 08:52:20 +00:00
Tobias Grosser	472d3b7037	codegen: Update LoopInfo correctly Add the 'polly.start' basic block to the loop that surrounds the scop we just codegenerate. This fixes PR13441 llvm-svn: 202000	2014-02-24 00:50:49 +00:00
Tobias Grosser	38c36ea18e	Do not fail in case we do not have valid dependences In case we do not have valid dependences, we do not run dead code elimination or the schedule optimizer. This fixes an infinite loop in the dead code elimination (PR12110). llvm-svn: 201982	2014-02-23 15:15:44 +00:00
Tobias Grosser	88640d2b47	Use -polly-codegen-isl in isl-codegen test Reported-by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 201902	2014-02-21 23:08:54 +00:00
Tobias Grosser	817d51dd1b	DCE: Switch to hybrid precise-unprecise analysis Instead of giving a choice between a precise (but possibly very complex) analysis and an approximative analysis we now use a hybrid approach which uses N precise steps followed by one approximating step. The precision of the analysis can be changed by increasing N. With a default of 'N' = 2, we get fully precise results for our current test cases and should not run into performance problems for more complex test cases. We can adjust this value when we got more experience with this dead code elimination. llvm-svn: 201888	2014-02-21 20:51:46 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Tobias Grosser	37eb422f69	Add polyhedral dead code elimination. This pass eliminates loop iterations that compute results that are not used later on. This can help e.g. in D, where the default zero-initialization is often unnecessary if right after new values are assigned to an array. Contributed-by: Peter Conn <conn.peter@gmail.com> llvm-svn: 201817	2014-02-20 21:43:54 +00:00
Tobias Grosser	d6aafa7c2e	Do not track location of scalar dependences in ScopInfo We do not have a use for this information at the moment. If we need this at some point, the "instruction -> access" mapping needs to be enhanced as a single instruction could then possibly perform multiple accesses. This patch allows us to build the polyhedral information for scops with scalar dependences. llvm-svn: 201815	2014-02-20 21:29:09 +00:00
Tobias Grosser	a1689937ba	Check scops a second time before working on them In rare cases the modification of one scop can effect the validity of other scops, as code generation of an earlier scop may make the scalar evolution functions derived for later scops less precise. The example that triggered this patch was a scop that contained an 'or' expression as follows: %add13710 = or i32 %j.19, 1 --> {(1 + (4 * %l)),+,2}<nsw><%for.body81> Scev could only analyze the 'or' as it knew %j.19 is a multiple of 2. This information was not available after the first scop was code generated (or independent-blocks was run on it) and SCEV could not derive a precise SCEV expression any more. This means we could not any more code generate this SCoP. My current understanding is that there is always the risk that an earlier code generation change invalidates later scops. As the example we have seen here is difficult to avoid, we use this occasion to guard us against all such invalidations. This patch "solves" this issue by verifying right before we start working on a detected scop, if this scop is in fact still valid. This adds a certain overhead. However the verification we run is anyways very fast and secondly it is only run on detected scops. So the overhead should not be very large. As a later optimization we could detect scops only on demand, such that we need to run scop-detections always only a single time. This should fix the single last failure in the LLVM test-suite for the new scev-based code generation. llvm-svn: 201593	2014-02-18 18:49:49 +00:00
Tobias Grosser	933edd04af	IndependentBlocks: Do not assert for PHI nodes outside of scops There does not seem to be a reason that we can not support PHI nodes outside of the scop that reference values within the SCoP. Or at least, the attached test case seems to do the right thing. We remove the assert for now. llvm-svn: 200427	2014-01-29 23:08:10 +00:00
Tobias Grosser	28a70c543d	ScopDetect: Transitively remove all children after region expansion In rare cases, a region R which is itself not valid has an indirect child region that is valid. When R becomes part of a valid region by expansion of another region, then all children of R have to be erased from the set of valid regions. This patch ensures that indirect children are erased in addition to direct children. Contributed-by: Armin Groesslinger <armin.groesslinger@uni-passau.de> Tobias: I added a reduced test case and adjusted the logic of the patch to only recurse until the first child is found. llvm-svn: 200411	2014-01-29 19:05:30 +00:00
Tobias Grosser	458fb78cfa	Check if array base addresses are invariant Array base addresses need to be invariant in the region considered. The base address has to be computed outside the region, or, when it is computed inside, the value must not change with the iterations of the loops. For example, when a two-dimensional array is represented as a pointer to pointers the base address A[i] in an access A[i][j] changes with i; therefore, such regions have to be rejected. Contributed by: Armin Größlinger <armin.groesslinger@uni-passau.de> llvm-svn: 200314	2014-01-28 12:58:58 +00:00
Tobias Grosser	5b5daab9f1	Add more test cases to check loop invariance of the base pointer. llvm-svn: 200305	2014-01-28 10:29:17 +00:00
Tobias Grosser	24d7e669b3	Do not test polybench with 'make check-polly' Those test cases should be tested in the LLVM test suite. For Polly we should extract regression tests for the individual passes. llvm-svn: 200206	2014-01-27 10:37:33 +00:00
Tobias Grosser	54646f7fab	Remove other unneccessary uses of -O3 in the test suite The polly test suite is now -O3 clean. llvm-svn: 200205	2014-01-27 10:37:06 +00:00
Tobias Grosser	a7fea8386c	Do not run -O3 to canonicalize test case This is not only not necessary, but in case -03 changes this can actually cause arbitrarily failing test cases such as, e.g., a recent change by Chandler that caused -O3 to unroll the loop body, which made the loop we wanted to detect disappear and consequently this test case fail. llvm-svn: 200204	2014-01-27 10:23:12 +00:00
Tobias Grosser	b917f47fc4	Dependences: Bound the time dependence calculation is allowed to take Count the number of computational steps that have been used to solve the dependence problem and abort in case we reach the "compute-out". This ensures we do not hang forever in cases the dependence problem is too difficult to solve. There is just a single case in the LLVM test-suite that runs into the compute-out. Even in this case, we can probably coalesce some of the parameters (i32 b, i32 b zext i64, ...) to simplify the problem enough to not hit the compute out. However, for now we set the compute out in place to address the general issue. The compute out was choosen such that it stops on a recent laptop after about 8 seconds. llvm-svn: 200156	2014-01-26 19:38:34 +00:00
Tobias Grosser	0d43646f93	Adjust test case to changed cloog output llvm-svn: 199587	2014-01-19 11:53:51 +00:00
Tobias Grosser	8519f897e7	Report detected scops using the new diagnostics We now report the following: $ polly-clang -O3 -mllvm -polly -mllvm -polly-report test.c -c \ -gline-tables-only note: Polly detected an optimizable loop region (scop) in function 'foo' test.c:2: Start of scop test.c:3: End of scop note: Polly detected an optimizable loop region (scop) in function 'bar' test.c:9: Start of scop test.c:13: End of scop llvm-svn: 197558	2013-12-18 10:49:53 +00:00
Tobias Grosser	7b6f9ba572	ScopValidator: smax expressions are no parameters This fixes PR18155 which is a regression introduced in 152913. llvm-svn: 196827	2013-12-09 21:51:46 +00:00
Tobias Grosser	7d66a19fe4	test: Remove use of defaultOpts llvm-svn: 196826	2013-12-09 21:51:31 +00:00
Tobias Grosser	54ee0ba74d	IslCodegen: Support for run-time conditions llvm-svn: 194948	2013-11-17 03:18:25 +00:00
Tobias Grosser	e86109f508	ScopInfo: Add support for AssumedContext When constructing a scop sometimes the exact representation of a statement or condition would be very complex, but there is a common case which is a lot simpler, but which is only valid under certain assumptions. The assumed context records the assumptions taken during the construction of this scop and that need to be code generated as a run-time test. At the moment, we do not yet model any assumptions, but only added the AssumedContext as well as the isl-ast generation support. As a next step, this needs to be hooked up with the isl code generation. if (1) /* run-time condition / { / optimized code / } else { / original code */ } llvm-svn: 193652	2013-10-29 21:05:49 +00:00
Tobias Grosser	4f8c0877e8	This test case requires assertions llvm-svn: 192530	2013-10-12 09:15:56 +00:00
Sebastian Pop	20594a842c	use -polly-codegen-isl in tests under test/Isl llvm-svn: 192110	2013-10-07 16:43:04 +00:00
Sebastian Pop	946070f2f0	do not use -polly-cloog in a ScopInfo testcase llvm-svn: 192109	2013-10-07 16:43:00 +00:00
Tobias Grosser	3613fd7a35	ScopInfo: Correctly handle true/false conditions This is a modified version of the orignally contributed patch. Contributed-by: alexandre.isoard@gmail.com llvm-svn: 190237	2013-09-07 01:54:13 +00:00
Tobias Grosser	815c635cec	[CodeGen] Fixup assert fails caused by incorrect LoopInfo update Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 189764	2013-09-02 16:13:00 +00:00
Daniel Dunbar	2bd59a2cc7	[tests] Update to use lit_config and lit package, as appropriate. llvm-svn: 188114	2013-08-09 21:54:36 +00:00
Tobias Grosser	22a155a7a6	ScopInfo: add a testcase that share parameters within nested start. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 187772	2013-08-06 04:36:45 +00:00
Tobias Grosser	e42ddb9ad3	ScopInfo: Split start value from SCEVAddRecExpr to enable parameter sharing. SCoP invariant parameters with the different start value would deter parameter sharing. For example, when compiling the following C code: void foo(float input) { for (long j = 0; j < 8; j++) { // SCoP begin for (long i = 0; i < 8; i++) { float x = input[j 64 + i + 1]; input[j * 64 + i] = x * x; } } } Polly would creat two parameters for these memory accesses: p_0: {0,+,256} p_2: {4,+,256} [j * 64 + i + 1] => MemRef_input[o0] : 4o0 = p_1 + 4i0 [j * 64 + i] => MemRef_input[o0] : 4o0 = p_0 + 4i0 These parameters only differ from start value. To enable parameter sharing, we split the start value from SCEVAddRecExpr, so they would share a single parameter that always has zero start value: p0: {0,+,256}<%for.cond1.preheader> [j * 64 + i + 1] => MemRef_input[o0] : 4o0 = 4 + p_1 + 4i0 [j * 64 + i] => MemRef_input[o0] : 4o0 = p_0 + 4i0 Such translation can make the polly-dependence much faster. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 187728	2013-08-05 15:14:15 +00:00
Tobias Grosser	96ef078583	Remove '-debug-only' from test case This flags was not used in the test case, but caused failures when LLVM was built without debugging. We can savely remove it. llvm-svn: 187343	2013-07-29 05:35:11 +00:00
Tobias Grosser	6e358c067a	TempScop: Actually load Polly in this test case llvm-svn: 187342	2013-07-29 05:18:09 +00:00
Tobias Grosser	7032ea6f5b	Remove second '-analyze' from command line llvm-svn: 187341	2013-07-29 05:15:33 +00:00
Tobias Grosser	85f7421731	JSONImporter: Free new schedule if found invalid In case we detect that the schedule the user wants to import is invalid we refuse it _and_ free the isl_maps containing it. Another bug found thanks to Rafael. llvm-svn: 187339	2013-07-29 05:12:01 +00:00
Tobias Grosser	880c52f56a	CodeGeneration: Fix double free in vector for We now use __isl_take to annotate the uses of the isl_set where we got the memory management wrong. Thanks to Rafael! His pipefail work hardened our test environment and exposed this bug nicely. llvm-svn: 187338	2013-07-29 01:58:07 +00:00
Rafael Espindola	cd61afb4ee	Use a slightly smaller hammer to make this pass. When first updating this test I only noticided the first RUN line. llvm-svn: 187328	2013-07-28 11:13:49 +00:00
Tobias Grosser	25f0342a68	Temporary disable a test until I finished the fix llvm-svn: 187305	2013-07-27 15:19:57 +00:00
Rafael Espindola	0329bb4fce	Looks like this test crashes. Add --crash to not for now. llvm-svn: 187300	2013-07-27 11:08:44 +00:00
Rafael Espindola	e559af8205	Add not to commands that fail. Polly devs: please check if these commands really should fail. llvm-svn: 187263	2013-07-26 22:49:25 +00:00
Tobias Grosser	6bcb34b180	ScopDetect: Add some test cases for sequential loops llvm-svn: 187024	2013-07-24 06:10:37 +00:00
Hongbin Zheng	63cc9467af	Ensure a correct order between memory accesses. Ensure that the scalar write access corresponds to the result of a load instruction appears after the generic read access corresponds to the load instruction. llvm-svn: 186419	2013-07-16 15:20:29 +00:00
Hongbin Zheng	5a772dcd84	IndependentBlock: Add option to disable scalar to array rewriting. llvm-svn: 186418	2013-07-16 15:19:33 +00:00
Tobias Grosser	6f0d6988a5	Dependences: Add a couple of basic test cases llvm-svn: 186254	2013-07-13 18:31:46 +00:00
Tobias Grosser	229d681675	Dependences: Clarify difference between value and memory based dependences We make the option a clear choice between the two analysis types and add descriptions about the difference between the two. llvm-svn: 186251	2013-07-13 17:37:55 +00:00
Sebastian Pop	784c012982	scop detection: remove an iteration over all uses reenabled reverted patch after checking that it passes without regressions on the nightly test-suite. Added testcase from Tobi. llvm-svn: 185720	2013-07-05 20:24:47 +00:00
Hongbin Zheng	8d3a888ca3	TempScop: (Partial) Implement the printDetail function. llvm-svn: 185254	2013-06-29 07:00:14 +00:00
Tobias Grosser	4f96749351	ScopInfo: Clarify may-write and must-write accesses llvm-svn: 184658	2013-06-23 05:21:18 +00:00
Tobias Grosser	3e030e178a	Correctly convert APInt to gmp values Previously this happend to work for integers up to i64, but we got it wrong for larger numbers. Fix this and add test cases to verify this keeps working. Reported by: Sven Verdoolaege <skimo at kotnet dot org> llvm-svn: 183986	2013-06-14 16:23:38 +00:00
Sebastian Pop	9d63234ad1	ScopDetect: check region entering edges are valid. When a region header is part of a loop, then all entering edges of this region should not come from the loop but outside the region. Otherwise, the loop may be only partially part of the region, which would cause troubles in handling induction variables. Currently, we can only model induction variables that are either fully part of the scop (loop induction variable) or induction variables that are scop- invariant (parameter). A loop that is only partially part of the scop causes troubles, as there is no good way to handle the induction variable in the independent blocks pass. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 183800	2013-06-11 22:20:40 +00:00
Sebastian Pop	2c9ec2e651	scop detection: do not run scop detection on regions without loops otherwise, use -polly-detect-scops-in-regions-without-loops to also detect scops in regions without loops llvm-svn: 183113	2013-06-03 16:35:37 +00:00
Tobias Grosser	93324aef17	Test that independent block pass does not transform induction variables The original test case showed a problem with the independet blocks pass and we decided to XFAIL it for now. Unfortunately the failure is not detected if we build without asserts and the verification of the independent block pass is not run. This change tests now for the actual reason of the failure and should trigger even in a non asserts build. We did not yet solve the underlying bug, but this should at least make the test suite behavior consistent. llvm-svn: 183025	2013-05-31 17:44:38 +00:00
Sebastian Pop	8fe6d11b84	scop detection: only handle functions with loops to detect scops in functions with no loops, use -polly-detect-scops-in-functions-without-loops llvm-svn: 182941	2013-05-30 17:47:32 +00:00
Sebastian Pop	359d3aa8a1	independent blocks: when moving Values, invalidate SCEV cached info llvm-svn: 182310	2013-05-20 20:02:03 +00:00
Sebastian Pop	c90ec7812e	rename make check target to match the naming convention followed in the other llvm projects llvm-svn: 182171	2013-05-17 23:04:28 +00:00
Tobias Grosser	3081b0f5ec	Update LoopInfo correctly When the Polly code generation was written we did not correctly update the LoopInfo data, but still claimed that the loop information is correct. This does not only lead to missed optimizations, but it can also cause miscompilations in case passes such as LoopSimplify are run after Polly. Reported-by: Sergei Larin <slarin@codeaurora.org> llvm-svn: 181987	2013-05-16 06:40:24 +00:00
Tobias Grosser	5db6ffd76f	LoopGenerators: Construct loops such that they are already loop rotated BeforeBB \| v GuardBB / \ __ PreHeaderBB \ / \ / \| latch HeaderBB \| \ / \ / < \ / \ / ExitBB This does not only remove the need for an explicit loop rotate pass, but it also gives us the possibility to skip the construction of the guard condition in case the loop is known to be executed at least once. We do not yet exploit this, but by implementing this analysis in the isl code generator we should be able to remove more guards than the generic loop rotate pass can. Another point is that loop rotation can introduce additional PHI nodes, which may hide that a loop can be executed in parallel. This change avoids this complication and will make it easier to move the openmp code generation into a separate pass. llvm-svn: 181986	2013-05-16 06:40:06 +00:00
Tobias Grosser	637bd63123	Move polly options into separate option category Use the new cl::OptionCategory support to move the Polly options into a separate option category. The aim is to hide most options and show by default only the options a user needs to influence '-O3 -polly'. The available options probably need some care, but here is the current status: Polly Options: Configure the polly loop optimizer -enable-polly-openmp - Generate OpenMP parallel code -polly - Enable the polly optimizer (only at -O3) -polly-no-tiling - Disable tiling in the scheduler -polly-only-func=<function-name> - Only run on a single function -polly-report - Print information about the activities of Polly -polly-vectorizer - Select the vectorization strategy =none - No Vectorization =polly - Polly internal vectorizer =unroll-only - Only grouped unroll the vectorize candidate loops =bb - The Basic Block vectorizer driven by Polly llvm-svn: 181295	2013-05-07 07:31:10 +00:00
Tobias Grosser	e8df5bd92b	IndependentBlocks: We can only reconstruct PHI nodes that are within the ScoP In the classical (non -polly-codegen-scev) mode, we assume that we can always recreate PHI nodes during code generation. This is not true. We can only reconstruct them from the polyhedral information, in case the entire loop of the PHI node is part of the SCoP and consequently the PHI node was translated in the polyhedral description. llvm-svn: 179674	2013-04-17 07:20:36 +00:00
Tobias Grosser	b5f92892d1	Remove unneeded RegionSimplify pass. We now support regions with multiple entries and multiple exits natively. Regions are not needed to be simplified to single entry and single exit. We need to XFAIL two test cases as this change increases the scop coverage and uncoveres two failures in the independent blocks pass. The first failure will be fixed in a subsequent commit, the second one is in the non-default -polly-codegen-scev mode and still needs to be fixed. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179673	2013-04-17 07:20:30 +00:00
Tobias Grosser	36a01b0a28	tests: Fix 'instruction does not dominate all its uses' error The LLVM-IR of this test case was apperently incorrect. llvm-svn: 179672	2013-04-17 07:20:17 +00:00
Tobias Grosser	8edce4ee62	Support SCoPs with multiple entry edges. Regions that have multiple entry edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / for_region This for_region contains two entry edges 'then' -> 'for_region' and 'else' -> 'for_region'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively when the region is in -loop-simplify form, which means the entry block should not be a loop header. Contributed by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179586	2013-04-16 08:04:42 +00:00
Tobias Grosser	3ed2600cab	SCEVValidator: Correctly store 'k * p' as a parameter We do not only need to understand that 'k * p' is a parameter expression, but also need to store this expression in the set of parameters. Before this patch we wrongly stored the two individual parameters %k and %p. Reported by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 179485	2013-04-14 13:15:59 +00:00
Tobias Grosser	f242b806ac	ScheduleOpt: Do not crash on statements with empty iteration domains Statements with an empty iteration domain may not have a schedule assigned by the isl schedule optimizer. As Polly expects each statement to have a schedule, we keep the old schedule for such statements. This fixes http://llvm.org/PR15645` Reported-by: Johannes Doerfert <johannesdoerfert@gmx.de> llvm-svn: 179233	2013-04-10 22:48:08 +00:00
Sebastian Pop	1006614228	fix testcase llvm-svn: 179183	2013-04-10 16:44:08 +00:00
Tobias Grosser	ecb5092707	ScopDetect: Allow multiplications of the form <param> * <param> We handle these by treating this result of the multiplication as an additional parameter. llvm-svn: 179163	2013-04-10 07:42:28 +00:00
Tobias Grosser	0ee50f6ee4	Support SCoPs with multiple exit edges Regions that have multiple exit edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / after Region: if -> after This regions contains the bbs 'if', 'then', 'else', but not 'after'. It has two exit edges 'then' -> 'after' and 'else' -> 'after'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179159	2013-04-10 06:55:31 +00:00
Sebastian Pop	9f57c5b695	scop detection: properly instantiate SCEVs to the place where they are used Fix inspired from c2d4a0627e95c34a819b9d4ffb4db62daa78dade. Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When translate the data reference A[i] in statement S using scev, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S'. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10}. This used to crash in: int loopDimension = getLoopDepth(Expr->getLoop()); isl_aff LAff = isl_aff_set_coefficient_si( isl_aff_zero_on_domain(LocalSpace), isl_dim_in, loopDimension, 1); (gdb) p Expr->dump() {8,+,8}<nw><%do.body> (gdb) p getLoopDepth(Expr->getLoop()) $5 = 0 isl_space Space = isl_space_set_alloc(Ctx, 0, NbLoopSpaces); isl_local_space LocalSpace = isl_local_space_from_space(Space); As we are trying to create a memory access in a stmt that is outside all loops, LocalSpace has 0 dimensions: (gdb) p NbLoopSpaces $12 = 0 (gdb) p Statement.BB->dump() if.then: ; preds = %do.end %0 = load float %add.ptr, align 4 store float %0, float* %q.1.reg2mem, align 4 br label %if.end.single_exit and so the scev for %add.ptr should be taken at the place where it is used, i.e., it should be the value on the last iteration of the do.body loop, and not "{8,+,8}<nw><%do.body>". llvm-svn: 179148	2013-04-10 04:05:18 +00:00
Sebastian Pop	9ca6612731	IndependentBlocks: translate out of SSA all uses escaping the region llvm-svn: 179019	2013-04-08 13:05:41 +00:00
Tobias Grosser	4d96c8d714	clang-format: Many more files After this commit, polly is clang-format clean. This can be tested with 'ninja polly-check-format'. Updates to clang-format may change this, but the differences will hopefully be both small and general improvements to the formatting. We currently have some not very nice formatting for a couple of items, DEBUG() stmts for example. I believe the benefit of being clang-format clean outweights the not perfect layout of this code. llvm-svn: 177796	2013-03-23 01:05:07 +00:00
Tobias Grosser	369430ffca	codegen: properly instantiate SCEVs to the place where they are used Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When code generating S using scev based code generation, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S' and that we consequently do not know how to code generate. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10} and it can be code generated without troubles. llvm-svn: 177777	2013-03-22 23:42:53 +00:00
Tobias Grosser	8ff029ccf1	Add failing test case llvm-svn: 177645	2013-03-21 16:14:55 +00:00
Tobias Grosser	826b2af112	Remove last uses of canoncial induction variable when scev code generating We now detect scops without a canonical induction variable and can generate a polyhedral representation for them. There was no modification necessary to code generate these scops. llvm-svn: 177643	2013-03-21 16:14:50 +00:00
Tobias Grosser	5bfa4f8eb8	CodePrepare: Do not require canonical induction variables for scev based mode llvm-svn: 177593	2013-03-20 22:41:53 +00:00
Tobias Grosser	db8b8a5b8e	ScopDetect: Test case to verify that base pointers are scop invariant llvm-svn: 177582	2013-03-20 21:40:11 +00:00
Tobias Grosser	e4584f6abf	ScopDetect: Add test cases for non-simple regions llvm-svn: 177567	2013-03-20 20:02:35 +00:00
Tobias Grosser	ecfe21b792	Remove dependence on canonical induction variable When using the scev based code generation, we now do not rely on the presence of a canonical induction variable any more. This commit prepares the path to (conditionally) disable the induction variable canonicalization pass. llvm-svn: 177548	2013-03-20 18:03:18 +00:00
Tobias Grosser	d2fbbf0f74	IndependentBlocks: Add a couple of test cases. llvm-svn: 177438	2013-03-19 21:11:25 +00:00
Tobias Grosser	d4ff632fa9	ScopDetection: Add a couple of test cases llvm-svn: 177433	2013-03-19 20:15:19 +00:00
Sebastian Pop	97cb813c29	Correct function to decide if a SCEV can be ignored When doing SCEV based code generation, we ignore instructions calculating values that are fully defined by a SCEV expression. The values that are calculated by this instructions are recalculated on demand. This commit improves the check to verify if certain instructions can be ignored and recalculated on demand. llvm-svn: 177313	2013-03-18 20:21:13 +00:00
Tobias Grosser	7f54714dcc	tests: Properly check if asserts are available In my previous commits I failed to realise that my new requires lines fully disabled these tests. We now properly check if we are in an asserts build and only disable the tests if assertions are not available. Reported-by: Sean Silva <silvas@purdue.edu> llvm-svn: 176900	2013-03-12 21:27:39 +00:00
Tobias Grosser	ee9423920e	Missed on test case in the last commit llvm-svn: 176864	2013-03-12 13:39:40 +00:00
Tobias Grosser	c9a72919a5	Move tests that depend on -stats under 'requires asserts' This fixes issues caused by the following commit: r176733 \| jvoung \| 2013-03-08 17:56:31 -0500 Disable statistics on Release builds and move tests that depend on -stats. Reported by: Jack Howarth <howarth@bromo.med.uc.edu> llvm-svn: 176856	2013-03-12 08:45:15 +00:00
Bill Wendling	83e9312ece	Use attributes references on call/invoke instructions. llvm-svn: 175881	2013-02-22 09:29:15 +00:00
Tobias Grosser	c92c8f06ec	[isl-codegen]: Fix off by one in getNumberOfIterations We need to remove one dimension. Any is correct as long as it exists. We have choosen for whatever reason the dimension #dims - 2. This is incorrect if there is just one dimension. For CLooG this case did never happen. For isl however, the case can happen and causes undefined behavior including crashes. We choose now always the last dimension #dims - 1. We could have choosen dimension '0' but the last dimension is what we remove conceptionally in the algorithm, so it seems better to actually program it that way. While at it remove another piece of undefined behavior. llvm-svn: 174894	2013-02-11 17:52:36 +00:00
Sebastian Pop	04c4ce32ae	isl: vector code generation based on ISL ast Original patch by Tobias Grosser, slightly modified by Sebastian Pop. llvm-svn: 170420	2012-12-18 07:46:13 +00:00
Sebastian Pop	e252c85545	isl: detect vector parallelism llvm-svn: 170138	2012-12-13 16:52:41 +00:00
Tobias Grosser	e36abf6d5d	isl: Detect openmp parallelism Based on code written by Riyadh Baghdadi. llvm-svn: 170102	2012-12-13 06:24:06 +00:00
Andy Gibbs	9936b214c0	Integrate polly test-suite into an llvm "make check-all" if built as part of the whole using cmake. llvm-svn: 169487	2012-12-06 07:59:18 +00:00
Sebastian Pop	a267d9b829	adapt cloog codegen testcases to isl llvm-svn: 169161	2012-12-03 21:34:09 +00:00
Sebastian Pop	47987128b6	use -polly-ast instead of -polly-cloog llvm-svn: 169160	2012-12-03 21:33:55 +00:00
Sebastian Pop	b08a52898a	execute cloog specific testcases only with CLOOG_FOUND llvm-svn: 169159	2012-12-03 21:33:40 +00:00
Patrik Hägglund	b476cdfde5	Fix tests with broken datalayout strings. Buildbot failure at r168785. llvm-svn: 168791	2012-11-28 13:30:31 +00:00
Sebastian Pop	ee4baf3eec	do not execute the OpenMP tests when cloog is not found llvm-svn: 168724	2012-11-27 21:15:15 +00:00
Tobias Grosser	3344f733fd	test: LLVM supports now vectors of arbitrary pointers This allows Polly to vectorize more code. Fix the relevant test cases. llvm-svn: 167923	2012-11-14 08:25:52 +00:00
Tobias Grosser	38ea9cd721	Tests: Pipe test files into 'opt' Use 'opt < %s' instead of just 'opt %s' to ensure that no temporary files are created. llvm-svn: 167372	2012-11-04 16:56:20 +00:00
Tobias Grosser	dcebf1e9da	Tests: remove ModuleID lines llvm-svn: 167284	2012-11-02 06:09:20 +00:00
Tobias Grosser	41b20a62c9	Tests: move content of .c files in .ll llvm-svn: 167283	2012-11-02 06:08:39 +00:00
Tobias Grosser	3eb851f370	Remove runtime tests from polly test suite Similar to LLVM we now follow the policy of only having LLVM-IR level tests in the Polly test suite. Testing for miscompilation of larger programs should be done with the llvm test suite. llvm-svn: 167255	2012-11-01 21:44:59 +00:00
Tobias Grosser	81a1c75035	Dependences: Add support to calculate memory based dependences Instead of calculating exact value (flow) dependences, it is also possible to calculate memory based dependences. Sometimes memory based dependences are a lot easier to calculate. To evaluate the benefits, we add an option to calculate memory based dependences (use -polly-value-dependences=false). llvm-svn: 167251	2012-11-01 21:28:32 +00:00
Tobias Grosser	ebe8c8cea2	Codegen: Selectively copy in array addresses for OpenMP code The detection of values that need to be copied in to the generated OpenMP subfunction also detects the array base addresses needed in the SCoP. Hence, it is not necessary to unconditionally copy all the base addresses to the generated function. Test cases are modified to reflect this change. Arrays which are global variables do not occur in the struct passed to the subfunction anymore. A test case for base address copy-in is added in copy_in_array.{c,ll}. Committed with slight modifications Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167215	2012-11-01 05:34:55 +00:00
Tobias Grosser	177982c478	CodeGen: Add scop-parameters to the OpenMP context In addition to the arrays and clast variables a SCoP statement may also refer to values defined before the SCoP or to function arguments. Detect these values and add them to the set of values passed to the function generated for OpenMP parallel execution of a clast. Committed with additional test cases and some refactoring. Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167214	2012-11-01 05:34:48 +00:00
Tobias Grosser	a17f666f99	Codegen: Copy and restore the ValueMap and ClastVars explicitly When generating OpenMP or GPGPU code the original ValueMap and ClastVars must be kept. We already recovered the original ClastVars by reverting the changes, but we did not keep the content of the ValueMap. This patch keeps now an explicit copy of both maps and restores them after generating OpenMP or GPGPU code. This is an adapted version of a patch contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167213	2012-11-01 05:34:35 +00:00
Tobias Grosser	6c8e696618	cmake: Use suffix for shared modules instead of the one for shared libraries On Linux there is no difference between shared modules and shared libaries, both are '.so' files. However, on darwin only shared modules are '.so' files. Shared libraries have the '.dynlib' suffix. Fix test cases on darwin by expecting a shared module suffix for Polly instead of a shared library suffix. This fixes PR14135 Reported by: Jack Howarth <howarth@bromo.med.uc.edu> llvm-svn: 166402	2012-10-21 21:08:29 +00:00
Tobias Grosser	28781423b2	isl scheduler: Do not fail when returning an empty band list The bug was within isl. To fix it, we simply update the isl version that is used by Polly. We still have some changes within Polly to be able to write a proper test case. Reported-by: Sameer Sahasrabuddhe <Sameer.Sahasrabuddhe@amd.com> llvm-svn: 166021	2012-10-16 07:29:19 +00:00
Tobias Grosser	c967d8e6e9	isl-codegen: Support '<' and '>' Previously isl always generated '<=' or '>='. However, in many cases '<' or '>' leads to simpler code. This commit updates isl and adds the relevant code generation support to Polly. llvm-svn: 166020	2012-10-16 07:29:13 +00:00
Tobias Grosser	6a2da6b9c8	Add test cases for multi-dimensional variable lengths arrays At the moment we can handle such arrays only by conservatively assuming that each access to such an array may touch any element in the array. It would be great if we could improve Polly/LLVM at some point, such that we can recover the multi-dimensionality of the accesses. llvm-svn: 163619	2012-09-11 14:03:19 +00:00
Tobias Grosser	ed29566c4e	ScopInfo: Align parameters when using -polly-allow-nonaffine This ensures that the isl sets/maps we operate on have the same parameter dimensions. Operations on objects with different parameter dimensions are not allow and trigger assertions. llvm-svn: 163618	2012-09-11 13:50:21 +00:00
Tobias Grosser	6217e18a7d	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with the cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. The patch was committed with smaller changes to the build system: There is now a flag to enable gpu code generation explictly. This was required as we need the llvm.codegen() patch applied on the llvm sources, to compile this feature correctly. Also, enabling gpu code generation does not require cuda. This requirement was removed to allow 'make polly-test' runs, even without an installed cuda runtime. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 161239	2012-08-03 12:50:07 +00:00
Hongbin Zheng	7aee737062	IndependentBLocks: Do not visit the same instruction twice when moving the operand tree. This patch fix Bug 13491, and the original "FIXME" in IndependentBlocks.cpp. Patched by Kevin Fan<kevin.fan@gmail.com>. llvm-svn: 161105	2012-08-01 08:46:11 +00:00
Tobias Grosser	6cc23b07e6	Revert "Add preliminary implementation for GPGPU code generation." I did not take into account, that this patch fails to compile without the llvm.codegen patch applied. This breaks buildbots. I revert this until we found a solution to commit this without buildbots complaining. This reverts commit cb43ab80e94434e780a66be3b9a6ad466822fe33. llvm-svn: 160165	2012-07-13 07:44:56 +00:00
Tobias Grosser	b299d28181	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 160164	2012-07-13 07:21:00 +00:00
Tobias Grosser	96682025c7	Add some tests for the independent blocks pass. llvm-svn: 158306	2012-06-11 10:25:12 +00:00
Tobias Grosser	18daacad61	ScopInfo: Add parameter bounds to context Derive the maximal and minimal values of a parameter from the type it has. Add this information to the scop context. This information is needed, to derive optimal types during code generation. llvm-svn: 157245	2012-05-22 10:47:27 +00:00
Hongbin Zheng	6417255283	Regression tests: Adapt the vectorize option change. llvm-svn: 156255	2012-05-06 10:22:43 +00:00
Tobias Grosser	e71c6ab54c	SCEV based code generation This is an incomplete implementation of the SCEV based code generation. When finished it will remove the need for -indvars -enable-iv-rewrite. For the moment it is still disabled. Even though it passes 'make polly-test', there are still loose ends especially in respect of OpenMP code generation. llvm-svn: 155717	2012-04-27 16:36:14 +00:00
Tobias Grosser	7c3061acdd	Make vector tests less sensible to codegen changes llvm-svn: 155438	2012-04-24 11:08:07 +00:00
Tobias Grosser	216ea58b21	ScheduleOpt: Fix crash with -enable-polly-vector llvm-svn: 154808	2012-04-16 11:06:06 +00:00
Tobias Grosser	4cb5461dae	CodeGen: Generate scalar code if vector instructions cannot be generated This fixes two crashes that appeared in case of: - A load of a non vectorizable type (e.g. float**) - An instruction that is not vectorizable (e.g. call) llvm-svn: 154586	2012-04-12 10:46:55 +00:00
Hongbin Zheng	e2107f0999	Revert "Make the "all" target depend on polly-test, so that users can run regression" This reverts commit 97bd8d50881000c11b65b0e033996ec5f57bcd15. llvm-svn: 154487	2012-04-11 07:43:24 +00:00
Tobias Grosser	84ecc47e1c	CodeGen: Allow Polly to do 'grouped unrolling', but no vector generation. Grouped unrolling means that we unroll a loop such that the different instances of a certain statement are scheduled right after each other, but we do not generate any vector code. The idea here is that we can schedule the bb vectorizer right afterwards and use it heuristics to decide when vectorization should be performed. llvm-svn: 154251	2012-04-07 06:16:08 +00:00
Tobias Grosser	0905a23806	CodeGen: Recreate old ivs with the original type To avoid overflows we still use a larger type (i64) while calculating the value of the old ivs. However, we truncate the result to the type of the old iv when providing it to the new code. A corresponding test case is added to the polly test suite. Also, a failing test case is fixed. This fixes PR12311. Contributed by: Tsingray Liu <tsingrayliu@gmail.com> llvm-svn: 153952	2012-04-03 12:24:32 +00:00
Tobias Grosser	de49ef76f6	Remove unneeded alias analysis llvm-svn: 153839	2012-04-01 16:49:48 +00:00
Tobias Grosser	89339067b0	CodeGen: Allow function parameters to be rewritten in getNewValue() When deriving new values for the statements of a SCoP, we assumed that parameter values are constant within the SCoP and consquently do not need to be rewritten. For OpenMP code generation this assumption is wrong, as such values are not available in the OpenMP subfunction and consequently also may need to be rewritten. Committed with some changes. Contributed-By: Johannes Doerfert <s9jodoer@stud.uni-saarland.de> llvm-svn: 153838	2012-04-01 16:49:45 +00:00
Hongbin Zheng	b5bf8cfa17	Make the "all" target depend on polly-test, so that users can run regression tests by simply typing "make -C tools/polly/test", like llvm's regression tests. llvm-svn: 153739	2012-03-30 09:27:16 +00:00
Hongbin Zheng	2700adebfa	Autoconf build: Try to update LLVMPolly.so before running regression tests llvm-svn: 153738	2012-03-30 09:27:07 +00:00
Tobias Grosser	900893d2d8	CodeGeneration: Proberly build the dominator tree llvm-svn: 153645	2012-03-29 13:10:26 +00:00
Hongbin Zheng	e53bdfe633	Use python script to silence the expected testcase fails on 32bit platform. llvm-svn: 153644	2012-03-29 13:10:10 +00:00
Hongbin Zheng	689e84fcec	Regession testing: Substitut POLLY_LIB_DIR, which is introduced by commit r152924, by $(LibDir). Because we assume polly built by autoconf is always in llvm tree. llvm-svn: 153642	2012-03-29 12:36:52 +00:00
Hongbin Zheng	0578aaf77c	Don't fail the lli testcases on 32bit platform. llvm-svn: 153440	2012-03-26 15:16:48 +00:00
Tobias Grosser	cf88d84d79	test: Remove memaccess prefix The prefix is not needed, as all test cases are already in a separate folder. llvm-svn: 153320	2012-03-23 08:24:04 +00:00
Tobias Grosser	d6adda3071	CodeGen: Full support for isl_pw expressions in modified access functions. This also adds support for modifiable write accesses (until now only read accesses where supported). We currently do not derive an exact type for the expression, but assume that i64 is good enough. This will be improved in future patches. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 153319	2012-03-23 08:21:22 +00:00
Tobias Grosser	3ec2abc5fb	Don't allow pointer types in affine expressions We currently do not support pointer types in affine expressions. Hence, we disallow in the SCoP detection. Later we may decide to add support for them. This fixes PR12277 Reported-By: Sebastian Pop <sebpop@gmail.com> llvm-svn: 152928	2012-03-16 16:36:47 +00:00
Hongbin Zheng	c7584ff270	Off-tree build support: Set the path of Polly's library correctly. llvm-svn: 152924	2012-03-16 14:34:20 +00:00
Hongbin Zheng	33254d1edf	Revert "Minor change: Use config.polly_obj_root to locate Polly's library," This reverts commit 7dd9b6327b54b08ece32a4607d5ac093b518b79a. llvm-svn: 152923	2012-03-16 13:49:55 +00:00
Hongbin Zheng	95c84eab5c	Minor change: Use config.polly_obj_root to locate Polly's library, so lit find Polly's library in off-tree build. llvm-svn: 152920	2012-03-16 13:24:34 +00:00
Tobias Grosser	8a5070213a	ScheduleOptimizer: Do not get dependences, if we do not calculate a schedule This solves the 'isl_ctx freed, but some objects still reference it' problem reported in PR12276. llvm-svn: 152917	2012-03-16 11:51:41 +00:00
Tobias Grosser	371badaa47	SCEVValidator: Ensure that parameters are recorded correctly This also fixes UMax where we did not correctly keep track of the parameters. Fixes PR12275. Reported-By: Sebastian Pop <sebpop@gmail.com> llvm-svn: 152913	2012-03-16 10:16:28 +00:00
Hongbin Zheng	c0f53b1c00	Polly-test: Add a cmake option "POLLY_TEST_DISABLE_BAR". We can enable this option in the configure step of Polly's builder to get more readable output from the stdio log. llvm-svn: 152910	2012-03-16 09:04:09 +00:00
Tobias Grosser	3cbe5cfff3	Remove FinalRead The FinalRead statement represented a virtual read that is executed after the SCoP. It was used when we verified the correctness of a schedule by checking if it yields the same FLOW dependences as the original code. This is only works, if we have a final read that reads all memory at the end of the SCoP. We now switched to just checking if a schedule does not introduce negative dependences and also consider WAW WAR dependences. This restricts the schedules a little bit more, but we do not have any optimizer that would calculate a more complex schedule. Hence, for now final reads are obsolete. llvm-svn: 152319	2012-03-08 15:21:51 +00:00
Tobias Grosser	df3823750e	CodeGen: Pass the scalar maps properly llvm-svn: 151916	2012-03-02 15:20:35 +00:00
Tobias Grosser	f6beec674e	CodeGen: Simplify the generation of a splat llvm-svn: 151912	2012-03-02 15:20:21 +00:00
Tobias Grosser	b61e6318ac	CodeGen: Name stmt bbs 'polly.stmt.' + OriginalName llvm-svn: 150575	2012-02-15 09:58:46 +00:00
Tobias Grosser	04eadc476e	tests: Replace . by %s llvm-svn: 150377	2012-02-13 12:29:43 +00:00
Tobias Grosser	8518bbe39f	CodeGen: Always name merge block llvm-svn: 150337	2012-02-12 12:09:46 +00:00
Tobias Grosser	0dbbdd7637	Codegen: Give split and merge basic blocks better names llvm-svn: 150335	2012-02-12 12:09:37 +00:00
Tobias Grosser	a187964bac	Support non-affine access functions in Polly. In case we can not analyze an access function, we do not discard the SCoP, but assume conservatively that all memory accesses that can be derived from our base pointer may be accessed. Patch provided by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 146972	2011-12-20 10:43:14 +00:00
Tobias Grosser	b6033396fd	ScheduleOptimizer: Do not tile bands with just one dimension llvm-svn: 146149	2011-12-08 13:02:58 +00:00
Tobias Grosser	595ec0d0e3	ClooG: Make sure ambigous schedules do not introduce complicated code Cloog continued to split the domains even after the scattering. This lead to complicated code. llvm-svn: 146033	2011-12-07 11:03:48 +00:00
Tobias Grosser	39913e3648	test: Switch to new atomic instructions This fixes the test with recent versions of LLVM that do not support the old atomic instructions any more. llvm-svn: 145402	2011-11-29 14:51:05 +00:00
Tobias Grosser	1e06003227	test: Add more dependences to cmake build llvm-svn: 145400	2011-11-29 14:50:47 +00:00
Tobias Grosser	f281702686	test: Do not hardcode '.so' as library suffix Contributed by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 145076	2011-11-22 19:40:38 +00:00
Tobias Grosser	4dca439cfc	Register Passes: Use -polly-optimizer=(isl\|pocc) to switch optimizers This replaces the old option -polly-use-pocc. Also call the passes uniformly -polly-opt-pocc and -polly-opt-isl. llvm-svn: 145071	2011-11-22 19:40:19 +00:00
Tobias Grosser	8f99c167cd	ScopInfo: Use names of simple parameters to name the isl parameter dimensions. Parameters can be complex SCEV expressions, but they can also be single scalar values. If a parameters is such a simple scalar value and the value is named, use this name to name the isl parameter dimensions. llvm-svn: 144641	2011-11-15 11:38:55 +00:00
Tobias Grosser	f50fc50c80	Remove unused parameters from TempScop llvm-svn: 144232	2011-11-09 22:35:15 +00:00
Tobias Grosser	6e9f25a5d5	Remove AffineSCEVIterator We do not use it anymore. It was replaced by SCEVVisitors like the SCEVValidator. llvm-svn: 144229	2011-11-09 22:35:00 +00:00
Tobias Grosser	fb47d66a06	Remove unused code from SCEVAffFunc constructor llvm-svn: 144224	2011-11-09 22:34:39 +00:00
Tobias Grosser	5683df4a23	Remove more of SCEVAffineFunc llvm-svn: 144223	2011-11-09 22:34:34 +00:00
Tobias Grosser	db87142b26	TempScop: Remove more of the buildAffineFunction llvm-svn: 144221	2011-11-09 22:34:24 +00:00
Tobias Grosser	e6efa37e76	TempScopInfo: Remove unneeded construction of SCEVAffFunc llvm-svn: 144220	2011-11-09 22:34:18 +00:00
Tobias Grosser	60b54f19e6	Detect Parameters directly on the SCEV. Instead of using TempScop to find parameters, we detect them directly on the SCEV. This allows us to remove the TempScop parameter detection in a subsequent commit. This fixes a bug reported by Marcello Maggioni <hayarms@gmail.com> llvm-svn: 144087	2011-11-08 15:41:28 +00:00
Tobias Grosser	65fa78e975	TempScopInfo: Print the original SCEV instead of using SCEVAffFunc This is reducing the impact of SCEVAffFunc llvm-svn: 143574	2011-11-02 21:37:06 +00:00
Tobias Grosser	67707b7131	Enable prevectorization with -enable-polly-vector. This removes the separate prevector options for the Pluto and isl scheduler. llvm-svn: 142774	2011-10-23 20:59:40 +00:00
Tobias Grosser	22636bf498	Rename -enable-schedule-prevector to -polly-prevector llvm-svn: 142771	2011-10-23 20:59:29 +00:00
Tobias Grosser	2ff8723d5d	ScopDetection: Allow to limit the scop detection to a single function -polly-detect-only=<functionname> allows to limit the scop detection to a single function. llvm-svn: 142750	2011-10-23 11:17:06 +00:00
Tobias Grosser	0e27e24751	ScopInfo: Use separate function to build context llvm-svn: 141253	2011-10-06 00:03:48 +00:00
Tobias Grosser	7a5246a371	Test: Convert to new exception handling llvm-svn: 141069	2011-10-04 07:53:21 +00:00
Tobias Grosser	c92151516f	CodeGen: Support for Cast Operations in vector code generation llvm-svn: 139097	2011-09-04 11:45:52 +00:00
Tobias Grosser	7551c3000a	CodeGen: Better separate scalar and vector code generation. llvm-svn: 139095	2011-09-04 11:45:41 +00:00
Tobias Grosser	8ae9aca5cc	CodeGen: Improve naming of copied basic blocks It may happen that we generate the code of a basic block from the original scop is code generated several times. The new naming scheme reduces confusing that earlier appeared as the version numbers of the new basic blocks could have been interpreted as part of the name of the original basic block. llvm-svn: 139092	2011-09-04 11:45:22 +00:00
Tobias Grosser	c532f12965	Fix crashes due to unaligned parameters Due to the recent introduction of isl_id, parameters need now always to be aligned. This was not yet taken care of in the code path of vectorization and dependence analysis. llvm-svn: 138555	2011-08-25 08:40:59 +00:00
Tobias Grosser	604c981f40	Temporarily remove reduction support and interchange pass I am planning to eliminate the TempScopInfo pass. To simplify this I remove some features that may later be added to the ScopInfo pass. The interchange pass is currently strongly tested and furthermore ment to be replaced by the general scheduling optimizer. Reductions itself can later be added easily. llvm-svn: 138219	2011-08-21 14:57:58 +00:00
Raghesh Aloor	129e867865	Memaccess: Code generation for constant access function change Support for generating code for an access function change which is a constant is added. llvm-svn: 137603	2011-08-15 02:33:39 +00:00
Raghesh Aloor	62b13120ee	Memaccess: Codegeneration for a simple access function change Code is generated for a simple access function change imported from JSCOP file. An access of A[i] is changed to A[0]. The code for A[0] is generated directly without refering to isl function calls. llvm-svn: 136789	2011-08-03 17:02:50 +00:00
Raghesh Aloor	7a04f4f9ba	Memaccess: Display Changed Access Relation The changed access relations imported from JSCOP file is shown as output of -analyze pass. llvm-svn: 136774	2011-08-03 13:47:59 +00:00
Tobias Grosser	bd2b2c7117	Add a vect target to the polly testsuite Contributed by: Sebastian Pop <sebpop@gmail.com> llvm-svn: 136685	2011-08-02 07:22:05 +00:00
Raghesh Aloor	3cb6628d7c	MemAccess: Reading Change in Access Function This patch reads the change in access functions from imported JSCOP file. A test case is also added. llvm-svn: 134991	2011-07-12 17:14:03 +00:00
Tobias Grosser	851b96e7f0	Adapt to LLVM type system changes Remove constness of Types and do not name the structures generated in the OpenMP code. llvm-svn: 134980	2011-07-12 12:42:54 +00:00
Tobias Grosser	928b2d16a6	test: Do not pipe the .ll file into opt The construct '< %s' complicates debugging with gdb --args as the content of %s is interpreted as gdb input. llvm-svn: 134432	2011-07-05 19:13:21 +00:00
Tobias Grosser	3770157502	test: Remove redundant function definition The latest version of LLVM fails, if a function is defined twice in an LLVM bitcode file. llvm-svn: 134400	2011-07-04 23:18:17 +00:00
Tobias Grosser	8c4cfc327b	CodeGeneration: Do not delete the old version of the Scop. Instead of deleting the old code, keep it on the side in an if-branch. It will either be deleted by the dead code elimination or we can use it as fallback. llvm-svn: 131352	2011-05-14 19:01:49 +00:00
Hongbin Zheng	94c5df16e2	ScopDetection: Remember the functions generated by backend in a pointer set, so we do not re-generate code for these functions. llvm-svn: 130975	2011-05-06 02:38:20 +00:00
Hongbin Zheng	e1bd40cfbd	Partial support test polly for out of tree build. llvm-svn: 130482	2011-04-29 07:34:54 +00:00
Tobias Grosser	758053788b	Add initial version of Polly This version is equivalent to commit ba26ebece8f5be84e9bd6315611d412af797147e in the old git repository. llvm-svn: 130476	2011-04-29 06:27:02 +00:00

... 24 25 26 27 28 ...

1469 Commits