llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	219b20e1a3	[Fix] Non i1 typed select condition for weird pw aff functions. In case the pieceweise affine function used to create an isl_ast_expr had empty cases (e.g., with contradicting constraints on the parameters), it was possible that the condition of the isl_ast_expr select was not a comparison but a constant (thus of type i64). This patch does two thing: 1) Handle the case the condition of a select is not a i1 type like C. 2) Try to simplify the pieceweise affine functions for the min/max access when we generate runtime alias checks. That step can often remove empty or redundant cases as well as redundant constrains. This fixes bug: http://llvm.org/PR21167 Differential Revision: http://reviews.llvm.org/D5627 llvm-svn: 219208	2014-10-07 14:37:59 +00:00
Johannes Doerfert	f1ee2622be	[Fix] Dead statements should not confuse the RTC generation This fixes http://llvm.org/bugs/show_bug.cgi?id=21166 . Differential Revision: http://reviews.llvm.org/D5623 llvm-svn: 219131	2014-10-06 17:43:00 +00:00
Johannes Doerfert	2ef33e9f16	Allow multidimensional accesses in the IslExprBuilder. This resolved the issues with delinearized accesses that might alias, thus delinearization doesn't deactivate runtime alias checks anymore. Differential Revision: http://reviews.llvm.org/D5614 llvm-svn: 219078	2014-10-05 11:33:59 +00:00
Johannes Doerfert	1a28a8938e	Introduce the ScopArrayInfo class. This class allows to store information about the arrays in the SCoP. For each base pointer in the SCoP one object is created storing the type and dimension sizes of the array. The objects can be obtained via the SCoP, a MemoryAccess or the isl_id associated with the output dimension of a MemoryAccess (the description of what is accessed). So far we use the information in the IslExprBuilder to create the right base type before indexing into the base array. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21113 (both test cases are included). On top of that we can now build runtime alias checks for delinearized arrays as the dimension sizes are also part of the ScopArrayInfo objects. Differential Revision: http://reviews.llvm.org/D5613 llvm-svn: 219077	2014-10-05 11:32:18 +00:00
Duncan P. N. Exon Smith	52fd68980c	DI: LLVM schema change: fold constants into string Update debug info testcases for the LLVM metadata schema change in r219010 to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219019	2014-10-03 21:08:48 +00:00
Johannes Doerfert	a441783544	[Fix] Accidently changed the type of a libgomp argument in r219003. Only subsequent patches introduced tests for the signature in the generated IR, thus the tests were wrong too and are adjusted now. llvm-svn: 219017	2014-10-03 20:40:24 +00:00
Johannes Doerfert	1356ac75d1	Put the parallel context alloca into the function entry block. We use lifetime markers to limit the actual life range (similar to clang). Differential Revision: http://reviews.llvm.org/D5582 llvm-svn: 219005	2014-10-03 19:12:05 +00:00
Johannes Doerfert	990cd4c2e2	Add option to limit the maximal number of parallel threads. Differential Revision: http://reviews.llvm.org/D5581 llvm-svn: 219004	2014-10-03 19:11:10 +00:00
Johannes Doerfert	12b355a2ce	[Refactor] Generalize parallel code generation + Generalized function names and comments + Removed OpenMP (omp) from the names and comments + Use common names (non OpenMP specific) for runtime library call creation methodes + Commented the parallel code generator and all its member functions + Refactored some values and methodes Differential Revision: http://reviews.llvm.org/D4990 llvm-svn: 219003	2014-10-03 19:10:13 +00:00
Johannes Doerfert	87901453d9	Align copied load/store instructions as the original. This also forbids the json importer to access other memory locations than the original instruction as we to reuse the alignment of the original load/store. Differential Revision: http://reviews.llvm.org/D5560 llvm-svn: 218883	2014-10-02 16:22:19 +00:00
Johannes Doerfert	ecdf263c07	Allow to annotate alias scopes in the new SCoP. The command line flag -polly-annotate-alias-scopes controls whether or not Polly annotates alias scopes in the new SCoP (default ON). This can improve later optimizations as the new SCoP is basically an alias free environment for them. llvm-svn: 218877	2014-10-02 15:31:24 +00:00
Adrian Prantl	e6579cd9a6	Update testcase to new intrinsic format llvm-svn: 218806	2014-10-01 20:40:12 +00:00
Johannes Doerfert	c7b719fc03	Annotate LLVM-IR for all parallel loops This change allows to annotate all parallel loops with loop id metadata. Furthermore, it will annotate memory instructions with llvm.mem.parallel_loop_access metadata for all surrounding parallel loops. This is especially usefull if an external paralleliser is used. This also removes the PollyLoopInfo class and comments the LoopAnnotator. A test case for multiple parallel loops is attached. llvm-svn: 218793	2014-10-01 20:10:44 +00:00
Johannes Doerfert	eeab05a084	[RTC] Use the domain to split alias groups. We use a parametric abstraction of the domain to split alias groups if accesses cannot be executed under the same parameter evaluation. The two test cases check that we can remove alias groups if the pointers which might alias are never accessed under the same parameter evaluation and that the minimal/maximal accesses are not global but with regards to the parameter evaluation. Differential Revision: http://reviews.llvm.org/D5436 llvm-svn: 218758	2014-10-01 12:42:37 +00:00
Johannes Doerfert	13771738d3	[RTC] Split alias groups according to read only base addresses If there are multiple read only base addresses in an alias group we can split it into multiple alias groups each with only one read only access. This way we might reduce the number of comparisons significantly as it grows linear in the number of alias groups but exponential in their size. Differential Revision: http://reviews.llvm.org/D5435 llvm-svn: 218757	2014-10-01 12:40:46 +00:00
Tobias Grosser	f8a678d2fd	Build domtree of new loops correctly This fixes a bug introduced in r217525. llvm-svn: 218581	2014-09-28 22:40:36 +00:00
Johannes Doerfert	9143d67aba	[RTC] Bail if too many parameters are involved in a RTC access. If too many parameters are involved in accesses used to create RTCs we might end up with enormous compile times and RTC expressions. The reason is that the lexmin/lexmax is dependent on all these parameters and isl might need to create a case for every "ordering" of them (e.g., p0 <= p1 <= p2, p1 <= p0 <= p2, ...). The exact number of parameters allowed in accesses is defined by the command line option -polly-rtc-max-parameters=XXX and set by default to 8. Differential Revision: http://reviews.llvm.org/D5500 llvm-svn: 218566	2014-09-27 11:02:39 +00:00
Tobias Grosser	1eedb67fa6	We do not support alias checks for base pointers defined inside the SCoP The run-time alias check places code that involves the base pointer at the beginning of the SCoP. This breaks if the base pointer is defined inside the SCoP. Hence, we can only create a run-time alias check if we are sure the base pointer is not an instruction defined inside the scop. If it is we refuse to handle the SCoP. This commit should unbreak most of our current LNT failures. Differential Revision: http://reviews.llvm.org/D5483 llvm-svn: 218412	2014-09-24 21:04:29 +00:00
Johannes Doerfert	77bd5ae3d9	[Fix] Allow pointer types as access elements and compare them correctly This fixes two problems which are usualy caused together: 1) The elements of an isl AST access expression could be pointers not only integers, floats and vectores thereof. 2) The runtime alias checks need to compare pointers but if they are of a different type we need to cast them into a "max" type similar to the non pointer case. llvm-svn: 218113	2014-09-19 08:49:02 +00:00
Tobias Grosser	3ee7cdab53	Report possible aliasing deterministically This commit drops a call to std::sort, which sorted the base pointers that possibly alias according to the address at which their corresponding llvm::Value was allocated. There does not seem to be any good reason, why those pointers should be (re)sorted and this only makes the output indeterministic. llvm-svn: 218052	2014-09-18 14:45:43 +00:00
Johannes Doerfert	b9fb5a2cc6	[RTC] Runtime Alias Checks for the ISL backend (missing tests) Test files missing in r218046. llvm-svn: 218047	2014-09-18 11:20:36 +00:00
Johannes Doerfert	b164c795b7	[RTC] Runtime Alias Checks for the ISL backend This change will build all alias groups (minimal/maximal accesses to possible aliasing base pointers) we have to check before we can assume an alias free environment. It will also use these to create Runtime Alias Checks (RTC) in the ISL code generation backend, thus allow us to optimize SCoPs despite possibly aliasing pointers when this backend is used. This feature will be enabled for the isl code generator, e.g., --polly-code-generator=isl, but disabled for: - The cloog code generator (still the default). - The case delinearization is enabled. - The case non-affine accesses are allowed. llvm-svn: 218046	2014-09-18 11:17:17 +00:00
Johannes Doerfert	b7e4083599	Updated to isl 2c19ecd444095d6f560349018f68993bc0e03691 Changed test cases and fixed warnings. llvm-svn: 218043	2014-09-18 11:13:35 +00:00
Johannes Doerfert	0fe35dd088	[Fix] Rewire the Region after a unconditional entry edge is created We use SplitEdge to split a conditional entry edge of the SCoP region. However, SplitEdge can cause two different situations (depending on whether or not the edge is critical). This patch tests which one is present and deals with the former unhandled one. It also refactors and unifies the case we have to change the basic blocks of the SCoP to new ones (see replaceScopAndRegionEntry). llvm-svn: 217802	2014-09-15 18:34:45 +00:00
Johannes Doerfert	377a620f98	Compute and print the minimal loop carried dependency distance During the IslAst parallelism check also compute the minimal dependency distance and store it in the IstAst for node. Reviewer: sebpop Differential Revision: http://reviews.llvm.org/D4987 llvm-svn: 217729	2014-09-13 17:34:11 +00:00
Tobias Grosser	230acc4445	Delinearize _all_ accesses to a multi-dimensional array Even though we previously correctly detected the multi-dimensional access pattern for accesses with a certain base address, we only delinearized non-affine accesses to this address. Affine accesses have not been touched and remained as single dimensional accesses. The result was an inconsistent description of accesses to the same array, with some being one dimensional and some being multi-dimensional. This patch ensures that all accesses are delinearized with the same dimensionality as soon as a single one of them has been detected as non-affine. While writing this patch, it became evident that the options -polly-allow-nonaffine and -polly-detect-keep-going have not been properly supported in case delinearization has been turned on. This patch adds relevant test coverage and addresses these issues as well. We also added some more documentation to the functions that are modified in this patch. This fixes llvm.org/PR20123 Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217728	2014-09-13 14:47:55 +00:00
Tobias Grosser	bcd4efffa7	Check that the elements of an array have the same size At the moment we assume that only elements of identical size are stored/loaded to a certain base pointer. This patch adds logic to the scop detection to verify this. Differential Revision: http://reviews.llvm.org/D5329 llvm-svn: 217727	2014-09-13 14:47:40 +00:00
Tobias Grosser	3762bd34e7	Improve test coverage for non-affine access functions We now verify that such functions are correctly detected even in combination with delinearization. This change is added to ensure we have good test coverage for the subsequent delinearization fix. We also remove unnecessary instructions from the test case. llvm-svn: 217664	2014-09-12 09:07:56 +00:00
Tobias Grosser	0ef617dda0	Remove executable bit on test files Some test files had been marked executable by accident. llvm-svn: 217663	2014-09-12 09:07:50 +00:00
Johannes Doerfert	dd5c144246	Allow to generate a loop without the GuardBB This allows us to omit the GuardBB in front of created loops if we can show the loop trip count is at least one. It also simplifies the dominance relation inside the new created region. A GuardBB (even with a constant branch condition) might trigger false dominance errors during function verification. Differential Revision: http://reviews.llvm.org/D5297 llvm-svn: 217525	2014-09-10 17:33:32 +00:00
Johannes Doerfert	3826224428	[Refactor] Cleanup isl code generation Summary: + Refactor the runtime check (RTC) build function + Added helper function to create an PollyIRBuilder + Change the simplify region function to create not only unique entry and exit edges but also enfore that the entry edge is unconditional + Cleaned the IslCodeGeneration runOnScop function: - less post-creation changes of the created IR + Adjusted and added test cases Reviewers: grosser, sebpop, simbuerg, dpeixott Subscribers: llvm-commits, #polly Differential Revision: http://reviews.llvm.org/D5076 llvm-svn: 217508	2014-09-10 14:50:23 +00:00
David Peixotto	9690f3b596	Add -e to test generation script The -e flag exits the script with a non-zero code if any subcommand fails. This flag allows us to notice as early as possible if the test was not properly regenerated using a command like: $ create_ll.sh t.c && opt < t.ll -polly ... The above pattern is useful when iteratively developing a test case to guard against un-noticed syntax errors. Differential Revision: http://reviews.llvm.org/D5276 llvm-svn: 217463	2014-09-09 22:14:38 +00:00
Johannes Doerfert	8e95dc657e	[Fix] OpenMP parallel loop detection for the isl backend There was a bug in the IslAst which caused that no more outermost parallel loops were detected/checked after a parallel outermost loop of depth 1. + Test case attached llvm-svn: 217452	2014-09-09 17:03:54 +00:00
Tobias Grosser	e7e33ba13a	Always pipe in test files In Polly we used to have a mix of test cases, some that used 'opt %s' and others that used 'opt < %s'. We now change all to use 'opt < %s'. Piping in test files is preferable as it does prevent temporary files to be written to disk. This brings us in line with what is usus in LLVM. llvm-svn: 216816	2014-08-30 09:15:04 +00:00
Tobias Grosser	2faa569c0a	Replace %defaultOpts with explicit pass names This replaces the use of %defaultOpts = '-basicaa -polly-prepare' with the minimal set of passes necessary for a test to succeed. Of the test cases that previously used %defaultOpts 76 test cases require none of these passes, 42 need -basicaa and only 2 need -polly-prepare. Our change makes this requirement explicit. In Polly many test cases have been using a macro '%defaultOpts' which run a couple of preparing passes before the actual Polly test case. This macro was introduced very early in the development of Polly and originally contained a large set of canonicalization passes. However, as the need for additional canonicalization passes makes test cases harder to understand and also more fragile in terms of changes in such passes, we aim since a longer time to only include the minimal set of passes necessary. This patch removes the last leftovers from of %defaultOpts and brings our tests cases more in line to what is usus in LLVM itself. llvm-svn: 216815	2014-08-30 09:13:28 +00:00
Johannes Doerfert	9e7b17b0d4	Added arcanist linters and cleaned errors and warnings Arcanist (arc) will now always run linters before uploading any new commit to Phabricator. All errors/warnings (or their absence) will be shown in the web interface together with a explanation by the commiter (arcanist will ask the commiter if the build was not clean). The linters include: - clang-format - spelling check - permissions check (aka. chmod) - filename check - merge conflict marker check Note, that their scope is sometimes limited (see .arclint for details). This commit also fixes all errors and warnings these linters reported, namely: - spelling mistakes and typos - executable permissions for various text files Differential Revision: http://reviews.llvm.org/D4916 llvm-svn: 215871	2014-08-18 00:40:13 +00:00
Andreas Simbuerger	6bf77979e0	Diagnostic: Provide end-user message for non-affine loop bound errors llvm-svn: 215832	2014-08-17 10:09:15 +00:00
Andreas Simbuerger	d46b935267	Diagnostic: Provide end-user message for non-affine access function errors llvm-svn: 215831	2014-08-17 10:09:11 +00:00
Andreas Simbuerger	f29f625748	Diagnostic: Provide end-user message for aliasing errors This will spill out information about LLVM-internals. However, in cases where the name of the Value matches the name of the array in the source, we provide more useful information. In cases where we spill internals, the information still might help the user to pin down the correct arrays. The problem we face here is: The error is pinned to the debug location of one of the offending values out of the alias set instead of all of them. The more information we give the user about the set of aliasing pointers the better. llvm-svn: 215830	2014-08-17 10:09:07 +00:00
Tobias Grosser	2873594709	Revert "[Refactor] Cleanup runtime code generation" This reverts commit 215466 (and 215528, a trivial formatting fix). The intention of these commits is a good one, but unfortunately they broke our LNT buildbot: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly-codegen-isl Several of the cleanup changes that have been combined in this 'fixup' are trivial and could probably be committed as obvious changes without risking to break the build. The remaining changes are little and it should be easy to figure out what went wrong. llvm-svn: 215817	2014-08-16 09:09:15 +00:00
Tobias Grosser	f4daf34496	Revert "Added support for modulo expressions" This reverts commit 215684. The intention of the commit is great, but unfortunately it seems to be the cause of 14 LNT test suite failures: http://lab.llvm.org:8011/builders/perf-x86_64-penryn-O3-polly/builds/116 To make our buildbots and performance testers green until this issue is solved, we temporarily revert this commit. llvm-svn: 215816	2014-08-16 09:08:55 +00:00
Johannes Doerfert	5130c849aa	Added support for modulo expressions The support is limited to signed modulo access and condition expressions with a constant right hand side, e.g., A[i % 2] or A[i % 9]. Test cases are modified according to this new feature and new test cases are added. Differential Revision: http://reviews.llvm.org/D4843 llvm-svn: 215684	2014-08-15 01:14:11 +00:00
Johannes Doerfert	9744c4af16	[Refactor] Cleanup runtime code generation + Use regexp in two test case. + Refactor the runtime condition build function llvm-svn: 215466	2014-08-12 18:35:54 +00:00
Johannes Doerfert	fab63f7791	[Minor] Change the number of cut lines for new tests This should cut all metadata community clang produces. llvm-svn: 215422	2014-08-12 03:31:23 +00:00
Tobias Grosser	f57d63f906	Do allow negative offsets in the outermost array dimension There is no needed for neither 1-dimensional nor higher dimensional arrays to require positive offsets in the outermost array dimension. We originally introduced this assumption with the support for delinearizing multi-dimensional arrays. llvm-svn: 214665	2014-08-03 21:07:30 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	b5d1c322f2	Update the jscop tests and port them to isl codegen. The updated tests use a different context than the old ones did. Other than that only their path and the code generation we use changed. llvm-svn: 214657	2014-08-03 01:48:49 +00:00
Tobias Grosser	8c112d838c	Mark a GPGPU test case as XFAIL This area of code is currently not very much tested. It will hopefully be superseeded by Yabin's GSoC project. llvm-svn: 214633	2014-08-02 13:37:32 +00:00
Johannes Doerfert	b41344a88f	[Fix] Annotate the IslAst with broken reductions (Missing files) + test cases of r214489. llvm-svn: 214491	2014-08-01 08:20:26 +00:00
Johannes Doerfert	32868bf4c3	Change the printing of reduction types We use the C operator representation when applicable. + Update all the test cases accordingly. llvm-svn: 214486	2014-08-01 08:13:25 +00:00
Johannes Doerfert	0eefb0258f	[Refactor] Use nicer print callback function in IslAst llvm-svn: 214447	2014-07-31 21:33:49 +00:00
Rafael Espindola	08dfd8f25f	Update for llvm change. llvm-svn: 214358	2014-07-30 23:17:15 +00:00
Tobias Grosser	924e9e0226	IslAst: Enhance parallelism detection test Add more check lines to ensure we do not accidentally generate nested openmp parallel annotations. llvm-svn: 214200	2014-07-29 19:22:46 +00:00
Johannes Doerfert	af9b1e2d80	[Refactor] Remove containsLoop to find innermost loops Use the fact that if we visit a for node first in pre and next in post order we know we did not visit any children, thus we found an innermost loop. + Test case for an innermost loop with a conditional inside llvm-svn: 213870	2014-07-24 15:59:06 +00:00
Johannes Doerfert	f6583176ab	[Refactor] Unify IslAst print methods + Add const annotations to some member functions llvm-svn: 213779	2014-07-23 18:14:43 +00:00
Johannes Doerfert	43e1eadf26	[Refactor] Use attributes to mark function as invalid for polly + Test case annotated with the new attribute + Modified test case to check if subfunctions are annotated llvm-svn: 213093	2014-07-15 21:06:48 +00:00
Johannes Doerfert	457f73eaee	Annotate reduction parallel loops in the IslAst textual output + Introduced dependency type TYPE_TC_RED to represent the transitive closure (& the reverse) of reduction dependences. These are used when we check for reduction parallel loops. + Test cases including loop reversals and modulo schedules which compute reductions in a alternated order. llvm-svn: 213019	2014-07-15 00:00:35 +00:00
Tobias Grosser	c2920ff747	DeadCodeElimination: Fix liveout computation We move back to a simple approach where the liveout is the last must-write statement for a data-location plus all may-write statements. The previous approach did not work out. We would have to consider per-data-access dependences, instead of per-statement dependences to correct it. As this adds complexity and it seems we would not gain anything over the simpler approach that we implement in this commit, I moved us back to the old approach of computing the liveout, but enhanced it to also add may-write accesses. We also fix the test case and explain why we can not perform dead code elimination in this case. llvm-svn: 212925	2014-07-14 08:32:01 +00:00
Tobias Grosser	780ce0f8e3	DeadCodeElim: Compute correct liveout for non-affine accesses Thanks to Johannes Doerfert for narrowing down the bug. Reported-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 212796	2014-07-11 07:12:10 +00:00
Tobias Grosser	5e6813d184	Derive run-time conditions for delinearization As our delinearization works optimistically, we need in some cases run-time checks that verify our optimistic assumptions. A simple example is the following code: void foo(long n, long m, long o, double A[n][m][o]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 150; j++) for (long k = 0; k < 200; k++) A[i][j][k] = 1.0; } After clang linearized the access to A and we delinearized it again to A[i][j][k] we need to ensure that we do not access the delinearized array out of bounds (this information is not available in LLVM-IR). Hence, we need to verify the following constraints at run-time: CHECK: Assumed Context: CHECK: [o, m] -> { : m >= 150 and o >= 200 } llvm-svn: 212198	2014-07-02 17:47:48 +00:00
Johannes Doerfert	f618339a37	Introduce reduction types This change is particularly useful in the code generation as we need to know which binary operator/identity element we need to combine/initialize the privatization locations. + Print the reduction type for each memory access + Adjusted the test cases to comply with the new output format and to test for the right reduction type llvm-svn: 212126	2014-07-01 20:52:51 +00:00
Johannes Doerfert	9890a05287	[FIX] Don't consider reductions which are partially outside the SCoP + Test case llvm-svn: 212080	2014-07-01 00:32:29 +00:00
Johannes Doerfert	1a62c7a34a	[Fix] Deleted renamed test after r211957 llvm-svn: 211964	2014-06-27 21:48:42 +00:00
Johannes Doerfert	e58a012094	Allow multiple reductions per statement Iterate over all store memory accesses and check for valid binary reduction candidate loads by following the operands of the stored value. For each candidate pair we check if they have the same base address and there are no other accesses which may overlap with them. This ensures that no intermediate value can escape into other memory locations or is overwritten at some point. + 17 test cases for reduction detection and reduction dependency modeling llvm-svn: 211957	2014-06-27 20:31:28 +00:00
Andreas Simbuerger	b379edbb3e	Don't expand to invalid Scops with -polly-detect-keep-going Enabling -keep-going in ScopDetection causes expansion to an invalid Scop candidate. Region A <- Valid candidate \| Region B <- Invalid candidate If -keep-going is enabled, ScopDetection would expand A to A+B because the RejectLog is never checked for errors during expansion. With this patch only A becomes a valid Scop. llvm-svn: 211875	2014-06-27 06:21:14 +00:00
Johannes Doerfert	76dd493eff	[Fix] Broken tests after r211796. llvm-svn: 211797	2014-06-26 19:29:11 +00:00
Johannes Doerfert	f8ee915deb	Use wrapped reduction dependences This change will ease the transision to multiple reductions per statement as we can now distinguish the effects of multiple reductions in the same statement. + Wrapped reduction dependences are used to compute privatization dependences + Modified test cases to account for the change llvm-svn: 211795	2014-06-26 18:44:14 +00:00
Johannes Doerfert	ea23b1d561	Hybrid dependency analysis This dependency analysis will keep track of memory accesses if they might be part of a reduction. If not, the dependences are tracked on a statement level. The main reason to do this is to reduce the compile time while beeing able to distinguish the effects of reduction and non-reduction accesses. + Adjusted two test cases llvm-svn: 211794	2014-06-26 18:38:08 +00:00
Andreas Simbuerger	99d4ab2b84	Add diagnostic remark for ReportVariantBasePtr llvm-svn: 211777	2014-06-26 13:33:35 +00:00
Andreas Simbuerger	5569bf300d	Support the new DiagnosticRemarks Add support for generating optimization remarks after completing the detection of Scops. The goal is to provide end-users with useful hints about opportunities that help to increase the size of the detected Scops in their code. By default the remark is unspecified and the debug location is empty. Future patches have to expand on the messages generated. This patch brings a simple test case for ReportFuncCall to demonstrate the feature. Reports all missed opportunities to increase the size/number of valid Scops: clang <...> -Rpass-missed="polly-detect" <...> opt <...> -pass-remarks-missed="polly-detect" <...> Reports beginning and end of all valid Scops: clang <...> -Rpass="polly-detect" <...> opt <...> -pass-remarks="polly-detect" <...> Differential Revision: http://reviews.llvm.org/D4171 llvm-svn: 211769	2014-06-26 10:06:40 +00:00
Tobias Grosser	50a5e6dac0	test/ScopInfo: Remove %defaultOpts and list passes explicitly Due to bad habit we sometimes used a variable %defaultOpts that listed a set of passes commonly run to prepare for Polly. None of these test cases actually needs special preparation and only two of them need the 'basicaa' to be scheduled. Scheduling the required alias analysis explicitly makes the test cases clearer. llvm-svn: 211671	2014-06-25 06:38:18 +00:00
Tobias Grosser	08031390d5	Clean up XFAILed test cases We had a set of test cases that have been incomplete and XFAILED. This patch completes a couple of the interesting ones and removes the ones which seem redundant or not sufficiently reduced to be useful. llvm-svn: 211670	2014-06-25 06:31:19 +00:00
Johannes Doerfert	5e275bc83a	[Refactor] Create nicer test cases from C/C++ Insert a header into the new testcase containing a sample RUN line a FIXME and an XFAIL. Then insert the formated C code and finally the LLVM-IR without attributes, the module ID or the target triple. llvm-svn: 211612	2014-06-24 17:02:53 +00:00
Yabin Hu	cc91169fd7	Remove use of llvm.codegen intrinsic for GPGPU codegen We use llvm.codegen intrinsic to generate code for embedded LLVM-IR strings. The reason we introduce such a intrinsic is that previous clang/opt tools was NOT linked with various LLVM targets and their AsmParsers and AsmPrinters. Since clang/opt been linked with all the needed libraries, we no longer need the llvm.codegen intrinsic. llvm-svn: 211573	2014-06-24 08:11:36 +00:00
Johannes Doerfert	f1906138b4	Model statement wise reduction dependences + Collect reduction dependences + Introduced TYPE_RED in Dependences.h which can be used to obtain the reduction dependences + Used TYPE_RED to prevent parallelization while we do not have a privatizing code generation + Relax the dependences for non-parallel code generation + Add privatization dependences to ensure correctness + 12 Test cases to check for reduction and privatization dependences llvm-svn: 211369	2014-06-20 16:37:11 +00:00
Johannes Doerfert	da80386700	Missing reduction detection test cases llvm-svn: 211235	2014-06-18 23:08:14 +00:00
Tobias Grosser	f4fcbf4097	Test delinearization of 2D diagonal matrix llvm-svn: 210538	2014-06-10 14:48:17 +00:00
Tobias Grosser	be7eaddc69	Adjust another test case to not access out of bounds llvm-svn: 210208	2014-06-04 19:41:47 +00:00
Tobias Grosser	5416a0395f	Adjust multidim test cases to not access out-of-bound memory We do this currently only for test cases where we have integer offsets that clearly access array dimensions out-of-bound. -; for (long i = 0; i < n; i++) -; for (long j = 0; j < m; j++) -; for (long k = 0; k < o; k++) +; for (long i = 0; i < n - 3; i++) +; for (long j = 4; j < m; j++) +; for (long k = 0; k < o - 7; k++) ; A[i+3][j-4][k+7] = 1.0; This will be helpful if we later want to simplify the access functions under the assumption that they do not access memory out of bounds. llvm-svn: 210179	2014-06-04 11:47:54 +00:00
Sebastian Pop	422e33f363	record delinearization result and reuse it in polyhedral translation Without this patch, the testcase would fail on the delinearization of the second array: ; void foo(long n, long m, long o, double A[n][m][o]) { ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) { ; A[i+3][j-4][k+7] = 1.0; ; A[i][0][k] = 2.0; ; } ; } ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[3 + i0, -4 + i1, 7 + i2] }; ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; Here is the output of FileCheck on the testcase without this patch: ; CHECK: [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[i0, 0, i2] }; ^ <stdin>:26:2: note: possible intended match here [n, m, o] -> { Stmt_for_body6[i0, i1, i2] -> MemRef_A[o0] }; ^ It is possible to find a good delinearization for A[i][0][k] only in the context of the delinearization of both array accesses. There are two ways to delinearize together all array subscripts touching the same base address: either duplicate the code from scop detection to first gather all array references and then run the delinearization; or as implemented in this patch, use the same delinearization info that we computed during scop detection. llvm-svn: 210117	2014-06-03 18:16:31 +00:00
Johannes Doerfert	c3958b214c	Added option for n-dimensional rectangular tiling + CL-option --polly-tile-sizes=<int,...,int> The i'th value is used as a tile size for dimension i, if there is no i'th value, the value of --polly-default-tile-size is used + CL-option --polly-default-tile-size=int Used if no tile size is given for a dimension i + 3 Simple testcases llvm-svn: 209753	2014-05-28 17:21:02 +00:00
Tobias Grosser	5f860fdfe9	Do not run GPGPU test cases without nvptx target Tag the GPGPU codegen test cases as unsupported if the nvptx target is not included in the current llvm build. Contributed-by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 208779	2014-05-14 14:18:14 +00:00
Sebastian Pop	c5c1055e3f	do not build llc and lli for polly test llvm-svn: 208619	2014-05-12 19:43:20 +00:00
Sebastian Pop	e8863b8f00	correct the delinearization failing case collect terms from affine and non affine memory accesses llvm-svn: 208616	2014-05-12 19:02:02 +00:00
Sebastian Pop	fcf68758b8	unxfail passing testcase llvm-svn: 208233	2014-05-07 18:01:32 +00:00
Tobias Grosser	f56af204b9	Add delinearization testcase for ivs that do not follow the loop order This is a test case that is currently failing, but that should start working with an upcoming version of our delinearization pass. llvm-svn: 207678	2014-04-30 17:49:22 +00:00
Tobias Grosser	841009a2cc	We missed two files in the last commit. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206901	2014-04-22 15:57:30 +00:00
Tobias Grosser	0d11dbabc4	Fixed missing cloog test with automake/configure build setup Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206900	2014-04-22 15:30:43 +00:00
Tobias Grosser	954939842f	Really fix the load case. Commit r206510 falsely advertised to fix the load cases, even though it only fixed the store case. This commit adds the same fix for the load case including the missing test coverage. llvm-svn: 206577	2014-04-18 09:46:35 +00:00
Tobias Grosser	50fd7010d8	Ensure a scalar pointer when issuing a vector load Even tough we may want to generate a vector load, the address from which to load still is a scalar. Make sure even if previous address computations may have been vectorized, that the addresses are also available as scalars. This fixes http://llvm.org/PR19469 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206510	2014-04-17 23:13:49 +00:00
Tobias Grosser	75b76729ab	Fix for vector codegen in OpenMP subfunctions Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 206332	2014-04-15 22:30:06 +00:00
Tobias Grosser	364c136d08	Dependences: Do not fail in case a schedule eliminates all dependences The following example shows a non-parallel loop void f(int a[]) { int i; for (i = 0; i < 10; ++i) A[i] = A[i+5]; } which, in case we import a schedule that limits the iteration domain to 0 <= i < 5, becomes parallel. Previously we crashed in such cases, now we just recognize it as parallel. This fixes http://llvm.org/PR19435 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206318	2014-04-15 20:14:57 +00:00
Tobias Grosser	efc3013544	Codegeneration: Free memory correctly when using -polly-vectorizer=polly This fixes PR19421. Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206156	2014-04-14 08:33:24 +00:00
Sebastian Pop	cd3bb59aa2	only delinearize when the access function is not affine llvm-svn: 205971	2014-04-10 16:08:11 +00:00
Tobias Grosser	79baa21242	ScopInfo: Scalar accesses are zero dimensional llvm-svn: 205958	2014-04-10 08:38:02 +00:00
Sebastian Pop	1801668af3	delinearize memory access functions llvm-svn: 205799	2014-04-08 21:20:44 +00:00
Tobias Grosser	64b95123ef	Delete trivial PHI nodes (aka stack slot sharing) During code preperation trivial PHI nodes (mainly introduced by lcssa) are deleted to decrease the number of introduced allocas (==> dependences). However simply replacing them by their only incoming value would cause the independent block pass to introduce new allocas. To prevent this we try to share stack slots during code preperarion, hence to reuse a already created alloca 'to demote' the trivial PHI node. This works if we know that the value stored in this alloca will be the incoming value of the trivial PHI at the end of the predecessor block of this trivial PHI. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 205320	2014-04-01 16:01:33 +00:00
Tobias Grosser	5fa36c0ff6	Updated test/create_ll.sh to work with old & new clang versions. We explicitly specifying all filenames instead of assuming some naming convention used by clang and opt. Contributed-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 204726	2014-03-25 15:50:44 +00:00
Tobias Grosser	e275e9216b	Return conservative result in case the dependence check timed out For complex examples it may happen that we do not compute dependences. In this case we do not want to crash, but just not detect parallel loops. llvm-svn: 204470	2014-03-21 15:12:09 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Tobias Grosser	8111a0ae7d	autoconf: Fix module loading in tests llvm-svn: 203925	2014-03-14 13:27:26 +00:00
Sebastian Pop	7537be92f4	add -load polly.so only when not LINK_POLLY_INTO_TOOLS llvm-svn: 203888	2014-03-14 04:04:36 +00:00
Rafael Espindola	80f20133d4	Fix polly tests to not include aliases to declarations. llvm-svn: 203721	2014-03-12 21:48:42 +00:00
Sebastian Pop	1b57e8f028	add dependence of check-polly on llc to avoid an error when directly doing ninja check-polly after cmake 'Could not find llc in .../ninja/bin'. llvm-svn: 203696	2014-03-12 18:55:25 +00:00
Tobias Grosser	4ba60fe9eb	ScheduleOptimizer: Fix prevectorization. In case we are at the innermost band, we try to prepare for vectorization. This means, we look for the innermost parallel loop and strip mine this loop to the innermost level using a strip-mine factor corresponding to the number of vector iterations. For whatever reason, the code that implemented this feature was broken. We now added a comment, a test case and obviously also the right code. llvm-svn: 203544	2014-03-11 06:27:36 +00:00
Tobias Grosser	e655754d57	Update CLooG and some test cases This is necessary to avoid test failures in the CLooG test suite due to the recent isl update. We also need to update two polly test cases which rely on a certain order in the textual description that isl chooses for its sets and maps. Changes here are not often, but we should probably switch to a check that verifies such maps are semantically equivalent instead of represented identically. llvm-svn: 203476	2014-03-10 17:31:22 +00:00
Tobias Grosser	37c9b8e0f2	Emit llvm.loop metadata for parallel loops For now we only mark innermost loops for the loop vectorizer. We could later also mark not-innermost loops to enable the introduction of openmp parallelism. llvm-svn: 202854	2014-03-04 14:59:00 +00:00
Tobias Grosser	356faa8f09	Dead code elimination: Schedule another approximative step before actual DCE In 'obsequi' we have a scop in which the current dead code elimination works, but the generated code is way too complex. To avoid this trouble (and to not disable the DCE entirely) we add an additional approximative step before the actual dead code elimination. This should fix one of the two current nightly-test issues. Polly could be improved to handle 'obsequi' by teaching it to introduce only a single parameter for (%1 and zext %1) which halves the number of parameters and allows polly to derive a simpler representation for the set of live iterations. However, this needs some time to investigate. I will commit a test case as soon as we have a reduced one. llvm-svn: 202010	2014-02-24 08:52:20 +00:00
Tobias Grosser	472d3b7037	codegen: Update LoopInfo correctly Add the 'polly.start' basic block to the loop that surrounds the scop we just codegenerate. This fixes PR13441 llvm-svn: 202000	2014-02-24 00:50:49 +00:00
Tobias Grosser	38c36ea18e	Do not fail in case we do not have valid dependences In case we do not have valid dependences, we do not run dead code elimination or the schedule optimizer. This fixes an infinite loop in the dead code elimination (PR12110). llvm-svn: 201982	2014-02-23 15:15:44 +00:00
Tobias Grosser	88640d2b47	Use -polly-codegen-isl in isl-codegen test Reported-by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 201902	2014-02-21 23:08:54 +00:00
Tobias Grosser	817d51dd1b	DCE: Switch to hybrid precise-unprecise analysis Instead of giving a choice between a precise (but possibly very complex) analysis and an approximative analysis we now use a hybrid approach which uses N precise steps followed by one approximating step. The precision of the analysis can be changed by increasing N. With a default of 'N' = 2, we get fully precise results for our current test cases and should not run into performance problems for more complex test cases. We can adjust this value when we got more experience with this dead code elimination. llvm-svn: 201888	2014-02-21 20:51:46 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Tobias Grosser	37eb422f69	Add polyhedral dead code elimination. This pass eliminates loop iterations that compute results that are not used later on. This can help e.g. in D, where the default zero-initialization is often unnecessary if right after new values are assigned to an array. Contributed-by: Peter Conn <conn.peter@gmail.com> llvm-svn: 201817	2014-02-20 21:43:54 +00:00
Tobias Grosser	d6aafa7c2e	Do not track location of scalar dependences in ScopInfo We do not have a use for this information at the moment. If we need this at some point, the "instruction -> access" mapping needs to be enhanced as a single instruction could then possibly perform multiple accesses. This patch allows us to build the polyhedral information for scops with scalar dependences. llvm-svn: 201815	2014-02-20 21:29:09 +00:00
Tobias Grosser	a1689937ba	Check scops a second time before working on them In rare cases the modification of one scop can effect the validity of other scops, as code generation of an earlier scop may make the scalar evolution functions derived for later scops less precise. The example that triggered this patch was a scop that contained an 'or' expression as follows: %add13710 = or i32 %j.19, 1 --> {(1 + (4 * %l)),+,2}<nsw><%for.body81> Scev could only analyze the 'or' as it knew %j.19 is a multiple of 2. This information was not available after the first scop was code generated (or independent-blocks was run on it) and SCEV could not derive a precise SCEV expression any more. This means we could not any more code generate this SCoP. My current understanding is that there is always the risk that an earlier code generation change invalidates later scops. As the example we have seen here is difficult to avoid, we use this occasion to guard us against all such invalidations. This patch "solves" this issue by verifying right before we start working on a detected scop, if this scop is in fact still valid. This adds a certain overhead. However the verification we run is anyways very fast and secondly it is only run on detected scops. So the overhead should not be very large. As a later optimization we could detect scops only on demand, such that we need to run scop-detections always only a single time. This should fix the single last failure in the LLVM test-suite for the new scev-based code generation. llvm-svn: 201593	2014-02-18 18:49:49 +00:00
Tobias Grosser	933edd04af	IndependentBlocks: Do not assert for PHI nodes outside of scops There does not seem to be a reason that we can not support PHI nodes outside of the scop that reference values within the SCoP. Or at least, the attached test case seems to do the right thing. We remove the assert for now. llvm-svn: 200427	2014-01-29 23:08:10 +00:00
Tobias Grosser	28a70c543d	ScopDetect: Transitively remove all children after region expansion In rare cases, a region R which is itself not valid has an indirect child region that is valid. When R becomes part of a valid region by expansion of another region, then all children of R have to be erased from the set of valid regions. This patch ensures that indirect children are erased in addition to direct children. Contributed-by: Armin Groesslinger <armin.groesslinger@uni-passau.de> Tobias: I added a reduced test case and adjusted the logic of the patch to only recurse until the first child is found. llvm-svn: 200411	2014-01-29 19:05:30 +00:00
Tobias Grosser	458fb78cfa	Check if array base addresses are invariant Array base addresses need to be invariant in the region considered. The base address has to be computed outside the region, or, when it is computed inside, the value must not change with the iterations of the loops. For example, when a two-dimensional array is represented as a pointer to pointers the base address A[i] in an access A[i][j] changes with i; therefore, such regions have to be rejected. Contributed by: Armin Größlinger <armin.groesslinger@uni-passau.de> llvm-svn: 200314	2014-01-28 12:58:58 +00:00
Tobias Grosser	5b5daab9f1	Add more test cases to check loop invariance of the base pointer. llvm-svn: 200305	2014-01-28 10:29:17 +00:00
Tobias Grosser	24d7e669b3	Do not test polybench with 'make check-polly' Those test cases should be tested in the LLVM test suite. For Polly we should extract regression tests for the individual passes. llvm-svn: 200206	2014-01-27 10:37:33 +00:00
Tobias Grosser	54646f7fab	Remove other unneccessary uses of -O3 in the test suite The polly test suite is now -O3 clean. llvm-svn: 200205	2014-01-27 10:37:06 +00:00
Tobias Grosser	a7fea8386c	Do not run -O3 to canonicalize test case This is not only not necessary, but in case -03 changes this can actually cause arbitrarily failing test cases such as, e.g., a recent change by Chandler that caused -O3 to unroll the loop body, which made the loop we wanted to detect disappear and consequently this test case fail. llvm-svn: 200204	2014-01-27 10:23:12 +00:00
Tobias Grosser	b917f47fc4	Dependences: Bound the time dependence calculation is allowed to take Count the number of computational steps that have been used to solve the dependence problem and abort in case we reach the "compute-out". This ensures we do not hang forever in cases the dependence problem is too difficult to solve. There is just a single case in the LLVM test-suite that runs into the compute-out. Even in this case, we can probably coalesce some of the parameters (i32 b, i32 b zext i64, ...) to simplify the problem enough to not hit the compute out. However, for now we set the compute out in place to address the general issue. The compute out was choosen such that it stops on a recent laptop after about 8 seconds. llvm-svn: 200156	2014-01-26 19:38:34 +00:00
Tobias Grosser	0d43646f93	Adjust test case to changed cloog output llvm-svn: 199587	2014-01-19 11:53:51 +00:00
Tobias Grosser	8519f897e7	Report detected scops using the new diagnostics We now report the following: $ polly-clang -O3 -mllvm -polly -mllvm -polly-report test.c -c \ -gline-tables-only note: Polly detected an optimizable loop region (scop) in function 'foo' test.c:2: Start of scop test.c:3: End of scop note: Polly detected an optimizable loop region (scop) in function 'bar' test.c:9: Start of scop test.c:13: End of scop llvm-svn: 197558	2013-12-18 10:49:53 +00:00
Tobias Grosser	7b6f9ba572	ScopValidator: smax expressions are no parameters This fixes PR18155 which is a regression introduced in 152913. llvm-svn: 196827	2013-12-09 21:51:46 +00:00
Tobias Grosser	7d66a19fe4	test: Remove use of defaultOpts llvm-svn: 196826	2013-12-09 21:51:31 +00:00
Tobias Grosser	54ee0ba74d	IslCodegen: Support for run-time conditions llvm-svn: 194948	2013-11-17 03:18:25 +00:00
Tobias Grosser	e86109f508	ScopInfo: Add support for AssumedContext When constructing a scop sometimes the exact representation of a statement or condition would be very complex, but there is a common case which is a lot simpler, but which is only valid under certain assumptions. The assumed context records the assumptions taken during the construction of this scop and that need to be code generated as a run-time test. At the moment, we do not yet model any assumptions, but only added the AssumedContext as well as the isl-ast generation support. As a next step, this needs to be hooked up with the isl code generation. if (1) /* run-time condition / { / optimized code / } else { / original code */ } llvm-svn: 193652	2013-10-29 21:05:49 +00:00
Tobias Grosser	4f8c0877e8	This test case requires assertions llvm-svn: 192530	2013-10-12 09:15:56 +00:00
Sebastian Pop	20594a842c	use -polly-codegen-isl in tests under test/Isl llvm-svn: 192110	2013-10-07 16:43:04 +00:00
Sebastian Pop	946070f2f0	do not use -polly-cloog in a ScopInfo testcase llvm-svn: 192109	2013-10-07 16:43:00 +00:00
Tobias Grosser	3613fd7a35	ScopInfo: Correctly handle true/false conditions This is a modified version of the orignally contributed patch. Contributed-by: alexandre.isoard@gmail.com llvm-svn: 190237	2013-09-07 01:54:13 +00:00
Tobias Grosser	815c635cec	[CodeGen] Fixup assert fails caused by incorrect LoopInfo update Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 189764	2013-09-02 16:13:00 +00:00
Daniel Dunbar	2bd59a2cc7	[tests] Update to use lit_config and lit package, as appropriate. llvm-svn: 188114	2013-08-09 21:54:36 +00:00
Tobias Grosser	22a155a7a6	ScopInfo: add a testcase that share parameters within nested start. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 187772	2013-08-06 04:36:45 +00:00
Tobias Grosser	e42ddb9ad3	ScopInfo: Split start value from SCEVAddRecExpr to enable parameter sharing. SCoP invariant parameters with the different start value would deter parameter sharing. For example, when compiling the following C code: void foo(float input) { for (long j = 0; j < 8; j++) { // SCoP begin for (long i = 0; i < 8; i++) { float x = input[j 64 + i + 1]; input[j * 64 + i] = x * x; } } } Polly would creat two parameters for these memory accesses: p_0: {0,+,256} p_2: {4,+,256} [j * 64 + i + 1] => MemRef_input[o0] : 4o0 = p_1 + 4i0 [j * 64 + i] => MemRef_input[o0] : 4o0 = p_0 + 4i0 These parameters only differ from start value. To enable parameter sharing, we split the start value from SCEVAddRecExpr, so they would share a single parameter that always has zero start value: p0: {0,+,256}<%for.cond1.preheader> [j * 64 + i + 1] => MemRef_input[o0] : 4o0 = 4 + p_1 + 4i0 [j * 64 + i] => MemRef_input[o0] : 4o0 = p_0 + 4i0 Such translation can make the polly-dependence much faster. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 187728	2013-08-05 15:14:15 +00:00
Tobias Grosser	96ef078583	Remove '-debug-only' from test case This flags was not used in the test case, but caused failures when LLVM was built without debugging. We can savely remove it. llvm-svn: 187343	2013-07-29 05:35:11 +00:00
Tobias Grosser	6e358c067a	TempScop: Actually load Polly in this test case llvm-svn: 187342	2013-07-29 05:18:09 +00:00
Tobias Grosser	7032ea6f5b	Remove second '-analyze' from command line llvm-svn: 187341	2013-07-29 05:15:33 +00:00
Tobias Grosser	85f7421731	JSONImporter: Free new schedule if found invalid In case we detect that the schedule the user wants to import is invalid we refuse it _and_ free the isl_maps containing it. Another bug found thanks to Rafael. llvm-svn: 187339	2013-07-29 05:12:01 +00:00
Tobias Grosser	880c52f56a	CodeGeneration: Fix double free in vector for We now use __isl_take to annotate the uses of the isl_set where we got the memory management wrong. Thanks to Rafael! His pipefail work hardened our test environment and exposed this bug nicely. llvm-svn: 187338	2013-07-29 01:58:07 +00:00
Rafael Espindola	cd61afb4ee	Use a slightly smaller hammer to make this pass. When first updating this test I only noticided the first RUN line. llvm-svn: 187328	2013-07-28 11:13:49 +00:00
Tobias Grosser	25f0342a68	Temporary disable a test until I finished the fix llvm-svn: 187305	2013-07-27 15:19:57 +00:00
Rafael Espindola	0329bb4fce	Looks like this test crashes. Add --crash to not for now. llvm-svn: 187300	2013-07-27 11:08:44 +00:00
Rafael Espindola	e559af8205	Add not to commands that fail. Polly devs: please check if these commands really should fail. llvm-svn: 187263	2013-07-26 22:49:25 +00:00
Tobias Grosser	6bcb34b180	ScopDetect: Add some test cases for sequential loops llvm-svn: 187024	2013-07-24 06:10:37 +00:00
Hongbin Zheng	63cc9467af	Ensure a correct order between memory accesses. Ensure that the scalar write access corresponds to the result of a load instruction appears after the generic read access corresponds to the load instruction. llvm-svn: 186419	2013-07-16 15:20:29 +00:00
Hongbin Zheng	5a772dcd84	IndependentBlock: Add option to disable scalar to array rewriting. llvm-svn: 186418	2013-07-16 15:19:33 +00:00
Tobias Grosser	6f0d6988a5	Dependences: Add a couple of basic test cases llvm-svn: 186254	2013-07-13 18:31:46 +00:00
Tobias Grosser	229d681675	Dependences: Clarify difference between value and memory based dependences We make the option a clear choice between the two analysis types and add descriptions about the difference between the two. llvm-svn: 186251	2013-07-13 17:37:55 +00:00
Sebastian Pop	784c012982	scop detection: remove an iteration over all uses reenabled reverted patch after checking that it passes without regressions on the nightly test-suite. Added testcase from Tobi. llvm-svn: 185720	2013-07-05 20:24:47 +00:00
Hongbin Zheng	8d3a888ca3	TempScop: (Partial) Implement the printDetail function. llvm-svn: 185254	2013-06-29 07:00:14 +00:00
Tobias Grosser	4f96749351	ScopInfo: Clarify may-write and must-write accesses llvm-svn: 184658	2013-06-23 05:21:18 +00:00
Tobias Grosser	3e030e178a	Correctly convert APInt to gmp values Previously this happend to work for integers up to i64, but we got it wrong for larger numbers. Fix this and add test cases to verify this keeps working. Reported by: Sven Verdoolaege <skimo at kotnet dot org> llvm-svn: 183986	2013-06-14 16:23:38 +00:00
Sebastian Pop	9d63234ad1	ScopDetect: check region entering edges are valid. When a region header is part of a loop, then all entering edges of this region should not come from the loop but outside the region. Otherwise, the loop may be only partially part of the region, which would cause troubles in handling induction variables. Currently, we can only model induction variables that are either fully part of the scop (loop induction variable) or induction variables that are scop- invariant (parameter). A loop that is only partially part of the scop causes troubles, as there is no good way to handle the induction variable in the independent blocks pass. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 183800	2013-06-11 22:20:40 +00:00
Sebastian Pop	2c9ec2e651	scop detection: do not run scop detection on regions without loops otherwise, use -polly-detect-scops-in-regions-without-loops to also detect scops in regions without loops llvm-svn: 183113	2013-06-03 16:35:37 +00:00
Tobias Grosser	93324aef17	Test that independent block pass does not transform induction variables The original test case showed a problem with the independet blocks pass and we decided to XFAIL it for now. Unfortunately the failure is not detected if we build without asserts and the verification of the independent block pass is not run. This change tests now for the actual reason of the failure and should trigger even in a non asserts build. We did not yet solve the underlying bug, but this should at least make the test suite behavior consistent. llvm-svn: 183025	2013-05-31 17:44:38 +00:00
Sebastian Pop	8fe6d11b84	scop detection: only handle functions with loops to detect scops in functions with no loops, use -polly-detect-scops-in-functions-without-loops llvm-svn: 182941	2013-05-30 17:47:32 +00:00
Sebastian Pop	359d3aa8a1	independent blocks: when moving Values, invalidate SCEV cached info llvm-svn: 182310	2013-05-20 20:02:03 +00:00
Sebastian Pop	c90ec7812e	rename make check target to match the naming convention followed in the other llvm projects llvm-svn: 182171	2013-05-17 23:04:28 +00:00
Tobias Grosser	3081b0f5ec	Update LoopInfo correctly When the Polly code generation was written we did not correctly update the LoopInfo data, but still claimed that the loop information is correct. This does not only lead to missed optimizations, but it can also cause miscompilations in case passes such as LoopSimplify are run after Polly. Reported-by: Sergei Larin <slarin@codeaurora.org> llvm-svn: 181987	2013-05-16 06:40:24 +00:00
Tobias Grosser	5db6ffd76f	LoopGenerators: Construct loops such that they are already loop rotated BeforeBB \| v GuardBB / \ __ PreHeaderBB \ / \ / \| latch HeaderBB \| \ / \ / < \ / \ / ExitBB This does not only remove the need for an explicit loop rotate pass, but it also gives us the possibility to skip the construction of the guard condition in case the loop is known to be executed at least once. We do not yet exploit this, but by implementing this analysis in the isl code generator we should be able to remove more guards than the generic loop rotate pass can. Another point is that loop rotation can introduce additional PHI nodes, which may hide that a loop can be executed in parallel. This change avoids this complication and will make it easier to move the openmp code generation into a separate pass. llvm-svn: 181986	2013-05-16 06:40:06 +00:00
Tobias Grosser	637bd63123	Move polly options into separate option category Use the new cl::OptionCategory support to move the Polly options into a separate option category. The aim is to hide most options and show by default only the options a user needs to influence '-O3 -polly'. The available options probably need some care, but here is the current status: Polly Options: Configure the polly loop optimizer -enable-polly-openmp - Generate OpenMP parallel code -polly - Enable the polly optimizer (only at -O3) -polly-no-tiling - Disable tiling in the scheduler -polly-only-func=<function-name> - Only run on a single function -polly-report - Print information about the activities of Polly -polly-vectorizer - Select the vectorization strategy =none - No Vectorization =polly - Polly internal vectorizer =unroll-only - Only grouped unroll the vectorize candidate loops =bb - The Basic Block vectorizer driven by Polly llvm-svn: 181295	2013-05-07 07:31:10 +00:00
Tobias Grosser	e8df5bd92b	IndependentBlocks: We can only reconstruct PHI nodes that are within the ScoP In the classical (non -polly-codegen-scev) mode, we assume that we can always recreate PHI nodes during code generation. This is not true. We can only reconstruct them from the polyhedral information, in case the entire loop of the PHI node is part of the SCoP and consequently the PHI node was translated in the polyhedral description. llvm-svn: 179674	2013-04-17 07:20:36 +00:00
Tobias Grosser	b5f92892d1	Remove unneeded RegionSimplify pass. We now support regions with multiple entries and multiple exits natively. Regions are not needed to be simplified to single entry and single exit. We need to XFAIL two test cases as this change increases the scop coverage and uncoveres two failures in the independent blocks pass. The first failure will be fixed in a subsequent commit, the second one is in the non-default -polly-codegen-scev mode and still needs to be fixed. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179673	2013-04-17 07:20:30 +00:00
Tobias Grosser	36a01b0a28	tests: Fix 'instruction does not dominate all its uses' error The LLVM-IR of this test case was apperently incorrect. llvm-svn: 179672	2013-04-17 07:20:17 +00:00
Tobias Grosser	8edce4ee62	Support SCoPs with multiple entry edges. Regions that have multiple entry edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / for_region This for_region contains two entry edges 'then' -> 'for_region' and 'else' -> 'for_region'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively when the region is in -loop-simplify form, which means the entry block should not be a loop header. Contributed by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179586	2013-04-16 08:04:42 +00:00
Tobias Grosser	3ed2600cab	SCEVValidator: Correctly store 'k * p' as a parameter We do not only need to understand that 'k * p' is a parameter expression, but also need to store this expression in the set of parameters. Before this patch we wrongly stored the two individual parameters %k and %p. Reported by: Sebastian Pop <spop@codeaurora.org> llvm-svn: 179485	2013-04-14 13:15:59 +00:00
Tobias Grosser	f242b806ac	ScheduleOpt: Do not crash on statements with empty iteration domains Statements with an empty iteration domain may not have a schedule assigned by the isl schedule optimizer. As Polly expects each statement to have a schedule, we keep the old schedule for such statements. This fixes http://llvm.org/PR15645` Reported-by: Johannes Doerfert <johannesdoerfert@gmx.de> llvm-svn: 179233	2013-04-10 22:48:08 +00:00
Sebastian Pop	1006614228	fix testcase llvm-svn: 179183	2013-04-10 16:44:08 +00:00
Tobias Grosser	ecb5092707	ScopDetect: Allow multiplications of the form <param> * <param> We handle these by treating this result of the multiplication as an additional parameter. llvm-svn: 179163	2013-04-10 07:42:28 +00:00
Tobias Grosser	0ee50f6ee4	Support SCoPs with multiple exit edges Regions that have multiple exit edges are very common. A simple if condition yields e.g. such a region: if / \ then else \ / after Region: if -> after This regions contains the bbs 'if', 'then', 'else', but not 'after'. It has two exit edges 'then' -> 'after' and 'else' -> 'after'. Previously we scheduled the RegionSimplify pass to translate such regions into simple regions. With this patch, we now support them natively. Contributed-by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179159	2013-04-10 06:55:31 +00:00
Sebastian Pop	9f57c5b695	scop detection: properly instantiate SCEVs to the place where they are used Fix inspired from c2d4a0627e95c34a819b9d4ffb4db62daa78dade. Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When translate the data reference A[i] in statement S using scev, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S'. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10}. This used to crash in: int loopDimension = getLoopDepth(Expr->getLoop()); isl_aff LAff = isl_aff_set_coefficient_si( isl_aff_zero_on_domain(LocalSpace), isl_dim_in, loopDimension, 1); (gdb) p Expr->dump() {8,+,8}<nw><%do.body> (gdb) p getLoopDepth(Expr->getLoop()) $5 = 0 isl_space Space = isl_space_set_alloc(Ctx, 0, NbLoopSpaces); isl_local_space LocalSpace = isl_local_space_from_space(Space); As we are trying to create a memory access in a stmt that is outside all loops, LocalSpace has 0 dimensions: (gdb) p NbLoopSpaces $12 = 0 (gdb) p Statement.BB->dump() if.then: ; preds = %do.end %0 = load float %add.ptr, align 4 store float %0, float* %q.1.reg2mem, align 4 br label %if.end.single_exit and so the scev for %add.ptr should be taken at the place where it is used, i.e., it should be the value on the last iteration of the do.body loop, and not "{8,+,8}<nw><%do.body>". llvm-svn: 179148	2013-04-10 04:05:18 +00:00
Sebastian Pop	9ca6612731	IndependentBlocks: translate out of SSA all uses escaping the region llvm-svn: 179019	2013-04-08 13:05:41 +00:00
Tobias Grosser	4d96c8d714	clang-format: Many more files After this commit, polly is clang-format clean. This can be tested with 'ninja polly-check-format'. Updates to clang-format may change this, but the differences will hopefully be both small and general improvements to the formatting. We currently have some not very nice formatting for a couple of items, DEBUG() stmts for example. I believe the benefit of being clang-format clean outweights the not perfect layout of this code. llvm-svn: 177796	2013-03-23 01:05:07 +00:00
Tobias Grosser	369430ffca	codegen: properly instantiate SCEVs to the place where they are used Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When code generating S using scev based code generation, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S' and that we consequently do not know how to code generate. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10} and it can be code generated without troubles. llvm-svn: 177777	2013-03-22 23:42:53 +00:00
Tobias Grosser	8ff029ccf1	Add failing test case llvm-svn: 177645	2013-03-21 16:14:55 +00:00
Tobias Grosser	826b2af112	Remove last uses of canoncial induction variable when scev code generating We now detect scops without a canonical induction variable and can generate a polyhedral representation for them. There was no modification necessary to code generate these scops. llvm-svn: 177643	2013-03-21 16:14:50 +00:00
Tobias Grosser	5bfa4f8eb8	CodePrepare: Do not require canonical induction variables for scev based mode llvm-svn: 177593	2013-03-20 22:41:53 +00:00
Tobias Grosser	db8b8a5b8e	ScopDetect: Test case to verify that base pointers are scop invariant llvm-svn: 177582	2013-03-20 21:40:11 +00:00
Tobias Grosser	e4584f6abf	ScopDetect: Add test cases for non-simple regions llvm-svn: 177567	2013-03-20 20:02:35 +00:00
Tobias Grosser	ecfe21b792	Remove dependence on canonical induction variable When using the scev based code generation, we now do not rely on the presence of a canonical induction variable any more. This commit prepares the path to (conditionally) disable the induction variable canonicalization pass. llvm-svn: 177548	2013-03-20 18:03:18 +00:00
Tobias Grosser	d2fbbf0f74	IndependentBlocks: Add a couple of test cases. llvm-svn: 177438	2013-03-19 21:11:25 +00:00
Tobias Grosser	d4ff632fa9	ScopDetection: Add a couple of test cases llvm-svn: 177433	2013-03-19 20:15:19 +00:00
Sebastian Pop	97cb813c29	Correct function to decide if a SCEV can be ignored When doing SCEV based code generation, we ignore instructions calculating values that are fully defined by a SCEV expression. The values that are calculated by this instructions are recalculated on demand. This commit improves the check to verify if certain instructions can be ignored and recalculated on demand. llvm-svn: 177313	2013-03-18 20:21:13 +00:00
Tobias Grosser	7f54714dcc	tests: Properly check if asserts are available In my previous commits I failed to realise that my new requires lines fully disabled these tests. We now properly check if we are in an asserts build and only disable the tests if assertions are not available. Reported-by: Sean Silva <silvas@purdue.edu> llvm-svn: 176900	2013-03-12 21:27:39 +00:00
Tobias Grosser	ee9423920e	Missed on test case in the last commit llvm-svn: 176864	2013-03-12 13:39:40 +00:00
Tobias Grosser	c9a72919a5	Move tests that depend on -stats under 'requires asserts' This fixes issues caused by the following commit: r176733 \| jvoung \| 2013-03-08 17:56:31 -0500 Disable statistics on Release builds and move tests that depend on -stats. Reported by: Jack Howarth <howarth@bromo.med.uc.edu> llvm-svn: 176856	2013-03-12 08:45:15 +00:00
Bill Wendling	83e9312ece	Use attributes references on call/invoke instructions. llvm-svn: 175881	2013-02-22 09:29:15 +00:00
Tobias Grosser	c92c8f06ec	[isl-codegen]: Fix off by one in getNumberOfIterations We need to remove one dimension. Any is correct as long as it exists. We have choosen for whatever reason the dimension #dims - 2. This is incorrect if there is just one dimension. For CLooG this case did never happen. For isl however, the case can happen and causes undefined behavior including crashes. We choose now always the last dimension #dims - 1. We could have choosen dimension '0' but the last dimension is what we remove conceptionally in the algorithm, so it seems better to actually program it that way. While at it remove another piece of undefined behavior. llvm-svn: 174894	2013-02-11 17:52:36 +00:00
Sebastian Pop	04c4ce32ae	isl: vector code generation based on ISL ast Original patch by Tobias Grosser, slightly modified by Sebastian Pop. llvm-svn: 170420	2012-12-18 07:46:13 +00:00
Sebastian Pop	e252c85545	isl: detect vector parallelism llvm-svn: 170138	2012-12-13 16:52:41 +00:00
Tobias Grosser	e36abf6d5d	isl: Detect openmp parallelism Based on code written by Riyadh Baghdadi. llvm-svn: 170102	2012-12-13 06:24:06 +00:00
Andy Gibbs	9936b214c0	Integrate polly test-suite into an llvm "make check-all" if built as part of the whole using cmake. llvm-svn: 169487	2012-12-06 07:59:18 +00:00
Sebastian Pop	a267d9b829	adapt cloog codegen testcases to isl llvm-svn: 169161	2012-12-03 21:34:09 +00:00
Sebastian Pop	47987128b6	use -polly-ast instead of -polly-cloog llvm-svn: 169160	2012-12-03 21:33:55 +00:00
Sebastian Pop	b08a52898a	execute cloog specific testcases only with CLOOG_FOUND llvm-svn: 169159	2012-12-03 21:33:40 +00:00
Patrik Hägglund	b476cdfde5	Fix tests with broken datalayout strings. Buildbot failure at r168785. llvm-svn: 168791	2012-11-28 13:30:31 +00:00
Sebastian Pop	ee4baf3eec	do not execute the OpenMP tests when cloog is not found llvm-svn: 168724	2012-11-27 21:15:15 +00:00
Tobias Grosser	3344f733fd	test: LLVM supports now vectors of arbitrary pointers This allows Polly to vectorize more code. Fix the relevant test cases. llvm-svn: 167923	2012-11-14 08:25:52 +00:00
Tobias Grosser	38ea9cd721	Tests: Pipe test files into 'opt' Use 'opt < %s' instead of just 'opt %s' to ensure that no temporary files are created. llvm-svn: 167372	2012-11-04 16:56:20 +00:00
Tobias Grosser	dcebf1e9da	Tests: remove ModuleID lines llvm-svn: 167284	2012-11-02 06:09:20 +00:00
Tobias Grosser	41b20a62c9	Tests: move content of .c files in .ll llvm-svn: 167283	2012-11-02 06:08:39 +00:00
Tobias Grosser	3eb851f370	Remove runtime tests from polly test suite Similar to LLVM we now follow the policy of only having LLVM-IR level tests in the Polly test suite. Testing for miscompilation of larger programs should be done with the llvm test suite. llvm-svn: 167255	2012-11-01 21:44:59 +00:00
Tobias Grosser	81a1c75035	Dependences: Add support to calculate memory based dependences Instead of calculating exact value (flow) dependences, it is also possible to calculate memory based dependences. Sometimes memory based dependences are a lot easier to calculate. To evaluate the benefits, we add an option to calculate memory based dependences (use -polly-value-dependences=false). llvm-svn: 167251	2012-11-01 21:28:32 +00:00
Tobias Grosser	ebe8c8cea2	Codegen: Selectively copy in array addresses for OpenMP code The detection of values that need to be copied in to the generated OpenMP subfunction also detects the array base addresses needed in the SCoP. Hence, it is not necessary to unconditionally copy all the base addresses to the generated function. Test cases are modified to reflect this change. Arrays which are global variables do not occur in the struct passed to the subfunction anymore. A test case for base address copy-in is added in copy_in_array.{c,ll}. Committed with slight modifications Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167215	2012-11-01 05:34:55 +00:00
Tobias Grosser	177982c478	CodeGen: Add scop-parameters to the OpenMP context In addition to the arrays and clast variables a SCoP statement may also refer to values defined before the SCoP or to function arguments. Detect these values and add them to the set of values passed to the function generated for OpenMP parallel execution of a clast. Committed with additional test cases and some refactoring. Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167214	2012-11-01 05:34:48 +00:00
Tobias Grosser	a17f666f99	Codegen: Copy and restore the ValueMap and ClastVars explicitly When generating OpenMP or GPGPU code the original ValueMap and ClastVars must be kept. We already recovered the original ClastVars by reverting the changes, but we did not keep the content of the ValueMap. This patch keeps now an explicit copy of both maps and restores them after generating OpenMP or GPGPU code. This is an adapted version of a patch contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167213	2012-11-01 05:34:35 +00:00
Tobias Grosser	6c8e696618	cmake: Use suffix for shared modules instead of the one for shared libraries On Linux there is no difference between shared modules and shared libaries, both are '.so' files. However, on darwin only shared modules are '.so' files. Shared libraries have the '.dynlib' suffix. Fix test cases on darwin by expecting a shared module suffix for Polly instead of a shared library suffix. This fixes PR14135 Reported by: Jack Howarth <howarth@bromo.med.uc.edu> llvm-svn: 166402	2012-10-21 21:08:29 +00:00
Tobias Grosser	28781423b2	isl scheduler: Do not fail when returning an empty band list The bug was within isl. To fix it, we simply update the isl version that is used by Polly. We still have some changes within Polly to be able to write a proper test case. Reported-by: Sameer Sahasrabuddhe <Sameer.Sahasrabuddhe@amd.com> llvm-svn: 166021	2012-10-16 07:29:19 +00:00
Tobias Grosser	c967d8e6e9	isl-codegen: Support '<' and '>' Previously isl always generated '<=' or '>='. However, in many cases '<' or '>' leads to simpler code. This commit updates isl and adds the relevant code generation support to Polly. llvm-svn: 166020	2012-10-16 07:29:13 +00:00
Tobias Grosser	6a2da6b9c8	Add test cases for multi-dimensional variable lengths arrays At the moment we can handle such arrays only by conservatively assuming that each access to such an array may touch any element in the array. It would be great if we could improve Polly/LLVM at some point, such that we can recover the multi-dimensionality of the accesses. llvm-svn: 163619	2012-09-11 14:03:19 +00:00
Tobias Grosser	ed29566c4e	ScopInfo: Align parameters when using -polly-allow-nonaffine This ensures that the isl sets/maps we operate on have the same parameter dimensions. Operations on objects with different parameter dimensions are not allow and trigger assertions. llvm-svn: 163618	2012-09-11 13:50:21 +00:00
Tobias Grosser	6217e18a7d	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with the cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. The patch was committed with smaller changes to the build system: There is now a flag to enable gpu code generation explictly. This was required as we need the llvm.codegen() patch applied on the llvm sources, to compile this feature correctly. Also, enabling gpu code generation does not require cuda. This requirement was removed to allow 'make polly-test' runs, even without an installed cuda runtime. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 161239	2012-08-03 12:50:07 +00:00
Hongbin Zheng	7aee737062	IndependentBLocks: Do not visit the same instruction twice when moving the operand tree. This patch fix Bug 13491, and the original "FIXME" in IndependentBlocks.cpp. Patched by Kevin Fan<kevin.fan@gmail.com>. llvm-svn: 161105	2012-08-01 08:46:11 +00:00
Tobias Grosser	6cc23b07e6	Revert "Add preliminary implementation for GPGPU code generation." I did not take into account, that this patch fails to compile without the llvm.codegen patch applied. This breaks buildbots. I revert this until we found a solution to commit this without buildbots complaining. This reverts commit cb43ab80e94434e780a66be3b9a6ad466822fe33. llvm-svn: 160165	2012-07-13 07:44:56 +00:00
Tobias Grosser	b299d28181	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 160164	2012-07-13 07:21:00 +00:00
Tobias Grosser	96682025c7	Add some tests for the independent blocks pass. llvm-svn: 158306	2012-06-11 10:25:12 +00:00
Tobias Grosser	18daacad61	ScopInfo: Add parameter bounds to context Derive the maximal and minimal values of a parameter from the type it has. Add this information to the scop context. This information is needed, to derive optimal types during code generation. llvm-svn: 157245	2012-05-22 10:47:27 +00:00
Hongbin Zheng	6417255283	Regression tests: Adapt the vectorize option change. llvm-svn: 156255	2012-05-06 10:22:43 +00:00
Tobias Grosser	e71c6ab54c	SCEV based code generation This is an incomplete implementation of the SCEV based code generation. When finished it will remove the need for -indvars -enable-iv-rewrite. For the moment it is still disabled. Even though it passes 'make polly-test', there are still loose ends especially in respect of OpenMP code generation. llvm-svn: 155717	2012-04-27 16:36:14 +00:00
Tobias Grosser	7c3061acdd	Make vector tests less sensible to codegen changes llvm-svn: 155438	2012-04-24 11:08:07 +00:00
Tobias Grosser	216ea58b21	ScheduleOpt: Fix crash with -enable-polly-vector llvm-svn: 154808	2012-04-16 11:06:06 +00:00
Tobias Grosser	4cb5461dae	CodeGen: Generate scalar code if vector instructions cannot be generated This fixes two crashes that appeared in case of: - A load of a non vectorizable type (e.g. float**) - An instruction that is not vectorizable (e.g. call) llvm-svn: 154586	2012-04-12 10:46:55 +00:00
Hongbin Zheng	e2107f0999	Revert "Make the "all" target depend on polly-test, so that users can run regression" This reverts commit 97bd8d50881000c11b65b0e033996ec5f57bcd15. llvm-svn: 154487	2012-04-11 07:43:24 +00:00
Tobias Grosser	84ecc47e1c	CodeGen: Allow Polly to do 'grouped unrolling', but no vector generation. Grouped unrolling means that we unroll a loop such that the different instances of a certain statement are scheduled right after each other, but we do not generate any vector code. The idea here is that we can schedule the bb vectorizer right afterwards and use it heuristics to decide when vectorization should be performed. llvm-svn: 154251	2012-04-07 06:16:08 +00:00
Tobias Grosser	0905a23806	CodeGen: Recreate old ivs with the original type To avoid overflows we still use a larger type (i64) while calculating the value of the old ivs. However, we truncate the result to the type of the old iv when providing it to the new code. A corresponding test case is added to the polly test suite. Also, a failing test case is fixed. This fixes PR12311. Contributed by: Tsingray Liu <tsingrayliu@gmail.com> llvm-svn: 153952	2012-04-03 12:24:32 +00:00
Tobias Grosser	de49ef76f6	Remove unneeded alias analysis llvm-svn: 153839	2012-04-01 16:49:48 +00:00
Tobias Grosser	89339067b0	CodeGen: Allow function parameters to be rewritten in getNewValue() When deriving new values for the statements of a SCoP, we assumed that parameter values are constant within the SCoP and consquently do not need to be rewritten. For OpenMP code generation this assumption is wrong, as such values are not available in the OpenMP subfunction and consequently also may need to be rewritten. Committed with some changes. Contributed-By: Johannes Doerfert <s9jodoer@stud.uni-saarland.de> llvm-svn: 153838	2012-04-01 16:49:45 +00:00
Hongbin Zheng	b5bf8cfa17	Make the "all" target depend on polly-test, so that users can run regression tests by simply typing "make -C tools/polly/test", like llvm's regression tests. llvm-svn: 153739	2012-03-30 09:27:16 +00:00
Hongbin Zheng	2700adebfa	Autoconf build: Try to update LLVMPolly.so before running regression tests llvm-svn: 153738	2012-03-30 09:27:07 +00:00
Tobias Grosser	900893d2d8	CodeGeneration: Proberly build the dominator tree llvm-svn: 153645	2012-03-29 13:10:26 +00:00
Hongbin Zheng	e53bdfe633	Use python script to silence the expected testcase fails on 32bit platform. llvm-svn: 153644	2012-03-29 13:10:10 +00:00
Hongbin Zheng	689e84fcec	Regession testing: Substitut POLLY_LIB_DIR, which is introduced by commit r152924, by $(LibDir). Because we assume polly built by autoconf is always in llvm tree. llvm-svn: 153642	2012-03-29 12:36:52 +00:00
Hongbin Zheng	0578aaf77c	Don't fail the lli testcases on 32bit platform. llvm-svn: 153440	2012-03-26 15:16:48 +00:00
Tobias Grosser	cf88d84d79	test: Remove memaccess prefix The prefix is not needed, as all test cases are already in a separate folder. llvm-svn: 153320	2012-03-23 08:24:04 +00:00
Tobias Grosser	d6adda3071	CodeGen: Full support for isl_pw expressions in modified access functions. This also adds support for modifiable write accesses (until now only read accesses where supported). We currently do not derive an exact type for the expression, but assume that i64 is good enough. This will be improved in future patches. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 153319	2012-03-23 08:21:22 +00:00
Tobias Grosser	3ec2abc5fb	Don't allow pointer types in affine expressions We currently do not support pointer types in affine expressions. Hence, we disallow in the SCoP detection. Later we may decide to add support for them. This fixes PR12277 Reported-By: Sebastian Pop <sebpop@gmail.com> llvm-svn: 152928	2012-03-16 16:36:47 +00:00
Hongbin Zheng	c7584ff270	Off-tree build support: Set the path of Polly's library correctly. llvm-svn: 152924	2012-03-16 14:34:20 +00:00
Hongbin Zheng	33254d1edf	Revert "Minor change: Use config.polly_obj_root to locate Polly's library," This reverts commit 7dd9b6327b54b08ece32a4607d5ac093b518b79a. llvm-svn: 152923	2012-03-16 13:49:55 +00:00
Hongbin Zheng	95c84eab5c	Minor change: Use config.polly_obj_root to locate Polly's library, so lit find Polly's library in off-tree build. llvm-svn: 152920	2012-03-16 13:24:34 +00:00
Tobias Grosser	8a5070213a	ScheduleOptimizer: Do not get dependences, if we do not calculate a schedule This solves the 'isl_ctx freed, but some objects still reference it' problem reported in PR12276. llvm-svn: 152917	2012-03-16 11:51:41 +00:00
Tobias Grosser	371badaa47	SCEVValidator: Ensure that parameters are recorded correctly This also fixes UMax where we did not correctly keep track of the parameters. Fixes PR12275. Reported-By: Sebastian Pop <sebpop@gmail.com> llvm-svn: 152913	2012-03-16 10:16:28 +00:00
Hongbin Zheng	c0f53b1c00	Polly-test: Add a cmake option "POLLY_TEST_DISABLE_BAR". We can enable this option in the configure step of Polly's builder to get more readable output from the stdio log. llvm-svn: 152910	2012-03-16 09:04:09 +00:00
Tobias Grosser	3cbe5cfff3	Remove FinalRead The FinalRead statement represented a virtual read that is executed after the SCoP. It was used when we verified the correctness of a schedule by checking if it yields the same FLOW dependences as the original code. This is only works, if we have a final read that reads all memory at the end of the SCoP. We now switched to just checking if a schedule does not introduce negative dependences and also consider WAW WAR dependences. This restricts the schedules a little bit more, but we do not have any optimizer that would calculate a more complex schedule. Hence, for now final reads are obsolete. llvm-svn: 152319	2012-03-08 15:21:51 +00:00
Tobias Grosser	df3823750e	CodeGen: Pass the scalar maps properly llvm-svn: 151916	2012-03-02 15:20:35 +00:00
Tobias Grosser	f6beec674e	CodeGen: Simplify the generation of a splat llvm-svn: 151912	2012-03-02 15:20:21 +00:00
Tobias Grosser	b61e6318ac	CodeGen: Name stmt bbs 'polly.stmt.' + OriginalName llvm-svn: 150575	2012-02-15 09:58:46 +00:00
Tobias Grosser	04eadc476e	tests: Replace . by %s llvm-svn: 150377	2012-02-13 12:29:43 +00:00
Tobias Grosser	8518bbe39f	CodeGen: Always name merge block llvm-svn: 150337	2012-02-12 12:09:46 +00:00
Tobias Grosser	0dbbdd7637	Codegen: Give split and merge basic blocks better names llvm-svn: 150335	2012-02-12 12:09:37 +00:00
Tobias Grosser	a187964bac	Support non-affine access functions in Polly. In case we can not analyze an access function, we do not discard the SCoP, but assume conservatively that all memory accesses that can be derived from our base pointer may be accessed. Patch provided by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 146972	2011-12-20 10:43:14 +00:00
Tobias Grosser	b6033396fd	ScheduleOptimizer: Do not tile bands with just one dimension llvm-svn: 146149	2011-12-08 13:02:58 +00:00
Tobias Grosser	595ec0d0e3	ClooG: Make sure ambigous schedules do not introduce complicated code Cloog continued to split the domains even after the scattering. This lead to complicated code. llvm-svn: 146033	2011-12-07 11:03:48 +00:00
Tobias Grosser	39913e3648	test: Switch to new atomic instructions This fixes the test with recent versions of LLVM that do not support the old atomic instructions any more. llvm-svn: 145402	2011-11-29 14:51:05 +00:00
Tobias Grosser	1e06003227	test: Add more dependences to cmake build llvm-svn: 145400	2011-11-29 14:50:47 +00:00
Tobias Grosser	f281702686	test: Do not hardcode '.so' as library suffix Contributed by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 145076	2011-11-22 19:40:38 +00:00
Tobias Grosser	4dca439cfc	Register Passes: Use -polly-optimizer=(isl\|pocc) to switch optimizers This replaces the old option -polly-use-pocc. Also call the passes uniformly -polly-opt-pocc and -polly-opt-isl. llvm-svn: 145071	2011-11-22 19:40:19 +00:00
Tobias Grosser	8f99c167cd	ScopInfo: Use names of simple parameters to name the isl parameter dimensions. Parameters can be complex SCEV expressions, but they can also be single scalar values. If a parameters is such a simple scalar value and the value is named, use this name to name the isl parameter dimensions. llvm-svn: 144641	2011-11-15 11:38:55 +00:00
Tobias Grosser	f50fc50c80	Remove unused parameters from TempScop llvm-svn: 144232	2011-11-09 22:35:15 +00:00
Tobias Grosser	6e9f25a5d5	Remove AffineSCEVIterator We do not use it anymore. It was replaced by SCEVVisitors like the SCEVValidator. llvm-svn: 144229	2011-11-09 22:35:00 +00:00
Tobias Grosser	fb47d66a06	Remove unused code from SCEVAffFunc constructor llvm-svn: 144224	2011-11-09 22:34:39 +00:00
Tobias Grosser	5683df4a23	Remove more of SCEVAffineFunc llvm-svn: 144223	2011-11-09 22:34:34 +00:00
Tobias Grosser	db87142b26	TempScop: Remove more of the buildAffineFunction llvm-svn: 144221	2011-11-09 22:34:24 +00:00
Tobias Grosser	e6efa37e76	TempScopInfo: Remove unneeded construction of SCEVAffFunc llvm-svn: 144220	2011-11-09 22:34:18 +00:00
Tobias Grosser	60b54f19e6	Detect Parameters directly on the SCEV. Instead of using TempScop to find parameters, we detect them directly on the SCEV. This allows us to remove the TempScop parameter detection in a subsequent commit. This fixes a bug reported by Marcello Maggioni <hayarms@gmail.com> llvm-svn: 144087	2011-11-08 15:41:28 +00:00
Tobias Grosser	65fa78e975	TempScopInfo: Print the original SCEV instead of using SCEVAffFunc This is reducing the impact of SCEVAffFunc llvm-svn: 143574	2011-11-02 21:37:06 +00:00
Tobias Grosser	67707b7131	Enable prevectorization with -enable-polly-vector. This removes the separate prevector options for the Pluto and isl scheduler. llvm-svn: 142774	2011-10-23 20:59:40 +00:00
Tobias Grosser	22636bf498	Rename -enable-schedule-prevector to -polly-prevector llvm-svn: 142771	2011-10-23 20:59:29 +00:00
Tobias Grosser	2ff8723d5d	ScopDetection: Allow to limit the scop detection to a single function -polly-detect-only=<functionname> allows to limit the scop detection to a single function. llvm-svn: 142750	2011-10-23 11:17:06 +00:00
Tobias Grosser	0e27e24751	ScopInfo: Use separate function to build context llvm-svn: 141253	2011-10-06 00:03:48 +00:00
Tobias Grosser	7a5246a371	Test: Convert to new exception handling llvm-svn: 141069	2011-10-04 07:53:21 +00:00
Tobias Grosser	c92151516f	CodeGen: Support for Cast Operations in vector code generation llvm-svn: 139097	2011-09-04 11:45:52 +00:00
Tobias Grosser	7551c3000a	CodeGen: Better separate scalar and vector code generation. llvm-svn: 139095	2011-09-04 11:45:41 +00:00
Tobias Grosser	8ae9aca5cc	CodeGen: Improve naming of copied basic blocks It may happen that we generate the code of a basic block from the original scop is code generated several times. The new naming scheme reduces confusing that earlier appeared as the version numbers of the new basic blocks could have been interpreted as part of the name of the original basic block. llvm-svn: 139092	2011-09-04 11:45:22 +00:00
Tobias Grosser	c532f12965	Fix crashes due to unaligned parameters Due to the recent introduction of isl_id, parameters need now always to be aligned. This was not yet taken care of in the code path of vectorization and dependence analysis. llvm-svn: 138555	2011-08-25 08:40:59 +00:00
Tobias Grosser	604c981f40	Temporarily remove reduction support and interchange pass I am planning to eliminate the TempScopInfo pass. To simplify this I remove some features that may later be added to the ScopInfo pass. The interchange pass is currently strongly tested and furthermore ment to be replaced by the general scheduling optimizer. Reductions itself can later be added easily. llvm-svn: 138219	2011-08-21 14:57:58 +00:00
Raghesh Aloor	129e867865	Memaccess: Code generation for constant access function change Support for generating code for an access function change which is a constant is added. llvm-svn: 137603	2011-08-15 02:33:39 +00:00
Raghesh Aloor	62b13120ee	Memaccess: Codegeneration for a simple access function change Code is generated for a simple access function change imported from JSCOP file. An access of A[i] is changed to A[0]. The code for A[0] is generated directly without refering to isl function calls. llvm-svn: 136789	2011-08-03 17:02:50 +00:00
Raghesh Aloor	7a04f4f9ba	Memaccess: Display Changed Access Relation The changed access relations imported from JSCOP file is shown as output of -analyze pass. llvm-svn: 136774	2011-08-03 13:47:59 +00:00
Tobias Grosser	bd2b2c7117	Add a vect target to the polly testsuite Contributed by: Sebastian Pop <sebpop@gmail.com> llvm-svn: 136685	2011-08-02 07:22:05 +00:00
Raghesh Aloor	3cb6628d7c	MemAccess: Reading Change in Access Function This patch reads the change in access functions from imported JSCOP file. A test case is also added. llvm-svn: 134991	2011-07-12 17:14:03 +00:00
Tobias Grosser	851b96e7f0	Adapt to LLVM type system changes Remove constness of Types and do not name the structures generated in the OpenMP code. llvm-svn: 134980	2011-07-12 12:42:54 +00:00
Tobias Grosser	928b2d16a6	test: Do not pipe the .ll file into opt The construct '< %s' complicates debugging with gdb --args as the content of %s is interpreted as gdb input. llvm-svn: 134432	2011-07-05 19:13:21 +00:00
Tobias Grosser	3770157502	test: Remove redundant function definition The latest version of LLVM fails, if a function is defined twice in an LLVM bitcode file. llvm-svn: 134400	2011-07-04 23:18:17 +00:00
Tobias Grosser	8c4cfc327b	CodeGeneration: Do not delete the old version of the Scop. Instead of deleting the old code, keep it on the side in an if-branch. It will either be deleted by the dead code elimination or we can use it as fallback. llvm-svn: 131352	2011-05-14 19:01:49 +00:00
Hongbin Zheng	94c5df16e2	ScopDetection: Remember the functions generated by backend in a pointer set, so we do not re-generate code for these functions. llvm-svn: 130975	2011-05-06 02:38:20 +00:00
Hongbin Zheng	e1bd40cfbd	Partial support test polly for out of tree build. llvm-svn: 130482	2011-04-29 07:34:54 +00:00
Tobias Grosser	758053788b	Add initial version of Polly This version is equivalent to commit ba26ebece8f5be84e9bd6315611d412af797147e in the old git repository. llvm-svn: 130476	2011-04-29 06:27:02 +00:00

... 22 23 24 25 26 ...

1442 Commits