llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	6213913244	Use the branch instruction to define the location of a PHI-node write We use the branch instruction as the location at which a PHI-node write takes place, instead of the PHI-node itself. This allows us to identify the basic-block in a region statement which is on the incoming edge of the PHI-node and for which the write access was originally introduced. As a result we can, during code generation, avoid generating PHI-node write accesses for basic blocks that do not preceed the PHI node without having to look at the IR again. This change fixes a bug which was introduced in r243420, when we started to explicitly model PHI-node reads and writes, but dropped some additional checks that where still necessary during code generation to not emit PHI-node writes for basic-blocks that are not on incoming edges of the original PHI node. Compared to the code before r243420 the new code does not need to inspect the IR any more and we also do not generate multiple redundant writes. llvm-svn: 243852	2015-08-02 16:17:41 +00:00
Tobias Grosser	d2d15a8c65	Dependences: Zero pad the schedule map The schedule map we derive from a schedule tree map may map statements into schedule spaces of different dimensionality. This change adds zero padding to ensure just a single schedule space is used and the translation from a union_map to an isl_multi_union_pw_aff does not fail. llvm-svn: 243849	2015-08-02 13:30:33 +00:00
Tobias Grosser	45e7944bcf	Only use instructions as insert locations for SCEVExpander SCEVExpander, which we are using during code generation, only allows instructions as insert locations, but breaks in case BasicBlock->end() iterators are passed to it due to it trying to obtain the basic block in which code should be generated by calling Instruction->getParent(), which is not defined for ->end() iterators. This change adds an assert to Polly that ensures we only pass valid instructions to SCEVExpander and it fixes one case, where we used IRBuilder->SetInsertBlock() to set an ->end() insert location which was later passed to SCEVExpander. In general, Polly is always trying to build up the CFG first, before we actually insert instructions into the CFG sceleton. As a result, each basic block should already have at least one branch instruction before we start adding code. Hence, always requiring the IRBuilder insert location to be set to a real instruction should always be possible. Thanks Utpal Bora <cs14mtech11017@iith.ac.in> for his help with test case reduction. llvm-svn: 243830	2015-08-01 09:07:57 +00:00
Duncan P. N. Exon Smith	c51714a0c6	Fix polly tests after LLVM IR change in r243774 llvm-svn: 243801	2015-07-31 23:58:50 +00:00
Tobias Grosser	80e237bd53	Do not detect scops that are delinearized to arrays with "undef" size Such codes are not interesting to optimize and most likely never appear in the normal compilation flow. However, they show up during test case reduction with bugpoint and trigger -- without this change -- an assert in polly::MemoryAccess::foldAccess(). It is better to detect them in ScopDetection itself and just bail out. Contributed-by: Utpal Bora <cs14mtech11017@iith.ac.in> Reviewers: grosser Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11425 llvm-svn: 243515	2015-07-29 13:52:05 +00:00
Tobias Grosser	b241d928bd	Rewrite getPrevectorMap using schedule trees operations Schedule trees are a lot easier to work with, for both humans and machines. For humans the more structured schedule representation is easier to reason about. Together with the more abstract isl programming interface this can result in a lot cleaner code (see this changeset). For machines, the structured schedule and the fact that we now use explicit piecewise affine expressions instead of integer maps makes it easier to generate code from this schedule tree. As a result, we can already see a slight compile-time improvement -- for 3mm from 0m0.593s to 0m0.551s seconds (-7 %). More importantly, future optimizations such as full-partial tile separation will most likely result in more streamlined code to be generated. Contributed-by: Roman Gareev <gareevroman@gmail.com> llvm-svn: 243458	2015-07-28 18:03:36 +00:00
Tobias Grosser	922452285a	Keep track of ScopArrayInfo objects that model PHI node storage Summary: When translating PHI nodes into memory dependences during code generation we require two kinds of memory. 'Normal memory' as for all scalar dependences and 'PHI node memory' to store the incoming values of the PHI node. With this patch we now mark and track these two kinds of memories, which we previously incorrectly marked as a single memory object. Being aware of PHI node storage makes code generation easier, as we do not need to guess what kind of storage a scalar reference requires. This simplifies the code nicely. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11554 llvm-svn: 243420	2015-07-28 14:53:44 +00:00
Tobias Grosser	3b10c94062	Prevectorize the schedule of the band (or the point loop in case of tiling) Contributed-by: Roman Gareev <gareevroman@gmail.com> llvm-svn: 243214	2015-07-25 12:28:56 +00:00
Michael Kruse	be16d22f04	Normalize whitespace in makefiles Tabs for rules and space for line continuations. llvm-svn: 243179	2015-07-24 23:30:31 +00:00
Michael Kruse	1bbe346cef	Make the lit configuration Python 3 compatible by using the same techniques as LLVM's lit configuration. llvm-svn: 243154	2015-07-24 20:33:22 +00:00
Michael Kruse	5e9f249c3e	Add LICM test cases These test cases check whether Polly still gives the same results if LICM runs before. Currently, it does not and therefore these cases are expected fails. llvm-svn: 243037	2015-07-23 20:05:11 +00:00
Johannes Doerfert	338b42c329	Removed redundant alias checks generated during run time. As specified in PR23888, run-time alias check generation is expensive in terms of compile-time. This reduces the compile time by computing minimal/maximal access only once for each base pointer Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 243024	2015-07-23 17:04:54 +00:00
Michael Kruse	6362f5aa0b	Unify FOLDER property of Polly targets Put all Polly targets into a single "Polly" category (i.e. solution folder). Previously there was no recognizable scheme and most categories contained just one or two targets or targets didn't belong to any category. Reviewers: grosser llvm-svn: 242779	2015-07-21 12:40:01 +00:00
Tobias Grosser	808cd69a92	Use schedule trees to represent execution order of statements Instead of flat schedules, we now use so-called schedule trees to represent the execution order of the statements in a SCoP. Schedule trees make it a lot easier to analyze, understand and modify properties of a schedule, as specific nodes in the tree can be choosen and possibly replaced. This patch does not yet fully move our DependenceInfo pass to schedule trees, as some additional performance analysis is needed here. (In general schedule trees should be faster in compile-time, as the more structured representation is generally easier to analyze and work with). We also can not yet perform the reduction analysis on schedule trees. For more information regarding schedule trees, please see Section 6 of https://lirias.kuleuven.be/handle/123456789/497238 llvm-svn: 242130	2015-07-14 09:33:13 +00:00
Tobias Grosser	16c4403a91	Make non-affine statement names isl compatible Named isl sets can generally have any name if they remain within Polly, but only certain strings can be parsed by isl. The new names we create ensure that we can always copy-past isl strings from Polly to other isl tools, e.g. for debugging. llvm-svn: 241787	2015-07-09 07:31:45 +00:00
Tobias Grosser	1b13ddea50	Add first support to delinearize A[t%2][i][j] This is very preliminary support, but it seems to work for the most common case. When observing more/different test cases, we can work on generalizing this. llvm-svn: 240955	2015-06-29 14:44:22 +00:00
Tobias Grosser	af4e809ca6	Remove code for scalar and PHI to array translation This removes old code that has been disabled since several weeks and was hidden behind the flags -disable-polly-intra-scop-scalar-to-array=false and -polly-model-phi-nodes=false. Earlier, Polly used to translate scalars and PHI nodes to single element arrays, as this avoided the need for their special handling in Polly. With Johannes' patches adding native support for such scalar references to Polly, this code is not needed any more. After this commit both -polly-prepare and -polly-independent are now mostly no-ops. Only a couple of simple transformations still remain, but they are scheduled for removal too. Thanks again to Johannes Doerfert for his nice work in making all this code obsolete. llvm-svn: 240766	2015-06-26 07:31:18 +00:00
Tobias Grosser	50165ffdee	Add support for srem instruction Remainder operations with constant divisor can be modeled as quasi-affine expression. This patch adds support for detecting and modeling them. We also add a test that ensures they are correctly code generated. This patch was extracted from a larger patch contributed by Johannes Doerfert in http://reviews.llvm.org/D5293 llvm-svn: 240518	2015-06-24 04:13:29 +00:00
Tobias Grosser	a608569856	Replace srem by function call in nonaffine test cases This makes the test cases nonaffine even if Polly some days gains support for the srem instruction, an instruction which is currently not modeled but which can clearly be modeled statically. A call to a function without definition will always remain non-affine, as there is just insufficient static information for it to be modeled more precisely. llvm-svn: 240458	2015-06-23 20:55:05 +00:00
Tobias Grosser	aa9f575ae1	Adjust to personality function change in 239940 llvm-svn: 239992	2015-06-18 05:02:11 +00:00
Tobias Grosser	8199c722c7	Disable output for test case that does not need output llvm-svn: 239060	2015-06-04 17:59:51 +00:00
Tobias Grosser	22adfb4373	Mark sdivs as 'exact' instead of lowering them ourselves LLVM's instcombine already translates power-of-two sdivs that are known to be exact to fast ashr instructions. Hence, there is no need to add this logic ourselves. Pointed-out-by: Johannes Doerfert llvm-svn: 239025	2015-06-04 07:45:09 +00:00
Tobias Grosser	5cf7860704	Ensure memory access mappings are defined for full domain We now verify that memory access functions imported via JSON are indeed defined for the full iteration domain. Before this change we accidentally imported memory mappings such as i -> i / 127, which only defined a mapped for values of i that are evenly divisible by 127, but which did not define any mapping for the remaining values, with the result that isl just generated an access expression that had undefined behavior for all the unmapped values. In the incorrect test cases, we now either use floor(i/127) or we use p/127 and provide the information that p is indeed a multiple of 127. llvm-svn: 239024	2015-06-04 07:44:35 +00:00
Tobias Grosser	244c8297cf	Lower signed-divisions without rounding to ashr instructions llvm-svn: 238929	2015-06-03 15:14:58 +00:00
Tobias Grosser	cb73f150d4	Translate power-of-two floor-division into ashr Power-of-two floor divisions can be translated into an arithmetic shift operation. This allows us to replace a complex lowering that requires division operations: %pexp.fdiv_q.0 = sub i64 %21, 128 %pexp.fdiv_q.1 = add i64 %pexp.fdiv_q.0, 1 %pexp.fdiv_q.2 = icmp slt i64 %21, 0 %pexp.fdiv_q.3 = select i1 %pexp.fdiv_q.2, i64 %pexp.fdiv_q.1, i64 %21 %pexp.fdiv_q.4 = sdiv i64 %pexp.fdiv_q.3, 128 with a simple ashr: %polly.fdiv_q.shr = ashr i64 %21, 7 llvm-svn: 238905	2015-06-03 06:31:30 +00:00
Tobias Grosser	cdb38e5625	Exploit non-negative numerators isl marks known non-negative numerators in modulo (and soon also division) operations. We now exploit this by generating unsigned operations. This is beneficial as unsigned operations with power-of-two denominators will be translated by isl to fast bitshift or bitwise and operations. llvm-svn: 238577	2015-05-29 17:08:19 +00:00
Tobias Grosser	c825fae020	Tighten the PHI modeling test cases While looking through the test cases I realized we did not have a CHECK line for a duplicate memory access which we may want to eliminate later. To ensure we do not have (or later introduce) unnecessary memory accesses, we now tighten the test cases to look for such a pattern (and add the CHECK: line that shows the redundant memory access). llvm-svn: 238227	2015-05-26 18:05:45 +00:00
Tobias Grosser	268205939f	Make use of scalar/phi code generation explicit in the tests This ensures we pass all tests independently of how we set the options -disable-polly-intra-scop-scalar-to-array and -polly-model-phi-nodes. (At least if we enable both or disable both. Enabling them individually makes little sense, as they will hopefully disappear soon anyhow). llvm-svn: 238087	2015-05-23 03:34:35 +00:00
Johannes Doerfert	ecff11dcfb	Add scalar and phi code generation To reduce compile time and to allow more and better quality SCoPs in the long run we introduced scalar dependences and PHI-modeling. This patch will now allow us to generate code if one or both of those options are set. While the principle of demoting scalars as well as PHIs to memory in order to communicate their value stays the same, this allows to delay the demotion till the very end (the actual code generation). Consequently: - We __almost__ do not modify the code if we do not generate code for an optimized SCoP in the end. Thus, the early exit as well as the unprofitable option will now actually preven us from introducing regressions in case we will probably not get better code. - Polly can be used as a "pure" analyzer tool as long as the code generator is set to none. - The original SCoP is almost not touched when the optimized version is placed next to it. Runtime regressions if the runtime checks chooses the original are not to be expected and later optimizations do not need to revert the demotion for that part. - We will generate direct accesses to the demoted values, thus there are no "trivial GEPs" that select the first element of a scalar we demoted and treated as an array. Differential Revision: http://reviews.llvm.org/D7513 llvm-svn: 238070	2015-05-22 23:43:58 +00:00
Tobias Grosser	5db5d2da13	Use base-pointer address space when creating new access functions llvm-svn: 237785	2015-05-20 11:02:12 +00:00
Tobias Grosser	49ad36ca16	Add printing and testing to ScopArrayInfo Being here, we extend the interface to return the element type and not a pointer to the element type. We also provide a function to get the size (in bytes) of the elements stored in this array. We currently still store the element size as an innermost dimension in ScopArrayInfo, which is somehow inconsistent and should be addressed in future patches. llvm-svn: 237779	2015-05-20 08:05:31 +00:00
Sunil Srivastava	19be68f088	Changed renaming of local symbols by inserting a dot before the numeric suffix. Modified two test cases to adjust to the above change in renaming. These two files were causing the buildbot failure in Polly, #30204 for example. Details in http://reviews.llvm.org/D9483 This checkin goes with r237150 and r237151 llvm-svn: 237203	2015-05-12 22:44:24 +00:00
Tobias Grosser	09d3069740	Rename IslCodeGeneration to CodeGeneration Besides class, function and file names, we also change the command line option from -polly-codegen-isl to just -polly-codegen. The isl postfix is a leftover from the times when we still had the CLooG based -polly-codegen. Today it is just redundant and we drop it. llvm-svn: 237099	2015-05-12 07:45:52 +00:00
Tobias Grosser	3e6070ef03	Update isl to c3892bebc0 Various smaller improvements and bugfixes. llvm-svn: 236932	2015-05-09 09:37:30 +00:00
Johannes Doerfert	8983031b5e	[FIX] Invalid recognition of multidimensional access In the lnt benchmark MultiSource/Benchmarks/MallocBench/gs/gs with scalar and PHI modeling we detected the multidimensional accesses with sizes variant in the SCoP. This will check the sizes for validity. llvm-svn: 236395	2015-05-03 16:03:01 +00:00
Duncan P. N. Exon Smith	ddf3a0ef38	Update polly for LLVM rename of debug info metadata with DI* prefix Ran the same rename-md-di-prefix.sh script attached to PR23080 as in LLVM r236120 and CFE r236121. llvm-svn: 236127	2015-04-29 17:02:14 +00:00
Tobias Grosser	6325cd2fcd	Remove flag '-polly-annotate-alias-scopes' This option is enabled since a long time and there does not seem to be a situation in which we would not want to print alias scopes. Remove this option to reduce the set of command-line option combinations that may expose bugs. llvm-svn: 235861	2015-04-27 10:43:10 +00:00
Johannes Doerfert	8f8af43fef	Use all available range information for parameters In the following even full-range information will help to avoid runtime checks for wrapping integers, hence we enable it now. llvm-svn: 235823	2015-04-26 20:07:21 +00:00
Johannes Doerfert	d5d8f67dc5	Use the original no-wrap flags for normalized AddRecs llvm-svn: 235822	2015-04-26 19:55:21 +00:00
Tobias Grosser	173ecab705	Remove target triples from test cases I just learned that target triples prevent test cases to be run on other architectures. Polly test cases are until now sufficiently target independent to not require any target triples. Hence, we drop them. llvm-svn: 235384	2015-04-21 14:28:02 +00:00
Tobias Grosser	5483931117	Rename 'scattering' to 'schedule' In Polly we used both the term 'scattering' and the term 'schedule' to describe the execution order of a statement without actually distinguishing between them. We now uniformly use the term 'schedule' for the execution order. This corresponds to the terminology of isl. History: CLooG introduced the term scattering as the generated code can be used as a sequential execution order (schedule) or as a parallel dimension enumerating different threads of execution (placement). In Polly and/or isl the term placement was never used, but we uniformly refer to an execution order as a schedule and only later introduce parallelism. When doing so we do not talk about about specific placement dimensions. llvm-svn: 235380	2015-04-21 11:37:25 +00:00
Tobias Grosser	094999bb55	Drop unneccessary -basicaa passes in DependenceInfo test cases llvm-svn: 235374	2015-04-21 09:17:52 +00:00
David Blaikie	556ffb7806	[opaque pointer types] Explicit non-pointer type for call expressions (migration for recent LLVM change to textual IR for calls) llvm-svn: 235146	2015-04-16 23:24:52 +00:00
Johannes Doerfert	f8206cf6d4	Allow loops in non-affine subregions -- SCoP Modeling This will allow the ScopInfo to build the polyhedral representation for non-affine regions that contain loops. Such loops are basically not visible in the SCoP representation. Accesses that are variant in such loops are therefor represented as non-affine accesses. Differential Revision: http://reviews.llvm.org/D8153 llvm-svn: 234713	2015-04-12 22:58:40 +00:00
Johannes Doerfert	c3e91b4d51	[FIX] Change old diagnostic output llvm-svn: 234712	2015-04-12 22:53:33 +00:00
Johannes Doerfert	f3e98f44e3	Allow loops in non-affine subregions -- SCoP Detection This will allow the ScopDetection to detect non-affine regions that contain loops. All loops contained will be collected and are accessible to later passes in order to adjust the access functions. As the loops are non-affine and will not be part of the polyhedral representation later, all accesses that are variant in these loops have to be over approximated as non-affine accesses. They are therefore handled the same way as other non-affine accesses. Additionally, we do not count non-affine loops for the profitability heuristic, thus a region with only a non-affine loop will only be detected if the general detection of loop free regions is enabled. Differential Revision: http://reviews.llvm.org/D8152 llvm-svn: 234711	2015-04-12 22:52:20 +00:00
Duncan P. N. Exon Smith	7431fb0257	Upgrade testcases after LLVM r234181 Until r234181 we were silently upgrading old `@llvm.dbg` intrinsics. Fix testcases in polly that were relying on that. llvm-svn: 234192	2015-04-06 18:25:51 +00:00
Tobias Grosser	02cf69a6ed	Make -polly-no-tiling work again llvm-svn: 234125	2015-04-05 21:52:21 +00:00
Tobias Grosser	eb18649ead	Sign-extend in case of non-matching bitwidth This change ensures that we sign-extend integer types in case non-matching operands are encountered when generating a multi-dimensional access offset. This fixes http://llvm.org/PR23124 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 234122	2015-04-05 17:36:42 +00:00
Tobias Grosser	2a586c387b	Do not assume all multi-parameter products are affine As soon as one operand of the product is invalid, the entire product is invalid. This happens for example if one of the operands is not loop-invariant. This fixes http://llvm.org/PR23125 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com llvm-svn: 234119	2015-04-05 14:57:50 +00:00

1 2 3 4 5 ...

436 Commits