llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Kruse	11c5e07925	canSynthesize: Remove unused argument LI. NFC. The helper function polly::canSynthesize() does not directly use the LoopInfo analysis, hence remove it from its argument list. llvm-svn: 288144	2016-11-29 15:11:04 +00:00
Tobias Grosser	df8f35b7b8	Update for clang-format change in r288119 llvm-svn: 288134	2016-11-29 12:52:08 +00:00
Tobias Grosser	278f9e7d27	[ScopInfo] Use SCEVRewriteVisitor to simplify SCEVSensitiveParameterRewriter [NFC] llvm-svn: 287984	2016-11-26 17:58:40 +00:00
Tobias Grosser	b45ae5601b	[ScopDetect] Expand statistics of the detected scops We now collect: Number of total loops Number of loops in scops Number of scops Number of scops with maximal loop depth 1 Number of scops with maximal loop depth 2 Number of scops with maximal loop depth 3 Number of scops with maximal loop depth 4 Number of scops with maximal loop depth 5 Number of scops with maximal loop depth 6 and larger Number of loops in scops (profitable scops only) Number of scops (profitable scops only) Number of scops with maximal loop depth 1 (profitable scops only) Number of scops with maximal loop depth 2 (profitable scops only) Number of scops with maximal loop depth 3 (profitable scops only) Number of scops with maximal loop depth 4 (profitable scops only) Number of scops with maximal loop depth 5 (profitable scops only) Number of scops with maximal loop depth 6 and larger (profitable scops only) These statistics are certainly completely accurate as we might drop scops when building up their polyhedral representation, but they should give a good indication of the number of scops we detect. llvm-svn: 287973	2016-11-26 07:37:46 +00:00
Tobias Grosser	5c00b0dc74	[ScopDetectionDiagnostic] Collect statistics for each diagnostic type Our original statistics were added before we introduced a more fine-grained diagnostic system, but the granularity of our statistics has never been increased accordingly. This change introduces now one statistic counter per diagnostic to enable us to collect fine-grained statistics about who certain scops are not detected. In case coarser grained statistics are needed, the user is expected to combine counters manually. llvm-svn: 287968	2016-11-26 05:53:09 +00:00
Tobias Grosser	0dcbcaa98b	[ScopDetectionDiagnostic] IrreducibleRegion is a subclasses of CFG Reflect this correctly in the RejectReasonKind enum. The definition of RejectReasonKind::IrreducibleRegion was introduced in r258497, when we started to refuse regions containing irreducible loops. llvm-svn: 287965	2016-11-26 05:08:27 +00:00
Tobias Grosser	8c21b1a50f	[ScopDetectionDiagnostic] Remove leftover RejectReasonKind for Conditions [NFC] In r248118 some diagnostics for unstructured control flow have been removed, but the corresponding RejectReasonKind was accidentally not removed. This change removes it, as it is not needed any more. llvm-svn: 287964	2016-11-26 05:08:24 +00:00
Tobias Grosser	c64269ea1b	[ScopDectionDiagnostic] Use scoped enums instead three letter prefix [NFC] This improves readability of the code. llvm-svn: 287963	2016-11-26 03:44:31 +00:00
Hongbin Zheng	4ff1c3983d	Fix typo. llvm-svn: 287819	2016-11-23 21:59:33 +00:00
Tobias Grosser	eab0943ec0	Update to isl-0.17.1-284-gbb38638 Regular maintenance update with only minor changes. llvm-svn: 287703	2016-11-22 21:31:59 +00:00
Tobias Grosser	b3c3d149b9	[CodeGen] Add flag to code-generate most memory access expressions Introduce the new flag -polly-codegen-generate-expressions which forces Polly to code generate AST expressions instead of using our SCEV based access expression generation even for cases where the original memory access relation was not changed and the SCEV based access expression could be code generated without any issue. This is an experimental option for better testing the isl ast expression generation. The default behavior of Polly remains unchanged. We also exclude a couple of cases for which the AST expression is not yet working. llvm-svn: 287694	2016-11-22 20:21:16 +00:00
Tobias Grosser	d51a945f38	[test] Simplify test case by removing unreferenced instructions [NFC] Drop instructions that do not influence the memory impact of a basic block. They are not needed to reproduce the original bug (verified) and will cause random test noise if we would decide to only model the instructions that have visible side-effects. llvm-svn: 287626	2016-11-22 07:18:57 +00:00
Tobias Grosser	88c025e82e	[test] Ensure important basic blocks in test case have side effects Add two store instructions at the end of basic blocks that are required to reproduce the original bug to ensure we always process and model these basic blocks. This makes this test case stable even in case we would decide to bail out early of basic blocks which do not modify the global state. Also add additional check lines to verify how we model the basic block. llvm-svn: 287625	2016-11-22 07:06:59 +00:00
Tobias Grosser	07ce9a0bcc	test: add more details to non-affine test case We add CHECK lines to this test case to make it easier to see the difference between affine and non-affine memory accesses. We also change the test case to use a parameteric index expression as otherwise our range analysis will understand that the non-affine memory access can only access input[1], which makes it difficult to see that the memory access is in-fact modeled as non-affine access. llvm-svn: 287623	2016-11-22 06:28:08 +00:00
Hongbin Zheng	d559da84af	Update comment for r287566 llvm-svn: 287572	2016-11-21 20:27:55 +00:00
Hongbin Zheng	dac3808873	Fix format llvm-svn: 287568	2016-11-21 20:17:00 +00:00
Hongbin Zheng	a8fb73fc0b	Split ScopInfo::addScopStmt into two versions. NFC One for adding statement for region, another one for BB llvm-svn: 287566	2016-11-21 20:09:40 +00:00
Hongbin Zheng	3ffa6f40b0	Minor change llvm-svn: 287562	2016-11-21 19:26:10 +00:00
Tobias Grosser	1f0236d8e5	[ScopDetect] Use mayReadOrWriteMemory to shorten condition llvm-svn: 287525	2016-11-21 09:07:30 +00:00
Tobias Grosser	b94e9b31d0	[ScopDetect] Remove unnecessary namespace qualifier llvm-svn: 287524	2016-11-21 09:04:45 +00:00
Johannes Doerfert	81aa6e882f	[NFC] Adjust naming scheme of statistic variables Suggested-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 287347	2016-11-18 14:37:08 +00:00
Johannes Doerfert	6cd59e9076	Probably overwritten loads should not be considered hoistable Do not assume a load to be hoistable/invariant if the pointer is used by another instruction in the SCoP that might write to memory and that is always executed. llvm-svn: 287272	2016-11-17 22:25:17 +00:00
Johannes Doerfert	50dfbc572a	[NFC] Add flag to disable error block assumptions The declaration as an "error block" is currently aggressive and not very smart. This patch allows to disable error blocks completely. This might be useful to prevent SCoP expansion to a point where the assumed context becomes infeasible, thus the SCoP has to be discarded. llvm-svn: 287271	2016-11-17 22:16:35 +00:00
Johannes Doerfert	c97654681e	[FIX] Do not try to hoist memory intrinsic Since we do not necessarily treat memory intrinsics as non-affine anymore, we have to check for them explicitly before we try to hoist an access. llvm-svn: 287270	2016-11-17 22:11:56 +00:00
Johannes Doerfert	b3265a3612	[NFC] Skip over trivial assumptions Filter trivial assumptions, thus assume { : } or restrict { : 0 = 1 }, as they clutter the user output as well as the statistics. llvm-svn: 287269	2016-11-17 22:08:40 +00:00
Johannes Doerfert	dae2e9287d	[DBG] Collect statistics about actually versioned SCoPs llvm-svn: 287267	2016-11-17 21:55:43 +00:00
Johannes Doerfert	8c5464a715	[DBG] Allow to emit the RTC value at runtime The new command line flag "polly-codegen-emit-rtc-print" can be used to place a "printf" in the generated code that will print the RTC value and the overflow state. llvm-svn: 287265	2016-11-17 21:49:19 +00:00
Johannes Doerfert	cfadb2293f	[DBG] Collect statistics about statically infeasible SCoPs llvm-svn: 287263	2016-11-17 21:44:47 +00:00
Johannes Doerfert	cd195326bf	[DBG] Collect statistics about taken assumptions llvm-svn: 287261	2016-11-17 21:41:08 +00:00
Tobias Grosser	06e1592663	Update to isl-0.17.1-267-gbf9723d This update corrects an incorrect generation of min/max expressions in the isl AST generator and a problematic nullptr dereference. llvm-svn: 287098	2016-11-16 11:06:47 +00:00
Tobias Grosser	26be8e99b6	[ScopBuilder] Drop unnecessary namespace identifiers [NFC] llvm-svn: 286781	2016-11-13 21:28:13 +00:00
Tobias Grosser	5743e8de86	[SCEVAffinator] Do not scan redundantly for parameters In r286430 "SCEVValidator: add new parameters resulting from constant extraction" we added functionality to scan for parameters after constant extraction has taken place to ensure newly created parameters are correctly registered. This addition made the already existing registration of parameters redundant. Hence, we remove the corresponding call in this commit. An alternative solution would have been to also perform constant extraction when validating SCEV expressions and to then scan for parameters when validating a SCEV expression. However, as SCEV validation is used during SCoP detection where we want to be especially fast, adding additional functionality on this hot path should be avoided if good alternatives exist. In this case, we can choose to continue to only transform SCEV expression when actually modeling them. As all transformations we perform are expected to not change the validity of the SCEV expressions, this solution seems preferable. Suggested-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286780	2016-11-13 21:28:07 +00:00
Tobias Grosser	70d2709b1a	[ScopDetect] Conservatively handle inaccessible memory alias attributes Commit r286294 introduced support for inaccessiblememonly and inaccessiblemem_or_argmemonly attributes to BasicAA, which we need to support to avoid undefined behavior. This change just refuses all calls which are annotated with these attributes, which is conservatively correct. In the future we may consider to model and support such function calls in Polly. llvm-svn: 286771	2016-11-13 19:27:24 +00:00
Tobias Grosser	a9cac6a732	[tests] Adjust test output to recent changed SCEV canonocalization [NFC] LLVM recently changed the SCEV canonicalization which changed the output of one of our GPGPU test cases. llvm-svn: 286770	2016-11-13 19:27:17 +00:00
Tobias Grosser	a2f8fa33aa	[ScopDetect] Evaluate and verify branches at branch condition, not icmp The validity of a branch condition must be verified at the location of the branch (the branch instruction), not the location of the icmp that is used in the branch instruction. When verifying at the wrong location, we may accept an icmp that is defined within a loop which itself dominates, but does not contain the branch instruction. Such loops cannot be modeled as we only introduce domain dimensions for surrounding loops. To address this problem we change the scop detection to evaluate and verify SCEV expressions at the right location. This issue has been around since at least r179148 "scop detection: properly instantiate SCEVs to the place where they are used", where we explicitly set the scope to the wrong location. Before this commit the scope was not explicitly set, which probably also resulted in the scope around the ICmp to be choosen. This resolves http://llvm.org/PR30989 Reported-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286769	2016-11-13 19:27:04 +00:00
Tobias Grosser	f67433abd9	SCEVAffinator: pass parameter-only set to addRestriction if BB=nullptr Assumptions can either be added for a given basic block, in which case the set describing the assumptions is expected to match the dimensions of its domain. In case no basic block is provided a parameter-only set is expected to describe the assumption. The piecewise expressions that are generated by the SCEVAffinator sometimes have a zero-dimensional domain (e.g., [p] -> { [] : p <= -129 or p >= 128 }), which looks similar to a parameter-only domain, but is still a set domain. This change adds an assert that checks that we always pass parameter domains to addAssumptions if BB is empty to make mismatches here fail early. We also change visitTruncExpr to always convert to parameter sets, if BB is null. This change resolves http://llvm.org/PR30941 Another alternative to this change would have been to inspect all code to make sure we directly generate in the SCEV affinator parameter sets in case of empty domains. However, this would likely complicate the code which combines parameter and non-parameter domains when constructing a statement domain. We might still consider doing this at some point, but as this likely requires several non-local changes this should probably be done as a separate refactoring. Reported-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286444	2016-11-10 11:44:10 +00:00
Tobias Grosser	d0b9173caa	IslAst: always use the context during ast generation Providing the context to the ast generator allows for additional simplifcations and -- more importantly -- allows to generate loops with only partially bounded domains, assuming the domains are bounded for all parameter configurations that are valid as defined by the context. This change fixes the crash reported in http://llvm.org/PR30956 The original reason why we did not include the context when generating an AST was that CLooG and later isl used to sometimes transfer some of the constraints that bound the size of parameters from the context into the generated AST. This resulted in operations with very large constants, which sometimes introduced problematic integer overflows. The latest versions of the isl AST generator are careful to not introduce such constants. Reported-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286442	2016-11-10 09:39:58 +00:00
Tobias Grosser	4d543d654a	SCEVValidator: add new parameters resulting from constant extraction When extracting constant expressions out of SCEVs, new parameters may be introduced, which have not been registered before. This change scans SCEV expressions after constant extraction again to make sure newly introduced parameters are registered. We may for example extract the constant '8' from the expression '((8 * ((%a * %b) + %c)) + (-8 * %a))' and obtain the expression '(((-1 + %b) * %a) + %c)'. The new expression has a new parameter '(-1 + %b) * %a)', which was not registered before, but must be registered to not crash. This closes http://llvm.org/PR30953 Reported-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286430	2016-11-10 06:45:28 +00:00
Tobias Grosser	bbaeda3fe5	Do not allow switch statements in loop latches In r248701 "Allow switch instructions in SCoPs" support for switch statements has been introduced, but support for switch statements in loop latches was incomplete. This change completely disables switch statements in loop latches. The original commit changed addLoopBoundsToHeaderDomain to support non-branch terminator instructions, but this change was incorrect: it added a check for BI != null to the if-branch of a condition, but BI was used in the else branch es well. As a result, when a non-branch terminator instruction is encounted a nullptr dereference is triggered. Due to missing test coverage, this bug was overlooked. r249273 "[FIX] Approximate non-affine loops correctly" added code to disallow switch statements for non-affine loops, if they appear in either a loop latch or a loop exit. We adapt this code to now prohibit switch statements in loop latches even if the control condition is affine. We could possibly add support for switch statements in loop latches, but such support should be evaluated and tested separately. This fixes llvm.org/PR30952 Reported-by: Eli Friedman <efriedma@codeaurora.org> llvm-svn: 286426	2016-11-10 05:20:29 +00:00
Tobias Grosser	eba86a1208	ScopInfo: only run code needed for ASSERT in DEBUG mode Suggested-by: Johannes Doerfert llvm-svn: 286338	2016-11-09 04:24:49 +00:00
Tobias Grosser	a8ca3ed06a	SCEVValidator: reduce indentation to increase readability [NFC] llvm-svn: 286217	2016-11-08 07:17:48 +00:00
Tobias Grosser	16480186f8	IslNodeBuilder: Ensure newly generated memory accesses are well-defined Add some additional asserts that ensure newly code-generated memory accesses are defined on all domain and schedule domain instances. llvm-svn: 286050	2016-11-05 21:46:01 +00:00
Tobias Grosser	744740ad91	ScopInfo: Ensure copy statement memory accesses are correct Add asserts that verify that the memory accesses of a new copy statement are defined for all domain instances the copy statement is defined for. llvm-svn: 286047	2016-11-05 21:02:43 +00:00
Tobias Grosser	9321e208e8	Update isl to isl-0.17.1-243-g24c0339 This introduces big-endian support in imath and resolves http://llvm.org/PR24632. llvm-svn: 285993	2016-11-04 11:56:48 +00:00
Hongbin Zheng	ada8544dfb	Remove POLLY_LINK_LIBS, it is not used llvm-svn: 285976	2016-11-04 00:32:32 +00:00
Michael Kruse	e1dc387731	[ScopInfo] Fix isl object leak. Fix return from function without releasing isl objects, which was introduced in r269055. llvm-svn: 285924	2016-11-03 15:19:41 +00:00
Eli Friedman	acf8006471	[Polly CodeGen] Break critical edge from RTC to original loop. This makes polly generate a CFG which is closer to what we want in LLVM IR, with a loop preheader for the original loop. This is just a cleanup, but it exposes some fragile assumptions. I'm not completely happy with the changes related to expandCodeFor; RTCBB->getTerminator() is basically a random insertion point which happens to work due to the way we generate runtime checks. I'm not sure what the right answer looks like, though. Differential Revision: https://reviews.llvm.org/D26053 llvm-svn: 285864	2016-11-02 22:32:23 +00:00
Eli Friedman	b9c6f01a81	[ScopInfo] Make memset etc. affine where possible. We don't actually check whether a MemoryAccess is affine in very many places, but one important one is in checks for aliasing. Differential Revision: https://reviews.llvm.org/D25706 llvm-svn: 285746	2016-11-01 20:53:11 +00:00
Eli Friedman	6768285dcc	Add missing test from r284848. Original commit title: [SCEVAffinator] Make precise modular math more correct. llvm-svn: 285745	2016-11-01 20:45:28 +00:00
Tobias Grosser	ebb626e4b7	[ScopDetect] Use SCEVRewriteVisitor to simplify SCEVRemoveSMax rewriter ScalarEvolution got at some pointer a SCEVRewriteVisitor. Use it to simplify our SCEVRemoveSMax visitor. llvm-svn: 285491	2016-10-29 06:19:34 +00:00
Michael Kruse	426e6f71f8	[ScopInfo] Fix: use raw source pointer. When adding an llvm.memcpy instruction to AliasSetTracker, it uses the raw source and target pointers which preserve bitcasts. MemAccInst::getPointerOperand() also returns the raw target pointers, but Scop::buildAliasGroups() did not for the source pointer. This lead to mismatches between AliasSetTracker and ScopInfo on which pointer to use. Fixed by also using raw pointers in Scop::buildAliasGroups(). llvm-svn: 285071	2016-10-25 13:37:43 +00:00
Mandeep Singh Grang	3642dd3ca4	[polly] Change SmallPtrSet which is being iterated to SmallSetVector in ScopInfo.h Summary: This will avoid non-deterministic iteration order. Reviewers: grosser, jdoerfert, zinob, mgrang Subscribers: #polly Tags: #polly Differential Revision: https://reviews.llvm.org/D25880 llvm-svn: 284883	2016-10-21 21:00:11 +00:00
Eli Friedman	286c5a76ba	[SCEVAffinator] Make precise modular math more correct. Integer math in LLVM IR is modular. Integer math in isl is arbitrary-precision. Modeling LLVM IR math correctly in isl requires either adding assumptions that math doesn't actually overflow, or explicitly wrapping the math. However, expressions with the "nsw" flag are special; we can pretend they're arbitrary-precision because it's undefined behavior if the result wraps. SCEV expressions based on IR instructions with an nsw flag also carry an nsw flag (roughly; actually, the real rule is a bit more complicated, but the details don't matter here). Before this patch, SCEV flags were also overloaded with an additional function: the ZExt code was mutating SCEV expressions as a hack to indicate to checkForWrapping that we don't need to add assumptions to the operand of a ZExt; it'll add explicit wrapping itself. This kind of works... the problem is that if anything else ever touches that SCEV expression, it'll get confused by the incorrect flags. Instead, with this patch, we make the decision about whether to explicitly wrap the math a bit earlier, basing the decision purely on the SCEV expression itself, and not its users. Differential Revision: https://reviews.llvm.org/D25287 llvm-svn: 284848	2016-10-21 18:08:02 +00:00
Mandeep Singh Grang	48e7add80f	[polly] Change SmallPtrSet which are being iterated into SmallSetVector Summary: Otherwise the lack of an iteration order results in non-determinism in codegen. Reviewers: _jdoerfert, zinob, grosser Tags: #polly Differential Revision: https://reviews.llvm.org/D25863 llvm-svn: 284845	2016-10-21 17:29:10 +00:00
Michael Kruse	3d0266b664	[cmake] Avoid warnings in feature tests. NFC. Apply the __attribute__((unused)) before the function to unambiguously apply to the function declaration. Add more casts-to-void to mark return values unused as intended. Contributed-by: Andy Gibbs <andyg1001@hotmail.co.uk> llvm-svn: 284718	2016-10-20 11:16:19 +00:00
Tobias Grosser	69ed8542f5	Update isl to isl-0.17.1-236-ga9c6cc7 This includes isl_id_to_str, which is used in Michael's upcoming DeLICM patch. llvm-svn: 284689	2016-10-20 01:59:24 +00:00
Mandeep Singh Grang	5b1abfc88e	[polly] Fix non-determinism in polly BlockGenerators Summary: Iterating over SeenBlocks which is a SmallPtrSet results in non-determinism in codegen Reviewers: jdoerfert, zinob, grosser Tags: #polly Differential Revision: https://reviews.llvm.org/D25778 llvm-svn: 284622	2016-10-19 17:56:49 +00:00
Michael Kruse	6b87504973	[test] Fix buildbot after SCEV change. Update test after commit r284501: [SCEV] Make CompareValueComplexity a little bit smarter Contributed-by: Sanjoy Das <sanjoy@playingwithpointers.com> llvm-svn: 284543	2016-10-18 22:58:09 +00:00
Eli Friedman	3c1a75bf9c	Handle multi-dimensional invariant load. If the address of a load depends on another load, make sure to emit the loads in the right order. llvm-svn: 284426	2016-10-17 21:04:26 +00:00
Michael Kruse	6a19d592da	[ScopDetect] Depend transitively on ScalarEvolution. ScopDetection might be queried by -dot-scops or -view-scops passes for which it accesses ScalarEvolution. llvm-svn: 284385	2016-10-17 13:29:20 +00:00
Michael Kruse	2ddb279a39	[test] Add missing colon. llvm-svn: 284349	2016-10-16 22:05:51 +00:00
Michael Kruse	17d5090532	[cmake] Add polly-isl-test dependency to lit tests. Also handle the in-llvm-tree case forgotten in r284339. llvm-svn: 284347	2016-10-16 21:35:57 +00:00
Michael Kruse	8dee3427f7	[cmake] Add polly-isl-test dependency to lit tests. lit recursively iterates through the test subdirectories and finds the ISL unittest. For this test to work, the polly-isl-test executable needs to be compiled. Add the polly-isl-test dependency to POLLY_TEST_DEPS. This makes check-polly and check-polly-tests work from a fresh build directory. llvm-svn: 284339	2016-10-16 18:22:02 +00:00
Michael Kruse	f0c06900ed	[test] Add -polly-unprofitable-scalar-accs to test that needs it. The test non_affine_loop_used_later.ll also tests the profability heuristic. Add the option -polly-unprofitable-scalar-accs explicitely to ensure that the test succeeds if the default value is changed. llvm-svn: 284338	2016-10-16 18:13:01 +00:00
Tobias Grosser	4b5e24df9a	cmake: avoid "zero-length gnu_printf format string" warning in gcc 6.1.1 Contributed-by: Andy Gibbs <andyg1001@hotmail.co.uk> llvm-svn: 284302	2016-10-15 05:08:12 +00:00
Michael Kruse	fa53c86dc1	[ScopInfo/CodeGen] ExitPHI reads are implicit. Under some conditions MK_Value read accessed where converted to MK_ExitPHI read accessed. This is unexpected because MK_ExitPHI read accesses are implicit after the scop execution. This behaviour was introduced in r265261, which fixed a failed assertion/crash in CodeGen. Instead, we fix this failure in CodeGen itself. createExitPHINodeMerges(), despite its name, also handles accesses of kind MK_Value, only to skip them because they access values that are usually not PHI nodes in the SCoP region's exit block. Except in the situation observed in r265261. Do not convert value accessed to ExitPHI accesses and do not handle value accesses like ExitPHI accessed in CodeGen anymore. llvm-svn: 284023	2016-10-12 16:31:09 +00:00
Michael Kruse	c9edc2ee8d	[DepInfo] Print -debug output outside of max-operations scope. ISL tries to simplify the polyhedral operations before printing its objects. This increases the operations counter and therefore can contribute to hitting the operations limit. Therefore the result could be different when -debug output is enabled, making debugging harder. llvm-svn: 283745	2016-10-10 11:45:59 +00:00
Michael Kruse	8bfba1ff46	[Support/DepInfo] Introduce IslMaxOperationsGuard and make DepInfo use it. NFC. IslMaxOperationsGuard defines a scope where ISL may abort operations because if it takes too many operations. Replace the call to the raw ISL interface by a use of the guard. IslMaxOperationsGuard provides a uniform way to define a maximal computation time for a code region in C++ using RAII. llvm-svn: 283744	2016-10-10 11:45:54 +00:00
Tobias Grosser	b270288752	Fix formatting after recent cl:: changes This fixes 'make check-polly' llvm-svn: 283693	2016-10-09 08:31:35 +00:00
Mehdi Amini	732afdd09a	Turn cl::values() (for enum) from a vararg function to using C++ variadic template The core of the change is supposed to be NFC, however it also fixes what I believe was an undefined behavior when calling: va_start(ValueArgs, Desc); with Desc being a StringRef. Differential Revision: https://reviews.llvm.org/D25342 llvm-svn: 283671	2016-10-08 19:41:06 +00:00
Hongbin Zheng	5860aef675	Define PATH_MAX on windows Differential Revision: https://reviews.llvm.org/D25372 llvm-svn: 283600	2016-10-07 20:58:20 +00:00
Michael Kruse	de42b43de0	[cmake] Unify disabling upstream project warnings. Handle MSVC, ISL and PPCG in one place. The only functional change is that warnings are also disabled for MSVC compiling PPCG (Which currently fails anyway). llvm-svn: 283547	2016-10-07 12:38:32 +00:00
Michael Kruse	4b5f6af2dc	[cmake] Move isl_test artifacts to Polly folder. Folders in Visual Studio solutions help organize the build artifacts from all LLVM projects. There is a folder to keep Polly-built files in. llvm-svn: 283546	2016-10-07 12:38:24 +00:00
Tobias Grosser	e84ee850d1	Build and run isl_test as part of check-polly Running isl tests is important to gain confidence that the isl build we created works as expected. Besides the actual isl tests, there are also isl AST generation tests shipped with isl. This change only adds support for the isl unit tests. AST generation test support is left for a later commit. There is a choice to run tests directly through the build system or in the context of lit. We choose to run tests as part of lit to as this allows us to easily set environment variables, print output only on error and generally run the tests directly from the lit command. Reviewers: brad.king, Meinersbur Subscribers: modocache, brad.king, pollydev, beanz, llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D25155 llvm-svn: 283245	2016-10-04 19:48:40 +00:00
Michael Kruse	6ab4476835	[ScopInfo] Add -polly-unprofitable-scalar-accs option. With this option one can disable the heuristic that assumes that statements with a scalar write access cannot be profitably optimized. Such a statement instances necessarily have WAW-dependences to itself. With DeLICM scalar accesses can be changed to array accesses, which can avoid these WAW-dependence. llvm-svn: 283233	2016-10-04 17:33:39 +00:00
Michael Kruse	ca7cbcca37	[ScopInfo] Scalar access do not have indirect base pointers. ScopArrayInfo used to determine base pointer origins by looking up whether the base pointer is a load. The "base pointer" for scalar accesses is the llvm::Value being accessed. This is only a symbolic base pointer, it represents the alloca variable (.s2a or .phiops) generated for it at code generation. This patch disables determining base pointer origin for scalars. A test case where this caused a crash will be added in the next commit. In that test SAI tried to get the origin base pointer that was only declared later, therefore not existing. This is probably only possible for scalars used in PHINode incoming blocks. llvm-svn: 283232	2016-10-04 17:33:34 +00:00
Michael Kruse	9116899615	[ScopInfo] Make simplifySCoP() public. NFC. This function may need to be called after the scop construction. The upcoming DeLICM will use this to cleanup statement that all write accesses have been removed from. llvm-svn: 283221	2016-10-04 14:14:33 +00:00
Tobias Grosser	5652c9c830	isl: update to isl-0.17.1-233-gc911e6a llvm-svn: 283049	2016-10-01 19:46:51 +00:00
Michael Kruse	4b0c5aea78	[CodeGen] Add assertion for indirect array index expression generation. NFC. Currently Polly cannot generate code for index expressions if the base pointer is computed within the scop. The base pointer must be generated as well, but there is no code that triggers that. Add an assertion to detect when this would occur and miscompile. The IR verifier should catch it as well. llvm-svn: 282893	2016-09-30 18:29:37 +00:00
Michael Kruse	f6f795e9ae	[Support] Complete ISL annotations to IslPtr<>. NFC. Add missing __isl_(give/take/keep) annotations to IslPtr<> and NonowningIslPtr<> methods. Because IslPtr's constructor's annotation would depend on the TakeOwnership parameter, the parameter has been removed. Caller must copy the object themselves if the do not want to take ownership. llvm-svn: 282883	2016-09-30 17:47:39 +00:00
Michael Kruse	51f514d853	[Support] Compile fix for gcc. NFC. gcc 5.4 insists on template specialization to be in a namespace polly { ... } block, instead of being prefixed with 'polly::'. Error message: root/src/llvm/tools/polly/lib/Support/GICHelper.cpp:203:54: error: specialization of ‘template<class T> void polly::IslPtr<T>::dump() const’ in different namespace [-fpermissive] template <> void polly::IslPtr<isl_##TYPE>::dump() const { \ ^ msvc14 and clang 3.8 did not complain. llvm-svn: 282874	2016-09-30 16:47:43 +00:00
Michael Kruse	55519dad62	[Support] Add (Nonowning-)IslPtr::dump(). NFC. The dump() methods can be called from a debugger instead of e.g. isl_*_dump(Var.Obj) where Var is a variable of type IslPtr/NonowningIslPtr. To ensure that the existence of the function pointers do not depdend on whether the methods are used somwhere, they are declared with external linkage. llvm-svn: 282870	2016-09-30 16:10:19 +00:00
Michael Kruse	32312d0294	[Support] Call isl__free() only on non-null pointers. NFC. Add a non-NULL check before calling the free function into functions that are supposed to be inlined. First, this is a form of partial inlining of the free function, namely the nullptr test that free has to do. Secondly, and more importantly, it allows the compiler to remove the call to isl__free() when it knows that the object is nullptr, for instance because the last call is a take(). "Consuming" the last use of an ISL object using take() (instead of copy()) is a common pattern. llvm-svn: 282864	2016-09-30 15:29:43 +00:00
Michael Kruse	888ab55140	[CodeGen] Change 'Scalar' to 'Array' in method names. NFC. generateScalarLoad() and generateScalarStore() are used for explicit (MK_Array) memory accesses, therefore the method names were misleading. The names also were similar to generateScalarLoads() and generateScalarStores() (plural forms) which indeed handle scalar accesses. Presumbly, they were originally named to contrast VectorBlockGenerator::generateLoad(). Rename the two methods to generateArrayLoad(), respectively generateArrayStore(). llvm-svn: 282861	2016-09-30 14:34:05 +00:00
Michael Kruse	77394f1394	[CodeGen] Add assertion for partial scalar accesses. NFC. The code generator always adds unconditional LoadInst and StoreInst, hence the MemoryAccess must be defined over all statement instances. llvm-svn: 282853	2016-09-30 14:01:46 +00:00
Tobias Grosser	b110296011	www: Add Loopy publication llvm-svn: 282747	2016-09-29 18:17:30 +00:00
Tobias Grosser	ae91bc7353	www: add new code coverage link to Polly website llvm-svn: 282351	2016-09-25 08:03:38 +00:00
Tobias Grosser	349d1c3368	[ScopDetection] Remove redundant checks for endless loops Summary: Both `canUseISLTripCount()` and `addOverApproximatedRegion()` contained checks to reject endless loops which are now removed and replaced by a single check in `isValidLoop()`. For reporting such loops the `ReportLoopOverlapWithNonAffineSubRegion` is renamed to `ReportLoopHasNoExit`. The test case `ReportLoopOverlapWithNonAffineSubRegion.ll` is adapted and renamed as well. The schedule generation in `buildSchedule()` is based on the following assumption: Given some block B that is contained in a loop L and a SESE region R, we assume that L is contained in R or the other way around. However, this assumption is broken in the presence of endless loops that are nested inside other loops. Therefore, in order to prevent erroneous behavior in `buildSchedule()`, r265280 introduced a corresponding check in `canUseISLTripCount()` to reject endless loops. Unfortunately, it was possible to bypass this check with -polly-allow-nonaffine-loops which was fixed by adding another check to reject endless loops in `allowOverApproximatedRegion()` in r273905. Hence there existed two separate locations that handled this case. Thank you Johannes Doerfert for helping to provide the above background information. Reviewers: Meinersbur, grosser Subscribers: _jdoerfert, pollydev Differential Revision: https://reviews.llvm.org/D24560 Contributed-by: Matthias Reisinger <d412vv1n@gmail.com> llvm-svn: 281987	2016-09-20 17:05:22 +00:00
Tobias Grosser	122d6d74f6	Fix spelling in CMakeLists llvm-svn: 281897	2016-09-19 10:55:31 +00:00
Tobias Grosser	05ee64e67a	GPGPU: add missing REQUIRES line to test case llvm-svn: 281850	2016-09-18 08:57:38 +00:00
Tobias Grosser	bc653f2031	GPGPU: Do not run mostly sequential kernels in GPU In case sequential kernels are found deeper in the loop tree than any parallel kernel, the overall scop is probably mostly sequential. Hence, run it on the CPU. llvm-svn: 281849	2016-09-18 08:31:09 +00:00
Tobias Grosser	82f2af3508	GPGPU: Dynamically ensure 'sufficient compute' Offloading to a GPU is only beneficial if there is a sufficient amount of compute that can be accelerated. Many kernels just have a very small number of dynamic compute, which means GPU acceleration is not beneficial. We compute at run-time an approximation of how many dynamic instructions will be executed and fall back to CPU code in case this number is not sufficiently large. To keep the run-time checking code simple, we over-approximate the number of instructions executed in each statement by computing the volume of the rectangular hull of its iteration space. llvm-svn: 281848	2016-09-18 06:50:35 +00:00
Tobias Grosser	cfdee6582b	GPGPU: Make test cases independent of register numbering [NFC] llvm-svn: 281847	2016-09-18 06:50:28 +00:00
Tobias Grosser	51dfc27589	GPGPU: Store back non-read-only scalars We may generate GPU kernels that store into scalars in case we run some sequential code on the GPU because the remaining data is expected to already be on the GPU. For these kernels it is important to not keep the scalar values in thread-local registers, but to store them back to the corresponding device memory objects that backs them up. We currently only store scalars back at the end of a kernel. This is only correct if precisely one thread is executed. In case more than one thread may be run, we currently invalidate the scop. To support such cases correctly, we would need to always load and store back from a corresponding global memory slot instead of a thread-local alloca slot. llvm-svn: 281838	2016-09-17 19:22:31 +00:00
Tobias Grosser	fe74a7a1f5	GPGPU: Detect read-only scalar arrays ... and pass these by value rather than by reference. llvm-svn: 281837	2016-09-17 19:22:18 +00:00
Tobias Grosser	8f86a47461	Update CFGPrinter -> CFGPrinterLegacyPass .. to match recent changes in LLVM that broke the Polly compilation. llvm-svn: 281705	2016-09-16 05:48:09 +00:00
Tobias Grosser	aaabbbf886	GPGPU: Do not assume arrays start at 0 Our alias checks precisely check that the minimal and maximal accessed elements do not overlap in a kernel. Hence, we must ensure that our host <-> device transfers do not touch additional memory locations that are not covered in the alias check. To ensure this, we make sure that the data we copy for a given array is only the data from the smallest element accessed to the largest element accessed. We also adjust the size of the array according to the offset at which the array is actually accessed. An interesting result of this is: In case array are accessed with negative subscripts ,e.g., A[-100], we automatically allocate and transfer _more_ data to cover the full array. This is important as such code indeed exists in the wild. llvm-svn: 281611	2016-09-15 14:05:58 +00:00
Roman Gareev	b3224adfb6	Perform copying to created arrays according to the packing transformation This is the fourth patch to apply the BLIS matmul optimization pattern on matmul kernels (http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf). BLIS implements gemm as three nested loops around a macro-kernel, plus two packing routines. The macro-kernel is implemented in terms of two additional loops around a micro-kernel. The micro-kernel is a loop around a rank-1 (i.e., outer product) update. In this change we perform copying to created arrays, which is the last step to implement the packing transformation. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D23260 llvm-svn: 281441	2016-09-14 06:26:09 +00:00
Tobias Grosser	e8c69bbabd	cmake: PollyPPCG depends on PollyISL This line makes BUILD_SHARED_LIBS=ON work for Polly-ACC. Without it, ld complains about missing isl symbols when constructing the shared library. llvm-svn: 281396	2016-09-13 21:09:35 +00:00
Tobias Grosser	0a893f7df4	GPGPU: Use const_cast to avoid compiler warning [NFC] llvm-svn: 281333	2016-09-13 13:22:27 +00:00

1 2 3 4 5 ...

2823 Commits