llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	10da5a0ae7	createNextIterationMap from C to C++ interface Summary: update createNextIterationMap function to new C++ interface. Reviewers: grosser, Meinersbur, jdoerfert, bollu, cs15btech11044 Reviewed By: cs15btech11044 Subscribers: llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D47102 llvm-svn: 333113	2018-05-23 18:41:40 +00:00
Philip Pfaffe	c06a6380a0	[Acc] Re-land r326643 to finally fix PR33208. Other than before, don't clear out LI entirely but only those relevant loops. llvm-svn: 333089	2018-05-23 14:52:35 +00:00
Peter Collingbourne	9a45114b3c	CodeGen: Add a dwo output file argument to addPassesToEmitFile and hook it up to dwo output. Part of PR37466. Differential Revision: https://reviews.llvm.org/D47089 llvm-svn: 332881	2018-05-21 20:16:41 +00:00
Roman Lebedev	df4fed6fe7	[polly] Drop nonexistant LLVM_PLUGIN_EXPORT macro from llvmGetPassPluginInfo() Fixes build: /build/polly/lib/Support/RegisterPasses.cpp:709:80: error: expected ';' after top level declarator extern "C" ::llvm::PassPluginLibraryInfo LLVM_ATTRIBUTE_WEAK LLVM_PLUGIN_EXPORT ^ ; Was missed in rL332796 / D47082 llvm-svn: 332814	2018-05-19 19:16:35 +00:00
Eli Friedman	e6ed0323cc	[SCEVAffinator] BB can be null; don't use it to get the LLVMContext. Fixes post-commit review comment on r332309. llvm-svn: 332775	2018-05-18 21:57:44 +00:00
Michael Kruse	d6c2ca8dd2	[DeLICM] Avoid assertion on out-of-quota. An assertion was not prepared to be passed a nullptr because the out-of-quota limit was exceeded. Bail-out before the assertion since the assertion does not apply on out-of-quote. This fixes llvm.org/PR37477. llvm-svn: 332488	2018-05-16 16:39:51 +00:00
Philip Pfaffe	9375d57202	[ScopInfo] Remove usage of isl_set_n_basic_set() Summary: This patch aims to remove the usage of old C-styled isl functions (in this case `isl_set_n_basic_set()`) in favor of new C++ isl interface based methods in `ScopInfo.cpp`. Patch by Sahil Yerawar Differential Revision: https://reviews.llvm.org/D46935 llvm-svn: 332471	2018-05-16 14:05:03 +00:00
Philip Pfaffe	d477bb9a50	[SI] Create Scop Name lazily Summary: Creating the Scop name is expensive, because creating the Region name it's derived from is expensive. So create the name lazily, because getName() is actually called rarely. This is a reiteration of r328666, which introduced a use-after-free and got reverted in r331363. Differential Revision: https://reviews.llvm.org/D46868 llvm-svn: 332359	2018-05-15 14:53:25 +00:00
Nicola Zaghen	349506a926	[polly] Update uses of DEBUG macro to LLVM_DEBUG. The DEBUG() macro is very generic so it might clash with other projects. The renaming was done as follows: - git grep -l 'DEBUG' \| xargs sed -i 's/\bDEBUG\s\?(/LLVM_DEBUG(/g' - git diff -U0 master \| ../clang/tools/clang-format/clang-format-diff.py -i -p1 -style LLVM Differential Revision: https://reviews.llvm.org/D44978 llvm-svn: 332352	2018-05-15 13:37:17 +00:00
Eli Friedman	9ae56b9a0e	[SCEVAffinator] Fix handling of pwaff complexity limit. nullptr is not a valid affine expression, and none of the callers check for null, so we eventually hit an isl error and crash. Instead, invalidate the scop and return a constant zero. Differential Revision: https://reviews.llvm.org/D46445 llvm-svn: 332309	2018-05-14 23:05:43 +00:00
Michael Kruse	e330071b43	[ScopInfo] Remove bail out condition in buildMinMaxAccess(). The condition was introduced in r267142 to mitigate a long compile-time case. In r306087, a max-computation limit was introduced that should handle the same case while leaving the max disjuncts heuristic it should have replaced intact. Today, the max disjuncts bail-out causes problems in that it prematurely stops SCoPs from being detected, e.g. in SPEC's lbm. This would hit less like if isl_set_coalesce would be called after isl_set_remove_divs (which makes more basic_set likely to be coalescable) instead of before. This patch tries to remove the premature max-disjuncts bail-out condition by using simple_hull() to reduce the computational overhead, instead of directly invalidating that SCoP. Differential Revision: https://reviews.llvm.org/D45066 Contributed-by: Sahil Girish Yerawar <cs15btech11044@iith.ac.in> llvm-svn: 331891	2018-05-09 16:23:56 +00:00
Philip Pfaffe	e9ca17e9b6	Revert "[polly] [ScopInfo] Don't pre-compute the name of the Scop's region." This reverts commit 0f9dc03765dc301fff7a52e2a0e1dd3e5f3130c5, r328666. The change introduced a use-after-free, caused by the temporary name string being destroyed after converting it to a StringRef. llvm-svn: 331363	2018-05-02 14:55:39 +00:00
Tobias Grosser	e1cadf1722	Remove keep/take/give from isl C++ bindings These functions have been legacy leftovers which we used before the official C++ bindings existed. As all uses of these legacy functions have been removed, this polly-specific extension can also be dropped. llvm-svn: 331130	2018-04-29 00:57:43 +00:00
Tobias Grosser	8dae41a1cb	Remove another set or release() calls llvm-svn: 331129	2018-04-29 00:57:38 +00:00
Tobias Grosser	d3d3d6b75d	Remove the last uses of isl::give and isl::take llvm-svn: 331126	2018-04-29 00:28:26 +00:00
Tobias Grosser	da3e8c4ba7	[DeLICM] Remove uses of isl::give llvm-svn: 331122	2018-04-28 22:11:55 +00:00
Tobias Grosser	daf68ea309	[ZoneAlgo] Remove uses of isl::give - II llvm-svn: 331121	2018-04-28 22:11:48 +00:00
Tobias Grosser	2f549fd6a9	[ZoneAlgo] Remove uses of isl::give This moves more of Polly to islpp. llvm-svn: 331120	2018-04-28 21:22:17 +00:00
Tobias Grosser	77e871aaf5	[MaximalStaticExpansion] Replace copied function with version from ISLTools llvm-svn: 331118	2018-04-28 20:42:35 +00:00
Tobias Grosser	b58928096e	Update to latest version of the isl c++ bindings The delta to the previous version is rather small, but a change in brace placement makes this a rather noisy commit. llvm-svn: 331113	2018-04-28 16:02:30 +00:00
Michael Kruse	e819fffee3	[CodeGen] Print executed statement instances at runtime. Add the options -polly-codegen-trace-stmts and -polly-codegen-trace-scalars. When enabled, adds a call to the beginning of every generated statement that prints the executed statement instance. With -polly-codegen-trace-scalars, it also prints the value of all scalars that are used in the statement, and PHIs defined in the beginning of the statement. Differential Revision: https://reviews.llvm.org/D45743 llvm-svn: 330864	2018-04-25 19:43:49 +00:00
Michael Kruse	beffdb9daa	[ScopDetect] Reject loop with multiple exit blocks. The current statement domain derivation algorithm does not (always) consider that different exit blocks of a loop can have different conditions to be reached. From the code for (int i = n; ; i-=2) { if (i <= 0) goto even; if (i <= 1) goto odd; A[i] = i; } even: A[0] = 42; return; odd: A[1] = 21; return; Polly currently derives the following domains: Stmt_even_critedge Domain := [n] -> { Stmt_even_critedge[] }; Stmt_odd Domain := [n] -> { Stmt_odd[] : (1 + n) mod 2 = 0 and n > 0 }; while the domain for the odd case is correct, Stmt_even is assumed to be executed unconditionally, which is obviously wrong. While projecting out the loop dimension in `adjustDomainDimensions`, it does not consider that there are other exit condition that have matched before. I don't know a how to fix this without changing a lot of code. Therefore This patch rejects loops with multiple exist blocks to fix the miscompile of test-suite's uuencode. The odd condition is transformed by LLVM to %cmp1 = icmp eq i64 %indvars.iv, 1 such that the project_out in adjustDomainDimensions() indeed only matches for odd n (using this condition only, we'd have an infinite loop otherwise). The even condition manifests as %cmp = icmp slt i64 %indvars.iv, 3 Because buildDomainsWithBranchConstraints() does not consider other exit conditions, it has to assume that the induction variable will eventually be lower than 3 and taking this exit. IMHO we need to reuse the algorithm that determines the number of iterations (addLoopBoundsToHeaderDomain) to determine which exit condition applies first. It has to happen in buildDomainsWithBranchConstraints() because the result will need to propagate to successor BBs. Currently addLoopBoundsToHeaderDomain() just look for union of all backedge conditions (which means leaving not the loop here). The patch in llvm.org/PR35465 changes it to look for exit conditions instead. This is required because there might be other exit conditions that do not alternatively go back to the loop header. Differential Revision: https://reviews.llvm.org/D45649 llvm-svn: 330858	2018-04-25 18:53:33 +00:00
Tobias Grosser	5fa86378aa	Update isl to isl-0.19-114-g385262af llvm-svn: 330800	2018-04-25 06:10:35 +00:00
David Blaikie	60dc462b04	Fixup Polly for an LLVM header file change. llvm-svn: 330679	2018-04-24 02:23:41 +00:00
Tobias Grosser	6135b0fe83	Update isl to isl-0.19-107-gc4fe33d8 This is a regular maintenance update. llvm-svn: 330496	2018-04-21 08:34:22 +00:00
Michael Kruse	76238aac8b	[isl++] abort() on assertion violation. Before this patch, ISL_ASSERT only printed an error message to stderr. This can be easily missed if the program continues or just fails later. To fail-early and help error diagnostics (e.g. using bugpoint), call abort() when an assertion does not hold. I seem to just have forgotten to add this abort() when I originally proposed the ISL_ASSERT macro. Suggested-By: Eli Friedman <efriedma@codeaurora.org> Differential Revision: https://reviews.llvm.org/D45171 llvm-svn: 330467	2018-04-20 18:59:13 +00:00
Michael Kruse	5369ea5dd5	Allow arbitrary function calls for debugging purposes. Add the switch -polly-debug-func to define the name of a debug function. This function is ignored for any validity check. Its purpose is to allow to observe a value after transformation by a SCoP, and to follow which statements are executed in which order. For instance, consider the following code: static void dbg_printf(int sum, int i) { fprintf(stderr, "The value of sum is %d, i=%d\n", sum, i); fflush(stderr); } void func(int n) { int sum = 0; for (int i = 0; i < 16; i+=1) { sum += i; dbg_printf(sum, i); } } Executing this after Polly's codegen with -polly-debug-func=dbg_printf reveals the new execution order and the assumed values at that point of execution. Differential Revision: https://reviews.llvm.org/D45728 llvm-svn: 330466	2018-04-20 18:55:44 +00:00
Tobias Grosser	c49f115b27	[RuntimeDebugBuilder] Do not break for 64 bit integers In r330292 this assert was turned incorrectly into an unreachable, but the correct behavior (thanks Michael) is to assert for anything that is not 64 bit, but falltrough for 64 bit. I document this in the source code. llvm-svn: 330309	2018-04-19 05:38:12 +00:00
Tobias Grosser	b20ae44ed0	[RuntimeDebugBuilder] Turn assert into an unreachable llvm-svn: 330289	2018-04-18 20:18:43 +00:00
Tobias Grosser	fcc3ad5d3c	[ScopDetect / ScopInfo] Get statistics for scops without any loop correctly Make sure we also counts scops not containing any loops. llvm-svn: 330285	2018-04-18 20:03:36 +00:00
Philip Pfaffe	8da7d1d7ee	[NewPM] Update pass registration for the LLVM plugin interface Summary: As of rL329273, LLVM has a mechanism to load new-pm plugins in opt. Use this API in Polly. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser, Meinersbur Subscribers: lksbhm, bollu, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D45484 llvm-svn: 330181	2018-04-17 07:59:46 +00:00
Tobias Grosser	7bbacbf4ca	Revert r327216 'Add isl operator overloads for isl::pw_aff' This commit requires further discussions. llvm-svn: 329825	2018-04-11 16:58:08 +00:00
Michael Kruse	4485ae0890	[CodeGen] Allow undefined loads in statement instances outside context. A check in assert-builds was meant to verify that a load provides a value in all statement instances (i.e. its domain). The domain is commonly gist'ed within the parameter context to contain fewer constraints. However, statement instances outside the context are no valid executions, hence the value provided can be undefined. Refine the check for valid loads to only needed to be defined within the SCoP context. In addition, the JSONImporter had to be changed to allow importing access relations that are broader than the current access relation, but still defined over all statement instances. This should fix the compiler crash in test-suite's oggenc of the -polly-process-unprofitable buildbot. llvm-svn: 329655	2018-04-10 01:20:51 +00:00
Michael Kruse	388730c9e0	[CodeGen] Convert BlockGenerator::generateScalarLoads to isl++. NFC. llvm-svn: 329654	2018-04-10 01:20:47 +00:00
Michael Kruse	db6f71e48d	[ScopInfo] Avoid iterator invalidation. Commit r329640 introduced the removal of all MemoryAccesses of a Scop. It accidentally continued iterating over a vector whose iterators have been invalidated by a MemoryAccess removal. Make a copy of the MemoryAccesses to remove to iterate over while removing them. llvm-svn: 329653	2018-04-10 01:20:41 +00:00
Michael Kruse	192e7f72ca	[ScopInfo] Completely remove MemoryAccesses when their parent statement is removed. Removing a statement left its MemoryAccesses in some lists and maps of the SCoP. Which lists depends on at which phase of the SCoP construction the statement is deleted. Follow-up passes could still see the already deleted MemoryAccesses by iterating through these lists/maps, resulting in an access violation. When removing a ScopStmt, also remove all its MemoryAccesses by using the same mechnism that removes a MemoryAccess. llvm-svn: 329640	2018-04-09 23:13:05 +00:00
Michael Kruse	7de61668ae	[ScopInfo] Actually remove from list. std::remove, despite its name, does not remove elements from a list, but only moves them to the end of a list. Call erase() to shorten the vector to the remaining elements. Test case included in next commit. llvm-svn: 329639	2018-04-09 23:13:01 +00:00
Michael Kruse	df8e140349	Remove immediate dominator heuristic for error block detection. This patch removes the heuristic in - Polly :: lib/Support/ScopHelper.cpp The heuristic forces blocks that directly follow a loop header to not to be considered error blocks. It was introduced in r249611 with the following commit message: > This replaces the support for user defined error functions by a > heuristic that tries to determine if a call to a non-pure function > should be considered "an error". If so the block is assumed not to be > executed at runtime. While treating all non-pure function calls as > errors will allow a lot more regions to be analyzed, it will also > cause us to dismiss a lot again due to an infeasible runtime context. > This patch tries to limit that effect. A non-pure function call is > considered an error if it is executed only in conditionally with > regards to a cheap but simple heuristic. In the code below `CCK_Abort2()` would be considered as an error block, but not `CCK_Abort1()` due to this heuristic. ``` for (int i = 0; i < n; i+=1) { if (ErrorCondition1) CCK_Abort1(); // No __attribute__((noreturn)) if (ErrorCondition2) CCK_Abort2(); // No __attribute__((noreturn)) } ``` This does not seem useful. Checking error conditions in the beginning of some work is quite common. It causes a switch default-case to be not considered an error block in SPEC's cactuBSSN. The comment justifying the heuristic mentions a "load", which does not seem to be applicable here. It has been proposed to remove the heuristic. In addition, the patch fixes the following test cases: - Polly :: ScopDetect/mod_ref_read_pointer.ll - Polly :: ScopInfo/max-loop-depth.ll - Polly :: ScopInfo/mod_ref_access_pointee_arguments.ll - Polly :: ScopInfo/mod_ref_read_pointee_arguments.ll - Polly :: ScopInfo/mod_ref_read_pointer.ll - Polly :: ScopInfo/mod_ref_read_pointers.ll The test cases failed after removing the heuristic. Differential Revision: https://reviews.llvm.org/D45274 Contributed-by: Lorenzo Chelini <l.chelini@icloud.com> llvm-svn: 329548	2018-04-09 06:07:44 +00:00
Michael Kruse	ae180b95b0	Silence msvc warning on isl. NFC. The warning is: isl_union_map.c(2041): warning C4221: nonstandard extension used: 'filter_user': cannot be initialized using address of automatic variable 'data' for the following code (and others) struct isl_un_op_drop_user_data data = { &isl_set_is_wrapping }; struct isl_un_op_control control = { .filter = &un_op_filter_drop_user, .filter_user = &data, .fn_map = &isl_set_wrapped_domain_map, }; llvm-svn: 329328	2018-04-05 18:30:44 +00:00
Huihui Zhang	71e54ccd06	[Polly][IslAst] Fix minimal dependence distance. Summary: When checking the parallelism of a scheduling dimension, we first check if excluding reduction dependences the loop is parallel or not. If the loop is not parallel, then we need to return the minimal dependence distance of all data dependences, including the previously subtracted reduction dependences. Reviewers: grosser, Meinersbur, efriedma, eli.friedman, jdoerfert, bollu Reviewed By: Meinersbur Subscribers: llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D45236 llvm-svn: 329214	2018-04-04 18:08:13 +00:00
Reid Kleckner	757c8cf615	Fix polly build after r328717 llvm-svn: 328728	2018-03-28 19:56:26 +00:00
Eli Friedman	ac4ad45177	[polly] [ScopInfo] Don't pre-compute the name of the Scop's region. This gets very expensive for basic blocks which don't have a name: it calls printAsOperand, which numbers the entire module. We don't normally need the name anyway, though; it's only used for debug dumps, so don't compute it by default. Differential Revision: https://reviews.llvm.org/D44946 llvm-svn: 328666	2018-03-27 20:51:49 +00:00
David Blaikie	fd94eee3b9	Update for LLVM header movement llvm-svn: 328169	2018-03-21 23:21:10 +00:00
Tobias Grosser	3a99893618	Adjust to clang-format changes llvm-svn: 328005	2018-03-20 17:16:32 +00:00
Mandeep Singh Grang	daec0aa71f	[polly] Change std::sort to llvm::sort in response to r327219 Summary: r327219 added wrappers to std::sort which randomly shuffle the container before sorting. This will help in uncovering non-determinism caused due to undefined sorting order of objects having the same key. To make use of that infrastructure we need to invoke llvm::sort instead of std::sort. Reviewers: grosser, efriedma, jdoerfert, bollu, sebpop Reviewed By: sebpop Subscribers: sebpop, mehdi_amini, llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D44361 llvm-svn: 327361	2018-03-13 05:25:23 +00:00
Tobias Grosser	5fdbdeb542	Revert untested changes in SCEVAffinator llvm-svn: 327221	2018-03-10 19:15:23 +00:00
Tobias Grosser	a1da86b224	Add isl operator overloads for isl::pw_aff Piecewise affine expressions have directly corresponding mathematical operators. Introduce these operators as overloads as this makes writing code with isl::pw_aff expressions more directly readable. We can now write: A = B + C instead of A = B.add(C) llvm-svn: 327216	2018-03-10 18:07:03 +00:00
Tobias Grosser	b94863001a	[ScopInfo] Do not use the set dimension ids to carry loop information isl does not guarantee that set dimension ids will be preserved, so using them to carry information is not a good idea. Furthermore, the loop information can be derived without problem from the statement itself. As this even requires less code than propagating loop information on set dimension ids, starting from this commit we just derive the loop information in collectSurroundingLoops directly from the IR. Interestingly this also results in a couple of isl sets to take a simpler representation. llvm-svn: 326664	2018-03-03 19:27:54 +00:00
Philip Pfaffe	4d50ab86e6	Revert "[Acc] Fix for PR33208" This reverts commit r326643. Fix didn't really fix anything. llvm-svn: 326656	2018-03-03 15:34:49 +00:00
Philip Pfaffe	a8f7cc8ec9	[Acc] Fix for PR33208 During codegen, Polly attempts to clear all loops from ScalarEvolution and LoopInfo, and it does so one block at a time. This causes undefined behaviour, since this way a loop header might be removed from a loop before the entire loop is erased, causing ScalarEvolution to run into an error. Instead, just delete the entire loop atomically. This fixes currently failing testcases. llvm-svn: 326643	2018-03-03 10:47:37 +00:00
Tobias Grosser	de6b342e90	isl: "isl_schedule_get_map: handle trees with divergent filter node parameters" Also un-revert (isl_pw_*_alloc: add missing check for compatible spaces, Wed Sep 6 12:18:04 2017 +0200). This patch is a proposed fix to avoid asserts due to stricter space checking within isl, which resulted in failures when converting a schedule tree to a schedule map. llvm-svn: 326073	2018-02-26 09:26:41 +00:00
Tobias Grosser	718d04c653	Use isl::manage_copy to simplify calls to isl::manage(isl_.._copy()) As part of this cleanup a couple of unnecessary isl::manage(obj.copy()) pattern are eliminated as well. We checked for all potential cleanups by scanning for: "grep -R isl::manage\( lib/ \| grep copy" llvm-svn: 325558	2018-02-20 07:26:58 +00:00
Tobias Grosser	fa8079d0dc	Update isl to isl-0.18-1047-g4a20ef8 This update: - Removes several deprecated functions (e.g., isl_band). - Improves the pretty-printing of sets by detecting modulos and "false" equalities. - Minor improvements to coalescing and increased robustness of the isl scheduler. This update does not yet include isl commit isl-0.18-90-gd00cb45 (isl_pw_*_alloc: add missing check for compatible spaces, Wed Sep 6 12:18:04 2017 +0200), as this additional check is too tight and unfortunately causes two test case failures in Polly. A patch has been submitted to isl and will be included in the next isl update for Polly. llvm-svn: 325557	2018-02-20 07:26:42 +00:00
Tobias Grosser	85476dc45a	Fix broken isl-noexceptions.h path in update-isl script llvm-svn: 325556	2018-02-20 07:24:58 +00:00
Tobias Grosser	ba4257b187	Update isl C++ bindings to latest version of isl llvm-svn: 325555	2018-02-20 07:24:55 +00:00
Tobias Grosser	5f62fafadd	Do not call band_list().dump() This is in preparation for the removal of band_list from isl. llvm-svn: 325554	2018-02-20 07:24:40 +00:00
Michael Kruse	a6716d9d81	[ScopBuilder] scalar-indep: Fix mutually referencing PHIs. Two or more PHIs mutually using each other directly or indirectly as incoming value could cause that a PHI WRITE be added before the PHI READ (i.e. it overwrites the current incoming value with the next incoming value before it being read). Fix by ensuring that the PHI WRITE and PHI READ are in the same statement. This should fix the miscompile of SingleSource/Benchmark/Misc/whetstone from the test-suite. llvm-svn: 324934	2018-02-12 21:09:40 +00:00
Michael Kruse	a43ba2d84f	[ScopBuilder] Make -polly-stmt-granularity=scalar-indep the default. Splitting basic blocks into multiple statements if there are now additional scalar dependencies gives more freedom to the scheduler, but more statements also means higher compile-time complexity. Switch to finer statement granularity, the additional compile time should be limited by the number of operations quota. The regression tests are written for the -polly-stmt-granularity=bb setting, therefore we add that flag to those tests that break with the new default. Some of the tests only fail because the statements are named differently due to a basic block resulting in multiple statements, but which are removed during simplification of statements without side-effects. Previous commits tried to reduce this effect, but it is not completely avoidable. Differential Revision: https://reviews.llvm.org/D42151 llvm-svn: 324169	2018-02-03 06:59:47 +00:00
Michael Kruse	217704f7a8	[ScopInfo] Allow epilogues to be the main statement of a BB. Do not add a "_last" suffix to the statement name if there is no (other) main statement for a basic block. In other words, it becomes the main statement itself. This further reduces the statement naming difference between -polly-stmt-granularity=bb and -polly-stmt-granularity=scalar-indep. llvm-svn: 324168	2018-02-03 05:43:00 +00:00
Michael Kruse	1a745a4ef6	Run clang-format after r324003. NFC. llvm-svn: 324112	2018-02-02 18:11:58 +00:00
Benjamin Kramer	e65c7bbe8a	Update polly for r323999. llvm-svn: 324003	2018-02-01 20:49:53 +00:00
Michael Kruse	a230f22f4b	[ScopBuilder] Prefer PHI Write accesses in the statement the incoming value is defined. Theoretically, a PHI write can be added to any statement that represents the incoming basic block. We previously always chose the last because the incoming value's definition is guaranteed to be defined. With this patch the PHI write is added to the statement that defines the incoming value. It avoids the requirement for a scalar dependency between the defining statement and the statement containing the write. As such the logic for -polly-stmt-granularity=scalar-indep that ensures that there is such scalar dependencies can be removed. Differential Revision: https://reviews.llvm.org/D42147 llvm-svn: 323284	2018-01-23 23:56:36 +00:00
Michael Kruse	1ed2bc5266	[VirtualInst] Derive correct use kind of PHI operands. NFC. VirtualUse::create is only called for MemoryKind::Value, but its consistency nonetheless checked in verifyUses(). PHI uses are always inter-stmt dependencies, which was not considered by the constructor method. The virtual and non-virtual execution paths were the same, such that verifyUses did not encounter any inconsistencies. llvm-svn: 323283	2018-01-23 23:56:25 +00:00
Michael Kruse	9cfb0ac223	[ScopBuilder] Revise statement naming when there are multiple statements per BB. The goal is to have -polly-stmt-granularity=bb and -polly-stmt-granularity=scalar-indep to have the same names if there is just one statement per basic block. This fixes a fluke when Polybench's jacobi-2d is optimized differently depending on the -polly-stmt-granularity option, although both options create the same SCoP, just with different statement names. The new naming scheme is: With -polly-use-llvm-names=0: Stmt<BBIdx as decimal><Idx within BB as letter> With -polly-use-llvm-names=1: Stmt_BBName_<Idx within BB as letter> The <Idx within BB> suffix is omitted for the main statement of a BB. The main statement is either the one containing the first store or call (those cannot be removed by the simplifyer), or if there is no such instruction, the first. If after simplification there is just a single statement left, it should be the main statement and have the same names as with -polly-stmt-granularity=bb. Differential Revision: https://reviews.llvm.org/D42136 llvm-svn: 322852	2018-01-18 15:15:50 +00:00
Michael Kruse	d6e2208671	[ScopInfo] Pass name to ScopStmt ctor. NFC. This will give control of the statement's name to the caller. Required to give -polly-stmt-granularity=scalar-indep more control over the name of the generated statement in a follow-up commit. llvm-svn: 322851	2018-01-18 15:15:38 +00:00
Eli Friedman	a75d53c83f	[polly] [ScopInfo] Don't use isl_val_get_num_si. isl_val_get_num_si crashes on overflow, so don't use it on arbitrary integers. Testcase only crashes on platforms where long is 32 bits because of the signature of isl_val_get_num_si; not sure if it's possible to write a testcase which crashes if long is 64 bits. There are a few other places in polly which use isl_val_get_num_si; they probably need to be fixed as well. I don't think polly uses any of the other "long" isl APIs in an unsafe manner. Differential Revision: https://reviews.llvm.org/D42129 llvm-svn: 322766	2018-01-17 21:59:02 +00:00
Michael Kruse	a0db63a195	[IslTools] dumpPw: Dump same structure pieces together. Print same or similar structure elements together. Previously, the value could take more importance that the space structure if visited first in the space nest tree. Before: { Left[0] -> Right[i]: i >= 0; Left[1] -> AnotherRight[i]; Left[2] -> Right[-1] } After: { Left[0] -> Right[i]: i >= 0; Left[2] -> Right[-1]; Left[1] -> AnotherRight[i] } llvm-svn: 322581	2018-01-16 18:39:42 +00:00
Michael Kruse	21de8adc36	[CMake] Use only keyword-version of target_link_library. NFC. CMake insists that for each target, one uses only the non-keyword version of target_link_library target_link_library(mytarget lib) or the one with PUBLIC/PRIVATE/INTERFACE keyword: target_link_library(mytarget PUBLIC lib) Otherwise, CMake fails with the error message: The keyword signature for target_link_libraries has already been used with the target "mytarget". All uses of target_link_libraries with a target must be either all-keyword or all-plain. Change all occurances of target_link_library to the newer keyworded version to avoid such errors. Some already have been changed in r319840, but might not be sufficient for all build configurations to build the doxygen manual. Reported-by: Tanya Lattner <tanyalattner@llvm.org> llvm-svn: 322376	2018-01-12 16:09:18 +00:00
Michael Kruse	271deb17b0	[CodeGen] Fix noalias annotations for memcpy/memmove. Memory transfer instructions take two pointers. It is not defined to which of those a noalias annotation applies. To ensure correctness, do not add noalias annotations to memcpy/memmove instructions anymore. The caused a miscompile with test-suite's MultiSource/Applications/obsequi. Since r321138, the MemCpyOpt pass would remove memcpy/memmove calls if known to copy uninitialized memory. In that case, it was initialized by another memcpy, but the annotation for the target pointer said it would not alias. The annotation was actually meant for the source pointer, which was was an alloca and could not alias with the target pointer. llvm-svn: 321371	2017-12-22 17:44:53 +00:00
Michael Kruse	5f0e8a46cf	[ScopBuilder] Split statements on encountering store instructions. Introduce -polly-stmt-granularity=store option. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D37337 llvm-svn: 320360	2017-12-11 12:51:24 +00:00
Michael Kruse	188b437fcb	[ScopBuilder] Fix typo. NFC. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D41047 llvm-svn: 320336	2017-12-10 22:56:32 +00:00
Philip Pfaffe	0969462c52	[NFC] Fix formatting llvm-svn: 319973	2017-12-06 22:01:08 +00:00
Philip Pfaffe	d98dbeeb71	Port SCEVAffinator to the isl c++ bindings Summary: Straight forward port of SCEVAffinator Reviewers: grosser, bollu, Meinersbur Reviewed By: Meinersbur Subscribers: pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D40803 llvm-svn: 319958	2017-12-06 21:02:22 +00:00
Siddharth Bhat	c0f5f4deae	Update to latest clang-format. [NFC] Differential Revision: https://reviews.llvm.org/D40791 llvm-svn: 319718	2017-12-05 00:06:09 +00:00
Philip Pfaffe	4fe21814d1	Handle Top-Level-Regions in polly::isHoistableLoad Summary: This can be seen as a follow-up on my previous differential [D33411](https://reviews.llvm.org/D33411). We received a bug report where this error was triggered. I have tried my best to recreate the issue in a minimal lit testcase which is also part of this differential. I only handle return instructions as predecessors to a virtual TLR-exit right now. From inspecting the codebase, it seems `unreachable` instructions may also be of interest here. If requested, I can extend my patches to consider them as well. I would also apply this on `ScopHelper.cpp::isErrorBlock` (see D33411), of course. Reviewers: philip.pfaffe, bollu Reviewed By: bollu Subscribers: Meinersbur, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D40492 llvm-svn: 319431	2017-11-30 13:06:10 +00:00
Michael Kruse	bfb8fa5a16	Update format after clang-format change. NFC. In r319314 clang-format changed its reflowing logic. llvm-svn: 319426	2017-11-30 12:05:48 +00:00
Davide Italiano	b0c7dee0b6	[MaximalStaticExpansion] Simplify this code a bit. NFCI. llvm-svn: 318988	2017-11-25 23:01:31 +00:00
Michael Kruse	163cacb469	[CodeGen] Detect empty domain because of parameters context. Isl does not allow generating isl_ast_expr from an isl_pw_aff that has an empty domain (i.e. has no pieces). We already detected the case if the isl_pw_aff comes with an empty domain. isl_ast_build also considers the domain empty if it is disjoint with the parameter context (e.g. parameters values that we exclude by runtime versioning). Intersect the access relation domain with the parameter context to also detect such practically empty access domains. The effective pointer used in the generated code is unimportand because it will never be executed. This fixes llvm.org/PR35362 llvm-svn: 318806	2017-11-21 22:11:10 +00:00
Michael Kruse	58166b13e0	Run polly-update-format. NFC. polly-check-format has been failing since at least r318517, due to more than one cause. llvm-svn: 318795	2017-11-21 19:25:26 +00:00
Philip Pfaffe	00fd43b327	Port ScopInfo to the isl cpp bindings Summary: Most changes are mechanical, but in one place I changed the program semantics by fixing a likely bug: In `Scop::hasFeasibleRuntimeContext()`, I'm now explicitely handling the error-case. Before, when the call to `addNonEmptyDomainConstraints()` returned a null set, this (probably) accidentally worked because isl_bool_error converts to true. I'm checking for nullptr now. Reviewers: grosser, Meinersbur, bollu Reviewed By: Meinersbur Subscribers: nemanjai, kbarton, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D39971 llvm-svn: 318632	2017-11-19 22:13:34 +00:00
Zhaoshi Zheng	ceec175dff	[NFC] Make r318597 compatible with clang-format llvm-svn: 318561	2017-11-17 22:05:19 +00:00
Philip Pfaffe	2813ce228b	[nfc] Iwyu: forward-declare/include raw_ostream in zone algo llvm-svn: 318517	2017-11-17 11:34:29 +00:00
Philip Pfaffe	8dd0f479e8	[SI] Fix a potential use-after-free Summary: There is a potential use-after-free bug in Scop::buildSchedule(Region *, LoopStackTy &, LoopInfo &). Before, we took a reference to LoopStack.back() which is a use after free, since back is popped off further below. This didn't crash before by pure chance, since LoopStack is actually a vector, and the memory isn't freed upon pop. I turned this into an iterator-based algorithm. Reviewers: grosser, bollu, Meinersbur Reviewed By: Meinersbur Subscribers: llvm-commits, pollydev Differential Revision: https://reviews.llvm.org/D39979 llvm-svn: 318415	2017-11-16 16:35:19 +00:00
Mandeep Singh Grang	02e789c9bf	[polly] Remove redundant return [NFC] Reviewers: grosser, bollu Reviewed By: grosser Subscribers: nemanjai, kbarton, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D39916 llvm-svn: 317922	2017-11-10 20:33:08 +00:00
Michael Kruse	4d3f3c7206	[ForwardOpTree] Limit isl operations of known content reload. Put the analysis part of reloadKnownContent under an isl max-operations quota scope, as has already been done for forwardKnownLoad. This should fix the aosp timeout of "GrTestUtils.cpp". llvm-svn: 317495	2017-11-06 17:48:14 +00:00
Sanjay Patel	1b5114fa52	[Analysis] update to use new fast-math API - isFast() llvm-svn: 317491	2017-11-06 16:52:31 +00:00
Michael Kruse	68821a8b91	[ZoneAlgo/ForwardOpTree] Normalize PHIs to their known incoming values. Represent PHIs by their incoming values instead of an opaque value of themselves. This allows ForwardOpTree to "look through" the PHIs and forward the incoming values since forwardings PHIs is currently not supported. This is particularly useful to cope with PHIs inserted by GVN LoadPRE. The incoming values all resolve to a load from a single array element which then can be forwarded. It should in theory also reduce spurious conflicts in value mapping (DeLICM), but I have not yet found a profitable case yet, so it is not included here. To avoid transitive closure and potentially necessary overapproximations of those, PHIs that may reference themselves are excluded from normalization and keep their opaque self-representation. Differential Revision: https://reviews.llvm.org/D39333 llvm-svn: 317008	2017-10-31 16:11:46 +00:00
Michael Kruse	ff426d974d	[DeLICM] Fix wrong assumed access execution order. ForwardOpTree may already transform a scalar access to an array accesses. The access remains implicit (isOriginalScalarKind(), meaning that the access is always executed at the begin/end of a statement), but targets an array (isLatestArrayKind(), which is unrelated to whether the execution is implicit/explicit). Fix by properly using isOriginalXXX() to determine execution order. This fixes the buildbots on MultiSource/Benchmarks/DOE-ProxyApps-C/miniGMG. llvm-svn: 316995	2017-10-31 12:50:25 +00:00
Michael Kruse	06618bf71a	[OpenMP] Fix reference collection of latest base ptrs. When collecting base pointers that need to be made available in parallel subfunctions, use the base pointer associated with the latest ScopArrayInfo, instead of the original one. llvm-svn: 316983	2017-10-31 10:28:22 +00:00
Philip Pfaffe	53c803871e	[Acc] Do not statically dispatch into IslNodeBuilder's createFor Summary: When GPUNodeBuilder creates loops inside the kernel, it dispatches to IslNodeBuilder. This however is surprisingly dangerous, since it accesses the AST Node's user through the wrong type. This patch fixes this problem by overriding createFor correctly. This fixes PR35010. Reviewers: grosser, bollu, Meinersbur Reviewed By: Meinersbur Subscribers: Meinersbur, nemanjai, pollydev, llvm-commits, kbarton Differential Revision: https://reviews.llvm.org/D39364 llvm-svn: 316872	2017-10-29 21:36:34 +00:00
Michael Kruse	cc6ea8e74f	[ForwardOpTree] Use space indention. NFC. llvm-svn: 316769	2017-10-27 14:48:34 +00:00
Michael Kruse	822dfe271b	[ForwardOpTree] Reload know values. For scalar accesses, change the access target to an array element that is known to contain the same value. This may become an alternative to forwardKnownLoad which creates new loads (and therefore closer to forwarding speculatives). Reloading does not require the known value originating from a load, but can be a store as well. Differential Revision: https://reviews.llvm.org/D39325 llvm-svn: 316766	2017-10-27 14:26:14 +00:00
Michael Kruse	b6b65834a1	[Simplify] Mark (and sweep) based on latest access relation. Previously we marked scalars based on the original access function. However, when a scalar read access is redirected, the original definition (or incoming values of a PHI) is not used anymore, and can be deleted (unless referenced by use that has not been redirected). llvm-svn: 316660	2017-10-26 12:34:36 +00:00
Michael Kruse	983fa9bf23	[ZoneAlgo] Translate addArrayWriteAccess to isl++. NFC. llvm-svn: 316459	2017-10-24 16:40:34 +00:00
Michael Kruse	25bd602b7a	[ISLTools] Translate computeReachingWrite to isl++. NFC. llvm-svn: 316445	2017-10-24 15:19:46 +00:00
Michael Kruse	19cd61dc11	[DeLICM] Do not try to map to multiple array elements. Add check and skip when the store used to determine the target accesses multiple array elements. Only a single array location should for mapping the scalar. Having multiple creates problems when deciding which element to load from. While MemoryAccess::getAddressFunction() should select just one of them, other problems arise in code that assumes that there is just one target element per statement instance. This fixes llvm.org/PR34989 This also reverts r313902 which fixed llvm.org/PR34485 also caused by a non-functional target array element. This patch avoids the situation to occur in the first place. llvm-svn: 316432	2017-10-24 13:05:24 +00:00
Adam Nemet	e0f1541f41	Rename OptimizationDiagnosticInfo.h to OptimizationRemarkEmitter.h Polly version of r315249 on LLVM trunk. llvm-svn: 315253	2017-10-09 23:49:08 +00:00
Michael Kruse	cc345e6e94	[ScopBuilder] Introduce -polly-stmt-granularity=scalar-indep option. The option splits BasicBlocks into minimal statements such that no additional scalar dependencies are introduced. The algorithm is based on a union-find structure, and unites sets if putting them into separate statements would introduce a scalar dependencies. As a consequence, instructions may be split into separate statements such their relative order is different than the statements they are in. This is accounted for instructions whose relative order matters (e.g. memory accesses). The algorithm is generic in that heuristic changes can be made relatively easily. We might relax the order requirement for read-reads or accesses to different base pointers. Forwardable instructions can be made to not cause a join. This implementation gives us a speed-up of 82% in SPEC 2006 456.hmmer benchmark by allowing loop-distribution in a hot loop such that one of the loops can be vectorized. Differential Revision: https://reviews.llvm.org/D38403 llvm-svn: 314983	2017-10-05 13:43:00 +00:00
Michael Kruse	482d3f41e5	[ScopBuilder] Introduce -polly-stmt-granularity option. NFC. The option is introduced with only one possible value -polly-stmt-granularity=bb which represents the current behaviour, which is outlined into the new function buildSequentialBlockStmts(). More options will be added in future commits. llvm-svn: 314900	2017-10-04 12:18:57 +00:00
Tobias Grosser	c52b71db15	[GPGPU] Make sure escaping invariant load hoisted scalars are preserved We make sure that the final reload of an invariant scalar memory access uses the same stack slot into which the invariant memory access was stored originally. Earlier, this was broken as we introduce a new stack slot aside of the preload stack slot, which remained uninitialized and caused our escaping loads to contain garbage. This happened due to us clearing the pre-populated values in EscapeMap after kernel code generation. We address this issue by preserving the original host values and restoring them after kernel code generation. EscapeMap is not expected to be used during kernel code generation, hence we clear it during kernel generation to make sure that any unintended uses are noticed. llvm-svn: 314894	2017-10-04 10:24:23 +00:00
Michael Kruse	4ee19603e9	[ScopBuilder] Iterate over statement instructions. NFC. Iterate over statement instructions instead over basic block instructions when creating MemoryAccesses. It allows making the creation of MemoryAccesses independent of how the basic blocks are split into multiple ScopStmts. llvm-svn: 314665	2017-10-02 11:41:33 +00:00
Michael Kruse	f5745b4e7d	[ScopBuilder] Build invariant loads separately. Create the MemoryAccesses of invariant loads separately and before all other MemoryAccesses. Invariant loads are classified as synthesizable and therefore are not contained in any statement. When iterating over all instructions of all statements, the invariant loads are consequently not processed and iterating over them separately becomes necessary. This patch can change the order in which MemoryAccesses are created, but otherwise has no functional change. Some temporary code is introduced to ensure correctness, but will be removed in the next commit. llvm-svn: 314664	2017-10-02 11:41:27 +00:00
Michael Kruse	89a6f3db02	[ScopBuilder] Build escaping dependencies separately. Instructions that compute escaping values might be synthesizable and therefore not contained in any ScopStmt. When buildAccessFunctions is changed to only iterate over the instruction list of statement, "free" instructions still need to be written. We do this after the main MemoryAccesses have been created. This can change the order in which MemoryAccesses are created, but has otherwise no functional change. llvm-svn: 314663	2017-10-02 11:41:19 +00:00
Michael Kruse	0bedec0e65	[ScopBuilder] Specialize exit block handling. NFC. Decouple handling of exit block PHIs and other MemoryAccesses. Exit PHIs only need the PHI handling part of buildAccessFunctions but requires code for skipping them in while creating other MemoryAcesses. This change will make it easier to modify how statement MemoryAccesses are created without considering the exit block special case. llvm-svn: 314662	2017-10-02 11:41:12 +00:00
Michael Kruse	e276e9f324	[ForwardOpTree] Fix out-of-quota in assertion. llvm-svn: 314661	2017-10-02 11:41:06 +00:00
Michael Kruse	c013399197	[ScopDetect] Do not add loads out of the SCoP to required invariant loads. Loads before the SCoP are always invariant within the SCoP and therefore are no "required invariant loads". An assertion failes in ScopBuilder when it finds such an invariant load. Fix by not adding such loads to the required invariant load list. This likely will cause the region to be not considered a valid SCoP. We may want to unconditionally accept instructions defined before the region as valid invariant conditions instead of rejecting them. This fixes a compilation crash of SPEC CPU2006 453.povray's render.cpp. llvm-svn: 314636	2017-10-01 22:19:28 +00:00
Tobias Grosser	2fb847fbf6	[GPGPU] Set Polly's RTC to false in case invariant load hoisting fails This matches the behavior we already have in lib/Codegen/CodeGeneration.cpp and makes sure that we fall back to the original code. It seems when invariant load hoisting was introduced to the GPGPU backend we missed to reset the RTC flag, such that kernels where invariant load hoisting failed executed the 'optimized' SCoP, which however is set to a simple 'unreachable'. Unsurprisingly, this results in hard to debug issues that are a lot of fun to debug. llvm-svn: 314624	2017-10-01 12:39:14 +00:00
Michael Kruse	ed787e7540	[Polly] Add dumpPw() and dumpExpanded() functions. NFC. These functions print a multi-line and sorted representation of unions of polyhedra. Each polyhedron (basic_{ast/map}) has its own line. First sort key is the polyhedron's hierachical space structure. Secondary sort key is the lower bound of the polyhedron, which should ensure that the polyhedral are printed in approximately ascending order. Example output of dumpPw(): [p_0, p_1, p_2] -> { Stmt0[0] -> [0, 0]; Stmt0[i0] -> [i0, 0] : 0 < i0 <= 5 - p_2; Stmt1[0] -> [0, 2] : p_1 = 1 and p_0 = -1; Stmt2[0] -> [0, 1] : p_1 >= 3 + p_0; Stmt3[0] -> [0, 3]; } In contrast dumpExpanded() prints each point in the sets, unless there is an unbounded dimension that cannot be expandend. This is useful for reduced test cases where the loop counts are set to some constant to understand a bug. Example output of dumpExpanded( { [MemRef_A[i0] -> [i1]] : (exists (e0 = floor((1 + i1)/3): i0 = 1 and 3e0 <= i1 and 3e0 >= -1 + i1 and i1 >= 15 and i1 <= 25)) or (exists (e0 = floor((i1)/3): i0 = 0 and 3e0 < i1 and 3e0 >= -2 + i1 and i1 > 0 and i1 <= 11)) }): { [MemRef_A[0] ->[1]]; [MemRef_A[0] ->[2]]; [MemRef_A[0] ->[4]]; [MemRef_A[0] ->[5]]; [MemRef_A[0] ->[7]]; [MemRef_A[0] ->[8]]; [MemRef_A[0] ->[10]]; [MemRef_A[0] ->[11]]; [MemRef_A[1] ->[15]]; [MemRef_A[1] ->[16]]; [MemRef_A[1] ->[18]]; [MemRef_A[1] ->[19]]; [MemRef_A[1] ->[21]]; [MemRef_A[1] ->[22]]; [MemRef_A[1] ->[24]]; [MemRef_A[1] ->[25]] } Differential Revision: https://reviews.llvm.org/D38349 llvm-svn: 314525	2017-09-29 15:45:40 +00:00
Michael Kruse	2dd5fa4dc7	[ScopBuilder] Fix typo. NFC. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D38322 llvm-svn: 314519	2017-09-29 15:13:05 +00:00
Philip Pfaffe	859ef1c09e	Fix the build after r314375 r314375 privatized Loop's constructor and replaced it with an Allocator. llvm-svn: 314412	2017-09-28 12:20:24 +00:00
Michael Kruse	89d2be0702	[Support] Force instantiation of isl dump() methods. NFC. In order for debuggers to be able to call an inline method, it must have been instantiated somewhere. The dump() methods are usually not used, so add an instantiation in debug builds. This allows to call .dump() on any isl++ object from the gcc/gdb and Visual Studio debugger in debug builds with assertions enabled. In optimized builds, even with assertions enabled, the dump() methods are also inlined in GICHelper.cpp, so no externally visible symbols will be available either. Differential Revision: https://reviews.llvm.org/D38198 llvm-svn: 314395	2017-09-28 09:51:04 +00:00
Tobias Grosser	1f93d0f1f9	[ScopInfo] Allow PHI nodes that reference an error block As long as these PHI nodes are only referenced by terminator instructions. llvm-svn: 314212	2017-09-26 15:00:10 +00:00
Tobias Grosser	5e531dfef4	[ScopInfo] Allow invariant loads in branch conditions In case the value used in a branch condition is a load instruction, assume this load to be invariant. llvm-svn: 314146	2017-09-25 20:27:15 +00:00
Tobias Grosser	0a62b2d887	[ScopInfo] Allow uniform branch conditions If all but one branch come from an error condition and the incoming value from this branch is a constant, we can model this branch. llvm-svn: 314116	2017-09-25 16:37:15 +00:00
Tobias Grosser	ee457594c2	[ScopDetect/Info] Look through PHIs that follow an error block In case a PHI node follows an error block we can assume that the incoming value can only come from the node that is not an error block. As a result, conditions that seemed non-affine before are now in fact affine. This is a recommit of r312663 after fixing test/Isl/CodeGen/phi_after_error_block_outside_of_scop.ll llvm-svn: 314075	2017-09-24 09:25:30 +00:00
Tobias Grosser	75d133f0ac	[IslExprBuilder] Do not generate RTC with more than 64 bit Such RTCs may introduce integer wrapping intrinsics with more than 64 bit, which are translated to library calls on AOSP that are not part of the runtime and will consequently cause linker errors. Thanks to Eli Friedman for reporting this issue and reducing the test case. llvm-svn: 314065	2017-09-23 15:32:07 +00:00
Reid Kleckner	3fc649cb76	[Support] Rename tool_output_file to ToolOutputFile, NFC This class isn't similar to anything from the STL, so it shouldn't use the STL naming conventions. llvm-svn: 314050	2017-09-23 01:03:17 +00:00
Michael Kruse	bfca5f4334	[DeLICM] Allow non-injective PHIRead->PHIWrite mapping. Remove an assertion that tests the injectivity of the PHIRead -> PHIWrite relation. That is, allow a single PHI write to be used by multiple PHI reads. This may happen due to some statements containing the PHI write not having the statement instances that would overwrite the previous incoming value due to (assumed/invalid) contexts. This result in that PHI write is mapped to multiple targets which is not supported. Codegen will select one one of the targets using getAddressFunction(). However, the runtime check should protect us from this case ever being executed. We therefore allow injective PHI relations. Additional calculations to detect/santitize this case would probably not be worth the compuational effort. This fixes llvm.org/PR34485 llvm-svn: 313902	2017-09-21 19:08:23 +00:00
Michael Kruse	6d7a7896ce	[ScopInfo] Use map for value def/PHI read accesses. Before this patch, ScopInfo::getValueDef(SAI) used getStmtFor(Instruction*) to find the MemoryAccess that writes a MemoryKind::Value. In cases where the value is synthesizable within the statement that defines, the instruction is not added to the statement's instruction list, which means getStmtFor() won't return anything. If the synthesiable instruction is not synthesiable in a different statement (due to being defined in a loop that and ScalarEvolution cannot derive its escape value), we still need a MemoryKind::Value and a write to it that makes it available in the other statements. Introduce a separate map for this purpose. This fixes MultiSource/Benchmarks/MallocBench/cfrac where -polly-simplify could not find the writing MemoryAccess for a use. The write was not marked as required and consequently was removed. Because this could in principle happen as well for PHI scalars, add such a map for PHI reads as well. llvm-svn: 313881	2017-09-21 14:23:11 +00:00
Michael Kruse	0e370cf1a7	Check whether IslAstInfo and DependenceInfo were computed for the same Scop. Since -polly-codegen reports itself to preserve DependenceInfo and IslAstInfo, we might get those analysis that were computed by a different ScopInfo for a different Scop structure. This would be unfortunate because DependenceInfo and IslAstInfo hold references to resources allocated by ScopInfo/ScopBuilder/Scop (e.g. isl_id). If -polly-codegen and DependenceInfo/IslAstInfo do not agree on which Scop to use, unpredictable things can happen. When the ScopInfo/Scop object is freed, there is a high probability that the new ScopInfo/Scop object will be created at the same heap position with the same address. Comparing whether the Scop or ScopInfo address is the expected therefore is unreliable. Instead, we compare the address of the isl_ctx object. Both, DependenceInfo and IslAstInfo must hold a reference to the isl_ctx object to ensure it is not freed before the destruction of those analyses which might happen after the destruction of the Scop/ScopInfo they refer to. Hence, the isl_ctx will not be freed and its address not reused as long there is a DependenceInfo or IslAstInfo around. This fixes llvm.org/PR34441 llvm-svn: 313842	2017-09-21 00:01:13 +00:00
Michael Kruse	8dceb76066	[ScheduleOptimizer] Fix and test schedule tree statistics. Fix walking over the schedule tree to collect its properties (Number of permutable bands etc.). Also add regression tests for these statistics. llvm-svn: 313750	2017-09-20 11:53:05 +00:00
Michael Kruse	89972e21f8	[ForwardOpTree] Allow out-of-quota in examination part of forwardTree. Computing the reaching definition in forwardTree() can take a long time if the coefficients are large. When the forwarding is carried-out (doIt==true), forwardTree() must execute entirely or not at all to get a consistent output, which means we cannot just allow out-of-quota errors to happen in the middle of the processing. We introduce the class IslQuotaScope which allows to opt-in code that is conformant and has been tested with out-of-quota events. In case of ForwardOpTree, out-of-quota is allowed during the operand tree examination, but not during the transformation. The same forwardTree() recursion is used for examination and execution, meaning that the reaching definition has already been computed in the examination tree walk and cached for reuse in the transformation tree walk. This should fix the time-out of grtestutils.ll of the asop buildbot. If the compilation still takes too long, we can reduce the max-operations allows for -polly-optree. Differential Revision: https://reviews.llvm.org/D37984 llvm-svn: 313690	2017-09-19 22:53:20 +00:00
Michael Kruse	ef8325ba50	[ForwardOpTree] Test the max operations quota. cl::opt<unsigned long> is not specialized and hence the option -polly-optree-max-ops impossible to use. Replace by supported option cl::opt<unsigned>. Also check for an error state when computing the written value, which happens when the quota runs out. llvm-svn: 313546	2017-09-18 17:43:50 +00:00
Michael Kruse	ad32de9424	[ForwardOptTree] Remove redundant simplify(). NFC. The result of computeKnown has already been simplified. llvm-svn: 313526	2017-09-18 12:28:07 +00:00
Roman Gareev	925ce50f1b	Unroll and separate the remaining parts of isolation The remaining parts produced by the full partial tile isolation can contain hot spots that are worth to be optimized. Currently, we rely on the simple loop unrolling pass, LiCM and the SLP vectorizer to optimize such parts. However, the approach can suffer from the lack of the information about aliasing that Polly provides using additional alias metadata or/and the lack of the information required by simple loop unrolling pass. This patch is the first step to optimize the remaining parts. To do it, we unroll and separate them. In case of, for instance, Intel Kaby Lake, it helps to increase the performance of the generated code from 39.87 GFlop/s to 49.23 GFlop/s. The next possible step is to avoid unrolling performed by Polly in case of isolated and remaining parts and rely only on simple loop unrolling pass and the Loop vectorizer. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D37692 llvm-svn: 312929	2017-09-11 17:46:47 +00:00
Michael Kruse	0481d78c6c	[CodegenCleanup] Update cleanup passes according (old) PassManagerBuilder. Update CodegenCleanup using the function-level passes added by populatePassManager that run between EP_EarlyAsPossible and EP_VectorizerStart in -O3. The changes in particular are: - Added pass create arguments, e.g. ExpensiveCombines for InstCombine. - Remove reroll pass. The option -reroll-loops is disabled by default. - Add passes run with UnitAtATime, which is the default. - Add instances of LibCallsShrinkWrap, TailCallElimination, SCCP (sparse conditional constant propagation), Float2Int that did not run before. - Add instances of GVN as in the default pipeline. Notes: - GVNHoist, GVNSink, NewGVN are still disabled in the -O3 pipeline. - The optimization level and other optimization parameters are not accessible outside of PassManagerBuilder, hence we cannot add passes depending on these. Differential Revision: https://reviews.llvm.org/D37571 llvm-svn: 312875	2017-09-09 21:43:49 +00:00
Reid Kleckner	b79e7a6897	Fix some unused warnings in polly llvm-svn: 312755	2017-09-07 22:46:51 +00:00
Michael Kruse	2f5cbc449a	[CodeGen] Bitcast scalar writes to actual value. The type of NewValue might change due to ScalarEvolution looking though bitcasts. The synthesized NewValue therefore becomes the type before the bitcast. llvm-svn: 312718	2017-09-07 12:15:01 +00:00
Siddharth Bhat	e2950f46c6	[PPCGCodeGen] Document pre-composition with Zero in getExtent. [NFC] It's weird at first glance that we do this, so I wrote up some documentation on why we need to perform this process. llvm-svn: 312715	2017-09-07 11:57:33 +00:00
Michael Kruse	8ee179d3b4	Revert "[ScopDetect/Info] Look through PHIs that follow an error block" This reverts commit r312410 - [ScopDetect/Info] Look through PHIs that follow an error block The commit caused generation of invalid IR due to accessing a parameter that does not dominate the SCoP. llvm-svn: 312663	2017-09-06 19:05:40 +00:00
Michael Kruse	bd84ce8931	[ZoneAlgo] Handle non-StoreInst/LoadInst MemoryAccesses including memset. Up to now ZoneAlgo considered array elements access by something else than a LoadInst or StoreInst as not analyzable. This patch removes that restriction by using the unknown ValInst to describe the written content, repectively the element type's null value in case of memset. Differential Revision: https://reviews.llvm.org/D37362 llvm-svn: 312630	2017-09-06 12:40:55 +00:00
Michael Kruse	420c4863a9	[Simplify] Actually remove unsed instruction from region header. Since r312249 instructions of a entry block of region statements are not marked as root anymore and hence can theoretically be removed if unused. Theoretically, because the instruction list was not changed. Still, MemoryAccesses for unused instructions were removed. This lead to a failed assertion in the code generator when the MemoryAccess for the still listed instruction was not found. This hould fix the Assertion failed: ArrayAccess && "No array access found for instruction!", file ScopInfo.h, line 1494 compiler crashes. llvm-svn: 312566	2017-09-05 19:44:39 +00:00
Tobias Grosser	1a695b1d6c	[CodegenCleanup] Use old GVN pass instead of NewGVN It seems NewGVN still has some problems: llvm.org/PR34452, we will switch back after they have been resolved. llvm-svn: 312480	2017-09-04 11:04:33 +00:00
Tobias Grosser	8703e38380	[ISLTools]: Move singleton to isl++ llvm-svn: 312476	2017-09-04 10:05:29 +00:00
Tobias Grosser	3575afd739	[DeLICM] Move some functions to isl++ [NFC] llvm-svn: 312475	2017-09-04 10:05:25 +00:00
Tobias Grosser	d6e0679c4e	[ForwardOp] Remove read accesses for all instructions that have been moved Before this patch, OpTree did not consider forwarding an operand tree consisting of only single LoadInst as useful. The motivation was that, like an access to a read-only variable, it would just replace one MemoryAccess by another. However, in contrast to read-only accesses, this would replace a scalar access by an array access, which is something worth doing. In addition, leaving scalar MemoryAccess is problematic in that VirtualUse prioritizes inter-Stmt use over intra-Stmt. It was possible that the same LLVM value has a MemoryAccess for accessing the remote Stmt's LoadInst as well as having the same LoadInst in its own instruction list (due to being forwarded from another operand tree). With this patch we ensure that if a LoadInst is forwarded is any operand tree, also the operand tree containing just the LoadInst is forwarded as well, which effectively removes the scalar MemoryAccess such that only the array access remains, not both. Thanks Michael for the detailed explanation. Reviewers: Meinersbur, bellu, singam-sanjay, gareevroman Subscribers: hfinkel, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D37424 llvm-svn: 312456	2017-09-03 19:52:15 +00:00
Tobias Grosser	701d943d12	[IslAst] Do not assert in case of empty min/max alias locations In certain situations, the context in the isl_ast_build could result for the min/max locations of our alias sets to become empty, which would cause an internal error in isl, which is then unable to derive a value for these expressions. Check these conditions before code generating expressions and instead assume that alias check succeeded. This is valid, as the corresponding memory accesses will not be executed under any valid context. This fixed llvm.org/PR34432. Thanks to Qirun Zhang for reporting. llvm-svn: 312455	2017-09-03 19:47:19 +00:00
Tobias Grosser	6b1e461329	[IslAst] Move buildCondition to isl++ llvm-svn: 312452	2017-09-03 18:31:44 +00:00
Tobias Grosser	99ccf05694	[ScopHelper] Do not crash on unreachable blocks This resolves llvm.org/PR34433. Thanks to Zhendong Su for reporting. llvm-svn: 312451	2017-09-03 18:01:22 +00:00
Michael Kruse	7954a221f3	[ForwardOpTree] Fix typos. NFC. llvm-svn: 312446	2017-09-03 16:09:38 +00:00
Tobias Grosser	4baedc70d1	[ScopDetect/Info] Look through PHIs that follow an error block In case a PHI node follows an error block we can assume that the incoming value can only come from the node that is not an error block. As a result, conditions that seemed non-affine before are now in fact affine. llvm-svn: 312410	2017-09-02 08:25:55 +00:00
Siddharth Bhat	3928e3f50a	[ISLNodeBuilder] Materialize Fortran array sizes of arrays without memory accesses. In Polly, we specifically add a paramter to represent the outermost dimension size of fortran arrays. We do this because this information is statically available from the fortran metadata generated by dragonegg. However, we were only materializing these parameters (meaning, creating an llvm::Value to back the isl_id) from memory accesses. This is wrong, we should materialize parameters from scop array info. It is wrong because if there is a case where we detect 2 fortran arrays, but only one of them is accessed, we may not materialize the other array's dimensions at all. This is incorrect. We fix this by looping over all `polly::ScopArrayInfo` in a scop, rather that just all `polly::MemoryAccess`. Differential Revision: https://reviews.llvm.org/D37379 llvm-svn: 312350	2017-09-01 18:55:43 +00:00
Michael Kruse	0c6c555beb	Fix Memory Access of failing tests. Mark scalar dependences for different statements belonging to same BB as 'Inter'. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D37147 llvm-svn: 312324	2017-09-01 11:36:52 +00:00
Roman Gareev	1cb3491620	Run GVN during the cleanup Currently, GVN can be necessary to eliminate redundant instructions in case of, for instance, GEMM and float type. This patch makes GVN be run during the cleanup. Reviewed-by: Tobias Grosser <tobias@grosser.es>, Michael Kruse <llvm@meinersbur.de> Differential Revision: https://reviews.llvm.org/D37340 llvm-svn: 312307	2017-09-01 06:52:28 +00:00
Tobias Grosser	04567fd480	Drop unused statistic counter llvm-svn: 312304	2017-09-01 02:17:10 +00:00
Mandeep Singh Grang	c2774a549b	[polly] Fix non-deterministic output due to iteration of unordered ScopArrayInfo Summary: This fixes the following failures in the reverse iteration builder: http://lab.llvm.org:8011/builders/reverse-iteration/builds/25 Polly :: MaximalStaticExpansion/working_deps_between_inners.ll Polly :: MaximalStaticExpansion/working_expansion_multiple_dependences_per_statement.ll Polly :: MaximalStaticExpansion/working_expansion_multiple_instruction_per_statement.ll Polly :: MaximalStaticExpansion/working_phi_expansion.ll Reviewers: simbuerg, Eugene.Zelenko, grosser, zinob, bollu Reviewed By: grosser Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37349 llvm-svn: 312273	2017-08-31 20:10:30 +00:00
Roman Gareev	6589748920	Use the information about the target cache provided by the TargetTransformInfo. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D37178 llvm-svn: 312255	2017-08-31 17:07:54 +00:00
Tobias Grosser	2307f86c47	[ForwardOpTree] Allow forwarding in the presence of region statements Summary: After region statements now also have instruction lists, this is a straightforward extension. Reviewers: Meinersbur, bollu, singam-sanjay, gareevroman Reviewed By: Meinersbur Subscribers: hfinkel, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D37298 llvm-svn: 312249	2017-08-31 16:04:49 +00:00
Siddharth Bhat	56572c6a5e	[PPCGCodeGen] Convert intrinsics to libdevice functions whenever possible. This is useful when we face certain intrinsics such as `llvm.exp.*` which cannot be lowered by the NVPTX backend while other intrinsics can. So, we would need to keep blacklists of intrinsics that cannot be handled by the NVPTX backend. It is much simpler to try and promote all intrinsics to libdevice versions. This patch makes function/intrinsic very uniform, and will always try to use a libdevice version if it exists. Differential Revision: https://reviews.llvm.org/D37056 llvm-svn: 312239	2017-08-31 13:03:37 +00:00
Tobias Grosser	c43d0360cc	[BlockGenerator] Generate entry block of regions from instruction lists The adds code generation support for the previous commit. This patch has been re-applied, after the memory issue in the previous patch has been fixed. llvm-svn: 312211	2017-08-31 03:17:35 +00:00
Tobias Grosser	bd15d13d4e	[ScopInfo] Use statement lists for entry blocks of region statements By using statement lists in the entry blocks of region statements, instruction level analyses also work on region statements. We currently only model the entry block of a region statements, as this is sufficient for most transformations the known-passes currently execute. Modeling instructions in the presence of control flow (e.g. infinite loops) is left out to not increase code complexity too much. It can be added when good use cases are found. This change set is reapplied, after a memory corruption issue had been fixed. llvm-svn: 312210	2017-08-31 03:15:56 +00:00
Tobias Grosser	d3edc16416	Revert "[ScopInfo] Use statement lists for entry blocks of region statements" This reverts commit r312128. It aused some memory issues. llvm-svn: 312209	2017-08-31 02:43:49 +00:00
Tobias Grosser	6f1f5cbb5b	Revert "[BlockGenerator] Generate entry block of regions from instruction lists" This reverts commit r312129. It caused some memory issues. llvm-svn: 312208	2017-08-31 02:43:27 +00:00
Tobias Grosser	1e34508bcc	[BlockGenerator] Generate entry block of regions from instruction lists The adds code generation support for the previous commit. llvm-svn: 312129	2017-08-30 15:08:30 +00:00
Tobias Grosser	6fbe4c8501	[ScopInfo] Use statement lists for entry blocks of region statements By using statement lists in the entry blocks of region statements, instruction level analyses also work on region statements. We currently only model the entry block of a region statements, as this is sufficient for most transformations the known-passes currently execute. Modeling instructions in the presence of control flow (e.g. infinite loops) is left out to not increase code complexity too much. It can be added when good use cases are found. llvm-svn: 312128	2017-08-30 15:08:21 +00:00
Michael Kruse	f3387836d0	[ScopBuilder/ScopInfo] Move reduction detection to ScopBuilder. NFC. Reduction detection is only executed in the SCoP building phase. Hence it fits better into ScopBuilder to separate SCoP-construction from SCoP modeling. llvm-svn: 312118	2017-08-30 13:05:08 +00:00
Michael Kruse	35aa9d862e	[ScopBuilder/ScopInfo] Move ScopStmt::collectSurroundingLoops to ScopBuilder. NFC. This method is only called in the SCoP building phase. Therefore it fits better into ScopBuilder to separate SCoP-construction from SCoP modeling. llvm-svn: 312117	2017-08-30 13:05:01 +00:00
Michael Kruse	eb83141f9e	[ScopBuilder/ScopInfo] Move ScopStmt::buildDomain to ScopBuilder. NFC. This method is only called in the SCoP building phase. Therefore it fits better into ScopBuilder to separate SCoP-construction from SCoP modeling. llvm-svn: 312116	2017-08-30 13:04:54 +00:00
Michael Kruse	a29f8c03d4	[ScopBuilder/ScopInfo] Move ScopStmt::buildAccessRelations to ScopBuilder. NFC. This method is only called in the SCoP building phase. Therefore it fits better into ScopBuilder to separate SCoP-construction from SCoP modeling. This mostly mechanical change makes ScopBuilder directly access some of ScopStmt/MemoryAccess private fields. We add ScopBuilder as a friend class and will add proper accessor functions sometime later. llvm-svn: 312115	2017-08-30 13:04:46 +00:00
Michael Kruse	f6eb3a2ed2	[ScopBuilder/ScopInfo] Move and inline Scop::init into ScopBuilder::buildScop. NFC. The method is only needed in the SCoP building phase, and doesn't need to be part of the general API. llvm-svn: 312114	2017-08-30 13:04:39 +00:00
Michael Kruse	860870b7b0	[ScopBuilder] Report to dbgs() on SCoP bailout. NFC. This allows to use -debug to see that a SCoP was found in ScopDetect, but dismissed by ScopBuilder. llvm-svn: 312113	2017-08-30 11:52:03 +00:00
Michael Kruse	591255183b	[ScopBuilder] Introduce metadata for splitting scop statement. This patch allows annotating of metadata in ir instruction (with "polly_split_after"), which specifies where to split a particular scop statement. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D36402 llvm-svn: 312107	2017-08-30 10:11:06 +00:00
Michael Kruse	99cc9ded41	Do not consider mem intrinsics as error. The intrinsics memset, memcopy and memmove do have their memory accesses modeled by ScopBuilder. Do not consider them error-case behavior. Test case will come with a future patch that requires memory intrinsics outside of error blocks. llvm-svn: 312021	2017-08-29 18:27:47 +00:00
Michael Kruse	25d3f85a43	Skip ignored intrinsics. Commit r252725 introduced a "return false" if an ignored intrinsics was found. The consequence of this was that the mere existence of an ignored intrinsic (such as llvm.dbg.value) before a call that would have qualified the block to be an error block, to not be an error block. The obvious goal was to just skip ignored intrinsics, not changing the meaning of what an error block is. llvm-svn: 312020	2017-08-29 18:27:42 +00:00
Michael Kruse	4728184342	[ZoneAlgo] More fine-grained bail-out. ZoneAlgo used to bail out for the complete SCoP if it encountered something violating its assumption. This meant the neither OpTree can forward any load nor DeLICM do anything in such cases, even if their transformations are unrelated to the violations. This patch adds a list of compatible elements (currently with the granularity of entire arrays) that can be used for analysis. OpTree and DeLICM can then check whether their transformations only concern compatible elements, and skip non-compatible ones. This will be useful for e.g. Polybench's benchmarks covariance, correlation, bicg, doitgen, durbin, gramschmidt, adi that have assumption violation, but which are not necessarily relevant for all transformations. Differential Revision: https://reviews.llvm.org/D37219 llvm-svn: 311929	2017-08-28 20:39:07 +00:00
Tobias Grosser	ee8ad1c0ff	[IslAst] Do not compare arrays in alias check which are known to be identical This possibly helps to avoid run-time check failures in the COSMO kernels. llvm-svn: 311920	2017-08-28 20:17:02 +00:00
Michael Kruse	a4f447c2a4	[PM] Properly require and preserve OptimizationRemarkEmitter. NFCI. Properly require and preserve the OptimizationRemarkEmitter for use in ScopPass. Previously one had to get the ORE from ScopDetection because CodeGeneration did not mark it as preserved. It would need to be recomputed which results in the legacy PM to throw away all previous SCoP analysis. This also changes the implementation of ScopPass::getAnalysisUsage to not unconditionally preserve all passes, but only those needed to be preserved by any SCoP pass (at least when using the legacy PM). This allows invalidating DependenceInfo (and IslAstInfo) in case the pass would cause them to change (e.g. OpTree, DeLICM, MaximalArrayExpansion) JSONImporter should also invalidate the DependenceInfo. In this patch it marks DependenceInfo as preserved anyway because some regression tests depend on it. Differential Revision: https://reviews.llvm.org/D37010 llvm-svn: 311888	2017-08-28 14:07:33 +00:00
Michael Kruse	e983e6b1c5	[ZoneAlgo] Print rejection reasons to llvm::dbgs(). NFC. llvm-svn: 311885	2017-08-28 11:22:23 +00:00
Tobias Grosser	93ab558d2e	[Detect] Consider nested loop profitable if entry block is not in loop In cases where the entry block of a scop was not contained in a loop that was part of the scop region and at the same time there was a loop surrounding the scop, we missed to count the loops in the scop and consequently did not consider the scop profitable. We correct this by only moving to the loop parent, in case the current loop is loop contained in the scop. This increases the number of loops in COSMO which we assume to be profitable from 3974 to 4981. llvm-svn: 311863	2017-08-27 21:39:25 +00:00
Eugene Zelenko	a32707d5b1	[Polly] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 311802	2017-08-25 21:35:27 +00:00
Eugene Zelenko	9248fde53a	[Polly] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 311704	2017-08-24 21:22:41 +00:00
Tobias Grosser	6d0970f64e	Revert "[polly] Fix ScopDetectionDiagnostic test failure caused by r310940" This reverts commit 950849ece9bb8fdd2b41e3ec348b9653b4e37df6. This commit broke various buildbots. llvm-svn: 311692	2017-08-24 19:47:15 +00:00
Michael Kruse	b795bfc0d4	[CodeGen] Detect impossible partial write conditions more reliably. Whether a partial write is tautological/unsatisfiable not only depends on the access domain, but also on the domain covered by its node in the AST. In the example below, there are two instances of Stmt_cond_false. It may have a partial write access that is not executed in instance Stmt_cond_false(0). for (int c0 = 0; c0 < tmp5; c0 += 1) { Stmt_for_body344(c0); if (tmp5 >= c0 + 2) Stmt_cond_false(c0); Stmt_cond_end(c0); } if (tmp5 <= 0) { Stmt_for_body344(0); Stmt_cond_false(0); Stmt_cond_end(0); } Isl cannot derive a subscript for an array element that is never accessed. This caused an error in that no subscript expression has been generated in IslNodeBuilder::createNewAccesses, but BlockGenerator expected one to exist because there is an execution of that write, just not in that ast node. Fixed by instead of determining whether the access domain is empty, inspect whether isl generated a constant "false" ast expression in the current ast node. This should fix a compiler crash of the aosp buildbot. llvm-svn: 311663	2017-08-24 14:51:35 +00:00
Siddharth Bhat	78027437e6	[Polly] [PPCGCodeGeneration] Mild refactoring of checking validity of functions in a kernel. This is a stylistic change to make the function a little more readable. Also add a debug print to show what instruction contains a use of a function we don't understand in the kernel. Differential Revision: https://reviews.llvm.org/D37058 llvm-svn: 311648	2017-08-24 09:54:15 +00:00
Andreas Simbuerger	e478e2de83	[Polly][WIP] Scalar fully indexed expansion Summary: This patch comes directly after https://reviews.llvm.org/D34982 which allows fully indexed expansion of MemoryKind::Array. This patch allows expansion for MemoryKind::Value and MemoryKind::PHI. MemoryKind::Value seems to be working with no majors modifications of D34982. A test case has been added. Unfortunatly, no "run time" checks can be done for now because as @Meinersbur explains in a comment on D34982, DependenceInfo need to be cleared and reset to take expansion into account in the remaining part of the Polly pipeline. There is no way to do that in Polly for now. MemoryKind::PHI is not working. Test case is in place, but not working. To expand MemoryKind::Array, we expand first the write and then after the reads. For MemoryKind::PHI, the idea of the current implementation is to exchange the "roles" of the read and write and expand first the read according to its domain and after the writes. But with this strategy, I still encounter the problem of union_map in new access map. For example with the following source code (source code of the test case) : ``` void mse(double A[Ni], double B[Nj]) { int i,j; double tmp = 6; for (i = 0; i < Ni; i++) { for (int j = 0; j<Nj; j++) { tmp = tmp + 2; } B[i] = tmp; } } ``` Polly gives us the following statements and memory accesses : ``` Statements { Stmt_for_body Domain := { Stmt_for_body[i0] : 0 <= i0 <= 9999 }; Schedule := { Stmt_for_body[i0] -> [i0, 0, 0] }; ReadAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_body[i0] -> MemRef_tmp_04__phi[] }; MustWriteAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_body[i0] -> MemRef_tmp_11__phi[] }; Instructions { %tmp.04 = phi double [ 6.000000e+00, %entry.split ], [ %add.lcssa, %for.end ] } Stmt_for_inc Domain := { Stmt_for_inc[i0, i1] : 0 <= i0 <= 9999 and 0 <= i1 <= 9999 }; Schedule := { Stmt_for_inc[i0, i1] -> [i0, 1, i1] }; MustWriteAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_inc[i0, i1] -> MemRef_tmp_11__phi[] }; ReadAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_inc[i0, i1] -> MemRef_tmp_11__phi[] }; MustWriteAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_inc[i0, i1] -> MemRef_add_lcssa__phi[] }; Instructions { %tmp.11 = phi double [ %tmp.04, %for.body ], [ %add, %for.inc ] %add = fadd double %tmp.11, 2.000000e+00 %exitcond = icmp ne i32 %inc, 10000 } Stmt_for_end Domain := { Stmt_for_end[i0] : 0 <= i0 <= 9999 }; Schedule := { Stmt_for_end[i0] -> [i0, 2, 0] }; MustWriteAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_end[i0] -> MemRef_tmp_04__phi[] }; ReadAccess := [Reduction Type: NONE] [Scalar: 1] { Stmt_for_end[i0] -> MemRef_add_lcssa__phi[] }; MustWriteAccess := [Reduction Type: NONE] [Scalar: 0] { Stmt_for_end[i0] -> MemRef_B[i0] }; Instructions { %add.lcssa = phi double [ %add, %for.inc ] store double %add.lcssa, double* %arrayidx, align 8 %exitcond5 = icmp ne i64 %indvars.iv.next, 10000 } } ``` and the following dependences : ``` { Stmt_for_inc[i0, 9999] -> Stmt_for_end[i0] : 0 <= i0 <= 9999; Stmt_for_inc[i0, i1] -> Stmt_for_inc[i0, 1 + i1] : 0 <= i0 <= 9999 and 0 <= i1 <= 9998; Stmt_for_body[i0] -> Stmt_for_inc[i0, 0] : 0 <= i0 <= 9999; Stmt_for_end[i0] -> Stmt_for_body[1 + i0] : 0 <= i0 <= 9998 } ``` When trying to expand this memory access : ``` { Stmt_for_inc[i0, i1] -> MemRef_tmp_11__phi[] }; ``` The new access map would look like this : ``` { Stmt_for_inc[i0, 9999] -> MemRef_tmp_11__phi_exp[i0] : 0 <= i0 <= 9999; Stmt_for_inc[i0, i1] ->MemRef_tmp_11__phi_exp[i0, 1 + i1] : 0 <= i0 <= 9999 and 0 <= i1 <= 9998 } ``` The idea to implement the expansion for PHI access is an idea from @Meinersbur and I don't understand why my implementation does not work. I should have miss something in the understanding of the idea. Contributed by: Nicolas Bonfante <nicolas.bonfante@gmail.com> Reviewers: Meinersbur, simbuerg, bollu Reviewed By: Meinersbur Subscribers: llvm-commits, pollydev, Meinersbur Differential Revision: https://reviews.llvm.org/D36647 llvm-svn: 311619	2017-08-24 00:04:45 +00:00
Michael Kruse	06ed529205	Add more statistics. Add statistics about - Which optimizations are applied - Number of loops in Scops at various stages - Number of scalar/singleton writes at various stages representative for scalar false dependencies - Number of parallel loops These will be useful to find regressions due to moving Polly further down of LLVM's pass pipeline. Differential Revision: https://reviews.llvm.org/D37049 llvm-svn: 311553	2017-08-23 13:50:30 +00:00
Michael Kruse	7fac28fa4f	[ScopDetect] Include zero-iteration loops in loop count. Loop with zero iteration are, syntactically, loops. They have been excluded from the loop counter even for the non-profitable counters. This seems to be unintentially as the sentinel value of '0' minimal iterations does exclude such loops. Fix by never considering the iteration count when the sentinel value of 0 is found. This makes the recently added NumTotalLoops couter redundant with NumLoopsOverall, which now is equivalent. Hence, NumTotalLoops is removed as well. Note: The test case 'ScopDetect/statistics.ll' effectively does not check profitability, because -polly-process-unprofitable is passed to all test cases. llvm-svn: 311551	2017-08-23 13:29:59 +00:00
Michael Kruse	99fba1fd52	[ScopInliner] Fix hidden overload warning. NFC. By exposing the the hidden member, but as private. llvm-svn: 311550	2017-08-23 13:07:43 +00:00
Michael Kruse	a1579aab46	[MaximumStaticExpansion] Avoid warning in release builds. Conditionally compile function only used in an assert(). llvm-svn: 311549	2017-08-23 12:50:02 +00:00
Michael Kruse	3044dc51cf	[PPCGCodeGen] Fix compiler warning: '<': signed/unsigned mismatch. NFC. MSVC warns about comparison between a signed and unsigned integer. The rules of C(++) define that an unsigned comparison has to be carried-out in this case. This is unlikely to be intended. Fix by assigning the loop's upper bound to a signed integer first. This also avoids repeated evaluation of the invariant upper bound. llvm-svn: 311548	2017-08-23 12:45:25 +00:00
Michael Kruse	594386e773	[ScopInfo] Remove stray semicolon. NFC. llvm-svn: 311547	2017-08-23 12:34:37 +00:00
Tobias Grosser	d680edfb98	Move include/isl-noexceptions.h to include/isl/isl-noexceptions.h llvm-svn: 311504	2017-08-22 22:04:22 +00:00
Jakub Kuderski	0ac1e585fc	[polly] Fix ScopDetectionDiagnostic test failure caused by r310940 Summary: ScopDetection used to check if a loop withing a region was infinite and emitted a diagnostic in such cases. After r310940 there's no point checking against that situation, as infinite loops don't appear in regions anymore. The test failure was observed on these two polly buildbots: http://lab.llvm.org:8011/builders/polly-arm-linux/builds/8368 http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/10310 This patch XFAILs `ReportLoopHasNoExit.ll` and turns infinite loop detection into an assert. Reviewers: grosser, sanjoy, bollu Reviewed By: grosser Subscribers: efriedma, aemerson, kristof.beyls, dberlin, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D36776 llvm-svn: 311503	2017-08-22 22:01:53 +00:00
Tobias Grosser	4a07bbe3f6	[IRBuilder] Only emit alias scop metadata for arrays, but not scalars Summary: There is no need to emit alias metadata for scalars, as basicaa will easily distinguish them from arrays. This reduces the size of the metadata we generate. This is especially useful after we moved to -polly-position=before-vectorizer, where a lot more scalar dependences are introduced, which increased the size of the alias analysis metadata and made us commonly reach the limits after which we do not emit alias metadata that have been introduced to prevent quadratic growth of this alias metadata. This improves 2mm performance from 1.5 seconds to 0.17 seconds. Reviewers: Meinersbur, bollu, singam-sanjay Reviewed By: Meinersbur Subscribers: pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D37028 llvm-svn: 311498	2017-08-22 21:58:48 +00:00
Eugene Zelenko	0c4c2ce0b0	[Polly] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 311489	2017-08-22 21:25:51 +00:00
Roman Gareev	6bfeba24d3	[NFC] Fix the broken comment. llvm-svn: 311477	2017-08-22 17:43:03 +00:00
Roman Gareev	0956a606ff	Disable the Loop Vectorizer in case of GEMM Currently, in case of GEMM and the pattern matching based optimizations, we use only the SLP Vectorizer out of two LLVM vectorizers. Since the Loop Vectorizer can get in the way of optimal code generation, we disable the Loop Vectorizer for the innermost loop using mark nodes and emitting the corresponding metadata. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D36928 llvm-svn: 311473	2017-08-22 17:38:46 +00:00
Michael Kruse	5b228bbb12	[ScopDetection] Add stat for total number of loops. The total number of loops is useful as a baseline comparing how many loops have been optimized in different configurations. llvm-svn: 311469	2017-08-22 17:09:51 +00:00
Siddharth Bhat	cb5155bf6d	[ManagedMemoryRewrite] Use `unit64_t` to store size, not `int`. llvm-svn: 311440	2017-08-22 09:30:37 +00:00
Siddharth Bhat	603544863f	[ManagedMemoryRewrite] Get size in bytes rather than in bits and dividing by 8. llvm-svn: 311439	2017-08-22 09:27:41 +00:00
Tobias Grosser	6683c81af8	test/GPGPU/invalid-kernel-assert-verifymodule.ll also requires assertions llvm-svn: 311423	2017-08-22 03:12:29 +00:00
Michael Kruse	ade14269cd	[DeLICM] Fix unused zone for writes without in-between read. The implementation of computeArrayUnused did not consider writes without reads before, except for the first write in the SCoP. This caused it to 'forget' writes directly following another write. This patch re-adds the entire reaching defintion of a write that has not been covered before by a read. This fixes Polybench 4.2 2mm where only one of the matrix-multiplication was detected. llvm-svn: 311403	2017-08-21 23:04:45 +00:00
Siddharth Bhat	a8c329b0eb	[ManagedMemoryRewrite] slightly tweak debug output style. [NFC] llvm-svn: 311361	2017-08-21 18:58:33 +00:00
Siddharth Bhat	557ce3a8b0	[ManagedMemoryRewrite] Print reasons for skipping global array to dbgs(). [NFC] llvm-svn: 311360	2017-08-21 18:52:15 +00:00
Tobias Grosser	0dd42512ff	[ZoneAlgorithm] Move computeScalarReachingDefinition to c++ llvm-svn: 311336	2017-08-21 14:19:40 +00:00
Siddharth Bhat	0a198dc18a	[ManagedMemoryRewrite] hide debug output behing DEBUG(...). [NFC] llvm-svn: 311331	2017-08-21 12:51:57 +00:00
Siddharth Bhat	7bc77e87c8	[ScopInfo] Add option to treat all function parameters as dereferencible. Dragonegg generates most function parameters as pointers to the actual parameters. However, it does not mark these parameters with the dereferencable attribute. Polly is conservative when it comes to invariant load hoisting, thus we add runtime checks to invariant load hoisted pointers when we do not know that pointers are dereferencable. This is correct behaviour, but is a performance penalty. Add a flag that allows all pointer parameters to be dereferencable. That way, polly can speculatively load-hoist paramters to functions without runtime checks. Differential Revision: https://reviews.llvm.org/D36461 llvm-svn: 311329	2017-08-21 11:57:04 +00:00
Siddharth Bhat	7b9f5ca27e	[PPCGCodeGeneration] Enable `polly-codegen-perf-monitoring` for PPCGCodegen. This feature was not enabled for `PPCGCodeGeneration`. Now that this is enabled, we can benchmark Scops that have been optimised with `-polly-codegen-ppcg` with the `-polly-codegen-perf-monitoring` option. Differential Revision: https://reviews.llvm.org/D36934 llvm-svn: 311328	2017-08-21 11:44:01 +00:00
Tobias Grosser	b09bd74da8	[GPGPU] Add llvm.powi to the libdevice supported functions These intrinsics are used in COSMO. llvm-svn: 311324	2017-08-21 09:52:08 +00:00
Tobias Grosser	5170b6627a	[GPGPU] Add log / logf to the libdevice supported functions These two functions are used in COSMO llvm-svn: 311322	2017-08-21 09:00:31 +00:00

... 2 3 4 5 6 ...

2903 Commits