llvm-project

Commit Graph

Author	SHA1	Message	Date
Siddharth Bhat	9aca1cb519	[NFC] [PPCGCodeGen] Add missing REQUIRES: pollyacc line. llvm-svn: 310354	2017-08-08 12:26:37 +00:00
Siddharth Bhat	83fe6b546d	[ScopInfo] [NFC] Typo fix. "to conservative" -> "too conservative". llvm-svn: 310353	2017-08-08 12:26:32 +00:00
Siddharth Bhat	71dfb3eb07	[Polly] [PPCGCodeGeneration] Handle failing of invariant load hoisting gracefully. To do this, we replicate what `CodeGeneration` does. We expose `markNodeUnreachable` from `CodeGeneration` to `PPCGCodeGeneration`. Differential Revision: https://reviews.llvm.org/D36457 llvm-svn: 310350	2017-08-08 12:00:59 +00:00
Michael Kruse	27c010a22e	[DeLICM] Properly handle PHI writes becoming empty partial writes. It is possible that partial writes are empty (write is never executed). In this case, when in PHINode's incoming edge is never taken such that the incoming write becomes an empty partial write, if enabled. The issue is that when converting the union_map to an map, it's space cannot be derived from the union_map itself. Rather, we need to determine its space independently. This fixes test-suite's MultiSource/Benchmarks/ASC_Sequoia/CrystalMk. llvm-svn: 310348	2017-08-08 11:27:12 +00:00
Siddharth Bhat	8ff723dcf1	[NFC] [GPUJIT] Print line number & size information on allocateMemoryForDeviceCuda failure - It's useful to know the amount of memory asked for since, for example, asking for `0` bytes of memory is illegal. - Line number is helpful since we print the same message in the function at different points. llvm-svn: 310340	2017-08-08 09:03:27 +00:00
Tobias Grosser	327e9ecb0d	[ScheduleOptimizer] Make matmul pattern detection work with delicm output In certain cases delicm might decide to not leave the original array write in the loop body, but to remove it and instead leave a transformed phi node as write access. This commit teached the matmul pattern detection to order the memory accesses according to when the access actually happens and use this information to detect the new pattern. This makes pattern based matmul optimization work for 2mm and 3mm in polybench 4 after polly-position=before-vectorizer has been enabled. llvm-svn: 310338	2017-08-08 06:15:15 +00:00
Tobias Grosser	50206d8f57	Change Polly's position to "before-vectorizer" Polly has traditionally always been executed at the beginning of the pass pipeline as LLVM's inliner and DeLICM passes introduced plenty of scalar dependences which prevented any kind of useful high-level loop optimizations later in the pass pipeline. With DeLICM now being available, Polly can also run optimizations when folded into the pass pipeline. This has the benefit that Polly should now be more effective on C++ code and as an additional bonus, no additional early canonicalization phase must be run. As a result, Polly touches the code only if it applies a transformation. Code that does not benefit from Polly is not touched and consequently will have the very same execution time as without Polly enabled. Random performance changes, as could sometimes be observed with polly-position=early are consequently not possible any more. If performance is changed, this is due to Polly is choosing to perform a transformation. If this choice is wrong, it can be fixed directly in Polly. http://polly.llvm.org/docs/Architecture.html#polly-in-the-llvm-pass-pipeline llvm-svn: 310319	2017-08-07 22:33:34 +00:00
Tobias Grosser	736c44c848	[test] Add some missing options that become necessary after the recent default changes llvm-svn: 310315	2017-08-07 22:10:23 +00:00
Tobias Grosser	32f64ed22b	[DeLICM] Enable partial writes This allows us to remove more scalar dependences. While this feature is still rather experimental, we want to give it sufficient test coverage. llvm-svn: 310314	2017-08-07 22:06:07 +00:00
Tobias Grosser	ad73f6a7b3	Enable delicm to automatically remove scalar loop carried dependences While this code is still rather we enable it by default to get better test coverage. llvm-svn: 310313	2017-08-07 22:04:20 +00:00
Tobias Grosser	a98081c9f5	[test] Add one more test case for the previous commit llvm-svn: 310312	2017-08-07 22:02:06 +00:00
Tobias Grosser	2ef378120d	[ZoneAlgo] Allow two writes that write identical values into same array slot Two write statements which write into the very same array slot generally are conflicting. However, in case the value that is written is identical, this does not cause any problem. Hence, allow such write pairs in this specific situation. llvm-svn: 310311	2017-08-07 22:01:29 +00:00
Andreas Simbuerger	81fb6b3e40	[Polly] Fully-Indexed static expansion This commit implements the initial version of fully-indexed static expansion. ``` for(int i = 0; i<Ni; i++) for(int j = 0; j<Ni; j++) S: B[j] = j; T: A[i] = B[i] ``` After the pass, we want this : ``` for(int i = 0; i<Ni; i++) for(int j = 0; j<Ni; j++) S: B[i][j] = j; T: A[i] = B[i][i] ``` For now we bail (fail) in the following cases: - Scalar access - Multiple writes per SAI - MayWrite Access - Expansion that leads to an access to the original array Furthermore: We still miss checks for escaping references to the array base pointers. A future commit will add the missing escape-checks to stay correct in those cases. The expansion is still locked behind a CLI-Option and should not yet be used. Patch contributed by: Nicholas Bonfante <bonfante.nicolas@gmail.com> Reviewers: simbuerg, Meinersbur, bollu Reviewed By: Meinersbur Subscribers: mgorny, llvm-commits, pollydev Differential Revision: https://reviews.llvm.org/D34982 llvm-svn: 310304	2017-08-07 20:54:20 +00:00
Tobias Grosser	d70ea7fed0	[GPGPU] Remove redundant constructors llvm-svn: 310284	2017-08-07 19:20:57 +00:00
Michael Kruse	70af4f579d	[ForwardOpTree] Use known array content analysis to forward load instructions. This is an addition to the -polly-optree pass that reuses the array content analysis from DeLICM to find array elements that contain the same value as the value loaded when the target statement instance is executed. The analysis is now enabled by default. The known content analysis could also be used to rematerialize any llvm::Value that was written to some array element, but currently only loads are forwarded. Differential Revision: https://reviews.llvm.org/D36380 llvm-svn: 310279	2017-08-07 18:40:29 +00:00
Tobias Grosser	305d3164f2	[ScopInfo] Make Scop::canAlwaysBeHoisted a member function llvm-svn: 310236	2017-08-07 00:10:11 +00:00
Tobias Grosser	e69b272260	[ScopInfo] Move Scop::addInvariantLoads to isl++ [NFC] llvm-svn: 310235	2017-08-06 23:50:25 +00:00
Tobias Grosser	61bd3a4840	[ScopInfo] Move Scop::getPwAffOnly to isl++ [NFC] llvm-svn: 310231	2017-08-06 21:42:38 +00:00
Tobias Grosser	31df6f31c0	[ScopInfo] Move Scop::getDomains to isl++ [NFC] llvm-svn: 310230	2017-08-06 21:42:25 +00:00
Tobias Grosser	04ec2eb8c9	[ScopInfo] Move Scop::getInvalidContext to isl++ [NFC] llvm-svn: 310229	2017-08-06 21:42:16 +00:00
Tobias Grosser	e127033f98	[ScopInfo] Move Scop::getAssumedContext to isl++ [NFC] llvm-svn: 310228	2017-08-06 21:42:09 +00:00
Tobias Grosser	232fdad4f2	[ScopInfo] Move Scop::addNonEmptyDomainConstraints to isl++ [NFC] llvm-svn: 310225	2017-08-06 20:19:26 +00:00
Tobias Grosser	b65ccc4302	[ScopInfo] Translate Scop::getParamSpace to isl++ [NFC] llvm-svn: 310224	2017-08-06 20:11:59 +00:00
Tobias Grosser	8ea1fc19b3	[ScopInfo] Translate Scop::getContext to isl++ [NFC] llvm-svn: 310221	2017-08-06 19:52:38 +00:00
Tobias Grosser	9a63570b13	[ScopInfo] Translate Scop::getIdForParam to isl++ [NFC] llvm-svn: 310220	2017-08-06 19:31:27 +00:00
Tobias Grosser	5ab39ff224	[ScopInfo] Move get*Writes/getReads/getAccesses to isl++ llvm-svn: 310219	2017-08-06 19:22:27 +00:00
Tobias Grosser	b2e6598a7f	Remove functional changes that sneaked in by accident in r308892 llvm-svn: 310218	2017-08-06 18:59:19 +00:00
Tobias Grosser	132860afe5	[ScopInfo] Move ScopStmt::setAstBuild/getAstBuild to isl++ llvm-svn: 310216	2017-08-06 17:53:04 +00:00
Tobias Grosser	6ad1640a1d	[ScopInfo] Move ScopStmt::getSchedule to isl++ llvm-svn: 310215	2017-08-06 17:45:28 +00:00
Tobias Grosser	2f3041fc6a	[ScopInfo] Move getPredecessorDomainConstraints to isl++ [NFC] llvm-svn: 310214	2017-08-06 17:31:38 +00:00
Tobias Grosser	d16f927781	[ScopInfo] Move InvariantAccess to isl++ [NFC] llvm-svn: 310213	2017-08-06 17:25:14 +00:00
Tobias Grosser	dfd20b7949	[ScopInfo] Update comments to refer to isl++ [NFC] llvm-svn: 310212	2017-08-06 17:25:09 +00:00
Tobias Grosser	27db02b247	[ScopInfo] Move ScopArrayInfo::ScopArrayInfo to isl++ [NFC] llvm-svn: 310211	2017-08-06 17:25:05 +00:00
Tobias Grosser	85048eff1a	[ScopInfo] Move ScopStmt::ScopStmt to isl++ [NFC] llvm-svn: 310210	2017-08-06 17:24:59 +00:00
Tobias Grosser	dcf8d696ff	Move ScopInfo::getDomain(), getDomainSpace(), getDomainId() to isl++ llvm-svn: 310209	2017-08-06 16:39:52 +00:00
Tobias Grosser	a9b5bbac78	Move ScopStmt::Domain to isl++ llvm-svn: 310207	2017-08-06 16:11:53 +00:00
Tobias Grosser	cb0224ad59	Update to a newer version of isl++ llvm-svn: 310206	2017-08-06 15:56:45 +00:00
Tobias Grosser	8b40f8c6c7	Update to isl-0.18-812-g565da6e This update is mostly a maintenance update, but also exposes a couple of new functions that will be needed for the next version of the isl++ bindings. llvm-svn: 310205	2017-08-06 15:51:16 +00:00
Tobias Grosser	bfee458d0f	[Scopinfo] Fix memory corruption issue that sneaked into the previous commit llvm-svn: 310204	2017-08-06 15:47:04 +00:00
Tobias Grosser	2332fa3604	[ScopInfo] Move InvalidDomain to isl++ [NFC] llvm-svn: 310203	2017-08-06 15:36:48 +00:00
Tobias Grosser	2b7479b1af	[Polly] Fix for the JSON Exporter Summary: Small patch to fix the JSON exporter. Currently, using "opt -polly-export-jscop" does not generate jscop files, but gives an error: * Error in `opt': corrupted double-linked list: 0x0000000000bc4bb0 * Updated the function getAccessRelationStr() to work with the current version of getAccessRelation(), fixing the JSON exporter Reviewers: bollu, grosser Reviewed By: grosser Subscribers: grosser, llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D36370 llvm-svn: 310199	2017-08-06 11:41:10 +00:00
Tobias Grosser	aabfbfa5fc	Add missing 'REQUIRES: pollyacc' line llvm-svn: 310197	2017-08-06 11:21:09 +00:00
Tobias Grosser	b99c11710c	[GPGPU] Make sure managed arrays are prepared at the beginning of the scop Summary: This resolves some "instruction does not dominate use" errors, as we used to prepare the arrays at the location of the first kernel, which not necessarily dominated all other kernel calls. Reviewers: Meinersbur, bollu, singam-sanjay Subscribers: nemanjai, pollydev, llvm-commits, kbarton Differential Revision: https://reviews.llvm.org/D36372 llvm-svn: 310196	2017-08-06 11:10:38 +00:00
Tobias Grosser	5b307cdb8a	[GPGPU] Rename all, not only the first libdevice function llvm-svn: 310194	2017-08-06 03:04:15 +00:00
Siddharth Bhat	e53c924b0f	[Polly] [PPCGCodeGeneration] Deal with loops outside the Scop correctly in PPCGCodeGeneration. A Scop with a loop outside it is not handled currently by PPCGCodeGeneration. The test case is such that the Scop has only one inner loop that is detected. This currently breaks codegen. The fix is to reuse the existing mechanism in `IslNodeBuilder` within `GPUNodeBuilder. Differential Revision: https://reviews.llvm.org/D36290 llvm-svn: 310193	2017-08-06 02:39:05 +00:00
Siddharth Bhat	0caed1fbe6	[IslNodeBuilder] [NFC] Refactor creation of loop induction variables of loops outside scops. This logic is duplicated, so we refactor it into a separate function. This will be used in a later patch to teach PPCGCodeGen code generation for loops that are outside the scop. Differential Revision: https://reviews.llvm.org/D36310 llvm-svn: 310192	2017-08-06 02:07:11 +00:00
Tobias Grosser	f2068ef7dd	[Polly] Fix typo. NFC. Reviewers: grosser, Meinersbur, bollu Differential Revision: https://reviews.llvm.org/D36356 llvm-svn: 310187	2017-08-05 20:03:13 +00:00
Tobias Grosser	f9308489eb	Add forgotten CMakeLists.txt file in unit-test llvm-svn: 310177	2017-08-05 09:44:11 +00:00
Tobias Grosser	00f25d0915	Fix spelling error in previous commit llvm-svn: 310176	2017-08-05 09:39:00 +00:00
Tobias Grosser	feae3dfe9f	[unittests] Add unittest for getPartialTilePrefixes In https://reviews.llvm.org/D36278 it was pointed out that the behavior of getPartialTilePrefixes is not very well understood. To allow for a better understanding, we first provide some basic unittests. llvm-svn: 310175	2017-08-05 09:38:09 +00:00
Michael Kruse	138a3fbae1	[DeLICM] Refactor ZoneAlgorithm into ZoneAlgo.cpp. NFC. Extract ZoneAlgorithm from DeLICM.cpp into its own file. It will gain a second use by the load forwarding part of -polly-optree. llvm-svn: 310146	2017-08-04 22:51:23 +00:00
Siddharth Bhat	638316da5b	[PPCGCodeGeneration] [NFC] Log every location from which PPCGCodegen bails. This is useful when trying to understand why no GPU code was produced. Differential Revision: https://reviews.llvm.org/D36318 llvm-svn: 310103	2017-08-04 19:36:40 +00:00
Michael Kruse	a9a7086319	[ForwardOpTree] Refactor out forwardSpeculatable(). NFC. The method forwardSpeculatable forwards speculatively executable instructions and is currently the only way to forward an instruction. In the future we intend to add more methods. llvm-svn: 310056	2017-08-04 12:28:42 +00:00
Philip Pfaffe	96d2143f20	[PM] Make the new-pm passes behave more like the legacy passes Summary: Testing the new-pm passes becomes much easier once they behave more like the old passes in terms of the order in which Scops are processed and printed. This requires three changes: - ScopInfo: Use an ordered map to store scops - ScopInfo: Iterate and print Scops in reverse order to match legacy PM behaviour - ScopDetection: print function name in ScopAnalysisPrinter Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D36303 llvm-svn: 310052	2017-08-04 11:28:51 +00:00
Philip Pfaffe	6ea444e671	[NFC] Fix r310036: Appease clang-format llvm-svn: 310039	2017-08-04 08:26:45 +00:00
Philip Pfaffe	b24beb6f46	[NFC] ScopPass: Remove unused AnalysisKey from OwningInnerAnalysisManagerProxy llvm-svn: 310036	2017-08-04 08:12:31 +00:00
Michael Kruse	1046aa3148	[VirtualInstruction] Handle MetadataAsValue as constant. The complication of bspatch.cc of the AOSP buildbot currently fails presumably because the occurance of a MetadataAsValue in an operand. This kind of value can occur as operands of intrinsics, the typical example being the debug intrinsics. Polly currently ignores the debug intrinsics and it is not yet clear which other intrinic might occur. For such cases, and to unbreak the AOSP buildbot, treat a MetadataAsValue as a constant because it can be referenced without modification in generated code. llvm-svn: 309992	2017-08-03 22:00:01 +00:00
Michael Kruse	672c011460	[VirtualInstruction] Avoid use of getStmtFor(BB). NFC. With this patch, we get rid of the last use of getStmtFor(BB). Here this is done by getting the last statement of the incoming block in case the user is a phi node; otherwise just fetching the statement comprising the instruction for which the virtual use is being created. Differential Revision: https://reviews.llvm.org/D36268 llvm-svn: 309947	2017-08-03 15:27:00 +00:00
Tobias Grosser	c1cfe0a828	Add missing REQUIRES line llvm-svn: 309943	2017-08-03 14:46:53 +00:00
Tobias Grosser	b5563c6817	Make sure that all parameter dimensions are set in schedule Summary: In case the option -polly-ignore-parameter-bounds is set, not all parameters will be added to context and domains. This is useful to keep the size of the sets and maps we work with small. Unfortunately, for AST generation it is necessary to ensure all parameters are part of the schedule tree. Hence, we modify the GPGPU code generation to make sure this is the case. To obtain the necessary information we expose a new function Scop::getFullParamSpace(). We also make a couple of functions const to be able to make SCoP::getFullParamSpace() const. Reviewers: Meinersbur, bollu, gareevroman, efriedma, huihuiz, sebpop, simbuerg Subscribers: nemanjai, kbarton, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D36243 llvm-svn: 309939	2017-08-03 13:51:15 +00:00
Michael Kruse	291fd8074e	[test] Fix test case without Polly-ACC. llvm-svn: 309938	2017-08-03 13:44:31 +00:00
Siddharth Bhat	eadf76d34a	[PPCGCodeGeneration] Construct `isl_multi_pw_aff` of PPCGArray.bounds even when polly-ignore-parameter-bounds is turned on. When we have `-polly-ignore-parameter-bounds`, `Scop::Context` does not contain all the paramters present in the program. The construction of the `isl_multi_pw_aff` requires all the indivisual `pw_aff` to have the same parameter dimensions. To achieve this, we used to realign every `pw_aff` with `Scop::Context`. However, in conjunction with `-polly-ignore-parameter-bounds`, this is now incorrect, since `Scop::Context` does not contain all parameters. We set this up correctly by creating a space that has all the parameters used by all the `isl_pw_aff`. Then, we realign all `isl_pw_aff` to this space. llvm-svn: 309934	2017-08-03 12:09:33 +00:00
Tobias Grosser	a195576118	Enable simplify and forward-op-tree by default These passes have been tested over the last month and should generally help to remove scalar data dependences in Polly. We enable them to give them even wider test coverage. Large performance regressions and any kind of correctness regressions are not expected. llvm-svn: 309878	2017-08-02 20:12:27 +00:00
Tobias Grosser	7b45af13ce	Move setNewAccessRelation to isl++ llvm-svn: 309871	2017-08-02 19:27:25 +00:00
Tobias Grosser	6d58804cc2	Move ScopStmt::setAccessRelation to isl++ llvm-svn: 309870	2017-08-02 19:27:16 +00:00
Tobias Grosser	18ca9e5119	Replace asserts with llvm_unreachable to clarify intent llvm-svn: 309856	2017-08-02 19:11:46 +00:00
Philip Pfaffe	33aef072c1	Fix r309826: Appease clang-format check. llvm-svn: 309853	2017-08-02 18:26:48 +00:00
Singapuram Sanjay Srivallabh	1f9ab16c4e	Fix code format on r309826 Summary: Fix code format on r309826 / D35458 Reviewers: grosser, bollu Reviewed By: grosser Subscribers: pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D36232 llvm-svn: 309845	2017-08-02 17:56:39 +00:00
Philip Pfaffe	8f1872fb27	Fix r309826: Move intantiation and specialization of OwningScopAnalysisManagerFunctionProxy to the polly namespace. When compiling with clang, explicit instantiation of the OwningScopAnalysisManagerFunctionProxy needs to happen within the polly namespace. Same goes with the specialization of its run method. llvm-svn: 309835	2017-08-02 17:25:45 +00:00
Philip Pfaffe	a70e2649ab	[Polly][PM][WIP] Polly pass registration Summary: This patch is a first attempt at registering Polly passes with the LLVM tools. Tool plugins are still unsupported, but this registration is usable from the tools if Polly is linked into them (albeit requiring minimal patches to those tools). Registration requires a small amount of machinery (the owning analysis proxies), necessary for injecting ScopAnalysisManager objects into the calling tools. This patch is marked WIP because the registration is incomplete. Parsing manual pipelines is fully supported, but default pass injection into the O3 pipeline is lacking, mostly because there is opportunity for some redesign here, I believe. The first point of order would be insertion points. I think it makes sense to run before the vectorizers. Running Polly Early, however, is weird. Mostly because it actually is the default (which to me is unexpected), and because Polly runs it's own O1 pipeline. Why not instead insert it at an appropriate place somewhere after simplification happend? Running after the loop optimizers seems intuitive, but it also seems wasteful, since multiple consecutive loops might well be a single scop, and we don't need to run for all of them. My second request for comments would be regarding all those smallish helper passes we have, like PollyViewer, PollyPrinter, PollyImportJScop. Right now these are controlled by command line options, deciding whether they should be part of the Polly pipeline. What is your opinion on treating them like real passes, and have the user write an appropriate pipeline if they want to use any of them? Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D35458 llvm-svn: 309826	2017-08-02 15:52:25 +00:00
Singapuram Sanjay Srivallabh	188053af5e	Remove debug metadata from copied instruction to prevent GPUModule verification failure Summary: Remove debug metadata from instruction to be copied to prevent the source file's debug metadata being copied into GPUModule and eventually failing Module verification and ASM string codegeneration. When copying the instruction onto the Module meant for the GPU, debug metadata attached to an instruction causes all related metadata to be pulled into the Module, including the DICompileUnit, which is not listed in llvm.dbg.cu of the Module. This fails the verification of the Module and generation of the ASM string. The only debug metadata of the instruction, the DebugLoc, is unset by this patch. This patch reattempts https://reviews.llvm.org/D35630 by targeting only those instructions that are to end up in a Module meant for the GPU. Reviewers: grosser, bollu Reviewed By: grosser Subscribers: pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D36161 llvm-svn: 309822	2017-08-02 15:20:07 +00:00
Philip Pfaffe	f081ec7609	[PM] Fix proxy invalidation Summary: I made a mistake in handling transitive invalidation of analysis results. I've updated the list of preserved analyses as well as the correct result dependences. The Invalidator passed through the invalidate() path can be used to transitively invalidate analyses. It frequently happens that analysis results depend on other analyses, and thus store references to their results. When the dependee now gets invalidated, the depender needs to be invalidated as well. This is the purpose of the Invalidator object, which can be used to check whether some dependee analysis is in the process of being invalidated. I originally was checking the wrong dependee analyses, which is an actual error, you can only check analysis results that are in the cache (which they are if you've captured their reference). The invalidation I'm handling inside the proxy deals with the standard analyses the proxy passes into the Scop pipeline, since I'm capturing their reference. This checking allows us to actually preserve a couple of results outside of the proxy, since the Scop pipeline shouldn't break those, or otherwise should update them accordingly. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D36216 llvm-svn: 309811	2017-08-02 13:18:49 +00:00
Siddharth Bhat	f23bb4a8ba	[GPUJIT] Add GPUJIT APIs for allocating and freeing managed memory. We introduce `polly_mallocManaged` and `polly_freeManaged` as proxies for `cudaMallocManaged` / `cudaFree`. This is currently not used by Polly. It is auxiliary code that is used in `COSMO`. This is useful because `polly_mallocManaged` matches the signature of `malloc`, while `cudaMallocManaged` does not. We introduce `polly_freeManaged` for symmetry. We use this in COSMO to use the unified memory feature of the newer CUDA APIs (>= 6). Differential Revision: https://reviews.llvm.org/D35991 llvm-svn: 309808	2017-08-02 12:23:22 +00:00
Philip Pfaffe	ead67dbbd6	[SI][NewPM] Collect loop count statistics llvm-svn: 309807	2017-08-02 11:14:41 +00:00
Philip Pfaffe	f5a4394ad6	[SD] Set PollyUseRuntimeAliasChecks correctly llvm-svn: 309805	2017-08-02 11:08:01 +00:00
Siddharth Bhat	b1a52abd87	[GPUJIT] Teach GPUJIT to use a pre-existing CUDA context if available. On mixing the driver and runtime APIs, it is quite possible that a context already exists due to runtime API usage. In this case, Polly should try to use the same context. This patch teaches GPUJIT to detect that a context exists and how to pick up this context. Without this, calling `cudaMallocManaged`, for example, before a polly-generated kernel launch causes P100 to hang. This is a part of (https://reviews.llvm.org/D35991) that was extracted out. Differential Revision: https://reviews.llvm.org/D36162 llvm-svn: 309802	2017-08-02 09:19:42 +00:00
Michael Kruse	fd35089689	[ForwardOpTree] Execute canForwardTree also in release builds. Commit r309730 moved the call to canForwardTree into an assert(), even though this function has side-effects if its DoIt parameter is true. To avoid a warning in release builds, do an (void)Execution of its result instead. To avoid such confusion in the future, rename canForwardTree() to forwardTree(). llvm-svn: 309753	2017-08-01 22:15:04 +00:00
Michael Kruse	bc88a78cb4	[Simplify] Rewrite redundant write detection algorithm. The previous algorithm was to search a writes and the sours of its value operand, and see whether the write just stores the same read value back, which includes a search whether there is another write access between them. This is O(n^2) in the max number of accesses in a statement (+ the complexity of isl comparing the access functions). The new algorithm is more similar to the one used for searching for overwrites and coalescable writes. It scans over all accesses in order of execution while tracking which array elements still have the same value since it was read. This is O(n), not counting the complexity within isl. It should be more reliable than trying to catch all non-conforming cases in the previous approach. It is also less code. We now also support if the write is a partial write of the read's domain, and to some extent non-affine subregions. Differential Revision: https://reviews.llvm.org/D36137 llvm-svn: 309734	2017-08-01 20:01:34 +00:00
Reid Kleckner	859c1e606a	Silence -Wunused-variable warning in NDEBUG builds llvm-svn: 309730	2017-08-01 19:53:01 +00:00
Michael Kruse	693ef99935	[Simplify] Improve scalability. With a lot of reads and writes to the same array in a statement, some isl sets that capture the state between access can become complex such that isl takes more considerable time and memory for operations on them. The problems identified were: - is_subset() takes considerable time with many disjoints in the arguments. We limit the number of disjoints to 4, any additional information is thrown away. - subtract() can lead to many disjoints. We instead assume that any array element is possibly accessed, which removes all disjoints. - subtract_domain() may lead to considerable processing, even if all elements are are to be removed. Instead, we remove determine and remove the affected spaces manually. No behaviour is changed. llvm-svn: 309728	2017-08-01 19:39:11 +00:00
Tobias Grosser	e327eebccb	Update to isl-0.18-809-gd5b4535 This fixes some undefined behavior in the isl schedule tree code. llvm-svn: 309727	2017-08-01 19:37:50 +00:00
Siddharth Bhat	1ec9cba4e3	[NFC] Add 'REQUIRES: pollyacc' on 'test/GPGPU/invariant-load-hoisting-of-array.ll' - Should fix broken build due to `r309681`. llvm-svn: 309686	2017-08-01 14:52:18 +00:00
Siddharth Bhat	442e722c1e	[GPUJIT] Call `cuProfilerStop` before destroying the context to flush profiler cache. This is necessary to get accurate traces from `nvprof` / `nvcc`. Otherwise, we lose some profiling information. Differential Revision: https://reviews.llvm.org/D35940 llvm-svn: 309682	2017-08-01 14:36:24 +00:00
Siddharth Bhat	edf9581e4c	[PPCGCodeGeneration] Correct usage of llvm::Value with getLatestValue. It is possible that the `HostPtr` that coresponds to an array could be invariant load hoisted. Make sure we use the invariant load hoisted value by using `IslNodeBuilder::getLatestValue`. Differential Revision: https://reviews.llvm.org/D36001 llvm-svn: 309681	2017-08-01 14:26:39 +00:00
Siddharth Bhat	f2cfd2a4db	[NFC] [IslNodeBuilder, GPUNodeBuilder] Unify mechanism for looking up replacement Values. We populate `IslNodeBuilder::ValueMap` which contains replacements for `llvm::Value`s. There was no simple method to pick up a replacement if it exists, otherwise fall back to the original. Create a method `IslNodeBuilder::getLatestValue` which provides this functionality. This will be used in a later patch to fix bugs in `PPCGCodeGeneration` where the latest value is not being used. Differential Revision: https://reviews.llvm.org/D36000 llvm-svn: 309674	2017-08-01 12:15:51 +00:00
Siddharth Bhat	4d5820d171	[NFC] [PPCGCodeGeneration] Convert GPUNodeBuilder::getGridSizes to isl++. llvm-svn: 309671	2017-08-01 10:45:41 +00:00
Siddharth Bhat	ccbf4b509c	[NFC] [PPCGCodeGeneration] Convert GPUNodeBuilder::getArrayOffset to isl++. llvm-svn: 309669	2017-08-01 09:58:55 +00:00
Michael Kruse	9f6e41cdba	[ForwardOpTree] Support synthesizable values. This allows -polly-optree to move instructions that depend on synthesizable values. The difficulty for synthesizable values is that their value depends on the location. When it is moved over a loop header, and the SCEV expression depends on the loop induction variable (SCEVAddRecExpr), it would use the current induction variable instead of the last one. At the moment we cannot forward PHI nodes such that crossing the header of loops referenced by SCEVAddRecExpr is not possible (assuming the loop header has at least two incoming blocks: for entering the loop and the backedge, such any instruction to be forwarded must have a phi between use and definition). A remaining issue is when the forwarded value is used after the loop, but is only synthesizable inside the loop. This happens e.g. if ScalarEvolution is unable to determine the number of loop iterations or the initial loop value. We do not forward in this situation. Differential Revision: https://reviews.llvm.org/D36102 llvm-svn: 309609	2017-07-31 19:46:21 +00:00
Michael Kruse	57cc92b790	[Simplify] Remove all kinds of redundant scalar writes. In addition to array and PHI writes, also allow scalar value writes. The only kind of write not allowed are writes by functions (including memcpy/memmove/memset). llvm-svn: 309582	2017-07-31 17:04:55 +00:00
Tobias Grosser	8fc6cdfb1c	[GPGPU] Add support for NVIDIA libdevice Summary: This allows us to map functions such as exp, expf, expl, for which no LLVM intrinsics exist. Instead, we link to NVIDIA's libdevice which provides high-performance implementations of a wide range of (math) functions. We currently link only a small subset, the exp, cos and copysign functions. Other functions will be enabled as needed. Reviewers: bollu, singam-sanjay Reviewed By: bollu Subscribers: tstellar, tra, nemanjai, pollydev, mgorny, llvm-commits, kbarton Tags: #polly Differential Revision: https://reviews.llvm.org/D35703 llvm-svn: 309560	2017-07-31 14:03:16 +00:00
Tobias Grosser	39977e4e76	Revert "Remove Debug metadata from copied instruction to prevent Module verification failure" This reverts commit r309490 as it triggers on our AOSP buildbut error messages of the form: inlinable function call in a function with debug info must have a !dbg location llvm-svn: 309556	2017-07-31 11:43:38 +00:00
Tobias Grosser	7639db8ed9	[IslNodeBuilder] Remove unused instruction Suggested-by: Maximilian Falkenstein <falkensm@student.ethz.ch> llvm-svn: 309533	2017-07-31 01:59:23 +00:00
Singapuram Sanjay Srivallabh	cf9a813368	Remove Debug metadata from copied instruction to prevent Module verification failure Summary: Remove debug metadata from instruction to be copied to prevent the source file's debug metadata being copied into GPUModule and eventually failing Module verification and ASM string codegeneration. When copying the instruction onto the Module meant for the GPU, debug metadata attached to an instruction causes all related metadata to be pulled into the Module, including the DICompileUnit, which is not listed in llvm.dbg.cu of the Module. This fails the verification of the Module and generation of the ASM string. The only debug metadata of the instruction, the DebugLoc, is unset by this patch. Reviewers: grosser, bollu, Meinersbur Reviewed By: grosser, bollu Subscribers: pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D35630 llvm-svn: 309490	2017-07-29 18:03:49 +00:00
Michael Kruse	ce9617f4fe	[Simplify] Implement write accesses coalescing. Write coalescing combines write accesses that - Write the same llvm::Value. - Write to the same array. - Unless they do not write anything in a statement instance (partial writes), write to the same element. - There is no other access between them that accesses the same element. This is particularly useful after DeLICM, which leaves partial writes to disjoint domains. Differential Revision: https://reviews.llvm.org/D36010 llvm-svn: 309489	2017-07-29 16:21:16 +00:00
Michael Kruse	4335c3992a	[test] Add test case for -polly-simplify. NFC. llvm-svn: 309458	2017-07-29 00:06:06 +00:00
Michael Kruse	8e41d2baab	[Simplify] Do not remove dependencies of phis within region stmts. These were wrongly assumed to be phi nodes that require MemoryKind::PHI accesses. llvm-svn: 309454	2017-07-28 23:22:32 +00:00
Michael Kruse	fd7f40961b	[VirtualInstruction] Do not iterate over a region statement's instruction list. NFC. It should be empty anyways. In this case it would even be redundant because we just all all instructions in region statements. llvm-svn: 309453	2017-07-28 23:22:23 +00:00
Adrian Prantl	99c4a5fb8e	Remove offset parameter from llvm.dbg.value intrinsics in testcase llvm-svn: 309433	2017-07-28 21:08:53 +00:00
Michael Kruse	0137d80ad4	[VirtualInstruction] Remove assertion. NFC. ScopStmt::contains is currently implemented on the basis of BasicBlock and does not take the instruction list into account. Therefore any instruction copied by -polly-optree into another statement currently triggers that assertion. Remove that assertion for now. We might re-enable it when the implementation of ScopStmt::contains changes. llvm-svn: 309421	2017-07-28 19:26:24 +00:00
Michael Kruse	c99209b4b2	[test] Fix typo in filename. NFC. llvm-svn: 309403	2017-07-28 16:57:56 +00:00
Michael Kruse	6c8f91b908	[Simplify] Fix typo in statistics output. NFC. llvm-svn: 309402	2017-07-28 16:57:51 +00:00
Michael Kruse	34a77780c5	[Simplify] Remove empty partial accesses first. NFC. So follow-up cleanup do not need special handling for such accesses. llvm-svn: 309401	2017-07-28 16:57:45 +00:00
Siddharth Bhat	4ebeb3568a	[PPCGCodeGeneration] Check that invariant load hoisting succeeded. If we fail, throw an error for now. We can gracefully handle this later. llvm-svn: 309387	2017-07-28 14:48:32 +00:00
Siddharth Bhat	0a1177b58e	[ScopDetect] add `-polly-ignore-func` flag to ignore functions by name. Ignore all functions whose name match a regex. Useful because creating a regex that does not match a string is somewhat hard. Example: https://stackoverflow.com/questions/1240275/how-to-negate-specific-word-in-regex llvm-svn: 309377	2017-07-28 11:47:24 +00:00
Tobias Grosser	c0678c016f	Add missing namespace comment llvm-svn: 309373	2017-07-28 09:33:06 +00:00
Tobias Grosser	25271b91b2	[GPGPU] Do not require the Scop::Context to have information about all parameters llvm-svn: 309368	2017-07-28 06:49:44 +00:00
Tobias Grosser	30caae6d23	[GPGPU] Fix compilation issue with latest CUDA upgrade to i128 llvm-svn: 309366	2017-07-28 06:38:49 +00:00
Hans Wennborg	ce99589225	Tiny docs fix llvm-svn: 309300	2017-07-27 18:14:00 +00:00
Tobias Grosser	adcbee5433	Update isl to isl-0.18-800-g4018f45 This fixes a bug in isl_flow where triggering the compute out could result in undefined or unexpected behavior. This fixes some recent regressions we saw in the android buildbots. Thanks Eli Friedman for reducing the corresponding test cases. llvm-svn: 309274	2017-07-27 14:48:02 +00:00
Michael Kruse	a508a4e619	[ScopBuilder/Simplify] Refactor isEscaping. NFC. ScopBuilder and Simplify (through VirtualInstruction.cpp) previously used this functionality in their own implementation. Refactor them both into a common one into the Scop class. BlockGenerator also makes use of a similiar functionality, but also records outside users and takes place after region simplification. Merging it as well would be more complicated. llvm-svn: 309273	2017-07-27 14:39:52 +00:00
Michael Kruse	8a8aca4299	[Simplify] Count PHINodes in simplifiable exit nodes as escaping use. After region exit simplification, the incoming block of a phi node in the SCoP region's exit block lands outside of the region. Since we treat SCoPs as if this already happened, we need to account for that when looking for outside uses of scalars (i.e. escaping scalars). llvm-svn: 309271	2017-07-27 14:09:31 +00:00
Michael Kruse	eca86cee64	[ScopInfo] Never print instruction list of region stmts. A region statement's instruction list is always empty and ignored by the code generator. Don't give the impression that it means anything. llvm-svn: 309197	2017-07-26 22:01:33 +00:00
Michael Kruse	cedd7a74e1	[Simplify] Do not setInstructions() of region stmts. NFC. The instruction list is ignored for region statements, there is no reason to set it. llvm-svn: 309196	2017-07-26 22:01:28 +00:00
Michael Kruse	95b39da8ae	[Simplify] Fix invalid removal write for escaping values. A PHI node's incoming block is the user of its operand, not the PHI's parent. Assuming the PHINode's parent being the user lead to the removal of a MemoryAccesses because its use was assumed to be inside of the SCoP. llvm-svn: 309164	2017-07-26 19:58:15 +00:00
Roman Gareev	2e580538be	[ScheduleOptimizer] Translate to C++ bindings Translate the ScheduleOptimizer to use the new isl C++ bindings. Reviewed-by: Michael Kruse <llvm@meinersbur.de> Differential Revision: https://reviews.llvm.org/D35845 llvm-svn: 309119	2017-07-26 14:59:15 +00:00
Michael Kruse	1df1aac014	[ScopInfo] Avoid use of getStmtFor(BB). NFC. Since there will be no more a 1:1 correspondence between statements and basic blocks, we would like to get rid of the method getStmtFor(BB) and its uses. Here we remove one of its uses in ScopInfo by fetching the statement in which the call instruction lies. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35691 llvm-svn: 309110	2017-07-26 13:25:28 +00:00
Michael Kruse	11ed062258	[SCEVValidator] Loop exit values of loops before the SCoP are synthesizable. In the following loop: int i; for (i = 0; i < func(); i+=1) ; SCoP: for (int j = 0; j<n; j+=1) S(i, j) The value i is synthesizable in the SCoP that includes only the j-loop. This is because i is fixed within the SCoP, it is irrelevant whether it originates from another loop. This fixes a strange case where a PHI was synthesiable in a SCoP, but not its incoming value, triggering an assertion. This should fix MultiSource/Applications/sgefa/sgefa of the perf-x86_64-penryn-O3-polly-before-vectorizer-unprofitable buildbot. llvm-svn: 309109	2017-07-26 13:05:45 +00:00
Tobias Grosser	9ddcf8e6ac	Revert accidental isl changes in 308923 It seems I still had some incomplete changes in the tree when committing. In general, we only import changes from isl upstream. In this case, the changes were especially unfortunate, as they broke the error management in isl_flow.c and consequently caused regressions. Thanks to Michael Kruse for spotting this mistake. llvm-svn: 309039	2017-07-25 22:15:47 +00:00
Michael Kruse	8d89179e33	[ScopInfo] Rename ScopStmt::contains(BB) to represents(BB). NFC. In future, there will be no more a 1:1 correspondence between statements and basic blocks, the name `contains` does not correctly capture their relationship. A BB may infact comprise of multiple statements; hence we describe a statement 'representing' a basic block. Differential Revision: https://reviews.llvm.org/D35838 llvm-svn: 308982	2017-07-25 16:25:37 +00:00
Philip Pfaffe	85cc5687df	[IslAst] Untangle IslAst lit-testcases from specifics of the legacy-PM Summary: This consists instances of two changes: - Accept any order of checks for a specific loop form, that appear in different order in the new vs legacy-PM. - Remove checks for specific regions. Reviewers: grosser Reviewed By: grosser Subscribers: pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D35837 llvm-svn: 308976	2017-07-25 15:07:42 +00:00
Michael Kruse	b6317007b4	[ScopInfo] Fix assertion for PHIs not in a region stmts entry. A PHI node within a region statement is legal, but does not have a MemoryKind::PHI access. llvm-svn: 308973	2017-07-25 13:28:39 +00:00
Siddharth Bhat	43f178bbc9	[PPCGCodeGeneration] Skip arrays with empty extent. Invariant load hoisted scalars, and arrays whose size we can statically compute to be 0 do not need to be allocated as arrays. Invariant load hoisted scalars are sent to the kernel directly as parameters. Earlier, we used to allocate `0` bytes of memory for these because our computation of size from `PPCGCodeGeneration::getArraySize` would result in `0`. Now, since we don't invariant loads as arrays in PPCGCodeGeneration, this problem does not occur anymore. Differential Revision: https://reviews.llvm.org/D35795 llvm-svn: 308971	2017-07-25 12:35:36 +00:00
Tobias Grosser	d7065e5df5	Move MemoryAccess::isStride* to isl++ llvm-svn: 308927	2017-07-24 20:50:22 +00:00
Tobias Grosser	b739cb42f5	Move MemoryAccess::InvalidDomain to isl++ llvm-svn: 308923	2017-07-24 20:30:34 +00:00
Tobias Grosser	cdf471baef	Move MemoryAccess::getPwAff to isl++ llvm-svn: 308895	2017-07-24 16:36:34 +00:00
Tobias Grosser	1f6ba7e238	Move MemoryAccess::MemoryAccess to isl++ llvm-svn: 308893	2017-07-24 16:22:32 +00:00
Tobias Grosser	206e9e3b3b	Move ScopArrayInfo::getFromAccessFunction and getFromId to isl++ llvm-svn: 308892	2017-07-24 16:22:27 +00:00
Michael Kruse	54071126d8	[ForwardOpTree] Properly indent enumeration in comment. NFC. llvm-svn: 308887	2017-07-24 15:34:03 +00:00
Michael Kruse	67752076bc	[ForwardOpTree] Rename FD_CanForward to FD_CanForwardLeaf. NFC. To make the meaning and distinction to FD_CanForwardTree clearer. llvm-svn: 308886	2017-07-24 15:33:58 +00:00
Michael Kruse	d85e345ce0	[ForwardOpTree] Add comments to ForwardingDecision items. NFC. In particular, explain the difference between FD_CanForward and FD_CanForwardTree. llvm-svn: 308885	2017-07-24 15:33:53 +00:00
Michael Kruse	07e8c36dc7	[ForwardOpTree] Support read-only value uses. Read-only values (values defined before the SCoP) require special handing with -polly-analyze-read-only-scalars=true (which is the default). If active, each use of a value requires a read access. When a copied value uses a read-only value, we must also ensure that such a MemoryAccess is available or is created. Differential Revision: https://reviews.llvm.org/D35764 llvm-svn: 308876	2017-07-24 12:43:27 +00:00
Siddharth Bhat	e2699b572e	[Polly] [NFC] [ScopDetection] Make `polly-only-func` perform regex scop name match. Summary: - We were using `.count` in `StringRef`, which matches substrings. - We may want to use this for equality as well. - Generalise this, so allow regexes as a parameter to `polly-only-func`. Differential Revision: https://reviews.llvm.org/D35728 llvm-svn: 308875	2017-07-24 12:40:52 +00:00
Michael Kruse	5b8a9095e8	[ForwardOpTree] Fix mixup in comment. NFC. The cases DoIt==false and DoIt==true were mixed up. Thanks to Siddharth for noticing. llvm-svn: 308874	2017-07-24 12:39:46 +00:00
Michael Kruse	25a688165b	[ScopInfo] Fix typo in method name. NFC. prependInstrunction -> prependInstruction Thanks Nandini for noticing. llvm-svn: 308873	2017-07-24 12:39:41 +00:00
Siddharth Bhat	f7face4bc4	Convert GPUNodeBuilder::getArraySize to islcpp. Note: PPCGCodeGeneration::pollyBuildAstExprForStmt is at https://reviews.llvm.org/D35770 Differential Revision: https://reviews.llvm.org/D35771 llvm-svn: 308870	2017-07-24 09:08:21 +00:00
Siddharth Bhat	35de900917	[NFC] Move PPCGCodeGeneration::pollyBuildAstExprForStmt to isl++. Differential Revision: https://reviews.llvm.org/D35771 llvm-svn: 308869	2017-07-24 08:34:24 +00:00
Tobias Grosser	325812ac6d	Simplify: Adopt for translation of MemoryAccess::getAccessRelation For some reason this one was missed earlier. llvm-svn: 308845	2017-07-23 08:15:28 +00:00
Tobias Grosser	1959dbda75	Move MemoryAccess::get*ArrayId to isl++ llvm-svn: 308843	2017-07-23 04:08:59 +00:00
Tobias Grosser	3b196131b5	Move applyScheduleToAccessRelation to isl++ llvm-svn: 308842	2017-07-23 04:08:52 +00:00
Tobias Grosser	6a87036e0f	Move MemoryAccess::getAddressFunction to isl++ llvm-svn: 308841	2017-07-23 04:08:45 +00:00
Tobias Grosser	1515f6b937	Move MemoryAccess::NewAccessRelation to isl++ We also move related accessor functions llvm-svn: 308840	2017-07-23 04:08:38 +00:00
Tobias Grosser	22da5f087a	Move MemoryAccess::getOriginalAccessRelation to isl++ llvm-svn: 308839	2017-07-23 04:08:27 +00:00
Tobias Grosser	0c4c2eef75	Move MemoryAccess::AccessRelation to isl++ llvm-svn: 308838	2017-07-23 04:08:22 +00:00
Tobias Grosser	b6e7a85a6d	Move MemoryAccess::createBasicAccessMap to isl++ llvm-svn: 308837	2017-07-23 04:08:17 +00:00
Tobias Grosser	fe46c3ff3a	Move MemoryAccess::id to isl++ llvm-svn: 308836	2017-07-23 04:08:11 +00:00
Michael Kruse	ab8f0d57df	[Simplify] Remove partial write accesses with empty domain. If the access relation's domain is empty, the access will never be executed. We can just remove it. We only remove write accesses. Partial read accesses are not yet supported and instructions in the statement might require the llvm::Value holding the read's result to be defined. llvm-svn: 308830	2017-07-22 20:33:09 +00:00
Michael Kruse	e52ebd1ae4	[ScopInfo] Adapt indentation of instruction list printing. Change the indention of the last brace to align with the opening line. Before: Instructions { %val = fadd double %arg, 2.100000e+01 store double %val, double* %A } After: Instructions { %val = fadd double %arg, 2.100000e+01 store double %val, double* %A } llvm-svn: 308828	2017-07-22 16:44:39 +00:00
Michael Kruse	e5f4706a55	[ForwardOpTree] Support hoisted invariant loads. Hoisted loads can be trivially supported because there are no MemoryAccess to be modified, the loaded value is just available at code generation. llvm-svn: 308826	2017-07-22 14:30:02 +00:00
Michael Kruse	a6b2de3b59	[ForwardOpTree] Introduce the -polly-optree pass. This pass 'forwards' operand trees into statements that use them in order to avoid scalar dependencies. This minimal implementation handles only the case of speculatable instructions. We will successively add support for: - Hoisted loads - Read-only values - Synthesizable values - Loads - PHIs - Forwarding only parts of the tree Differential Revision: https://reviews.llvm.org/D35754 llvm-svn: 308825	2017-07-22 14:02:47 +00:00
Tobias Grosser	77eef90f50	Move ScopArrayInfo to isl++ This moves the full ScopArrayInfo class to isl++ llvm-svn: 308801	2017-07-21 23:07:56 +00:00
Philip Pfaffe	8f6c48e2aa	Untangle ScopInfo lit-testcases from specifics of the legacy-PM Summary: For the ScopInfo lit testsuite, this patch removes some dependences on output behaviour of the legacy PM. In most cases, these tests checked the tool output for labels created by the pass printer in the legacy PM. This doesn't work for the new PM anymore. Untangling the testcases is the first step to porting the testsuite for the new PM infrastructure. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: llvm-commits, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D35727 llvm-svn: 308754	2017-07-21 16:47:36 +00:00
Philipp Schaad	2f3073b5cb	[Polly][GPGPU] Added SPIR Code Generation and Corresponding Runtime Support for Intel Summary: Added SPIR Code Generation to the PPCG Code Generator. This can be invoked using the polly-gpu-arch flag value 'spir32' or 'spir64' for 32 and 64 bit code respectively. In addition to that, runtime support has been added to execute said SPIR code on Intel GPU's, where the system is equipped with Intel's open source driver Beignet (development version). This requires the cmake flag 'USE_INTEL_OCL' to be turned on, and the polly-gpu-runtime flag value to be 'libopencl'. The transformation of LLVM IR to SPIR is currently quite a hack, consisting in part of regex string transformations. Has been tested (working) with Polybench 3.2 on an Intel i7-5500U (integrated graphics chip). Reviewers: bollu, grosser, Meinersbur, singam-sanjay Reviewed By: grosser, singam-sanjay Subscribers: pollydev, nemanjai, mgorny, Anastasia, kbarton Tags: #polly Differential Revision: https://reviews.llvm.org/D35185 llvm-svn: 308751	2017-07-21 16:11:06 +00:00
Michael Kruse	e186013149	Annotate dump() functions with LLVM_DUMP_METHOD. NFC. llvm-svn: 308749	2017-07-21 15:54:13 +00:00
Michael Kruse	5d5184698d	[ScopInfo] Don't compile dump() functions into non-assert builds. NFC. This follows a convention used in LLVM. llvm-svn: 308748	2017-07-21 15:54:07 +00:00
Michael Kruse	cd4c977b8b	[ScopInfo] Print instructions in dump(). Print a statement's instruction on dump() regardless of -polly-print-instructions. dump() is supposed to be used in the debugger only and never in regression tests. While debugging, get all the information we have and we are not bound to break anything. For non-dump purposes of print, forward the setting of -polly-print-instructions as parameters. Some calls to print() had to be changed because the PollyPrintInstructions setting is only available in ScopInfo.cpp. In ScheduleOptimizer.cpp, dump() was used in regression tests. That's not what dump() is for. The print parameter "PrintInstructions" will also be useful for an explicit print SCoP pass in a future patch. llvm-svn: 308746	2017-07-21 15:35:53 +00:00
Siddharth Bhat	06d4ed6787	[NFC] [RegisterPasses] Fix typo: To early -> too early. llvm-svn: 308743	2017-07-21 15:12:03 +00:00
Siddharth Bhat	a0fb8b23e1	[NFC] [PPCGCodeGeneration] Print `verifyModule` failure to debug stream. If verifyModule fails, it is helpful to know why it failed. Add a log to the debug stream that prints the failure. llvm-svn: 308727	2017-07-21 11:21:44 +00:00
Tobias Grosser	018103d34e	Fix typo in function name Bllock -> Block llvm-svn: 308715	2017-07-21 06:00:38 +00:00
Tobias Grosser	1eeedf4829	[IslNodeBuilder] Relax complexity check in invariant loads and run it early When performing invariant load hoisting we check that invariant load expressions are not too complex. Up to this commit, we performed this check by counting the sum of dimensions in the access range as a very simple heuristic. This heuristic is a little too conservative, as it prevents hoisting for any scops with a very large number of parameters. Hence, we update the heuristic to only count existentially quantified dimensions and set dimensions. We expect this to still detect the problematic expressions in h264 because of which this check was originally introduced. For some unknown reason, this complexity check was originally committed in IslNodeBuilder. It really belongs in ScopInfo, as there is no point in optimizing a program which we could have known earlier cannot be code generated. The benefit of running the check early is that we can avoid to even hoist checks that are expensive to code generate as invariant loads. This can be seen in the changed tests, where we now indeed detect the scop, but just not invariant load hoist the complicated access. We also improve the formatting of the code, document it, and use isl++ to simplify expressions. llvm-svn: 308659	2017-07-20 19:55:19 +00:00
Tobias Grosser	54491db687	Support fabs and copysign in Polly-ACC llvm-svn: 308649	2017-07-20 18:26:34 +00:00
Michael Kruse	b936c4b332	[PPCG] Compile fix for MSVC. Visual Studio, even the 2017 version, does not support C99 VLAs. For VLA paramters, the length of the outermost dimension is not required anyway, so remove it. llvm-svn: 308643	2017-07-20 18:04:54 +00:00
Michael Kruse	1ce6791e7e	[ScopInfo] Get a list of statements for a region node. NFC. When constructing a schedule true and there are multiple statements for a basic block, create a sequence node for these statements. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35679 llvm-svn: 308635	2017-07-20 17:18:58 +00:00
Michael Kruse	6eba4b1031	[ScopInfo] Remove dependency of Scop::getLastStmtFor(BB) on getStmtFor(BB). NFC. We are working towards removing uses of Scop::getStmtFor(BB). In this patch, we remove dependency of Scop::getLastStmtFor(BB) on getStmtFor(BB). To do so, we get the list of all statements corresponding to the BB and then fetch the last one. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35665 llvm-svn: 308633	2017-07-20 17:08:50 +00:00
Michael Kruse	c4d8f9b11a	Fix indention in comment. NFC. llvm-svn: 308632	2017-07-20 16:52:22 +00:00
Michael Kruse	3562f272cf	[ScopInfo] Use map for lookupPHIReadOf. NFC. Introduce previously missing PHIReads analogous the the already existing PHIWrites/ValueWrites/ValueReads maps. PHIReads was initially not required and the later introduced lookupPHIReadOf() used a linear search instead. With PHIReads, lookupPHIReadOf() can now also do a map lookup and remove any surprising performance/behaviour differences to lookupPHIWriteOf(), lookupValueWriteOf() and lookupValueReadOf(). llvm-svn: 308630	2017-07-20 16:47:57 +00:00
Michael Kruse	22058c3fbb	[Simplify] Remove unused instructions and accesses. Use a mark-and-sweep algorithm to find and remove unused instructions and MemoryAccesses. This is useful in particular to remove scalar writes that are never used anywhere. A scalar write in a loop induces a write-after-write dependency that stops the loop iterations to be rescheduled. Such writes can be a result of previous transformations such as DeLICM and operand tree forwarding. It adds a new class VirtualInstruction that represents an instruction in a particular statement. At the moment an instruction can only belong to the statement that represents a BasicBlock. In the future, instructions can be in one of multiple statements representing a BasicBlock (Nandini's work), in different statements than its BasicBlock would indicate, and even multiple statements at once (by forwarding operand trees). It also integrates nicely with the VirtualUse class. ScopStmt::contains(Instruction*) currently uses the instruction's parent BasicBlock to check whether it contains the instruction. It will need to check the actual statement list when one of the aforementioned features become possible. Differential Revision: https://reviews.llvm.org/D35656 llvm-svn: 308626	2017-07-20 16:21:55 +00:00
Siddharth Bhat	9e3db2b756	[PPCGCodeGen] [3/3] Update PPCGCodeGen + tests to latest ppcg. This commit WILL COMPILE. 1. `PPCG` now uses `isl_multi_pw_aff` instead of an array of `pw_aff`. This needs us to adjust how we index array bounds and how we construct array bounds. 2. `PPCG` introduces two new kinds of nodes: `init_device` and `clear_device`. We should investigate what the correct way to handle these are. 3. `PPCG` has gotten smarter with its use of live range reordering, so some of the tests have a qualitative improvement. 4. `PPCG` changed its output style, so many test cases need to be updated to fit the new style for `polly-acc-dump-code` checks. Differential Revision: https://reviews.llvm.org/D35677 llvm-svn: 308625	2017-07-20 15:48:36 +00:00
Siddharth Bhat	3d4d752188	[PPCG] [2/3] Make polly specific PPCG Changes. - This commit WILL NOT COMPILE. `PPCGCodeGeneration` requires changes since some of PPCG's internal data structures have been modified. - Has polly-speific changes to PPCG. Polly exports certain functionality that is private to PPCG. It also creates stubs for large parts of the pet API as well as other functions in `ppcg/external.c` to keep the linker happy. - This commit includes changes to CMakeLists.txt. Differential Revision: https://reviews.llvm.org/D35676 llvm-svn: 308624	2017-07-20 15:48:22 +00:00
Siddharth Bhat	951515f236	[PPCG] [1/3] Bump up PPCG version to 0.07. - This commit WILL NOT COMPILE, as it checks in vanilla PPCG 0.07 - We choose to introduce this commit into the history to cleanly display the Polly-specific changes made to PPCG. Differential Revision: https://reviews.llvm.org/D35675 llvm-svn: 308623	2017-07-20 15:48:13 +00:00
Michael Kruse	4642c3ce85	[ScopBuilder] Avoid use of getStmtFor(BB). NFC. Since there will be no more a 1:1 correspondence between statements and basic blocks, we would like to get rid of the method getStmtFor(BB) and its uses. Here we remove one of its uses in ScopBuilder by fetching the statement in which the instruction lies. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35610 llvm-svn: 308610	2017-07-20 12:47:09 +00:00
Michael Kruse	0865585eab	[ScopInfo] Add support for wrap-around of integers in unsigned comparisons. This is one possible solution to implement wrap-arounds for integers in unsigned icmp operations. For example, store i32 -1, i32* %A_addr %0 = load i32, i32* %A_addr %1 = icmp ult i32 %0, 0 %1 should hold false, because under the assumption of unsigned integers, -1 should wrap around to 2^32-1. However, previously. it was assumed that the MSB (Most Significant Bit - aka the Sign bit) was never set for integers in unsigned operations. This patch modifies the buildConditionSets function in ScopInfo.cpp to give better information about the integers in these unsigned comparisons. Contributed-by: Annanay Agarwal <cs14btech11001@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35464 llvm-svn: 308608	2017-07-20 12:37:02 +00:00
Michael Kruse	89da6bbcb4	Make byref llvm::Use parameters const. NFC. llvm-svn: 308522	2017-07-19 20:41:56 +00:00
Philip Pfaffe	17b1ecfdc5	[CMake] Fix r307650: Readd missing dependency. The commit erroneously removed the dependency of the Polly tests on things like opt and FileCheck. Add that dependency back. llvm-svn: 308512	2017-07-19 19:20:58 +00:00
Eli Friedman	84c73fd9fc	[docs] Minor formatting fixes. Patch by Tarun Rajendran. llvm-svn: 308505	2017-07-19 18:18:37 +00:00
Roman Gareev	5abea0c97a	[FIX] Update test/ScheduleOptimizer/pattern-matching-based-opts_11.ll. llvm-svn: 308501	2017-07-19 18:01:51 +00:00
Roman Gareev	6531df41ae	[FIX] Fix pattern-matching-based-opts_11.ll. llvm-svn: 308499	2017-07-19 17:33:42 +00:00
Michael Kruse	8b8058072f	[ScopInfo] Integrate ScalarDefUseChain into polly::Scop. NFC. Before this patch, ScalarDefUseChain was a tool used by DeLICM to find all reads and writes of scalar accesses. It iterated once over all accesses and stores the accesses into maps. By integrating it into the Scop class, we can keep the maps up-to-date without the need for recomputing them. It will be needed for more than DeLICM in the future, such as SCoP simplification, code movement between virtual statements, and array expansion (GSoC project). Compared to ScalarUseDefChain, we save two maps by finding the ScopStmt a Def/PHIRead must reside in, and use its already existing lookup function to find the MemoryAccess. Differential Revision: https://reviews.llvm.org/D35631 llvm-svn: 308495	2017-07-19 17:11:25 +00:00
Roman Gareev	750374181b	Make the pattern matching work with modified memory accesses Some optimizations (e.g., DeLICM) can modify memory accesses (e.g., change their MemoryKind). Consequently, the pattern matching should take it into the account. Reviewed-by: Tobias Grosser <tobias@grosser.es>, Michael Kruse <llvm@meinersbur.de> Differential Revision: https://reviews.llvm.org/D33138 llvm-svn: 308494	2017-07-19 16:59:06 +00:00
Tobias Grosser	199ec4af40	[ScopInfo] Do not create entries in map if non exists Suggested-by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 308491	2017-07-19 16:31:10 +00:00
Hans Wennborg	5a1606d562	Clear release notes for 6.0.0 llvm-svn: 308477	2017-07-19 14:11:00 +00:00
Michael Kruse	629f9185bf	[Simplify] Ensure all counters are reset before next SCoP is processed. NFC. llvm-svn: 308473	2017-07-19 14:07:21 +00:00
Hans Wennborg	d3ae835c9e	Bump docs version to 6.0 llvm-svn: 308464	2017-07-19 13:48:55 +00:00
Tobias Grosser	37eadd55f7	[ScopInfo] Use AnyPHINode in tryGetValueStored() This was an oversight in the previous commit. Michael already suggested this beforehand. llvm-svn: 308440	2017-07-19 11:30:18 +00:00
Michael Kruse	bb7d22a31a	[Test] Do not pipe binary data to FileCheck. llvm-svn: 308437	2017-07-19 11:12:16 +00:00
Tobias Grosser	303bd07c6e	[ScopInfo] Introduce tryGetValueStored Summary: This makes code more readable and allows to reuse this functionality in the future at other places. Suggested-by Michael Kruse in post-commit review of r307660. Reviewers: Meinersbur, bollu, gareevroman, efriedma, huihuiz, sebpop, simbuerg Reviewed By: Meinersbur Subscribers: pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D35585 llvm-svn: 308435	2017-07-19 11:09:16 +00:00
Tobias Grosser	5c88f00765	[Polly][docs][Release Notes] Adding Information about Remarks to Release Notes and Documentation Summary: Based off of D35399 Reviewers: pollydev, llvm-commits, bollu, grosser Reviewed By: grosser Tags: #polly Contributed-by: Tarun Ranjendran Differential Revision: https://reviews.llvm.org/D35596 llvm-svn: 308403	2017-07-19 01:16:55 +00:00
Michael Kruse	4dfa732750	[ScopInfo] Introduce list of statements in Scop::StmtMap. NFC. Once statements are split, a BasicBlock will comprise of multiple statements. To prepare for this change in future, we introduce a list of statements in the statement map. Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35301 llvm-svn: 308318	2017-07-18 15:41:49 +00:00
Michael Kruse	6123cdce39	[CMake] FindJsoncpp.cmake: Use descriptive variable name for libjsoncpp.so path. find_library(lib) stores the result in the variable "lib", which is also cached in CMakeCache.txt. This could theoretically conflict if jsoncpp required two libraries, which each would get cached as "lib". Use a more descriptive and disambiguative "jsoncpp_${libname}" for that. llvm-svn: 308289	2017-07-18 10:10:02 +00:00
Michael Kruse	50d7b290c3	[CMake] FindJsoncpp.cmake: Use foreach variable. Use ${libname} instead of ${lib}. By a coincidence, this worked because ${lib} also the variable used for finding the libjsoncpp.so full path. llvm-svn: 308288	2017-07-18 10:09:58 +00:00
Michael Kruse	8ab6869447	[CMake] FindJsoncpp.cmake: search pkg-config libs in default search paths. pkg_search_module(JSONCPP) should set JSONCPP_LIBDIR/JSONCPP_LIBRARY_DIRS to where the libjsoncpp.so can be found. However, on Ubuntu 14.04 LTS (Trusty Tahr) it returns /usr/lib while the libjsoncpp library can be found at /usr/lib/x86_64-linux-gnu/libjsoncpp.so. Thus, while searching for the full path of the jsoncpp library, it is not found. JSONCPP_LIBDIR is correctly set to /usr/lib/x86_64-linux-gnu on e.g., Ubuntu 16.04 LTS (Xenial Xerus ) Fix by removing the NO_DEFAULT_PATH flag, in order to search the system default paths even if the library is not found in JSONCPP_LIBDIR/JSONCPP_LIBRARY_DIRS. This fixes bug llvm.org/PR33798. llvm-svn: 308287	2017-07-18 10:09:53 +00:00
Siddharth Bhat	edfef5ae8e	[NFC] [PPCGCodeGeneration] cleanup kills related code. We extended kills in Polly to handle both `phi` nodes and scalars that are not used within the Scop. Update the comments and choice of variable names to reflect this. llvm-svn: 308279	2017-07-18 09:15:16 +00:00
Eli Friedman	e737fc120e	[Polly] [OptDiag] Updating Polly Diagnostics Remarks Utilizing newer LLVM diagnostic remark API in order to enable use of opt-viewer tool. Polly Diagnostic Remarks also now appear in YAML remark file. In this patch, I've added the OptimizationRemarkEmitter into certain classes where remarks are being emitted and update the remark emit calls itself. I also provide each remark a BasicBlock or Instruction from where it is being called, in order to compute the hotness of the remark. Patch by Tarun Rajendran! Differential Revision: https://reviews.llvm.org/D35399 llvm-svn: 308233	2017-07-17 23:58:33 +00:00
Tobias Grosser	66e38a84be	[Polly] Avoid use of `getStmtFor(BB)` in PolyhedralInfo. NFC Summary: Since there will be no more a 1-1 correspondence between statements and basic block, we would like to get rid of the method `getStmtFor(BB)` and its uses. Here we remove one of its uses in PolyhedralInfo, as suggested by Michael Sir. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D35300 llvm-svn: 308220	2017-07-17 20:58:13 +00:00
Tobias Grosser	4556c9b8fe	[ScopInfo] Simplify new access functions under domain context Summary: We do not keep domain constraints on access functions when building the scop. Hence, for consistency reasons, it makes also sense to not include them when storing a new access function. This change results in simpler access functions that make output easier to read. This patch also helps to make DeLICMed memory accesses to be understood by our matrix multiplication pattern matching pass. Further changes to the matrix multiplication pattern matching are needed for this to work, so the corresponding test case will be added in a future commit. Reviewers: Meinersbur, bollu, gareevroman, efriedma, huihuiz, sebpop, simbuerg Subscribers: pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D35237 llvm-svn: 308215	2017-07-17 20:47:10 +00:00
Siddharth Bhat	233d717ec1	[PPCGCodeGeneration] Generate invariant loads before trying to generate IR. - We should call `preloadInvariantLoads` to make sure that code is generated for invariant loads in the kernel. Differential Revision: https://reviews.llvm.org/D35410 llvm-svn: 308187	2017-07-17 15:57:01 +00:00
Tobias Grosser	21cbcf03d3	ScopInfo: Remove not-in-DomainMap statements in separate function This separates ScopBuilder internal and ScopBuilder external functionality. llvm-svn: 308152	2017-07-16 23:55:38 +00:00
Tobias Grosser	3012a0b302	Fix typo in comment [NFC] llvm-svn: 308149	2017-07-16 22:44:17 +00:00
Tobias Grosser	8e1280b8b2	[Polly] Fix a typo [NFC] Reviewers: grosser, Meinersbur, bollu Tags: #polly Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35459 llvm-svn: 308134	2017-07-16 13:54:41 +00:00
Tobias Grosser	a3aa423fc3	[ScopDetection] If a loop is not part of a scop, none of it backedges can be This patch makes sure that in case a loop is not fully contained within a region that later forms a SCoP, none of the loop backedges are allowed to be part of the region. We currently do not support the situation where only some of a loops backedges are part of a scop. Today, this can break both scop modeling and code generation. One such breaking test case is for example test/ScopDetectionDiagnostics/loop_partially_in_scop-2.ll, where we totally forgot to code generate some of the backedges. Fortunately, it is commonly not necessary to support these partial loops, it is way more common that either no backedge is included in a region or all loop backedge are included. This fixes a recent miscompile in MultiSource/Benchmarks/MiBench/consumer-typeset which was exposed after r306477. llvm-svn: 308113	2017-07-15 22:42:17 +00:00
Tobias Grosser	325204a30e	[Polly] Translate Scop::DomainMap to islpp Reviewers: grosser, Meinersbur, bollu Subscribers: pollydev Tags: #polly Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35453 llvm-svn: 308093	2017-07-15 12:41:32 +00:00
Tobias Grosser	13acbb91ee	[Polly] Use Isl c++ for InvalidDomainMap Reviewers: grosser, Meinersbur, bollu Subscribers: maxf, pollydev Tags: #polly Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D35308 llvm-svn: 308089	2017-07-15 09:01:31 +00:00
Tobias Grosser	231179ac70	update isl to: isl-0.18-791-ga22eb92 This is a regular maintenance update llvm-svn: 308013	2017-07-14 10:36:00 +00:00
Siddharth Bhat	03346c2701	[PPCGCodeGeneration] Fix runtime check adjustments since they make assumptions about BB layout. - There is a conditional branch that is used to switch between the old and new versions of the code. - If we detect that the build was unsuccessful, `PPCGCodeGeneration` will change the runtime check to be always set to false. - To actually reach this runtime check instruction, `PPCGCodeGeneration` was using assumptions about the layout of the BBs. - However, invariant load hoisting violates this assumption by inserting an extra basic block in the middle. - Fix the assumption on the layout by having `createScopConditionally` return the conditional branch instruction. - Use this reference to set to always-false. llvm-svn: 308010	2017-07-14 10:00:25 +00:00
Siddharth Bhat	a1b2086a33	[Invariant Loads] Do not consider invariant loads to have dependences. We need to relax constraints on invariant loads so that they do not create fake RAW dependences. So, we do not consider invariant loads as scalar dependences in a region. During these changes, it turned out that we do not consider `llvm::Value` replacements correctly within `PPCGCodeGeneration` and `ISLNodeBuilder`. The replacements dictated by `ValueMap` were not being followed in all places. This was fixed in this commit. There is no clean way to decouple this change because this bug only seems to arise when the relaxed version of invariant load hoisting was enabled. Differential Revision: https://reviews.llvm.org/D35120 llvm-svn: 307907	2017-07-13 12:18:56 +00:00
Singapuram Sanjay Srivallabh	1abd9ffa37	[PPCGCodeGen] Differentiate kernels based on their parent Scop Summary: Add a sequence number that identifies a ptx_kernel's parent Scop within a function to it's name to differentiate it from other kernels produced from the same function, yet different Scops. Kernels produced from different Scops can end up having the same name. Consider a function with 2 Scops and each Scop being able to produce just one kernel. Both of these kernels have the name "kernel_0". This can lead to the wrong kernel being launched when the runtime picks a kernel from its cache based on the name alone. This patch supplements D33985, by differentiating kernels across Scops as well. Previously (even before D33985) while profiling kernels generated through JIT e.g. Julia, [[ https://groups.google.com/d/msg/polly-dev/J1j587H3-Qw/mR-jfL16BgAJ \| kernels associated with different functions, and even different SCoPs within a function, would be grouped together due to the common name ]]. This patch prevents this grouping and the kernels are reported separately. Reviewers: grosser, bollu Reviewed By: grosser Subscribers: mehdi_amini, nemanjai, pollydev, kbarton Tags: #polly Differential Revision: https://reviews.llvm.org/D35176 llvm-svn: 307814	2017-07-12 16:46:19 +00:00
Siddharth Bhat	6cbb5a478e	[NFC] [SCEVValidator] Make parameter name of `hasScalarDepsInsideRegion` consistent. `SCEV` parameter is called as `Expr` in `SCEVValidator.cpp`, as well as in other functions in `SCEVValidator.h`. llvm-svn: 307800	2017-07-12 15:32:30 +00:00
Siddharth Bhat	87fa280831	[Polly] [Tests] Update `lit.cfg` uses of `lit.util.capture` to `subprocess.check_output` - `lit.util.capture` was removed in `r306625`. - Replace `lit.util.capture` to `subprocess.check_output` as LLVM did. - LLVM revision of this change: `https://reviews.llvm.org/D35088`. Differential Revision: https://reviews.llvm.org/D35255 llvm-svn: 307765	2017-07-12 09:42:05 +00:00
Philip Pfaffe	e12d036d13	[WWW] Add a section to Getting Started about building out-of-tree llvm-svn: 307704	2017-07-11 20:37:28 +00:00
Tobias Grosser	bed2ca6eac	[Simplify] Also remove redundant writes which originally came from PHI nodes llvm-svn: 307660	2017-07-11 14:29:39 +00:00
Philip Pfaffe	54df93d60e	[Polly][CMake] Skip unit-tests in lit if gtest is not available Summary: There is a bug in the current lit configurations for the unittests. If gtest is not available, the site-config for the unit tests won't be generated. Because lit recurses through the test directory, the lit configuration for the unit tests will be discovered nevertheless, leading to a fatal error in lit. This patch semi-gracefully skips the unittests if gtest is not available. As a result, running lit now prints this: `warning: test suite 'Polly-Unit' contained no test`. If people think that this is too annoying, the alternative would be to pick apart the test directory, so that the lit testsuite discovery will always only find one configuration. In fact, both of these things could be combined. While it's certainly nice that running a single lit command runs all the tests, I suppose people use the `check-polly` make target over lit most of the time, so the difference might not be noticed. Reviewers: Meinersbur, grosser Reviewed By: grosser Subscribers: mgorny, bollu, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D34053 llvm-svn: 307651	2017-07-11 11:37:35 +00:00
Philip Pfaffe	d99c406e3d	[Polly][CMake] Use the CMake Package instead of llvm-config in out-of-tree builds Summary: As of now, Polly uses llvm-config to set up LLVM dependencies in an out-of-tree build. This is problematic for two reasons: 1) Right now, in-tree and out-of-tree builds in fact do different things. E.g., in an in-tree build, libPolly depends on a handful of LLVM libraries, while in an out-of-tree build it depends on all of them. This means that we often need to treat both paths seperately. 2) I'm specifically unhappy with the way libPolly is linked right now, because it just blindly links against all the LLVM libs. That doesn't make a lot of sense. For instance, one of these libs is LLVMTableGen, which contains a command line definition of a -o option. This means that I can not link an out-of-tree libPolly into a tool which might want to offer a -o option as well. This patch (mostly) drop the use of llvm-config in favor of LLVMs exported cmake package. However, building Polly with unittests requires access to the gtest sources (in the LLVM source tree). If we're building against an LLVM installation, this source tree is unavailable and must specified. I'm using llvm-config to provide a default in this case. Reviewers: Meinersbur, grosser Reviewed By: grosser Subscribers: tstellar, bollu, chapuni, mgorny, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D33299 llvm-svn: 307650	2017-07-11 11:24:25 +00:00
Tobias Grosser	d6bea86029	[tests] Add import-jscop-dir to lit.site.cfg.in For the previous commit I accidentally added this change to lit.site.cfg, which is autogenerated and was consequently not part of the previous commit. llvm-svn: 307648	2017-07-11 11:07:01 +00:00
Tobias Grosser	e40c0fe3f8	[tests] Set -polly-import-jscop-dir=%S always This simplifies the test cases. llvm-svn: 307645	2017-07-11 10:39:01 +00:00
Tobias Grosser	6561f78b64	[Simplify] Add test case which we currently miss llvm-svn: 307643	2017-07-11 10:30:45 +00:00
Tobias Grosser	6a4c12fb33	Always export the latest memory access relations This allows us to export the results from transformations such as DeLICM. llvm-svn: 307641	2017-07-11 10:10:13 +00:00
Tobias Grosser	153a508349	[IslAst] Print memory accesses in AST dump When providing the option "-polly-ast-print-accesses" Polly also prints the memory accesses that are generated: #pragma known-parallel for (int c0 = 0; c0 <= 1023; c0 += 4) #pragma simd for (int c1 = c0; c1 <= c0 + 3; c1 += 1) Stmt_for_body( /* read / &MemRef_B[0] / write */ MemRef_A[c1] ); This makes writing and debugging memory layout transformations easier. Based on a patch contributed by Thomas Lang (ETH Zurich) llvm-svn: 307579	2017-07-10 20:13:06 +00:00
Tobias Grosser	f44f005a7d	Remove freed InvalidDomains from InvalidDomainMap. Summary: Since r306667, propagateInvalidStmtDomains gets a reference to an InvalidDomainMap. As part of the branch leading to return false, the respective domain is freed. It is, however, not removed from the InvalidDomainMap, leaking a pointer to a freed object which results in a use-after-free. Fix this be removing the domain from the map before returning. We tried to derive a test case that reliably failes, but did not succeed in producing one. Hence, for now the failures in our LNT bots must be sufficient to keep this issue tested. Reviewers: grosser, Meinersbur, bollu Subscribers: bollu, nandini12396, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D34971 llvm-svn: 307499	2017-07-09 15:47:17 +00:00
Siddharth Bhat	7cd1f53ce7	[NFC] [PPCGCodeGeneration] Extend `invariant-load-hoisting-with-variable-upper-bound` test case. - Check that we have invariant accesses. - Use `-polly-use-llvm-names` for better names in the test. - Rename test function to `f` for brevity. llvm-svn: 307401	2017-07-07 14:02:27 +00:00
Siddharth Bhat	1fc7b76a2b	[NFC] [PPCGCodeGeneration] Add test for simple invariant load hoisting. - This already works, but add this to ensure that there is no regressions when I expand the invariant load hoisting ability of `PPCGCodeGeneration`. llvm-svn: 307398	2017-07-07 13:44:22 +00:00
Tobias Grosser	41f02a9960	Make create_ll work with latest LLVM [NFC] - Instead of running with -O0, we enable the highest optimization level, but then disable optimizations. This ensures that possibly important metadata is still emitted. - Update the code for attribute removal to work with latest LLVM - Do not cut an arbitrary number of lines from the LL file. It is undocumented why this was needed at the first place, and such a feature is likely to break with trivial IR changes that may come in the future. llvm-svn: 307355	2017-07-07 04:20:55 +00:00
Siddharth Bhat	761e5b9310	[Polly] [PPCGCodeGeneration] Teach `must_kills` to kill scalars that are local to the scop. - By definition, we can pass something as a `kill` to PPCG if we know that no data can flow across a kill. - This is useful for more complex examples where we have scalars that are local to a scop. - If the local is only used within a scop, we are free to kill it. Differential Revision: https://reviews.llvm.org/D35045 llvm-svn: 307260	2017-07-06 13:42:42 +00:00
Singapuram Sanjay Srivallabh	79f13b9a80	Prefix the name of the calling host function in the name of callee GPU kernel Summary: Provide more context to the name of a GPU kernel by prefixing its name with the host function that calls it. E.g. The first kernel called by `gemm` would be `FUNC_gemm_KERNEL_0`. Kernels currently follow the "kernel_#" (# = 0,1,2,3,...) nomenclature. This patch makes it easier to map host caller and device callee, especially when there are many kernels produced by Polly-ACC. Reviewers: grosser, Meinersbur, bollu, philip.pfaffe, kbarton! Reviewed By: grosser Subscribers: nemanjai, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D33985 llvm-svn: 307173	2017-07-05 16:48:21 +00:00
Siddharth Bhat	de0a534c75	[NFC] Fix breaking build by adding REQUIRES: pollyacc llvm-svn: 307165	2017-07-05 15:20:28 +00:00
Siddharth Bhat	47c7237bd8	[NFC] [ScopInfo] fix warning about construction order llvm-svn: 307164	2017-07-05 15:07:28 +00:00
Siddharth Bhat	a82f2d264a	[PPCGCodeGeneration] Teach Polly to start using live range reordering. Polly did not use PPCG's live range reordering feature. Teach PPCGCodeGeneration to use this. Documentation on this is sparse, so much of the code is conservative. We currently kill all phi nodes in a Scop by appending them to the must_kill map we pass to PPCG. I do not have a proof of correctness, but it seems to be intuitively correct. We also do not handle `array_order`, which, quoting PPCG, is: PPCG/gpu.h: "Order dependences on non-scalars." It seems to consist of RAW dependences between arrays. We need to pass this information for more complex privatization cases. Differential Revision: https://reviews.llvm.org/D34941 llvm-svn: 307163	2017-07-05 14:57:04 +00:00
Tobias Grosser	5e41458985	Bump isl to isl-0.18-768-g033b61ae Summary: This is a general maintenance update Reviewers: grosser Subscribers: srhines, fedor.sergeev, pollydev, llvm-commits Contributed-by: Maximilian Falkenstein <falkensm@student.ethz.ch> Differential Revision: https://reviews.llvm.org/D34903 llvm-svn: 307090	2017-07-04 15:54:11 +00:00
Singapuram Sanjay Srivallabh	02ca346e48	Introduce a hybrid target to generate code for either the GPU or CPU Summary: Introduce a "hybrid" `-polly-target` option to optimise code for either the GPU or CPU. When this target is selected, PPCGCodeGeneration will attempt first to optimise a Scop. If the Scop isn't modified, it is then sent to the passes that form the CPU pipeline, i.e. IslScheduleOptimizerPass, IslAstInfoWrapperPass and CodeGeneration. In case the Scop is modified, it is marked to be skipped by the subsequent CPU optimisation passes. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: kbarton, nemanjai, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D34054 llvm-svn: 306863	2017-06-30 19:42:21 +00:00
Tobias Grosser	37c8ee7611	Fix typo llvm-svn: 306791	2017-06-30 06:30:51 +00:00
Chandler Carruth	16879be0da	Update Polly to reflect a change to a clang-format patch. I'm not sure this is a great test file name based on this update, but I'll let Polly folks sort out how they want this to work long-term, I just want tho bots back. llvm-svn: 306767	2017-06-29 23:58:03 +00:00
NAKAMURA Takumi	b49ca64b18	Test commit llvm-svn: 306696	2017-06-29 16:35:38 +00:00
Michael Kruse	476f855ec8	[ScopInfo] Do not use ScopStmt in Domain derivation of ScopInfo. NFC ScopStmts were being used in the computation of the Domain of the SCoPs in ScopInfo. Once statements are split, there will not be a 1-to-1 correspondence between Stmts and Basic blocks. Thus this patch avoids the use of getStmtFor() by creating a map of BB to InvalidDomain and using it to compute the domain of the statements. Contributed-by: Nanidini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D33942 llvm-svn: 306667	2017-06-29 12:47:41 +00:00
NAKAMURA Takumi	6936506f50	Test commit llvm-svn: 306657	2017-06-29 09:46:01 +00:00
Singapuram Sanjay Srivallabh	42caad0257	Initializing NVPTX backend within Polly Summary: The NVPTX backend is now initialised within Polly. A language front-end need not be modified to initialise the backend, just for Polly. Reviewers: Meinersbur, grosser Reviewed By: Meinersbur Subscribers: vchuravy, mgorny Tags: #polly Differential Revision: https://reviews.llvm.org/D31859 llvm-svn: 306649	2017-06-29 07:43:22 +00:00
Michael Kruse	b738ffa845	Heap allocation for new arrays. This patch aims to implement the option of allocating new arrays created by polly on heap instead of stack. To enable this option, a key named 'allocation' must be written in the imported json file with the value 'heap'. We need such a feature because in a next iteration, we will implement a mechanism of maximal static expansion which will need a way to allocate arrays on heap. Indeed, the expansion is very costly in terms of memory and doing the allocation on stack is not worth considering. The malloc and the free are added respectively at polly.start and polly.exiting such that there is no use-after-free (for instance in case of Scop in a loop) and such that all memory cells allocated with a malloc are free'd when we don't need them anymore. We also add : - In the class ScopArrayInfo, we add a boolean as member called IsOnHeap which represents the fact that the array in allocated on heap or not. - A new branch in the method allocateNewArrays in the ISLNodeBuilder for the case of heap allocation. allocateNewArrays now takes a BBPair containing polly.start and polly.exiting. allocateNewArrays takes this two blocks and add the malloc and free calls respectively to polly.start and polly.exiting. - As IntPtrTy for the malloc call, we use the DataLayout one. To do that, we have modified : - createScopArrayInfo and getOrCreateScopArrayInfo such that it returns a non-const SAI, in order to be able to call setIsOnHeap in the JSONImporter. - executeScopConditionnaly such that it return both start block and end block of the scop, because we need this two blocs to be able to add the malloc and the free calls at the right position. Differential Revision: https://reviews.llvm.org/D33688 llvm-svn: 306540	2017-06-28 13:02:43 +00:00
Tobias Grosser	72d2539937	Test commit llvm-svn: 306539	2017-06-28 12:58:44 +00:00
Andreas Simbuerger	6d08ec7233	[JSONImport] Check, if the size of an imported array is positive llvm-svn: 306479	2017-06-27 22:30:44 +00:00
Andreas Simbuerger	dbb0ef8e94	[NFC][CodeGen] Use the ExitBlock explicitly. Before we would 'guess' the correct location for the MergeBlock that got introduced when executing a Scop conditionally. This implicitly depends on the situation that at this point during CodeGen there will be nothing between polly.start and polly.exiting. With this commit we explicitly state that we want the block that directly follows polly.exiting. llvm-svn: 306398	2017-06-27 11:33:22 +00:00
Andreas Simbuerger	4e6eed8566	[FIX] Add %loadPolly to test This test fails, if polly is not linked into LLVM's tools. Our lit site-config already deals with this by not adding the -load option, if polly is linked into LLVM's tools. llvm-svn: 306395	2017-06-27 10:47:55 +00:00
Siddharth Bhat	65d7f72f2c	[PPCGCodeGeneration] Add flag to allow polly to fail in GPU kernel fails. - This is useful for debugging GPU code. llvm-svn: 306290	2017-06-26 14:56:56 +00:00
Siddharth Bhat	f291c8d510	[PPCGCodeGeneration] Allow intrinsics within kernels. - In D33414, if any function call was found within a kernel, we would bail out. - This is an over-approximation. This patch changes this by allowing the `llvm.sqrt.*` family of intrinsics. - This introduces an additional step when creating a separate llvm::Module for a kernel (GPUModule). We now copy function declarations from the original module to new module. - We also populate IslNodeBuilder::ValueMap so it replaces the function references to the old module to the ones in the new module (GPUModule). Differential Revision: https://reviews.llvm.org/D34145 llvm-svn: 306284	2017-06-26 13:12:06 +00:00
Andreas Simbuerger	256070d85c	[NFC] Return both polly.start and polly.exiting from executeScopConditionally. This commit returns both the start and the exit block that are created by executeScopConditionally. In a future commit we will make use of the exit block. Before we would have to use the implicit property that there won't be any code generated between polly.start and polly.exiting at the time of use to find the correct block ('polly.exiting'). All usage location are semantically unchanged. llvm-svn: 306283	2017-06-26 12:17:11 +00:00
Tobias Grosser	2927cb7520	[tests] Add forgotten pollyacc REQUIRES line llvm-svn: 306273	2017-06-26 06:07:40 +00:00
Siddharth Bhat	a12f807f33	[PPCGCodeGeneration] Enable GPU code generation with invariant loads. The condition that disallowed code generation in PPCGCodeGeneration with invariant loads is not required. I haven't been able to construct a counterexample where this generates invalid code. Differential Revision: https://reviews.llvm.org/D34604 llvm-svn: 306245	2017-06-25 14:48:24 +00:00
Tobias Grosser	812bc3c983	Test commit llvm-svn: 306244	2017-06-25 14:22:32 +00:00
Tobias Grosser	1b9d1bcc6d	[ScopInfo] Bound the number of array disjuncts in run-time bounds checks This reduces the compilation time of one reduced test case from Android from 16 seconds to 100 mseconds (we bail out), without negatively impacting any other test case we currently have. We still saw occasionally compilation timeouts on the AOSP buildbot. Hopefully, those will go away with this change. llvm-svn: 306235	2017-06-25 06:32:00 +00:00
Roman Gareev	c4a4d04717	[FIX] A small addition to r305675. llvm-svn: 306234	2017-06-25 06:30:11 +00:00
Tobias Grosser	c948178af8	Update to latest clang-format changes llvm-svn: 306203	2017-06-24 05:23:10 +00:00
Michael Kruse	7604d9add5	[ScopBuilder] Pass ScopStmts around instead of BasicBlocks. NFC. During the construction of MemoryAccesses in ScopBuilder, BasicBlocks were used in function parameters, assuming that the ScopStmt an be directly derived from it. This won't be true anymore once we split BasicBlocks into multiple ScopStmt. As a preparation for such a change in the future, we instead pass the ScopStmt and avoid the use of getStmtFor(). There are two occasions where a kind of mapping from BasicBlock to ScopStmt is still required. 1. Get the statement representing the incoming block of a `PHINode` using `getLastStmtOf`. 2. One statement is required to write a scalar to be readable by those which need it. This is most often the statement which contains its definition, which we get using `getStmtFor(Instruction*)`. Differential Revision: https://reviews.llvm.org/D34369 llvm-svn: 306132	2017-06-23 17:55:36 +00:00
Tobias Grosser	78a7a6cddf	Bail out early in case we see an invalid runtime context in buildAliasGroups llvm-svn: 306088	2017-06-23 08:05:31 +00:00
Tobias Grosser	57a1d36d98	Hoist buildMinMaxAccess computeout to cover full alias-group This allows us to bail out both in case the lexmin/max computation is too expensive, but also in case the commulative cost across an alias group is too expensive. This is an improvement of r303404, which did not seem to be sufficient to keep the Android Buildbot quiet. llvm-svn: 306087	2017-06-23 08:05:27 +00:00

... 3 4 5 6 7 ...

3667 Commits