llvm-project

Commit Graph

Author	SHA1	Message	Date
Singapuram Sanjay Srivallabh	1abd9ffa37	[PPCGCodeGen] Differentiate kernels based on their parent Scop Summary: Add a sequence number that identifies a ptx_kernel's parent Scop within a function to it's name to differentiate it from other kernels produced from the same function, yet different Scops. Kernels produced from different Scops can end up having the same name. Consider a function with 2 Scops and each Scop being able to produce just one kernel. Both of these kernels have the name "kernel_0". This can lead to the wrong kernel being launched when the runtime picks a kernel from its cache based on the name alone. This patch supplements D33985, by differentiating kernels across Scops as well. Previously (even before D33985) while profiling kernels generated through JIT e.g. Julia, [[ https://groups.google.com/d/msg/polly-dev/J1j587H3-Qw/mR-jfL16BgAJ \| kernels associated with different functions, and even different SCoPs within a function, would be grouped together due to the common name ]]. This patch prevents this grouping and the kernels are reported separately. Reviewers: grosser, bollu Reviewed By: grosser Subscribers: mehdi_amini, nemanjai, pollydev, kbarton Tags: #polly Differential Revision: https://reviews.llvm.org/D35176 llvm-svn: 307814	2017-07-12 16:46:19 +00:00
Siddharth Bhat	6cbb5a478e	[NFC] [SCEVValidator] Make parameter name of `hasScalarDepsInsideRegion` consistent. `SCEV` parameter is called as `Expr` in `SCEVValidator.cpp`, as well as in other functions in `SCEVValidator.h`. llvm-svn: 307800	2017-07-12 15:32:30 +00:00
Siddharth Bhat	87fa280831	[Polly] [Tests] Update `lit.cfg` uses of `lit.util.capture` to `subprocess.check_output` - `lit.util.capture` was removed in `r306625`. - Replace `lit.util.capture` to `subprocess.check_output` as LLVM did. - LLVM revision of this change: `https://reviews.llvm.org/D35088`. Differential Revision: https://reviews.llvm.org/D35255 llvm-svn: 307765	2017-07-12 09:42:05 +00:00
Philip Pfaffe	e12d036d13	[WWW] Add a section to Getting Started about building out-of-tree llvm-svn: 307704	2017-07-11 20:37:28 +00:00
Tobias Grosser	bed2ca6eac	[Simplify] Also remove redundant writes which originally came from PHI nodes llvm-svn: 307660	2017-07-11 14:29:39 +00:00
Philip Pfaffe	54df93d60e	[Polly][CMake] Skip unit-tests in lit if gtest is not available Summary: There is a bug in the current lit configurations for the unittests. If gtest is not available, the site-config for the unit tests won't be generated. Because lit recurses through the test directory, the lit configuration for the unit tests will be discovered nevertheless, leading to a fatal error in lit. This patch semi-gracefully skips the unittests if gtest is not available. As a result, running lit now prints this: `warning: test suite 'Polly-Unit' contained no test`. If people think that this is too annoying, the alternative would be to pick apart the test directory, so that the lit testsuite discovery will always only find one configuration. In fact, both of these things could be combined. While it's certainly nice that running a single lit command runs all the tests, I suppose people use the `check-polly` make target over lit most of the time, so the difference might not be noticed. Reviewers: Meinersbur, grosser Reviewed By: grosser Subscribers: mgorny, bollu, pollydev, llvm-commits Tags: #polly Differential Revision: https://reviews.llvm.org/D34053 llvm-svn: 307651	2017-07-11 11:37:35 +00:00
Philip Pfaffe	d99c406e3d	[Polly][CMake] Use the CMake Package instead of llvm-config in out-of-tree builds Summary: As of now, Polly uses llvm-config to set up LLVM dependencies in an out-of-tree build. This is problematic for two reasons: 1) Right now, in-tree and out-of-tree builds in fact do different things. E.g., in an in-tree build, libPolly depends on a handful of LLVM libraries, while in an out-of-tree build it depends on all of them. This means that we often need to treat both paths seperately. 2) I'm specifically unhappy with the way libPolly is linked right now, because it just blindly links against all the LLVM libs. That doesn't make a lot of sense. For instance, one of these libs is LLVMTableGen, which contains a command line definition of a -o option. This means that I can not link an out-of-tree libPolly into a tool which might want to offer a -o option as well. This patch (mostly) drop the use of llvm-config in favor of LLVMs exported cmake package. However, building Polly with unittests requires access to the gtest sources (in the LLVM source tree). If we're building against an LLVM installation, this source tree is unavailable and must specified. I'm using llvm-config to provide a default in this case. Reviewers: Meinersbur, grosser Reviewed By: grosser Subscribers: tstellar, bollu, chapuni, mgorny, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D33299 llvm-svn: 307650	2017-07-11 11:24:25 +00:00
Tobias Grosser	d6bea86029	[tests] Add import-jscop-dir to lit.site.cfg.in For the previous commit I accidentally added this change to lit.site.cfg, which is autogenerated and was consequently not part of the previous commit. llvm-svn: 307648	2017-07-11 11:07:01 +00:00
Tobias Grosser	e40c0fe3f8	[tests] Set -polly-import-jscop-dir=%S always This simplifies the test cases. llvm-svn: 307645	2017-07-11 10:39:01 +00:00
Tobias Grosser	6561f78b64	[Simplify] Add test case which we currently miss llvm-svn: 307643	2017-07-11 10:30:45 +00:00
Tobias Grosser	6a4c12fb33	Always export the latest memory access relations This allows us to export the results from transformations such as DeLICM. llvm-svn: 307641	2017-07-11 10:10:13 +00:00
Tobias Grosser	153a508349	[IslAst] Print memory accesses in AST dump When providing the option "-polly-ast-print-accesses" Polly also prints the memory accesses that are generated: #pragma known-parallel for (int c0 = 0; c0 <= 1023; c0 += 4) #pragma simd for (int c1 = c0; c1 <= c0 + 3; c1 += 1) Stmt_for_body( /* read / &MemRef_B[0] / write */ MemRef_A[c1] ); This makes writing and debugging memory layout transformations easier. Based on a patch contributed by Thomas Lang (ETH Zurich) llvm-svn: 307579	2017-07-10 20:13:06 +00:00
Tobias Grosser	f44f005a7d	Remove freed InvalidDomains from InvalidDomainMap. Summary: Since r306667, propagateInvalidStmtDomains gets a reference to an InvalidDomainMap. As part of the branch leading to return false, the respective domain is freed. It is, however, not removed from the InvalidDomainMap, leaking a pointer to a freed object which results in a use-after-free. Fix this be removing the domain from the map before returning. We tried to derive a test case that reliably failes, but did not succeed in producing one. Hence, for now the failures in our LNT bots must be sufficient to keep this issue tested. Reviewers: grosser, Meinersbur, bollu Subscribers: bollu, nandini12396, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D34971 llvm-svn: 307499	2017-07-09 15:47:17 +00:00
Siddharth Bhat	7cd1f53ce7	[NFC] [PPCGCodeGeneration] Extend `invariant-load-hoisting-with-variable-upper-bound` test case. - Check that we have invariant accesses. - Use `-polly-use-llvm-names` for better names in the test. - Rename test function to `f` for brevity. llvm-svn: 307401	2017-07-07 14:02:27 +00:00
Siddharth Bhat	1fc7b76a2b	[NFC] [PPCGCodeGeneration] Add test for simple invariant load hoisting. - This already works, but add this to ensure that there is no regressions when I expand the invariant load hoisting ability of `PPCGCodeGeneration`. llvm-svn: 307398	2017-07-07 13:44:22 +00:00
Tobias Grosser	41f02a9960	Make create_ll work with latest LLVM [NFC] - Instead of running with -O0, we enable the highest optimization level, but then disable optimizations. This ensures that possibly important metadata is still emitted. - Update the code for attribute removal to work with latest LLVM - Do not cut an arbitrary number of lines from the LL file. It is undocumented why this was needed at the first place, and such a feature is likely to break with trivial IR changes that may come in the future. llvm-svn: 307355	2017-07-07 04:20:55 +00:00
Siddharth Bhat	761e5b9310	[Polly] [PPCGCodeGeneration] Teach `must_kills` to kill scalars that are local to the scop. - By definition, we can pass something as a `kill` to PPCG if we know that no data can flow across a kill. - This is useful for more complex examples where we have scalars that are local to a scop. - If the local is only used within a scop, we are free to kill it. Differential Revision: https://reviews.llvm.org/D35045 llvm-svn: 307260	2017-07-06 13:42:42 +00:00
Singapuram Sanjay Srivallabh	79f13b9a80	Prefix the name of the calling host function in the name of callee GPU kernel Summary: Provide more context to the name of a GPU kernel by prefixing its name with the host function that calls it. E.g. The first kernel called by `gemm` would be `FUNC_gemm_KERNEL_0`. Kernels currently follow the "kernel_#" (# = 0,1,2,3,...) nomenclature. This patch makes it easier to map host caller and device callee, especially when there are many kernels produced by Polly-ACC. Reviewers: grosser, Meinersbur, bollu, philip.pfaffe, kbarton! Reviewed By: grosser Subscribers: nemanjai, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D33985 llvm-svn: 307173	2017-07-05 16:48:21 +00:00
Siddharth Bhat	de0a534c75	[NFC] Fix breaking build by adding REQUIRES: pollyacc llvm-svn: 307165	2017-07-05 15:20:28 +00:00
Siddharth Bhat	47c7237bd8	[NFC] [ScopInfo] fix warning about construction order llvm-svn: 307164	2017-07-05 15:07:28 +00:00
Siddharth Bhat	a82f2d264a	[PPCGCodeGeneration] Teach Polly to start using live range reordering. Polly did not use PPCG's live range reordering feature. Teach PPCGCodeGeneration to use this. Documentation on this is sparse, so much of the code is conservative. We currently kill all phi nodes in a Scop by appending them to the must_kill map we pass to PPCG. I do not have a proof of correctness, but it seems to be intuitively correct. We also do not handle `array_order`, which, quoting PPCG, is: PPCG/gpu.h: "Order dependences on non-scalars." It seems to consist of RAW dependences between arrays. We need to pass this information for more complex privatization cases. Differential Revision: https://reviews.llvm.org/D34941 llvm-svn: 307163	2017-07-05 14:57:04 +00:00
Tobias Grosser	5e41458985	Bump isl to isl-0.18-768-g033b61ae Summary: This is a general maintenance update Reviewers: grosser Subscribers: srhines, fedor.sergeev, pollydev, llvm-commits Contributed-by: Maximilian Falkenstein <falkensm@student.ethz.ch> Differential Revision: https://reviews.llvm.org/D34903 llvm-svn: 307090	2017-07-04 15:54:11 +00:00
Singapuram Sanjay Srivallabh	02ca346e48	Introduce a hybrid target to generate code for either the GPU or CPU Summary: Introduce a "hybrid" `-polly-target` option to optimise code for either the GPU or CPU. When this target is selected, PPCGCodeGeneration will attempt first to optimise a Scop. If the Scop isn't modified, it is then sent to the passes that form the CPU pipeline, i.e. IslScheduleOptimizerPass, IslAstInfoWrapperPass and CodeGeneration. In case the Scop is modified, it is marked to be skipped by the subsequent CPU optimisation passes. Reviewers: grosser, Meinersbur, bollu Reviewed By: grosser Subscribers: kbarton, nemanjai, pollydev Tags: #polly Differential Revision: https://reviews.llvm.org/D34054 llvm-svn: 306863	2017-06-30 19:42:21 +00:00
Tobias Grosser	37c8ee7611	Fix typo llvm-svn: 306791	2017-06-30 06:30:51 +00:00
Chandler Carruth	16879be0da	Update Polly to reflect a change to a clang-format patch. I'm not sure this is a great test file name based on this update, but I'll let Polly folks sort out how they want this to work long-term, I just want tho bots back. llvm-svn: 306767	2017-06-29 23:58:03 +00:00
NAKAMURA Takumi	b49ca64b18	Test commit llvm-svn: 306696	2017-06-29 16:35:38 +00:00
Michael Kruse	476f855ec8	[ScopInfo] Do not use ScopStmt in Domain derivation of ScopInfo. NFC ScopStmts were being used in the computation of the Domain of the SCoPs in ScopInfo. Once statements are split, there will not be a 1-to-1 correspondence between Stmts and Basic blocks. Thus this patch avoids the use of getStmtFor() by creating a map of BB to InvalidDomain and using it to compute the domain of the statements. Contributed-by: Nanidini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D33942 llvm-svn: 306667	2017-06-29 12:47:41 +00:00
NAKAMURA Takumi	6936506f50	Test commit llvm-svn: 306657	2017-06-29 09:46:01 +00:00
Singapuram Sanjay Srivallabh	42caad0257	Initializing NVPTX backend within Polly Summary: The NVPTX backend is now initialised within Polly. A language front-end need not be modified to initialise the backend, just for Polly. Reviewers: Meinersbur, grosser Reviewed By: Meinersbur Subscribers: vchuravy, mgorny Tags: #polly Differential Revision: https://reviews.llvm.org/D31859 llvm-svn: 306649	2017-06-29 07:43:22 +00:00
Michael Kruse	b738ffa845	Heap allocation for new arrays. This patch aims to implement the option of allocating new arrays created by polly on heap instead of stack. To enable this option, a key named 'allocation' must be written in the imported json file with the value 'heap'. We need such a feature because in a next iteration, we will implement a mechanism of maximal static expansion which will need a way to allocate arrays on heap. Indeed, the expansion is very costly in terms of memory and doing the allocation on stack is not worth considering. The malloc and the free are added respectively at polly.start and polly.exiting such that there is no use-after-free (for instance in case of Scop in a loop) and such that all memory cells allocated with a malloc are free'd when we don't need them anymore. We also add : - In the class ScopArrayInfo, we add a boolean as member called IsOnHeap which represents the fact that the array in allocated on heap or not. - A new branch in the method allocateNewArrays in the ISLNodeBuilder for the case of heap allocation. allocateNewArrays now takes a BBPair containing polly.start and polly.exiting. allocateNewArrays takes this two blocks and add the malloc and free calls respectively to polly.start and polly.exiting. - As IntPtrTy for the malloc call, we use the DataLayout one. To do that, we have modified : - createScopArrayInfo and getOrCreateScopArrayInfo such that it returns a non-const SAI, in order to be able to call setIsOnHeap in the JSONImporter. - executeScopConditionnaly such that it return both start block and end block of the scop, because we need this two blocs to be able to add the malloc and the free calls at the right position. Differential Revision: https://reviews.llvm.org/D33688 llvm-svn: 306540	2017-06-28 13:02:43 +00:00
Tobias Grosser	72d2539937	Test commit llvm-svn: 306539	2017-06-28 12:58:44 +00:00
Andreas Simbuerger	6d08ec7233	[JSONImport] Check, if the size of an imported array is positive llvm-svn: 306479	2017-06-27 22:30:44 +00:00
Andreas Simbuerger	dbb0ef8e94	[NFC][CodeGen] Use the ExitBlock explicitly. Before we would 'guess' the correct location for the MergeBlock that got introduced when executing a Scop conditionally. This implicitly depends on the situation that at this point during CodeGen there will be nothing between polly.start and polly.exiting. With this commit we explicitly state that we want the block that directly follows polly.exiting. llvm-svn: 306398	2017-06-27 11:33:22 +00:00
Andreas Simbuerger	4e6eed8566	[FIX] Add %loadPolly to test This test fails, if polly is not linked into LLVM's tools. Our lit site-config already deals with this by not adding the -load option, if polly is linked into LLVM's tools. llvm-svn: 306395	2017-06-27 10:47:55 +00:00
Siddharth Bhat	65d7f72f2c	[PPCGCodeGeneration] Add flag to allow polly to fail in GPU kernel fails. - This is useful for debugging GPU code. llvm-svn: 306290	2017-06-26 14:56:56 +00:00
Siddharth Bhat	f291c8d510	[PPCGCodeGeneration] Allow intrinsics within kernels. - In D33414, if any function call was found within a kernel, we would bail out. - This is an over-approximation. This patch changes this by allowing the `llvm.sqrt.*` family of intrinsics. - This introduces an additional step when creating a separate llvm::Module for a kernel (GPUModule). We now copy function declarations from the original module to new module. - We also populate IslNodeBuilder::ValueMap so it replaces the function references to the old module to the ones in the new module (GPUModule). Differential Revision: https://reviews.llvm.org/D34145 llvm-svn: 306284	2017-06-26 13:12:06 +00:00
Andreas Simbuerger	256070d85c	[NFC] Return both polly.start and polly.exiting from executeScopConditionally. This commit returns both the start and the exit block that are created by executeScopConditionally. In a future commit we will make use of the exit block. Before we would have to use the implicit property that there won't be any code generated between polly.start and polly.exiting at the time of use to find the correct block ('polly.exiting'). All usage location are semantically unchanged. llvm-svn: 306283	2017-06-26 12:17:11 +00:00
Tobias Grosser	2927cb7520	[tests] Add forgotten pollyacc REQUIRES line llvm-svn: 306273	2017-06-26 06:07:40 +00:00
Siddharth Bhat	a12f807f33	[PPCGCodeGeneration] Enable GPU code generation with invariant loads. The condition that disallowed code generation in PPCGCodeGeneration with invariant loads is not required. I haven't been able to construct a counterexample where this generates invalid code. Differential Revision: https://reviews.llvm.org/D34604 llvm-svn: 306245	2017-06-25 14:48:24 +00:00
Tobias Grosser	812bc3c983	Test commit llvm-svn: 306244	2017-06-25 14:22:32 +00:00
Tobias Grosser	1b9d1bcc6d	[ScopInfo] Bound the number of array disjuncts in run-time bounds checks This reduces the compilation time of one reduced test case from Android from 16 seconds to 100 mseconds (we bail out), without negatively impacting any other test case we currently have. We still saw occasionally compilation timeouts on the AOSP buildbot. Hopefully, those will go away with this change. llvm-svn: 306235	2017-06-25 06:32:00 +00:00
Roman Gareev	c4a4d04717	[FIX] A small addition to r305675. llvm-svn: 306234	2017-06-25 06:30:11 +00:00
Tobias Grosser	c948178af8	Update to latest clang-format changes llvm-svn: 306203	2017-06-24 05:23:10 +00:00
Michael Kruse	7604d9add5	[ScopBuilder] Pass ScopStmts around instead of BasicBlocks. NFC. During the construction of MemoryAccesses in ScopBuilder, BasicBlocks were used in function parameters, assuming that the ScopStmt an be directly derived from it. This won't be true anymore once we split BasicBlocks into multiple ScopStmt. As a preparation for such a change in the future, we instead pass the ScopStmt and avoid the use of getStmtFor(). There are two occasions where a kind of mapping from BasicBlock to ScopStmt is still required. 1. Get the statement representing the incoming block of a `PHINode` using `getLastStmtOf`. 2. One statement is required to write a scalar to be readable by those which need it. This is most often the statement which contains its definition, which we get using `getStmtFor(Instruction*)`. Differential Revision: https://reviews.llvm.org/D34369 llvm-svn: 306132	2017-06-23 17:55:36 +00:00
Tobias Grosser	78a7a6cddf	Bail out early in case we see an invalid runtime context in buildAliasGroups llvm-svn: 306088	2017-06-23 08:05:31 +00:00
Tobias Grosser	57a1d36d98	Hoist buildMinMaxAccess computeout to cover full alias-group This allows us to bail out both in case the lexmin/max computation is too expensive, but also in case the commulative cost across an alias group is too expensive. This is an improvement of r303404, which did not seem to be sufficient to keep the Android Buildbot quiet. llvm-svn: 306087	2017-06-23 08:05:27 +00:00
Tobias Grosser	8f23fb8486	[islpp] Move buildMinMaxAccess[es] to C++ [NFC] llvm-svn: 306086	2017-06-23 08:05:20 +00:00
Eli Friedman	5e589ea4b1	[ScopInfo] Fix crash with sum of invariant load and AddRec. r303971 added an assertion that SCEV addition involving an AddRec and a SCEVUnknown must involve a dominance relation: either the SCEVUnknown value dominates the AddRec's loop, or the AddRec's loop header dominates the SCEVUnknown. This is generally fine for most usage of SCEV because it isn't possible to write an expression in IR which would violate it, but it's a bit inconvenient here for polly. To solve the issue, just avoid creating a SCEV expression which triggers the asssertion. I'm not really happy with this solution, but I don't have any better ideas. Fixes https://bugs.llvm.org/show_bug.cgi?id=33464. Differential Revision: https://reviews.llvm.org/D34259 llvm-svn: 305864	2017-06-20 22:53:02 +00:00
Reid Kleckner	df2b283bf9	Fix -Wsign-compare in ScopInfo.cpp llvm::Loop::getNumBlocks returns an unsigned int, not a long. llvm-svn: 305717	2017-06-19 17:44:02 +00:00
Tobias Grosser	dcd94e3e93	[ScheduleOptimizer] Fix minor typo [NFC] llvm-svn: 305709	2017-06-19 16:55:48 +00:00

1 2 3 4 5 ...

3263 Commits