llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	e114dc024e	[FIX] Handle error blocks in non-affine regions correctly llvm-svn: 247545	2015-09-14 11:15:58 +00:00
Michael Kruse	d868b5d509	Merge TempScopInfo into ScopInfo The TempScopInfo (-polly-analyze-ir) pass is removed and its work taken over by ScopInfo (-polly-scops). Several tests depend on -polly-analyze-ir and use -polly-scops instead which for the moment prints the output of both passes. This again is not expected by some other tests, especially those with negative searches, which have been adapted. Differential Version: http://reviews.llvm.org/D12694 llvm-svn: 247288	2015-09-10 15:25:24 +00:00
Michael Kruse	7bf3944d23	Merge TempScopInfo.{cpp\|h} into ScopInfo.{cpp\|h} This prepares for a series of patches that merges TempScopInfo into ScopInfo to reduce Polly's code complexity. Only ScopInfo.{cpp\|h} will be left thereafter. Moving the code of TempScopInfo in one commit makes the mains diffs simpler to understand. In detail, merging the following classes is planned: TempScopInfo into ScopInfo TempScop into Scop IRAccess into MemoryAccess Only moving code, no functional changes intended. Differential Version: http://reviews.llvm.org/D12693 llvm-svn: 247274	2015-09-10 12:46:52 +00:00
Chandler Carruth	66ef16b289	[PM] Update Polly for the new AA infrastructure landed in r247167. llvm-svn: 247198	2015-09-09 22:13:56 +00:00
Tobias Grosser	f1ac57c6cd	IslNodeBuilder: Add virtual function to obtain the schedule of an ast node Not all users of our IslNodeBuilder will attach scheduling information to the AST in the same way IslAstInfo is doing it today. By going through a virtual function when extracting the schedule of an AST node other users can provide their own functions for extract scheduling information in case they attach scheduling information in a different way to the AST nodes. No functional change for Polly itself intended. llvm-svn: 247126	2015-09-09 09:24:38 +00:00
Johannes Doerfert	717b866798	Allow PHI nodes in the region exit block While we do not need to model PHI nodes in the region exit (as it is not part of the SCoP), we need to prepare for the case that the exit block is split in code generation to create a single exiting block. If this will happen, hence if the region did not have a single exiting block before, we will model the operands of the PHI nodes as escaping scalars in the SCoP. Differential Revision: http://reviews.llvm.org/D12051 llvm-svn: 247078	2015-09-08 21:44:27 +00:00
Tobias Grosser	a89dc57b41	Do not use '.' in subfunction names Certain backends, e.g. NVPTX, do not support '.' in function names. Hence, we ensure all '.' are replaced by '_' when generating function names for subfunctions. For the current OpenMP code generation, this is not strictly necessary, but future uses cases (e.g. GPU offloading) need this issue to be fixed. llvm-svn: 246980	2015-09-08 06:22:17 +00:00
Tobias Grosser	86bc93a9b2	Add option -polly-codegen-add-debug-printing When this option is enabled, Polly will emit printf calls for each scalar load/and store which dump the scalar value loaded/stored at run time. This patch also refactors the RuntimeDebugBuilder to use variadic templates when generating CPU printfs. As result, it now becomes easier to print strings that consist of a set of arguments. Also, as a single printf call is emitted, it is more likely for such strings to be emitted atomically if executed multi-threaded. llvm-svn: 246941	2015-09-06 08:47:57 +00:00
Tobias Grosser	e58d358171	RuntimeDebugPrinter: Simplify code [NFC] llvm-svn: 246940	2015-09-06 07:17:54 +00:00
Tobias Grosser	e3d8c05c5f	Add some more documentation and structure to the collection of subtree references Some of the structures are renamed, subfunction introduced to clarify the individual steps and comments are added describing their functionality. llvm-svn: 246929	2015-09-05 15:45:25 +00:00
Tobias Grosser	abcec37f64	IslNodeBuilder: Only obtain the isl_ast_build, when needed In the common case, the access functions are not modified, hence there is no need to obtain the IslAstBuild context at all. This should not only be minimally faster, but this also allows the IslNodeBuilder to work on asts that are not annotated with isl_ast_builds as long as the memory accesses are not modified. llvm-svn: 246928	2015-09-05 13:03:57 +00:00
Tobias Grosser	8eae8361fc	RegionGenerator: Do not modify GlobalMaps By inspection the update of the GlobalMaps in the RegionGenerator seems unneed, and is removed as also no test cases fail when dropping this. Johannes Doerfert confirmed that this is indeed save: "I think that code was needed when we did not use the scalar codegen by default. Now everything defined in a non-affine region should be communicated via memory and reloaded in the user block. Hence, we should be good removing this code." llvm-svn: 246926	2015-09-05 11:26:30 +00:00
Tobias Grosser	72b80672d9	OpenMP: Name the values passed to the subfunciton according to the original llvm::Values llvm-svn: 246924	2015-09-05 10:41:19 +00:00
Tobias Grosser	0d8874c0f6	OpenMP codegen: support generation of multi-dimensional access functions When computing the index expressions for new, multi-dimensional memory accesses these new index expressions may reference original llvm::Values that are not transfered into the OpenMP subfunction. Using GlobalMap we now replace references to such values with the rewritten values that have e.g. been passed to the OpenMP subfunction. llvm-svn: 246923	2015-09-05 10:32:56 +00:00
Tobias Grosser	bc13260775	BlockGenerator: Make GlobalMap a member variable The GlobalMap variable used in BlockGenerator should always reference the same list througout the entire code generation, hence we can make it a member variable to avoid passing it around through every function call. History: Before we switched to the SCEV based code generation the GlobalMap also contained a mapping form old to new induction variables, hence it was different for each ScopStmt, which is why we passed it as function argument to copyStmt. The new SCEV based code generation now uses a separate mapping called LTS -> LoopToSCEV that maps each original loop to a new loop iteration variable provided as a SCEVExpr. The GlobalMap is currently mostly used for OpenMP code generation, where references to parameters in the original function need to be rewritten to the locations of these variables after they have been passed to the subfunction. Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de> llvm-svn: 246920	2015-09-05 09:56:54 +00:00
Tobias Grosser	9f3d55cf3d	Generate scalar initialization loads at the beginning of the start BB Our OpenMP code generation generated part of its launching code directly into the start basic block and without this change the scalar initialization was run _after_ the OpenMP threads have been launched. This resulted in uninitialized scalar values to be used. llvm-svn: 246427	2015-08-31 11:06:19 +00:00
Tobias Grosser	f93451802a	OpenMP-codegen: Correctly pass function arguments to subfunctions Before we only checked if certain instructions can be expanded by us. Now we check any value, including function arguments. llvm-svn: 246425	2015-08-31 09:05:43 +00:00
Tobias Grosser	64c0ff4141	Add support for scalar dependences to OpenMP code generation Scalar dependences between scop statements have caused troubles during parallel code generation as we did not pass on the new stack allocation created for such scalars to the parallel subfunctions. This change now detects all scalar reads/writes in parallel subfunctions, creates the allocas for these scalar objects, passes the resulting memory locations to the subfunctions and ensures that within the subfunction requests for these memory locations will return the rewritten values. Johannes suggested as a future optimization to privatizing some of the scalars in the subfunction. llvm-svn: 246414	2015-08-31 05:52:24 +00:00
Tobias Grosser	2fc50df900	Do not store into a temporary twine For some reason, this causes memory corruption issues. Let's just avoid it. llvm-svn: 246396	2015-08-30 19:51:01 +00:00
Tobias Grosser	c0091a77f9	Store scalar dependences from outside the scop into alloca locations We already modeled read-only dependences to scalar values defined outside the scop as memory reads and also generated read accesses from the corresponding alloca instructions that have been used to pass these scalar values around during code generation. However, besides for PHI nodes that have already been handled, we failed to store the orignal read-only scalar values into these alloc. This commit extends the initialization of scalar values to all read-only scalar values used within the scop. llvm-svn: 246394	2015-08-30 19:19:34 +00:00
Tobias Grosser	b649e26a50	getNewScalarValue: Get ScalarMap directly from member variable [NFC] There is no need to pass the ScalarMap to getNewScalarValue as this map is (indirectly) used when calling getOrCreateScalarAlloca. llvm-svn: 246390	2015-08-30 17:37:55 +00:00
Tobias Grosser	655a4570cd	createScalarInitialization: Always store PHI-node value The current code really tries hard to use getNewScalarValue(), which checks if not the original value, but a possible copy or demoted value needs to be stored. In this calling context it seems, that we _always_ use the ScalarValue that comes from the incoming PHI node, but never any other value. As also no test cases fail, it seems right to just drop this call to getNewScalarValue and remove the parameters that are not needed any more. Johannes suggested that code like this might be needed for parallel code generation with offloading, but it was still unclear if/what exactly would be needed. As the parallel code generation does currently not support scalars at all, we will remove this code for now and add relevant code back when complitng the support of scalars in the parallel code generation. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D12470 llvm-svn: 246389	2015-08-30 17:32:39 +00:00
Tobias Grosser	e83a396b1d	Ignore debug intrinsics and do not model their potential scalar metadata reads Our code generation currently does not support scalar references to metadata values. Hence, it would crash if we try to model scalar dependences to metadata values. Fortunately, for one of the common uses, debug information, we can for now just ignore the relevant intrinsics and consequently the issue of how to model scalar dependences to metadata. llvm-svn: 246388	2015-08-30 16:57:20 +00:00
Tobias Grosser	9c0ffe3a1d	Remove some code duplication [NFC] llvm-svn: 246387	2015-08-30 16:57:15 +00:00
Tobias Grosser	fcfac082ea	Minor code style improvement [NFC] llvm-svn: 246386	2015-08-30 16:01:58 +00:00
Tobias Grosser	2985400a0e	Remove isNew from getOrCreateAlloca This commit drops some dead code. Specifically, there is no need to initialize the virtual memory locations of scalars in BlockGenerator::handleOutsideUsers, the function that initalizes the escape map that keeps track of out-of-scope uses of scalar values. We already model instructions inside the scop that are used outside the scope (escaping instructions) as scalar memory writes at the position of the instruction. As a result, the virtual memory location of this instructions is already initialized when code-generating the corresponding virtual scalar write and consequently does not need to be initialized later on when generating the set of escaping values. Code references: In TempScopInfo::buildScalarDependences we detect scalar cross-statement dependences for all instructions (including PHIs) that have uses outside of the scop's region: // Check whether or not the use is in the SCoP. if (!R->contains(UseParent)) { AnyCrossStmtUse = true; continue; } We use this information in TempScopInfo::buildAccessFunctions were we build scalar write memory accesses for all these instructions: if (!isa<StoreInst>(Inst) && buildScalarDependences(Inst, &R, NonAffineSubRegion)) { // If the Instruction is used outside the statement, we need to build the // write access. IRAccess ScalarAccess(IRAccess::MUST_WRITE, Inst, ZeroOffset, 1, true, Inst); Functions.push_back(std::make_pair(ScalarAccess, Inst)); } Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D12472 llvm-svn: 246383	2015-08-30 15:03:59 +00:00
Tobias Grosser	f8d55f7e4e	Remove some code duplication when creating Allocas [NFC] llvm-svn: 246364	2015-08-29 18:12:03 +00:00
Tobias Grosser	b79a67df78	BlockGenerator: Make scalar memory locations accessible For external users, the memory locations into which we generate scalar values may be of interest. This change introduces two functions that allow to obtain (or create) the AllocInsts for a given BasePointer. We use this change to simplify the code in BlockGenerators. llvm-svn: 246285	2015-08-28 08:23:35 +00:00
Tobias Grosser	2d1ed0bfa7	BlockGenerator: Add the possiblity to pass a set of new access functions This change allows the BlockGenerator to be reused in contexts where we want to provide different/modified isl_ast_expressions, which are not only changed to a different access relation than the original statement, but which may indeed be different for each code-generated instance of the statement. We ensure testing of this feature by moving Polly's support to import changed access functions through a jscop file to use the BlockGenerators support for generating arbitary access functions if provided. This commit should not change the behavior of Polly for now. The diff is rather large, but most changes are due to us passing the NewAccesses hash table through functions. This style, even though rather verbose, matches what is done throughout the BlockGenerator with other per-statement properties. llvm-svn: 246144	2015-08-27 07:28:16 +00:00
Tobias Grosser	39f9f30e8b	Only derive number of loop iterations for loops we can actually vectorize llvm-svn: 245870	2015-08-24 20:11:34 +00:00
Tobias Grosser	1ac884d73a	Use marker nodes to annotate the different levels of tiling Currently, marker nodes are ignored during AST generation, but visible in the -debug-only=polly-ast output. llvm-svn: 245809	2015-08-23 09:11:00 +00:00
Tobias Grosser	75296901f7	Fix 'unused variable' warning in NASSERTS build llvm-svn: 245723	2015-08-21 19:23:21 +00:00
Roman Gareev	c49724f008	Manually check a loop form Add manual check of a loop form and return non-negative number of iterations in case of trivially vectorizable loop. llvm-svn: 245680	2015-08-21 09:08:14 +00:00
Johannes Doerfert	43788c5783	Check for feasible runtime check context early Instead of generating code for an empty assumed context we bail out early. As the number of assumptions we generate increases this becomes more and more important. Additionally, this change will allow us to hide internal contexts that are only used in runtime checks e.g., a boundary context with constraints not suited for simplifications. llvm-svn: 245540	2015-08-20 05:58:56 +00:00
Tobias Grosser	b0da42fb55	Generate alias metadata even in OpenMP mode To make alias scope metadata generation work in OpenMP mode we now provide the ScopAnnotator with information about the base pointer rewrite that happens when passing arrays into the OpenMP subfunction. llvm-svn: 245451	2015-08-19 16:04:35 +00:00
Tobias Grosser	d8e3c8c665	Fix typo llvm-svn: 245441	2015-08-19 14:22:48 +00:00
Michael Kruse	acb6ade757	Move early exit to the beginning of the function If the function exits early there is no reason to enter the loop. llvm-svn: 245316	2015-08-18 17:25:48 +00:00
Michael Kruse	d2b0360197	Fix Codegen adding a second exit out of region executeScopConditionally would destroy a predecessor region if it the scop's entry was the region's exit block by forking it to polly.start and thus creating a secnd exit out of the region. This patch "shrinks" the predecessor region s.t. polly.split_new_and_old is not the region's exit anymore. llvm-svn: 245294	2015-08-18 13:14:42 +00:00
Johannes Doerfert	e69e1141d9	Introduce the ScopExpander as a SCEVExpander replacement The SCEVExpander cannot deal with all SCEVs Polly allows in all kinds of expressions. To this end we introduce a ScopExpander that handles the additional expressions separatly and falls back to the SCEVExpander for everything else. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12066 llvm-svn: 245288	2015-08-18 11:56:00 +00:00
Johannes Doerfert	d86f2157e5	Add a field to the memory access class for a related value. The new field in the MemoryAccess allows us to track a value related to that access: - For real memory accesses the value is the loaded result or the stored value. - For straigt line scalar accesses it is the access instruction itself. - For PHI operand accesses it is the operand value. We use this value to simplify code which deduced information about the value later in the Polly pipeline and was known to be error prone. Reviewers: grosser, Meinsersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12062 llvm-svn: 245213	2015-08-17 10:58:17 +00:00
Tobias Grosser	c5bcf246d1	Fix Polly after SCEV port to new pass manager This fixes compilation after LLVM commit r245193. llvm-svn: 245211	2015-08-17 10:57:08 +00:00
Johannes Doerfert	e1fa6da356	[FIX] Create location if a needed value was not yet demoted This allows the code generation to continue working even if a needed value (that is reloaded anyway) was not yet demoted. Instead of failing it will now create the location for future demotion to memory and load from that location. The stores will use the same location and by construction execute before the load even if the textual order in the generated AST is otherwise. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12072 llvm-svn: 245203	2015-08-17 09:38:46 +00:00
Johannes Doerfert	ddb83d0f6d	Remove trivially true condition llvm-svn: 245174	2015-08-16 08:35:40 +00:00
Tobias Grosser	234a48270e	AST Generation Paper published in TOPLAS The July issue of TOPLAS contains a 50 page discussion of the AST generation techniques used in Polly. This discussion gives not only an in-depth description of how we (re)generate an imperative AST from our polyhedral based mathematical program description, but also gives interesting insights about: - Schedule trees: A tree-based mathematical program description that enables us to perform loop transformations on an abstract level, while issues like the generation of the correct loop structure and loop bounds will be taken care of by our AST generator. - Polyhedral unrolling: We discuss techniques that allow the unrolling of non-trivial loops in the context of parameteric loop bounds, complex tile shapes and conditionally executed statements. Such unrolling support enables the generation of predicated code e.g. in the context of GPGPU computing. - Isolation for full/partial tile separation: We discuss native support for handling full/partial tile separation and -- in general -- native support for isolation of boundary cases to enable smooth code generation for core computations. - AST generation with modulo constraints: We discuss how modulo mappings are lowered to efficient C/LLVM code. - User-defined constraint sets for run-time checks We discuss how arbitrary sets of constraints can be used to automatically create run-time checks that ensure a set of constrainst actually hold. This feature is very useful to verify at run-time various assumptions that have been taken program optimization. Polyhedral AST generation is more than scanning polyhedra Tobias Grosser, Sven Verdoolaege, Albert Cohen ACM Transations on Programming Languages and Systems (TOPLAS), 37(4), July 2015 llvm-svn: 245157	2015-08-15 09:34:33 +00:00
Michael Kruse	82a1c7de09	Make TempScopInfo a RegionPass This modifies the order in which Polly passes are executed. Assuming a function has two scops (A and B), the order before was: FunctionPassManager ScopDetection IndependentBlocks TempScopInfo for A and B RegionPassManager ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B After this patch: FunctionPassManager ScopDetection IndependentBlocks RegionPassManager TempScopInfo for A ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A TempScopInfo for B ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B TempScopInfo for B might store information and references to the IR that CodeGeneration for A might modify. Changing the order ensures that the IR is not modified from the analysis of a region until code generation. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12014 llvm-svn: 245091	2015-08-14 20:10:27 +00:00
Tobias Grosser	0164b8ff70	Enable code generation of scalar dependences from function arguments This change extends the BlockGenerator to not only allow Instructions as base elements of scalar dependences, but any llvm::Value. This allows us to code-generate scalar dependences which reference function arguments, as they arise when moddeling read-only scalar dependences. llvm-svn: 244874	2015-08-13 08:07:39 +00:00
Michael Kruse	9c483c5834	Assign regions to all BBs from CodeGeneration In order to have a valid region analysis, we assign all newly created blocks to the parent of the scop's region. This is correct for any pre-existing regions (including the scop's region and its parent), but does not discover any region inside the generated code. For Polly this is not necessary because we do not want to re-run Polly on its own generated code anyway. Reviewers: grosser Part of Differential Revision: http://reviews.llvm.org/D11867 llvm-svn: 244608	2015-08-11 14:47:37 +00:00
Michael Kruse	22370884c4	Revise the simplification of regions The previous code had several problems: For newly created BasicBlocks it did not (always) call RegionInfo::setRegionFor in order to update its analysis. At the moment RegionInfo does not verify its BBMap, but will in the future. This is fixed by determining the region new BBs belong to and set it accordingly. The new executeScopConditionally() requires accurate getRegionFor information. Which block is created by SplitEdge depends on the incoming and outgoing edges of the blocks it connects, which makes handling its output more difficult than it needs to be. Especially for finding which block has been created an to assign a region to it for the setRegionFor problem above. This patch uses an implementation for splitEdge that always creates a block between the predecessor and successor. simplifyRegion has also been simplified by using SplitBlockPredecessors instead of SplitEdge. Isolating the entries and exits have been refectored into individual functions. Previously simplifyRegion did more than just ensuring that there is only one entering and one exiting edge. It ensured that the entering block had no other outgoing edge which was necessary for executeScopConditionally(). Now the latter uses the alternative splitEdge implementation which can handle this situation so simplifyRegion really only needs to simplify the region. Also, executeScopConditionally assumed that there can be no PHI nodes in blocks with one incoming edge. This is wrong and LCSSA deliberately produces such edges. However, previous passes ensured that there can be no such PHIs in exit nodes, but which will no longer hold in the future. The new code that the property that it preserves the identity of region block (the property that the memory address of the BasicBlock containing the instructions remains the same; new blocks only contain PHI nodes and a terminator), especially the entry block. As a result, there is no need to update the reference to the BasicBlock of ScopStmt that contain its instructions because they have been moved to other basic blocks. Reviewers: grosser Part of Differential Revision: http://reviews.llvm.org/D11867 llvm-svn: 244606	2015-08-11 14:39:21 +00:00
Tobias Grosser	c186ac7aea	BlockGenerator: Do not store 'store' statements in BBMap A store statement has no return value and can consequently not be referenced from another statement. llvm-svn: 244576	2015-08-11 08:13:15 +00:00
Michael Kruse	9bb8ef03a2	Add an assertion Check whether a block is a direct predecessor. llvm-svn: 244401	2015-08-08 18:10:54 +00:00

1 2 3 4 5 ...

354 Commits