llvm-project

Commit Graph

Author	SHA1	Message	Date
Jennifer Chukwu	c5c7a493ef	Fixed Typos Fixed typo errors in release notes of Polly 13 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D100588	2021-04-28 08:55:03 +05:30
Michael Kruse	286677870b	[Polly][ManualOpt] Match interpretation of unroll metadata to LoopUnrolls's. We previously had a different interpretation of unroll transformation attributes than how LoopUnroll interpreted it. In particular, llvm.loop.unroll.enable was needed explicitly to enable it and disabling metadata was ignored. Additionally, it required that either full unrolling or an unroll factor to be specified or fail otherwise. An unroll factor is still required, but the transformation is ignored with the hope that LoopUnroll is going to apply the unrolling, since Polly currently does not implement an heuristic. Fixes llvm.org/PR50109	2021-04-24 04:30:19 -05:00
Fangrui Song	a7b7e7b187	[Polly] Fix stage 2 clang -Wdeprecated-copy after D79714	2021-04-24 00:56:23 -07:00
Krzysztof Parzyszek	a7f33d720b	[Polly] Explicitly include all std headers There were some recent header changes, and now some std classes are no longer available in ScopDetection.cpp with the pre-existing includes.	2021-04-21 12:52:11 -05:00
Michael Kruse	89b59345ee	[PollyACC] Fix declaration/stub definition mismatch. NFC. external.c defines stub functions that are never used because of how Polly uses PPCG. Unfortunately, they are declared as functions without return values or parameters which does not match their declarations. Since they are never called, this was usually not a problem, but an LTO build gets confused with differently declared functions, or in case of pet_options_args, a global variable declaration that is defined as a function Resolve by including the declaring headers in external.c which forces the declaration and definition to match at compile-time. This fixes llvm.org/50021	2021-04-21 11:52:43 -05:00
Michael Kruse	6048d1d19c	[PollyACC] Configure PollyPPCG only if needed. The PollyPPCG library is only needed when POLLY_ENABLE_GPGPU_CODEGEN=ON. If disabled, the library target is still created, but not linked against anything. This change does not add create the PollyPPCG build target if not needed. Motivated by llvm.org/PR50021	2021-04-21 01:08:01 -05:00
Michael Kruse	90e5ce0b0d	[PollyACC] Fix implicit function definitions. NFC. The isl_id_* have been in used without including the correspodning isl/id.h header. According to rules in C, a function is defined implicitly when first used with an assumed int return type (32 bits on 64 bit systems). But the implementation returns a pointer (64 bits on 64 bit systems). Is usually has no consequence because the return value is stored in a registers that is 64 bits (RAX) and the optimizer does not truncate its value before using it again as a pointer value. However, LTO optimizers will be rightfull;y confused. Fix by including <isl/id.h> This fixes llvm.org/PR50021	2021-04-21 01:08:00 -05:00
Roman Lebedev	2aff4f7f57	[polly] Fix check-polly after SCEVExpander PtrToInt fixes	2021-04-19 19:10:55 +03:00
patacca	4170d6cdd5	[Polly][Ast] Partial refactoring of IslAst and IslAstInfo to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class `IslAst` - `getAst()` `getRunCondition()` `buildRunCondition()` - Removed the destructor in favor of the default one - Change the type of the attribute `IslAst.RunCondition` to `isl::ast_expr` - Change the type of the attribute `IslAst.Root` to `isl::ast_node` - Change the order of attributes in class `IslAst` to reflect the data dependencies so that the destructor won't complain - Refactoring the following methods of class `IslAstInfo` - `getAst()` `getRunCondition()` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D100265	2021-04-16 00:40:26 -05:00
Fangrui Song	acf7e55783	[Polly] Fix PM invalidate usage after D100519	2021-04-15 18:41:20 -07:00
Hamza Sood	0a92aff721	Replace uses of std::iterator with explicit using This patch removes all uses of `std::iterator`, which was deprecated in C++17. While this isn't currently an issue while compiling LLVM, it's useful for those using LLVM as a library. For some reason there're a few places that were seemingly able to use `std` functions unqualified, which no longer works after this patch. I've updated those places, but I'm not really sure why it worked in the first place. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D67586	2021-04-12 10:47:14 -07:00
patacca	82fbc5d45b	[Polly] Partially refactoring of IslAstInfo and IslNodeBuilder to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class IslAstInfo - isParallel() isExecutedInParallel() isReductionParallel() getSchedule() getMinimalDependenceDistance() getBrokenReductions() - Refactoring the following methods of class IslNodeBuilder - getReferencesInSubtree() getScheduleForAstNode() - Refactoring function getBrokenReductionsStr() - Fixed the mismatching function declaration for getScheduleForAstNode() Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D99971	2021-04-10 21:28:02 -05:00
Abhina Sreeskantharajan	82b3e28e83	[SystemZ][z/OS][Windows] Add new OF_TextWithCRLF flag and use this flag instead of OF_Text Problem: On SystemZ we need to open text files in text mode. On Windows, files opened in text mode adds a CRLF '\r\n' which may not be desirable. Solution: This patch adds two new flags - OF_CRLF which indicates that CRLF translation is used. - OF_TextWithCRLF = OF_Text \| OF_CRLF indicates that the file is text and uses CRLF translation. Developers should now use either the OF_Text or OF_TextWithCRLF for text files and OF_None for binary files. If the developer doesn't want carriage returns on Windows, they should use OF_Text, if they do want carriage returns on Windows, they should use OF_TextWithCRLF. So this is the behaviour per platform with my patch: z/OS: OF_None: open in binary mode OF_Text : open in text mode OF_TextWithCRLF: open in text mode Windows: OF_None: open file with no carriage return OF_Text: open file with no carriage return OF_TextWithCRLF: open file with carriage return The Major change is in llvm/lib/Support/Windows/Path.inc to only set text mode if the OF_CRLF is set. ``` if (Flags & OF_CRLF) CrtOpenFlags \|= _O_TEXT; ``` These following files are the ones that still use OF_Text which I left unchanged. I modified all these except raw_ostream.cpp in recent patches so I know these were previously in Binary mode on Windows. ./llvm/lib/Support/raw_ostream.cpp ./llvm/lib/TableGen/Main.cpp ./llvm/tools/dsymutil/DwarfLinkerForBinary.cpp ./llvm/unittests/Support/Path.cpp ./clang/lib/StaticAnalyzer/Core/HTMLDiagnostics.cpp ./clang/lib/Frontend/CompilerInstance.cpp ./clang/lib/Driver/Driver.cpp ./clang/lib/Driver/ToolChains/Clang.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99426	2021-04-06 07:23:31 -04:00
patacca	812ce7f9be	[Polly] Refactoring isInnermost() from isl to use the C++ wrapper Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring isInnermost() to take C++ bindings instead of the plain isl C api. - Addition of manage_copy() when needed to get the reference for the isl_ast_node object Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D99841	2021-04-05 21:16:52 -05:00
David Blaikie	30df6d5d6a	Preprocessor conditionalize some assert-only functions to suppress -Wunused-function	2021-04-03 14:03:43 -07:00
Fangrui Song	927050af53	[Polly] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds	2021-03-24 19:56:43 -07:00
Nikita Popov	7d91d81c6b	[polly] Fix build This produced a compile error with GCC: llvm-project/polly/lib/Transform/ScheduleOptimizer.cpp:1220:49: error: cannot convert ‘bool’ to ‘llvm::TargetTransformInfo::RegisterKind’ 1220 \| RegisterBitwidth = TTI->getRegisterBitWidth(true);	2021-03-24 17:46:46 +01:00
Michael Kruse	40653c8bb9	[Polly][DCE] clang-format DeadCodeElimination.h. NFC. The targets polly-update-format and check-polly-format process new files only after running cmake again. This is why it was missed in the last commit.	2021-03-24 09:33:14 -05:00
Michael Kruse	8796451d6e	[Polly] Port DeadCodeElim to the NewPM.	2021-03-24 01:01:29 -05:00
Michael Kruse	5f58aae8f3	[Polly][CodeGen] Allow nesting of BandAttr mark without loop. BandAttr markers are added as parents of schedule tree bands. These also appear as markers its equivalent AST, but a band does not necessarily corresponds to a loop in this. Iterations may be peeled or the loop being unrolled (e.g. if it has just one iteration). In such cases it may happend that there is not loop between a BandAttr marker and the marker for a loop nested in the former parent band/loop. Handle the situation by giving priority to the inner marker over the outer. Fixes the polly-x86_64-linux-test-suite buildbot.	2021-03-16 16:17:07 -05:00
Michael Kruse	f51427afb5	[Polly][Unroll] Fix unroll_double test. We enumerated the cross product Domain x Scatter, but sorted only be the scatter key. In case there are are multiple statement instances per scatter value, the order between statement instances of the same loop iteration was undefined. Propertly enumerate and sort only by the scatter value, and group the domains using the scatter dimension again. Thanks to Leonard Chan for the report.	2021-03-16 09:00:42 -05:00
Michael Kruse	9c486eb348	[Polly] Fix deprecation warning. NFC. IRBuilder::CreateLoad without type parameter was deprecated in `6312c538` to prepare for opaque pointers.	2021-03-15 14:31:16 -05:00
Michael Kruse	3f170eb197	[Polly][Optimizer] Apply user-directed unrolling. Make Polly look for unrolling metadata (https://llvm.org/docs/TransformMetadata.html#loop-unrolling) that is usually only interpreted by the LoopUnroll pass and apply it to the SCoP's schedule. While not that useful by itself (there already is an unroll pass), it introduces mechanism to apply arbitrary loop transformation directives in arbitrary order to the schedule. Transformations are applied until no more directives are found. Since ISL's rescheduling would discard the manual transformations and it is assumed that when the user specifies the sequence of transformations, they do not want any other transformations to apply. Applying user-directed transformations can be controlled using the `-polly-pragma-based-opts` switch and is enabled by default. This does not influence the SCoP detection heuristic. As a consequence, loop that do not fulfill SCoP requirements or the initial profitability heuristic will be ignored. `-polly-process-unprofitable` can be used to disable the latter. Other than manually editing the IR, there is currently no way for the user to add loop transformations in an order other than the order in the default pipeline, or transformations other than the one supported by clang's LoopHint. See the `unroll_double.ll` test as example that clang currently is unable to emit. My own extension of `#pragma clang loop` allowing an arbitrary order and additional transformations is available here: https://github.com/meinersbur/llvm-project/tree/pragma-clang-loop. An effort to upstream this functionality as `#pragma clang transform` (because `#pragma clang loop` has an implicit transformation order defined by the loop pipeline) is D69088. Additional transformations from my downstream pragma-clang-loop branch are tiling, interchange, reversal, unroll-and-jam, thread-parallelization and array packing. Unroll was chosen because it uses already-defined metadata and does not require correctness checks. Reviewed By: sebastiankreutzer Differential Revision: https://reviews.llvm.org/D97977	2021-03-15 13:05:39 -05:00
Kevin Zhou	b7df372cdc	[Polly] Refactoring astScheduleDimIsParallel to take the C++ wrapper object. NFC Polly currently needs to be slowly refactor to use the C++ wrapper objects to handle the reference counters automatically. I took the function of astScheduleDimIsParallel and refactored it so that it uses the C++ wrapper function as much as possible. There are some problems with the IsParallel since it expects the C objects, so the C++ wrapper functions must be .release() and .get() first before they are able to be used with IsParallel. When checking the ReductionDependencies Parallelism with the Build's Schedule, I opted to keep the union map as a C object rather than a C++ object. Eventually, changes will need to be made to IsParallel to refactor it to the C++ wrappers. When this is done, this function will also need to be slightly refactored to not use the C object. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98455	2021-03-15 12:08:50 -05:00
Roman Lebedev	78b8ce40ef	Reland [SCEV] Improve modelling for (null) pointer constants This reverts commit `329aeb5db4`, and relands commit `61f006ac65`. This is a continuation of D89456. As it was suggested there, now that SCEV models `PtrToInt`, we can try to improve SCEV's pointer handling. In particular, i believe, i will need this in the future to further fix `SCEVAddExpr`operation type handling. This removes special handling of `ConstantPointerNull` from `ScalarEvolution::createSCEV()`, and add constant folding into `ScalarEvolution::getPtrToIntExpr()`. This way, `null` constants stay as such in SCEV's, but gracefully become zero integers when asked. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98147	2021-03-13 16:05:34 +03:00
Roman Lebedev	329aeb5db4	Temporairly evert "[SCEV] Improve modelling for (null) pointer constants" This appears to have broken ubsan bot: https://lab.llvm.org/buildbot/#/builders/85/builds/3062 https://reviews.llvm.org/D98147#2623549 It looks like LSR needs some kind of a change around insertion point handling. Reverting until i have a fix. This reverts commit `61f006ac65`.	2021-03-13 09:10:28 +03:00
Nikita Popov	42eb658f65	[OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) This removes some (but not all) uses of type-less CreateGEP() and CreateInBoundsGEP() APIs, which are incompatible with opaque pointers. There are a still a number of tricky uses left, as well as many more variation APIs for CreateGEP.	2021-03-12 21:01:16 +01:00
Roman Lebedev	61f006ac65	[SCEV] Improve modelling for (null) pointer constants This is a continuation of D89456. As it was suggested there, now that SCEV models `PtrToInt`, we can try to improve SCEV's pointer handling. In particular, i believe, i will need this in the future to further fix `SCEVAddExpr`operation type handling. This removes special handling of `ConstantPointerNull` from `ScalarEvolution::createSCEV()`, and add constant folding into `ScalarEvolution::getPtrToIntExpr()`. This way, `null` constants stay as such in SCEV's, but gracefully become zero integers when asked. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98147	2021-03-12 22:11:58 +03:00
Nikita Popov	ff9b37e95f	[Polly] Remove uses of type-less CreateLoad() APIs (NFC) These are incompatible with opaque pointers and are going away. Explicitly specify the loaded type instead.	2021-03-11 18:39:20 +01:00
Nikita Popov	46354bac76	[OpaquePtrs] Remove some uses of type-less CreateLoad APIs (NFC) Explicitly pass loaded type when creating loads, in preparation for the deprecation of these APIs. There are still a couple of uses left.	2021-03-11 14:40:57 +01:00
Leonard Chan	baf637dcde	Rename top-level LICENSE.txt files to LICENSE.TXT This makes all the license filenames uniform across subprojects. Differential Revision: https://reviews.llvm.org/D98380	2021-03-10 21:26:24 -08:00
Roman Lebedev	f449e5ef9b	[NFCI] Fix polly tests after `b46c085d2b` That commit changed SCEVExpander to emit intrinsics instead of icmp+select, but i forgot about polly, and i'm not sure if any bots complained.	2021-03-07 20:44:04 +03:00
Michael Kruse	b85c98b4c5	[Polly][Codegen] Emit access group metadata. Emit llvm.loop.parallel_accesses metadata instead of llvm.mem.parallel_loop_access. The latter is deprecated because it assumes that LoopIDs are persistent, which they are not. We also emit parallel access metadata for all surrounding parallel loops, not just the innermost parallel.	2021-03-04 03:58:03 -06:00
Prateek Pardeshi	50e34497ac	[Polly] Refabricating IsOutermostParallel() from Integer Set Libarary(ISL) to take the C++ wrapper Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: * Refabricating IsOutermostParallel() to take C++ bindings instead of reference-counting in C isl lib. * Addition of manage_copy() to be used as reference for C objects instead of IsOutermostParallel() Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D97751	2021-03-03 01:49:37 -06:00
Kevin Zhou	1ab2753d4c	[Polly] Refactoring IsInnermostParallel() in ISL to take the C++ wrapper object. NFC Currently, the IslAst library is a C library that would be incompatible with the rest of the LLVM because LLVM is written in C++. I took one function, IsInnermostParallel(), and refactored it so that it would take the C++ wrapper object instead of using reference counters with the C ISL library. As well, all the references that use IsInnermostParallel() will use manage_copy() since they are still expecting the C object. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D97425	2021-02-26 18:41:44 -06:00
Harmen Stoppels	a54f160b3a	Prefer /usr/bin/env xxx over /usr/bin/xxx where xxx = perl, python, awk Allow users to use a non-system version of perl, python and awk, which is useful in certain package managers. Reviewed By: JDevlieghere, MaskRay Differential Revision: https://reviews.llvm.org/D95119	2021-02-25 11:32:27 +01:00
Michael Kruse	91c472c86c	[Polly] Fix test after D96534.	2021-02-19 12:49:29 -06:00
Michael Kruse	ea37ee5bc4	[Polly] Update IslAstInfo::getNodePayload to use isl C++ interface. NFC.	2021-02-18 17:53:32 -06:00
Michael Kruse	ab0556bb20	[Polly] Regenerate isl-noexceptions.h. Regenerate the C++ wrapper header from the current isl version's headers. The most notable change is that some dimension sizes are represented by an isl_size (instead of unsigned), which is a signed int. Additionally, some function may return -1 in case of an error which already had been fixed in the past. The C++ may no return -1 instead of UINT_MAX which caused the problems. Some types in Polly had been changed from unsigned to isl_size (that were not already auto) and some loops/comparision had to be changed to avoid unsigned/signed comparison warnings.	2021-02-14 19:17:54 -06:00
Michael Kruse	d09491895f	[Polly] Invalidate passes after Scop processing in NewPM. ScopDetection's DetectionContext holds AssertionVH for RequiredInvariantLoads. An assertion is thrown if the handle's value is erased and the ScopDetection is not yet invalidated. The ScopDetection must remain valid durting the ScopPassManager. Enusure that all Scop analyses are free'd when the ScopPass manager is done. If IR generation has happened, also invalidate all other passes to avoid possible issues because, like for the legacy pass manager, Polly does not yet perfectly preserve them.	2021-02-14 17:32:29 -06:00
Michael Kruse	089421ba9a	[Polly] Test all optimization levels.	2021-02-14 00:31:10 -06:00
Michael Kruse	95ef556bd1	[Polly] Preserve DetectionContext references. DetectionContext objects are stored as values in a DenseMap. When the DenseMap reaches its maximum load factor, it is resized and all its objects moved to a new memory allocation. Unfortunately Scop object have a reference to its DetectionContext. When the DenseMap resizes, all the DetectionContexts reference now point to invalid memory, even if caused by an unrelated DetectionContext. Even worse, NewPM's ScopPassManager called isMaxRegionInScop with the Verify=true parameter before each pass. This caused the old DetectionContext to be removed an a new on created and re-verified. Of course, the Scop object was already created pointing to the old DetectionContext. Because the new DetectionContext would usually be stored at the same position in the DenseMap, the reference would usually reference the new DetectionContext of the same Region. Usually. If not, the old position still points to memory in the DenseMap allocation (unless also a resizing occurs) such that tools like Valgrind and AddressSanitizer would not be able to diagnose this. Instead of storing the DetectionContext inside the DenseMap, use a std::unique_ptr to a DetectionContext allocation, i.e. it will not move around anymore. This also allows use to remove the very strange DetectionContext(const DetectionContext &&) copy/move(?) constructor. DetectionContext objects now are neither copied nor moved. As a result, every re-verification of a DetectionContext will use a new allocation. Therefore, once a Scop object has been created using a DetectionContext, it must not be re-verified (the Scop data structure requires its underlying Region to not change before code generation anyway). The NewPM may call isMaxRegionInScop only with Validate=false parameter.	2021-02-13 03:36:09 -06:00
Michael Kruse	f0f5afc4dd	[Polly] Remove unused declaration. NFC.	2021-02-12 02:20:31 -06:00
Michael Kruse	9b123cde63	[Polly] Sanitize optimization levels. The description of the -polly switch stated that it was only enabled with -O3. This was a lie, the optimization level was ignored. Only at -O0 Polly was not added to the pass pipeline because the pass builder, but only because the extension points were not triggered. In the NewPM, the VectorizerStart extensions point is actually trigger even with -O0 which leads to the following crash: Assertion `Level != OptimizationLevel::O0 && "Must request optimizations!"' failed. We sanitize the optimization levels using the following rules for both pass mangers: 1. Only enable Polly if optimizing at all (-O1, -O2 or -O3). 2. Do not enable Polly when optimizing for size. 3. Ignore the optimization level for diagnostic passes (printer, viewer or JScop-exporter). 4. If only diagnostic passes enabled, skip the code-generation. 5. Fix the description of the -polly command line option.	2021-02-11 23:07:48 -06:00
Michael Kruse	7387f33bfe	[Polly] Hide IslScheduleOptimizer implementation from header. NFC. These are implementation details of the IslScheduleOptimizer pass implementation and not use anywhere else. Hence, we can move them to the cpp file and into an anonymous namespace. Only getPartialTilePrefixes is, aside from the pass itself, used externally (by the ScheduleOptimizerTest) and moved into the polly namespace.	2021-02-11 21:02:29 -06:00
Michael Kruse	606aa622b2	Revert "[AssumptionCache] Avoid dangling llvm.assume calls in the cache" This reverts commit `b7d870eae7` and the subsequent fix "[Polly] Fix build after AssumptionCache change (D96168)" (commit `e6810cab09`). It caused indeterminism in the output, such that e.g. the polly-x86_64-linux buildbot failed accasionally.	2021-02-11 12:17:38 -06:00
Michael Kruse	23753c6088	[Polly] Hide Simplify implementation from header. NFC. Move SimplifiyVisitor from Simplify.h to Simplify.cpp. It is not relevant for applying the pass in either the NewPM or the legacyPM. Rename it to SimplifyImpl to account for that. This is possible due its state not being necessary to be preserved between runs and thefore SimplifyImpl not needed to be held in the pass object. Instead, SimplifyImpl is only instatiated for the current Scop. In the NewPM as a function-local variable, and in the legacy PM inside a llvm::Optional object because the state must be preserved between the printScop (invoked by opt -analyze) and the most recent runOnScop calls.	2021-02-10 22:11:52 -06:00
Michael Kruse	91ca9adc9e	[Polly] Avoid "using namespace llvm" in public headers. NFC. "using namespace" pollutes the namespace of every file that includes such a header and universally considered a bad thing. Even the variant namespace polly { using namespace llvm; } (previously used by LoopGenerators.h) imports more symbols than the file is in control of. The header may include a fixed set of files from LLVM, but the header itself may by be included together with other headers from LLVM. For instance, LLVM's MemorySSA.h and Polly's ScopInfo.h both declare a class 'MemoryAccess' which may conflict. Instead of prefixing everything in Polly's header files, this patch adds 'using' statements to import only the symbols that are actually referenced in Polly. This approach is also used by MLIR to import commonly used symbols into the mlir namespace. This patch also puts the symbols declared in IslNodeBuilder.h into the Polly namespace to also be able to use the imported symbols.	2021-02-10 20:58:33 -06:00
Michael Kruse	d50f92a4f0	[Polly] Added dedicated test for working -O3 pipeline. Test the NewPM as well as the legacy PM.	2021-02-10 13:25:56 -06:00
Michael Kruse	11511ee343	[Polly] Do not use -O3 pipeline for single pass test.	2021-02-10 13:25:56 -06:00
Fangrui Song	564788ddce	[Polly] Fix -Wunused-lambda-capture	2021-02-10 09:19:05 -08:00
Fangrui Song	89e257bd62	[Polly] Fix -DPOLLY_ENABLE_GPGPU_CODEGEN=off build after `222d380d2f`	2021-02-10 09:17:13 -08:00
Michael Kruse	222d380d2f	[Polly] Make the NewPM pass pipeline more similar to the legacy's. Even though it has some oddities, both pipelines should be as similar as possible. Also use report_fatal_error instead of assertions to ensure a proper failure in release builds for unsupported options. This finalizes the patch serious to make Polly run in the default configuration when using the NewPM by default.	2021-02-09 23:56:22 -06:00
Michael Kruse	08bab4b030	[Polly] Make NewPM's IslAstAnalysis more similar to the legacy IslAstInfoWrapperPass. In particular, print the ast with -debug-only=polly-ast, print a per-scop header with print<polly-ast> and force-add the analysis with -polly-code-generation=ast.	2021-02-09 23:56:21 -06:00
Michael Kruse	13f758a805	[Polly] Improve Simplify pass PM integration. 1. LegacyPM: Rename SimplifyLegacyPass to SimplifyWrapperPass. 2. LegacyPM: Complete create/init functions in LinkAllPasses.h 3. NewPM: Only invalidate non-Scop passes if changed. 4. NewPM: Add to default pass pipeline. 5. NewPM: Print -analyze header for each print<polly-simplify>	2021-02-09 23:56:21 -06:00
Michael Kruse	e200df952b	[Polly] Port IslScheduleOptimizer to the NewPM.	2021-02-09 23:56:21 -06:00
Michael Kruse	5ff963fd11	[Polly] Add TargetTransformInfo to ScopPass standard analysis results. TargetTransformInfo is required by IslScheduleOptimizer, as ScopPass. Unfortunately it is not possible to get arbitrary larger-unit analyses in for as ScopPass. Loop passes also already use TargetTransformInfo as LoopStandardAnalysisResults, hence wei might expect it to be available to Scop passes as well.	2021-02-09 23:56:20 -06:00
Michael Kruse	e7b9e43c9a	[Polly] Register pass-instrumentation for NewPM's Scop level. The pass-instrumentation pass is implicitly execute by the NewPM whenever a new analysis runs. Not registering it will cause the crash whenever a scop pass requests an analysis. For instance this is the case for the IstAstAnalysis requesting the DependenceAnalsis result.	2021-02-09 23:56:20 -06:00
Michael Kruse	b687fc9122	[Polly] Port PruneUnprofitable to the NewPM.	2021-02-09 23:56:20 -06:00
Michael Kruse	7903d594ea	[Polly] Port DeLICM to the NewPM.	2021-02-09 23:56:19 -06:00
Michael Kruse	4c64d8ee3a	[Polly] Port ForwardOpTree to the NewPM.	2021-02-09 23:56:19 -06:00
Michael Kruse	3dcb535115	[Polly] Remove use of -O3 in regression test. In addition to that regression tests should not test the intire pass pipeline (unless they are testing the pipeline itself), the Polly-ACC currently does not support the new pass manager. If enabled by default, such tests will therefore fail. Use the -polly-gpu-runtime and -polly-gpu-arch options also as default values for the PPCGCodeGeneration pass. This requires to move the option to be moved from the pipeline-building Register passes to the PPCGCodeGeneration implementation. Fixes the spir-typesize.ll buildbot fail.	2021-02-09 18:13:35 -06:00
Fangrui Song	e6810cab09	[Polly] Fix build after AssumptionCache change (D96168)	2021-02-06 11:56:15 -08:00
Arthur Eubanks	781a1b1e36	[test] Pin spir-codegen.ll to legacy PM -polly-enable-delicm is not supported under the new PM but is tested here: Assertion `!EnableDeLICM && "This option is not implemented"' failed.	2021-02-03 19:37:32 -08:00
xgupta	94fac81fcc	[Branch-Rename] Fix some links According to the [[ https://foundation.llvm.org/docs/branch-rename/ \| status of branch rename ]], the master branch of the LLVM repository is removed on 28 Jan 2021. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95766	2021-02-01 16:43:21 +05:30
Tom Stellard	5369517d20	Bump the trunk major version to 13 and clear the release notes.	2021-01-26 19:37:55 -08:00
Michael Kruse	3b9677e1ec	[Polly] Track defined behavior for PHI predecessor computation. ZoneAlgorithms's computePHI relies on being provided with consistent a schedule to compute the statement prodecessors of a statement containing PHINodes. Otherwise unexpected results such as PHI nodes with multiple predecessors can occur which would result in problems in the algorithms expecting consistent data. In the added test case, statement instances are scrubbed from the SCoP their execution would result in undefined behavior (Due to a nsw overflow). As already being undefined behavior in LLVM-IR, neither AssumedContext nor InvalidContext are updated, giving computePHI no means to avoid these cases. Intoduce a new SCoP property, the DefinedBehaviorContext, that among the runtime-checked conditions, also tracks the assumptions not needing a runtime check, in particular those affecting the assumed control flow. This replaces the manual combination of the 3 other contexts that was already done in computePHI and setNewAccessRelation. Currently, the only additional assumption is that loop induction variables will nsw flag for not wrap, but potentially more can be added. Use in hasFeasibleRuntimeContext, isl::ast_build and gisting are other potential uses. To limit computational complexity, the DefinedBehaviorContext is not availabe if it grows too large (atm hardcoded to 8 disjuncts). Possible other fixes include bailing out in computePHI when inconsistencies are detected, choose an arbitrary value for inconsistent cases (since it is undefined behavior anyways), or make the code receiving the result from ComputePHI handle inconsistent data. All of them reduce the quality of implementation having to bail out more often and disabling the ability to assert on actually wrong results. This fixes llvm.org/PR48783.	2021-01-23 13:03:49 -06:00
Michael Kruse	02e8a5ad3c	[Polly] Allow param sets for dumpPw().	2021-01-23 13:03:48 -06:00
Michael Kruse	de0457a013	[Polly] Clean up hasFeasibleRuntimeContext.	2021-01-23 13:03:48 -06:00
Michael Kruse	a5b895110f	[Polly] Gist new access relations using the SCoP context. This simplifies the access relations.	2021-01-23 13:03:48 -06:00
Arthur Eubanks	cabe1b1124	[polly][NewPM][test] Fix polly tests under -enable-new-pm In preparation for turning on opt's -enable-new-pm by default, this pins uses of passes via the legacy "opt -passname" with pass names beginning with "polly-" and "polyhedral-info" to the legacy PM. Many of these tests use -analyze, which isn't supported in the new PM. (This doesn't affect uses of "opt -passes=passname"). rL240766 accidentally removed `-polly-prepare` in phi_not_grouped_at_top.ll, and it also doesn't use the output of -analyze. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D94266	2021-01-19 12:38:58 -08:00
Michael Kruse	842314b5f0	[Polly] Update isl to isl-0.23-61-g24e8cd12. This fixes llvm.org/PR48554 Some test cases had to be updated because the hash function for union_maps have been changed which affects the output order.	2021-01-19 12:01:31 -06:00
Wei Mi	86341247c4	[NFC] Rename ThinLTOPhase to ThinOrFullLTOPhase and move it from PassBuilder.h to Pass.h. In some compiler passes like SampleProfileLoaderPass, we want to know which LTO/ThinLTO phase the pass is in. Currently the phase is represented in enum class PassBuilder::ThinLTOPhase, so it is only available in PassBuilder and it also cannot represent phase in full LTO. The patch extends it to include full LTO phases and move it from PassBuilder.h to Pass.h, then it is much easier for PassBuilder to communiate with each pass about current LTO phase. Differential Revision: https://reviews.llvm.org/D94613	2021-01-13 15:55:40 -08:00
Juneyoung Lee	278aa65cc4	[IR] Let IRBuilder's CreateVectorSplat/CreateShuffleVector use poison as placeholder This patch updates IRBuilder to create insertelement/shufflevector using poison as a placeholder. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93793	2020-12-30 04:21:04 +09:00
Michael Kruse	bc633fe46b	[Polly] Consider InvalidContext to determine partial READ. MemoryAccess::setNewAccessRelation() in assert-builds checks whether the access relation for a READ has a memory location for every instance of the domain. Otherwise, we would not have value to load from. That check already considered that instances outside the Scop's context do not matter since they are never executed (or would be undefined behavior). In this patch also take instances of the InvalidContext into account, as these can also be assumed to never occur. InvalidContext was introduced to avoid the computational complexity of subtracting restrictions from the AssumedContext. However, this additional check in setNewAccessRelation is only done in assert-builds. The assertion case with an InvalidContext may occur with DeLICM on a conditionally infinite loops, as it is the case in the following code: for (int i = 0; i < n; i+=b) vreg = ...; Dest = vreg; The loop is infinite when b=0, and [b] -> { : b = 0 } is part of the InvalidContext. When DeLICM tries to map the memory for %vreg to Dest, there is no store instance that uses the value of vreg when b = 0, hence no location to map it to. However, the case is irrelevant since Polly's runtime condition check ensures that this is never case. Fixes llvm.org/PR48445	2020-12-10 22:25:19 -06:00
Michael Kruse	6249bfeefe	[Polly][CodeGen] Remove use of ScalarEvolution. ScalarEvolution::getSCEV cannot be used during codegen. ScalarEvolution assumes a stable IR and control flow which is under construction during Polly's CodeGen. In particular, it uses DominatorTree for compute the backedge taken count. However the DominatorTree is not updated during codegen. In this case, SCEV was used to determine the base pointer of an array access. Replace it by our own function. Polly generates only GEP and BitCasts for array acceses, i.e. it is sufficient to handle these to to find the base pointer. Fixes llvm.org/PR48422	2020-12-07 15:21:51 -06:00
dfukalov	2ce38b3f03	[NFC] Reduce include files dependency. 1. Removed #include "...AliasAnalysis.h" in other headers and modules. 2. Cleaned up includes in AliasAnalysis.h. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D92489	2020-12-03 18:25:05 +03:00
Nick Lewycky	fe43168348	Creating a named struct requires only a Context and a name, but looking up a struct by name requires a Module. The method on Module merely accesses the LLVMContextImpl and no data from the module itself, so this patch moves getTypeByName to a static method on StructType that takes a Context and a name. There's a small number of users of this function, they are all updated. This updates the C API adding a new method LLVMGetTypeByName2 that takes a context and a name. Differential Revision: https://reviews.llvm.org/D78793	2020-11-30 11:34:12 -08:00
Nikita Popov	4df8efce80	[AA] Split up LocationSize::unknown() Currently, we have some confusion in the codebase regarding the meaning of LocationSize::unknown(): Some parts (including most of BasicAA) assume that LocationSize::unknown() only allows accesses after the base pointer. Some parts (various callers of AA) assume that LocationSize::unknown() allows accesses both before and after the base pointer (but within the underlying object). This patch splits up LocationSize::unknown() into LocationSize::afterPointer() and LocationSize::beforeOrAfterPointer() to make this completely unambiguous. I tried my best to determine which one is appropriate for all the existing uses. The test changes in cs-cs.ll in particular illustrate a previously clearly incorrect AA result: We were effectively assuming that argmemonly functions were only allowed to access their arguments after the passed pointer, but not before it. I'm pretty sure that this was not intentional, and it's certainly not specified by LangRef that way. Differential Revision: https://reviews.llvm.org/D91649	2020-11-26 18:39:55 +01:00
Nikita Popov	eb995e9332	[Polly] Use LocationSize::unknown() (NFC) Avoid MemoryLocation::UnknownSize when we're initializing a LocationSize.	2020-11-19 20:26:43 +01:00
Michael Kruse	33ca0b0ea6	[Polly] Add override qualifiers. Silence compiler warnings that suggest to add override qualifiers.	2020-11-16 21:29:37 -06:00
Michael Kruse	fc115f2e73	[Polly] Move SimplifyVisitor into polly namespace. Declarations in headers should not be in the anonymous namespace. Compilers also warn about the use of <anon namespace>::SimplifyVisitor as a public field in polly::SimplifyPass and polly::SimplifyPrinterPass.	2020-11-16 18:59:08 -06:00
Michael Kruse	243511a24e	[Polly] Fix memory leak.	2020-11-12 20:04:17 -06:00
Michael Kruse	c8a0e27cfb	[Polly][OpTree] Fix mid-processing change of access kind. Operand tree forwarding can cause the change of an access kind; in particular change from a scalar kind to an array kind if the scalar dependency is not necessary. Such an access cannot and doesn't need to be forwarded anymore. Fixes llvm.org/PR48034	2020-11-11 16:21:48 -06:00
Michael Kruse	c1cf51e777	[Polly][OpTree] Better report applied changes. Print to dbgs() any taken action. Also, read-only scalars do not require any action unless -polly-analyze-read-only-scalars=true is used. Better refect this by using ForwardingAction::triviallyForwardable and thus not bumping the statistics.	2020-11-11 16:21:48 -06:00
Michael Kruse	e408935bb5	[Polly][ScopBuilder] Use only modeled instructions to compute statement granularity. ScopBuilder distributes independent instructions between statements. Only modeled (e.g. not synthesizable) instructions are represented. To compute independence, non-modeled instructions were used in some parts of determining instruction independence, which could lead to the re-introduction of non-model instructions. In particular, required invariant loads could be added to instruction list, which then led to redundant MemoryAccesses for such a load. This fixes llvm.org/PR48059.	2020-11-10 15:30:16 -06:00
Fangrui Song	98031b664c	[polly] Fix -Wunused-lambda-capture and -Wunused-variable	2020-11-02 20:35:26 -08:00
Roman Lebedev	b4916918e5	[SCEV] SCEVPtrToIntExpr simplifications If we've got an SCEVPtrToIntExpr(op), where op is not an SCEVUnknown, we want to sink the SCEVPtrToIntExpr into an operand, so that the operation is performed on integers, and eventually we end up with just an `SCEVPtrToIntExpr(SCEVUnknown)`. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D89692	2020-10-30 11:13:35 +03:00
Roman Lebedev	81fc53a36a	[SCEV] Introduce SCEVPtrToIntExpr (PR46786) And use it to model LLVM IR's `ptrtoint` cast. This is essentially an alternative to D88806, but with no chance for all the problems it caused due to having the cast as implicit there. (see rG7ee6c402474a2f5fd21c403e7529f97f6362fdb3) As we've established by now, there are at least two reasons why we want this: * It will allow SCEV to actually model the `ptrtoint` casts and their operands, instead of treating them as `SCEVUnknown` * It should help with initial problem of PR46786 - this should eventually allow us to not loose pointer-ness of an expression in more cases As discussed in [[ https://bugs.llvm.org/show_bug.cgi?id=46786 \| PR46786 ]], in principle, we could just extend `SCEVUnknown` with a `is ptrtoint` cast, because `ScalarEvolution::getPtrToIntExpr()` should sink the cast as far down into the expression as possible, so in the end we should always end up with `SCEVPtrToIntExpr` of `SCEVUnknown`. But i think that it isn't the best solution, because it doesn't really matter from memory consumption side - there probably won't be that many `SCEVPtrToIntExpr`s for it to matter, and it allows for much better discoverability. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D89456	2020-10-30 11:13:35 +03:00
Duncan P. N. Exon Smith	d4c667c9af	Avoid unnecessary uses of `MDNode::getTemporary`, NFC This is a long-delayed follow-up to `5e5b85098d`. `TempMDNode` includes a bunch of machinery for RAUW, and should only be used when necessary. RAUW wasn't being used in any of these cases... it was just a placeholder for a self-reference. Where the real node was using `MDNode::getDistinct`, just replace the temporary argument with `nullptr`. Where the real node was using `MDNode::get`, the `replaceOperandWith` call was "promoting" the node to a distinct one implicitly due to self-reference detection in `MDNode::handleChangedOperand`. The `TempMDNode` was serving a purpose by delaying uniquing, but it's way simpler to just call `MDNode::getDistinct` in the first place. Note that using a self-reference at all in these places is a hold-over from before `distinct` metadata existed. It was an old trick to create distinct nodes. It would be intrusive to change, including bitcode upgrades, etc., and it's harmless so I'm not sure there's much value in removing it from existing schemas. After this commit it still has a tiny memory cost (in the extra metadata operand) but no more overhead in construction. Differential Revision: https://reviews.llvm.org/D90079	2020-10-26 17:03:25 -04:00
Sylvestre Ledru	0784e17f1b	Remove .svn from exclude list as we moved to git Reviewed By: emaste Differential Revision: https://reviews.llvm.org/D89859	2020-10-21 16:09:21 +02:00
Fangrui Song	2213a354b9	[Polly] Delete unused lambda capture after `7175cffb21`	2020-10-20 18:34:52 -07:00
Michael Kruse	7175cffb21	[Polly] Reuse multiple uses in operand tree. Recursively traversing the operand tree leads to an exponential blowup if instructions are used multiple times due to every path leading to an additional copy of the instructions after forwarding. This problem was marked as a TODO in the code and was reported as a bug in llvm.org/PR47340. Fix by caching already visited instructions and returning the cached version when already visited. Instead of calling forwardTree() twice, return a ForwardingAction structure that contains a lambda which will carry-out the forwarding when requested. The lambdas are executed in reverse-postorder to mimic the previous recursive calls unless there is a reuse. Fixes llvm.org/PR47340	2020-10-20 18:05:35 -05:00
Christopher Tetreault	2eac8ce820	Get the address space within getVectorPtrTy getVectorPtrTy is private to VectorBlockGenerator, and all uses query the address space from the passed-in pointer prior to calling it. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D89745	2020-10-20 14:42:10 -07:00
Mark Schimmel	8e570abf10	Polly - specify address space when creating a pointer to a vector type Polly incorrectly dropped the address space specified for a load instruction when it vectorized the code. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D88907	2020-10-14 11:17:15 -05:00
Roman Lebedev	7ee6c40247	Revert "Reland "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown"" and it's follow-ups While we haven't encountered an earth-shattering problem with this yet, by now it is pretty evident that trying to model the ptr->int cast implicitly leads to having to update every single place that assumed no such cast could be needed. That is of course the wrong approach. Let's back this out, and re-attempt with some another approach, possibly one originally suggested by Eli Friedman in https://bugs.llvm.org/show_bug.cgi?id=46786#c20 which should hopefully spare us this pain and more. This reverts commits `1fb6104293`, `7324616660`, `aaafe350bb`, `e92a8e0c74`. I've kept&improved the tests though.	2020-10-14 16:09:18 +03:00
Roman Lebedev	1fb6104293	Reland "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown" This relands commit `1c021c64ca` which was reverted in commit `17cec6a11a` because an assertion was being triggered, since `BuildConstantFromSCEV()` wasn't updated to handle the case where the constant we want to truncate is actually a pointer. I was unsuccessful in coming up with a test case where we'd end there with constant zext/sext of a pointer, so i didn't handle those cases there until there is a test case. Original commit message: While we indeed can't treat them as no-ops, i believe we can/should do better than just modelling them as `unknown`. `inttoptr` story is complicated, but for `ptrtoint`, it seems straight-forward to model it just as a zext-or-trunc of unknown. This may be important now that we track towards making inttoptr/ptrtoint casts not no-op, and towards preventing folding them into loads/etc (see D88979/D88789/D88788) Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D88806	2020-10-12 23:02:55 +03:00
Hans Wennborg	17cec6a11a	Revert `1c021c64c` "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown" > While we indeed can't treat them as no-ops, i believe we can/should > do better than just modelling them as `unknown`. `inttoptr` story > is complicated, but for `ptrtoint`, it seems straight-forward > to model it just as a zext-or-trunc of unknown. > > This may be important now that we track towards > making inttoptr/ptrtoint casts not no-op, > and towards preventing folding them into loads/etc > (see D88979/D88789/D88788) > > Reviewed By: mkazantsev > > Differential Revision: https://reviews.llvm.org/D88806 It caused the following assert during Chromium builds: llvm/lib/IR/Constants.cpp:1868: static llvm::Constant llvm::ConstantExpr::getTrunc(llvm::Constant , llvm::Type *, bool): Assertion `C->getType()->isIntOrIntVectorTy() && "Trunc operand must be integer"' failed. See code review for a link to a reproducer. This reverts commit `1c021c64ca`.	2020-10-12 18:39:35 +02:00
Roman Lebedev	1c021c64ca	[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown While we indeed can't treat them as no-ops, i believe we can/should do better than just modelling them as `unknown`. `inttoptr` story is complicated, but for `ptrtoint`, it seems straight-forward to model it just as a zext-or-trunc of unknown. This may be important now that we track towards making inttoptr/ptrtoint casts not no-op, and towards preventing folding them into loads/etc (see D88979/D88789/D88788) Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D88806	2020-10-12 11:04:03 +03:00
Arthur Eubanks	6dcbea877b	[NewPM] Use PassInstrumentation for -verify-each This removes "VerifyEachPass" parameters from a lot of functions which is nice. Don't verify after special passes or VerifierPass. This introduces verification on loop and cgscc passes, verifying the corresponding function/module. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D88764	2020-10-07 19:24:25 -07:00

1 2 3 4 5 ...

4192 Commits