llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Drewniak	ddc2eb0ada	[mlir] Adds getUpperBound() to LoopLikeInterface. getUpperBound is analogous to getLowerBound(), except for the upper bound, and is used in range analysis. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124020	2022-04-19 19:56:44 +00:00
Alex Zinenko	0eb403ad1b	[mlir][transform] Introduce transform.sequence op Sequence is an important transform combination primitive that just indicates transform ops being applied in a row. The simplest version requires fails immediately if any transformation in the sequence fails. Introducing this operation allows one to start placing transform IR within other IR. Depends On D123135 Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123664	2022-04-19 21:41:02 +02:00
Denys Petrov	e37726beb2	[analyzer] Implemented RangeSet::Factory::castTo function to perform promotions, truncations and conversions. Summary: Handle casts for ranges working similarly to APSIntType::apply function but for the whole range set. Support promotions, truncations and conversions. Example: promotion: char [0, 42] -> short [0, 42] -> int [0, 42] -> llong [0, 42] truncation: llong [4295033088, 4295033130] -> int [65792, 65834] -> short [256, 298] -> char [0, 42] conversion: char [-42, 42] -> uint [0, 42]U[4294967254, 4294967295] -> short[-42, 42] Differential Revision: https://reviews.llvm.org/D103094	2022-04-19 22:34:03 +03:00
Ashay Rane	25c218be36	[MLIR] Add function to create BFloat16 array attribute This patch adds a new function `mlirDenseElementsAttrBFloat16Get()`, which accepts the shaped type, the number of BFloat16 values, and a pointer to an array of BFloat16 values, each of which is a `uint16_t` value. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D123981	2022-04-19 19:27:06 +00:00
Jonas Paulsson	0f8c626723	[BuildLibCalls] Introduce getOrInsertLibFunc() for use when building libcalls. A new set of overloaded functions named getOrInsertLibFunc() are now supposed to be used instead of getOrInsertFunction() when building a libcall from within an LLVM optimizer(). The idea is that this new function also makes sure that any mandatory argument attributes are added to the function prototype (after calling getOrInsertFunction()). inferLibFuncAttributes() is renamed to inferNonMandatoryLibFuncAttrs() as it only adds attributes that are not necessary for correctness but merely helping with later optimizations. Generally, the front end is responsible for building a correct function prototype with the needed argument attributes. If the middle end however is the one creating the call, e.g. when replacing one libcall with another, it then must take this responsibility. This continues the work of properly handling argument extension if required by the target ABI when building a lib call. getOrInsertLibFunc() now does this for all libcalls currently built by any LLVM optimizer. It is expected that when in the future a new optimization builds a new libcall with an integer argument it is to be added to getOrInsertLibFunc() with the proper handling. Note that not all targets have it in their ABI to sign/zero extend integer arguments to the full register width, but this will be done selectively as determined by getExtAttrForI32Param(). Review: Eli Friedman, Nikita Popov, Dávid Bolvanský Differential Revision: https://reviews.llvm.org/D123198	2022-04-19 21:22:07 +02:00
Sanjay Patel	8a9c70fc01	[InstCombine] C0 shift (X add nuw C) --> (C0 shift C) shift X With 'nuw' we can convert the increment of the shift amount into a pre-shift (constant fold) of the shifted constant: https://alive2.llvm.org/ce/z/FkTyR2 Fixes issue #41976	2022-04-19 15:21:34 -04:00
Sanjay Patel	a9aa14e0cb	[InstCombine] add tests for shift-of-add with constants; NFC	2022-04-19 15:21:34 -04:00
Kirill Stoimenov	ab99a414ef	[ASan] Removed checks if the tested functions were emitted. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D124030	2022-04-19 19:20:52 +00:00
Vasileios Porpodas	8d4b5e0833	[NFC][SLP] Improved description of getShallowScore() and getScoreAtLevelRec() Differential Revision: https://reviews.llvm.org/D124027	2022-04-19 12:15:36 -07:00
Yaxun (Sam) Liu	800f26386c	[CUDA][HIP] Fix delete operator for -fopenmp When new operator is called in OpenMP parallel region, delete operator is resolved and checked. Due to similar issue fixed by https://reviews.llvm.org/D121765, when resolving delete operator, the caller was not determined correctly, which results in error as shown in https://godbolt.org/z/jKhd8qKos. This patch fixes the issue in a similar way as https://reviews.llvm.org/D121765 Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D123976	2022-04-19 14:28:03 -04:00
Andrew Litteken	3de29ad209	[IRSim] Ignore debug instructions when creating canonical numbering When constructing canonical relationships between two regions, the first instruction of a basic block from the first region is used to find the corresponding basic block from the second region. However, debug instructions are not included in similarity matching, and therefore do not have a canonical numbering. This patch makes sure to ignore the debug instructions when finding the first instruction in a basic block. Reviewer: paquette Differential Revision: https://reviews.llvm.org/D123903	2022-04-19 13:18:28 -05:00
Fangrui Song	06cafd045e	[Go] Remove PopulateLTOPassManager binding after D123882	2022-04-19 11:16:27 -07:00
Nico Weber	f6b2ddbf38	[compiler-rt] Use ld64 flag -lto_library instead of DYLD_LIBRARY_PATH Makes bin/llvm-lit \ projects/compiler-rt/test/profile/Profile-arm64/instrprof-darwin-dead-strip.c pass on my machine. Without this change, ld64 complains that the bitcode was generated by LLVM 15 while the reader is 13.1 -- the version of Xcode on my machine. Looks like the DYLD_LIBRARY_PATH technique isn't working. -lto_library was added back in ld64-136, which was in Xcode 4.6, which was released over 10 years ago. So relying on it should be safe by now. Differential Revision: https://reviews.llvm.org/D124018	2022-04-19 13:54:57 -04:00
Mehdi Amini	83892d76f4	Print custom assembly on pass failure by default The printer is now resilient to invalid IR and will already automatically fallback to the generic form on invalid IR. Using the generic printer on pass failure was a conservative option before the printer was made failsafe. Reviewed By: lattner, rriddle, jpienaar, bondhugula Differential Revision: https://reviews.llvm.org/D123915	2022-04-19 17:29:08 +00:00
Kadir Cetinkaya	1aa3a54921	[clangd] Dont include version string in update tasks This increases cardinality of span latency metrics. Currently this was being shown to the user via file status updates as `Running Update (x)` after this change we'll only display `Running Update`. This also affects logs in case of a crash, but contents and version number for inputs are printed separately in that case already. Differential Revision: https://reviews.llvm.org/D124013	2022-04-19 19:27:04 +02:00
Mehdi Amini	2d6335421f	Apply clang-tidy fixes for llvm-qualified-auto in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-19 17:20:57 +00:00
Mehdi Amini	f9735be7e2	Apply clang-tidy fixes for performance-unnecessary-value-param in ControlFlowInterfaces.cpp (NFC)	2022-04-19 17:20:57 +00:00
Sanjay Patel	5f7c385498	[InstCombine] add tests for freeze of partial undef vector constants; NFC	2022-04-19 12:41:50 -04:00
Nikita Popov	f2d955a8a4	[OCaml] Fix pass builder test The LTO API has been removed.	2022-04-19 18:34:53 +02:00
Dmitry Makogon	084ad1ebee	[Test] Add more tests showing duplicate PHIs generated by RS4GC (NFC) This adds more tests with derived pointers.	2022-04-19 23:05:50 +07:00
Nikita Popov	dbe6d85b8b	[PPCGCodeGeneration] Look for function instead of function pointer type What this code is actually interested in are references to functions. Use of a function pointer type is being used as an imprecise proxy for that.	2022-04-19 17:59:34 +02:00
Nikita Popov	880014b593	[PPCGCodeGeneration] Avoid another pointer element type access Use an API that returns both the address and the element type, and use that for the load type.	2022-04-19 17:26:33 +02:00
David Green	cc03414125	[PerfectShuffle] Remove unused variables from D123386. NFC	2022-04-19 16:22:04 +01:00
Florian Hahn	4026b718b8	[VPlan] Remove unused SCEV forward declaration (NFC).	2022-04-19 17:16:17 +02:00
Nikita Popov	ee6bd28f23	[PPCGCodeGeneration] Avoid pointer element type access Pass through the ArrayTy instead.	2022-04-19 17:09:34 +02:00
Kirill Stoimenov	64c929ec09	[ASan] Fixed a reporting bug in (load\|store)N functions which would print unknown-crash instead of the proper error message when a the data access is unaligned. Reviewed By: kda, eugenis Differential Revision: https://reviews.llvm.org/D123643	2022-04-19 15:07:17 +00:00
Jonas Paulsson	4aa5dc15f0	[SystemZ] Handle SystemZ specific inline assembly address operands. Handle ZQ, ZR, ZS and ZT inline assembly operand constraints. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D110267	2022-04-19 16:55:45 +02:00
Tom Ritter	82f3ed9904	[analyzer] Expose Taint.h to plugins Reviewed By: NoQ, xazax.hun, steakhal Differential Revision: https://reviews.llvm.org/D123155	2022-04-19 16:55:01 +02:00
gbreynoo	42865819b2	[llvm-ar][test] Rename two tests and use correct thin command Two tests used the term "full archive" rather than "regular", these have been updated including the test names. They now also use --thin rather than the deprecated T. This change was made in preparation of D123142. Differential Revision: https://reviews.llvm.org/D123778	2022-04-19 15:13:37 +01:00
Qiongsi Wu	2512a875cc	[clang] Adding Platform/Architecture Specific Resource Header Installation Targets The goal of this patch is to improve distribution build's flexibility to include only applicable header files. Currently, the clang-resource-headers target contains nearly all the files in clang/lib/Headers. Most of these files are platform specific (e.g. immintrin.h is x86 specific). A distribution build will have to either include all the headers for all the platforms, or not include any headers. For example, if a distribution build for powerpc includes the clang-resource-headers target, it will include all the x86 specific headers, even-though the x86 specific headers cannot be used. This patch breaks up the clang-resource-headers list to a core list and platform specific lists. With the patch, a distribution build can now include the ppc-resource-headers to include the headers applicable to the powerpc platform. Specifically, one can now have cmake ... LLVM_DISTRIBUTION_COMPONENTS="clang;ppc-resource-headers" ... ../llvm ninja install-distribution then installs the powerpc headers. Similarly, one can do cmake ... LLVM_DISTRIBUTION_COMPONENTS="clang;x86-resource-headers" ... ../llvm to include headers applicable to the x86 platform in a distribution installation. To implement this behaviour, the patch does two things: * It breaks up the long files header file list to a core list and platform specific lists. * It adds numerous platform specific installation targets. Differential Revision: https://reviews.llvm.org/D123498	2022-04-19 10:10:07 -04:00
David Spickett	218b5c8394	[clang][AArch64] Remove BTI after setjmp from release notes This is now going into 14.0.2 as 571c7d8f6dae1a8797ae3271c0c09fc648b1940b so will not be new in clang-15.	2022-04-19 13:49:55 +00:00
David Green	73dc996428	[AArch64] Add lane moves to PerfectShuffle tables This teaches the perfect shuffle tables about lane inserts, that can help reduce the cost of many entries. Many of the shuffle masks are one-away from being correct, and a simple lane move can be a lot simpler than trying to use ext/zip/etc. Because they are not exactly like the other masks handled in the perfect shuffle tables, they require special casing to generate them, with a special InsOp Operator. The lane to insert into is encoded as the RHSID, and the move from is grabbed from the original mask. This helps reduce the maximum perfect shuffle entry cost to 3, with many more shuffles being generatable in a single instruction. Differential Revision: https://reviews.llvm.org/D123386	2022-04-19 14:49:50 +01:00
Alexey Bataev	7adfa31bc6	[SLP][NFC]Add a test for reducing same values, NFC.	2022-04-19 06:48:21 -07:00
Alexey Bataev	883571928c	Revert "[SLP]Improve reductions analysis and emission, part 1." This reverts commit `0e1f4d4d3c` to fix a crash reported in PR54976	2022-04-19 06:17:03 -07:00
Kirill Bobyrev	bdf0b757d5	[clangd] IncludeCleaner: Add filtering mechanism This introduces filtering out inclusions based on the resolved path. This mechanism will be important for disabling warnings for headers that we can not diagnose correctly yet. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D123488	2022-04-19 14:56:27 +02:00
Joseph Huber	0f8b8d79af	[OpenMP][Docs] Remove old 14.0 release information Summary: This patch removes the OpenMP sections in the release notes. These will be filled once the release is close and implementations are finalized.	2022-04-19 08:45:51 -04:00
Joseph Huber	944b25aee3	[OpenMP] Make Xopenmp-target args compile-only to silence warnings Summary: Previously we needed the `Xopenmp-target=` option during the linking phase so the old offloading driver knew which items to extract and link for the device. Now that the new driver has become the default this is no longer necessary and will cause a warning to be emitted for the unused argument. This should be silenced to avoid noise.	2022-04-19 08:42:43 -04:00
Arnab Dutta	12f55cac69	[MLIR][GPU] Add canonicalizer for gpu.memcpy Fold away gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D121279	2022-04-19 17:54:00 +05:30
David Green	cc9495f679	[AArch64] Only mark cost 1 perfect shuffles as legal The perfect shuffle tables encode a cost of either 0 (a nop-copy) or 1 (a single instruction) with a cost encoding of 0 in the upper 2 bits. All perfect shuffles with any cost are then marked as legal shuffles though (the maximum encoded cost is 3), which can confuse the DAG combiner into thinking the shuffles are cheaper than the should be. Limiting legal shuffles to single instructions seems to do better in most case, producing less instructions for complex shuffles. There are some cases that now become tbl, which may be better or worse depending on whether the instruction is in a loop and the tbl load can be hoisted out. Differential Revision: https://reviews.llvm.org/D123377	2022-04-19 12:58:55 +01:00
Roy Jacobson	76410040b9	Revert "[Concepts] Fix overload resolution bug with constrained candidates" This reverts commit `454d1df942`.	2022-04-19 07:51:21 -04:00
Florian Hahn	a65f2730d2	[VPlan] Expand induction step in VPlan pre-header. This patch moves SCEV expansion of steps used by VPWidenIntOrFpInductionRecipes to the pre-header using VPExpandSCEVRecipe. This ensures that those steps are expanded while the CFG is in a valid state. Previously, SCEV expansion may happen during vector body code-generation, during which the CFG may be invalid, causing issues with SCEV expansion. Depends on D122095. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D122096	2022-04-19 13:06:39 +02:00
David Green	50af82701c	[AArch64] Cost all perfect shuffles entries as cost 1 A brief introduction to perfect shuffles - AArch64 NEON has a number of shuffle operations - dups, zips, exts, movs etc that can in some way shuffle around the lanes of a vector. Given a shuffle of size 4 with 2 inputs, some shuffle masks can be easily codegen'd to a single instruction. A <0,0,1,1> mask for example is a zip LHS, LHS. This is great, but some masks are not so simple, like a <0,0,1,2>. It turns out we can generate that from zip LHS, <0,2,0,2>, having generated <0,2,0,2> from uzp LHS, LHS, producing the result in 2 instructions. It is not obvious from a given mask how to get there though. So we have a simple program (PerfectShuffle.cpp in the util folder) that can scan through all combinations of 4-element vectors and generate the perfect combination of results needed for each shuffle mask (for some definition of perfect). This is run offline to generate a table that is queried for generating shuffle instructions. (Because the table could get quite big, it is limited to 4 element vectors). In the perfect shuffle tables zip, unz and trn shuffles were being cost as 2, which is higher than needed and skews the perfect shuffle tables to create inefficient combinations. This sets them to 1 and regenerates the tables. The codegen will usually be better and the costs should be more precise (but it can get less second-order re-use of values from multiple shuffles, these cases should be fixed up in subsequent patches. Differential Revision: https://reviews.llvm.org/D123379	2022-04-19 12:05:05 +01:00
Alban Bridonneau	8daffd1dfb	Fix SLP score for out of order contiguous loads SLP uses the distance between pointers to optimize the getShallowScore. However the current code misses the case where we are trying to vectorize for VF=4, and the distance between pointers is 2. In that case the returned score reflects the case of contiguous loads, when it's not actually contiguous. The attached unit tests have 5 loads, where the program order is not the same as the offset order in the GEPs. So, the choice of which 4 loads to bundle together matters. If we pick the first 4, then we can vectorize with VF=4. If we pick the last 4, then we can only vectorize with VF=2. This patch makes a more conservative choice, to consider all distances>1 to not be a case of contiguous load, and give those cases a lower score. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D123516	2022-04-19 11:58:01 +01:00
Dmitry Preobrazhensky	e01dbabdd1	[AMDGPU][MC] Corrected error message "image data size does not match dmask and tfe" Differential Revision: https://reviews.llvm.org/D123929	2022-04-19 13:52:58 +03:00
Balazs Benics	7984189826	[analyzer] Remove HasAlphaDocumentation tablegen enum value D121387 simplified the doc url generation process, so we no longer need the HasAlphaDocumentation enum entry. This patch removes that. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D121459	2022-04-19 12:14:27 +02:00
Balazs Benics	744e2a3e22	[analyzer] ClangSA should tablegen doc urls refering to the main doc page AFAIK we should prefer https://clang.llvm.org/docs/analyzer/checkers.html to https://clang-analyzer.llvm.org/{available_checks,alpha_checks}.html This patch will ensure that the doc urls produced by tablegen for the ClangSA, will use the new url. Nothing else will be changed. Reviewed By: martong, Szelethus, ASDenysPetrov Differential Revision: https://reviews.llvm.org/D121387	2022-04-19 12:14:27 +02:00
Balazs Benics	63c4ca9d14	[analyzer] Turn missing tablegen doc entry of a checker into fatal error It turns out all checkers explicitly mention the `Documentation<>`. It makes sense to demand this, so emit a fatal tablegen error if such happens. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D122244	2022-04-19 12:14:27 +02:00
Balazs Benics	b7c988811d	[analyzer][NFC] Introduce the checker package separator character Reviewed By: martong, ASDenysPetrov Differential Revision: https://reviews.llvm.org/D122243	2022-04-19 12:14:27 +02:00
David Spickett	68e73eaee6	[lldb] Handle empty search string in "memory find" Given that you'd never find empty string, just error. Also add a test that an invalid expr generates an error. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D123793	2022-04-19 09:19:38 +00:00
Sven van Haastregt	f3ee0afc67	[OpenCL] opencl-c.h: Add const to get_image_num_samples Align with the `-fdeclare-opencl-builtins` option and other get_image_* builtins which have the const attribute. Differential Revision: https://reviews.llvm.org/D122728	2022-04-19 10:16:44 +01:00

1 2 3 4 5 ...

421452 Commits All Branches Search

421452 Commits

All Branches