llvm-project

Commit Graph

Author	SHA1	Message	Date
Dávid Bolvanský	9aee07abd0	[InstCombine] X - usub.sat(X, Y) => umin(X, Y) Pattern regressed in LLVM 9 with the introduction of usub.sat. Fixes https://bugs.llvm.org/show_bug.cgi?id=42178#c2 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101184	2021-04-23 21:13:07 +02:00
Pooja Yadav	0764c8af76	[Docs] Updated LLVM_TARGETS_TO_BUILD section in GettingStarted.rst Updated LLVM_TARGETS_TO_BUILD under https://llvm.org/docs/GettingStarted.html#local-llvm-configuration. Differential Revision: https://reviews.llvm.org/D101101	2021-04-24 00:31:43 +05:30
Alexander Belyaev	5291a7a3c7	[mlir] Add block arguments for input/output operands of 'linalg.tiled_loop`. Differential Revision: https://reviews.llvm.org/D101186	2021-04-23 20:55:20 +02:00
Nico Weber	a61891d491	[lld/mac] Support more flags for --reproduce I went through the callers of `readFile()` and `addFile()` in Driver.cpp and checked that the options that use them all get rewritten in the --reproduce response file. -(un)exported_symbols_list and -bundle_loader weren't, so add them. Also spruce up the test for reproduce a bit and actually try linking with the exptracted repro archive. Motivated by the response file in PR50098 complaining abou the -exported_symbols_list path being wrong :) Differential Revision: https://reviews.llvm.org/D101182	2021-04-23 14:40:24 -04:00
Peter Collingbourne	f2819ee6cc	scudo: Work around gcc 8 conversion warning. Should fix: https://lab.llvm.org/buildbot#builders/99/builds/2953	2021-04-23 11:26:12 -07:00
Hongtao Yu	5f2d730073	[CSSPGO] Fix incorrect prorating indirect call distribution factor that leads to target count loss. Pseudo probe distribution factor is used to scale down profile samples to avoid misleading the counts inference due to the usage of "maximum" in `getBlockWeight`. For callsites, the scaling down can come from code duplication prior to the sample profile loader (prelink or postlink), or due to the indirect call promotion in sample loader inliner. This patch fixes an issue in sample loader ICP where the leftover indirect callsite scaling down causes the loss of non-promoted call target samples unexpectedly. While the scaling down is to favor BFI/BPI with accurate an callsite count, it doesn't fit in the current distribution factor that represents code duplication changes. Ideally, we would need two factors, one is for code duplication, the other is for ICP. However this seems over complicated. I'm going to trade one usage (callsite counts) for the other (call target counts). Seeing perf win on one benchmark (mcf) of SPEC2017 with others unchanged. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D100993	2021-04-23 11:09:22 -07:00
Mitch Phillips	f1a47181f5	[hwasan] Remove untagging of kernel-consumed memory Now that page aliasing for x64 has landed, we don't need to worry about passing tagged pointers to libc, and thus D98875 removed it. Unfortunately, we still test on aarch64 devices that don't have the kernel tagged address ABI (https://reviews.llvm.org/D98875#2649269). All the memory that we pass to the kernel in these tests is from global variables. Instead of having architecture-specific untagging mechanisms for this memory, let's just not tag the globals. Reviewed By: eugenis, morehouse Differential Revision: https://reviews.llvm.org/D101121	2021-04-23 11:04:36 -07:00
Fangrui Song	a92dbadffe	[OpenMP] Fix -Wdeprecated-copy	2021-04-23 10:49:19 -07:00
Mitch Phillips	caea37b37e	Revert "[X86][AMX] Try to hoist AMX shapes' def" This reverts commit `90118563ad`. Reason: Broke the MSan buildbots. https://lab.llvm.org/buildbot/#/builders/5/builds/6967/steps/9/logs/stdio More details can be found in the original phabricator review: https://reviews.llvm.org/D101067	2021-04-23 10:42:26 -07:00
Sanjay Patel	e10d7d455d	[InstCombine] fold 'not' of ctpop in parity pattern As discussed in https://llvm.org/PR50096 , we could convert the 'not' into a 'sub' and see the same fold. That's because we already have another demanded bits optimization for 'sub'. We could add a related transform for odd-number-of-type-bits, but that seems unlikely to be practical. https://alive2.llvm.org/ce/z/TWJZXr	2021-04-23 13:23:24 -04:00
Sanjay Patel	d5175005ab	[InstCombine] add test for ctpop; NFC Goes with 2912f42a / PR50096.	2021-04-23 13:23:24 -04:00
Mitch Phillips	a683abe5c0	[Scudo] Use GWP-ASan's aligned allocations and fixup postalloc hooks. This patch does a few cleanup things: 1. The non-standalone scudo has a problem where GWP-ASan allocations may not meet alignment requirements where Scudo was requested to have alignment >= 16. Use the new GWP-ASan API to fix this. 2. The standalone variant loses some debugging information inside of GWP-ASan because we ask GWP-ASan to allocate an aligned size in the frontend. This means reports end up with 'UaF on a 16-byte allocation' for a 1-byte allocation with 16-byte alignment. Also use the new API to fix this. 3. Add post-alloc hooks for GWP-ASan intercepted allocations, and add stats tracking for GWP-ASan allocations. 4. Add a small test that checks the alignment of the frontend allocator, so that it can be used under GWP-ASan torture mode. 5. Add GWP-ASan torture mode as a testing configuration to catch these regressions. Depends on D94830, D95889. Reviewed By: cryptoad Differential Revision: https://reviews.llvm.org/D95884	2021-04-23 10:07:36 -07:00
Chris Hamilton	cae3b70ceb	[PR49761] Fix variadic arg handling in matcher Mishandling of variadic arguments in a function call caused a crash (runtime assert fail) in bugprone-infinite-loop tidy checker. Fix is to limit argument matching to the lesser of the number of variadic params in the prototype or the number of actual args in the call. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D101108	2021-04-23 12:07:14 -05:00
Teresa Johnson	10b781fb03	Mark type test intrinsics as speculatable to fix inline cost There is already code in InlineCost.cpp to identify and ignore ephemeral values (llvm.assume intrinsics and other side-effect free instructions only feeding the assumes). However, because llvm.type.test intrinsics were not marked speculatable, they and any instructions specifically feeding the type test (typically a bitcast) were being counted towards the instruction cost when inlining. This was causing profile matching issues in some cases when enabling -fwhole-program-vtables for whole program devirtualization. According to the language reference, the speculatable attribute means: "the function does not have any effects besides calculating its result and does not have undefined behavior". I see no reason why type tests cannot be marked with this attribute. There are 2 test changes: llvm/test/Transforms/Inline/ephemeral.ll: I added a type test intrinsic here to verify the fix. Also, I found the test was not actually testing what it originally intended. Many of the existing instructions were optimized away by -Oz, and the cost of inlining was negative due to the benefit of removing the call. So I changed the test to simply invoke the inline pass and check the number of instructions computed by InlineCost. I also fixed an instruction that was not actually used anywhere. llvm/test/Transforms/SimplifyCFG/no-md-sink.ll needed to be made more robust to code changes that reordered the metadata. Differential Revision: https://reviews.llvm.org/D101180	2021-04-23 10:02:31 -07:00
Snehasish Kumar	3da0aeea08	[NFC] Use hasSection instead of getSection().empty() Use the optimized check hasSection() instead of calling getSection().empty(). Originally suggested in D101004, but was dropped in the commit.	2021-04-23 10:00:38 -07:00
Stephen Kelly	df82fa8d9b	[AST] Update tests to query for introspection support	2021-04-23 17:51:10 +01:00
Louis Dionne	a3ab5120fd	[libc++] Rewrite the tuple constructors to be strictly Standards conforming This nasty patch rewrites the tuple constructors to match those defined by the Standard. We were previously providing several extensions in those constructors - those extensions are removed by this patch. The issue with those extensions is that we've had numerous bugs filed against us over the years for problems essentially caused by them. As a result, people are unable to use tuple in ways that are blessed by the Standard, all that for the perceived benefit of providing them extensions that they never asked for. Since this is an API break, I communicated it in the release notes. I do not foresee major issues with this break because I don't think the extensions are too widely relied upon, but we can ship it and see if we get complaints before the next LLVM release - that will give us some amount of information regarding how much use these extensions have. Differential Revision: https://reviews.llvm.org/D96523	2021-04-23 12:46:37 -04:00
Jeremy Morse	7deb970efb	Drop a REQUIRES: lldb on a dexter regression test As this is a test that actually gets to operating the debugger, it needs to be limited to scenarios where the debugger is available. (We'll file this in the set of things Dexter doesn't handle gracefully..)	2021-04-23 17:41:38 +01:00
Craig Topper	3064a63b2b	[RISCV] Remove GetVRegNoV0 from the output register class of masked compare pseudo instructions. Theses instructions are allowed to write v0 when they are masked. We'll still never use v0 because of the earlyclobber constraint so this doesn't really help anything. It just makes the definitions correct. While I was there remove an unused multiclass I noticed. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D101118	2021-04-23 09:33:29 -07:00
Craig Topper	fae1d31c09	[RISCV] Have assembler check that the temp register is different than dest register for vmsgeu.vx pseudo. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D101015	2021-04-23 09:33:29 -07:00
Peter Collingbourne	0a5576ecf0	scudo: Store header on deallocation before retagging memory. From a cache perspective it's better to store the header immediately after loading it. If we delay this operation until after we've retagged it's more likely that our header will have been evicted from the cache and we'll need to fetch it again in order to perform the compare-exchange operation. For similar reasons, store the deallocation stack before retagging instead of afterwards. Differential Revision: https://reviews.llvm.org/D101137	2021-04-23 09:32:16 -07:00
Florian Hahn	89c4dda076	[VPlan] Add GraphTraits impl to traverse through VPRegionBlock. This patch adds a new iterator to traverse through VPRegionBlocks and a GraphTraits specialization using the iterator to traverse through VPRegionBlocks. Because there is already a GraphTraits specialization for VPBlockBase * and co, a new VPBlockRecursiveTraversalWrapper helper is introduced. This allows us to provide a new GraphTraits specialization for that type. Users can use the new recursive traversal by using this wrapper. The graph trait visits both the entry block of a region, as well as all its successors. Exit blocks of a region implicitly have their parent region's successors. This ensures all blocks in a region are visited before any blocks in a successor region when doing a reverse post-order traversal of the graph. Reviewed By: a.elovikov Differential Revision: https://reviews.llvm.org/D100175	2021-04-23 17:26:47 +01:00
Johannes Doerfert	17330a3cb1	[OpenMP] Avoid reading uninitialized parallel level values In a last minute change request for `a2dbfb6b72` we introduced a read of the uninitialized parallel level value in SPMD-mode. We go back to initializing the array early and checking for an adjusted level. Found by the miniqmc unit tests: https://cdash.qmcpack.org/CDash/viewTest.php?onlyfailed&buildid=203434 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101123	2021-04-23 11:21:58 -05:00
Johannes Doerfert	cbe8b57a67	[Clang] Allow the combination of loader_uninitialized and address spaces When an object is allocated in a non-default address space we do not need to check for a constructor if it is not initialized and has a trivial constructor (which we won't call then). Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100929	2021-04-23 11:21:52 -05:00
Dávid Bolvanský	3b71de41cc	[libcxx] Fixed build break on buildbots with -Werror	2021-04-23 18:16:38 +02:00
Sebastian Neubauer	3366d81153	[AMDGPU] Save WWM registers in functions The values of registers in inactive lanes needs to be saved during function calls. Save all registers used for whole wave mode, similar to how it is done for VGPRs that are used for SGPR spilling. Differential Revision: https://reviews.llvm.org/D99429 Reapply with fixed tests on window.	2021-04-23 18:09:24 +02:00
Paul C. Anagnostopoulos	d9187f50b9	[TableGen] [docs] Improve BNF for the 'multiclass' statement [NFC]	2021-04-23 12:05:52 -04:00
Nemanja Ivanovic	6725b90a02	[PowerPC] Add vec_ctsl and vec_ctul to altivec.h These are added for compatibility with XLC. They are similar to vec_cts and vec_ctu except that the result is a doubleword vector regardless of the parameter type.	2021-04-23 11:03:38 -05:00
Dave Lee	638d84b60b	[cmake] Configure policy CMP0116 Using `cmake` >=3.20 results in many warnings about this new policy. This change silences the warnings by explicitly declaring use of the "OLD" behavior. This policy currently affects only one place: the `tablegen()` function in `TableGen.cmake`. Differential Revision: https://reviews.llvm.org/D101083	2021-04-23 08:57:40 -07:00
Simon Pilgrim	043bc88dba	[CostModel][X86] Improve v2f32 fadd reduction cost This was being reported as a similar cost to v4f32 when its a lot cheaper (just a shufps+addps).	2021-04-23 16:56:13 +01:00
Nico Weber	fcf59cc917	fix comment typo to cycle bots	2021-04-23 11:45:49 -04:00
Gabor Marton	a7cb951fa4	[Analyzer][StdLibraryFunctionsChecker] Describe arg constraints In this patch, I provide a detailed explanation for each argument constraint. This explanation is added in an extra 'note' tag, which is displayed alongside the warning. Since these new notes describe clearly the constraint, there is no need to provide the number of the argument (e.g. 'Arg3') within the warning. However, I decided to keep the name of the constraint in the warning (but this could be a subject of discussion) in order to be able to identify the different kind of constraint violations easily in a bug database (e.g. CodeChecker). Differential Revision: https://reviews.llvm.org/D101060	2021-04-23 17:27:54 +02:00
Stephen Kelly	35918bcb6f	[AST] Sort introspection results without instantiating other data Avoid string allocation in particular, but also avoid attempting to impose any particular ordering based on formatted results. Differential Revision: https://reviews.llvm.org/D101054	2021-04-23 16:21:01 +01:00
Andrzej Warzynski	2f67267a93	[flang] Switch from %f18 to %flang_fc1 in a test This patch updates the final test that can be shared between the old and the new Flang drivers and that has not been ported yet. %f18 (always expanded as `f18`) is replaced with %flang_fc1 (expanded as either `f18` or `flang-new -fc1`, depending on `FLANG_BUILD_NEW_DRIVER`). This test should've been updated in https://reviews.llvm.org/D100309, but I missed it then. That's because this test contains non-ascii characters and `grep -I %f18` (as well as other grep-like tools) skips it because it's interpreted as a data/binary file. In fact, it's just a text file with non-ascii chars. Since this is an obvious omission from D100309 (reviewed, accepted and merged), I'm sending this without a review to reduce the noise on Phabricator.	2021-04-23 15:10:07 +00:00
Sander de Smalen	f9a50f04ba	[TTI] NFC: Change getIntImmCost[Inst\|Intrin] to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Differential Revision: https://reviews.llvm.org/D100565	2021-04-23 16:06:36 +01:00
Sander de Smalen	43ace8b5ce	[TTI] NFC: Change getScalingFactorCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Differential Revision: https://reviews.llvm.org/D100564	2021-04-23 16:06:36 +01:00
Sander de Smalen	008a072ded	[TTI] NFC: Change getMemcpyCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Differential Revision: https://reviews.llvm.org/D100563	2021-04-23 16:06:35 +01:00
Sander de Smalen	9ba07f37f8	[TTI] NFC: Change getGEPCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Differential Revision: https://reviews.llvm.org/D100562	2021-04-23 16:06:35 +01:00
Sander de Smalen	e0edfa052f	[TTI] NFC: Change getAddressComputationCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Differential Revision: https://reviews.llvm.org/D100561	2021-04-23 16:06:35 +01:00
dfukalov	9ab17a60eb	[TTI] NFC: Use InstructionCost to store ScalarizationCost in IntrinsicCostAttributes. This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D101151	2021-04-23 18:02:00 +03:00
Daniil Fukalov	f79d055791	[TTI] Fix ScalarizationCost initialization. In cases when ScalarizationCostPassed has no value, UINT_MAX is actually used for cost estimation in `return ScalarCalls * ScalarCost + ScalarizationCost`. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D101099	2021-04-23 17:59:59 +03:00
Joe Ellis	c19c0ad681	[AArch64][SVE] Fix bug in lowering of fixed-length integer vector divides The function AArch64TargetLowering::LowerFixedLengthVectorIntDivideToSVE previously assumed the operands were full vectors, but this is not always true. This function would produce bogus if the division operands are not full vectors, resulting in miscompiles when dividing 8-bit or 16-bit vectors. The fix is to perform an extend + div + truncate for non-full vectors, instead of the usual unpacking and unzipping logic. This is an additive change which reduces the non-full integer vector divisions to a pattern recognised by the existing lowering logic. For future reference, an example of code that would miscompile before this patch is below: 1 int8_t foo(unsigned N, int8_t a, int8_t b, int8_t *c) { 2 int8_t result = 0; 3 for (int i = 0; i < N; ++i) { 4 result += (a[i] / b[i]) / c[i]; 5 } 6 return result; 7 } Differential Revision: https://reviews.llvm.org/D100370	2021-04-23 14:55:10 +00:00
Jay Foad	5802cbefc1	[AMDGPU] Fix typo in implicit operand lists Several tests had a typo where they mentioned sgpr17 twice instead of sgpr17 and sgpr27. This had a significant effect on the "scavenge_sgpr_pei_no_sgprs" tests because there was actually an sgpr available, namely sgpr27. Differential Revision: https://reviews.llvm.org/D100960	2021-04-23 15:44:17 +01:00
Sebastian Neubauer	22d99cb63f	Revert "[AMDGPU] Save WWM registers in functions" This reverts commit `91464c30bf`. Seems to break tests on windows.	2021-04-23 16:38:50 +02:00
Piotr Sobczak	83a3395b30	[AMDGPU][NFC] Update auto-gen test Most likely the "glc" was not added to the test when the volatile loads started generating those bits.	2021-04-23 16:33:16 +02:00
Krzysztof Parzyszek	8ebdb58aac	[Hexagon] Remove redundant HVX intrinsic selection patterns, NFC Deleted HexagonMapAsm2IntrinV65.gen.td that wasn't included anywhere, moved V6_vrmpy_rtt patterns to HexagonIntrinsics.td. Touch CMakeLists.txt to force re-cmake (somehow the unused file was listed as a dependency in the generated makefiles).	2021-04-23 09:28:08 -05:00
Sebastian Neubauer	91464c30bf	[AMDGPU] Save WWM registers in functions The values of registers in inactive lanes needs to be saved during function calls. Save all registers used for whole wave mode, similar to how it is done for VGPRs that are used for SGPR spilling. Differential Revision: https://reviews.llvm.org/D99429	2021-04-23 16:09:31 +02:00
Paul C. Anagnostopoulos	9d609adcb0	[TableGen] Correct some comments in the TableGen parser [NFC] Differential Revision: https://reviews.llvm.org/D101088	2021-04-23 09:53:31 -04:00
Simon Pilgrim	c2da0cdff5	[X86] Add Win32/64 mulo test coverage Part of an investigation to solve the windows regressions caused by rG13ec913bdf50	2021-04-23 14:51:42 +01:00
Paul C. Anagnostopoulos	6a067cdb06	[TableGen] [docs] Improve description of NAME in Programmer's Reference Also use "parent class" consistently and add a note about the term. Differential Revision: https://reviews.llvm.org/D100867	2021-04-23 09:49:17 -04:00

1 2 3 4 5 ...

386472 Commits All Branches Search

386472 Commits

All Branches