llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	96352e0a1b	AMDGPU/GlobalISel: Handle LDS with relocations case	2020-01-29 08:18:55 -08:00
Elia Geretto	ab2300bc15	[PassManagerBuilder] Remove global extension when a plugin is unloaded This commit fixes PR39321. GlobalExtensions is not guaranteed to be destroyed when optimizer plugins are unloaded. If it is indeed destroyed after a plugin is dlclose-d, the destructor of the corresponding ExtensionFn is not mapped anymore, causing a call to unmapped memory during destruction. This commit guarantees that extensions coming from external plugins are removed from GlobalExtensions when the plugin is unloaded if GlobalExtensions has not been destroyed yet. Differential Revision: https://reviews.llvm.org/D71959	2020-01-29 16:15:45 +00:00
Connor Abbott	87d98c1495	AMDGPU: Fix handling of infinite loops in fragment shaders Summary: Due to the fact that kill is just a normal intrinsic, even though it's supposed to terminate the thread, we can end up with provably infinite loops that are actually supposed to end successfully. The AMDGPUUnifyDivergentExitNodes pass breaks up these loops, but because there's no obvious place to make the loop branch to, it just makes it return immediately, which skips the exports that are supposed to happen at the end and hangs the GPU if all the threads end up being killed. While it would be nice if the fact that kill terminates the thread were modeled in the IR, I think that the structurizer as-is would make a mess if we did that when the kill is inside control flow. For now, we just add a null export at the end to make sure that it always exports something, which fixes the immediate problem without penalizing the more common case. This means that we sometimes do two "done" exports when only some of the threads enter the discard loop, but from tests the hardware seems ok with that. This fixes dEQP-VK.graphicsfuzz.while-inside-switch with radv. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70781	2020-01-29 17:13:25 +01:00
Matt Arsenault	94e8ef4d4c	AMDGPU/GlobalISel: Look through copies for source modifiers When all VOP instructions are legalized to VGPRs, any SGPR source modifiers will have a copy in the way.	2020-01-29 08:08:13 -08:00
Stanislav Mekhanoshin	c2ad7ee1a9	[AMDGPU] override isHighLatencyDef SIMachineScheduler uses isHighLatencyInstruction with the same sematincs, but TargetInstrInfo has virtual isHighLatencyDef method, so override it instead. Added FLAT to the list of high latency opcodes and a check for mayLoad since stores are not technically high latency in terms of data dependency. This change did not produce any visible impact on our tests. Differential Revision: https://reviews.llvm.org/D73582	2020-01-29 08:01:29 -08:00
Matt Arsenault	f717483acd	GlobalISel: Assert on invalid bitcast in MIRBuilder The other casts validate, so this should too.	2020-01-29 07:49:39 -08:00
Matt Arsenault	752e2e245a	AMDGPU/GlobalISel: Rewrite fadd select tests Convert to the style most others use with one test instruction per function, and use an implicit use to ensure the result register class is constrained. Change-Id: I6109148b0e3c80aa5535796a37abca583c19a936	2020-01-29 07:49:38 -08:00
Benjamin Kramer	01213f9070	[clang-tidy] Initialize token before handing it to the lexer Found by msan.	2020-01-29 16:48:57 +01:00
Simon Pilgrim	79748add70	Fix MSVC lamdba default capture mode warning. NFCI.	2020-01-29 15:47:04 +00:00
Hans Wennborg	31e07692d7	Work around PR44697 in CrashRecoveryContext	2020-01-29 16:35:07 +01:00
Matt Arsenault	24ab761a60	LLT: Add changeNumElements This is the element analog of changeElementType/changeElementSize	2020-01-29 07:32:07 -08:00
LLVM GN Syncbot	df8f2774b6	[gn build] Port `9a08a3fab9`	2020-01-29 15:15:45 +00:00
Connor Abbott	08b205bb48	Revert "AMDGPU: Fix handling of infinite loops in fragment shaders" This reverts commit `0994c485e6`.	2020-01-29 16:14:52 +01:00
Connor Abbott	13ab22ab22	Revert "AMDGPU: Fix AMDGPUUnifyDivergentExitNodes with no normal returns" This reverts commit `323bfde20c`.	2020-01-29 16:14:49 +01:00
Adam Balogh	9a08a3fab9	[Analyzer] Split container modeling from iterator modeling Iterator modeling depends on container modeling, but not vice versa. This enables the possibility to arrange these two modeling checkers into separate layers. There are several advantages for doing this: the first one is that this way we can keep the respective modeling checkers moderately simple and small. Furthermore, this enables creation of checkers on container operations which only depend on the container modeling. Thus iterator modeling can be disabled together with the iterator checkers if they are not needed. Since many container operations also affect iterators, container modeling also uses the iterator library: it creates iterator positions upon calling the `begin()` or `end()` method of a containter (but propagation of the abstract position is left to the iterator modeling), shifts or invalidates iterators according to the rules upon calling a container modifier and rebinds the iterator to a new container upon `std::move()`. Iterator modeling propagates the abstract iterator position, handles the relations between iterator positions and models iterator operations such as increments and decrements. Differential Revision: https://reviews.llvm.org/D73547	2020-01-29 16:10:45 +01:00
Whitney Tsang	da58e68fdf	[LoopFusion] Move instructions from FC1.Preheader to FC0.Preheader when proven safe. Summary: Currently LoopFusion give up when the second loop nest preheader is not empty. For example: for (int i = 0; i < 100; ++i) {} x+=1; for (int i = 0; i < 100; ++i) {} The above example should be safe to fuse. This PR moves instructions in FC1 preheader (e.g. x+=1; ) to FC0 preheader, which then LoopFusion is able to fuse them. Reviewer: kbarton, Meinersbur, jdoerfert, dmgreen, fhahn, hfinkel, bmahjour, etiotto Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D71821	2020-01-29 15:06:11 +00:00
Kazushi (Jam) Marukawa	0bec0e7151	[VE] udiv/sdiv/urem/srem/mul isel patterns Summary: udiv/sdiv/urem/srem/mul integer isel patterns and tests. Pretend for now that integer division were always cheap in HW. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73623	2020-01-29 15:59:50 +01:00
Guillaume Chatelet	c2dcdf95eb	[libc] Fix benchmarks CMakeLists.txt Summary: This is a follow up on https://reviews.llvm.org/rGaba80d0734d1#886881. `target_link_options` requires CMake>=3.13. Reviewers: abrachet Subscribers: mgorny, MaskRay, tschuett, libc-commits Tags: #libc-project Differential Revision: https://reviews.llvm.org/D73452	2020-01-29 15:56:47 +01:00
Nicolas Vasilache	ea1e3369f7	[mlir][Linalg] Introduce folding patterns to remove certain MemRefCastOp Summary: Canonicalization and folding patterns in StandardOps may interfere with the needs of Linalg. This revision introduces specific foldings for dynamic memrefs that can be proven to be static. Very concretely: Determines whether it is possible to fold it away in the parent Linalg op: ```mlir %1 = memref_cast %0 : memref<8x16xf32> to memref<?x?xf32> %2 = linalg.slice %1 ... : memref<?x?xf32> ... // or %1 = memref_cast %0 : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>> to memref<?x?xf32> linalg.generic(%1 ...) : memref<?x?xf32> ... ``` into ```mlir %2 = linalg.slice %0 ... : memref<8x16xf32> ... // or linalg.generic(%0 ... : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>> ``` Reviewers: ftynse, aartbik, jsetoain, tetuante, asaadaldien Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73565	2020-01-29 09:52:51 -05:00
Matt Arsenault	02adfb5155	AMDGPU/GlobalISel: Manually select scalar f64 G_FNEG This should be no problem to support with a pattern, but it turns out there are just too many yaks to shave. The main problem is in the DAG emitter, which I have no desire to sink effort into fixing. If we had a bit to disable patterns in the DAG importer, fixing the GlobalISelEmitter is more manageable.	2020-01-29 06:49:16 -08:00
Matt Arsenault	a9af1dc34d	Analysis: Add max recursison to isDereferenceableAndAlignedPointer Fixes stack overflow in test/CodeGen/X86/large-gep-chain.ll when store lowering starts adding dereferenceable flags.	2020-01-29 06:48:24 -08:00
Matt Arsenault	c5c1bb3374	GlobalISel: Lower G_WRITE_REGISTER	2020-01-29 06:48:24 -08:00
Mikael HolmÃ©n	2103e08b3f	More fixes of implicit std::string conversions	2020-01-29 15:29:46 +01:00
Connor Abbott	323bfde20c	AMDGPU: Fix AMDGPUUnifyDivergentExitNodes with no normal returns Summary: The code was assuming in a few places that if there was only one exit from the function that it was a normal return, which is invalid. It could be an infinite loop, in which case we still need to insert the usual fake edge so that the null export happens. This fixes shaders that end with an infinite loop that discards. Reviewers: arsenm, nhaehnle, critson Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71192	2020-01-29 15:08:46 +01:00
Connor Abbott	0994c485e6	AMDGPU: Fix handling of infinite loops in fragment shaders Summary: Due to the fact that kill is just a normal intrinsic, even though it's supposed to terminate the thread, we can end up with provably infinite loops that are actually supposed to end successfully. The AMDGPUUnifyDivergentExitNodes pass breaks up these loops, but because there's no obvious place to make the loop branch to, it just makes it return immediately, which skips the exports that are supposed to happen at the end and hangs the GPU if all the threads end up being killed. While it would be nice if the fact that kill terminates the thread were modeled in the IR, I think that the structurizer as-is would make a mess if we did that when the kill is inside control flow. For now, we just add a null export at the end to make sure that it always exports something, which fixes the immediate problem without penalizing the more common case. This means that we sometimes do two "done" exports when only some of the threads enter the discard loop, but from tests the hardware seems ok with that. This fixes dEQP-VK.graphicsfuzz.while-inside-switch with radv. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70781	2020-01-29 15:08:46 +01:00
Haojian Wu	fce8983a3c	[clangd] Remove the temporary alias for clangd::DiagnosticConsumer. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73619	2020-01-29 14:50:58 +01:00
Sanjay Patel	87f6314f8c	[InstCombine] canonicalize splat shuffle after cmp cmp (splat V1, M), SplatC --> splat (cmp V1, SplatC'), M As discussed in PR44588: https://bugs.llvm.org/show_bug.cgi?id=44588 ...we try harder to push shuffles after binops than after compares. This patch handles the special (but presumably most common case) of splat shuffles. If both operands are splats, then we can do the comparison on the non-splat inputs followed by splat of the compare. That should take care of the regression noted in D73411. There's another potential fold requested in PR37463 to scalarize the compare, but that's another patch (and it's not clear if we can do that without the ability to undo it later): https://bugs.llvm.org/show_bug.cgi?id=37463 Differential Revision: https://reviews.llvm.org/D73575	2020-01-29 08:34:29 -05:00
Sanne Wouda	2939fc13c8	[AArch64] Add IR intrinsics for sq(r)dmulh_lane(q) Summary: Currently, sqdmulh_lane and friends from the ACLE (implemented in arm_neon.h), are represented in LLVM IR as a (by vector) sqdmulh and a vector of (repeated) indices, like so: %shuffle = shufflevector <4 x i16> %v, <4 x i16> undef, <4 x i32> <i32 3, i32 3, i32 3, i32 3> %vqdmulh2.i = tail call <4 x i16> @llvm.aarch64.neon.sqdmulh.v4i16(<4 x i16> %a, <4 x i16> %shuffle) When %v's values are known, the shufflevector is optimized away and we are no longer able to select the lane variant of sqdmulh in the backend. This defeats a (hand-coded) optimization that packs several constants into a single vector and uses the lane intrinsics to reduce register pressure and trade-off materialising several constants for a single vector load from the constant pool, like so: int16x8_t v = {2,3,4,5,6,7,8,9}; a = vqdmulh_laneq_s16(a, v, 0); b = vqdmulh_laneq_s16(b, v, 1); c = vqdmulh_laneq_s16(c, v, 2); d = vqdmulh_laneq_s16(d, v, 3); [...] In one microbenchmark from libjpeg-turbo this accounts for a 2.5% to 4% performance difference. We could teach the compiler to recover the lane variants, but this would likely require its own pass. (Alternatively, "volatile" could be used on the constants vector, but this is a bit ugly.) This patch instead implements the following LLVM IR intrinsics for AArch64 to maintain the original structure through IR optmization and into instruction selection: - sqdmulh_lane - sqdmulh_laneq - sqrdmulh_lane - sqrdmulh_laneq. These 'lane' variants need an additional register class. The second argument must be in the lower half of the 64-bit NEON register file, but only when operating on i16 elements. Note that the existing patterns for shufflevector and sqdmulh into sqdmulh_lane (etc.) remain, so code that does not rely on NEON intrinsics to generate these instructions is not affected. This patch also changes clang to emit these IR intrinsics for the corresponding NEON intrinsics (AArch64 only). Reviewers: SjoerdMeijer, dmgreen, t.p.northover, rovka, rengolin, efriedma Reviewed By: efriedma Subscribers: kristof.beyls, hiraditya, jdoerfert, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71469	2020-01-29 13:25:23 +00:00
Sjoerd Meijer	f719b0ba13	[MVE][MC] evaluateBranch: add missing MVE opcode This adds some missing MVE opcodes to evaluateBranch, which results in llvm-objdump being able to print the PC relative branch target as an annotation. Differential Revision: https://reviews.llvm.org/D73553	2020-01-29 13:19:45 +00:00
Kazushi (Jam) Marukawa	6b587ee23c	[VE] Isel patterns for fp32/64 and i32/64 conversion Summary: fp32/64 <> signed/unsigned i32/64 conversion isel patterns and tests (This patch depends on `fsub` implemented by https://reviews.llvm.org/D73540 ) Reviewers: arsenm, craig.topper, rengolin, k-ishizaka Reviewed By: arsenm Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Tags: #ve, #llvm Differential Revision: https://reviews.llvm.org/D73544	2020-01-29 14:10:22 +01:00
Sanne Wouda	cbc45e4e75	Regenerate aarch64-neon-2velem.c CHECK lines	2020-01-29 13:03:27 +00:00
Sanne Wouda	4ec2a26732	Fix clang test build	2020-01-29 13:03:27 +00:00
Karasev Nikita	d5dfd1350e	Add TagDecl AST matcher	2020-01-29 07:58:31 -05:00
Georgii Rymar	e6b55cbcdc	[yaml2obj][obj2yaml] - Add lost test cases. It is a part of https://reviews.llvm.org/D71872 which was lost somehow during relanding after being reverted: https://reviews.llvm.org/rG7570d387c21935b58afa67cb9ee17250e38721fa	2020-01-29 15:40:35 +03:00
Martin Probst	a324fcf1ae	clang-format: insert trailing commas into containers. Summary: This change adds an option to insert trailing commas into container literals. For example, in JavaScript: const x = [ a, b, ^~~~~ inserted if missing. ] This is implemented as a seperate post-processing pass after formatting (because formatting might change whether the container literal does or does not wrap). This keeps the code relatively simple and orthogonal, though it has the notable drawback that the newly inserted comma is not taken into account for formatting decisions (e.g. it might exceed the 80 char limit). To avoid exceeding the ColumnLimit, a comma is only inserted if it fits into the limit. Trailing comma insertion conceptually conflicts with argument bin-packing: inserting a comma disables bin-packing, so we cannot do both. clang-format rejects FormatStyle configurations that do both with this change. Reviewers: krasimir, MyDeveloperDay Subscribers: cfe-commits Tags: #clang	2020-01-29 13:23:54 +01:00
Kerry McLaughlin	3cf80822a9	[AArch64][SVE] Add SVE2 intrinsics for uniform DSP operations Summary: Implements the following intrinsics: - sqrdmlah, sqrdmlsh, sqrdmulh & sqdmulh - [s\|u]hadd, [s\|u]hsub, [s\|u]rhadd & [s\|u]hsubr - urecpe, ursqrte, sqabs & sqneg Reviewers: sdesmalen, efriedma, dancgr, cameron.mcinally Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73493	2020-01-29 12:03:15 +00:00
Sam Parker	dc0d84f09e	[NFC][ARM] Add test	2020-01-29 06:59:21 -05:00
Haojian Wu	17fadeffcc	[clangd][vscode] Update lsp dependencies to pickup the progress support in LSP 3.15 Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73612	2020-01-29 12:58:53 +01:00
Haojian Wu	e864f93766	[clangd] Replace raw lexer code with token buffer in prepare rename. Summary: there is a slight behavior change in this patch: - before: `in^t a;`, returns our internal error message (no symbol at given location) - after: `in^t a, returns null, and client displays their message (e.g. e.g. "the element can't be renamed" in vscode). both are sensible according LSP, and we'd save one `rename` call in the later case. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73610	2020-01-29 12:57:18 +01:00
Sam McCall	bcb3e42fdf	[clangd] Go-to-definition on 'override' jumps to overridden method(s) Reviewers: kadircet Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73367	2020-01-29 12:43:52 +01:00
Sam McCall	6f6952780b	[clangd] add CODE_OWNERS Reviewers: klimek Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73537	2020-01-29 12:43:19 +01:00
Peter Smith	0b4a047bfb	[LLD][ELF][ARM] Do not substitute BL/BLX for non STT_FUNC symbols. D73474 disabled the generation of interworking thunks for branch relocations to non STT_FUNC symbols. This patch handles the case of BL and BLX instructions to non STT_FUNC symbols. LLD would normally look at the state of the caller and the callee and write a BL if the states are the same and a BLX if the states are different. This patch disables BL/BLX substitution when the destination symbol does not have type STT_FUNC. This brings our behavior in line with GNU ld which may prevent difficult to diagnose runtime errors when switching to lld. Differential Revision: https://reviews.llvm.org/D73542	2020-01-29 11:42:25 +00:00
David Truby	63c8972562	[MLIR] Add OpenMP dialect with barrier operation Summary: Barrier is a simple operation that takes no arguments and returns nothing, but implies a side effect (synchronization of all threads) Reviewers: jdoerfert Subscribers: mgorny, guansong, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72400	2020-01-29 11:34:58 +00:00
Kadir Cetinkaya	7830c2d44f	[clangd] Get rid of delayed template parsing Summary: No need to pass fno-delayed-template-parsing as the opposite flag is only passed to cc1 when abi is set to msvc. Sending as a follow-up to D73613 to keep changes in the release branch minimal. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73615	2020-01-29 12:12:45 +01:00
Kadir Cetinkaya	55b0e9c9d5	[clangd][Hover] Make tests hermetic by setting target triplet Summary: Fixes https://bugs.llvm.org/show_bug.cgi?id=44696 Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73613	2020-01-29 12:12:45 +01:00
Benjamin Kramer	0ee4b027d3	Fix an implicit conversion in clang-tidy. GCC 5 complains about it.	2020-01-29 12:05:35 +01:00
Momchil Velikov	ac21535460	[ARM] Add documentation for -march= and -mfpu= command line options Differential Revision: https://reviews.llvm.org/D73459	2020-01-29 10:39:01 +00:00
Kerry McLaughlin	bd33a46213	[AArch64][SVE] Add SVE2 intrinsics for pairwise arithmetic Summary: Implements the following intrinsics: - addp - smaxp, sminp, umaxp & uminp - sadalp & uadalp Reviewers: dancgr, efriedma, sdesmalen, c-rhodes Reviewed By: c-rhodes Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73347	2020-01-29 10:31:31 +00:00
James Henderson	7116e431c0	[DebugInfo] Make most debug line prologue errors non-fatal to parsing Many of the debug line prologue errors are not inherently fatal. In most cases, we can make reasonable assumptions and carry on. This patch does exactly that. In the case of length problems, the approach of "assume stated length is correct" is taken which means the offset might need adjusting. This is a relanding of `b94191fe`, fixing an LLD test and the LLDB build. Reviewed by: dblaikie, labath Differential Revision: https://reviews.llvm.org/D72158	2020-01-29 10:23:41 +00:00
Pavel Labath	7a6ebb5ba3	[lldb] More windows StringRef fixes I don't have a windows build around, so I am just going by the buildbot messages.	2020-01-29 11:15:20 +01:00

... 3 4 5 6 7 ...

341086 Commits All Branches Search

341086 Commits

All Branches