llvm-project

Commit Graph

Author	SHA1	Message	Date
Connor Abbott	323bfde20c	AMDGPU: Fix AMDGPUUnifyDivergentExitNodes with no normal returns Summary: The code was assuming in a few places that if there was only one exit from the function that it was a normal return, which is invalid. It could be an infinite loop, in which case we still need to insert the usual fake edge so that the null export happens. This fixes shaders that end with an infinite loop that discards. Reviewers: arsenm, nhaehnle, critson Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71192	2020-01-29 15:08:46 +01:00
Connor Abbott	0994c485e6	AMDGPU: Fix handling of infinite loops in fragment shaders Summary: Due to the fact that kill is just a normal intrinsic, even though it's supposed to terminate the thread, we can end up with provably infinite loops that are actually supposed to end successfully. The AMDGPUUnifyDivergentExitNodes pass breaks up these loops, but because there's no obvious place to make the loop branch to, it just makes it return immediately, which skips the exports that are supposed to happen at the end and hangs the GPU if all the threads end up being killed. While it would be nice if the fact that kill terminates the thread were modeled in the IR, I think that the structurizer as-is would make a mess if we did that when the kill is inside control flow. For now, we just add a null export at the end to make sure that it always exports something, which fixes the immediate problem without penalizing the more common case. This means that we sometimes do two "done" exports when only some of the threads enter the discard loop, but from tests the hardware seems ok with that. This fixes dEQP-VK.graphicsfuzz.while-inside-switch with radv. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70781	2020-01-29 15:08:46 +01:00
Haojian Wu	fce8983a3c	[clangd] Remove the temporary alias for clangd::DiagnosticConsumer. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73619	2020-01-29 14:50:58 +01:00
Sanjay Patel	87f6314f8c	[InstCombine] canonicalize splat shuffle after cmp cmp (splat V1, M), SplatC --> splat (cmp V1, SplatC'), M As discussed in PR44588: https://bugs.llvm.org/show_bug.cgi?id=44588 ...we try harder to push shuffles after binops than after compares. This patch handles the special (but presumably most common case) of splat shuffles. If both operands are splats, then we can do the comparison on the non-splat inputs followed by splat of the compare. That should take care of the regression noted in D73411. There's another potential fold requested in PR37463 to scalarize the compare, but that's another patch (and it's not clear if we can do that without the ability to undo it later): https://bugs.llvm.org/show_bug.cgi?id=37463 Differential Revision: https://reviews.llvm.org/D73575	2020-01-29 08:34:29 -05:00
Sanne Wouda	2939fc13c8	[AArch64] Add IR intrinsics for sq(r)dmulh_lane(q) Summary: Currently, sqdmulh_lane and friends from the ACLE (implemented in arm_neon.h), are represented in LLVM IR as a (by vector) sqdmulh and a vector of (repeated) indices, like so: %shuffle = shufflevector <4 x i16> %v, <4 x i16> undef, <4 x i32> <i32 3, i32 3, i32 3, i32 3> %vqdmulh2.i = tail call <4 x i16> @llvm.aarch64.neon.sqdmulh.v4i16(<4 x i16> %a, <4 x i16> %shuffle) When %v's values are known, the shufflevector is optimized away and we are no longer able to select the lane variant of sqdmulh in the backend. This defeats a (hand-coded) optimization that packs several constants into a single vector and uses the lane intrinsics to reduce register pressure and trade-off materialising several constants for a single vector load from the constant pool, like so: int16x8_t v = {2,3,4,5,6,7,8,9}; a = vqdmulh_laneq_s16(a, v, 0); b = vqdmulh_laneq_s16(b, v, 1); c = vqdmulh_laneq_s16(c, v, 2); d = vqdmulh_laneq_s16(d, v, 3); [...] In one microbenchmark from libjpeg-turbo this accounts for a 2.5% to 4% performance difference. We could teach the compiler to recover the lane variants, but this would likely require its own pass. (Alternatively, "volatile" could be used on the constants vector, but this is a bit ugly.) This patch instead implements the following LLVM IR intrinsics for AArch64 to maintain the original structure through IR optmization and into instruction selection: - sqdmulh_lane - sqdmulh_laneq - sqrdmulh_lane - sqrdmulh_laneq. These 'lane' variants need an additional register class. The second argument must be in the lower half of the 64-bit NEON register file, but only when operating on i16 elements. Note that the existing patterns for shufflevector and sqdmulh into sqdmulh_lane (etc.) remain, so code that does not rely on NEON intrinsics to generate these instructions is not affected. This patch also changes clang to emit these IR intrinsics for the corresponding NEON intrinsics (AArch64 only). Reviewers: SjoerdMeijer, dmgreen, t.p.northover, rovka, rengolin, efriedma Reviewed By: efriedma Subscribers: kristof.beyls, hiraditya, jdoerfert, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71469	2020-01-29 13:25:23 +00:00
Sjoerd Meijer	f719b0ba13	[MVE][MC] evaluateBranch: add missing MVE opcode This adds some missing MVE opcodes to evaluateBranch, which results in llvm-objdump being able to print the PC relative branch target as an annotation. Differential Revision: https://reviews.llvm.org/D73553	2020-01-29 13:19:45 +00:00
Kazushi (Jam) Marukawa	6b587ee23c	[VE] Isel patterns for fp32/64 and i32/64 conversion Summary: fp32/64 <> signed/unsigned i32/64 conversion isel patterns and tests (This patch depends on `fsub` implemented by https://reviews.llvm.org/D73540 ) Reviewers: arsenm, craig.topper, rengolin, k-ishizaka Reviewed By: arsenm Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Tags: #ve, #llvm Differential Revision: https://reviews.llvm.org/D73544	2020-01-29 14:10:22 +01:00
Sanne Wouda	cbc45e4e75	Regenerate aarch64-neon-2velem.c CHECK lines	2020-01-29 13:03:27 +00:00
Sanne Wouda	4ec2a26732	Fix clang test build	2020-01-29 13:03:27 +00:00
Karasev Nikita	d5dfd1350e	Add TagDecl AST matcher	2020-01-29 07:58:31 -05:00
Georgii Rymar	e6b55cbcdc	[yaml2obj][obj2yaml] - Add lost test cases. It is a part of https://reviews.llvm.org/D71872 which was lost somehow during relanding after being reverted: https://reviews.llvm.org/rG7570d387c21935b58afa67cb9ee17250e38721fa	2020-01-29 15:40:35 +03:00
Martin Probst	a324fcf1ae	clang-format: insert trailing commas into containers. Summary: This change adds an option to insert trailing commas into container literals. For example, in JavaScript: const x = [ a, b, ^~~~~ inserted if missing. ] This is implemented as a seperate post-processing pass after formatting (because formatting might change whether the container literal does or does not wrap). This keeps the code relatively simple and orthogonal, though it has the notable drawback that the newly inserted comma is not taken into account for formatting decisions (e.g. it might exceed the 80 char limit). To avoid exceeding the ColumnLimit, a comma is only inserted if it fits into the limit. Trailing comma insertion conceptually conflicts with argument bin-packing: inserting a comma disables bin-packing, so we cannot do both. clang-format rejects FormatStyle configurations that do both with this change. Reviewers: krasimir, MyDeveloperDay Subscribers: cfe-commits Tags: #clang	2020-01-29 13:23:54 +01:00
Kerry McLaughlin	3cf80822a9	[AArch64][SVE] Add SVE2 intrinsics for uniform DSP operations Summary: Implements the following intrinsics: - sqrdmlah, sqrdmlsh, sqrdmulh & sqdmulh - [s\|u]hadd, [s\|u]hsub, [s\|u]rhadd & [s\|u]hsubr - urecpe, ursqrte, sqabs & sqneg Reviewers: sdesmalen, efriedma, dancgr, cameron.mcinally Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73493	2020-01-29 12:03:15 +00:00
Sam Parker	dc0d84f09e	[NFC][ARM] Add test	2020-01-29 06:59:21 -05:00
Haojian Wu	17fadeffcc	[clangd][vscode] Update lsp dependencies to pickup the progress support in LSP 3.15 Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73612	2020-01-29 12:58:53 +01:00
Haojian Wu	e864f93766	[clangd] Replace raw lexer code with token buffer in prepare rename. Summary: there is a slight behavior change in this patch: - before: `in^t a;`, returns our internal error message (no symbol at given location) - after: `in^t a, returns null, and client displays their message (e.g. e.g. "the element can't be renamed" in vscode). both are sensible according LSP, and we'd save one `rename` call in the later case. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73610	2020-01-29 12:57:18 +01:00
Sam McCall	bcb3e42fdf	[clangd] Go-to-definition on 'override' jumps to overridden method(s) Reviewers: kadircet Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73367	2020-01-29 12:43:52 +01:00
Sam McCall	6f6952780b	[clangd] add CODE_OWNERS Reviewers: klimek Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73537	2020-01-29 12:43:19 +01:00
Peter Smith	0b4a047bfb	[LLD][ELF][ARM] Do not substitute BL/BLX for non STT_FUNC symbols. D73474 disabled the generation of interworking thunks for branch relocations to non STT_FUNC symbols. This patch handles the case of BL and BLX instructions to non STT_FUNC symbols. LLD would normally look at the state of the caller and the callee and write a BL if the states are the same and a BLX if the states are different. This patch disables BL/BLX substitution when the destination symbol does not have type STT_FUNC. This brings our behavior in line with GNU ld which may prevent difficult to diagnose runtime errors when switching to lld. Differential Revision: https://reviews.llvm.org/D73542	2020-01-29 11:42:25 +00:00
David Truby	63c8972562	[MLIR] Add OpenMP dialect with barrier operation Summary: Barrier is a simple operation that takes no arguments and returns nothing, but implies a side effect (synchronization of all threads) Reviewers: jdoerfert Subscribers: mgorny, guansong, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72400	2020-01-29 11:34:58 +00:00
Kadir Cetinkaya	7830c2d44f	[clangd] Get rid of delayed template parsing Summary: No need to pass fno-delayed-template-parsing as the opposite flag is only passed to cc1 when abi is set to msvc. Sending as a follow-up to D73613 to keep changes in the release branch minimal. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73615	2020-01-29 12:12:45 +01:00
Kadir Cetinkaya	55b0e9c9d5	[clangd][Hover] Make tests hermetic by setting target triplet Summary: Fixes https://bugs.llvm.org/show_bug.cgi?id=44696 Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73613	2020-01-29 12:12:45 +01:00
Benjamin Kramer	0ee4b027d3	Fix an implicit conversion in clang-tidy. GCC 5 complains about it.	2020-01-29 12:05:35 +01:00
Momchil Velikov	ac21535460	[ARM] Add documentation for -march= and -mfpu= command line options Differential Revision: https://reviews.llvm.org/D73459	2020-01-29 10:39:01 +00:00
Kerry McLaughlin	bd33a46213	[AArch64][SVE] Add SVE2 intrinsics for pairwise arithmetic Summary: Implements the following intrinsics: - addp - smaxp, sminp, umaxp & uminp - sadalp & uadalp Reviewers: dancgr, efriedma, sdesmalen, c-rhodes Reviewed By: c-rhodes Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73347	2020-01-29 10:31:31 +00:00
James Henderson	7116e431c0	[DebugInfo] Make most debug line prologue errors non-fatal to parsing Many of the debug line prologue errors are not inherently fatal. In most cases, we can make reasonable assumptions and carry on. This patch does exactly that. In the case of length problems, the approach of "assume stated length is correct" is taken which means the offset might need adjusting. This is a relanding of `b94191fe`, fixing an LLD test and the LLDB build. Reviewed by: dblaikie, labath Differential Revision: https://reviews.llvm.org/D72158	2020-01-29 10:23:41 +00:00
Pavel Labath	7a6ebb5ba3	[lldb] More windows StringRef fixes I don't have a windows build around, so I am just going by the buildbot messages.	2020-01-29 11:15:20 +01:00
Kazushi (Jam) Marukawa	f6bb58542a	[VE] fp32/64 fadd/fsub/fdiv/fmul isel patterns Summary: fp32/64 fadd/fsub/fdiv/fmul isel patterns and tests. Reviewers: arsenm, craig.topper, rengolin, k-ishizaka Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D73540	2020-01-29 11:00:56 +01:00
David Stenberg	6a2413c435	[ARM64] Debug info for structure argument missing DW_AT_location Summary: Prevent eliminating dbg_val due to COPY. Fixes this https://bugs.llvm.org/show_bug.cgi?id=40709 Patch by: Kamlesh Kumar (kamleshbhalui) Reviewers: aprantl, dblaikie, vsk, dsanders Reviewed By: dsanders Subscribers: dstenb, kristof.beyls, hiraditya, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D73159	2020-01-29 10:56:23 +01:00
Benjamin Kramer	4e3f4f03f3	[ASTMatchers] StringRef'ify hasName This was just inconvenient, and we make a copy anyways.	2020-01-29 10:53:08 +01:00
Simon Moll	93bbe7b2b5	[VE][fix] (more) explicit StringRef to std::string	2020-01-29 10:46:59 +01:00
Jay Foad	ad08c01d6c	[AMDGPU] Simplify DS and SM cases in getMemOperandsWithOffset Summary: This removes a couple of unnecessary isReg checks, now that memOpsHaveSameBasePtr can handle FI operands, but is otherwise NFC. Reviewers: arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73485	2020-01-29 09:43:24 +00:00
Simon Moll	d53840ad39	[VE][fix] Explicit StringRef to std::string conversion Adapt to changes of "[ADT] Make StringRef's std::string conversion operator explicit" (`777180a32`).	2020-01-29 10:34:28 +01:00
Haojian Wu	0d893fda43	[clangd] Add a symbol-name-based blacklist for rename. Summary: This patch adds a simple mechanism to disallow global rename on std symbols. We might extend it to other symbols, e.g. protobuf. Reviewers: kadircet Subscribers: mgorny, ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73450	2020-01-29 10:32:40 +01:00
Benjamin Kramer	757bdc64d3	Fix clang unnittest build with GCC 5	2020-01-29 10:30:36 +01:00
Pavel Labath	e06444d982	[lldb] Fix windows build for the StringRef conversion operator change "operator std::string()" is now explicit.	2020-01-29 10:08:40 +01:00
Fangrui Song	800a0f81e9	[ARC] Fix ARCTargetMachine after `777180a32b`	2020-01-29 00:59:16 -08:00
Sam Parker	ac30ea2f87	[RDA][ARM] Move functionality into RDA Add several new helpers to RDA: - hasLocalDefBefore - isRegDefinedAfter - isSafeToDefRegAt And move two bits of logic from ARMLowOverheadLoops into RDA: - isSafeToMove - isSafeToRemove Both of these have some wrappers too to make them more convienent to use. Differential Revision: https://reviews.llvm.org/D73460	2020-01-29 03:27:47 -05:00
Raphael Isemann	ab8b22d1c2	[lldb] Don't create duplicate declarations when completing a forward declaration with a definition from another source Summary: I noticed this strange line in `ASTImporterDelegate::ImportDefinitionTo` which doesn't make a lot of sense: ``` to_tag->setCompleteDefinition(from_tag->isCompleteDefinition()); ``` It forcibly sets the imported TagDecl to be defined if the source TagDecl was defined. This doesn't make any sense as in this code we already forced the ASTImporter to import the definition so this should always be a no-op. Turns out this is hiding two bugs: 1. The way we handle forward declarations in the debug info that might be completed later is that we import them and then mark them as having external lexical storage. This makes Clang ask for the definition later when it needs it (at which point we hopefully have the definition around and can complete it). However, this is currently not completing the forward decls with external storage but instead creates a duplicated decl in the target AST which is then defined. The forward decl is kept incomplete after the import and we just forcibly make it a definition of the record without any content with our workaround. The TestSharedLib* tests is only passing because of this. 2. Minimal import of lambdas is broken and never imports the definition it seems. That appears to be a bug in the ASTImporter which gives the definition of lambda's some special treatment. TestLambdas.py is actually broken but is passing because of this workaround. This patch fixes the first bug by forcing the ASTImporter to import to the target forward declaration. We can't delete the workaround as the second bug is still around but that will be a follow up review for the ASTImporter. However it will get rid of all the duplicated RecordDecls that are in our expression AST that are strangely defined but don't have any of the fields they are supposed to have. Reviewers: shafik, labath Reviewed By: shafik Subscribers: aprantl, abidh, JDevlieghere, lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D73345	2020-01-29 09:20:47 +01:00
Raphael Isemann	a5fb2e371e	[lldb] Complete return types of CXXMethodDecls to prevent crashing due to covariant return types Summary: Currently we crash in Clang's CodeGen when we call functions with covariant return types with this assert: ``` Assertion failed: (DD && "queried property of class with no definition"), function data, file clang/include/clang/AST/DeclCXX.h, line 433. ``` when calling `clang::CXXRecordDecl::isDerivedFrom` from the `ItaniumVTableBuilder`. Clang seems to assume that the underlying record decls of covariant return types are already completed. This is true during a normal Clang invocation as there the type checker will complete both decls when checking if the overloaded function is valid (i.e., the return types are covariant). When we minimally import our AST into the expression in LLDB we don't do this type checking (which would complete the record decls) and we end up trying to access the invalid record decls from CodeGen which makes us trigger the assert. This patch just completes the underlying types of ptr/ref return types of virtual function so that the underlying records are complete and we behave as Clang expects us to do. Fixes rdar://38048657 Reviewers: lhames, shafik Reviewed By: shafik Subscribers: abidh, JDevlieghere, lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D73024	2020-01-29 09:08:35 +01:00
Raphael Isemann	a497e1b5ea	[lldb] Use CompletionRequest in REPL::CompleteCode and remove translation code to old API Any REPL client should just move to CompletionRequest instead of relying on the translation code from the old API, so let's remove that translation code.	2020-01-29 08:56:32 +01:00
Fangrui Song	bc15bf66dc	[X86] matchAdd: don't fold a large offset into a %rip relative address For `ret i64 add (i64 ptrtoint (i32* @foo to i64), i64 1701208431)`, ``` X86DAGToDAGISel::matchAdd ... // AM.setBaseReg(CurDAG->getRegister(X86::RIP, MVT::i64)); if (!matchAddressRecursively(N.getOperand(0), AM, Depth+1) && // Try folding offset but fail; there is a symbolic displacement, so offset cannot be too large !matchAddressRecursively(Handle.getValue().getOperand(1), AM, Depth+1)) return false; ... // Try again after commuting the operands. // AM.Disp = Val; foldOffsetIntoAddress() does not know there will be a symbolic displacement if (!matchAddressRecursively(Handle.getValue().getOperand(1), AM, Depth+1) && // AM.setBaseReg(CurDAG->getRegister(X86::RIP, MVT::i64)); !matchAddressRecursively(Handle.getValue().getOperand(0), AM, Depth+1)) // Succeeded! Produced leaq sym+disp(%rip),... return false; ``` `foldOffsetIntoAddress()` currently does not know there is a symbolic displacement and can fold a large offset. The produced `leaq sym+disp(%rip), %rax` instruction is relocated by an R_X86_64_PC32. If disp is large and sym+disp-rip>=2**31, there will be a relocation overflow. This approach is still not elegant. Unfortunately the isRIPRelative interface is a bit clumsy. I tried several solutions and eventually picked this one. Differential Revision: https://reviews.llvm.org/D73606	2020-01-28 22:30:52 -08:00
Johannes Doerfert	76843ba37f	[Attributor][Fix] Initialize unused but loaded variable This hopefully un-breaks: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/38333	2020-01-28 23:52:16 -06:00
Johannes Doerfert	ea5fabe60c	[Attributor] Reuse existing logic to avoid duplication There was a TODO in AAValueConstantRangeArgument to reuse AAArgumentFromCallSiteArguments. We now do this by allowing new States to be build from the bestState.	2020-01-28 23:45:59 -06:00
Johannes Doerfert	224085409d	[Attributor][FIX] Treat invalidated attributes as changed If we invalidate an attribute we need to inform all dependent ones even if the fixpoint state is not invalid. Before we only continued invalidation if the fixpoint state was invalid, now we signal a change in case the fixpoint state is valid. The test case was already included in D71620 but the problem was hiding because it only manifested with the old PM (for that input).	2020-01-28 23:40:41 -06:00
Johannes Doerfert	53992c7bf7	[Attributor] Modularize AANoAliasCallSiteArgument to simplify extensions This patch modularizes the way we check for no-alias call site arguments by putting the existing logic into helper functions. The reasoning was not changed but special cases for readonly/readnone were added.	2020-01-28 23:39:29 -06:00
Johannes Doerfert	24ae77eebf	[Attributor] Mark a non-defined `null` pointer as `noalias` If `null` is not defined we cannot access it, hence the pointer is `noalias`. While this is not helpful on it's own it simplifies later deductions that can skip over already known `noalias` pointers in certain situations.	2020-01-28 23:09:37 -06:00
Johannes Doerfert	6626d1b7c0	[Attributor][NFC] Remove ugly and unneeded cast	2020-01-28 22:54:31 -06:00
Johannes Doerfert	02bd8180fc	[Attributor][NFC] Improve debug messages	2020-01-28 22:53:19 -06:00
Johannes Doerfert	b6dbd0f71f	[Attributor][NFC] Internalize helper function	2020-01-28 22:50:34 -06:00

1 2 3 4 5 ...

340863 Commits All Branches Search

340863 Commits

All Branches