llvm-project

Commit Graph

Author	SHA1	Message	Date
Nico Weber	877073bc1c	[gn build] (manually) merge `47edf5bafb`	2020-03-10 10:22:39 -04:00
Mikhail Maltsev	47edf5bafb	[ARM,CDE] Generalize MVE intrinsics infrastructure to support CDE Summary: This patch generalizes the existing code to support CDE intrinsics which will share some properties with existing MVE intrinsics (some of the intrinsics will be polymorphic and accept/return values of MVE vector types). Specifically the patch: * Adds new tablegen backends -gen-arm-cde-builtin-def, -gen-arm-cde-builtin-codegen, -gen-arm-cde-builtin-sema, -gen-arm-cde-builtin-aliases, -gen-arm-cde-builtin-header based on existing MVE backends. * Renames the '__clang_arm_mve_alias' attribute into '__clang_arm_builtin_alias' (it will be used with CDE intrinsics as well as MVE intrinsics) * Implements semantic checks for the coprocessor argument of the CDE intrinsics as well as the existing coprocessor intrinsics. * Adds one CDE intrinsic __arm_cx1 to test the above changes Reviewers: simon_tatham, MarkMurrayARM, ostannard, dmgreen Reviewed By: simon_tatham Subscribers: sdesmalen, mgorny, kristof.beyls, danielkiss, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75850	2020-03-10 14:03:16 +00:00
Jonas Paulsson	c2dafe12dc	[SimplifyCFG] Skip merging return blocks if it would break a CallBr. SimplifyCFG should not merge empty return blocks and leave a CallBr behind with a duplicated destination since the verifier will then trigger an assert. This patch checks for this case and avoids the transformation. CodeGenPrepare has a similar check which also has a FIXME comment about why this is needed. It seems perhaps better if these two passes would eventually instead update the CallBr instruction instead of just checking and avoiding. This fixes https://bugs.llvm.org/show_bug.cgi?id=45062. Review: Craig Topper Differential Revision: https://reviews.llvm.org/D75620	2020-03-10 14:59:13 +01:00
Sanjay Patel	6e60e1025f	[InstCombine] regenerate test checks; NFC tmp -> t because 'tmp' tends to cause problems for the auto-generation script.	2020-03-10 09:57:41 -04:00
Simon Pilgrim	e71fb46a8f	[TargetLowering] SimplifyDemandedVectorElts - add DemandedElts mask to ISD::BITCAST SimplifyDemandedBits call. This fixes most of the regressions introduced in the rG4bc6f6332028 bugfix. The vector-trunc.ll issue should be fixed by D66004.	2020-03-10 13:39:10 +00:00
Pavel Labath	6b37c476a2	[lldb] Improve test failure messages in vscode tests A couple of tests sporadically fail on these assertions, but the error messages do not give a clue as to what has actually happened. Improve them so that we can better understand what is going wrong.	2020-03-10 14:32:45 +01:00
Sanjay Patel	467eec0910	[InstCombine] fold gep-of-select-of-constants (PR45084) As shown in: https://bugs.llvm.org/show_bug.cgi?id=45084 ...we failed to combine a gep with constant indexes with a pointer operand that is a select of constants. Differential Revision: https://reviews.llvm.org/D75807	2020-03-10 09:25:13 -04:00
Sanjay Patel	5b465ad290	[InstCombine] add/adjust tests for select-gep; NFC Goes with D75807	2020-03-10 09:25:13 -04:00
Florian Hahn	2d6ecf4648	[SLP] Support vectorizing functions provided by vector libs. It seems like the SLPVectorizer is currently not aware of vector versions of functions provided by libraries like Accelerate [1]. This patch updates SLPVectorizer to use the same infrastructure the LoopVectorizer uses to detect vectorizable library functions. For calls, it computes the cost of an intrinsic call (existing behavior) and the cost of a vector function library call, if available. Like LoopVectorizer, it assumes the cost of the vector function is simply the cost of a call to a vector function. [1] https://developer.apple.com/documentation/accelerate Reviewers: ABataev, RKSimon, spatel Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D75878	2020-03-10 13:10:50 +00:00
Pavel Labath	1ca1e08e75	[lldb] Break up CommandObjectDisassemble::DoExecute The function consisted of a complicated set of conditions to compute the address ranges which are to be disassembled (depending on the mode selected by command line switches). This patch creates a separate function for each mode, so that DoExecute is only left with the task of figuring out how to dump the relevant ranges. This is NFC-ish, except for one change in the error message, which is actually an improvement.	2020-03-10 14:03:16 +01:00
Pavel Labath	d00dff88b4	[lldb] Make UnwindLLDB a non-plugin Summary: This is the only real unwinder, and things have been this way for quite a long time. At this point, the class has accumulated so many features it is unlikely that anyone will want to reimplement the whole thing. The class is also fairly closely coupled (through UnwindPlans and FuncUnwinders) with a lot of other lldb components that it is hard to imagine a different unwinder implementation being substantially different without reimplementing all of those. The existing unwinding functionality is nonetheless fairly complex and there is space for adding more structure to it, but I believe a more worthwhile effort would be to take the existing UnwindLLDB class and try to break it down and introduce extension/customization points, instead of writing a brand new Unwind implementation. Reviewers: jasonmolenda, JDevlieghere, xiaobai Subscribers: mgorny, lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D75848	2020-03-10 13:56:15 +01:00
Nathan James	1fc5be0669	[NFC] Tweak OptionsUtils	2020-03-10 12:50:58 +00:00
David Bozier	6e2804ce6b	[LLD] Add support for --unique option Summary: Places orphan sections into a unique output section. This prevents the merging of orphan sections of the same name. Matches behaviour of GNU ld --unique. --unique=pattern is not implemented. Motivated user case shown in the test has 2 local symbols as they would appear if C++ source has been compiled with -ffunction-sections. The merging of these sections in the case of a partial link (-r) may limit the effectiveness of -gc-sections of a subsequent link. Reviewers: espindola, jhenderson, bd1976llvm, edd, andrewng, JonChesterfield, MaskRay, grimar, ruiu, psmith Reviewed By: MaskRay, grimar Subscribers: emaste, arichardson, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75536	2020-03-10 12:20:21 +00:00
Simon Pilgrim	5cbddf7cbc	[X86][SSE] Add more accurate costs for fmaxnum/fminnum codegen Based off llvm-mca reports on codegen in llvm\test\CodeGen\X86\fmaxnum.ll + llvm\test\CodeGen\X86\fminnum.ll	2020-03-10 11:59:40 +00:00
Djordje Todorovic	3e47f87e64	[NFC][llvm-dwarfdump] Always use 'const Twine &' According to the Twine.h comment, the Twines should only be used as const references in arguments. Differential Revision: https://reviews.llvm.org/D75727	2020-03-10 12:58:59 +01:00
Simon Pilgrim	9b05596eff	[SLPVectorizer][X86] Add fmaxnum/fminnum tests	2020-03-10 11:18:28 +00:00
Simon Pilgrim	0b1dc6016f	[CostModel][X86] Add fmaxnum/fminnum costs tests	2020-03-10 11:18:27 +00:00
Simon Pilgrim	b9b96adcf5	[X86][SSE] Add SSE41 coverage for fmaxnum/fminnum tests	2020-03-10 11:18:27 +00:00
alex-t	39e1a90784	[AMDGPU] SI_INDIRECT_DST_V* pseudos expansion should place EXEC restore to separate basic block Summary: When SI_INDIRECT_DST_V* pseudos has indexes in VGPR, they get expanded into the self-looped basic block that modifies EXEC in a loop. To keep EXEC consistent it is stored before and then re-stored after the pseudo expansion result. %95:vreg_512 = SI_INDIRECT_DST_V16 %93:vreg_512(tied-def 0), %94:sreg_32, 0, killed %1500:vgpr_32 results to s_mov_b64 s[6:7], exec BB0_16: v_readfirstlane_b32 s8, v28 v_cmp_eq_u32_e32 vcc, s8, v28 s_and_saveexec_b64 vcc, vcc s_set_gpr_idx_on s8, gpr_idx(DST) v_mov_b32_e32 v6, v25 s_set_gpr_idx_off s_xor_b64 exec, exec, vcc s_cbranch_execnz BB0_16 ; %bb.17: s_mov_b64 exec, s[6:7] The bug appeared in case this expansion occurs in the ELSE block of the CF. Originally %110:vreg_512 = SI_INDIRECT_DST_V16 %103:vreg_512(tied-def 0), %85:vgpr_32, 0, %107:vgpr_32, %112:sreg_64 = SI_ELSE %108:sreg_64, %bb.19, 0, implicit-def dead $exec, implicit-def dead $scc, implicit $exec expanded to ****************** <== here exec has "THEN" context s_mov_b64 s[6:7], exec BB0_16: v_readfirstlane_b32 s8, v28 v_cmp_eq_u32_e32 vcc, s8, v28 s_and_saveexec_b64 vcc, vcc s_set_gpr_idx_on s8, gpr_idx(DST) v_mov_b32_e32 v6, v25 s_set_gpr_idx_off s_xor_b64 exec, exec, vcc s_cbranch_execnz BB0_16 ; %bb.17: s_or_saveexec_b64 s[4:5], s[4:5] <-- exec mask is restored for "ELSE" but immediately overwritten. s_mov_b64 exec, s[6:7] The rest of the "ELSE" block is executed not by the workitems which constitute the "else mask" but by those which constitute "then mask" SILowerControlFlow::emitElse always considers the basic block begin() as an insertion point for s_or_saveexec. Proposed fix: The SI_INDIRECT_DST_V* procedure should split the reminder block to create landing pad for the EXEC restoration. Reviewers: rampitec, vpykhtin, nhaehnle Reviewed By: vpykhtin Subscribers: arsenm, kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75472	2020-03-10 14:04:22 +03:00
Kerry McLaughlin	0bba37a320	[AArch64][SVE] Add SVE intrinsics for address calculations Summary: Adds the @llvm.aarch64.sve.adr[b\|h\|w\|d] intrinsics Reviewers: sdesmalen, andwar, efriedma, dancgr, cameron.mcinally, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75858	2020-03-10 10:53:37 +00:00
James Greenhalgh	f0de8d0940	[Arm] Do not lower vmax/vmin to Neon instructions On some Arm cores there is a performance penalty when forwarding from an S register to a D register. Calculating VMAX in a D register creates false forwarding hazards, so don't do that unless we're on a core which specifically asks for it. Patch by James Greenhalgh Differential Revision: https://reviews.llvm.org/D75248	2020-03-10 10:48:48 +00:00
Simon Pilgrim	18c19441d1	[X86][AVX] combineX86ShuffleChain - combine binary shuffles to X86ISD::VPERM2X128 For pre-AVX512 targets, combine binary shuffles to X86ISD::VPERM2X128 if possible. This mainly helps optimize the blend(extract_subvector(x,1),y) pattern. At some point soon we're going to have make a decision about when to combine AVX512 shuffles more aggressively - we bail out if there is any change in element size (to protect predicate mask merging) which means we miss out on a lot of optimizations.	2020-03-10 10:44:28 +00:00
Adam Balogh	20a3d64c88	[Analyzer][NFC] Change parameter of NoteTag lambdas to PathSensitiveBugReport Lambdas creating path notes using NoteTags still take BugReport as their parameter. Since path notes obviously only appear in PathSensitiveBugReports it is straightforward that lambdas of NoteTags take PathSensitiveBugReport as their parameter. Differential Revision: https://reviews.llvm.org/D75898	2020-03-10 11:30:28 +01:00
Clement Courbet	30477197b3	[ExpandMemCmp][NFC] Add more tests.	2020-03-10 11:34:19 +01:00
Florian Hahn	b53907bfed	[SLP] Precommit vector library test for D75878.	2020-03-10 10:17:34 +00:00
Sam Parker	ff9ac33e1e	[ARM][MVE] Validate tail predication values Iterate through the loop and check that the observable values produced are the same whether tail predication happens or not. We want to find out if the tail-predicated version of this loop will produce the same values as the loop in its original form. For this to be true, the newly inserted implicit predication must not change the the (observable) results. We're doing this because many instructions in the loop will not be predicated and so the conversion from VPT predication to tail predication can result in different values being produced, because of falsely predicated lanes not being updated in the converted form. A masked load, whether through VPT or tail predication, will write zeros to any of the falsely predicated bytes. So, from the loads, we know that the false lanes are zeroed and here we're trying to track that those false lanes remain zero, or where they change, the differences are masked away by their user(s). All MVE loads and stores have to be predicated, so we know that any load operands, or stored results are equivalent already. Other explicitly predicated instructions will perform the same operation in the original loop and the tail-predicated form too. Because of this, we can insert loads, stores and other predicated instructions into our KnownFalseZeros set and build from there. Differential Revision: https://reviews.llvm.org/D75452	2020-03-10 09:59:01 +00:00
Jonathan Coe	0c28a0938c	[clang-format] Correct indentation for `[key] = value,` entries in C# object initialisers Restores content of commit `cb3f20d27c` reverted in commit `5a101f3773` with a corrected commit message. Summary: Do not use continuation indent for '[' in blocks in C# code. Reviewers: krasimir Reviewed By: krasimir Subscribers: cfe-commits Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D75747	2020-03-10 09:36:52 +00:00
Jonathan Coe	5a101f3773	Revert "[clang-format] Correct indentation for `[key] = value,` entries in C++ object initialisers" Commit message says "C++" where it should say "C#". This reverts commit `cb3f20d27c`.	2020-03-10 09:30:34 +00:00
Djordje Todorovic	5aa5c943f7	Reland "[DebugInfo] Enable the debug entry values feature by default" Differential Revision: https://reviews.llvm.org/D73534	2020-03-10 09:15:06 +01:00
Dmitry Vyukov	a72dc86cdd	tsan: tsan_interface.h: make constants static Note that in C++ the static keyword is implicit for const objects. In C however it is not, and we get clashes at link time after including the header from more than one C file: lib.a(bar.o):(.rodata+0x0): multiple definition of `__tsan_mutex_linker_init' lib.a(foo.o):(.rodata+0x0): first defined here lib.a(bar.o):(.rodata+0xc): multiple definition of `__tsan_mutex_not_static' lib.a(foo.o):(.rodata+0xc): first defined here <snip> Indeed both foo.o and bar.o define the clashing symbols: $ nm foo.o <snip> 0000000000000000 R __tsan_mutex_linker_init 000000000000000c R __tsan_mutex_not_static <snip> Fix it by explicitly making the constants static. Reviewed-in: https://reviews.llvm.org/D75820 Author: cota (Emilio G. Cota)	2020-03-10 09:13:41 +01:00
Craig Topper	ef4f939d38	[X86] Remove isel patterns for (X86VBroadcast (i16 (trunc (i32 (load))))). Replace with a DAG combine to form VBROADCAST_LOAD. isTypeDesirableForOp prevents loads from being shrunk to i16 by DAG combine. Because of this we can't just match the broadcast and a scalar load. So look for broadcast+truncate+load and form a vbroadcast_load during DAG combine. This replaces what was previously done as an isel pattern and I think fixes it so we won't change the size of a volatile load. But my main motivation is just to clean up our isel patterns.	2020-03-10 00:07:07 -07:00
Puyan Lotfi	4b8af31f63	[llvm][MIRVRegNamer] Avoid collisions across constant pool indices. When hashing on MachineOperand::MO_ConstantPoolIndex, now MIR-Canon and MIRVRegNamer will no longer result in a hash collision. Differential Revision: https://reviews.llvm.org/D74449	2020-03-10 01:13:20 -04:00
Siva Chandra Reddy	550be40515	[libc] Add simple implementations of mtx_lock and mtx_unlock. These functions only support locking and unlocking of plain mutexes. They will be extended in future changes to handled recursive and timed mutexes. Reviewers: phosek Differential Revision: https://reviews.llvm.org/D74653	2020-03-09 21:56:02 -07:00
Siva Chandra Reddy	fd8c133613	[libc] Take 2: Add linux implementations of thrd_create and thrd_join functions. The following are the differences from the first version: 1. The kernel does not copy the stack for the new thread (it cannot). The previous version missed this fact. In this new version, the new thread's start args are copied on to the new stack in a known location so that the new thread can sniff them out. 2. A start args sniffer for x86_64 has been added. 2. Default stack size has been increased to 64KB. Reviewers: abrachet, phosek Differential Revision: https://reviews.llvm.org/D75818	2020-03-09 21:28:11 -07:00
Mehdi Amini	f80c6d8dec	Fix MLIR build when NVPTX backend is not configured in The GPUToCUDA conversion needs to conditionally link it in.	2020-03-10 04:11:49 +00:00
Matt Arsenault	627bb31a28	AMDGPU/GlobalISel: Avoid illegal vector exts for add/sub/mul When expanding scalar packed operations, we should not introduce illegal vector casts LegalizerHelper introduces. We're not in a legalizer context, and there's no RegBankSelect apply or legalize worklist.	2020-03-09 23:42:17 -04:00
Matt Arsenault	ed72bcae34	AMDGPU/GlobalISel: Fix mishandling SGPR v2s16 add/sub/mul We weren't considering the packed case correctly, and this was passing through to the selector. The selector only checked the size, so this would incorrectly compile to a single 32-bit scalar add. As usual, the LegalizerHelper is somewhat awkward to use from applyMappingImpl. I think this is the first place we've needed multi-step legalization here though.	2020-03-09 22:51:54 -04:00
Lang Hames	3f981cdde9	[MC] Allow Stackmap sections after DWARF in MachO. Summary: Mixing stackmaps and DWARF in a single file triggers an assertion in MCMachOStreamer as stackmap sections are emitted in AsmPrinter::emitEndOfAsmFile after the DWARF sections have already been emitted. Since it should be safe to emit stackmap sections after DWARF sections this patch relaxes the assertion to allow that. Reviewers: aprantl, dblaikie, echristo Subscribers: hiraditya, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75836	2020-03-09 18:33:32 -07:00
Uday Bondhugula	e241573d59	[mlir] NFC: remove IntegerValueSet / MutableIntegerSet Summary: - these are unused and really not needed now given flat affine constraints Differential Revision: https://reviews.llvm.org/D75792	2020-03-09 20:55:55 -04:00
Nathan James	97572fa6e9	[NFC] use hasAnyOperatorName and hasAnyOverloadedOperatorName functions in clang-tidy matchers	2020-03-10 00:42:21 +00:00
Wouter van Oortmerssen	a7a3751775	[WebAssembly] Fixed FrameBaseLocal not being set. Summary: Fixes: https://bugs.llvm.org/show_bug.cgi?id=44920 WebAssemblyRegColoring may merge the vreg that currently represents the FrameBase with one representing an argument. WebAssemblyExplicitLocals picks up the corresponding local when a vreg is first added to the Reg2Local mapping, except when it is an argument instruction which are handled separately. Note that this does not change that vregs representing the FrameBase may get merged, it is not clear to me that this may have other effects we may want to avoid? Reviewers: dschuff Reviewed By: dschuff Subscribers: azakai, sbc100, hiraditya, aheejin, sunfish, llvm-commits, jgravelle-google Tags: #llvm Differential Revision: https://reviews.llvm.org/D75718	2020-03-09 17:29:36 -07:00
Nathan James	77eec38626	[ASTMatchers] Add hasAnyOverloadedOperatorName matcher Reviewers: aaron.ballman Reviewed By: aaron.ballman Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75841	2020-03-10 00:11:27 +00:00
George Burgess IV	174c3eb69f	[x86][slh] Move isDataInvariant* functions Patch by Zola Bridges! From the review: """ I moved these functions to X86InstrInfo.cpp, so they are available from another pass. In addition, this is a step toward resolving the FIXME to move this metadata to the instruction tables. This is the final step to make these two data invariance checks available for non-SLH passes. The other two steps were here: - https://reviews.llvm.org/D70283 - https://reviews.llvm.org/D75650 Tested via llvm-lit llvm/test/CodeGen/X86/speculative-load-hardening* """ Differential Revision: https://reviews.llvm.org/D75654	2020-03-09 17:07:44 -07:00
George Burgess IV	bb0ec1daff	[x86][slh][NFC] Rm redundant liveness check Patch by Zola Bridges! From the review: """ In this changeset (https://reviews.llvm.org/D70283), I added a liveness check everywhere the isDataInvariant* functions were used, so that I could safely delete the checks within the function. I mistakenly left that deletion out of the patch. The result is that the same condition is checked twice for some instructions which is functionally fine, but not good. This change deletes the redundant check that I intended to delete in the last change. This is the second of three patches that will make the data invariance checks available for non-SLH passes and enable the FIXMEs related to moving this metadata to the instruction tables to be resolved. Tested via llvm-lit llvm/test/CodeGen/X86/speculative-load-hardening* """ Differential Revision: https://reviews.llvm.org/D75650	2020-03-09 17:07:44 -07:00
Richard Smith	6333cc2a12	Revert "PR45083: Mark statement expressions as being dependent if they contain" This reverts commit `2669e41b7b`, which was pushed by mistake.	2020-03-09 17:03:56 -07:00
Richard Smith	51fab8f36f	Mark test function as 'weak' to prevent interprocedural CSE. A recent change to MemorySSA caused LLVM to start optimizing the call to 'f(x)' into just 'x', despite the 'noinline' attribute. So try harder to prevent this optimization from firing.	2020-03-09 17:01:07 -07:00
Richard Smith	2669e41b7b	PR45083: Mark statement expressions as being dependent if they contain dependent constructs. We previously assumed they were neither value- nor instantiation-dependent under any circumstances, which would lead to crashes and other misbehavior. This doesn't match GCC's behavior (where statement expressions appear to be treated as value-dependent if they appear in a dependent context), but seems to be the best thing we can do in the short term: it turns out to be remarkably difficult for us to correctly determine whether we are in a dependent context (and it's not even possible in some cases, such as in a generic lambda where we might not have seen the 'auto' yet). This was previously reverted in `8e4a867` for rejecting some code, but that code was invalid and Clang was previously incorrectly accepting it.	2020-03-09 16:57:07 -07:00
Matt Morehouse	d93303b783	[ASan] Enable set_shadow_test.c on Windows. It looks like the recent -fno-common is making it pass now.	2020-03-09 16:09:28 -07:00
River Riddle	b10c662514	[mlir][SideEffects] Replace the old SideEffects dialect interface with the newly added op interfaces/traits. Summary: The old interface was a temporary stopgap to allow for implementing simple LICM that took side effects of region operations into account. Now that MLIR has proper support for specifying memory effects, this interface can be deleted. Differential Revision: https://reviews.llvm.org/D74441	2020-03-09 16:02:21 -07:00
George Burgess IV	cfc3e7f458	[cmake] Strip quotes in compiler-rt/lib/crt; error if checks fail Patch by Zhizhou Yang! In his own words: """ Similar change to CMakeLists as r372312. After r372209, compiler command line may include argument with quotes: ``` -fprofile-instr-use="/foo/bar.profdata" ``` And it causes a hidden failure with execute_process later: Could not read profile "/foo/bar.profdata": No such file or directory. In this particular case, the check for .init_array will fail silently and creates a PGO-ed binary with bad .init_array section in compiler-rt. Bug details can be found in PR45022 """ Differential Revision: https://reviews.llvm.org/D75065	2020-03-09 15:52:39 -07:00

1 2 3 4 5 ...

344825 Commits All Branches Search

344825 Commits

All Branches