llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam Clegg	2322a93709	[WebAssembly] MC: Refactor section creation code Remove the use of default argument in favor of a separate startCustomSection method. Differential Revision: https://reviews.llvm.org/D45794 llvm-svn: 330632	2018-04-23 19:16:19 +00:00
Peter Collingbourne	5ab4a4793e	Reland r329956, "AArch64: Introduce a DAG combine for folding offsets into addresses.", with a fix for the bot failure. This reland includes a check to prevent the DAG combiner from folding an offset that is smaller than the existing one. This can cause oscillations between two possible DAGs, which was the cause of the hang and later assertion failure observed on the lnt-ctmark-aarch64-O3-flto bot. http://green.lab.llvm.org/green/job/lnt-ctmark-aarch64-O3-flto/2024/ Original commit message: > This is a code size win in code that takes offseted addresses > frequently, such as C++ constructors that typically need to compute > an offseted address of a vtable. This reduces the size of Chromium > for Android's .text section by 108KB. Differential Revision: https://reviews.llvm.org/D45199 llvm-svn: 330630	2018-04-23 19:09:34 +00:00
Daniel Neilson	cc45e923c5	[DSE] Teach the pass that atomic memory intrinsics are stores. Summary: This change teaches DSE that the atomic memory intrinsics are stores that can be eliminated, and can allow other stores to be eliminated. This change specifically does not teach DSE that these intrinsics can be partially eliminated (i.e. length reduced, and dest/src changed); that will be handled in another change. Reviewers: mkazantsev, skatkov, apilipenko, efriedma, rsmith Reviewed By: efriedma Subscribers: dmgreen, llvm-commits Differential Revision: https://reviews.llvm.org/D45535 llvm-svn: 330629	2018-04-23 19:06:49 +00:00
Alex Shlyapnikov	a2b4f9b4d4	[HWASan] Switch back to fixed shadow mapping for x86-64 For now switch back to fixed shadow mapping for x86-64 due to the issues with IFUNC linking on older binutils. More details will be added to https://bugs.chromium.org/p/chromium/issues/detail?id=835864 Differential Revision: https://reviews.llvm.org/D45840 llvm-svn: 330623	2018-04-23 18:14:39 +00:00
Vedant Kumar	f17720633b	[SelectionDAG] Dump debug locs in SDNodes This helps debug issues where selection-dag assigns the wrong location to an instruction. Differential Revision: https://reviews.llvm.org/D45913 llvm-svn: 330618	2018-04-23 17:18:24 +00:00
Reid Kleckner	e160d51b42	Fix -Wtautological-compare warning with npos on Windows llvm-svn: 330614	2018-04-23 16:47:27 +00:00
Matt Arsenault	b21f9592be	AMDGPU: Move a flawed assert when spilling SGPRs It's possible to validly spill the frame offset register in a call sequence to a VGPR. There are definitely issues with SGPR spilling to memory, so move the assert later. llvm-svn: 330612	2018-04-23 16:13:30 +00:00
Simon Pilgrim	8cd01aaa0f	[X86] Replace x87 instregex with instrs if they only match one instruction llvm-svn: 330611	2018-04-23 16:10:50 +00:00
Adrian Prantl	bbe980dfe1	Fix computeSymbolSizes SEGFAULT on invalid file We use llvm-symbolizer in some production systems, and we run it against all possibly related files, including some that are not ELF. We noticed that for some of those invalid files, llvm-symbolizer would crash with SEGFAULT. Here is an example of such a file. It is due to that in computeSymbolSizes, a loop uses condition for (unsigned I = 0, N = Addresses.size() - 1; I < N; ++I) { where if Addresses.size() is 0, N would overflow and causing the loop to access invalid memory. Instead of patching the loop conditions, the commit makes so that the function returns early if Addresses is empty. Validated by checking that llvm-symbolizer no longer crashes. Patch by Teng Qin! Differential Revision: https://reviews.llvm.org/D44285 llvm-svn: 330610	2018-04-23 16:08:01 +00:00
Matt Arsenault	adc59d7076	AMDGPU: Assign enum name to stack ID Also assert that it is correct for SGPRs. There is currently a bug where stack slot coloring replaces SGPR spill FIs with one with the default ID, which results in a more confusing assert later about a dead object. llvm-svn: 330607	2018-04-23 15:51:26 +00:00
Matt Arsenault	488476c6eb	StackSlotColoring: Fix missing skipFunction check llvm-svn: 330606	2018-04-23 15:51:21 +00:00
Daniel Neilson	9863b48d4e	[SelectionDAG] Refactor lowering of atomic memory intrinsics. Summary: This just refactors the lowering of the atomic memory intrinsics to more closely match the code patterns used in the lowering of the non-atomic memory intrinsics. Specifically, we encapsulate the lowering in SelectionDAG::getAtomicMem*() functions rather than embedding the code directly in the SelectionDAGBuilder code. llvm-svn: 330603	2018-04-23 15:40:37 +00:00
Robert Widmann	6978db7800	[LLVM-C] DIBuilderBindings for Subrange and Arrays Summary: Move Go bindings for subranges and DINode arrays. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45933 llvm-svn: 330594	2018-04-23 14:29:33 +00:00
Robert Widmann	b02fe644d4	[LLVM-C] Finish Up Scope Bindings Summary: Adds bindings for Module and NameSpace scopes and LLVMDIBuilderCreateForwardDecl, a counterpart to LLVMDIBuilderCreateReplaceableCompositeType. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45934 llvm-svn: 330591	2018-04-23 13:51:43 +00:00
Simon Pilgrim	455d0b2cfe	[X86] Remove instregex matching from CLAC/STAC. Note - noticed this as the STAC case as it was unintentionally matching against STACK pseudo instructions. llvm-svn: 330588	2018-04-23 13:24:17 +00:00
Nico Weber	77c5471d9f	List cpp file only once (was added in 147117 and 147117 as build fix each). llvm-svn: 330587	2018-04-23 13:11:51 +00:00
Nicolai Haehnle	cbebba4917	AMDGPU: Fix SDWA peephole for V_AND_B32 Summary: Found by inspection. We care about the operand that doesn't contain the immediate. I believe this is currently not hit because we fold 0xff / 0xffff immediates only later. Change-Id: Ic3cf8538bc7da5eff3200d96eccf9d339e6345a7 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45886 llvm-svn: 330586	2018-04-23 13:06:03 +00:00
Nicolai Haehnle	5a995664f0	AMDGPU: Fix a corner case crash in SIOptimizeExecMasking Summary: See the new test case; this is really unlikely to happen with real code, but I ran into this while attempting to bugpoint-reduce a different issue. Change-Id: I9ade1dc1aa8fd9c4d9fc83661d7b80e310b5c4a6 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45885 llvm-svn: 330585	2018-04-23 13:05:50 +00:00
Nico Weber	5d53aed419	Consistently sort add_subdirectory calls in lib/Target/*/CMakeLists.txt llvm-svn: 330584	2018-04-23 12:49:34 +00:00
Sander de Smalen	7893f722b2	[AArch64][SVE] Asm: Support for contiguous, non-faulting LDNF1 (scalar+imm) load instructions Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45684 llvm-svn: 330583	2018-04-23 12:43:19 +00:00
Max Kazantsev	91f481665e	[LoopRotate] Fix incorrect SCEV invalidation in loop rotation LoopRotate only invalidates innermost loops while the changes that it makes may also affert any of this parents. With patch rL329047, SCEV becomes much smarter about calculation of exit counts for outer loops, so we cannot assume that they are not affected. Differential Revision: https://reviews.llvm.org/D45945 llvm-svn: 330582	2018-04-23 12:33:31 +00:00
Simon Pilgrim	0a334a8668	[X86] Remove unnecessary MMX reg-mem InstRW scheduler overrides. llvm-svn: 330581	2018-04-23 11:57:15 +00:00
Max Kazantsev	acda4c0f18	[LoopUnroll] Fix potentially incorrect SCEV invalidation in UnrollRuntime Current runtime unrolling invalidates parent loop saying that it might have changed after the inner loop has changed, but it doesn't bother to do the same to its parents. With patch rL329047, SCEV becomes much smarter about calculation of exit counts for outer loops. We might need to invalidate not only the immediate parent, but also any of its parents as well. There is no clear evidence that there is some miscompile happening because of this (at least I don't have such test), but the common sense says that the current code is wrong. Differential Revision: https://reviews.llvm.org/D45940 Reviewed By: chandlerc llvm-svn: 330577	2018-04-23 10:39:38 +00:00
Max Kazantsev	b1137c42fa	[LoopSimplify] Fix incorrect SCEV invalidation In the function `simplifyOneLoop` we optimistically assume that changes in the inner loop only affect this very loop and have no impact on its parents. In fact, after rL329047 has been merged, we can now calculate exit counts for outer loops which may depend on inner loops. Thus, we need to invalidate all parents when we do something to a loop. There is an evidence of incorrect behavior of `simplifyOneLoop`: when we insert `SE->verify()` check in the end of this funciton, it fails on a bunch of existing test, in particular: LLVM :: Transforms/LoopUnroll/peel-loop-not-forced.ll LLVM :: Transforms/LoopUnroll/peel-loop-pgo.ll LLVM :: Transforms/LoopUnroll/peel-loop.ll LLVM :: Transforms/LoopUnroll/peel-loop2.ll Note that previously we have fixed issues of this variety, see rL328483. This patch makes this function invalidate the outermost loop properly. Differential Revision: https://reviews.llvm.org/D45937 Reviewed By: chandlerc llvm-svn: 330576	2018-04-23 10:32:37 +00:00
Sander de Smalen	1b6d374422	[AArch64][SVE] Asm: Support for structured ST2, ST3 and ST4 (scalar+imm) store instructions. Reviewers: fhahn, rengolin, javed.absar, SjoerdMeijer, t.p.northover, echristo, evandro, huntergr Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45681 llvm-svn: 330565	2018-04-23 07:50:35 +00:00
Chandler Carruth	bf7190a154	[PM/LoopUnswitch] Remove a buggy assert in the new loop unswitch. The condition this was asserting doesn't actually hold. I've added comments to explain why, removed the assert, and added a fun test case reduced out of 403.gcc. llvm-svn: 330564	2018-04-23 06:58:36 +00:00
Craig Topper	3f1d538165	[X86] Add VEX_WIG to VEX encoded version of VCMPPSY/VCMPPDY. llvm-svn: 330563	2018-04-23 04:50:01 +00:00
Chandler Carruth	b525424118	[PM/LoopUnswitch] Fix comment typo. NFC. llvm-svn: 330560	2018-04-23 00:48:42 +00:00
Simon Pilgrim	326594bc92	[X86][Znver1] Remove unnecessary BMI1 ANDN InstRW overrides. llvm-svn: 330558	2018-04-22 21:37:08 +00:00
Robert Widmann	12e367b6db	[LLVM-C] Add DIBuilder Bindings For Variable Creation Summary: Wrap LLVMDIBuilderCreateAutoVariable, LLVMDIBuilderCreateParameterVariable, LLVMDIBuilderCreateExpression, and move and correct LLVMDIBuilderInsertDeclareBefore and LLVMDIBuilderInsertDeclareAtEnd from the Go bindings to the C bindings. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: harlanhaskins, whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45928 llvm-svn: 330555	2018-04-22 19:24:44 +00:00
Simon Pilgrim	06e16541ba	[X86] Remove unnecessary WriteFBlend/WriteBlend InstRW overrides. Fixed a lot of the default classes which were being completely overridden. llvm-svn: 330554	2018-04-22 18:35:53 +00:00
Simon Pilgrim	091680b6e7	[X86] Remove unnecessary WriteFMul/WriteFRcp/WriteFRsqrt InstRW overrides. llvm-svn: 330553	2018-04-22 18:09:50 +00:00
Simon Pilgrim	b362d02229	[X86] Remove unnecessary CVT instrw overrides. llvm-svn: 330552	2018-04-22 17:54:58 +00:00
Sanjay Patel	30be665e82	[PatternMatch] allow undef elements when matching a vector zero This is the last step in getting constant pattern matchers to allow undef elements in constant vectors. I'm adding a dedicated m_ZeroInt() function and building m_Zero() from that. In most cases, calling code can be updated to use m_ZeroInt() directly when there's no need to match pointers, but I'm leaving that efficiency optimization as a follow-up step because it's not always clear when that's ok. There are just enough icmp folds in InstSimplify that can be used for integer or pointer types, that we probably still want a generic m_Zero() for those cases. Otherwise, we could eliminate it (and possibly add a m_NullPtr() as an alias for isa<ConstantPointerNull>()). We're conservatively returning a full zero vector (zeroinitializer) in InstSimplify/InstCombine on some of these folds (see diffs in InstSimplify), but I'm not sure if that's actually necessary in all cases. We may be able to propagate an undef lane instead. One test where this happens is marked with 'TODO'. llvm-svn: 330550	2018-04-22 17:07:44 +00:00
Simon Pilgrim	c7f9b183c2	[X86][SkylakeServer] Remove unnecessary PMULLD instrw overrides. llvm-svn: 330549	2018-04-22 16:51:12 +00:00
Simon Pilgrim	3e8640a93a	[X86][Atom] Remove unnecessary scalar/vector load/move instrw overrides. llvm-svn: 330548	2018-04-22 16:49:35 +00:00
Simon Pilgrim	ef8d3ae4b5	[X86] Fix (completely overridden) WriteFHAdd/WritePHAdd classes to allow us to remove unnecessary instrw overrides. llvm-svn: 330546	2018-04-22 15:25:59 +00:00
Simon Pilgrim	2fd8269c6f	[X86][MMX][SSE] Tag missed PHADD/PHSUB instructions with WritePHAdd llvm-svn: 330545	2018-04-22 15:02:23 +00:00
Simon Pilgrim	96855ec39e	[X86] Remove unnecessary WriteFVarBlend/WriteVarBlend InstRW overrides. This also fixes some of the ReadAfterLd issues due to InstRW. llvm-svn: 330544	2018-04-22 14:43:12 +00:00
Simon Pilgrim	a41ae2f005	[X86] Fix WriteMPSAD/WritePSADBW values to allow us to remove unnecessary instrw overrides. llvm-svn: 330542	2018-04-22 10:39:16 +00:00
Simon Pilgrim	523fd335b1	[X86][SandyBridge] Remove unnecessary WritePOPCNTLd overrides by fixing load latency. llvm-svn: 330541	2018-04-22 10:03:52 +00:00
Jonas Devlieghere	578c049497	[Support] Fix prefix logic in WithColor. When a prefix is passed, we need to print a colon a space after it, not just the prefix. llvm-svn: 330535	2018-04-22 08:01:01 +00:00
Craig Topper	e958c7270e	[X86] Change TB to PS on LFENCE instruction. This matches the other FENCE instructions. llvm-svn: 330533	2018-04-22 03:15:02 +00:00
Craig Topper	2a28336f34	[X86] Remove OpSizeIgnore, it's not implemented any differently than OpSizeFixed. llvm-svn: 330532	2018-04-22 01:24:58 +00:00
Craig Topper	e33ed7d667	[X86] Remove DATA32_PREFIX. Hack the printing for DATA16_PREFIX to print 'data32' in 16-bit mode. Hack the asm parser to convert 'data32' to 'data16' in 16-bit mode. Improve the error messages to match GNU assembler. This also allows us to remove the hack from the disassembler table building. llvm-svn: 330531	2018-04-22 00:52:02 +00:00
Simon Pilgrim	37334ea67a	[X86] Strip unnecessary prefetch + vector move/load instrw overrides from scheduler models. llvm-svn: 330527	2018-04-21 21:59:36 +00:00
Jonas Devlieghere	0c1b29540c	[Support] Add optional prefix to convenience helpers in WithColor. Several tools prefix the error/warning/note output with the name of the tool. One such tool is LLD for example. This commit adds as an optional 'Prefix' argument to the convenience helpers. llvm-svn: 330526	2018-04-21 21:36:11 +00:00
Simon Pilgrim	920802cc50	[X86] Strip unnecessary WriteCvtF2I instrw overrides from scheduler models. llvm-svn: 330525	2018-04-21 21:16:44 +00:00
Simon Pilgrim	825ead950e	[X86] Strip unnecessary broadcast/shuffle256 instrw overrides from scheduler models. llvm-svn: 330523	2018-04-21 20:45:12 +00:00
Simon Pilgrim	58ddaeabe2	[X86][AVX] VPERM2F128/VINSERTF128 should be a shuffle256 schedule like VPERM2I128/VINSERTI128 llvm-svn: 330522	2018-04-21 20:04:24 +00:00
Simon Pilgrim	74ccc6a303	[X86] Strip unnecessary vector integer math, shift-imm, extend, shuffle, pack/unpack instruction instrw overrides from scheduler models. llvm-svn: 330521	2018-04-21 19:11:55 +00:00
Craig Topper	fe59bea07b	[X86] Add DAG combine to turn (trunc (srl (mul ext, ext), 16) into PMULHW/PMULHUW. Ultimately I want to use this to remove the intrinsics for these instructions. llvm-svn: 330520	2018-04-21 18:39:21 +00:00
Craig Topper	05242bf691	[X86] Add SchedWrites for LDMXCSR/STMXCSR. llvm-svn: 330517	2018-04-21 18:07:36 +00:00
Simon Pilgrim	44278f6598	[X86][Haswell] Strip unnecessary WriteFAdd/WriteFHAdd instruction instrw overrides. llvm-svn: 330514	2018-04-21 16:20:28 +00:00
Simon Pilgrim	a80df0999f	[X86][Broadwell] Remove unnecessary VORPD/VORPS instrw override - missed in D45629 llvm-svn: 330513	2018-04-21 16:17:47 +00:00
Simon Pilgrim	93b102cd45	[X86] Strip unnecessary WriteFRcp/WriteFRsqrt instruction instrw overrides from scheduler models. The required the default skylake schedules to be updated - these were being completely overriden by the InstRW and the existing values not used at all. llvm-svn: 330510	2018-04-21 15:16:59 +00:00
Simon Pilgrim	2193524fb4	[X86] Strip unnecessary WriteFShuffle instruction instrw overrides from scheduler models. llvm-svn: 330508	2018-04-21 14:56:56 +00:00
Simon Pilgrim	f7f84a0ca3	[X86][SandyBridge] Strip unnecessary MOVQ/CVT instruction instrw overrides. llvm-svn: 330505	2018-04-21 14:03:40 +00:00
Simon Pilgrim	02fc375a22	[X86] Strip unnecessary MMX instruction instrw overrides from scheduler models. llvm-svn: 330503	2018-04-21 12:15:42 +00:00
Simon Pilgrim	c0f654f18e	[X86] Strip unnecessary x87 instruction instrw overrides from scheduler models. llvm-svn: 330501	2018-04-21 11:25:02 +00:00
Hiroshi Inoue	33486787cb	[PowerPC] fix incorrect vectorization of abs() on POWER9 Vectorized loops with abs() returns incorrect results on POWER9. This patch fixes it. For example the following code returns negative result if input values are negative though it sums up the absolute value of the inputs. int vpx_satd_c(const int16_t *coeff, int length) { int satd = 0; for (int i = 0; i < length; ++i) satd += abs(coeff[i]); return satd; } This problem causes test failures for libvpx. For vector absolute and vector absolute difference on POWER9, LLVM generates VABSDUW (Vector Absolute Difference Unsigned Word) instruction or variants. Since these instructions are for unsigned integers, we need adjustment for signed integers. For abs(sub(a, b)), we generate VABSDUW(a+0x80000000, b+0x80000000). Otherwise, abs(sub(-1, 0)) returns 0xFFFFFFFF(=-1) instead of 1. For abs(a), we generate VABSDUW(a+0x80000000, 0x80000000). Differential Revision: https://reviews.llvm.org/D45522 llvm-svn: 330497	2018-04-21 09:32:17 +00:00
Eli Friedman	0644130612	[AArch64] Don't crash trying to resolve __stack_chk_guard. In certain cases, the compiler might try to merge __stack_chk_guard with another global variable. (Or someone could theoretically define __stack_chk_guard as an alias.) In that case, make sure we don't crash. Differential Revision: https://reviews.llvm.org/D45746 llvm-svn: 330495	2018-04-21 00:07:46 +00:00
Shoaib Meenai	106df7dd20	[ObjCARC] Take BlockColors by const reference. NFC llvm-svn: 330489	2018-04-20 22:14:45 +00:00
Shoaib Meenai	d64b83266b	[ObjCARC] Account for funclet token in storeStrong transform When creating a call to storeStrong in ObjCARCContract, ensure the call gets the correct funclet token, otherwise WinEHPrepare will turn the call (and all subsequent instructions) into unreachable. We already have logic to do this for the ARC autorelease elision marker; factor that out into a common function that's used for both. These are the only two places in this transform that create call instructions. Differential Revision: https://reviews.llvm.org/D45857 llvm-svn: 330487	2018-04-20 22:11:03 +00:00
Simon Pilgrim	d14d2e7b18	[X86] Add WriteFSign/WriteFLogic scheduler classes Split the fp and integer vector logical instruction scheduler classes - older CPUs especially often handled these on different pipes. This unearthed a couple of things that are also handled in this patch: (1) We were tagging avx512 fp logic ops as WriteFAdd, probably because of the lack of WriteFLogic (2) SandyBridge had integer logic ops only using Port5, when afaict they can use Ports015. (3) Cleaned up x86 FCHS/FABS scheduling as they are typically treated as fp logic ops. Differential Revision: https://reviews.llvm.org/D45629 llvm-svn: 330480	2018-04-20 21:16:05 +00:00
Alex Shlyapnikov	99cf54baa6	[HWASan] Introduce non-zero based and dynamic shadow memory (LLVM). Summary: Support the dynamic shadow memory offset (the default case for user space now) and static non-zero shadow memory offset (-hwasan-mapping-offset option). Keeping the the latter case around for functionality and performance comparison tests (and mostly for -hwasan-mapping-offset=0 case). The implementation is stripped down ASan one, picking only the relevant parts in the following assumptions: shadow scale is fixed, the shadow memory is dynamic, it is accessed via ifunc global, shadow memory address rematerialization is suppressed. Keep zero-based shadow memory for kernel (-hwasan-kernel option) and calls instreumented case (-hwasan-instrument-with-calls option), which essentially means that the generated code is not changed in these cases. Reviewers: eugenis Subscribers: srhines, llvm-commits Differential Revision: https://reviews.llvm.org/D45840 llvm-svn: 330475	2018-04-20 20:04:04 +00:00
Sean Fertile	18f17333dd	[PartialInlining] Fix Crash from holding a reference to a destructed ORE. The callback used to create an ORE for the legacy PI pass caches the allocated object in a unique_ptr in the runOnModule function, and returns a reference to that object. Under certian circumstances we can end up holding onto that reference after the OREs destruction. Rather then allowing the new and legacy passes to create ORE object in diffrent ways, create the ORE at the point of use. Differential Revision: https://reviews.llvm.org/D43219 llvm-svn: 330473	2018-04-20 19:56:26 +00:00
Krzysztof Parzyszek	5061b37e9c	[Hexagon] hexagon-autohvx was left on again llvm-svn: 330472	2018-04-20 19:45:49 +00:00
Krzysztof Parzyszek	41a24b7b13	[Hexagon] Improve HVX instruction selection (bitcast, vsplat) There was some unfortunate interaction between VSPLAT and BITCAST related to the selection of constant vectors (coming from selecting shuffles). Introduce VSPLATW that always splats a 32-bit word, and can have arbitrary result type (to avoid BITCASTs of VSPLAT). Clean up the previous selection of BITCAST/VSPLAT. llvm-svn: 330471	2018-04-20 19:38:37 +00:00
Eric Christopher	aadbabc070	Remove unused argument from emitModuleMetadata. NFCI. llvm-svn: 330470	2018-04-20 19:07:57 +00:00
Krzysztof Parzyszek	642120122c	[Hexagon] Skip fixed-stack indexes in HexagonConstExtenders Fixed slots have negative values, and TRI::stackSlot2Index and TRI::index2StackSlot do not handle negative numbers. llvm-svn: 330468	2018-04-20 19:06:46 +00:00
Craig Topper	173d59b62e	[X86][SandyBridge] Remove duplciate InstRWs from Sandy Brige scheduler model. llvm-svn: 330465	2018-04-20 18:55:40 +00:00
Gabor Buella	31fa8025ba	[X86] WaitPKG instructions Three new instructions: umonitor - Sets up a linear address range to be monitored by hardware and activates the monitor. The address range should be a writeback memory caching type. umwait - A hint that allows the processor to stop instruction execution and enter an implementation-dependent optimized state until occurrence of a class of events. tpause - Directs the processor to enter an implementation-dependent optimized state until the TSC reaches the value in EDX:EAX. Also modifying the description of the mfence instruction, as the rep prefix (0xF3) was allowed before, which would conflict with umonitor during disassembly. Before: $ echo 0xf3,0x0f,0xae,0xf0 \| llvm-mc -disassemble .text mfence After: $ echo 0xf3,0x0f,0xae,0xf0 \| llvm-mc -disassemble .text umonitor %rax Reviewers: craig.topper, zvi Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D45253 llvm-svn: 330462	2018-04-20 18:42:47 +00:00
Jessica Paquette	2e5ada5c81	[MachineOutliner] Change B instruction for tail calls to TCRETURNdi First off, this is more correct than having the B. Second off, this was making a bot upset. This fixes that. Update the test to include -verify-machineinstrs as well to prevent stuff like this slipping by non debug/assert builds in the future. llvm-svn: 330459	2018-04-20 18:03:21 +00:00
Zachary Turner	194be871b9	[LLD/PDB] Emit first section contribution for DBI Module Descriptor. Part of the DBI stream is a list of variable length structures describing each module that contributes to the final executable. One member of this structure is a section contribution entry that describes the first section contribution in the output file for the given module. We have been leaving this structure unpopulated until now, so with this patch it is now filled out correctly. Differential Revision: https://reviews.llvm.org/D45832 llvm-svn: 330457	2018-04-20 18:00:46 +00:00
Nicholas Wilson	ef90ff36da	[WebAssembly] Distinguish debug/symbol names in the Wasm structs. NFC Differential Revision: https://reviews.llvm.org/D45021 llvm-svn: 330448	2018-04-20 17:07:24 +00:00
Michael Zolotukhin	e268304122	Revert r330431. There are still stage3/stage4 miscompares :( llvm-svn: 330446	2018-04-20 16:57:10 +00:00
Florian Hahn	773872fd67	[NewGVN] Split OpPHI detection and creation. It also adds a check making sure PHIs for operands are all in the same block. Patch by Daniel Berlin <dberlin@dberlin.org> Reviewers: dberlin, davide Differential Revision: https://reviews.llvm.org/D43865 llvm-svn: 330444	2018-04-20 16:37:13 +00:00
Andrew Ng	7a2fa74ab0	[DebugInfo] Use WithColor for more debug line warnings Updated two more debug line related warnings to use WithColor. This was necessary to ensure consistent output order of the warnings on Windows for debug line tests. Differential Revision: https://reviews.llvm.org/D45871 llvm-svn: 330440	2018-04-20 15:29:47 +00:00
Sanjay Patel	3d453ad711	[DAGCombine] (float)((int) f) --> ftrunc (PR36617) This was originally committed at rL328921 and reverted at rL329920 to investigate failures in Chrome. This time I've added to the ReleaseNotes to warn users of the potential of exposing UB and let me repeat that here for more exposure: Optimization of floating-point casts is improved. This may cause surprising results for code that is relying on undefined behavior. Code sanitizers can be used to detect affected patterns such as this: int main() { float x = 4294967296.0f; x = (float)((int)x); printf("junk in the ftrunc: %f\n", x); return 0; } $ clang -O1 ftrunc.c -fsanitize=undefined ; ./a.out ftrunc.c:5:15: runtime error: 4.29497e+09 is outside the range of representable values of type 'int' junk in the ftrunc: 0.000000 Original commit message: fptosi / fptoui round towards zero, and that's the same behavior as ISD::FTRUNC, so replace a pair of casts with the equivalent node. We don't have to account for special cases (NaN, INF) because out-of-range casts are undefined. Differential Revision: https://reviews.llvm.org/D44909 llvm-svn: 330437	2018-04-20 15:07:55 +00:00
Michael Zolotukhin	a2c9af0209	Revert "Revert r330403 and r330413." Reapply the patches with a fix. Thanks Ilya and Hans for the reproducer! This reverts commit r330416. The issue was that removing predecessors invalidated uses that we stored for rewrite. The fix is to finish manipulating with CFG before we select uses for rewrite. llvm-svn: 330431	2018-04-20 13:34:32 +00:00
Simon Pilgrim	df8fa6d734	[X86][BtVer2] Cleanup some old FIXMEs from the model. NFCI. llvm-svn: 330428	2018-04-20 13:12:04 +00:00
Simon Pilgrim	2f522ef13d	[X86] Tag CLDEMOTE instruction with WriteLoad scheduling class Same as other cacheline instructions llvm-svn: 330424	2018-04-20 12:54:53 +00:00
Sander de Smalen	30f9f11d51	[AArch64][SVE] Asm: Support for contiguous LD1 (scalar+scalar) load instructions. This is patch [4/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D45690 llvm-svn: 330423	2018-04-20 12:52:01 +00:00
Jonas Devlieghere	5c709eda07	[ObjectYAML] Add ability for DWARFYAML to calculate DIE lengths This patch adds the ability for the ObjectYAML DWARFEmitter to calculate the lengths of DIEs. This is accomplished by creating a DIEFixupVisitor class which traverses the DWARF DIEs to calculate and fix up the lengths in the Compile Unit header. The DIEFixupVisitor can be extended in the future to enable more complex fix ups which will enable simplified YAML string representations. This is also very useful when using the YAML format in unit tests because you no longer need to know the length of the compile unit when writing the YAML string. Differential commandeered from Chris Bieneman (beanz) Differential revision: https://reviews.llvm.org/D30666 llvm-svn: 330421	2018-04-20 12:33:49 +00:00
Ilya Biryukov	afe822bd6d	Revert r330403 and r330413. Revert r330413: "[SSAUpdaterBulk] Use SmallVector instead of DenseMap for storing rewrites." Revert r330403 "Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." one more time." r330403 commit seems to crash clang during our integrate while doing PGO build with the following stacktrace: #2 llvm::SSAUpdaterBulk::RewriteAllUses(llvm::DominatorTree, llvm::SmallVectorImpl<llvm::PHINode>) #3 llvm::JumpThreadingPass::ThreadEdge(llvm::BasicBlock, llvm::SmallVectorImpl<llvm::BasicBlock> const&, llvm::BasicBlock) #4 llvm::JumpThreadingPass::ProcessThreadableEdges(llvm::Value, llvm::BasicBlock, llvm::jumpthreading::ConstantPreference, llvm::Instruction) #5 llvm::JumpThreadingPass::ProcessBlock(llvm::BasicBlock) The crash happens while compiling 'lib/Analysis/CallGraph.cpp'. r3340413 is reverted due to conflicting changes. llvm-svn: 330416	2018-04-20 10:52:54 +00:00
Michael Zolotukhin	9dea079315	[SSAUpdaterBulk] Use SmallVector instead of DenseMap for storing rewrites. llvm-svn: 330413	2018-04-20 10:31:06 +00:00
Florian Hahn	d4332eb3b7	[LTO] Add stats-file option to LTO/Config.h. This patch adds a StatsFile option to LTO/Config.h and updates both LLVMGold and llvm-lto2 to set it. Reviewers: MatzeB, tejohnson, espindola Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D45531 llvm-svn: 330411	2018-04-20 10:18:36 +00:00
Sander de Smalen	137efb231e	[AArch64][SVE] Fix diagnostic for SVE LD4 instructions: Diagnostic: 'index must be multiple of 3 in range [-32, 28]' Must be: 'index must be multiple of 4 in range [-32, 28]' llvm-svn: 330407	2018-04-20 09:45:50 +00:00
Sander de Smalen	367694b093	[AArch64][SVE] Added GPR64shifted and GPR64NoXZRshifted register classes. Summary: This is patch [3/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: SjoerdMeijer Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45689 llvm-svn: 330406	2018-04-20 08:54:49 +00:00
Michael Zolotukhin	79e4f7fadb	Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." one more time. Hopefully, changing set to vector removes nondeterminism detected by some bots, or the new assert will catch something. This reverts commit r330180. llvm-svn: 330403	2018-04-20 08:01:08 +00:00
Michael Zolotukhin	26339b445a	[SSAUpdaterBulk] Add an assert. llvm-svn: 330402	2018-04-20 07:59:57 +00:00
Michael Zolotukhin	0df1d48ca9	[SSAUpdaterBulk] Add * and & to auto. llvm-svn: 330400	2018-04-20 07:58:54 +00:00
Michael Zolotukhin	bc843211fd	[SSAUpdaterBulk] Use PredCache in ComputeLiveInBlocks. llvm-svn: 330399	2018-04-20 07:57:24 +00:00
Michael Zolotukhin	79cb54b2d9	[SSAUpdaterBulk] Use SmallVector instead of SmallPtrSet for uses. llvm-svn: 330398	2018-04-20 07:56:00 +00:00
Daniel Cederman	4557178061	Revert "This pass, fixing an erratum in some LEON 2 processors..." Summary: Reading Atmel's AT697E errata document this does not seem like a valid workaround. While the text only mentions SDIV, it says that the ICC flags can be wrong, and those are only generated by SDIVcc. Verification on hardware shows that simply replacing SDIV with SDIVcc does not avoid the bug with negative operands. This reverts r283727. Reviewers: lero_chris, jyknight Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45813 llvm-svn: 330397	2018-04-20 07:53:27 +00:00
Daniel Cederman	c67b3ffba7	[Sparc] Use synthetic instruction clr to zero register instead of sethi Using `clr reg`/`mov %g0, reg`/`or %g0, %g0, reg` to zero a register looks much better than `sethi 0, reg`. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: eraman, fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45810 llvm-svn: 330396	2018-04-20 07:47:12 +00:00
Sander de Smalen	149916d29a	[AArch64][AsmParser] Extend RegOp with integrated 'shift/extend'. Summary: In some cases the shift/extend needs to be explicitly parsed together with the register, rather than as a separate operand. This is needed for addressing modes where the instruction as a whole dictates the scaling/extend, rather than specific bits in the instruction. By parsing them as a single operand, we avoid the need to pass an extra operand in all CodeGen patterns (because all operands need to have an associated value), and we avoid the need to update TableGen to accept operands that have no associated bits in the instruction. An added benefit of parsing them together is that the assembler can give a sensible diagnostic if the scaling is not correct. This is patch [2/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: fhahn, SjoerdMeijer Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45688 llvm-svn: 330394	2018-04-20 07:24:20 +00:00
Nicolai Haehnle	7a87977fb2	AMDGPU: Legalize the operand of SI_INIT_M0 Summary: This fixes a case where the argument to a sendmsg intrinsic ends up in a VGPR, for whatever reason. The underlying performance issue is that a multiplication that can be an s_mul_i32 is instead needlessly generated as v_mul_u32_u24, but this is not addressed by this patch. Change-Id: I61fd4034314d5acdf6074632c30b65364dfa7328 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45826 llvm-svn: 330393	2018-04-20 07:14:25 +00:00
Daniel Cederman	793af3b9f0	[Sparc] Fix addressing mode when using 64-bit values in inline assembly Summary: If a 64-bit register is used as an operand in inline assembly together with a memory reference, the memory addressing will be wrong. The addressing will be a single reg, instead of reg+reg or reg+imm. This will generate a bad offset value or an exception in printMemOperand(). For example: ``` long long int val = 5; long long int mem; __asm__ volatile ("std %1, %0":"=m"(mem):"r"(val)); ``` becomes: ``` std %i0, [%i2+589833] ``` The problem is that SelectInlineAsmMemoryOperand() is never called for the memory references if one of the operands is a 64-bit register. By calling SelectInlineAsmMemoryOperands() in tryInlineAsm() the Sparc version of SelectInlineAsmMemoryOperand() gets called for each memory reference. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: eraman, fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45761 llvm-svn: 330392	2018-04-20 06:57:49 +00:00
Vlad Tsyrklevich	230b256783	LowerTypeTests: Propagate symver directives Summary: This change fixes https://crbug.com/834474, a build failure caused by LowerTypeTests not preserving .symver symbol versioning directives for exported functions. Emit symver information to ThinLTO summary data and then propagate symver directives for exported functions to the merged module. Emitting symver information to the summaries increases the size of intermediate build artifacts for a Chromium build by less than 0.2%. Reviewers: pcc Reviewed By: pcc Subscribers: tejohnson, mehdi_amini, eraman, llvm-commits, eugenis, kcc Differential Revision: https://reviews.llvm.org/D45798 llvm-svn: 330387	2018-04-20 01:36:48 +00:00
Amara Emerson	6aacbf4d7c	Move a dump() implementation out of line. Fixes some link issues. llvm-svn: 330384	2018-04-20 00:42:46 +00:00
Jessica Paquette	1eca23bdd8	[MachineOutliner] NFC: Move EnableLinkOnceODROutlining into MachineOutliner.cpp This moves the EnableLinkOnceODROutlining flag from TargetPassConfig.cpp into MachineOutliner.cpp. It also removes OutlineFromLinkOnceODRs from the MachineOutliner constructor. This is now handled by the moved command-line flag. llvm-svn: 330373	2018-04-19 22:17:07 +00:00
Sam Clegg	f009da2448	[WebAssembly] Enabled -triple=wasm32-unknown-unknown-wasm path using ELF directive parser. This is a temporary solution until a proper WASM implementation of MCAsmParserExtension is in place, but at least for now will unblock this path. Added test to make sure this path works with the WASM Assembler. Patch By Wouter van Oortmerssen! Differential Revision: https://reviews.llvm.org/D45386 llvm-svn: 330370	2018-04-19 22:00:53 +00:00
Stanislav Mekhanoshin	160f85794d	[AMDGPU] Use packed literals with zero either lower or hi part Differential Revision: https://reviews.llvm.org/D45790 llvm-svn: 330365	2018-04-19 21:16:50 +00:00
Jin Lin	585f2699cf	Refine the loop rotation's API Summary: The following changes addresses the following two issues. 1) The existing loop rotation pass contains both loop latch simplification and loop rotation. So one flag RotationOnly is added to be passed to the loop rotation pass. 2) The threshold value is initialized with MAX_UINT since the loop rotation utility should not have threshold limit. Reviewers: dmgreen, efriedma Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D45582 llvm-svn: 330362	2018-04-19 20:29:43 +00:00
Lang Hames	ee68ec06a1	[ORC] Fix an assertion condition from r329934. Thanks to Alexander Ivchenko for finding the issue! llvm-svn: 330359	2018-04-19 19:30:35 +00:00
Craig Topper	bc895a3afc	[X86] Enable popcnt false dependency breaking on Silvermont and Goldmont. Silvermont and Goldmont have the same issue on popcnt as Sandy Bridge, Haswell, Broadwell, and Skylake. Believe it is fixed in Goldmont Plus. llvm-svn: 330358	2018-04-19 19:25:24 +00:00
Chandler Carruth	32e62f9c5b	[PM/LoopUnswitch] Detect irreducible control flow within loops and skip unswitching non-trivial edges. Summary: This fixes the bug pointed out in review with non-trivial unswitching. This also provides a basis that should make it pretty easy to finish fleshing out a routine to scan an entire function body for irreducible control flow, but this patch remains minimal for disabling loop unswitch. Reviewers: sanjoy, fedor.sergeev Subscribers: mcrosier, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45754 llvm-svn: 330357	2018-04-19 18:44:25 +00:00
Lang Hames	9bbd653084	[ORC] Make VSO symbol resolution/finalization operations private. This forces these operations to be carried out via a MaterializationResponsibility instance, ensuring responsibility is explicitly tracked. llvm-svn: 330356	2018-04-19 18:42:49 +00:00
Simon Pilgrim	4ba057dbd1	[X86][SLM] Fix typo using SandyBridge resources. Luckily this was on instructions not supported on Silvermont.... llvm-svn: 330351	2018-04-19 18:01:52 +00:00
Craig Topper	b5f2659130	[X86] Correct the scheduling data for register forms of XCHG and XADD on Intel CPUs. The XCHG16rr/XCHG32rr/XCHG64rr instructions should be 3 uops just like XCHG8rr. I believe they're just implemented as 3 move uops with a temporary register. XADD is probably 2 moves and an add also using a temporary register. Change the latency for both from 2 cycles to 3 cycles. Only 2 of the uops are serialized in their execution, the move into the temporary and the move out of the temporary. The move from one GPR to the other should be able to go in parallel with this if there are ALU resources available. llvm-svn: 330349	2018-04-19 18:00:17 +00:00
Sanjay Patel	a201787fd7	[Reassociate] fix formatting; NFC llvm-svn: 330348	2018-04-19 17:56:36 +00:00
Simon Pilgrim	5e492d29a3	[X86] Merge some MMX instregex There's a lot more but I'd prefer focussing on removing unnecessary InstRWs first. llvm-svn: 330347	2018-04-19 17:32:10 +00:00
Krzysztof Parzyszek	fbee8574ab	[if-converter] Handle BBs that terminate in ret during diamond conversion This fixes https://llvm.org/PR36825. Original patch by Valentin Churavy (D45218). Differential Revision: https://reviews.llvm.org/D45731 llvm-svn: 330345	2018-04-19 17:26:46 +00:00
Krzysztof Parzyszek	2a9a83cd3f	[Hexagon] Use legal types when lowering CONCAT_VECTORS via BUILD_VECTOR llvm-svn: 330344	2018-04-19 17:11:58 +00:00
Francis Visoiu Mistrih	1834682b97	[llvm-objdump] Print "..." instead of random data for virtual sections When disassembling with -D, skip virtual sections by printing "..." for each symbol. This patch also implements `MachOObjectFile::isSectionVirtual`. Test case comes from: ``` .zerofill __DATA,__common,_data64unsigned,472,3 ``` Differential Revision: https://reviews.llvm.org/D45824 llvm-svn: 330342	2018-04-19 17:02:57 +00:00
Mark Searles	1bc6e71f32	[AMDGPU] Do not only rely on BB number when finding bottom loop We should also check that the "bottom" basic block of a loopis a successor of the "header" basic block, otherwise we don't propagate the information correctly when the CFG is complex. This fixes an important rendering problem with Wolfsentein 2, because of one vector-memory wait was missing. Differential Revision: https://reviews.llvm.org/D43831 llvm-svn: 330337	2018-04-19 15:42:30 +00:00
Florian Hahn	b789165e6b	[NewGVN] Add ops as dependency if we cannot find a leader for ValueOp. If those operands change, we might find a leader for ValueOp, which could enable new phi-of-op creation. This fixes a case where we missed creating a phi-of-ops node. With D43865 and this patch, bootstrapping clang/llvm works with -enable-newgvn, whereas without it, the "value changed after iteration" assertion is triggered. Reviewers: dberlin, davide Reviewed By: dberlin Differential Revision: https://reviews.llvm.org/D42180 llvm-svn: 330334	2018-04-19 15:05:47 +00:00
Krzysztof Parzyszek	d92c37e090	[Hexagon] Generate code for vector bswap intrinsics llvm-svn: 330333	2018-04-19 14:46:44 +00:00
Simon Pilgrim	f21ace6cdd	[X86][BtVer2] Remove SSE4A EXTRQ/EXTRQI InstRW overrides. These are already handled identically by WriteALU. llvm-svn: 330332	2018-04-19 14:38:36 +00:00
Krzysztof Parzyszek	23bcf06a15	[Hexagon] Add/fix patterns for 32/64-bit vector compares and logical ops llvm-svn: 330330	2018-04-19 14:24:31 +00:00
Simon Dardis	5d61c8b225	[mips] Correct the definitions of the unaligned word memory operation instructions These instructions lacked the correct predicates, were not marked as loads and stores and lacked the proper instruction mapping information. In the case of microMIPS sw(l\|r)e (EVA) these instructions were using the load EVA description. Reviewers: abeserminji, smaksimovic, atanasyan Differential Revision: https://reviews.llvm.org/D45626 llvm-svn: 330326	2018-04-19 13:33:51 +00:00
Alexander Ivchenko	e8fed1546e	Lowering x86 adds/addus/subs/subus intrinsics (llvm part) This is the patch that lowers x86 intrinsics to native IR in order to enable optimizations. The patch also includes folding of previously missing saturation patterns so that IR emits the same machine instructions as the intrinsics. Patch by tkrupa Differential Revision: https://reviews.llvm.org/D44785 llvm-svn: 330322	2018-04-19 12:13:30 +00:00
Simon Pilgrim	3c06617f0e	[X86][FMA] Remove FMA reg-reg InstRW scheduler overrides. These are all already handled identically by WriteFMA. llvm-svn: 330319	2018-04-19 11:37:26 +00:00
Simon Pilgrim	33dede9075	[X86][BtVer2] Remove 128-bit F16C InstRW overrides. These are already handled identically by WriteCvtF2F. llvm-svn: 330318	2018-04-19 11:16:33 +00:00
Florian Hahn	147fc016e3	[BasicBlock] Add instructionsWithoutDebug methods to skip debug insts. Reviewers: aprantl, vsk, mattd, chandlerc Reviewed By: aprantl, vsk Differential Revision: https://reviews.llvm.org/D45657 llvm-svn: 330316	2018-04-19 09:48:07 +00:00
Simon Dardis	fdc052686c	[mips] Guard some macro expansions properly Reviewers: atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D45565 llvm-svn: 330315	2018-04-19 09:45:04 +00:00
Sander de Smalen	50d8702f26	[AArch64][AsmParser] NFC: Cleanup parsing of scalar registers. Summary: - Renamed tryParseRegister to tryParseScalarRegister, which now returns an OperandMatchResultTy. - Moved matching of certain aliases into matchRegisterNameAlias. - Changed type of most 'Reg' variables to 'unsigned'. This is patch [1/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro, samparker Reviewed By: samparker Subscribers: samparker, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D45687 llvm-svn: 330311	2018-04-19 07:35:08 +00:00
Craig Topper	f846e2d1b1	[X86] Scrub scheduling information for MUL/IMUL on Intel CPUs. This removes a bunch of unnecessary InstRW overrides. It also cleans up the missing information from the Sandy Bridge model. Other fixes to other models. llvm-svn: 330308	2018-04-19 05:34:05 +00:00
Bob Haarman	cb80a3fce0	Fix data race in X86FloatingPoint.cpp ASSERT_SORTED Summary: ASSERT_SORTED checks if a table is sorted, and uses a boolean to prevent the check from being run again if it was earlier determined that the table is in fact sorted. Unsynchronized reads and writes of that boolean triggered ThreadSanitizer's data race detection. This change rewrites the code to use std::atomic<bool> instead. Fixes PR36922. Reviewers: rnk Reviewed By: rnk Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D45742 llvm-svn: 330301	2018-04-18 23:04:09 +00:00
Craig Topper	ebf52e80c1	[X86] Correct the Defs, Uses, hasSideEffects, mayLoad, mayStore for XCHG and XADD instructions. I don't think we emit any of these from codegen except for using XCHG16ar as 2 byte NOP. llvm-svn: 330298	2018-04-18 22:07:53 +00:00
Artem Belevich	0ae8590354	[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions. The new instructions were added added for sm_70+ GPUs in CUDA-9.1. Differential Revision: https://reviews.llvm.org/D45068 llvm-svn: 330296	2018-04-18 21:51:48 +00:00
Alex Bradbury	3ff2022bb9	[RISCV] Introduce pattern for materialising immediates with 0 for lower 12 bits These immediates can be materialised with just an lui, rather than an lui+addi pair. llvm-svn: 330293	2018-04-18 20:34:23 +00:00
Craig Topper	04244cbf45	[X86] Fix the Uses/Defs,mayLoad,mayStore,hasSideEffects flags for the CMPXCHG instructions. The compiler only emits the locked version of these which use different instruction definitions. The versions fixed here are only used by the assembler/disassembler. llvm-svn: 330287	2018-04-18 20:15:00 +00:00
Alex Bradbury	099c720426	Revert "[RISCV] implement li pseudo instruction" Reverts rL330224, while issues with the C extension and missed common subexpression elimination opportunities are addressed. Neither of these issues are visible in current RISC-V backend unit tests, which clearly need expanding. llvm-svn: 330281	2018-04-18 19:02:31 +00:00
Lei Huang	192c6ccf6d	[Power9]Legalize and emit code for converting Unsigned HWord/Char to Quad-Precision Legalize and emit code for converting unsigned HWord/Char to QP: xscvsdqp xscvudqp Only covering patterns for unsigned forms cause we don't have part-word sign-extending integer loads into VSX registers. Differential Revision: https://reviews.llvm.org/D45494 llvm-svn: 330278	2018-04-18 17:41:46 +00:00
Amara Emerson	9de072f8ae	[AArch64] Add isel pattern for v8i8->v2f32 NVCASTs. rdar://39454635 llvm-svn: 330276	2018-04-18 17:10:19 +00:00
Lei Huang	198e678576	[Power9]Legalize and emit code for converting (Un)Signed Word to Quad-Precision Legalize and emit code for converting (Un)Signed Word to quad-precision via: xscvsdqp xscvudqp Differential Revision: https://reviews.llvm.org/D45389 llvm-svn: 330273	2018-04-18 16:34:22 +00:00
Alexey Bataev	242706b8d1	[DEBUG] Initial adaptation of NVPTX target for debug info emission. Summary: Patch adds initial emission of the debug info for NVPTX target. Currently, only .file and .loc directives are emitted, everything else is commented out to not break the compilation of Cuda. Reviewers: echristo, jlebar, tra, jholewinski Subscribers: mgorny, aprantl, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D41827 llvm-svn: 330271	2018-04-18 16:13:41 +00:00
Chandler Carruth	ccd3ecb95a	[x86] Switch EFLAGS copy lowering to use reg-reg form of testing for a zero register. Previously I tried this and saw LLVM unable to transform this to fold with memory operands such as spill slot rematerialization. However, it clearly works as shown in this patch. We turn these into `cmpb $0, <mem>` when useful for folding a memory operand without issue. This form has no disadvantage compared to `testb $-1, <mem>`. So overall, this is likely no worse and may be slightly smaller in some cases due to the `testb %reg, %reg` form. Differential Revision: https://reviews.llvm.org/D45475 llvm-svn: 330269	2018-04-18 15:52:50 +00:00
Aaron Smith	02caafd7e5	[support] Revert the changes made to Path.inc for the default Windows code page Path.inc/widenPath tries to decode the path using both UTF-8 and the default Windows code page. This is no longer necessary with the new InitLLVM method which ensures that the command line arguemnts are already UTF-8 on Windows. llvm-svn: 330266	2018-04-18 15:26:26 +00:00
Chandler Carruth	1f87618f8f	[x86] Fix PR37100 by teaching the EFLAGS copy lowering to rewrite uses across basic blocks in the limited cases where it is very straight forward to do so. This will also be useful for other places where we do some limited EFLAGS propagation across CFG edges and need to handle copy rewrites afterward. I think this is rapidly approaching the maximum we can and should be doing here. Everything else begins to require either heroic analysis to prove how to do PHI insertion manually, or somehow managing arbitrary PHI-ing of EFLAGS with general PHI insertion. Neither of these seem at all promising so if those cases come up, we'll almost certainly need to rewrite the parts of LLVM that produce those patterns. We do now require dominator trees in order to reliably diagnose patterns that would require PHI nodes. This is a bit unfortunate but it seems better than the completely mysterious crash we would get otherwise. Differential Revision: https://reviews.llvm.org/D45673 llvm-svn: 330264	2018-04-18 15:13:16 +00:00
Sanjay Patel	b2ab3f28d5	[SimplifyLibcalls] Realloc(null, N) -> Malloc(N) Patch by Dávid Bolvanský! Differential Revision: https://reviews.llvm.org/D45413 llvm-svn: 330259	2018-04-18 14:21:31 +00:00
David Stuttard	31f482c26b	[AMDGPU] Fix issues for backend divergence tracking Summary: A change to use divergence analysis in the AMDGPU backend was getting formal arguments incorrect (not tagged as divergent) unless they were VGPR0, VGPR1 or VGPR2 For graphics shaders it is possible to have more than these passed in as VGPR Modified the checking code to check for any VGPR registers passed in as formal arguments. Also, some intrinsics that are sources of divergence may have been lowered during instruction selection and are missed on subsequent calls to isSDNodeSourceOfDivergence - added the relevant AMDGPUISD checks as well. Finally, the FunctionLoweringInfo tracks virtual registers that are live across basic block boundaries. This is used to check for divergence of CopyFromRegister registers using the DivergenceAnalysis analysis. For multiple blocks the lazily evaluated inverted map VirtReg2Value was not cleared when the ValueMap map was. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45372 Change-Id: I112f3bd6dfe0f62e63ce9b43b893982778e4bee3 llvm-svn: 330257	2018-04-18 13:53:31 +00:00
Sam Parker	3c19051bf0	[IRCE] Only check for NSW on equality predicates After investigation discussed in D45439, it would seem that the nsw flag restriction is unnecessary in most cases. So the IsInductionVar lambda has been removed, the functionality extracted, and now only require nsw when using eq/ne predicates. Differential Revision: https://reviews.llvm.org/D45617 llvm-svn: 330256	2018-04-18 13:50:28 +00:00
Pavel Labath	8f5a456eb2	[cmake] Improve pthread_[gs]etname_np detection code Summary: Due to some android peculiarities, in some build configurations (statically linked executables targeting older releases) we could detect the presence of these functions (because they are present in libc.a, where check_library_exists searches), but then fail to build because the headers did not include the definition. This attempts to remedy that by upgrading the check_library_exists to check_symbol_exists, which will check that the function is declared too. I am hoping that a more thorough check will make the messy #ifdef we have accumulated in the code obsolete, so I optimistically try to remove them. Reviewers: zturner, kparzysz, danalbert Subscribers: srhines, mgorny, krytarowski, llvm-commits Differential Revision: https://reviews.llvm.org/D45359 llvm-svn: 330251	2018-04-18 13:13:27 +00:00
Florian Hahn	ac27758895	[LoopUnroll] Only peel if a predicate becomes known in the loop body. If a predicate does not become known after peeling, peeling is unlikely to be beneficial. Reviewers: mcrosier, efriedma, mkazantsev, junbuml Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D44983 llvm-svn: 330250	2018-04-18 12:29:24 +00:00
Pavel Labath	3fb39c79ed	[CodeGen/Dwarf] Make debug_names compatible with split-dwarf Summary: Previously we crashed for the combination of the two features because we tried to reference the dwo CU from the main object file. The fix consists of two items: - reference the skeleton CU from the name index (the consumer is expected to use the skeleton CU to find the real data). - use the main object file string pool for the strings in the index Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45566 llvm-svn: 330249	2018-04-18 12:11:59 +00:00
Bjorn Pettersson	bc4f19b6bd	[DebugInfo] Sink related dbg users when sinking in InstCombine Summary: When sinking an instruction in InstCombine we now also sink the DbgInfoIntrinsics that are using the sunken value. Example) When sinking the load in this input bb.X: %0 = load i64, i64* %start, align 4, !dbg !31 tail call void @llvm.dbg.value(metadata i64 %0, ...) br i1 %cond, label %for.end, label %for.body.lr.ph for.body.lr.ph: br label %for.body we now also move the dbg.value, like this bb.X: br i1 %cond, label %for.end, label %for.body.lr.ph for.body.lr.ph: %0 = load i64, i64* %start, align 4, !dbg !31 tail call void @llvm.dbg.value(metadata i64 %0, ...) br label %for.body In the past we haven't moved the dbg.value so we got bb.X: tail call void @llvm.dbg.value(metadata i64 %0, ...) br i1 %cond, label %for.end, label %for.body.lr.ph for.body.lr.ph: %0 = load i64, i64* %start, align 4, !dbg !31 br label %for.body So in the past we got a debug-use before the def of %0. And that dbg.value was also on the path jumping to %for.end, for which %0 never was defined. CodeGenPrepare normally comes to rescue later (when not moving the dbg.value), since it moves dbg.value instrinsics quite brutally, without really analysing if it is correct to move the intrinsic (see PR31878). So at the moment this patch isn't expected to have much impact, besides that it is moving the dbg.value already in opt, making the IR look more sane directly. This can be seen as a preparation to (hopefully) make it possible to turn off CodeGenPrepare::placeDbgValues later as a solution to PR31878. I also adjusted test/DebugInfo/X86/sdagsplit-1.ll to make the IR in the test case up-to-date with this behavior in InstCombine. Reviewers: rnk, vsk, aprantl Reviewed By: vsk, aprantl Subscribers: mattd, JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D45425 llvm-svn: 330243	2018-04-18 08:08:04 +00:00
Craig Topper	dfccafe18a	[X86][Broadwell] Remove some unnecessary InstRW overrides and add some FIXMEs. llvm-svn: 330241	2018-04-18 06:41:25 +00:00
Craig Topper	513e11bb70	[X86] Give CMOV 2 cycle latency on SLM. llvm-svn: 330239	2018-04-18 06:04:30 +00:00
Craig Topper	8704612481	[X86] Don't crash on bad operand modifiers in inline assembly Summary: Previously if a modifer was placed on a non-GPR register class we would hit an assert or crash. Reviewers: echristo Reviewed By: echristo Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D45751 llvm-svn: 330238	2018-04-18 05:15:24 +00:00
Sanjay Patel	aea15131db	[InstCombine] peek through bitcasted vector/array pointer GEP operand The bitcast may be interfering with other combines or vectorization as shown in PR16739: https://bugs.llvm.org/show_bug.cgi?id=16739 Most pointer-related optimizations are probably able to look through this bitcast, but removing the bitcast shrinks the IR, so it's at least a size savings. Differential Revision: https://reviews.llvm.org/D44833 llvm-svn: 330237	2018-04-18 00:36:40 +00:00
Bob Haarman	37a9269cc7	Fix lock order inversion between ManagedStatic and Statistic Summary: Statistic and ManagedStatic both use mutexes. There was a lock order inversion where, during initialization, Statistic's mutex would be held while taking ManagedStatic's, and in llvm_shutdown, ManagedStatic's mutex would be held while taking Statistic's mutex. This change causes Statistic's initialization code to avoid holding its mutex while calling ManagedStatic's methods, avoiding the inversion. Reviewers: dsanders, rtereshin Reviewed By: dsanders Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45398 llvm-svn: 330236	2018-04-17 23:37:18 +00:00
Stanislav Mekhanoshin	8b20b7dc2b	[AMDGPU] Enabled v2.16 literals for VOP3P Literal encoding needs op_sel_hi to select low 16 bit in this case. Differential Revision: https://reviews.llvm.org/D45745 llvm-svn: 330230	2018-04-17 23:09:05 +00:00
Vedant Kumar	b0585893cc	[Mem2Reg] Create merged debug locations for inserted phis Track the debug locations of the incoming values to newly-created phis, and apply merged debug locations to the phis. A merged location will be on line 0, but will have the correct scope set. This improves crash reporting when an inlined instruction with a merged location triggers a machine exception. A debugger will be able to narrow down the crash to the correct inlined scope, instead of simply pointing to the outer scope of the caller. Taken together with a change allows generating merged line-0 locations for instructions which aren't calls, this results in a 0.5% increase in the uncompressed size of the .debug_line section of a stage2+Release build of clang (-O3 -g). rdar://33858697 Differential Revision: https://reviews.llvm.org/D45397 llvm-svn: 330227	2018-04-17 22:03:08 +00:00
Vedant Kumar	4b29172d09	[Mem2Reg] Make RenamePassData a struct, NFC llvm-svn: 330226	2018-04-17 22:03:07 +00:00
Alex Bradbury	480b7bc906	[RISCV] implement li pseudo instruction The implementation follows the MIPS backend and expands the pseudo instruction directly during asm parsing. As the result, only real MC instructions are emitted to the MCStreamer. Additionally, PseudoLI instructions are emitted during codegen. The actual expansion to real instructions is performed during MI to MC lowering and is similar to the expansion performed by the GNU Assembler. Differential Revision: https://reviews.llvm.org/D41949 Patch by Mario Werner. llvm-svn: 330224	2018-04-17 21:56:40 +00:00
Stanislav Mekhanoshin	0bee630814	LoadStoreVectorizer crashes due to unsized type When we skip bitcasts while looking for GEP in LoadSoreVectorizer we should also verify that the type is sized otherwise we assert Differential Revision: https://reviews.llvm.org/D45709 llvm-svn: 330221	2018-04-17 21:40:04 +00:00
Keith Wyss	3d86823f3d	[XRay] Typed event logging intrinsic Summary: Add an LLVM intrinsic for type discriminated event logging with XRay. Similar to the existing intrinsic for custom events, but also accepts a type tag argument to allow plugins to be aware of different types and semantically interpret logged events they know about without choking on those they don't. Relies on a symbol defined in compiler-rt patch D43668. I may wait to submit before I can see demo everything working together including a still to come clang patch. Reviewers: dberris, pelikan, eizan, rSerge, timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45633 llvm-svn: 330219	2018-04-17 21:30:29 +00:00
Heejin Ahn	2b8158f441	[WebAssembly] Add an assertion for an invalid CFG Summary: It was not easy to provide a test case for D45648 (rL330079) because the bug didn't manifest itself in the set of currently valid IRs. Added an assertion to check this faster, thanks to @dblaikie's suggestion. Reviewers: dblaikie Subscribers: jfb, dschuff, sbc100, jgravelle-google, llvm-commits, dblaikie Differential Revision: https://reviews.llvm.org/D45711 llvm-svn: 330217	2018-04-17 21:19:21 +00:00
Rui Ueyama	e6ac9f5ec3	Rename sys::Process::GetArgumentVector -> sys::windows::GetCommandLineArguments GetArgumentVector (or GetCommandLineArguments) is very Windows-specific. I think it doesn't make much sense to provide that function from sys::Process. I also made a change so that the function takes a BumpPtrAllocator instead of a SpecificBumpPtrAllocator. The latter is the class to call dtors, but since char * is trivially destructible, we should use the former class. Differential Revision: https://reviews.llvm.org/D45641 llvm-svn: 330216	2018-04-17 21:09:16 +00:00
Dan Gohman	4576dc06be	[WebAssembly] Teach fast-isel to gracefully recover from illegal return types. Fixes PR36564. llvm-svn: 330215	2018-04-17 20:46:42 +00:00
Zachary Turner	bee6c22414	[llvm-pdbutil] Dump first section contribution for each module. The DBI stream contains a list of module descriptors. At the beginning of each descriptor is a structure representing the first section contribution in the output file for that module. LLD currently doesn't fill out this structure at all, but link.exe does. So as a precursor to emitting this data in LLD, we first need a way to dump it so that it can be checked. This patch adds support for the dumping, and verifies via a test that LLD emits bogus information. llvm-svn: 330208	2018-04-17 20:06:43 +00:00
Craig Topper	e56a2fc5e7	[X86] Add separate scheduling class for PSADBW instruction. llvm-svn: 330204	2018-04-17 19:35:19 +00:00
Craig Topper	655e1db722	[X86] Remove unnecessary InstRW overrides. Add somes FIXMEs/TODOs. llvm-svn: 330203	2018-04-17 19:35:14 +00:00
Krzysztof Parzyszek	cc71291731	[Hexagon] Do not merge initializers for stack and non-stack expressions Stack addressing needs addressing modes that provide an offset field immediately following the frame index. An initializer from a non-stack addressing could force the stack address to use a form that does not provide an offset field. llvm-svn: 330191	2018-04-17 15:23:09 +00:00
whitequark	31dff5337f	[LLVM-C] [PR34633] Avoid calling ->dump() methods from LLVMDump. LLVMDump functions are available in Release builds too. Patch by Brenton Bostick. Differential Revision: https://reviews.llvm.org/D44600 llvm-svn: 330189	2018-04-17 14:52:43 +00:00
Momchil Velikov	e256ab847b	Fix incorrect choice of callee-saved registers save/restore points Make the shrink wrapping pass pay attention to uses/defs of the stack pointer. Differential revision: https://reviews.llvm.org/D45524 llvm-svn: 330183	2018-04-17 08:37:38 +00:00
Michael Zolotukhin	21458fdc55	Revert "Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." again." This reverts r330175. There are still stage3/stage4 miscompares. llvm-svn: 330180	2018-04-17 07:31:27 +00:00
Simon Pilgrim	86e3c26924	[X86] Add FP comparison scheduler classes Split VCMP/VMAX/VMIN instructions off to WriteFCmp and VCOMIS instructions off to WriteFCom instead of assuming they match WriteFAdd Differential Revision: https://reviews.llvm.org/D45656 llvm-svn: 330179	2018-04-17 07:22:44 +00:00
Gerolf Hoflehner	5b4a67af1b	[DAGCombiner] Fix for oss-fuzz bug llvm-svn: 330178	2018-04-17 07:22:34 +00:00
Michael Zolotukhin	a6e7bd7001	[SSAUpdaterBulk] Add debug logging. llvm-svn: 330176	2018-04-17 04:45:40 +00:00
Michael Zolotukhin	3f5fd1b129	Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." again. One more, hopefully the last, bug is fixed: when forming UsesToRewrite we should ignore phi operands coming from edges that we want to delete. This reverts r329910. llvm-svn: 330175	2018-04-17 04:45:22 +00:00
Gerolf Hoflehner	1c3a07834e	[IR] Upgrade comment token in objc retain release marker for asm call Older compiler issued '#' instead of ';' llvm-svn: 330173	2018-04-17 04:02:24 +00:00
Peter Collingbourne	2f6d00612d	COFF: Make SectionChunk::Relocs field an ArrayRef. NFCI. Differential Revision: https://reviews.llvm.org/D45714 llvm-svn: 330172	2018-04-17 01:54:34 +00:00
Roman Tereshin	7a44782c73	[DebugInfo] Follow-up bug fix on "Fixing a couple of DI duplication bugs of CloneModule" Apparently, DebugInfoFinder::processCompileUnit doesn't process all of the possible kinds of DIImportedEntit'ies, e.g. DIGlobalVariable's. Previously introduced `llvm_unreachable` is therefore incorrect. Removing it here. llvm-svn: 330167	2018-04-16 23:39:44 +00:00
Zachary Turner	d8d97de514	[PDB] Correctly use the target machine when writing DBI stream. Using Config->is64() will treat ARM64 as Amd64, which is incorrect. Furthermore, there are more esoteric architectures that could theoretically be encountered. Just set it directly to the machine type, which we already know anyway. llvm-svn: 330157	2018-04-16 20:42:06 +00:00
Mandeep Singh Grang	88a8b269b4	[RISCV] Fix assert message operator Summary: Specifying assert message with an \|\| operator makes the compiler interpret it as a bool. Changed it to &&. Reviewers: asb, apazos Reviewed By: asb Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, sabuasal, niosHD, kito-cheng, shiva0217, zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D45660 llvm-svn: 330148	2018-04-16 18:56:10 +00:00
Zachary Turner	e3fe669855	Resubmit "Fix some incorrect fields in our generated PDBs." This fixes the failing tests. They simply hadn't been updated to match the new output resulting from this patch. llvm-svn: 330145	2018-04-16 18:17:13 +00:00
Haicheng Wu	f7466f3164	[SLP] Use getExtractWithExtendCost() to compute the scalar cost of extractelement/ext pair We use getExtractWithExtendCost to calculate the cost of extractelement and s\|zext together when computing the extract cost after vectorization, but we calculate the cost of extractelement and s\|zext separately when computing the scalar cost which is larger than it should be. Differential Revision: https://reviews.llvm.org/D45469 llvm-svn: 330143	2018-04-16 18:09:49 +00:00
Lang Hames	67feadb2c7	[ORC] Add a MaterializationResponsibility class to track responsibility for materializing function definitions. MaterializationUnit instances are responsible for resolving and finalizing symbol definitions when their materialize method is called. By contract, the MaterializationUnit must materialize all definitions it is responsible for and no others. If it can not materialize all definitions (because of some error) then it must notify the associated VSO about each definition that could not be materialized. The MaterializationResponsibility class tracks this responsibility, asserting that all required symbols are resolved and finalized, and that no extraneous symbols are resolved or finalized. In the event of an error it provides a convenience method for notifying the VSO about each definition that could not be materialized. llvm-svn: 330142	2018-04-16 18:05:24 +00:00
Lang Hames	ffeacb7b1a	[ORC] Merge VSO notifyResolutionFailed and notifyFinalizationFailed in to notifyMaterializationFailed. The notifyMaterializationFailed method can determine which error to raise by looking at which queue the pending queries are in (resolution or finalization). llvm-svn: 330141	2018-04-16 18:05:22 +00:00
Krzysztof Parzyszek	c546434e08	[Hexagon] Turn off flag enabling auto-vectorization It was turned on for testing and was accidentally left on in the commit. llvm-svn: 330139	2018-04-16 17:35:30 +00:00
Lei Huang	42ab1d3d03	[NFC] Move verificaiton check for f128 conversion into LowerINT_TO_FP() Move veriication check for legal conversions to f128 into LowerINT_TO_FP() and fix some indentations to match other sections of the code for readability. llvm-svn: 330138	2018-04-16 17:30:24 +00:00
Sanjay Patel	f4c4fc77cd	[InstCombine] simplify code in SimplifyAssociativeOrCommutative; NFCI llvm-svn: 330137	2018-04-16 17:15:13 +00:00
Craig Topper	f864250517	[Attributes] Fix a bug in AttributeList::get so it can handle a mix of FunctionIndex and ReturnIndex/arg indices at the same time The code uses the index of the last element in the sorted array to determine the maximum size needed for the vector. But if the last index is a FunctionIndex(~0), attrIdxToArrayIdx will return 0 and the vector will have size 1. If there are any indices before FunctionIndex, those values would return a value larger than 0 from attrIdxToArrayIdx. So in this case we need to look in front of the FunctionIndex to get the true size needed. Differential Revision: https://reviews.llvm.org/D45632 llvm-svn: 330136	2018-04-16 17:05:01 +00:00
Zachary Turner	52c80e3860	Revert "Fix some incorrect fields in our generated PDBs." There are a couple of failing tests which slipped under my radar so I'm reverting this while I attempt to fix. llvm-svn: 330133	2018-04-16 16:55:41 +00:00
Brock Wyma	94ece8fbc9	[CodeView] Initial support for emitting S_THUNK32 symbols for compiler... When emitting CodeView debug information, compiler-generated thunk routines should be emitted using S_THUNK32 symbols instead of S_GPROC32_ID symbols so Visual Studio can properly step into the user code. This initial support only handles standard thunk ordinals. Differential Revision: https://reviews.llvm.org/D43838 llvm-svn: 330132	2018-04-16 16:53:57 +00:00
Zachary Turner	1b06cc7817	Fix some incorrect fields in our generated PDBs. Most of these are pretty trivial and obvious. Setting the toolchain version to 14.11 is perhaps a little questionable, but we've been bitten in the past where one of our version fields sidn't match MSVC's, and I definitely don't want to go through that diagnosis again as it was pretty time consuming and hard to track down. I found all of these by using llvm-pdbutil export to dump the dbi and pdb streams to a file, then using fc followed by llvm-pdbutil explain to explain the mismatched bytes. There are still some more, these are just the low hanging fruit. Differential Revision: https://reviews.llvm.org/D45276 llvm-svn: 330130	2018-04-16 16:27:49 +00:00
Sanjay Patel	d93b8a0740	[InstCombine] simplify getBinOpsForFactorization(); NFC llvm-svn: 330129	2018-04-16 15:19:24 +00:00
Sanjay Patel	1170daa277	[InstCombine] simplify fneg+fadd folds; NFC Two cleanups: 1. As noted in D45453, we had tests that don't need FMF that were misplaced in the 'fast-math.ll' test file. 2. This removes the final uses of dyn_castFNegVal, so that can be deleted. We use 'match' now. llvm-svn: 330126	2018-04-16 14:13:57 +00:00
Sanjay Patel	77e990d887	[InstCombine] fix formatting; NFC llvm-svn: 330124	2018-04-16 13:21:15 +00:00
Dmitry Preobrazhensky	4c45e6ff0e	[AMDGPU][MC][VI][GFX9] Added support of SDWA/DPP for v_cndmask_b32 See bug 36356: https://bugs.llvm.org/show_bug.cgi?id=36356 Differential Revision: https://reviews.llvm.org/D45446 Reviewers: artem.tamazov, arsenm, timcorringham llvm-svn: 330123	2018-04-16 12:41:38 +00:00
Sander de Smalen	7a210db81e	[AArch64][SVE] Asm: Support for structured LD4 (scalar+imm) load instructions. Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: rengolin Subscribers: tschuett, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D45624 llvm-svn: 330120	2018-04-16 10:46:18 +00:00
Sander de Smalen	d239eb3ce3	[AArch64][SVE] Asm: Support for structured LD3 (scalar+imm) load instructions. Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45623 llvm-svn: 330116	2018-04-16 10:10:48 +00:00
Stefan Maksimovic	86d638ecdf	[mips] Restrict certain trap instructions for micromipsr6 Instructions removed from micromipsr6: teqi, tgei, tgeiu, tlti, tltiu, tnei Differential Revision: https://reviews.llvm.org/D45318 llvm-svn: 330114	2018-04-16 09:22:20 +00:00
Puyan Lotfi	14b6637edc	[MIR-Canon] Adding ISA-Agnostic COPY Folding. Transforms the following: %vreg1234:gpr32 = COPY %42 %vreg1235:gpr32 = COPY %vreg1234 %vreg1236:gpr32 = COPY %vreg1235 $w0 = COPY %vreg1236 into: $w0 = COPY %42 Assuming %42 is also a gpr32 llvm-svn: 330113	2018-04-16 09:03:03 +00:00
Puyan Lotfi	6ea89b4041	[NFC][MIR-Canon] clang-format cleanup of Mir Canonicalizer Pass. llvm-svn: 330111	2018-04-16 08:12:15 +00:00

... 2 3 4 5 6 ...

112786 Commits