llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Bradbury	ce9049952f	[RISCV][NFCI] Handle redundant splitf64+buildpairf64 pairs during instruction selection Although we can't write a tablegen pattern to remove redundant splitf64+buildf64 pairs due to the multiple return values, we can handle it with some C++ selection code. This is simpler than removing them after instruction selection through RISCVDAGToDAGISel::PostprocessISelDAG, as was done previously. llvm-svn: 343712	2018-10-03 20:12:10 +00:00
Craig Topper	703fbde3cb	[X86] Add CMOV pseudos for VR128X and VR256X register classes. Use them when AVX512VL is enabled. This allows the phi nodes to be generated with the correct register class when expanded. llvm-svn: 343710	2018-10-03 19:48:26 +00:00
Craig Topper	4b62c2dbda	[X86] Don't break CMOV pseudo instructions down by type. Just by register class. The register class is all that's important for the pseudo instructions. We can use patterns to handle the different types. llvm-svn: 343709	2018-10-03 19:48:23 +00:00
Simon Pilgrim	aabd99c27a	[X86] PUSH/POP 'mem-mem' instructions are not RMW - these are 2 different addresses This patch adds a 'WriteCopy' [WriteLoad, WriteStore] schedule sequence instead to better model the behaviour Found by @andreadb during llvm-mca testing on btver2 which was crashing on "zero uop" WriteRMW only instructions llvm-svn: 343708	2018-10-03 19:02:38 +00:00
Matthew Voss	f8ab35a4f4	Emit template type and value parameter DIEs for template variables. Summary: Ensure the TemplateParam attribute of the DIGlobalVariable node is translated into the proper DIEs. Resolves https://bugs.llvm.org/show_bug.cgi?id=22119 Reviewers: dblaikie, probinson, aprantl, JDevlieghere, clayborg, whitequark, deadalnix Reviewed By: dblaikie Subscribers: llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D52057 llvm-svn: 343706	2018-10-03 18:44:53 +00:00
Simon Pilgrim	b80d27a916	[X86] Move Atomic binops to use WriteALURMW schedule class These were being tagged as <WriteALULd, WriteRMW> instead of properly using the RMW sequence llvm-svn: 343705	2018-10-03 18:38:28 +00:00
Simon Pilgrim	0b451a2983	[X86][Btver2] Fix MMX PSHUFB schedule Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343701	2018-10-03 18:18:50 +00:00
Simon Pilgrim	a400612aed	[X86] Move Atomic CMPXCHG to WriteCMPXCHGRMW schedule class llvm-svn: 343700	2018-10-03 18:05:01 +00:00
Simon Pilgrim	2c59475c06	[X86] Add SkylakeClient uops counter - same as the other Intel models. llvm-svn: 343697	2018-10-03 16:45:26 +00:00
Daniel Sanders	10dedc00d0	Correct implementation of -verify-machineinstrs such that it's still overridable for EXPENSIVE_CHECKS -verify-machineinstrs was implemented as a simple bool. As a result, the 'VerifyMachineCode == cl::BOU_UNSET' used by EXPENSIVE_CHECKS to make it on by default but possible to disable didn't work as intended. Changed -verify-machineinstrs to a boolOrDefault to correct this. llvm-svn: 343696	2018-10-03 16:29:24 +00:00
Sanjay Patel	306f14ceb8	[InstCombine] clean up foldVectorBinop(); NFC 1. Fix include ordering. 2. Improve variable name (width is bitwidth not number-of-elements). 3. Add local Opcode variable to reduce code duplication. llvm-svn: 343694	2018-10-03 15:46:03 +00:00
Daniel Sanders	fb9b99b26e	[globalisel][combines] Don't sink G_TRUNC down to use if that use is a G_PHI This fixes a problem where the register allocator fails to eliminate a PHI because there's a non-PHI in the middle of the PHI instructions at the start of a BB. This G_TRUNC can be better placed but this at least fixes the correctness issue quickly. I'll follow up with a patch to the verifier to catch this kind of bug in future. llvm-svn: 343693	2018-10-03 15:43:39 +00:00
Sanjay Patel	79dceb2903	[InstCombine] name change: foldShuffledBinop -> foldVectorBinop; NFC This function will deal with more than shuffles with D50992, and I have another potential per-element fold that could live here. llvm-svn: 343692	2018-10-03 15:20:58 +00:00
Andrea Di Biagio	207e0217f9	[llvm-mca] Add support for move elimination in class RegisterFile. This patch teaches class RegisterFile how to analyze register writes from instructions that are move elimination candidates. In particular, it teaches it how to check if a move can be effectively eliminated by the underlying PRF, and (if necessary) how to perform move elimination. The long term goal is to allow processor models to describe instructions that are valid move elimination candidates. The idea is to let register file definitions in tablegen declare if/when moves can be eliminated. This patch is a non functional change. The logic that performs move elimination is currently disabled. A future patch will add support for move elimination in the processor models, and enable this new code path. llvm-svn: 343691	2018-10-03 15:02:44 +00:00
Simon Pilgrim	92d02027c2	[llvm-exegesis] Avoid yaml parser from calling sscanf for obvious non-matches (PR39102) deserializeMCOperand - ensure that we at least match the first character of the sscanf pattern before calling This reduces llvm-exegesis uops analysis of the instructions supported from btver2 from 5m13s to 2m1s on debug builds. llvm-svn: 343690	2018-10-03 14:51:09 +00:00
Nirav Dave	925b64be64	[X86] Correctly use SSE registers if no-x87 is selected. Fix use of SSE1 registers for f32 ops in no-x87 mode. Notably, allow use of SSE instructions for f32 operations in 64-bit mode (but not 32-bit which is disallowed by callign convention). Also avoid translating memset/memcopy/memmove into SSE registers without X87 for 32-bit mode. This fixes PR38738. Reviewers: nickdesaulniers, craig.topper Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D52555 llvm-svn: 343689	2018-10-03 14:13:30 +00:00
Alex Bradbury	d33ffe9bb1	[RISCV][NFC] Refactor RISCVDAGToDAGISel::Select Introduce and use a switch on the opcode. llvm-svn: 343688	2018-10-03 13:13:13 +00:00
James Henderson	99031b79a6	[ThinLTO]Expose cache entry expiration time option in llvm-lto and fix a test Two cases in a ThinLTO test were passing for the wrong reasons, since rL340374. The tests were supposed to be testing that files were being pruned due to the cache size, but they were in fact being pruned because they were older than the default expiration period of 1 week. This change fixes the tests by explicitly setting the expiration time to the maximum value. This required the option to be exposed in llvm-lto. By assigning all files in the cache a similar time, it is possible to see that the newest files are still being kept, and that we aren't passing for the wrong reason again. In the event that the entry expiration were to expire for them, then the test would start failing, because these files would be removed too. Reviewed by: rnk, inglorion Differential Revision: https://reviews.llvm.org/D51992 llvm-svn: 343687	2018-10-03 13:00:20 +00:00
Jonas Paulsson	fb3a97bec0	[RA CopyHints] Fix compile-time regression This patch makes sure that a register is only hinted once to RA. In extreme cases the same register can otherwise be hinted numerous times and cause a compile time slowdown. Review: Simon Pilgrim https://reviews.llvm.org/D52826 llvm-svn: 343686	2018-10-03 12:51:19 +00:00
Clement Courbet	5a768ddd44	[llvm-exegesis][NFC] Revert rL343682 "Fix unused variable warning". That was not the proper fix: the variable is used in debug mode. llvm-svn: 343685	2018-10-03 12:48:50 +00:00
Clement Courbet	8a5a6be47a	[llvm-exegesis] Fix rL343680 in release mode. llvm-svn: 343684	2018-10-03 12:35:35 +00:00
Clement Courbet	af50a5b85f	[llvm-exegesis][NFC] Fix unused variable warning. llvm-svn: 343682	2018-10-03 12:27:43 +00:00
Clement Courbet	d5a39553ff	[llvm-exegesis] Resolve variant classes in analysis. Summary: See PR38884. Reviewers: gchatelet Subscribers: tschuett, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D52825 llvm-svn: 343680	2018-10-03 11:50:25 +00:00
Alex Bradbury	d934032e48	[RISCV] Gate float<->int and double<->int conversion patterns on IsRV32 The patterns as defined are correct only when XLen==32. This is another preparatory patch for a set of patches that flesh out RV64 codegen. llvm-svn: 343679	2018-10-03 11:35:22 +00:00
Alex Bradbury	d464ed8c2e	[RISCV] Remove XLenVT==i32 assumptions from RISCVInstrInfo td 1. brcond operates on an condition. 2. atomic_fence and the pseudo AMO instructions should all take xlen immediates This allows the same definitions and patterns to work for RV64 (XLenVT==i64). llvm-svn: 343678	2018-10-03 11:14:26 +00:00
Alex Bradbury	a9ac5994b1	[RISCV] Gate simm32 materialisation pattern and SW pattern on IsRV32 These patterns are not correct for RV64. llvm-svn: 343677	2018-10-03 11:04:59 +00:00
Florian Hahn	11a1423348	[LoopInterchange] Remove unused variable PreserveLCSSA (NFC). llvm-svn: 343676	2018-10-03 11:01:23 +00:00
Alex Bradbury	efceb59801	[RISCV] Remove RV64 test lines from umulo-128-legalisation-lowering.ll The generated code is incorrect anyway, and this test adds noise to the upcoming set of patches that flesh out RV64 support. llvm-svn: 343675	2018-10-03 10:59:42 +00:00
Jonas Toth	602e3a640f	[CodeGen] NFC fix pedantic warning from extra semicolon llvm-svn: 343674	2018-10-03 10:59:19 +00:00
Tim Renouf	a37679d67b	[AMDGPU] Fix for negative offsets in buffer/tbuffer intrinsics Summary: The new buffer/tbuffer intrinsics handle an out-of-range immediate offset by moving/adding offset&-4096 to a vgpr, leaving an in-range immediate offset, with a chance of the move/add being CSEd for similar loads/stores. However it turns out that a negative offset in a vgpr is illegal, even if adding the immediate offset makes it legal again. Therefore, this commit disables the offset&-4096 thing if the offset is negative. Differential Revision: https://reviews.llvm.org/D52683 Change-Id: Ie02f0a74f240a138dc2a29d17cfbd9e350e4ed13 llvm-svn: 343672	2018-10-03 10:29:43 +00:00
Simon Pilgrim	c68cc4efbe	[X86][Btver2] Most RMW instructions don't require an additional uop Remove uop on WriteRMW and move it into the few instructions that need it. Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343671	2018-10-03 10:28:43 +00:00
Simon Pilgrim	d11015861c	[X86] ALU/ADC RMW instructions should use the WriteRMW sequence class I was expecting this to be a nfc but Silvermont seems to be setup a little differently: // A folded store needs a cycle on MEC_RSV for the store data, but it does not need an extra port cycle to recompute the address. def : WriteRes<WriteRMW, [SLM_MEC_RSV]>; So moving from WriteStore to WriteRMW reduces predicted port pressure, confirmed by @craig.topper that this is correct. Differential Revision: https://reviews.llvm.org/D52740 llvm-svn: 343670	2018-10-03 10:01:13 +00:00
Aditya Kumar	a27014b851	Improve static analysis of cold basic blocks Differential Revision: https://reviews.llvm.org/D52704 Reviewers: sebpop, tejohnson, brzycki, SirishP Reviewed By: sebpop llvm-svn: 343663	2018-10-03 06:21:05 +00:00
Aditya Kumar	9e20ade72a	Add support for new pass manager Modified the testcases to use both pass managers Use single commandline flag for both pass managers. Differential Revision: https://reviews.llvm.org/D52708 Reviewers: sebpop, tejohnson, brzycki, SirishP Reviewed By: tejohnson, brzycki llvm-svn: 343662	2018-10-03 05:55:20 +00:00
Fangrui Song	3d76d36059	[AMDGPU] Rename pass "isel" to "amdgpu-isel" Summary: The AMDGPU target specific pass "isel" is a misleading name. Reviewers: tstellar, echristo, javed.absar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D52759 llvm-svn: 343659	2018-10-03 03:38:22 +00:00
Daniel Sanders	bad3936109	[globalisel] Fix one more missing Verifier pass from gisel-commandline-option.ll llvm-svn: 343658	2018-10-03 02:52:54 +00:00
Matt Arsenault	635d479322	AMDGPU: Always run AMDGPUAlwaysInline Even if calls are enabled, it still needs to be run for forcing inline of functions that use LDS. llvm-svn: 343657	2018-10-03 02:47:25 +00:00
Matt Arsenault	0f83d66ae7	Add atomicrmw operation to error messages llvm-svn: 343656	2018-10-03 02:37:15 +00:00
Daniel Sanders	34eac35a60	Add the missing new files from r343654 llvm-svn: 343655	2018-10-03 02:21:30 +00:00
Daniel Sanders	c973ad1878	Re-commit: [globalisel] Add a combiner helpers for extending loads and use them in a pre-legalize combiner for AArch64 Summary: Depends on D45541 Reviewers: ab, aditya_nandakumar, bogner, rtereshin, volkan, rovka, javed.absar, aemerson Subscribers: aemerson, rengolin, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45543 The previous commit failed portions of the test-suite on GreenDragon due to duplicate COPY instructions and iterator invalidation. Both issues have now been fixed. To assist with this, a helper (cloneVirtualRegister) has been added to MachineRegisterInfo that can be used to get another register that has the same type and class/bank as an existing one. llvm-svn: 343654	2018-10-03 02:12:17 +00:00
Thomas Lively	9075cd607d	[WebAssembly] any_true and all_true intrinsics and instructions Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52755 llvm-svn: 343649	2018-10-03 00:19:39 +00:00
Stanislav Mekhanoshin	1821513e2f	[AMDGPU] Assert in getOpSize() there are no sub-dword subregs Differential Revision: https://reviews.llvm.org/D52769 llvm-svn: 343648	2018-10-03 00:00:41 +00:00
Matt Arsenault	b02ba99e91	IR: Move AtomicRMW string names into class This will be used to improve error messages in a future commit. llvm-svn: 343647	2018-10-02 23:44:11 +00:00
Sanjay Patel	abcacf9753	[InstCombine] add icmp+logic tests with commuted ops; NFC The transform in question is located in foldICmpAndConstConst(), but as shown here, it doesn't work if operands are commuted. llvm-svn: 343646	2018-10-02 22:53:37 +00:00
Reid Kleckner	9c0baa524c	Relax dbg-declare-inalloca.ll test more We don't need to match the precise type index number here. It's not important. The type name is what matters to make this test useful. llvm-svn: 343642	2018-10-02 22:28:10 +00:00
Sam Clegg	b2486f118d	[WebAssembly] Stop generating helper functions in WebAssemblyLowerEmscriptenEHSjLj Previously we were creating weakly defined helper function in each translation unit: - setThrew - setTempRet0 Instead we now assume these will be provided at link time. In emscripten they are provided in compiler-rt: https://github.com/kripken/emscripten/pull/7203 Additionally we previously created three global variable which are also now required to exist at link time instead. - __THREW__ - _threwValue - __tempRet0 Differential Revision: https://reviews.llvm.org/D49208 llvm-svn: 343640	2018-10-02 22:12:15 +00:00
Fangrui Song	e5652fc682	[CodeView] Try fixing DebugInfo/X86/dbg-declare-inalloca.ll llvm-svn: 343639	2018-10-02 22:03:31 +00:00
Daniel Sanders	f430d941e9	[globalisel] Attempt to fix llvm-clang-x86_64-expensive-checks-win The behaviour of this bot indicates that -verify-machineinstrs has been forced on and is therefore inserting the verifier on builds that don't expect it. Explicitly specify whether it's enabled or disabled for each test. llvm-svn: 343633	2018-10-02 20:51:27 +00:00
Aaron Smith	da0602c154	[CodeView] Only add the Scoped flag for an enum type when it has an immediate function scope to match MSVC Reviewers: rnk, zturner, llvm-commits Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D52706 llvm-svn: 343627	2018-10-02 20:28:15 +00:00
Aaron Smith	802b033d78	[CodeView] Emit function options for subprogram and member functions Summary: Use the newly added DebugInfo (DI) Trivial flag, which indicates if a C++ record is trivial or not, to determine Codeview::FunctionOptions. Clang and MSVC generate slightly different Codeview for C++ records. For example, here is the C++ code for a class with a defaulted ctor, class C { public: C() = default; }; Clang will produce a LF for the defaulted ctor while MSVC does not. For more details, refer to FIXMEs in the test cases in "function-options.ll" included with this set of changes. Reviewers: zturner, rnk, llvm-commits, aleksandr.urakov Reviewed By: rnk Subscribers: Hui, JDevlieghere Differential Revision: https://reviews.llvm.org/D45123 llvm-svn: 343626	2018-10-02 20:21:05 +00:00
Matt Davis	42425ccf50	[llvm-mca] Remove unecessary forward decls. NFC. This patch also removes an unecessary include. llvm-svn: 343621	2018-10-02 19:42:46 +00:00
Matt Morehouse	4b1ec17fb0	Revert "X86, AArch64, ARM: Do not attach debug location to spill/reload instructions" This reverts r343520 due to breakage of HWASan tests on Android. llvm-svn: 343616	2018-10-02 18:35:44 +00:00
Matt Davis	21d41dffe1	[llvm-mca] Constify the 'notify' routines. NFC. Also fixed up some whitespace formatting in DispatchStage.cpp. llvm-svn: 343615	2018-10-02 18:26:33 +00:00
Craig Topper	49225d0915	[X86][Disassembler] Add bizarro versions of the MOVSXD instruction that sign extend from a GR32 to GR32 or GR16. The 0x63 opcodes in 64-bit mode have a fixed source size of 32-bits, but the destination size is controlled by REX.W and the 0x66 opsize prefix. This instruction is normally used with a REX.W prefix which provides desired behavior. The other encodings are interpretted as valid by the processor, but aren't useful. This patch makes us recognize them for the disassembler to match objdump. llvm-svn: 343614	2018-10-02 18:16:19 +00:00
Daniel Sanders	74de21d06f	[globalisel][verifier] Run the MachineVerifier from IRTranslator onwards -verify-machineinstrs inserts the MachineVerifier after every MachineInstr-based pass. However, GlobalISel creates MachineInstr-based passes earlier than DAGISel and the corresponding verifiers are not being added. This patch fixes that. If GlobalISel triggers the fallback path then the MIR can be left in a bad state that is going to be cleared by ResetMachineFunctions. In this situation verifying between GlobalISel passes will prevent the fallback path from recovering from this. As a result, we bail out of verifying a function if the FailedISel attribute is present. llvm-svn: 343613	2018-10-02 17:56:58 +00:00
Reid Kleckner	d5e4ec74e3	[codeview] Fix 32-bit x86 variable locations in realigned stack frames Add the .cv_fpo_stackalign directive so that we can define $T0, or the VFRAME virtual register, with it. This was overlooked in the initial implementation because unlike MSVC, we push CSRs before allocating stack space, so this value is only needed to describe local variable locations. Variables that the compiler now addresses via ESP are instead described as being stored at offsets from VFRAME, which for us is ESP after alignment in the prologue. This adds tests that show that we use the VFRAME register properly in our S_DEFRANGE records, and that we emit the correct FPO data to define it. Fixes PR38857 llvm-svn: 343603	2018-10-02 16:43:52 +00:00
Simon Pilgrim	860cb5c071	[X86][Btver2] Fix BLENDV and AESDEC schedules Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343597	2018-10-02 15:13:18 +00:00
Krzysztof Parzyszek	528aff3372	[Hexagon] Fix extracting subvectors of non-HVX vNi1 Patch by Brendon Cahoon. llvm-svn: 343596	2018-10-02 15:05:43 +00:00
Sanjay Patel	e2cd6384b7	[InstCombine] add tests with undef elements; NFC See discussion in D52747. llvm-svn: 343595	2018-10-02 15:00:56 +00:00
Diogo N. Sampaio	eb9ca5ab18	[ARM] Emmit data symbol for constant pool data The ARM elf emitter would omit printing data symbol when constant data. This patch overrides the emitFill method as to enforce that the symbol is correctly printed. Differential revision: https://reviews.llvm.org/D52737 llvm-svn: 343594	2018-10-02 14:55:48 +00:00
Roman Lebedev	ea2046bea9	[NFC][CodeGen][X86] fma.ll, lwp-intrinsics.ll: actually spell --check-prefixes correctly :/ llvm-svn: 343588	2018-10-02 13:34:50 +00:00
Sanjay Patel	6dbecb4162	[InstCombine] add more insert/extract vector tests with FP types; NFC These are candidates for the same fold that was implemented in D52439, but FP types require bitcasting (and that changes the extra uses profitability calculation). llvm-svn: 343587	2018-10-02 13:34:05 +00:00
Simon Pilgrim	201bbe3993	[X86] Remove unnecessary BT(C/R/S)m(i/r) scheduler overrides Some SchedAlias remain due to some badly setup RMW tags - but at least the overrides are all removed llvm-svn: 343586	2018-10-02 13:11:59 +00:00
Roman Lebedev	5412be4b7a	[NFC][CodeGen][X86] lwp-intrinsics.ll: fix check prefixes llvm-svn: 343585	2018-10-02 13:11:08 +00:00
Roman Lebedev	8b253f0b54	[NFC][CodeGen][X86] fma.ll: fix check prefixes for -mcpu=bdver2 llvm-svn: 343584	2018-10-02 13:10:55 +00:00
Simon Pilgrim	271bcb9397	[X86] Add APInt constant assembly printer helper llvm-svn: 343577	2018-10-02 11:32:33 +00:00
Oliver Stannard	c41902807e	[AArch64][v8.5A] Add Memory Tagging instructions This adds new instructions to manipluate tagged pointers, and to load and store the tags associated with memory. Patch by Pablo Barrio, David Spickett and Oliver Stannard! Differential revision: https://reviews.llvm.org/D52490 llvm-svn: 343572	2018-10-02 10:04:39 +00:00
Oliver Stannard	2a5fcba94b	[AArch64][v8.5A] Add Memory Tagging system registers This adds new system registers introduced by the Memory Tagging extension. Patch by Pablo Barrio! Differential revision: https://reviews.llvm.org/D52488 llvm-svn: 343571	2018-10-02 09:54:35 +00:00
Oliver Stannard	4493f421ac	[AArch64][v8.5A] Add MTE system instructions The Memory Tagging Extension adds system instructions for data cache maintenance, implemented as new operands to the DC instruction. Patch by Pablo Barrio! Differential revision: https://reviews.llvm.org/D52487 llvm-svn: 343570	2018-10-02 09:48:43 +00:00
David Green	1e44c3b62c	[InstCombine] Fold ~A - Min/Max(~A, O) -> Max/Min(A, ~O) - A This is an attempt to get out of a local-minimum that instcombine currently gets stuck in. We essentially combine two optimisations at once, ~a - ~b = b-a and min(~a, ~b) = ~max(a, b), only doing the transform if the result is at least neutral. This involves using IsFreeToInvert, which has been expanded a little to include selects that can be easily inverted. This is trying to fix PR35875, using the ideas from Sanjay. It is a large improvement to one of our rgb to cmy kernels. Differential Revision: https://reviews.llvm.org/D52177 llvm-svn: 343569	2018-10-02 09:48:34 +00:00
Oliver Stannard	85de54090e	[AArch64][v8.5A] Add MTE as an optional AArch64 extension This adds the memory tagging extension, which is an optional extension introduced in v8.5A. The new instructions and registers will be added by subsequent patches. Patch by Pablo Barrio! Differential revision: https://reviews.llvm.org/D52486 llvm-svn: 343563	2018-10-02 09:36:28 +00:00
Simon Pilgrim	ad23f270db	[X86] Standardize floating point assembly comments Consistently try to use APFloat::toString for floating point constant comments to get rid of differences between Constant / ConstantDataSequential values - it should help stop some of the linux-windows buildbot failures matching NaN/INF etc. as well. Differential Revision: https://reviews.llvm.org/D52702 llvm-svn: 343562	2018-10-02 09:08:51 +00:00
David Green	c066a92657	[InstCombine] Tests for ~A - Min/Max(~A, O) -> Max/Min(A, ~O) - A. NFC llvm-svn: 343561	2018-10-02 09:06:49 +00:00
Matt Arsenault	ab41193312	AMDGPU: Expand atomicrmw nand in IR llvm-svn: 343559	2018-10-02 03:50:56 +00:00
Thomas Lively	6f77811a21	[WebAssembly] Restore slashes in SIMD conversion names Summary: Depends on D52372 and D52442. Reviewers: aheejin, dschuff, aardappel Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52512 llvm-svn: 343558	2018-10-02 01:52:21 +00:00
Owen Rodley	31fddbac8f	[MCA] Remove SM.hasNext() call in FetchStage::execute. Summary: This is redundant, as FetchStage::getNextInstruction already checks this and returns llvm::ErrorSuccess() as appropriate. NFC. Reviewers: andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D52642 llvm-svn: 343555	2018-10-02 00:40:08 +00:00
Fangrui Song	99d4f74d01	[AArch64][DAGCombiner]: change -stop-after=isel to instruction-select "isel" is registered by AMDGPU. The test will break if the AMDGPU target is not built. llvm-svn: 343553	2018-10-02 00:22:51 +00:00
Craig Topper	d616d33a96	[SimplifyCFG] Use Value::hasNUses instead of 'getNumUses() =='. NFCI getNumUses is linear in the number of uses. Since we're looking for a specific use count, we can use hasNUses which will stop as soon as it determines there are more than N uses instead of walking all of them. llvm-svn: 343550	2018-10-01 23:09:52 +00:00
Matt Davis	8e2c75900e	[llvm-mca] Rename the 'Subtract' method to 'subtract' llvm-svn: 343549	2018-10-01 23:01:45 +00:00
Craig Topper	90c0a0621c	[SimplifyCFG] Update comments that refer to CondBB to say ThenBB instead. NFC There is no variable in this function named CondBB, but there is one named ThenBB and I believe the comments are all refering to it. llvm-svn: 343548	2018-10-01 22:56:11 +00:00
Zachary Turner	a67765ac8d	[PDB] Add support for more kinds of PDB Sym Tags. DIA SDK is returning several new sym tag types, so we update the enumeration and printing code to support these. llvm-svn: 343547	2018-10-01 22:39:19 +00:00
Daniel Sanders	33f42f97af	Revert: r343521 and r343541: [globalisel] Add a combiner helpers for extending loads and use them in a pre-legalize combiner for AArch64 There's a strange assertion on two of the Green Dragon bots that goes away when this is reverted. The assertion is in RegBankAlloc and if it is this commit then -verify-machine-instrs should have caught it earlier in the pipeline. llvm-svn: 343546	2018-10-01 22:32:08 +00:00
Reid Kleckner	8d7c421a70	[codeview] Simplify S_DEFRANGE emission code, NFC These assembler directives are still pretty unreadable and it would be nice to clean them up at some point. llvm-svn: 343544	2018-10-01 22:25:49 +00:00
Reid Kleckner	9ea2c01264	[codeview] Emit S_FRAMEPROC and use S_DEFRANGE_FRAMEPOINTER_REL Summary: Before this change, LLVM would always describe locals on the stack as being relative to some specific register, RSP, ESP, EBP, ESI, etc. Variables in stack memory are pretty common, so there is a special S_DEFRANGE_FRAMEPOINTER_REL symbol for them. This change uses it to reduce the size of our debug info. On top of the size savings, there are cases on 32-bit x86 where local variables are addressed from ESP, but ESP changes across the function. Unlike in DWARF, there is no FPO data to describe the stack adjustments made to push arguments onto the stack and pop them off after the call, which makes it hard for the debugger to find the local variables in frames further up the stack. To handle this, CodeView has a special VFRAME register, which corresponds to the $T0 variable set by our FPO data in 32-bit. Offsets to local variables are instead relative to this value. This is part of PR38857. Reviewers: hans, zturner, javed.absar Subscribers: aprantl, hiraditya, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D52217 llvm-svn: 343543	2018-10-01 21:59:45 +00:00
Reid Kleckner	7a6966ec27	Fix the Windows build in GlobalISel Clang-cl was complaining about some sort of constexpr narrowing bug: C:\src\llvm-project\llvm\lib\CodeGen\GlobalISel\CombinerHelper.cpp(136,31): error: non-constant-expression cannot be narrowed from type 'llvm::TargetOpcode::(anonymous enum at C:\src\llvm-project\llvm\include\llvm/CodeGen/TargetOpcodes.h:22:1)' to 'unsigned int' in initializer list [-Wc++11-narrowing] unsigned(MI.getOpcode()) == unsigned(TargetOpcode::G_LOAD) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ C:\src\llvm-project\llvm\lib\CodeGen\GlobalISel\CombinerHelper.cpp(136,31): note: insert an explicit cast to silence this issue unsigned(MI.getOpcode()) == unsigned(TargetOpcode::G_LOAD) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ static_cast<unsigned int>( llvm-svn: 343541	2018-10-01 21:39:39 +00:00
Craig Topper	42cd8cd862	Recommit r343499 "[X86] Enable load folding in the test shrinking code" Original message: This patch adds load folding support to the test shrinking code. This was noticed missing in the review for D52669 llvm-svn: 343540	2018-10-01 21:35:28 +00:00
Craig Topper	f06a57fc89	Recommit r343498 "[X86] Improve test instruction shrinking when the sign flag is used and the output of the and is truncated." This includes a fix to prevent i16 compares with i32/i64 ands from being shrunk if bit 15 of the and is set and the sign bit is used. Original commit message: Currently we skip looking through truncates if the sign flag is used. But that's overly restrictive. It's safe to look through the truncate as long as we ensure one of the 3 things when we shrink. Either the MSB of the mask at the shrunken size isn't set. If the mask bit is set then either the shrunk size needs to be equal to the compare size or the sign There are still missed opportunities to shrink a load and fold it in here. This will be fixed in a future patch. llvm-svn: 343539	2018-10-01 21:35:26 +00:00
Sanjay Patel	de5e8b93f4	[InstCombine] add inverse test for vector trunc canonical form; NFC llvm-svn: 343529	2018-10-01 20:25:49 +00:00
Sanjay Patel	746eb09127	[InstCombine] regenerate test checks; NFC These files used an old version of the script. We regex more now. llvm-svn: 343527	2018-10-01 20:22:28 +00:00
Stefan Pintilie	5d32a86f44	[PowerPC] Folding XForm to DForm loads requires alignment for some DForm loads. Going from XForm Load to DSForm Load requires that the immediate be 4 byte aligned. If we are not aligned we must leave the load as LDX (XForm). This bug is causing a compile-time failure in the benchmark h264ref. Differential Revision: https://reviews.llvm.org/D51988 llvm-svn: 343525	2018-10-01 20:16:27 +00:00
Eric Christopher	dcf1d97c5c	Temporarily revert "[GVNHoist] Re-enable GVNHoist by default" This reverts commit r342387 as it's showing significant performance regressions in a number of benchmarks. Followed up with the committer and original thread with an example and will get performance numbers before recommitting. llvm-svn: 343522	2018-10-01 18:57:08 +00:00
Daniel Sanders	9659bfda5a	[globalisel] Add a combiner helpers for extending loads and use them in a pre-legalize combiner for AArch64 Summary: Depends on D45541 Reviewers: ab, aditya_nandakumar, bogner, rtereshin, volkan, rovka, javed.absar, aemerson Subscribers: aemerson, rengolin, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45543 llvm-svn: 343521	2018-10-01 18:56:47 +00:00
Matthias Braun	3e081703c3	X86, AArch64, ARM: Do not attach debug location to spill/reload instructions Spill/reload instructions are artificially generated by the compiler and have no relation to the original source code. So the best thing to do is not attach any debug location to them (instead of just taking the next debug location we find on following instructions). Differential Revision: https://reviews.llvm.org/D52125 llvm-svn: 343520	2018-10-01 18:56:39 +00:00
Craig Topper	1346b5b7cf	[X86] Add more test shrinking with truncate and sign bit usage tests. NFC llvm-svn: 343519	2018-10-01 18:52:19 +00:00
Craig Topper	e072934d28	Revert r343499 and r343498. X86 test improvements There's a subtle bug in the handling of truncate from i32/i64 to i32 without minsize. I'll be adding more test cases and trying to find a fix. llvm-svn: 343516	2018-10-01 18:40:44 +00:00
Krzysztof Parzyszek	6d569a2cc4	[Hexagon] Remove incorrect pattern for swiz The pattern had a couple of problems: - It was checking for loads of bytes in the reverse order to what it should have been looking for. - It would replace loads of bytes with a load of a word without making sure that the alignment was correct. Thanks to Eli Friedman for pointing it out. llvm-svn: 343514	2018-10-01 18:24:40 +00:00
Stanislav Mekhanoshin	ae8bd6d9b5	[AMDGPU] Fixed SIInstrInfo::getOpSize to handle subregs Currently it returns incorrect operand size for a target independet node such as COPY if operand is a register with subreg. Instead of correct subreg size it returns a size of the whole superreg. Differential Revision: https://reviews.llvm.org/D52736 llvm-svn: 343508	2018-10-01 18:00:02 +00:00
Zachary Turner	a5e3e02602	[PDB] Add support for dumping Typedef records. These work a little differently because they are actually in the globals stream and are treated as symbol records, even though DIA presents them as types. So this also adds the necessary infrastructure to cache records that live somewhere other than the TPI stream as well. llvm-svn: 343507	2018-10-01 17:55:38 +00:00
Zachary Turner	5c1873b213	[PDB] Add support for parsing VFTable Shape records. This allows them to be returned from the native API. llvm-svn: 343506	2018-10-01 17:55:16 +00:00
Matthias Braun	7159daa68e	MIRParser: Check that instructions only reference DILocation metadata llvm-svn: 343505	2018-10-01 17:50:52 +00:00
Wouter van Oortmerssen	0c83c3ff38	[WebAssembly] Fixed AsmParser not allowing instructions with / Summary: The AsmParser Lexer regards these as a seperate token. Here we expand the instruction name with them if they are adjacent (no whitespace). Tested: the basic-assembly.s test case has one case with a / in it. The currently are also instructions with : in them, which we intend to rename rather than fix them here. Reviewers: tlively, dschuff Subscribers: sbc100, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52442 llvm-svn: 343501	2018-10-01 17:20:31 +00:00
Craig Topper	aa84e1bba2	[X86] Enable load folding in the test shrinking code This patch adds load folding support to the test shrinking code. This was noticed missing in the review for D52669 Differential Revision: https://reviews.llvm.org/D52699 llvm-svn: 343499	2018-10-01 17:10:50 +00:00
Craig Topper	2b587ad071	[X86] Improve test instruction shrinking when the sign flag is used and the output of the and is truncated Currently we skip looking through truncates if the sign flag is used. But that's overly restrictive. It's safe to look through the truncate as long as we ensure one of the 3 things when we shrink. Either the MSB of the mask at the shrunken size isn't set. If the mask bit is set then either the shrunk size needs to be equal to the compare size or the sign flag needs to be unused. There are still missed opportunities to shrink a load and fold it in here. This will be fixed in a future patch. Differential Revision: https://reviews.llvm.org/D52669 llvm-svn: 343498	2018-10-01 17:10:45 +00:00
Simon Pilgrim	e0d2019052	[X86][Btver2] Fix BT(C\|R\|S)mr & BT(C\|R\|S)mi schedule latency + uop counts Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343494	2018-10-01 16:31:30 +00:00
Matthias Braun	004fe6bf83	DAGCombiner: StoreMerging: Fix bad index calculating when adjusting mismatching vector types This fixes a case of bad index calculation when merging mismatching vector types. This changes the existing code to just use the existing extract_{subvector\|element} and a bitcast (instead of bitcast first and then newly created extract_xxx) so we don't need to adjust any indices in the first place. rdar://44584718 Differential Revision: https://reviews.llvm.org/D52681 llvm-svn: 343493	2018-10-01 16:25:50 +00:00
Sanjay Patel	5187efcfab	[x86] add tests for 256- and 512-bit vector types for scalar-to-vector transform; NFC llvm-svn: 343491	2018-10-01 16:17:18 +00:00
Simon Pilgrim	683e35527b	[X86] Create schedule classes for BT(C\|R\|S)mi and BT(C\|R\|S)mr instructions llvm-svn: 343490	2018-10-01 16:12:44 +00:00
Evandro Menezes	55b9a5395b	[AArch64] Refactor cheap cost model Refactor the order in `TII::isAsCheapAsAMove()` to ease future development and maintenance. Practically NFC. llvm-svn: 343489	2018-10-01 16:11:19 +00:00
Simon Pilgrim	4334912c1c	[X86] Remove unnecessary BTmi/BTmr scheduler overrides llvm-svn: 343487	2018-10-01 15:01:00 +00:00
Jesper Antonsson	c954b86391	[InstCombine] Handle vector compares in foldGEPIcmp(), take 2 Summary: This is a continuation of the fix for PR34627 "InstCombine assertion at vector gep/icmp folding". (I just realized bugpoint had fuzzed the original test for me, so I had fixed another trigger of the same assert in adjacent code in InstCombine.) This patch avoids optimizing an icmp (to look only at the base pointers) when the resulting icmp would have a different type. The patch adds a testcase and also cleans up and shrinks the pre-existing test for the adjacent assert trigger. Reviewers: lebedev.ri, majnemer, spatel Reviewed By: lebedev.ri Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52494 llvm-svn: 343486	2018-10-01 14:59:25 +00:00
Simon Atanasyan	1ea206be73	[mips] Generate tests expectations using update_llc_test_checks. NFC Generate tests expectations using update_llc_test_checks and reduce number of "check prefixes" used in the tests. llvm-svn: 343485	2018-10-01 14:43:07 +00:00
Simon Pilgrim	6ddc4e821c	[X86][Btver2] Fix BTmr schedule uop counts Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343484	2018-10-01 14:42:16 +00:00
Sanjay Patel	31b07198f1	[InstCombine] try to convert vector insert+extract to trunc; 2nd try This was originally committed at rL343407, but reverted at rL343458 because it crashed trying to handle a case where the destination type is FP. This version of the patch adds a check for that possibility. Tests added at rL343480. Original commit message: This transform is requested for the backend in: https://bugs.llvm.org/show_bug.cgi?id=39016 ...but I figured it was worth doing in IR too, and it's probably easier to implement here, so that's this patch. In the simplest case, we are just truncating a scalar value. If the extract index doesn't correspond to the LSBs of the scalar, then we have to shift-right before the truncate. Endian-ness makes this tricky, but hopefully the ASCII-art helps visualize the transform. Differential Revision: https://reviews.llvm.org/D52439 llvm-svn: 343482	2018-10-01 14:40:00 +00:00
Sanjay Patel	22ae8dabb5	[InstCombine] add more insert-extract tests for D52439; NFC The first attempt at this transform: rL343407 ...was reverted: rL343458 ...because it did not handle the case where we bitcast to FP. The patch was already limited to avoid the case where we bitcast from FP, but we might want to transform that too. llvm-svn: 343480	2018-10-01 14:29:09 +00:00
Simon Pilgrim	43737a3df4	[X86] Create schedule classes for BTmi and BTmr instructions llvm-svn: 343478	2018-10-01 14:23:37 +00:00
Haojian Wu	9240494782	Move llvm util dependencies from clang-tools-extra to add_lit_target. Summary: Address fixme in r301762. And would simplify the cmake file in clang-tools-extra. Reviewers: sammccall Subscribers: mgorny, llvm-commits, cfe-commits Differential Revision: https://reviews.llvm.org/D52713 llvm-svn: 343473	2018-10-01 14:00:51 +00:00
Robert Widmann	abda7ee8e7	[LLVM-C] Add an accessor for the kind of a Metadata Node Summary: Allows for retrieving the type of a metadata node. Has the added benefit of ensuring that the C and C++ kind APIs stay in sync as a failure to add a corresponding LLVMMetadataKind will result in the switch in the accessor being semantically malformed. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52693 llvm-svn: 343469	2018-10-01 13:15:09 +00:00
Simon Pilgrim	a982236e59	[X86][Btver2] Fix masked load schedule JFPU01 resource usage should match JFPX Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343468	2018-10-01 13:12:05 +00:00
Guillaume Chatelet	415b2fbef5	[llvm-exegesis][NFC] Move random functions from CodeTemplate to SnippetGenerator. Summary: Just moving methods around. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52720 llvm-svn: 343461	2018-10-01 12:19:10 +00:00
Sam McCall	79c995c0cc	[Support] Listing a directory containing dangling symlinks is not an error. Summary: Reporting this as an error required stat()ing every file, as well as seeming semantically questionable. Reviewers: vsk, bkramer Subscribers: mgrang, kristina, llvm-commits, liaoyuke Differential Revision: https://reviews.llvm.org/D52648 llvm-svn: 343460	2018-10-01 12:17:05 +00:00
Hans Wennborg	a60aa91374	Revert r343407 "[InstCombine] try to convert vector insert+extract to trunc" This caused Chromium builds to fail with "Illegal Trunc" assertion. See https://crbug.com/890723 for repro. > This transform is requested for the backend in: > https://bugs.llvm.org/show_bug.cgi?id=39016 > ...but I figured it was worth doing in IR too, and it's probably > easier to implement here, so that's this patch. > > In the simplest case, we are just truncating a scalar value. If the > extract index doesn't correspond to the LSBs of the scalar, then we > have to shift-right before the truncate. Endian-ness makes this tricky, > but hopefully the ASCII-art helps visualize the transform. > > Differential Revision: https://reviews.llvm.org/D52439 llvm-svn: 343458	2018-10-01 12:07:45 +00:00
Guillaume Chatelet	c6268f3ba2	[llvm-exegesis][NFC] Make randomizeUnsetVariables a free function. Summary: This is prelimineary to moving random functions to SnippetGenerator. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52718 llvm-svn: 343456	2018-10-01 11:46:06 +00:00
Alexander Timofeev	b048fa3344	[AMDGPU] Divergence driven instruction selection. Shift operations. Summary: This change enables VOP3 shifts to be explicitly selected dependent on the divergence. Differential Revision: https://reviews.llvm.org/D52559 Reviewers: rampitec llvm-svn: 343455	2018-10-01 11:06:35 +00:00
Puyan Lotfi	06e65cae4a	[NFC] Adding "REQUIRES: zlib" to a llvm-objcopy test for bots without zlib. M test/tools/llvm-objcopy/compress-and-decompress-debug-sections-error.test llvm-svn: 343454	2018-10-01 10:50:23 +00:00
Andrea Di Biagio	24ea163007	[X86][BtVer2] Teach how to identify zero-idiom VPERM2F128rr instructions. This patch adds another variant class to identify zero-idiom VPERM2F128rr instructions. On Jaguar, a VPERM wih bit 3 and 7 of the mask set, is a zero-idiom. Differential Revision: https://reviews.llvm.org/D52663 llvm-svn: 343452	2018-10-01 10:35:13 +00:00
Puyan Lotfi	af048648d3	[llvm-objcopy] Adding support for decompressing zlib compressed dwarf sections. Summary: I had added support for compressing dwarf sections in a prior commit, this one adds support for decompressing. Usage is: llvm-objcopy --decompress-debug-sections input.o output.o Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D51841 llvm-svn: 343451	2018-10-01 10:29:41 +00:00
Florian Hahn	8600fee52e	Recommit r343308: [LoopInterchange] Turn into a loop pass. llvm-svn: 343450	2018-10-01 09:59:48 +00:00
Clement Courbet	a933fb237e	[X86][Sched] Update scheduling information for VZEROALL on HWS, BDW, SKX, SNB. Summary: While looking at PR35606, I found out that the scheduling info is incorrect. One can check that it's really a P5+P6 and not a 2*P56 with: echo -e 'vzeroall\nvandps %xmm1, %xmm2, %xmm3' \| ./bin/llvm-exegesis -mode=uops -snippets-file=- (vandps executes on P5 only) Reviewers: craig.topper, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52541 llvm-svn: 343447	2018-10-01 08:37:48 +00:00
Clement Courbet	dac60b9837	[X86][Sched] Add pfm uop counter definitions for SNB,BDW,SKX. llvm-svn: 343446	2018-10-01 08:37:37 +00:00
Carlos Alberto Enciso	81d8ef2196	[DebugInfo][Dexter] Incorrect DBG_VALUE after MCP dead copy instruction removal. When MachineCopyPropagation eliminates a dead 'copy', its associated debug information becomes invalid. as the recorded register has been removed. It causes the debugger to display wrong variable value. Differential Revision: https://reviews.llvm.org/D52614 llvm-svn: 343445	2018-10-01 08:14:44 +00:00
Clement Courbet	ce4caff0de	[CodeGen][NFC] Add tests for heterogeneous types in MergeConsecutiveStores Reviewers: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52643 llvm-svn: 343444	2018-10-01 07:16:22 +00:00
Craig Topper	67d9dbdbdd	[X86] Stop X86DomainReassignment from creating copies between GR8/GR16 physical registers and k-registers. We can only copy between a k-register and a GR32/GR64 register. This patch detects that the copy will be illegal and prevents the domain reassignment from happening for that closure. This probably isn't the best fix, and we should probably figure out how to handle this correctly. Fixes PR38803. llvm-svn: 343443	2018-10-01 07:08:41 +00:00
Lang Hames	deb3640d95	[ORC] Pass Symbols to ExecutionSession::lookup by value, potentially saving a copy. llvm-svn: 343442	2018-10-01 04:59:10 +00:00
Lang Hames	47d0a37704	[ORC] Add convenience methods for creating DynamicLibraryFallbackGenerators for libraries on disk, and for the current process. Avoids more boilerplate during JIT construction. llvm-svn: 343430	2018-10-01 00:59:28 +00:00
Lang Hames	d89c273a2e	[ORC] Add a method to JITTargetMachineBuilder to get the default data layout for the target machine. This simplifies usage during setup of concurrent JIT stacks where the client needs a DataLayout, but not a TargetMachine (TargetMachines are created on the fly by the compile threads later). llvm-svn: 343429	2018-10-01 00:59:26 +00:00
Craig Topper	1d1dca6a6f	[X86] Change an llvm_unreachable to a report_fatal_error so the optimizer will stop making us reach the other report_fatal_error in this function. There's a conditional report_fatal_error just above this llvm_unreachable. The optimizer when seeing the unreachable removes the conditional and just makes any other error trigger the existing report_fatal_error. llvm-svn: 343428	2018-09-30 23:43:30 +00:00
Lang Hames	71d781c434	[ORC] Add an 'intern' method to ExecutionEngine for interning symbol names. This cuts down on boilerplate by reducing 'ES.getSymbolStringPool().intern(...)' to 'ES.intern(...)'. llvm-svn: 343427	2018-09-30 23:18:24 +00:00
Fangrui Song	3507c6e884	Use the container form llvm::sort(C, ...) There are a few leftovers in rL343163 which span two lines. This commit changes these llvm::sort(C.begin(), C.end, ...) to llvm::sort(C, ...) llvm-svn: 343426	2018-09-30 22:31:29 +00:00
Simon Pilgrim	f21083870d	[X86] Fix scheduler class for BTmi instructions This wasn't treated as a folded load instruction llvm-svn: 343424	2018-09-30 20:19:16 +00:00
Lang Hames	d435ce4343	[ORC] Extract and tidy up JITTargetMachineBuilder, add unit test. (1) Adds comments for the API. (2) Removes the setArch method: This is redundant: the setArchStr method on the triple should be used instead. (3) Turns EmulatedTLS on by default. This matches EngineBuilder's behavior. llvm-svn: 343423	2018-09-30 19:12:23 +00:00
Simon Pilgrim	b1108399bd	[LLVM-MCA][X86] Add missing VCMPESTR/VCMPESTR tests llvm-svn: 343421	2018-09-30 18:19:00 +00:00
Craig Topper	99ad2a5723	[X86] Copy memrefs when folding a load for division instruction selection. llvm-svn: 343419	2018-09-30 17:47:18 +00:00
Bjorn Pettersson	c2fc53ac90	[PHIElimination] Lower a PHI node with only undef uses as IMPLICIT_DEF Summary: The lowering of PHI nodes used to detect if all inputs originated from IMPLICIT_DEF's. If so the PHI node was replaced by an IMPLICIT_DEF. Now we also consider undef uses when checking the inputs. So if all inputs are implicitly defined or undef we lower the PHI to an IMPLICIT_DEF. This makes PHIElimination::LowerPHINode more consistent as it checks both implicit and undef properties at later stages. Reviewers: MatzeB, tstellar Reviewed By: MatzeB Subscribers: jvesely, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D52558 llvm-svn: 343417	2018-09-30 17:26:58 +00:00
Bjorn Pettersson	4af7f57bdf	[PHIElimination] Update the regression test for PR16508 Summary: When PR16508 was solved (in rL185363) a regression test was added as test/CodeGen/PowerPC/2013-07-01-PHIElimBug.ll. I discovered that the test case no longer reproduced the scenario from PR16508. This problem could have been amended by adding an extra RUN line with "-O1" (or possibly "-O0"), but instead I added a mir-reproducer test/CodeGen/PowerPC/2013-07-01-PHIElimBug.mir to get a reproducer that is less sensitive to changes in earlier passes (including O-level). While being at it I also corrected a code comment in PHIElimination::EliminatePHINodes that has been incorrect since the related bugfix from rL185363. Reviewers: MatzeB, hfinkel Reviewed By: MatzeB Subscribers: nemanjai, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D52553 llvm-svn: 343416	2018-09-30 17:23:21 +00:00
Simon Pilgrim	20623f2343	[LLVM-MCA][X86] Add some AVX512 tests These are going to be necessary to check I don't mess up when I start cleaning up all the remaining vector integer overrides llvm-svn: 343414	2018-09-30 17:01:59 +00:00
Simon Pilgrim	4f5693ac8d	[X86][Btver2] Fix PCmpIStrI/PCmpIStrM schedules Missing JFPU0 pipe and double JFPU1 pipe (to match JVALU1) resources Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343413	2018-09-30 16:38:38 +00:00
Zachary Turner	518cb2d560	[PDB] Add native support for dumping array types. llvm-svn: 343412	2018-09-30 16:19:18 +00:00
Simon Pilgrim	9cec221a1c	[X86][BtVer2] Add the ability to add additional uops for folded instructions Some instructions take an extra load uop - but not consistently..... llvm-svn: 343410	2018-09-30 15:58:56 +00:00
Sanjay Patel	1e0f1f645a	[InstCombine] try to convert vector insert+extract to trunc This transform is requested for the backend in: https://bugs.llvm.org/show_bug.cgi?id=39016 ...but I figured it was worth doing in IR too, and it's probably easier to implement here, so that's this patch. In the simplest case, we are just truncating a scalar value. If the extract index doesn't correspond to the LSBs of the scalar, then we have to shift-right before the truncate. Endian-ness makes this tricky, but hopefully the ASCII-art helps visualize the transform. Differential Revision: https://reviews.llvm.org/D52439 llvm-svn: 343407	2018-09-30 14:34:01 +00:00
Sanjay Patel	26c119a9c2	[InstCombine] allow lengthening of insertelement to eliminate shuffles As noted in post-commit comments for D52548, the limitation on increasing vector length can be applied by opcode. As a first step, this patch only allows insertelement to be widened because that has no logical downsides for IR and has little risk of pessimizing codegen. This may cause PR39132 to go into hiding during a full compile, but that bug is not fixed. llvm-svn: 343406	2018-09-30 13:50:42 +00:00
Simon Pilgrim	818cfc40ff	[DAG] Don't perform SINT_TO_FP<->UINT_TO_FP custom conversion after legalization The SINT_TO_FP<->UINT_TO_FP combines for non-negative integers should only occur for legal ops once LegalOperations = true No test case to hand, noticed when investigating PR38226 + PR38970 llvm-svn: 343405	2018-09-30 12:46:42 +00:00
Roman Lebedev	0496477c5d	[NFC][CodeGen][X86][AArch64] Add 64-bit constant bit field extract pattern tests llvm-svn: 343404	2018-09-30 12:42:08 +00:00
Simon Pilgrim	84e280ae42	[X86] Regenerate MMX coalescing test Exposes another extractelement(bitcast(scalartovector())) pattern llvm-svn: 343403	2018-09-30 09:42:04 +00:00
Zachary Turner	9be3b6a18b	[PDB] Fix this test for real. I was able to test this fix on an actual Windows machine so this should get the bot green again. llvm-svn: 343400	2018-09-30 03:57:49 +00:00
Craig Topper	1709829fed	[X86] Disable BMI BEXTR in X86DAGToDAGISel::matchBEXTRFromAnd unless we're on compiling for a CPU with single uop BEXTR Summary: This function turns (X >> C1) & C2 into a BMI BEXTR or TBM BEXTRI instruction. For BMI BEXTR we have to materialize an immediate into a register to feed to the BEXTR instruction. The BMI BEXTR instruction is 2 uops on Intel CPUs. It looks like on SKL its one port 0/6 uop and one port 1/5 uop. Despite what Agner's tables say. I know one of the uops is a regular shift uop so it would have to go through the port 0/6 shifter unit. So that's the same or worse execution wise than the shift+and which is one 0/6 uop and one 0/1/5/6 uop. The move immediate into register is an additional 0/1/5/6 uop. For now I've limited this transform to AMD CPUs which have a single uop BEXTR. If may also might make sense if we can fold a load or if the and immediate is larger than 32-bits and can't be encoded as a sign extended 32-bit value or if LICM or CSE can hoist the move immediate and share it. But we'd need to look more carefully at that. In the regression I looked at it doesn't look load folding or large immediates were occurring so the regression isn't caused by the loss of those. So we could try to be smarter here if we find a compelling case. Reviewers: RKSimon, spatel, lebedev.ri, andreadb Reviewed By: RKSimon Subscribers: llvm-commits, andreadb, RKSimon Differential Revision: https://reviews.llvm.org/D52570 llvm-svn: 343399	2018-09-30 03:01:46 +00:00
Zachary Turner	6e6d545d24	Only dump the types we need in the test. We added support for dumping pointers but pointers to arrays won't correctly dump until we add support for dumping arrays. Instead of trying to dump everything, which this test isn't even interested in, just dump enums and typedefs. llvm-svn: 343398	2018-09-30 00:51:54 +00:00
Zachary Turner	a1e79e326a	Fix some tests on Windows. I don't actually have a Windows machine at the present moment, so hopefully this fixes it. llvm-svn: 343397	2018-09-30 00:22:21 +00:00
Lang Hames	98440293fb	[ORC] Add partitioning support to CompileOnDemandLayer2. CompileOnDemandLayer2 now supports user-supplied partition functions (the original CompileOnDemandLayer already supported these). Partition functions are called with the list of requested global values (i.e. global values that currently have queries waiting on them) and have an opportunity to select extra global values to materialize at the same time. Also adds testing infrastructure for the new feature to lli. llvm-svn: 343396	2018-09-29 23:49:57 +00:00
Lang Hames	c3053e41bf	[ORC] Clear SymbolToDefinitionMap when materializing a MaterializationUnit. The map is inaccessible at this point, so we may as well reclaim the memory early. llvm-svn: 343395	2018-09-29 23:49:56 +00:00
Lang Hames	d18d69f25a	Add a comment to clarify the contract for LLVMGetErrorMessage in the c-bindings for Error. llvm-svn: 343394	2018-09-29 23:49:54 +00:00
Zachary Turner	6ca6a03c51	[PDB] Better native API support for pointers. We didn't properly detect when a pointer was a member pointer, and when that was the case we were not properly returning class parent info. This caused member pointers to render incorrectly in pretty mode. However, we didn't even have pretty tests for pointers in native mode, so those are also added now to ensure this. llvm-svn: 343393	2018-09-29 23:28:19 +00:00
David Bolvansky	09fd8172df	[DAGCombiner][NFC] Tests for X div/rem Y single bit fold llvm-svn: 343392	2018-09-29 21:00:37 +00:00
Simon Pilgrim	c4e7c347cd	[X86][AVX2] Cleanup shuffle combining tests - add common prefixes llvm-svn: 343391	2018-09-29 20:34:16 +00:00
Simon Pilgrim	a2efe82b81	[X86] SimplifyDemandedVectorEltsForTargetNode - remove identity target shuffles before simplifying inputs By removing demanded target shuffles that simplify to zero/undef/identity before simplifying its inputs we improve chances of further simplification, as only the immediate parent user of the combined is added back to the work list - this still doesn't help us if its passed through other ops though (bitcasts....). llvm-svn: 343390	2018-09-29 18:15:26 +00:00
Craig Topper	845789e823	[X86] Add fast-isel test cases for unaligned load/store intrinsics recently added to clang This adds tests for: _mm_loadu_si16 _mm_loadu_si32 _mm_loadu_si16 _mm_storeu_si64 _mm_storeu_si32 _mm_storeu_si16 llvm-svn: 343389	2018-09-29 18:03:52 +00:00
Simon Pilgrim	a93407fadf	[X86][SSE] LowerScalarImmediateShift - remove 32-bit vXi64 special case handling. This is all handled generally by getTargetConstantBitsFromNode now llvm-svn: 343387	2018-09-29 17:36:22 +00:00
Simon Pilgrim	b5737007cd	Fix signed/unsigned mismatch warning. NFCI. llvm-svn: 343385	2018-09-29 17:11:19 +00:00
Simon Pilgrim	d633e290c8	[X86] getTargetConstantBitsFromNode - add support for rearranging constant bits via shuffles Exposed an issue that recursive calls to getTargetConstantBitsFromNode don't handle changes to EltSizeInBits yet. llvm-svn: 343384	2018-09-29 17:01:55 +00:00
Simon Pilgrim	ae34ae12ef	[X86][SSE] LowerScalarImmediateShift - use getTargetConstantBitsFromNode to get immediate data Don't just attempt to find a splat build vector. First step towards getting rid of all the 32-bit special case code. llvm-svn: 343383	2018-09-29 16:40:35 +00:00
Sanjay Patel	54d31ef87e	[InstCombine] fix formatting in vector evaluators; NFC We need to alter the functionality as shown in D52548. llvm-svn: 343379	2018-09-29 15:05:24 +00:00
Sanjay Patel	20c64510cb	[InstCombine] add test for vector widening of insertelements; NFC The test shows a potential overreach with the fix from D52548. llvm-svn: 343378	2018-09-29 15:01:45 +00:00
Simon Pilgrim	a731940c60	[X86] getTargetConstantBitsFromNode - fix self-move assertions from gcc builds due to rL343375 llvm-svn: 343377	2018-09-29 14:51:09 +00:00
Simon Pilgrim	43e4e648ef	[X86] Regenerate fma comments. llvm-svn: 343376	2018-09-29 14:31:00 +00:00
Simon Pilgrim	22d51014af	[X86] getTargetConstantBitsFromNode - add support for peeking through ISD::EXTRACT_SUBVECTOR llvm-svn: 343375	2018-09-29 14:17:32 +00:00
Simon Pilgrim	aa77033a6b	[X86][SSE] Fixed issue with v2i64 variable shifts on 32-bit targets The shift amount might have peeked through a extract_subvector, altering the number of vector elements in the 'Amt' variable - so we were incorrectly calculating the ratio when peeking through bitcasts, resulting in incorrectly detecting splats. llvm-svn: 343373	2018-09-29 13:25:22 +00:00
Heejin Ahn	5e174e7474	Fix comment indentation in addLandingPad rL343018 messed up the comment indentation while moving it. llvm-svn: 343371	2018-09-29 09:22:25 +00:00
Vitaly Buka	0509070811	[cxx2a] Fix warning triggered by r343285 llvm-svn: 343369	2018-09-29 02:17:12 +00:00
Lang Hames	2afc22ed9e	[ORC] Make MaterializationResponsibility::getRequestedSymbols() const. This makes it available for use in IRTransformLayer2::TransformFunction instances (since a const MaterializationResponsibility& parameter was added in r343365). llvm-svn: 343367	2018-09-28 22:03:17 +00:00
Lang Hames	3e709d5f78	[ORC] Add more utilities to aid debugging output. (1) A const accessor for the LLVMContext held by a ThreadSafeContext. (2) A const accessor for the ThreadSafeModules held by an IRMaterializationUnit. (3) A const MaterializationResponsibility reference to IRTransformLayer2's transform function. This makes IRTransformLayer2 useful for JIT debugging (since it can inspect JIT state through the responsibility argument) as well as program transformations. llvm-svn: 343365	2018-09-28 21:49:53 +00:00
Thomas Lively	d47b5c7bed	[ValueTracking] Allow select patterns to work on FP vectors Summary: This CL allows constant vectors of floats to be recognized as non-NaN and non-zero in select patterns. This change makes `matchSelectPattern` more powerful generally, but was motivated specifically because I wanted fminnan and fmaxnan to be created for vector versions of the scalar patterns they are created for. Tested with check-all on all targets. A testcase in the WebAssembly backend that tests the non-nan codepath is in an upcoming CL. Reviewers: aheejin, dschuff Subscribers: sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52324 llvm-svn: 343364	2018-09-28 21:36:43 +00:00
Robert Widmann	e63a12ccbe	[LLVM-C] Add an accessor for the "value type" of a global Summary: Before this, there was no reasonable way to retrieve the type of a global value (most notably, a function) that was created with the C API. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52659 llvm-svn: 343363	2018-09-28 20:54:29 +00:00
Heejin Ahn	ec3d65b870	[WebAssembly] Fix memory leak on WasmEHFuncInfo Summary: WasmEHFuncInfo objects were not being properly deleted. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52582 llvm-svn: 343362	2018-09-28 20:54:04 +00:00
Eli Friedman	5ab09a684f	[ARM] Fix correctness checks in promoteToConstantPool. Correctly check for relocations in the constant to promote. And don't allow promoting a constant multiple times. This partially fixes https://bugs.llvm.org//show_bug.cgi?id=32780 ; it's not a complete fix because we also need to prevent ARMConstantIslands from cloning the constant. (-arm-promote-constant is currently off by default, and it stays off with this patch. I'll look into turning it on again when all the known issues are fixed.) Differential Revision: https://reviews.llvm.org/D51472 llvm-svn: 343361	2018-09-28 20:27:31 +00:00
Eli Friedman	bb993be56b	[ARM] Use preferred alignment for constants in promoteToConstantPool. This mostly affects IR generated by non-clang frontends because clang generally sets the alignment of globals explicitly. Fixes https://bugs.llvm.org//show_bug.cgi?id=32394 . (-arm-promote-constant is currently off by default, and it stays off with this patch. I'll look into turning it on again when all the known issues are fixed.) Differential Revision: https://reviews.llvm.org/D51469 llvm-svn: 343359	2018-09-28 20:21:51 +00:00
Lang Hames	b62f73420b	[ORC] Narrow a cast: the block guarded by the condition only handles GlobalVariables, not all GlobalValues. llvm-svn: 343358	2018-09-28 20:16:16 +00:00
Craig Topper	98aa643420	[X86] Add test cases for failures to use narrow test with immediate instructions when a truncate is beteen the CMP and the AND and the sign flag is used. The code in X86ISelDAGToDAG only looks through truncates if the sign flag isn't used, but that is overly restrictive. A future patch will improve this. llvm-svn: 343355	2018-09-28 19:06:28 +00:00
Evandro Menezes	fc1852ff1c	[AArch64] Split zero cycle feature more granularly Split the `zcz` feature into specific ones got GP and FP registers, `zcz-gp` and `zcz-fp`, respectively, while retaining the original feature option to mean both. Differential revision: https://reviews.llvm.org/D52621 llvm-svn: 343354	2018-09-28 19:05:09 +00:00
George Karpenkov	86714886d9	GraphWriter: Provide an API for writing a graph into a specified file Always generating a temporary file is not always suitable, especially for tests Differential Revision: https://reviews.llvm.org/D52636 llvm-svn: 343351	2018-09-28 18:49:01 +00:00
David Bolvansky	8e90bad63d	[DAGCombiner] [NFC] Improve X div/rem 1 fold Reviewers: spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52661 llvm-svn: 343349	2018-09-28 18:40:30 +00:00
Chris Matthews	2f897090e4	make lit builtins a package cat.py is not being installed when lit is installed from source. So tests that use the internal shell fail when using cat. llvm-svn: 343347	2018-09-28 17:55:18 +00:00
Andrea Di Biagio	6e218d0a57	[llvm-mca] Add a test for zero-idiom VPERM2F128rr. NFC We don't correctly model the latency and resource usage information for zero-idiom VPERM2F128rr on Jaguar. This is demonstrated by the incorrect numbers in the resource pressure view, and the timeline view. A follow up patch will fix this problem. llvm-svn: 343346	2018-09-28 17:47:09 +00:00
whitequark	58aed9fb2e	[bindings/go] Add Go bindings to the Token type Summary: This type is necessary for implementing coroutines. Reviewers: whitequark Reviewed By: whitequark Subscribers: modocache, llvm-commits Differential Revision: https://reviews.llvm.org/D47684 llvm-svn: 343345	2018-09-28 17:39:59 +00:00
Luke Cheeseman	10981cc884	Revert r343317 - asan buildbots are breaking and I need to investigate the issue llvm-svn: 343341	2018-09-28 17:01:50 +00:00
whitequark	e42a8ecf66	[bindings/go] Add Go bindings for inline assembly Reviewers: harlanhaskins, whitequark, pcc Reviewed By: pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46437 llvm-svn: 343339	2018-09-28 16:48:47 +00:00
whitequark	29b2980159	Revert "[LLVM-C] Add bindings for addCoroutinePassesToExtensionPoints" This reverts commit c4baf7c2f06ff5459c4f5998ce980346e72bff97. Broke the bots, and should really be in Transforms/Coroutines instead. llvm-svn: 343337	2018-09-28 16:45:18 +00:00
whitequark	937afbc365	[LLVM-C] Add bindings for addCoroutinePassesToExtensionPoints Summary: This patch adds bindings to C and Go for addCoroutinePassesToExtensionPoints, which is used to add coroutine passes to the correct locations in PassManagerBuilder. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: mehdi_amini, modocache, llvm-commits Differential Revision: https://reviews.llvm.org/D51642 llvm-svn: 343336	2018-09-28 16:38:11 +00:00
Robert Widmann	d22ee9461f	[LLVM-C] Fix broken build bots Summary: Fix broken bots caused by the merge of D51522. Reviewers: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52657 llvm-svn: 343334	2018-09-28 16:02:26 +00:00
Greg Bedwell	3a109e9ab1	[utils] Cope with the binary having a .exe extension in update_mca_test_checks.py llvm-svn: 343333	2018-09-28 15:39:18 +00:00
Greg Bedwell	becbbe0383	[utils] Stricter checking from update_mca_test_checks.py If any prefixes have been specified on the RUN lines that do not end up ever actually getting printed, raise an Error. This is either an indication that the run lines just need cleaning up, or that something is more fundamentally wrong with the test. Also raise an Error if there are any blocks which cannot be checked because they are not uniquely covered by a prefix. Fixed up a couple of tests where the extra checking flagged up issues. Differential Revision: https://reviews.llvm.org/D48276 llvm-svn: 343332	2018-09-28 15:39:09 +00:00
Greg Bedwell	2f528f8c1e	[utils] Allow better identification of matching blocks in update_mca_test_checks.py Insert empty blocks to cause the positions of matching blocks to match across lists where possible so that later stages of the algorithm can actually identify them as being identical. Regenerated all tests with this change. Differential Revision: https://reviews.llvm.org/D52560 llvm-svn: 343331	2018-09-28 15:38:56 +00:00

... 2 3 4 5 6 ...

170097 Commits