llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	88aa627c0b	[X86][SSE] Added support for lowering to ADDSUBPS/ADDSUBPD with commuted inputs We could already recognise shuffle(FSUB, FADD) -> ADDSUB, this allow us to recognise shuffle(FADD, FSUB) -> ADDSUB by commuting the shuffle mask prior to matching. llvm-svn: 254259	2015-11-29 16:41:04 +00:00
Igor Breger	e293e83f5d	AVX512:Implemented encoding for the vmovq.s instruction. Differential Revision: http://reviews.llvm.org/D14810 llvm-svn: 254248	2015-11-29 07:41:26 +00:00
Renato Golin	5dbc8a5283	Revert "[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM." This reverts commit r254201 and r254202, as it broke test-suite, self-hosting and sanitizer tests on ARM buildbots. llvm-svn: 254234	2015-11-28 17:23:46 +00:00
Jonas Paulsson	f12b925bb1	[Stack realignment] Handling of aligned allocas. This patch implements dynamic realignment of stack objects for targets with a non-realigned stack pointer. Behaviour in FunctionLoweringInfo is changed so that for a target that has StackRealignable set to false, over-aligned static allocas are considered to be variable-sized objects and are handled with DYNAMIC_STACKALLOC nodes. It would be good to group aligned allocas into a single big alloca as an optimization, but this is yet todo. SystemZ benefits from this, due to its stack frame layout. New tests SystemZ/alloca-03.ll for aligned allocas, and SystemZ/alloca-04.ll for "no-realign-stack" attribute on functions. Review and help from Ulrich Weigand and Hal Finkel. llvm-svn: 254227	2015-11-28 11:02:32 +00:00
Artyom Skrobov	f01a59f9fb	Follow-up fix for r254201 llvm-svn: 254202	2015-11-27 16:20:34 +00:00
Artyom Skrobov	b955b90509	[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM. Summary: Since this build attribute corresponds to a whole module, and different functions in a module may differ in the optimizations enabled for them, this attribute is emitted after all functions, and only in the case that the optimization goals for all functions match. Reviewers: logan, hans Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14934 llvm-svn: 254201	2015-11-27 15:30:51 +00:00
Oliver Stannard	b25914e03f	[AArch64] Add ARMv8.2-A FP16 scalar instructions ARMv8.2-A adds 16-bit floating point versions of all existing VFP floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. Most of these instructions are the same as the 32- and 64-bit versions, but with the type field (bits 23-22) set to 0b11. Previously the top bit of the size field was always 0, so the instruction classes only provided a 1-bit size field, which I have widened to 2 bits. Differential Revision: http://reviews.llvm.org/D15014 llvm-svn: 254198	2015-11-27 13:04:48 +00:00
Craig Topper	e38c57a4b8	[X86] Pair a NoVLX with HasAVX512 to match the others and remove a unique predicate check in the isel tables. NFC llvm-svn: 254191	2015-11-27 05:44:02 +00:00
Craig Topper	a47576f297	[X86] Now that X86VPermt2 is used in all the avx512_perm_t_sizes just hardcode it into the patterns instead of passing as an argument. NFC llvm-svn: 254177	2015-11-26 20:21:29 +00:00
Craig Topper	05858f52fe	[X86] Merge X86VPermt2Fp and X86VPermt2Int back together by weakening them just enough. The SDTCisSameSizeAs introduced in r254138 helps here. llvm-svn: 254176	2015-11-26 20:02:01 +00:00
Craig Topper	0009656335	[X86] Split ISD node for Vfpclass and Vfpclasss so that we can write strong type constraints for each that don't cause ambiguous isel. llvm-svn: 254172	2015-11-26 19:41:34 +00:00
Craig Topper	ff2f14731a	[X86] Revert part of r254167 to recover bots. llvm-svn: 254169	2015-11-26 19:13:05 +00:00
Krzysztof Parzyszek	08ff8883fd	[Hexagon] Lowering of V60/HVX vector types llvm-svn: 254168	2015-11-26 18:38:27 +00:00
Craig Topper	9d1deb4b72	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254167	2015-11-26 18:31:19 +00:00
Krzysztof Parzyszek	4eb6d4d1f2	[Hexagon] Hexagon V60 HVX intrinsic defintions Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 254165	2015-11-26 16:54:33 +00:00
Daniel Sanders	daa4b6fbd9	[mips][ias] Range check uimm5 operands and fix several bugs this revealed. Summary: The bugs were: * append, prepend, and balign were not tested * balign takes a uimm2 not a uimm5. * drotr32 was correctly implemented with a uimm5 but the tests expected '52' to be valid. * li/la were implemented with a uimm5 instead of simm32. simm32 isn't completely correct either but I'll fix that when I get to simm32. A notable omission are some of the shift instructions. Several of these have been implemented using a single uimm6 instruction (rather than two uimm5 instructions and a CodeGen-only uimm6 pseudo). These will be updated in the uimm6 patch. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14712 llvm-svn: 254164	2015-11-26 16:35:41 +00:00
Oliver Stannard	64c167db7a	[AArch64] Add ARMv8.2-A new AT instruction variants ARMv8.2-A adds new variants of the "at" (address translate) system instruction, which take the PSTATE.PAN bit (added in ARMv8.1-A). These are a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15018 llvm-svn: 254159	2015-11-26 15:34:44 +00:00
Martell Malone	d12292480a	ARM: address WOA unsigned division overflow crash Building on r253865 the crash is not limited to signed overflows. Disable custom handling of unsigned 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit unsigned integer overflow. llvm-svn: 254158	2015-11-26 15:34:03 +00:00
Oliver Stannard	911ea20f07	[AArch64] Add ARMv8.2-A UAO PSTATE bit ARMv8.2-A adds a new PSTATE bit, PSTATE.UAO, which allows the LDTR/STTR instructions to behave the same as LDR/STR with respect to execute-only pages at higher privilege levels. New variants of the MSR/MRS instructions are added to allow reading and writing this bit. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15020 llvm-svn: 254157	2015-11-26 15:32:30 +00:00
Oliver Stannard	1a81cc9f43	[AArch64] Add ARMv8.2-A persistent memory instruction ARMv8.2-A adds the "dc cvap" instruction, which is a system instruction that cleans caches to the point of persistence (for systems that have persistent memory). It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15016 llvm-svn: 254156	2015-11-26 15:28:47 +00:00
Oliver Stannard	48b43741d0	[AArch64] Add ARMv8.2-A ID_A64MMFR2_EL1 register ARMv8.2-A adds a new ID register, ID_A64MMFR2_EL1, which behaves in the same way as ID_A64MMFR0_EL1 and ID_A64MMFR1_EL1. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15017 llvm-svn: 254155	2015-11-26 15:26:10 +00:00
Oliver Stannard	7cc0c4e675	[AArch64] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15013 llvm-svn: 254154	2015-11-26 15:23:32 +00:00
Craig Topper	a3ac738725	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254142	2015-11-26 07:58:20 +00:00
Vyacheslav Klochkov	ed865dfcc5	X86-FMA3: Improved/enabled the memory folding optimization for scalar loads generated for _mm_losd_s{s,d}() intrinsics and used in scalar FMAs generated for FMA intrinsics _mm_f{madd,msub,nmadd,nmsub}_s{s,d}(). Reviewer: David Kreitzer Differential Revision: http://reviews.llvm.org/D14762 llvm-svn: 254140	2015-11-26 07:45:30 +00:00
Craig Topper	4c175cdc8e	[X86] Strengthen the type constraints on X86psadbw and X86dbpsadbw to reduce some of the type checks in the isel matching tables. llvm-svn: 254139	2015-11-26 07:02:21 +00:00
Krzysztof Parzyszek	195dc8d0db	[Hexagon] HVX vector register classes and more isel patterns llvm-svn: 254132	2015-11-26 04:33:11 +00:00
Tom Stellard	48f29f21ee	AMDGPU: Add llvm.amdgcn.dispatch.ptr intrinsic Summary: This returns a pointer to the dispatch packet, which can be used to load information about the kernel dispach. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14898 llvm-svn: 254116	2015-11-26 00:43:29 +00:00
Dan Gohman	a774d719a0	[WebAssembly] Fix inline asm support for i64 operands. llvm-svn: 254106	2015-11-25 22:28:50 +00:00
Dan Gohman	d9b4218831	[WebAssembly] Fold setne and seteq comparisons into selects. llvm-svn: 254104	2015-11-25 22:13:48 +00:00
Krzysztof Parzyszek	70a134d29f	[Hexagon] Treat transfers of FP immediates are pseudo instructions This is a temporary fix to address ICE on 2005-10-21-longlonggtu.ll. The proper fix will be to use A2_tfrsi, but it will need more work to teach all users of A2_tfrsi to also expect a floating-point operand. llvm-svn: 254099	2015-11-25 21:40:03 +00:00
Dan Gohman	5941bde03c	[WebAssembly] Add some comments. NFC. llvm-svn: 254096	2015-11-25 21:32:06 +00:00
Marek Olsak	7ed6b2f414	AMDGPU/SI: select S_ABS_I32 when possible (v2) v2: added more tests, moved the SALU->VALU conversion to a separate function It looks like it's not possible to get subregisters in the S_ABS lowering code, and I don't feel like guessing without testing what the correct code would look like. llvm-svn: 254095	2015-11-25 21:22:45 +00:00
Dan Gohman	80e34e0a18	[WebAssembly] Fix WebAssembly register numbering for registers added late. If virtual registers are created late, mappings to WebAssembly registers need to be added explicitly. This patch adds a function to do so and teaches WebAssemblyPeephole to use it. This fixes an out-of-bounds access on the WARegs vector. llvm-svn: 254094	2015-11-25 21:13:02 +00:00
Matt Arsenault	49affb8462	AMDGPU: Check feature attributes in SIMachineFunctionInfo llvm-svn: 254091	2015-11-25 20:55:12 +00:00
Krzysztof Parzyszek	207c13f254	Add hexagonv55 and hexagonv60 as recognized CPUs, make v60 the default llvm-svn: 254089	2015-11-25 20:30:59 +00:00
Matt Arsenault	61001bbc03	AMDGPU: Make v2i64/v2f64 legal types. They can be loaded and stored, so count them as legal. This is mostly to fix a number of common cases for load/store merging. llvm-svn: 254086	2015-11-25 19:58:34 +00:00
Artyom Skrobov	314ee04268	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Dan Gohman	fb3e0594e4	[WebAssembly] Use a physical register to describe ARGUMENT liveness. Instead of trying to move ARGUMENT instructions back up to the top after they've been scheduled or sunk down, use a fake physical register to create a liveness constraint that prevents ARGUMENT instructions from moving down in the first place. This is still not entirely ideal, however it is more robust than letting them move and moving them back. llvm-svn: 254084	2015-11-25 19:36:19 +00:00
Dan Gohman	9c54d3b4c6	[WebAssembly] Clean up several FIXME comments. llvm-svn: 254079	2015-11-25 18:13:18 +00:00
Dan Gohman	81719f8555	[WebAssembly] Support for register stackifying with load and store instructions. llvm-svn: 254076	2015-11-25 16:55:01 +00:00
Dan Gohman	2c8fe6a428	[WebAssembly] Codegen support for ISD::ExternalSymbol llvm-svn: 254075	2015-11-25 16:44:29 +00:00
Dan Gohman	fd4a88c376	[WebAssembly] Add 'final' to some classes. NFC. llvm-svn: 254073	2015-11-25 16:29:24 +00:00
Dan Gohman	04c0401f28	[WebAssembly] Whitespace consistency. NFC. llvm-svn: 254071	2015-11-25 16:26:14 +00:00
Sanjay Patel	25150784ae	fix typo; NFC llvm-svn: 254069	2015-11-25 15:33:36 +00:00
Hal Finkel	005f840959	[PowerPC] Don't generate mfocrf on the e500mc The e500mc does not actually support the mfocrf instruction; update the processor definitions to reflect that fact. Patch by Tom Rix (with some test-case cleanup by me). llvm-svn: 254064	2015-11-25 10:14:31 +00:00
Elena Demikhovsky	f07df9fcac	AVX-512: Fixed a bug in VPERMT2* intrinsic. It was wrong order of operands (from intrinsic to DAG node). I added more strict type specification for instruction selection. Differential Revision: http://reviews.llvm.org/D14942 llvm-svn: 254059	2015-11-25 08:17:56 +00:00
Hans Wennborg	e412b71f95	Revert r253528: "[X86] Enable shrink-wrapping by default." This caused PR25607 and also caused Chromium to crash on start-up. (Also had to update test/CodeGen/X86/avx-splat.ll, which was committed after shrink wrapping was enabled.) llvm-svn: 254044	2015-11-25 00:05:13 +00:00
Kaelyn Takata	d0955312d9	Fix an asan error where NumElements > 32 for at least one case in test/CodeGen/X86/avg.ll. llvm-svn: 254043	2015-11-25 00:03:29 +00:00
Simon Pilgrim	1b4fecb098	[X86][FMA] Optimize FNEG(FMA) Patterns X86 needs to use its own FMA opcodes, preventing the standard FNEG(FMA) pattern table recognition method used by other platforms. This patch adds support for lowering FNEG(FMA(X,Y,Z)) into a single suitably negated FMA instruction. Fix for PR24364 Differential Revision: http://reviews.llvm.org/D14906 llvm-svn: 254016	2015-11-24 20:31:46 +00:00
Cong Hou	db6220f84d	[X86] Fix several issues related to X86's psadbw instruction. This patch fixes the following issues: 1. Fix the return type of X86psadbw: it should not be the same type of inputs. For vNi8 inputs the output should be vMi64, where M = N/8. 2. Fix the return type of int_x86_avx512_psad_bw_512 accordingly. 3. Fix the definiton of PSADBW, VPSADBW, and VPSADBWY accordingly. 4. Adjust the return type when building a DAG node of X86ISD::PSADBW type. 5. Update related tests. Differential revision: http://reviews.llvm.org/D14897 llvm-svn: 254010	2015-11-24 19:51:26 +00:00

1 2 3 4 5 ...

35140 Commits