llvm-project

Commit Graph

Author	SHA1	Message	Date
Justin Bogner	fde9f2e51d	SDAG: Use ReplaceNode here, not ReplaceUses This was a typo in an earlier commit - there's no point in keeping the old node around here. Noticed by Meador Inge. Thanks! llvm-svn: 269245	2016-05-11 22:21:50 +00:00
Justin Bogner	31d7da3b5f	SDAG: Add a helper to replace and remove a node during ISel It's very common to want to replace a node and then remove it since it's dead, especially as we port backends from the SDNode *Select API to the void Select one. This helper makes this sequence a bit less verbose. llvm-svn: 269236	2016-05-11 21:13:17 +00:00
Simon Pilgrim	6ce35dd9ea	[X86][AVX512] Fixed VPERMILPD/VPERMILPS shuffle comments. Fixed incorrect operands indices used to access src registers llvm-svn: 269221	2016-05-11 18:53:44 +00:00
Justin Bogner	c200ad7e3b	SDAG: Minor cleanup in X86 Don't bother returning a result we don't use here. I've also renamed this from selectGather to tryGather to better indicate that it may not do anything. llvm-svn: 269215	2016-05-11 17:46:03 +00:00
Simon Pilgrim	3016d9e9e1	[X86][SSE] Avoid repeatedly calling MCInst::getNumOperands(). NFCI. llvm-svn: 269209	2016-05-11 17:36:32 +00:00
Simon Pilgrim	41c05c019e	[X86][AVX512] Updated shuffle comments instruction macros to split writemask instructions. NFC This will make it easier to support the different writemask cases in shuffle comments llvm-svn: 269174	2016-05-11 11:55:12 +00:00
Justin Bogner	593741d354	SDAG: Implement Select instead of SelectImpl in X86 This is part of the work to have Select return void instead of an SDNode *, which is in turn part of llvm.org/pr26808. llvm-svn: 269144	2016-05-10 23:55:37 +00:00
Quentin Colombet	220f7da488	[X86] Properly check that EAX is dead when copying EFLAGS. This fixes a bug introduced in r267623, where we got smarter and avoided to save EAX before using it. However, we failed to check if any of the subregister of EAX were alive and thus, missed cases where we have to save EAX before using it. The problem may happen on every X86/i386/... platform. This fixes llvm.org/PR27624 llvm-svn: 269115	2016-05-10 20:49:46 +00:00
Jonas Paulsson	8e5b0c65cc	[foldMemoryOperand()] Pass LiveIntervals to enable liveness check. SystemZ (and probably other targets as well) can fold a memory operand by changing the opcode into a new instruction that as a side-effect also clobbers the CC-reg. In order to do this, liveness of that reg must first be checked. When LIS is passed, getRegUnit() can be called on it and the right LiveRange is computed on demand. Reviewed by Matthias Braun. http://reviews.llvm.org/D19861 llvm-svn: 269026	2016-05-10 08:09:37 +00:00
Craig Topper	3e0c038a84	[X86][AVX512] Strengthen the assertions from r269001. We need VLX to use the 128/256-bit move opcodes for extended registers. llvm-svn: 269019	2016-05-10 05:28:04 +00:00
Craig Topper	9f8e50cdb4	[X86] Add ZMM registers to the X86_INTR calling convention preserved mask when AVX512 is enabled. llvm-svn: 269018	2016-05-10 05:28:02 +00:00
Craig Topper	3fef1de785	[X86] Update X86_INTR calling convention to save ZMM registers instead of YMM registers when AVX512 is enabled. llvm-svn: 269017	2016-05-10 05:27:56 +00:00
Matthias Braun	31d19d43c7	CodeGen: Move TargetPassConfig from Passes.h to an own header; NFC Many files include Passes.h but only a fraction needs to know about the TargetPassConfig class. Move it into an own header. Also rename Passes.cpp to TargetPassConfig.cpp while we are at it. llvm-svn: 269011	2016-05-10 03:21:59 +00:00
Quentin Colombet	ee5f36bd54	[X86][AVX512] Use the proper load/store for AVX512 registers. When loading or storing AVX512 registers we were not using the AVX512 variant of the load and store for VR128 and VR256 like registers. Thus, we ended up with the wrong encoding and actually were dropping the high bits of the instruction. The result was that we load or store the wrong register. The effect is visible only when we emit the object file directly and disassemble it. Then, the output of the disassembler does not match the assembly input. This is related to llvm.org/PR27481. llvm-svn: 269001	2016-05-10 01:09:14 +00:00
Quentin Colombet	739614839f	[X86] Fix the AllRegs AVX calling convention. We used to list registers that were not in the AVX space. In other words, we were pushing registers that the ISA cannot encode (YMM16-YMM31). This is part of llvm.org/PR27481. llvm-svn: 268983	2016-05-09 22:37:05 +00:00
Quentin Colombet	b47b9b2de7	[X86] Strengthen the setting of inline asm constraints for fp regclasses. This is similar to r268953, but for floating point and vector register classes. Explanations: The setting of the inline asm constraints was implicitly relying on the order of the register classes in the file generated by tablegen. Since, we do not have any control on that order, make sure we do not depend on it anymore. llvm-svn: 268973	2016-05-09 21:24:31 +00:00
Simon Pilgrim	eec3a95f95	[X86][SSE] Improve cost model for i64 vector comparisons on pre-SSE42 targets As discussed on PR24888, until SSE42 we don't have access to PCMPGTQ for v2i64 comparisons, but the cost models don't reflect this, resulting in over-optimistic vectorizaton. This patch adds SSE2 'base level' costs that match what a typical target is capable of and only reduces the v2i64 costs at SSE42. Technically SSE41 provides a PCMPEQQ v2i64 equality test, but as getCmpSelInstrCost doesn't give us a way to discriminate between comparison test types we can't easily make use of this, otherwise we could split the cost of integer equality and greater-than tests to give better costings of each. Differential Revision: http://reviews.llvm.org/D20057 llvm-svn: 268972	2016-05-09 21:14:38 +00:00
Quentin Colombet	3126db6fd7	[X86] Drop the 64-bit alignment for LOW32_ADDR_ACCESS register class. The only 64-bit register in that register class is RIP and it will not get spilled in the current ABIs. llvm-svn: 268963	2016-05-09 19:50:30 +00:00
Quentin Colombet	86098ab10b	Reapply [X86] Add a new LOW32_ADDR_ACCESS_RBP register class. This reapplies commit r268796, with a fix for the setting of the inline asm constraints. I.e., "mark" LOW32_ADDR_ACCESS_RBP as a GR variant, so that the regular processing of the GR operands (setting of the subregisters) happens. Original commit log: [X86] Add a new LOW32_ADDR_ACCESS_RBP register class. ABIs like NaCl uses 32-bit addresses but have 64-bit frame. The new register class reflects those constraints when choosing a register class for a address access. llvm-svn: 268955	2016-05-09 19:01:46 +00:00
Quentin Colombet	bb15ce3d1f	[X86] Strengthen the setting of inline asm constraints. The setting of the inline asm constraints was implicitly relying on the order of the register classes in the file generated by tablegen. Since, we do not have any control on that order, make sure we do not depend on it anymore. llvm-svn: 268953	2016-05-09 19:01:35 +00:00
Simon Pilgrim	af742d51ad	[X86][SSE] Added TODO comment to add support for AVX512 mask registers to shuffle comments This came up in discussion on D19198 llvm-svn: 268915	2016-05-09 13:30:16 +00:00
Craig Topper	a5d0bf5c36	[X86] Strengthen some type contraints for floating point round and extend. llvm-svn: 268892	2016-05-09 05:34:14 +00:00
Craig Topper	a58abd1cc6	[AVX512] Fix up types for arguments of int_x86_avx512_mask_cvtsd2ss_round and int_x86_avx512_mask_cvtss2sd_round. Only the argument being converted should be a different type. The other 2 argument should have the same type as the result. llvm-svn: 268891	2016-05-09 05:34:12 +00:00
Craig Topper	707c89c00d	[AVX512] Add non-temporal store patterns for v16i32/v32i16/v64i8. llvm-svn: 268889	2016-05-08 23:43:17 +00:00
Craig Topper	c41320d700	[AVX512] Add missing patterns for non-temporal stores of 128/256-bit vXi8/vXi16/vXi32 when VLX is enabled. The equivalent AVX1/2 patterns are disabled by VLX. This caused regular stores to be emitted instead. llvm-svn: 268886	2016-05-08 23:08:45 +00:00
Craig Topper	906f397137	[AVX512] Change predicates on some vXi16/vXi8 AVX store patterns so they stay enabled unless VLX and BWI instructions are supported." Without this we could fail instruction selection if VLX was enabled, but BWI wasn't. llvm-svn: 268885	2016-05-08 23:08:40 +00:00
Craig Topper	e5ce84a33c	[AVX512] Add VLX 128/256-bit SET0 operations that encode to 128/256-bit EVEX encoded VPXORD so all 32 registers can be used. llvm-svn: 268884	2016-05-08 21:33:53 +00:00
Craig Topper	9d9251b86f	[X86] Remove extra patterns that check for BUILD_VECTOR of all 0s. These are always canonicalized to v4i32/v8i32/v16i32 except for in SSE1 only when only v4f32 is supported. llvm-svn: 268880	2016-05-08 20:10:20 +00:00
David Majnemer	eac58d8f68	[X86] Promote several single precision FP libcalls on Windows A number of libcalls don't exist in any particular lib but are, instead, defined in math.h as inline functions (even in C mode!). Don't rely on their existence when lowering @llvm.{cos,sin,floor,..}.f32, promote them instead. N.B. We had logic to handle FREM but were missing out on a number of others. This change generalizes the FREM handling. llvm-svn: 268875	2016-05-08 08:15:50 +00:00
Craig Topper	d681e23336	[X86] Lower 256-bit vector all-zero constants to v8i32 even with AVX1 only. Either way a 256-bit VXORPS will be used. llvm-svn: 268873	2016-05-08 07:10:54 +00:00
Craig Topper	3d6722910c	[X86] Add patterns for 256-bit non-temporal stores when only AVX1 is supported. While there, add a predicate to the SSE2 patterns to avoid an ordering dependency. llvm-svn: 268872	2016-05-08 07:10:50 +00:00
Craig Topper	d788498411	[X86] No need to avoid selecting AVX_SET0 for 256-bit integer types when only AVX1 is supported. AVX_SET0 just expands to 256-bit VXORPS which is legal in AVX1. llvm-svn: 268871	2016-05-08 07:10:47 +00:00
Craig Topper	6502975cf5	[X86] Fix InstAliases to not allow FARCALL32i/FARCALL16i/FARJMP32i/FARJMP16i in 64-bit mode. llvm-svn: 268863	2016-05-07 19:25:56 +00:00
Simon Pilgrim	96e5307d4e	[X86] Pulled out duplicate mask width calculation. NFCI. llvm-svn: 268861	2016-05-07 18:04:24 +00:00
Sanjay Patel	c2751e7050	[x86, BMI] add TLI hook for 'andn' and use it to simplify comparisons For the sake of minimalism, this patch is x86 only, but I think that at least PPC, ARM, AArch64, and Sparc probably want to do this too. We might want to generalize the hook and pattern recognition for a target like PPC that has a full assortment of negated logic ops (orc, nand). Note that http://reviews.llvm.org/D18842 will cause this transform to trigger more often. For reference, this relates to: https://llvm.org/bugs/show_bug.cgi?id=27105 https://llvm.org/bugs/show_bug.cgi?id=27202 https://llvm.org/bugs/show_bug.cgi?id=27203 https://llvm.org/bugs/show_bug.cgi?id=27328 Differential Revision: http://reviews.llvm.org/D19087 llvm-svn: 268858	2016-05-07 15:03:40 +00:00
Ahmed Bougacha	04a8fc2e37	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. This re-applies r268760, reverted in r268794. Fixes http://llvm.org/PR27670 The original imp-defs assertion was way overzealous: forward all implicit operands, except imp-defs of the new super-reg def (r268787 for GR64, but also possible for GR16->GR32), or imp-uses of the new super-reg use. While there, mark the source use as Undef, and add an imp-use of the old source reg: that should cover any case of dead super-regs. At the stage the pass runs, flags are unlikely to matter anyway; still, let's be as correct as possible. Also add MIR tests for the various interesting cases. Original commit message: Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268831	2016-05-07 01:11:17 +00:00
Ahmed Bougacha	068ac4af39	[X86] Register and initialize the FixupBW pass. That lets us use it in MIR tests. llvm-svn: 268830	2016-05-07 01:11:10 +00:00
Quentin Colombet	a09f050dc1	Revert "[X86] Add a new LOW32_ADDR_ACCESS_RBP register class." This reverts commit r268796. I believe it breaks test/CodeGen/X86/asm-mismatched-types.ll with: Cannot emit physreg copy instruction llvm-svn: 268799	2016-05-06 21:21:50 +00:00
Quentin Colombet	2728074e3c	[X86] Add a new LOW32_ADDR_ACCESS_RBP register class. ABIs like NaCl uses 32-bit addresses but have 64-bit frame. The new register class reflects those constraints when choosing a register class for a address access. llvm-svn: 268796	2016-05-06 21:10:53 +00:00
Quentin Colombet	377fc2aa3d	[X86] Rename the X32_ADDR_ACCESS register class into LOW32_ADDR_ACCESS. This register class may be used by any ABIs that uses x86_64 ISA while using 32-bit addresses, not just in X32 cases. Make sure the name reflects that. llvm-svn: 268795	2016-05-06 21:10:43 +00:00
Nico Weber	9b32b4fbee	Revert r268760, it caused PR27670. llvm-svn: 268794	2016-05-06 21:07:02 +00:00
Ahmed Bougacha	505984b466	[X86] Accept imp-defs of GR64 super-registers in FixupBW MOVrr. Testcase will follow shortly. llvm-svn: 268787	2016-05-06 20:03:03 +00:00
Quentin Colombet	a065ac45ee	[X86] Get rid of X32_NOREX_ADDR_ACCESS register class. According to H.J. Lu <hjl.tools@gmail.com>, this register class is never used. llvm-svn: 268771	2016-05-06 18:22:48 +00:00
Ahmed Bougacha	258426ca7a	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268760	2016-05-06 17:42:57 +00:00
Ahmed Bougacha	04200a7c86	[X86] Remove \brief in FixupBW. NFC. llvm-svn: 268754	2016-05-06 17:28:47 +00:00
Ahmed Bougacha	cfd9e55e90	[X86] Simplify FixupBW sub_8bit_hi-related logic. NFC. Instead of passing around sizes and asking for subregs, we can check the subreg indices we care about: sub_8bit_hi and sub_8bit. Differential Revision: http://reviews.llvm.org/D20006 llvm-svn: 268753	2016-05-06 17:28:42 +00:00
Justin Bogner	b012699741	SDAG: Rename Select->SelectImpl and repurpose Select as returning void This is a step towards removing the rampant undefined behaviour in SelectionDAG, which is a part of llvm.org/PR26808. We rename SelectionDAGISel::Select to SelectImpl and update targets to match, and then change Select to return void and consolidate the sketchy behaviour we're trying to get away from there. Next, we'll update backends to implement `void Select(...)` instead of SelectImpl and eventually drop the base Select implementation. llvm-svn: 268693	2016-05-05 23:19:08 +00:00
Hans Wennborg	501e739d8a	X86CallFrameOptimization: make adjustCallSequence's return type void It always returned the same value (true). No functionality change. llvm-svn: 268645	2016-05-05 16:39:31 +00:00
Marcin Koscielnicki	0275fac2c9	[X86] Extend some Linux special cases to cover kFreeBSD. Both Linux and kFreeBSD use glibc, so follow similiar code paths. Add isTargetGlibc to check for this, and use it instead of isTargetLinux in a few places. Fixes PR22248 for kFreeBSD. Differential Revision: http://reviews.llvm.org/D19104 llvm-svn: 268624	2016-05-05 11:35:51 +00:00
David Majnemer	911d0e3c21	[X86] Use the right type when folding xor (truncate (shift)) -> setcc The result type of setcc is dependent on whether or not AVX512 is present. We had an X86-specific DAG-combine which assumed that the result type should be i8 when it could be i1. This meant that we would generate illegal setccs which LowerSETCC did not like. Instead, use an appropriate type and zero extend to i8. Also, there were some scenarios where the fold should have fired but didn't because we were overly cautious about the types. This meant that we generated: shrl $31, %edi andl $1, %edi kmovw %edi, %k0 kxnorw %k0, %k0, %k1 kshiftrw $15, %k1, %k1 kxorw %k1, %k0, %k0 kmovw %k0, %eax instead of: testl %edi, %edi setns %al This fixes PR27638. llvm-svn: 268609	2016-05-05 06:00:56 +00:00

1 2 3 4 5 ...

13124 Commits