llvm-project

Commit Graph

Author	SHA1	Message	Date
Tanya Lattner	266792a55a	Prevent ARM DAG Combiner from doing an AND or OR combine on an illegal vector type (vectors of size 3). Also included test cases. llvm-svn: 129074	2011-04-07 15:24:20 +00:00
Johnny Chen	313ec7953a	Sanity check MSRi for invalid mask values and reject it as invalid. rdar://problem/9246844 llvm-svn: 129050	2011-04-07 01:37:34 +00:00
Eli Friedman	c5f22a7815	PR9634: Don't unconditionally tell the AliasSetTracker that the PreheaderLoad is equivalent to any other relevant value; it isn't true in general. If it is equivalent, the LoopPromoter will tell the AST the equivalence. Also, delete the PreheaderLoad if it is unused. Chris, since you were the last one to make major changes here, can you check that this is sane? llvm-svn: 129049	2011-04-07 01:35:06 +00:00
Johnny Chen	c0e86fb965	The ARM disassembler was not recognizing USADA8 instruction. Need to add checking for register values for USAD8 and USADA8. rdar://problem/9247060 llvm-svn: 129047	2011-04-07 01:05:52 +00:00
Evan Cheng	a7c7b54dde	Change -arm-divmod-libcall to a target neutral option. llvm-svn: 129045	2011-04-07 00:58:44 +00:00
Johnny Chen	d4cced54b3	Should also check SMLAD for invalid register values. rdar://problem/9246650 llvm-svn: 129042	2011-04-07 00:50:25 +00:00
Owen Anderson	bdff1c997a	Teach the ARM peephole optimizer that RSB, RSC, ADC, and SBC can be used for folded comparisons, just like ADD and SUB. llvm-svn: 129038	2011-04-06 23:35:59 +00:00
Johnny Chen	bd9a4f8d07	A8.6.393 The ARM disassembler should reject invalid (type, align) encodings as invalid instructions. So, instead of: Opcode=1641 Name=VST2b32_UPD Format=ARM_FORMAT_NLdSt(30) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 1: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| 0: 0: 1: 1\| 0: 0: 0: 0\| 1: 0: 0: 1\| 1: 0: 1: 1\| 0: 0: 1: 1\| ------------------------------------------------------------------------------------------------- vst2.32 {d0, d2}, [r3, :256], r3 we now have: Opcode=1641 Name=VST2b32_UPD Format=ARM_FORMAT_NLdSt(30) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 1: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| 0: 0: 1: 1\| 0: 0: 0: 0\| 1: 0: 0: 1\| 1: 0: 1: 1\| 0: 0: 1: 1\| ------------------------------------------------------------------------------------------------- mc-input.txt:1:1: warning: invalid instruction encoding 0xb3 0x9 0x3 0xf4 ^ llvm-svn: 129033	2011-04-06 22:14:48 +00:00
Johnny Chen	2ac486e387	A8.6.92 MCR (Encoding A1): if coproc == '101x' then SEE "Advanced SIMD and VFP" Since these "Advanced SIMD and VFP" instructions have more specfic encoding bits specified, if coproc == 10 or 11, we should reject the insn as invalid. rdar://problem/9239922 rdar://problem/9239596 llvm-svn: 129027	2011-04-06 20:49:02 +00:00
Johnny Chen	8bca174f48	Fix a bug in the disassembly of VGETLNs8 where the lane index was wrong. Also set the encoding bits (for A8.6.303, A8.6.328, A8.6.329) Inst{3-0} = 0b0000, in class NVLaneOp. rdar://problem/9240648 llvm-svn: 129015	2011-04-06 18:27:46 +00:00
Nadav Rotem	cc771acd77	This testcase passed even without the fix. Added the target info to make the test fail (without the fix). Thanks Dan. llvm-svn: 128999	2011-04-06 11:18:29 +00:00
Johnny Chen	0ec0e98a6a	Add a missing opcode (SMLSLDX) to BadRegsMulFrm() function. Add more complete sanity check for LdStFrm instructions where if IBit (Inst{25}) is 1, Inst{4} should be 0. Otherwise, we should reject the insn as invalid. rdar://problem/9239347 rdar://problem/9239467 llvm-svn: 128977	2011-04-06 01:18:32 +00:00
Johnny Chen	f6e327c6a3	Fix a typo in the handling of PKHTB opcode, plus add sanity check for illegal register encodings for DisassembleArithMiscFrm(). rdar://problem/9238659 llvm-svn: 128958	2011-04-05 23:28:00 +00:00
Johnny Chen	c3656d29f6	A7.3 register encoding Qd -> bit[12] == 0 Qn -> bit[16] == 0 Qm -> bit[0] == 0 If one of these bits is 1, the instruction is UNDEFINED. rdar://problem/9238399 rdar://problem/9238445 llvm-svn: 128949	2011-04-05 22:57:07 +00:00
Johnny Chen	9da60e016b	ARM disassembler was erroneously accepting an invalid RSC instruction. Added checks for regs which should not be 15. rdar://problem/9237734 llvm-svn: 128945	2011-04-05 22:18:07 +00:00
Chris Lattner	57ee5a5db7	remove postdom frontiers, because it is dead. Forward dom frontiers are still used by RegionInfo :( llvm-svn: 128943	2011-04-05 21:57:17 +00:00
Johnny Chen	25883487a1	ARM disassembler was erroneously accepting an invalid LSL instruction. For register-controlled shifts, we should check that the encoding constraint Inst{7} = 0 and Inst{4} = 1 is satisfied. rdar://problem/9237693 llvm-svn: 128941	2011-04-05 21:49:44 +00:00
Jakob Stoklund Olesen	1ec41e2bd9	These tests no longer require linear scan because reserved register coalescing is now universal. llvm-svn: 128936	2011-04-05 21:40:41 +00:00
Jakob Stoklund Olesen	6aa0fbf4c0	Run LiveDebugVariables in RegAllocBasic and RegAllocGreedy. llvm-svn: 128935	2011-04-05 21:40:37 +00:00
Johnny Chen	e9c644d4a0	The r128085 checkin modified the operand ordering for MRC/MRC2 instructions. Modify DisassembleCoprocessor() of ARMDisassemblerCore.cpp to react to the change. rdar://problem/9236873 llvm-svn: 128922	2011-04-05 20:32:23 +00:00
Jakob Stoklund Olesen	e20fec7732	Fix one more batch of X86 tests to be register allocation dependent. llvm-svn: 128919	2011-04-05 20:20:30 +00:00
Jakob Stoklund Olesen	18fd84c79a	When dead code elimination removes all but one use, try to fold the single def into the remaining use. Rematerialization can leave single-use loads behind that we might as well fold whenever possible. llvm-svn: 128918	2011-04-05 20:20:26 +00:00
Johnny Chen	151582492d	ARM disassembler should flag (rGPRRegClassID, r13\|r15) as an error. llvm-svn: 128913	2011-04-05 19:42:11 +00:00
Johnny Chen	56c15c64b0	LDRD now prints out two dst registers. llvm-svn: 128909	2011-04-05 18:53:14 +00:00
Johnny Chen	293875ef55	Fix test-llvm failures. llvm-svn: 128906	2011-04-05 18:41:40 +00:00
Johnny Chen	33d3a9fadc	Constants with multiple encodings (ARM): An alternative syntax is available for a modified immediate constant that permits the programmer to specify the encoding directly. In this syntax, #<const> is instead written as #<byte>,#<rot>, where: <byte> is the numeric value of abcdefgh, in the range 0-255 <rot> is twice the numeric value of rotation, an even number in the range 0-30. llvm-svn: 128897	2011-04-05 18:02:46 +00:00
Johnny Chen	268d63f307	Check for invalid register encodings for UMAAL and friends where: if dLo == 15 \|\| dHi == 15 \|\| n == 15 \|\| m == 15 then UNPREDICTABLE; if dHi == dLo then UNPREDICTABLE; rdar://problem/9230202 llvm-svn: 128895	2011-04-05 17:43:10 +00:00
Stuart Hastings	345094777f	ARM doesn't support byval yet. XFAIL this test until it does. llvm-svn: 128891	2011-04-05 17:16:21 +00:00
Jakob Stoklund Olesen	76ad3debab	Ensure all defs referring to a virtual register are marked dead by addRegisterDead(). There can be multiple defs for a single virtual register when they are defining sub-registers. The missing <dead> flag was stopping the inline spiller from eliminating dead code after rematerialization. llvm-svn: 128888	2011-04-05 16:53:50 +00:00
Rafael Espindola	7dd4d6e2e8	Print visibility info for external variables. llvm-svn: 128887	2011-04-05 15:51:32 +00:00
Nadav Rotem	a069c6ce05	InstCombine optimizes gep(bitcast(x)) even when the bitcasts casts away address space info. We crash with an assert in this case. This change checks that the address space of the bitcasted pointer is the same as the gep ptr. llvm-svn: 128884	2011-04-05 14:29:52 +00:00
Eric Christopher	f392a69ff7	Fix up testcase for previous commit. llvm-svn: 128870	2011-04-05 00:56:01 +00:00
Jakob Stoklund Olesen	bd09d45489	Fix register-dependent X86 tests. llvm-svn: 128867	2011-04-05 00:32:44 +00:00
Johnny Chen	9b3ccba636	Fix SRS/SRSW encoding bits. rdar://problem/9230801 ARM disassembler discrepancy: erroneously accepting SRS Plus add invalid-RFEorLDMIA-arm.txt test which should have been checked in with http://llvm.org/viewvc/llvm-project?view=rev&revision=128859. llvm-svn: 128864	2011-04-05 00:16:18 +00:00
Jakob Stoklund Olesen	2e85396509	Allow coalescing with reserved physregs in certain cases: When a virtual register has a single value that is defined as a copy of a reserved register, permit that copy to be joined. These virtual register are usually copies of the stack pointer: %vreg75<def> = COPY %ESP; GR32:%vreg75 MOV32mr %vreg75, 1, %noreg, 0, %noreg, %vreg74<kill> MOV32mi %vreg75, 1, %noreg, 8, %noreg, 0 MOV32mi %vreg75<kill>, 1, %noreg, 4, %noreg, 0 CALLpcrel32 ... Coalescing these virtual registers early decreases register pressure. Previously, they were coalesced by RALinScan::attemptTrivialCoalescing after register allocation was completed. The lower register pressure causes the mcinst-lowering-cmp0.ll test case to fail because it depends on linear scan spilling a particular register. I am deleting 2008-08-05-SpillerBug.ll because it is counting the number of instructions emitted, and its revision history shows the 'correct' count being edited many times. llvm-svn: 128845	2011-04-04 21:00:03 +00:00
Johnny Chen	8372006296	Fix incorrect alignment for NEON VST2b32_UPD. rdar://problem/9225433 llvm-svn: 128841	2011-04-04 20:35:31 +00:00
Jakob Stoklund Olesen	8296e30627	Disable the PowerPC/Atomics-64 test. The code inserted by PPCTargetLowering::EmitInstrWithCustomInserter for ppc64 is wrong, and I don't know how to fix it. It seems to be using the correct register classes for pointers, but it inserts all 32-bit instructions. llvm-svn: 128835	2011-04-04 17:57:26 +00:00
Bruno Cardoso Lopes	bda3632bcd	- Implement asm parsing support for LDRSBT, LDRHT, LDRSHT and STRHT also fix the encoding of the later. - Add a new encoding bit to describe the index mode used in AM3. - Teach printAddrMode3Operand to check by the addressing mode which index mode to print. - Testcases. llvm-svn: 128832	2011-04-04 17:18:19 +00:00
Jakob Stoklund Olesen	218661346a	Fix PowerPC tests to be register allocator independent. llvm-svn: 128827	2011-04-04 17:07:03 +00:00
Joerg Sonnenberger	fc4789da4a	Add support for the VIA PadLock instructions. llvm-svn: 128826	2011-04-04 16:58:13 +00:00
Jay Foad	11522097be	Remove some support for ReturnInsts with multiple operands, and for returning a scalar value in a function whose return type is a single- element structure or array. llvm-svn: 128810	2011-04-04 07:44:02 +00:00
Eli Friedman	17bf4922c9	PR9446: RecursivelyDeleteTriviallyDeadInstructions can delete the instruction after the given instruction; make sure to handle that case correctly. (It's difficult to trigger; the included testcase involves a dead block, but I don't think that's a requirement.) While I'm here, get rid of the unnecessary warning about SimplifyInstructionsInBlock, since it should work correctly as far as I know. llvm-svn: 128782	2011-04-02 22:45:17 +00:00
Che-Liang Chiou	e34b271718	ptx: support setp's 4-operand format llvm-svn: 128767	2011-04-02 08:51:39 +00:00
Cameron Zwarich	6fe5c29430	Do some peephole optimizations to remove pointless VMOVs from Neon to integer registers that arise from argument shuffling with the soft float ABI. These instructions are particularly slow on Cortex A8. This fixes one half of <rdar://problem/8674845>. llvm-svn: 128759	2011-04-02 02:40:43 +00:00
Johnny Chen	8904cc49db	Fixed a bug in disassembly of STR_POST, where the immediate is the second operand in am2offset; instead of the second operand in addrmode_imm12. rdar://problem/9225289 llvm-svn: 128757	2011-04-02 02:24:54 +00:00
Johnny Chen	387b36eaae	Fixed MOVr for "should be" encoding bits for Inst{19-16} = 0b0000. rdar://problem/9224276 llvm-svn: 128749	2011-04-01 23:30:25 +00:00
Johnny Chen	6615fa1de0	MOVs should have Inst{19-16} as 0b0000, otherwise, the instruction is UNPREDICTABLE. rdar://problem/9224120 llvm-svn: 128748	2011-04-01 23:15:50 +00:00
Johnny Chen	1e1010f56f	Fix the instruction table entries for AI1_adde_sube_s_irs multiclass definition so that all the instruction have: let Inst{31-27} = 0b1110; // non-predicated Before, the ARM decoder was confusing: > 0x40 0xf3 0xb8 0x80 as: Opcode=16 Name=ADCSSrs Format=ARM_FORMAT_DPSOREGFRM(5) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 0: 0: 0\| 0: 0: 0: 0\| 1: 0: 1: 1\| 1: 0: 0: 0\| 1: 1: 1: 1\| 0: 0: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| ------------------------------------------------------------------------------------------------- adcs pc, r8, r0, asr #6 since the cond field for ADCSSrs is a wild card, and so is ADCrs, with the ADCSSrs having Inst{20} as '1'. Now, the AR decoder behaves correctly: > 0x40 0xf3 0xb8 0x80 > END Executing command: /Volumes/data/lldb/llvm/Debug+Asserts/bin/llvm-mc -disassemble -triple=arm-apple-darwin -debug-only=arm-disassembler mc-input.txt Opcode=19 Name=ADCrs Format=ARM_FORMAT_DPSOREGFRM(5) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 0: 0: 0\| 0: 0: 0: 0\| 1: 0: 1: 1\| 1: 0: 0: 0\| 1: 1: 1: 1\| 0: 0: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| ------------------------------------------------------------------------------------------------- adcshi pc, r8, r0, asr #6 > rdar://problem/9223094 llvm-svn: 128746	2011-04-01 22:32:51 +00:00
Jim Grosbach	360c369967	LDRD/STRD instructions should print both Rt and Rt2 in the asm string. llvm-svn: 128736	2011-04-01 20:26:57 +00:00
Johnny Chen	3dfb80afbf	Fix a LDRT/LDRBT decoding bug where for Encoding A2, if Inst{4} != 0, we should reject the instruction as invalid. llvm-svn: 128734	2011-04-01 20:21:38 +00:00
Benjamin Kramer	d121765e64	InstCombine: Turn icmp + sext into bitwise/integer ops when the input has only one unknown bit. int test1(unsigned x) { return (x&8) ? 0 : -1; } int test3(unsigned x) { return (x&8) ? -1 : 0; } before (x86_64): _test1: andl $8, %edi cmpl $1, %edi sbbl %eax, %eax ret _test3: andl $8, %edi cmpl $1, %edi sbbl %eax, %eax notl %eax ret after: _test1: shrl $3, %edi andl $1, %edi leal -1(%rdi), %eax ret _test3: shll $28, %edi movl %edi, %eax sarl $31, %eax ret llvm-svn: 128732	2011-04-01 20:09:10 +00:00
Johnny Chen	fe6fba3fe6	Fix LDRi12 immediate operand, which was changed to be the second operand in $addrmode_imm12 => (ops GPR:$base, i32imm:$offsimm). rdar://problem/9219356 llvm-svn: 128722	2011-04-01 18:26:38 +00:00
Akira Hatanaka	93f898f643	Add code for analyzing FP branches. Clean up branch Analysis functions. llvm-svn: 128718	2011-04-01 17:39:08 +00:00
Evan Cheng	a6a992a662	Add test case. llvm-svn: 128707	2011-04-01 06:27:25 +00:00
Evan Cheng	0f86d6de50	FileCheck'ify test. llvm-svn: 128706	2011-04-01 03:36:33 +00:00
Jakob Stoklund Olesen	100f53fd25	Fix Thumb and Thumb2 tests to be register allocator independent. llvm-svn: 128690	2011-03-31 23:31:50 +00:00
Bruno Cardoso Lopes	ab8305063b	Apply again changes to support ARM memory asm parsing. I removed all LDR/STR changes and left them to a future patch. Passing all checks now. - Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and fix the encoding wherever is possible. - Add a new encoding bit to describe the index mode used and teach printAddrMode2Operand to check by the addressing mode which index mode to print. - Testcases llvm-svn: 128689	2011-03-31 23:26:08 +00:00
Jakob Stoklund Olesen	0709342652	Provide a legal pointer register class when targeting thumb1. The LocalStackSlotAllocation pass was creating illegal registers. llvm-svn: 128687	2011-03-31 23:02:15 +00:00
Jakob Stoklund Olesen	903baeac27	Fix SystemZ tests llvm-svn: 128686	2011-03-31 23:02:12 +00:00
Nadav Rotem	d74b72b8a9	Instcombile optimization: extractelement(cast) -> cast(extractelement) llvm-svn: 128683	2011-03-31 22:57:29 +00:00
Jakob Stoklund Olesen	0888bcf542	Fix ARM tests to be register allocator independent. llvm-svn: 128680	2011-03-31 22:14:03 +00:00
Benjamin Kramer	5291054ef1	InstCombine: APFloat can't perform arithmetic on PPC double doubles, don't even try. Thanks Eli! llvm-svn: 128676	2011-03-31 21:35:49 +00:00
Johnny Chen	9cd9c4e5c9	Add a test case for a malformed LDC/LDC2 instructions with PUDW = 0b0000, which amounts to an UNDEFINED instruction. llvm-svn: 128668	2011-03-31 20:54:30 +00:00
Evan Cheng	38bf5adcea	Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 llvm-svn: 128665	2011-03-31 19:38:48 +00:00
Johnny Chen	7b203f9cae	Fix single word and unsigned byte data transfer instruction encodings so that Inst{4} = 0. rdar://problem/9213022 llvm-svn: 128662	2011-03-31 19:28:35 +00:00
Jakob Stoklund Olesen	f4c9754d5c	Fix Mips, Sparc, and XCore tests that were dependent on register allocation. Add an extra run with -regalloc=basic to keep them honest. llvm-svn: 128654	2011-03-31 18:42:43 +00:00
Akira Hatanaka	a535270d91	Added support for FP conditional move instructions and fixed bugs in handling of FP comparisons. llvm-svn: 128650	2011-03-31 18:26:17 +00:00
Jakob Stoklund Olesen	e6e6750670	Don't completely eliminate identity copies that also modify super register liveness. Turn them into noop KILL instructions instead. This lets the scavenger know when super-registers are killed and defined. llvm-svn: 128645	2011-03-31 17:55:25 +00:00
Johnny Chen	13baa0e650	Add BLXi to the instruction table for disassembly purpose. A8.6.23 BLX (immediate) rdar://problem/9212921 llvm-svn: 128644	2011-03-31 17:53:50 +00:00
Jakob Stoklund Olesen	9a78835414	Mark all uses as <undef> when joining a copy. This way, shrinkToUses() will ignore the instruction that is about to be deleted, and we avoid leaving invalid live ranges that SplitKit doesn't like. Fix a misunderstanding in MachineVerifier about <def,undef> operands. The <undef> flag is valid on def operands where it has the same meaning as <undef> on a use operand. It only applies to sub-register defines which also read the full register. llvm-svn: 128642	2011-03-31 17:23:25 +00:00
Daniel Dunbar	22c5560bc8	Remove stray empty test file. llvm-svn: 128640	2011-03-31 17:01:56 +00:00
Bruno Cardoso Lopes	c2452a6f1d	Revert r128632 again, until I figure out what break the tests llvm-svn: 128635	2011-03-31 15:54:36 +00:00
Richard Osborne	9a827b30ab	Add XCore intrinsics for initializing / starting / synchronizing threads. llvm-svn: 128633	2011-03-31 15:13:13 +00:00
Bruno Cardoso Lopes	4c0aebfb91	Reapply r128585 without generating a lib depedency cycle. An updated log: - Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and {STR,LDC}{2}_{PRE,POST} fixing the encoding wherever is possible. - Move all instructions which use am2offset without a pattern to use addrmode2. - Add a new encoding bit to describe the index mode used and teach printAddrMode2Operand to check by the addressing mode which index mode to print. - Testcases llvm-svn: 128632	2011-03-31 14:52:28 +00:00
Benjamin Kramer	be209ab8a2	InstCombine: Fix transform to use the swapped predicate. Thanks Frits! llvm-svn: 128628	2011-03-31 10:46:03 +00:00
Benjamin Kramer	d159d94644	InstCombine: fold fcmp (fneg x), (fneg y) -> fcmp x, y llvm-svn: 128627	2011-03-31 10:12:22 +00:00
Benjamin Kramer	a8c5d0872d	InstCombine: fold fcmp pred (fneg x), C -> fcmp swap(pred) x, -C llvm-svn: 128626	2011-03-31 10:12:15 +00:00
Benjamin Kramer	cbb18e91a8	InstCombine: Shrink "fcmp (fpext x), C" to "fcmp x, C" if C can be losslessly converted to the type of x. Fixes PR9592. llvm-svn: 128625	2011-03-31 10:12:07 +00:00
Benjamin Kramer	2ccfbc8b71	InstCombine: fold fcmp (fpext x), (fpext y) -> fcmp x, y. llvm-svn: 128624	2011-03-31 10:11:58 +00:00
Duncan Sands	7c2b338a7e	Will not compile without the spec! llvm-svn: 128623	2011-03-31 10:03:32 +00:00
Bill Wendling	01cbbd8555	Testcase for r128619 (PR9571). llvm-svn: 128620	2011-03-31 08:13:57 +00:00
Jakob Stoklund Olesen	ae044c06bf	Pick a conservative register class when creating a small live range for remat. The rematerialized instruction may require a more constrained register class than the register being spilled. In the test case, the spilled register has been inflated to the DPR register class, but we are rematerializing a load of the ssub_0 sub-register which only exists for DPR_VFP2 registers. The register class is reinflated after spilling, so the conservative choice is only temporary. llvm-svn: 128610	2011-03-31 03:54:44 +00:00
Matt Beaumont-Gay	73906b05ca	Revert "- Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and" This revision introduced a dependency cycle, as nlewycky mentioned by email. llvm-svn: 128597	2011-03-31 00:39:16 +00:00
Evan Cheng	ee9d45dd55	Don't try to create zero-sized stack objects. llvm-svn: 128586	2011-03-30 23:44:13 +00:00
Bruno Cardoso Lopes	280264b889	- Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and {STR,LDC}{2}_PRE. - Fixed the encoding in some places. - Some of those instructions were using am2offset and now use addrmode2. Codegen isn't affected, instructions which use SelectAddrMode2Offset were not touched. - Teach printAddrMode2Operand to check by the addressing mode which index mode to print. - This is a work in progress, more work to come. The idea is to change places which use am2offset to use addrmode2 instead, as to unify assembly parser. - Add testcases for assembly parser llvm-svn: 128585	2011-03-30 23:32:32 +00:00
Cameron Zwarich	53dd03d537	Add a ARM-specific SD node for VBSL so that forms with a constant first operand can be recognized. This fixes <rdar://problem/9183078>. llvm-svn: 128584	2011-03-30 23:01:21 +00:00
Bill Wendling	5034159c5f	* The DSE code that tested for overlapping needed to take into account the fact that one of the numbers is signed while the other is unsigned. This could lead to a wrong result when the signed was promoted to an unsigned int. * Add the data layout line to the testcase so that it will test the appropriate thing. Patch by David Terei! llvm-svn: 128577	2011-03-30 21:37:19 +00:00
Benjamin Kramer	af0ed953c5	Avoid turning a floating point division with a constant power of two into a denormal multiplication. Some platforms may treat denormals as zero, on other platforms multiplication with a subnormal is slower than dividing by a normal. llvm-svn: 128555	2011-03-30 17:02:54 +00:00
Benjamin Kramer	8564e0de96	InstCombine: If the divisor of an fdiv has an exact inverse, turn it into an fmul. Fixes PR9587. llvm-svn: 128546	2011-03-30 15:42:35 +00:00
Johnny Chen	0ae2501fd2	Add a test case for thumb stc2 instruction. llvm-svn: 128517	2011-03-30 01:02:06 +00:00
Evan Cheng	18381b4257	Add intrinsics @llvm.arm.neon.vmulls and @llvm.arm.neon.vmullu.* back. Frontends was lowering them to sext / uxt + mul instructions. Unfortunately the optimization passes may hoist the extensions out of the loop and separate them. When that happens, the long multiplication instructions can be broken into several scalar instructions, causing significant performance issue. Note the vmla and vmls intrinsics are not added back. Frontend will codegen them as intrinsics vmull* + add / sub. Also note the isel optimizations for catching mul + sext / zext are not changed either. First part of rdar://8832507, rdar://9203134 llvm-svn: 128502	2011-03-29 23:06:19 +00:00
Benjamin Kramer	272f2b0044	InstCombine: Add a few missing combines for ANDs and ORs of sign bit tests. On x86 we now compile "if (a < 0 && b < 0)" into testl %edi, %esi js IF.THEN llvm-svn: 128496	2011-03-29 22:06:41 +00:00
Kevin Enderby	df4935cc90	Adding a test for "-inf" as well. llvm-svn: 128495	2011-03-29 21:54:10 +00:00
Johnny Chen	a0f0b5d9f0	Add a test case for MSRi. llvm-svn: 128494	2011-03-29 21:52:02 +00:00
Cameron Zwarich	143f9aea2b	Add Neon SINT_TO_FP and UINT_TO_FP lowering from v4i16 to v4f32. Fixes <rdar://problem/8875309> and <rdar://problem/9057191>. llvm-svn: 128492	2011-03-29 21:41:55 +00:00
Kevin Enderby	5bbe957155	Added support symbolic floating point constants in the MC assembler for Infinity and Nans with the same strings as GAS supports. rdar://8673024 llvm-svn: 128488	2011-03-29 21:11:52 +00:00
Johnny Chen	dcb29ae8ee	Add a thumb test file for printf (iOS 4.3). llvm-svn: 128487	2011-03-29 21:09:30 +00:00
Johnny Chen	4bc2baeb28	A8.6.188 STC, STC2 The STC_OPTION and STC2_OPTION instructions should have their coprocessor option enclosed in {}. rdar://problem/9200661 llvm-svn: 128478	2011-03-29 19:49:38 +00:00
Johnny Chen	7927569f05	Rename invalid-VLDMSDB-arm.txt to be invalid-VLDMSDB_UPD-arm.txt. llvm-svn: 128477	2011-03-29 19:10:06 +00:00
Johnny Chen	ec6f76ed38	Add and modify some tests. llvm-svn: 128476	2011-03-29 19:08:52 +00:00
Owen Anderson	d6c5a741b5	Get rid of the non-writeback versions VLDMDB and VSTMDB, which don't actually exist. llvm-svn: 128461	2011-03-29 16:45:53 +00:00
Cameron Zwarich	ff811cc475	Do some simple copy propagation through integer loads and stores when promoting vector types. This helps a lot with inlined functions when using the ARM soft float ABI. Fixes <rdar://problem/9184212>. llvm-svn: 128453	2011-03-29 05:19:52 +00:00
Rafael Espindola	6b2fac21ca	Reduce test case. llvm-svn: 128445	2011-03-29 02:18:54 +00:00
Evan Cheng	e2086e740f	Optimizing (zext A + zext B) * C, to (VMULL A, C) + (VMULL B, C) during isel lowering to fold the zero-extend's and take advantage of no-stall back to back vmul + vmla: vmull q0, d4, d6 vmlal q0, d5, d6 is faster than vaddl q0, d4, d5 vmovl q1, d6 vmul q0, q0, q1 This allows us to vmull + vmlal for: f = vmull_u8( vget_high_u8(s), c); f = vmlal_u8(f, vget_low_u8(s), c); rdar://9197392 llvm-svn: 128444	2011-03-29 01:56:09 +00:00
Bill Wendling	96f962fdff	In some cases, the "fail BB dominator" may be null after the BB was split (and becomes reachable when before it wasn't). Check to make sure that it's not null before trying to use it. llvm-svn: 128434	2011-03-28 23:02:18 +00:00
Daniel Dunbar	4ee0d03274	MC: Add support for disabling "temporary label" behavior. Useful for debugging on Darwin. llvm-svn: 128430	2011-03-28 22:49:15 +00:00
Johnny Chen	f9cd139369	Fix ARM disassembly for PLD/PLDW/PLI which suffers from code rot and add some test cases. Add comments to ThumbDisassemblerCore.h for recent change made for t2PLD disassembly. llvm-svn: 128417	2011-03-28 18:41:58 +00:00
Nick Lewycky	8544228d5a	Teach the transformation that moves binary operators around selects to preserve the subclass optional data. llvm-svn: 128388	2011-03-27 19:51:23 +00:00
Frits van Bommel	0bb2ad2cf7	Constant folding support for calls to umul.with.overflow(), basically identical to the smul.with.overflow() code. llvm-svn: 128379	2011-03-27 14:26:13 +00:00
Nick Lewycky	83167df787	Add a small missed optimization: turn X == C ? X : Y into X == C ? C : Y. This removes one use of X which helps it pass the many hasOneUse() checks. In my analysis, this turns up very often where X = A >>exact B and that can't be simplified unless X has one use (except by increasing the lifetime of A which is generally a performance loss). llvm-svn: 128373	2011-03-27 07:30:57 +00:00
Cameron Zwarich	d4174ee43e	Fix a typo and add a test. llvm-svn: 128331	2011-03-26 04:58:50 +00:00
Jakob Stoklund Olesen	9a624fa993	Collect and coalesce DBG_VALUE instructions before emitting the function. Correctly terminate the range of register DBG_VALUEs when the register is clobbered or when the basic block ends. The code is now ready to deal with variables that are sometimes in a register and sometimes on the stack. We just need to teach emitDebugLoc to say 'stack slot'. llvm-svn: 128327	2011-03-26 02:19:36 +00:00
Johnny Chen	923f3dac01	Fixed the t2PLD and friends disassembly and add two test cases. llvm-svn: 128322	2011-03-26 01:32:48 +00:00
Eric Christopher	d553096688	Fix the bfi handling for or (and a mask) (and b mask). We need the two masks to match inversely for the code as is to work. For the example given we actually want: bfi r0, r2, #1, #1 not #0, however, given the way the pattern is written it's not possible at the moment. Fixes rdar://9177502 llvm-svn: 128320	2011-03-26 01:21:03 +00:00
Bill Wendling	db40b5c899	PR9561: A store with a negative offset (via GEP) could erroniously say that it completely overlaps a previous store, thus mistakenly deleting that store. Check for this condition. llvm-svn: 128319	2011-03-26 01:20:37 +00:00
Johnny Chen	1572bf40b4	Add test for A8.6.246 UMULL to both arm-tests.txt amd thumb-tests.txt. llvm-svn: 128306	2011-03-25 23:02:58 +00:00
Johnny Chen	6e31bf1f6f	Add two test cases t2SMLABT and t2SMMULR for DisassembleThumb2Mul(). llvm-svn: 128305	2011-03-25 22:43:28 +00:00
Johnny Chen	49316e40ba	Fix DisassembleThumb2DPReg()'s handling of RegClass. Cannot hardcode GPRRegClassID. Also add some test cases. rdar://problem/9189829 llvm-svn: 128304	2011-03-25 22:19:07 +00:00
Johnny Chen	aaf2c69400	DisassembleThumb2LdSt() did not handle t2LDRs correctly with respect to RegClass. Add two test cases. rdar://problem/9182892 llvm-svn: 128299	2011-03-25 19:35:37 +00:00
Johnny Chen	4fd2194638	A8.6.226 TBB, TBH: Add two test cases. llvm-svn: 128295	2011-03-25 18:40:21 +00:00
Johnny Chen	b35548f44d	Modify DisassembleThumb2LdStEx() to be more robust/correct in light of recent change to t2LDREX/t2STREX instructions. Add two test cases. llvm-svn: 128293	2011-03-25 18:29:49 +00:00
Daniel Dunbar	6f4c9425eb	MC: Improve some diagnostics on uses of '.' pseudo-symbol. llvm-svn: 128289	2011-03-25 17:47:17 +00:00
Johnny Chen	aa84d41dfc	Instruction formats of SWP/SWPB were changed from LdStExFrm to MiscFrm. Modify the disassembler to handle that. rdar://problem/9184053 llvm-svn: 128285	2011-03-25 17:31:16 +00:00
Jakob Stoklund Olesen	1886a4c823	Emit less labels for debug info and stop emitting .loc directives for DBG_VALUEs. The .dot directives don't need labels, that is a leftover from when we created line number info manually. Instructions following a DBG_VALUE can share its label since the DBG_VALUE doesn't produce any code. llvm-svn: 128284	2011-03-25 17:20:59 +00:00
Johnny Chen	757ca69770	Also need to handle invalid imod values for CPS2p. rdar://problem/9186136 llvm-svn: 128283	2011-03-25 17:03:12 +00:00
Johnny Chen	a52143bff3	Modify the wrong logic in the assert of DisassembleThumb2LdStDual() (the register classes were changed), modify the comment to be up-to-date, and add a test case for A8.6.66 LDRD (immediate) Encoding T1. llvm-svn: 128252	2011-03-25 01:09:48 +00:00
Johnny Chen	72f4a95144	delegate the disassembly of t2ADR to the more generic t2ADDri12/t2SUBri12 instructions, and add a test case for that. llvm-svn: 128249	2011-03-25 00:17:42 +00:00
Johnny Chen	ceef55466a	The opcode names ("tLDM", "tLDM_UPD") used for conflict resolution have been stale since the change to ("tLDMIA", "tLDMIA_UPD"). Update the conflict resolution code and add test cases for that. llvm-svn: 128247	2011-03-24 23:42:31 +00:00
Johnny Chen	73193f2475	The ARM disassembler was confused with the 16-bit tSTMIA instruction. According to A8.6.189 STM/STMIA/STMEA (Encoding T1), there's only tSTMIA_UPD available. Ignore tSTMIA for the decoder emitter and add a test case for that. llvm-svn: 128246	2011-03-24 23:21:14 +00:00
Devang Patel	71536de752	Move test in x86 specific area. llvm-svn: 128245	2011-03-24 22:39:09 +00:00
Johnny Chen	9302df0ad9	Handle the added VBICivi NEON instructions, too. llvm-svn: 128243	2011-03-24 22:04:39 +00:00
Eric Christopher	3a213a50fe	Testcase for llvm-gcc commit r128230. llvm-svn: 128242	2011-03-24 21:59:03 +00:00
Johnny Chen	6469ca0c33	T2 Load/Store Multiple: These instructions were changed to not embed the addressing mode within the MC instructions We also need to update the corresponding assert stmt. Also add a test case. llvm-svn: 128240	2011-03-24 21:36:56 +00:00
Benjamin Kramer	dd9eb21c3f	Plug a leak in the arm disassembler and put the tests back. llvm-svn: 128238	2011-03-24 21:14:28 +00:00
Bruno Cardoso Lopes	f170f8bff6	Add asm parsing support w/ testcases for strex/ldrex family of instructions llvm-svn: 128236	2011-03-24 21:04:58 +00:00
Johnny Chen	471f5aa233	Remove these two test files as they cause llvm-i686-linux-vg_leak build to fail 'test-llvm'. These two are test cases which should result in 'invalid instruction encoding' from running llvm-mc -disassemble. llvm-svn: 128235	2011-03-24 20:56:23 +00:00
Johnny Chen	8bbc12824a	ADR was added with the wrong encoding for inst{24-21}, and the ARM decoder was fooled. Set the encoding bits to {0,?,?,0}, not 0. Plus delegate the disassembly of ADR to the more generic ADDri/SUBri instructions, and add a test case for that. llvm-svn: 128234	2011-03-24 20:42:48 +00:00
Devang Patel	e01b75cb89	Keep track of directory namd and fIx regression caused by Rafael's patch r119613. A better approach would be to move source id handling inside MC. llvm-svn: 128233	2011-03-24 20:30:50 +00:00
Johnny Chen	c5207f7167	The r118201 added support for VORR (immediate). Update ARMDisassemblerCore.cpp to disassemble the VORRivi instructions properly within the DisassembleN1RegModImmFrm() function. Add a test case. llvm-svn: 128226	2011-03-24 18:40:38 +00:00
Johnny Chen	1dd041083d	Add comments to the handling of opcode CPS3p to reject invalid instruction encoding, a test case of invalid CPS3p encoding and one for invalid VLDMSDB due to regs out of range. llvm-svn: 128220	2011-03-24 17:04:22 +00:00
NAKAMURA Takumi	521eb7c11e	Target/X86: [PR8777][PR8778] Tweak alloca/chkstk for Windows targets. FIXME: Some cleanups would be needed. llvm-svn: 128206	2011-03-24 07:07:00 +00:00
Cameron Zwarich	4649f17db1	Do early taildup of ret in CodeGenPrepare for potential tail calls that have a void return type. This fixes PR9487. llvm-svn: 128197	2011-03-24 04:52:10 +00:00
Johnny Chen	0f5d52d658	Load/Store Multiple: These instructions were changed to not embed the addressing mode within the MC instructions We also need to update the corresponding assert stmt. Also add two test cases. llvm-svn: 128191	2011-03-24 01:40:42 +00:00
Johnny Chen	1de8cc6f95	STRT and STRBT was incorrectly tagged as IndexModeNone during the refactorings (r119821). We now tag them as IndexModePost. llvm-svn: 128189	2011-03-24 01:07:26 +00:00
Johnny Chen	f949d8e13d	The r128103 fix to cope with the removal of addressing modes from the MC instructions were incomplete. The assert stmt needs to be updated and the operand index incrment is wrong. Fix the bad logic and add some sanity checking to detect bad instruction encoding; and add a test case. llvm-svn: 128186	2011-03-24 00:28:38 +00:00
Devang Patel	abc77347a7	Enable GlobalMerge on darwin. llvm-svn: 128183	2011-03-23 23:34:19 +00:00
Andrew Trick	4ab9a16569	Revert r128175. I'm backing this out for the second time. It was supposed to be fixed by r128164, but the mingw self-host must be defeating the fix. llvm-svn: 128181	2011-03-23 23:11:02 +00:00
Evan Cheng	425489d397	Cmp peephole optimization isn't always safe for signed arithmetics. int tries = INT_MAX; while (tries > 0) { tries--; } The check should be: subs r4, #1 cmp r4, #0 bgt LBB0_1 The subs can set the overflow V bit when r4 is INT_MAX+1 (which loop canonicalization apparently does in this case). cmp #0 would have cleared it while not changing the N and Z bits. Since BGT is dependent on the V bit, i.e. (N == V) && !Z, it is not safe to eliminate the cmp #0. rdar://9172742 llvm-svn: 128179	2011-03-23 22:52:04 +00:00
Eli Friedman	4c192305bf	PR9535: add support for splitting and scalarizing vector ISD::FP_ROUND. Also cleaning up some duplicated code while I'm here. llvm-svn: 128176	2011-03-23 22:18:48 +00:00
Andrew Trick	4046a0de91	Reapply Eli's r127852 now that the pre-RA scheduler can spill EFLAGS. (target-specific branchless method for double-width relational comparisons on x86) llvm-svn: 128175	2011-03-23 22:16:02 +00:00
Anders Carlsson	c4f0ab397c	Revert r128140 for now. llvm-svn: 128149	2011-03-23 15:51:12 +00:00
Cameron Zwarich	10ebc189ee	Fix PR9464 by correcting some math that just happened to be right in most cases that were hit in practice. llvm-svn: 128146	2011-03-23 05:25:55 +00:00
Anders Carlsson	9ed8d93f55	A global variable with internal linkage where all uses are in one function and whose address is never taken is a non-escaping local object and can't alias anything else. llvm-svn: 128140	2011-03-23 02:19:48 +00:00
Johnny Chen	122a6304ef	Add disassembly test cases for: A8.6.292 VCMPE llvm-svn: 128120	2011-03-22 23:08:56 +00:00
Devang Patel	6050de9689	Remove the test. llvm-svn: 128119	2011-03-22 23:07:03 +00:00
Jakob Stoklund Olesen	ec0ac3ca40	Reapply r128045 and r128051 with fixes. This will extend the ranges of debug info variables in registers until they are clobbered. Fix 1: Don't mistake DBG_VALUE instructions referring to incoming arguments on the stack with DBG_VALUE instructions referring to variables in the frame pointer. This fixes the gdb test-suite failure. Fix 2: Don't trace through copies to physical registers setting up call arguments. These registers are call clobbered, and the source register is more likely to be a callee-saved register that can be extended through the call instruction. llvm-svn: 128114	2011-03-22 22:33:08 +00:00
Johnny Chen	30350cdbdf	LDRT and LDRBT was incorrectly tagged as IndexModeNone during the refactorings (r119821). We now tag them as IndexModePost. This fixed http://llvm.org/bugs/show_bug.cgi?id=9530. llvm-svn: 128113	2011-03-22 22:28:49 +00:00
Devang Patel	bbc187c946	Try to appease buildbot gods. llvm-svn: 128112	2011-03-22 22:13:17 +00:00
Johnny Chen	0cf62f5045	Add one more test case for VFP Load/Store Multiple (vpop). llvm-svn: 128106	2011-03-22 20:21:08 +00:00
Johnny Chen	230268261b	A8.6.399 VSTM: VFP Load/Store Multiple Instructions used to embed the IA/DB addressing mode within the MC instruction; that has been changed so that now, for example, VSTMDDB_UPD and VSTMDIA_UPD are two instructions. Update the ARMDisassemblerCore.cpp's DisassembleVFPLdStMulFrm() to reflect the change. Also add a test case. llvm-svn: 128103	2011-03-22 20:00:10 +00:00
Andrew Trick	b0f98bb5e9	Revert r128045 and r128051, debug info enhancements. Temporarily reverting these to see if we can get llvm-objdump to link. Hopefully this is not the problem. llvm-svn: 128097	2011-03-22 19:18:42 +00:00
Che-Liang Chiou	7413080cea	ptx: add analyze/insert/remove branch llvm-svn: 128084	2011-03-22 14:12:00 +00:00
Jakob Stoklund Olesen	9c057ee440	Dont emit 'DBG_VALUE %noreg, ...' to terminate user variable ranges. These ranges get completely jumbled by the post-ra scheduler, and it is not really reasonable to expect it to make sense of them. Instead, teach DwarfDebug to notice when user variables in registers are clobbered, and terminate the ranges there. llvm-svn: 128045	2011-03-22 00:21:41 +00:00
Dan Gohman	c1783b31a4	Fix fast-isel address mode folding to avoid folding instructions outside of the current basic block. This fixes PR9500, rdar://9156159. llvm-svn: 128041	2011-03-22 00:04:35 +00:00
Devang Patel	dddce99f02	Try again to make this test darwin only. llvm-svn: 128036	2011-03-21 23:11:08 +00:00
Devang Patel	e351f20061	Force x86_64. llvm-svn: 128027	2011-03-21 21:37:52 +00:00
Devang Patel	d39242369a	Enable this test only for Darwin. llvm-svn: 128017	2011-03-21 20:32:56 +00:00
Rafael Espindola	1557fd6d39	Write the section table and the section data in the same order that gun as does. This makes it a lot easier to compare the output of both as the addresses are now a lot closer. llvm-svn: 127972	2011-03-20 18:44:20 +00:00
Anders Carlsson	ee6bc70d2f	Add an optimization to GlobalOpt that eliminates calls to __cxa_atexit, if the function passed is empty. llvm-svn: 127970	2011-03-20 17:59:11 +00:00
Daniel Dunbar	76c90c65e2	Disable test in a way that keeps lit happy. llvm-svn: 127962	2011-03-20 00:04:51 +00:00
Daniel Dunbar	327cd36f74	Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR", it broke a lot of things. llvm-svn: 127954	2011-03-19 21:47:14 +00:00
Evan Cheng	824a711305	SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR to have single return block (at least getting there) for optimizations. This is general goodness but it would prevent some tailcall optimizations. One specific case is code like this: int f1(void); int f2(void); int f3(void); int f4(void); int f5(void); int f6(void); int foo(int x) { switch(x) { case 1: return f1(); case 2: return f2(); case 3: return f3(); case 4: return f4(); case 5: return f5(); case 6: return f6(); } } => LBB0_2: ## %sw.bb callq _f1 popq %rbp ret LBB0_3: ## %sw.bb1 callq _f2 popq %rbp ret LBB0_4: ## %sw.bb3 callq _f3 popq %rbp ret This patch teaches codegenprep to duplicate returns when the return value is a phi and where the phi operands are produced by tail calls followed by an unconditional branch: sw.bb7: ; preds = %entry %call8 = tail call i32 @f5() nounwind br label %return sw.bb9: ; preds = %entry %call10 = tail call i32 @f6() nounwind br label %return return: %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ] ret i32 %retval.0 This allows codegen to generate better code like this: LBB0_2: ## %sw.bb jmp _f1 ## TAILCALL LBB0_3: ## %sw.bb1 jmp _f2 ## TAILCALL LBB0_4: ## %sw.bb3 jmp _f3 ## TAILCALL rdar://9147433 llvm-svn: 127953	2011-03-19 17:17:39 +00:00
Nadav Rotem	e7a101ccab	Add support for legalizing UINT_TO_FP of vectors on platforms which do not have native support for this operation (such as X86). The legalized code uses two vector INT_TO_FP operations and is faster than scalarizing. llvm-svn: 127951	2011-03-19 13:09:10 +00:00
Stuart Hastings	142f836d0e	Disable test to unbreak Linux. Radar 9156771. llvm-svn: 127945	2011-03-19 03:56:38 +00:00
Devang Patel	9cd6796104	Test case for r127940. llvm-svn: 127941	2011-03-19 01:40:43 +00:00
Johnny Chen	0c5f670fe7	Fixed an assert by the ARM disassembler for LDRD_PRE/POST. The relevant instruction table entries were changed sometime ago to no longer take <Rt2> as an operand. Modify ARMDisassemblerCore.cpp to accomodate the change and add a test case. llvm-svn: 127935	2011-03-19 01:16:20 +00:00
Andrew Trick	e7537a0187	FileCheckize a test. (one-by-one until valgrind is happy) llvm-svn: 127925	2011-03-19 00:41:39 +00:00
Owen Anderson	1d2f5cebe4	Add support to the ARM asm parser for the register-shifted-register forms of basic instructions like ADD. More work left to be done to support other instances of shifter ops in the ISA. llvm-svn: 127917	2011-03-18 22:50:18 +00:00
Evan Cheng	dc1d626a3d	Match a few more obvious patterns to revsh. rdar://9147637. llvm-svn: 127913	2011-03-18 21:52:42 +00:00
Eli Friedman	59721e3238	Revert r127852; it's apparently causing an ICE on mingw. llvm-svn: 127909	2011-03-18 21:12:29 +00:00
Justin Holewinski	0984dcc077	PTX: Fix various codegen issues - Emit mad instead of mad.rn for shader model 1.0 - Emit explicit mov.u32 instructions for reading global variables - (most PTX instructions cannot take global variable immediates) llvm-svn: 127895	2011-03-18 19:24:28 +00:00
Andrew Trick	1c4b42d00f	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Joerg Sonnenberger	3fbfcc0e1e	Support explicit argument forms for the X86 string instructions. For now, only the default segments are supported. llvm-svn: 127875	2011-03-18 11:59:40 +00:00
Che-Liang Chiou	b1df0fe1cc	ptx: fix parameter order that is reversed llvm-svn: 127874	2011-03-18 11:23:56 +00:00
Che-Liang Chiou	ff9d938e33	ptx: add unconditional and conditional branch llvm-svn: 127873	2011-03-18 11:08:52 +00:00
Eli Friedman	1a916a3c0c	Add a target-specific branchless method for double-width relational comparisons on x86. Essentially, the way this works is that SUB+SBB sets the relevant flags the same way a double-width CMP would. This is a substantial improvement over the generic lowering in LLVM. The output is also shorter than the gcc-generated output; I haven't done any detailed benchmarking, though. llvm-svn: 127852	2011-03-18 02:34:11 +00:00
Eli Friedman	c17c9a78aa	FileCheck-ize and update test. llvm-svn: 127845	2011-03-18 01:10:31 +00:00
Johnny Chen	e387f8a5e9	The disassembler for Thumb was wrongly adding 4 to the computed imm32 offset. Remove the offending logic and update the test cases. llvm-svn: 127843	2011-03-18 00:38:03 +00:00
Devang Patel	aad34d882d	Try to not lose variable's debug info during instcombine. This is done by lowering dbg.declare intrinsic into dbg.value intrinsic. Radar 9143931. llvm-svn: 127834	2011-03-17 22:18:16 +00:00
Johnny Chen	221a014ea3	It used to be that t_addrmode_s4 was used for both: o A8.6.195 STR (register) -- Encoding T1 o A8.6.193 STR (immediate, Thumb) -- Encoding T1 It has been changed so that now they use different addressing modes and thus different MC representation (Operand Infos). Modify the disassembler to reflect the change, and add relevant tests. llvm-svn: 127833	2011-03-17 22:04:05 +00:00
Benjamin Kramer	cfcea12fe2	BuildUDIV: If the divisor is even we can simplify the fixup of the multiplied value by introducing an early shift. This allows us to compile "unsigned foo(unsigned x) { return x/28; }" into shrl $2, %edi imulq $613566757, %rdi, %rax shrq $32, %rax ret instead of movl %edi, %eax imulq $613566757, %rax, %rcx shrq $32, %rcx subl %ecx, %eax shrl %eax addl %ecx, %eax shrl $4, %eax on x86_64 llvm-svn: 127829	2011-03-17 20:39:14 +00:00
Stuart Hastings	ec54bd755f	Reapply: Add type output to llvm-dis annotations. Patch by Yuri! llvm-svn: 127824	2011-03-17 19:50:04 +00:00
Richard Osborne	6120962d7d	Add XCore intrinsic for setpsc. llvm-svn: 127821	2011-03-17 18:42:05 +00:00
Daniel Dunbar	f1d62cfc8f	MC/Mach-O: Fix regression introduced in r126127, this assignment shouldn't have been removed. llvm-svn: 127812	2011-03-17 16:25:24 +00:00
NAKAMURA Takumi	bf9ff6f63b	test/CodeGen/X86/h-registers-1.ll: Add explicit -mtriple=x86_64-linux. It does not need to be checked on x86_64-win32 (aka Win64). llvm-svn: 127800	2011-03-17 04:24:40 +00:00
Joerg Sonnenberger	07de07eeea	Fix handling of @IDNTPOFF relocations, they need to get STT_TLS. While here, add VK_ARM_TPOFF and VK_ARM_GOTTPOFF, too. llvm-svn: 127780	2011-03-17 00:35:10 +00:00
NAKAMURA Takumi	5b6198dfb9	test/CodeGen/X86/constant-pool-remat-0.ll: FileCheck-ize and add explicit -mtriple=x86_64-linux. llvm-svn: 127775	2011-03-16 23:01:31 +00:00
Cameron Zwarich	ac106273d4	The x86-64 ABI says that a bool is only guaranteed to be sign-extended to a byte rather than an int. Thankfully, this only causes LLVM to miss optimizations, not generate incorrect code. This just fixes the zext at the return. We still insert an i32 ZextAssert when reading a function's arguments, but it is followed by a truncate and another i8 ZextAssert so it is not optimized. llvm-svn: 127766	2011-03-16 22:20:18 +00:00
Cameron Zwarich	40a9200357	Rename a test to be more inclusive. llvm-svn: 127765	2011-03-16 22:20:12 +00:00
Daniel Dunbar	fd95b016fb	Revert r127757, "Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break.", which broke Clang self-host of a Debug+Asserts compiler, on OS X. llvm-svn: 127763	2011-03-16 22:16:39 +00:00
Richard Osborne	c871eff3f5	Add XCore intrinsics for setclk, setrdy. llvm-svn: 127761	2011-03-16 21:56:00 +00:00
Renato Golin	a3aeafeb35	Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break. llvm-svn: 127757	2011-03-16 21:05:52 +00:00
Cameron Zwarich	49e354bcb6	Add a test for i1 zeroext arguments on x86-64. We currently generate code that conforms to the ABI, but DAGCombine could in theory recognize the sequence of zext asserts and truncates and generate incorrect code. llvm-svn: 127754	2011-03-16 20:15:44 +00:00
Richard Osborne	d4346f2388	Add checkevent intrinsic to check if any resources owned by the current thread can event. llvm-svn: 127741	2011-03-16 18:34:00 +00:00
NAKAMURA Takumi	d60e4101e6	test/CodeGen/X86: FileCheck-ize and add actions for x86_64-linux and x86_64-win32. llvm-svn: 127734	2011-03-16 13:53:07 +00:00
NAKAMURA Takumi	0b9e2b0257	test/CodeGen/X86: Add a pattern for Win64. llvm-svn: 127733	2011-03-16 13:52:51 +00:00
NAKAMURA Takumi	c10801e8a5	test/CodeGen/X86: FileCheck-ize and add explicit -mtriple=x86_64-linux. They are useless to Win64 target. llvm-svn: 127732	2011-03-16 13:52:38 +00:00
NAKAMURA Takumi	662892df27	test/CodeGen/X86/byval*.ll: Win64 has not supported byval yet. llvm-svn: 127731	2011-03-16 13:52:20 +00:00
NAKAMURA Takumi	406f02c9ea	test/CodeGen/X86/dyn-stackalloc.ll: FileCheck-ize. llvm-svn: 127730	2011-03-16 13:52:08 +00:00
Cameron Zwarich	0454253d7a	Only convert allocas to scalars if it is profitable. The profitability metric I chose is having a non-memcpy/memset use and being larger than any native integer type. Originally I chose having an access of a size smaller than the total size of the alloca, but this caused some minor issues on the spirit benchmark where SRoA runs again after some inlining. This fixes <rdar://problem/8613163>. llvm-svn: 127718	2011-03-16 00:13:44 +00:00
Cameron Zwarich	7b0f3c6a1a	Add native integer type TargetData to some existing tests. llvm-svn: 127717	2011-03-16 00:13:40 +00:00
Johnny Chen	a4c3154fca	There were two issues fixed: 1. The ARM Darwin *r9 call instructions were pseudo-ized recently. Modify the ARMDisassemblerCore.cpp file to accomodate the change. 2. The disassembler was unnecessarily adding 8 to the sign-extended imm24: imm32 = SignExtend(imm24:'00', 32); // A8.6.23 BL, BLX (immediate) // Encoding A1 It has no business doing such. Removed the offending logic. Add test cases to arm-tests.txt. llvm-svn: 127707	2011-03-15 22:27:33 +00:00
Bill Wendling	ebecb33307	Some minor cleanups based on feedback. llvm-svn: 127694	2011-03-15 20:47:26 +00:00
Evan Cheng	42401d6af2	Do not form thumb2 ldrd / strd if the offset is by multiple of 4. rdar://9133587 llvm-svn: 127683	2011-03-15 18:41:52 +00:00
Richard Osborne	5f1a26ea39	On the XCore the scavenging slot should be closest to the SP. llvm-svn: 127680	2011-03-15 15:10:11 +00:00
Richard Osborne	3a68eb150b	Add XCore intrinsics for getps, setps, setsr and clrsr. llvm-svn: 127678	2011-03-15 13:45:47 +00:00
Justin Holewinski	94751fbf32	PTX: Set PTX 2.0 as the minimum supported version - Remove PTX 1.4 code generation - Change type of intrinsics to .v4.i32 instead of .v4.i16 - Add and/or/xor integer instructions llvm-svn: 127677	2011-03-15 13:24:15 +00:00
Cameron Zwarich	0b8cdfb6ec	Do not add PHIs with no users when creating LCSSA form. Patch by Andrew Clinton. llvm-svn: 127674	2011-03-15 07:41:25 +00:00
Evan Cheng	e4b8ac9fef	Add a peephole optimization to optimize pairs of bitcasts. e.g. v2 = bitcast v1 ... v3 = bitcast v2 ... = v3 => v2 = bitcast v1 ... = v1 if v1 and v3 are of in the same register class. bitcast between i32 and fp (and others) are often not nops since they are in different register classes. These bitcast instructions are often left because they are in different basic blocks and cannot be eliminated by dag combine. rdar://9104514 llvm-svn: 127668	2011-03-15 05:13:13 +00:00
Eli Friedman	c4414c6e92	PR9450: Make switch optimization in SimplifyCFG not dependent on the ordering of pointers in an std::map. llvm-svn: 127650	2011-03-15 02:23:35 +00:00
Evan Cheng	c5c2cfa381	sext(undef) = 0, because the top bits will all be the same. zext(undef) = 0, because the top bits will be zero. llvm-svn: 127649	2011-03-15 02:22:10 +00:00
Bill Wendling	928de16793	Testcase for r127630. llvm-svn: 127648	2011-03-15 01:49:08 +00:00
Sean Callanan	f2f4837de3	Basic sanity checks to ensure that 2- and 3-byte VEX prefixes are working for triadic AVX instructions. This concludes the patch set to enable AVX support for the X86 disassebler. llvm-svn: 127647	2011-03-15 01:32:46 +00:00
Johnny Chen	7a2873dfbe	Fixed an ARM disassembler bug where it does not handle STRi12 correctly because an extra register operand was erroneously added. Remove an incorrect assert which triggers the bug. rdar://problem/9131529 llvm-svn: 127642	2011-03-15 01:13:17 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Jim Grosbach	3af6fe66b9	Clean up ARM tail calls a bit. They're pseudo-instructions for normal branches. Also more cleanly separate the ARM vs. Thumb functionality. Previously, the encoding would be incorrect for some Thumb instructions (the indirect calls). llvm-svn: 127637	2011-03-15 00:30:40 +00:00
Eric Christopher	2139d3148f	If we don't know how long a string is we can't fold an _chk version to the normal version. Fixes rdar://9123638 llvm-svn: 127636	2011-03-15 00:25:41 +00:00
Bill Wendling	e1fd78f2bc	Generate a VTBL instruction instead of a series of loads and stores when we can. As Nate pointed out, VTBL isn't super performant, but it has to be better than this: _shuf: @ BB#0: @ %entry push {r4, r7, lr} add r7, sp, #4 sub sp, #12 mov r4, sp bic r4, r4, #7 mov sp, r4 mov r2, sp vmov d16, r0, r1 orr r0, r2, #6 orr r3, r2, #7 vst1.8 {d16[0]}, [r3] vst1.8 {d16[5]}, [r0] subs r4, r7, #4 orr r0, r2, #5 vst1.8 {d16[4]}, [r0] orr r0, r2, #4 vst1.8 {d16[4]}, [r0] orr r0, r2, #3 vst1.8 {d16[0]}, [r0] orr r0, r2, #2 vst1.8 {d16[2]}, [r0] orr r0, r2, #1 vst1.8 {d16[1]}, [r0] vst1.8 {d16[3]}, [r2] vldr.64 d16, [sp] vmov r0, r1, d16 mov sp, r4 pop {r4, r7, pc} The "illegal" testcase in vext.ll is no longer illegal. <rdar://problem/9078775> llvm-svn: 127630	2011-03-14 23:02:38 +00:00
Eric Christopher	d3cc9fdd8e	Fix this test up a bit. llvm-svn: 127621	2011-03-14 21:05:21 +00:00
Evan Cheng	d2f3b01797	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Justin Holewinski	fbc8d301bf	PTX: Emit global arrays with proper sizes - Emit all arrays as type .b8 and proper sizes in bytes to conform to the output of nvcc llvm-svn: 127584	2011-03-14 15:40:11 +00:00
Justin Holewinski	8509380f83	PTX: Add support for sqrt/sin/cos intrinsics llvm-svn: 127578	2011-03-14 14:09:33 +00:00
Che-Liang Chiou	a19f075974	ptx: add set.p instruction and related changes to predicate execution llvm-svn: 127577	2011-03-14 11:26:01 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Eric Christopher	c313d94068	Saving files before committing is overrated. Add a RUN line to this test. llvm-svn: 127520	2011-03-12 01:36:23 +00:00
Eric Christopher	174d872702	Sometimes isPredicable lies to us and tells us we don't need the operands. Go ahead and add them on when we might want to use them and let later passes remove them. Fixes rdar://9118569 llvm-svn: 127518	2011-03-12 01:09:29 +00:00
Jim Grosbach	6d371ce37e	Properly pseudo-ize the ARM LDMIA_RET instruction. This has the nice side- effect that we get proper instruction printing using the "pop" mnemonic for it. llvm-svn: 127502	2011-03-11 22:51:41 +00:00
Cameron Zwarich	338d362200	Roll r127459 back in: Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127498	2011-03-11 21:52:04 +00:00
Cameron Zwarich	4d7d728594	Fix the GCC test suite issue exposed by r127477, which was caused by stack protector insertion not working correctly with unreachable code. Since that revision was rolled out, this test doesn't actual fail before this fix. llvm-svn: 127497	2011-03-11 21:51:56 +00:00
Daniel Dunbar	94ccb27b43	Revert r127459, "Optimize trivial branches in CodeGenPrepare, which often get created from the", it broke some GCC test suite tests. llvm-svn: 127477	2011-03-11 19:30:30 +00:00
Benjamin Kramer	391a946fa9	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Benjamin Kramer	51897bcd3e	InstCombine: Fix a thinko where transform an icmp under the assumption that it's a zero comparison when it's not. Fixes PR9454. llvm-svn: 127464	2011-03-11 11:37:40 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Cameron Zwarich	cc27b3acc4	Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127459	2011-03-11 04:54:27 +00:00
Eric Christopher	cf56a5034f	Change the x86 32-bit scheduler to register pressure and fix up the corresponding testcases back to the previous versions. Fixes some performance regressions only seen on 32-bit. llvm-svn: 127441	2011-03-11 01:05:58 +00:00
Evan Cheng	adb9c03e41	Avoid replacing the value of a directly stored load with the stored value if the load is indexed. rdar://9117613. llvm-svn: 127440	2011-03-11 00:48:56 +00:00
Jim Grosbach	62a7b473af	Properly pseudo-ize MOVCCr and MOVCCs. llvm-svn: 127434	2011-03-10 23:56:09 +00:00
Dan Gohman	154ed49784	Fix reassociate to postpone certain instruction deletions until after it has finished all of its reassociations, because its habit of unlinking operands and holding them in a datastructure while working means that it's not easy to determine when an instruction is really dead until after all its regular work is done. rdar://9096268. llvm-svn: 127424	2011-03-10 19:51:54 +00:00
Benjamin Kramer	b49b964b98	InstCombine: Turn umul_with_overflow into mul nuw if we can prove that it cannot overflow. This happens a lot in clang-compiled C++ code because it adds overflow checks to operator new[]: unsigned foo(unsigned n) { return new unsigned[n]; } We can optimize away the overflow check on 64 bit targets because (uint64_t)n4 cannot overflow. llvm-svn: 127418	2011-03-10 18:40:14 +00:00
Justin Holewinski	72ff7e4fa9	PTX: Add preliminary support for floating-point divide and multiply-and-add llvm-svn: 127410	2011-03-10 16:57:18 +00:00

... 3 4 5 6 7 ...

12884 Commits