llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	6699a60b0e	Test commit. llvm-svn: 155626	2012-04-26 08:24:07 +00:00
Evan Cheng	9f7ad310b5	If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume the feature set of v7a. This comes about if the user specifies something like -arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as uxtab in this case. rdar://11318438 llvm-svn: 155601	2012-04-26 01:13:36 +00:00
Richard Barton	ba5b0cc82e	Unify internal representation of ARM instructions with a register right-shifted by #32 . These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation. llvm-svn: 155565	2012-04-25 18:00:18 +00:00
Craig Topper	3ec7c2aa84	Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning. llvm-svn: 155538	2012-04-25 06:56:34 +00:00
Jim Grosbach	5117ef7453	ARM: improved assembler diagnostics for missing CPU features. When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499	2012-04-24 22:40:08 +00:00
Jim Grosbach	1e75fc1fe1	ARM: Nuke remnant bogus code. r154362 was supposed to delete this bit, but obviously didn't. rdar://11305594 llvm-svn: 155465	2012-04-24 18:39:47 +00:00
Richard Barton	e9600009e9	Refactor Thumb ITState handling in ARM Disassembler to more efficiently use its vector llvm-svn: 155439	2012-04-24 11:13:20 +00:00
Jim Grosbach	671ad2a572	Tidy up. 80 columns, whitespace, et. al. llvm-svn: 155399	2012-04-23 22:04:10 +00:00
Preston Gurd	9a0914753a	This patch fixes a problem which arose when using the Post-RA scheduler on X86 Atom. Some of our tests failed because the tail merging part of the BranchFolding pass was creating new basic blocks which did not contain live-in information. When the anti-dependency code in the Post-RA scheduler ran, it would sometimes rename the register containing the function return value because the fact that the return value was live-in to the subsequent block had been lost. To fix this, it is necessary to run the RegisterScavenging code in the BranchFolding pass. This patch makes sure that the register scavenging code is invoked in the X86 subtarget only when post-RA scheduling is being done. Post RA scheduling in the X86 subtarget is only done for Atom. This patch adds a new function to the TargetRegisterClass to control whether or not live-ins should be preserved during branch folding. This is necessary in order for the anti-dependency optimizations done during the PostRASchedulerList pass to work properly when doing Post-RA scheduling for the X86 in general and for the Intel Atom in particular. The patch adds and invokes the new function trackLivenessAfterRegAlloc() instead of using the existing requiresRegisterScavenging(). It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of requiresRegisterScavenging(). It changes the all the targets that implemented requiresRegisterScavenging() to also implement trackLivenessAfterRegAlloc(). It adds an assertion in the Post RA scheduler to make sure that post RA liveness information is available when it is needed. It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order to avoid running into the added assertion. Finally, this patch restores the use of anti-dependency checking (which was turned off temporarily for the 3.1 release) for Intel Atom in the Post RA scheduler. Patch by Andy Zhang! Thanks to Jakob and Anton for their reviews. llvm-svn: 155395	2012-04-23 21:39:35 +00:00
Jim Grosbach	41e94d79be	ARM: VSLI two-operand assmebly aliases are tblgen'erated. llvm-svn: 155393	2012-04-23 21:22:04 +00:00
Jim Grosbach	3dada484c3	ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. llvm-svn: 155392	2012-04-23 21:00:49 +00:00
Jim Grosbach	e5012fbad3	ARM: vqdmulh two-operand aliases are tblgen'erated now. llvm-svn: 155387	2012-04-23 20:37:20 +00:00
Benjamin Kramer	8877d68db7	ARM: Initialize the HasRAS bit. Found by valgrind. llvm-svn: 155313	2012-04-22 11:52:41 +00:00
Jim Grosbach	c931d451cd	ARM: tblgen'erate more NEON two-operand aliases. VMUL and VEXT. llvm-svn: 155258	2012-04-20 23:46:33 +00:00
Jim Grosbach	b4e849b924	ARM: tblgen'erate more NEON two-operand aliases. llvm-svn: 155254	2012-04-20 23:30:14 +00:00
Jim Grosbach	2937df45a8	ARM: Update NEON assembly two-operand aliases. Use the new TwoOperandAliasConstraint to handle lots of the two-operand aliases for NEON instructions. There's still more to go, but this is a good chunk of them. llvm-svn: 155210	2012-04-20 18:12:54 +00:00
Craig Topper	c7242e054d	Convert more uses of XXXRegisterClass to &XXXRegClass. No functional change since they are equivalent. llvm-svn: 155188	2012-04-20 07:30:17 +00:00
Jim Grosbach	9cc324d31a	ARM some VFP tblgen'erated two-operand aliases. llvm-svn: 155178	2012-04-20 00:15:00 +00:00
Jim Grosbach	6b46134862	ARM let TableGen handle a few two-operand aliases. No need for these explicit aliases anymore. Nuke 'em. llvm-svn: 155173	2012-04-19 23:59:26 +00:00
Silviu Baranga	ca45af9a75	Added support for disassembling unpredictable swp/swpb ARM instructions. llvm-svn: 155004	2012-04-18 14:18:57 +00:00
Silviu Baranga	d5c6a63a50	Fix the bahavior of the disassembler when decoding unpredictable mrs instructions on ARM. Now the diasassembler emmits warnings instead of errors. llvm-svn: 155002	2012-04-18 14:09:07 +00:00
Silviu Baranga	41f1fcd80e	Added support for unpredictable mcrr/mcrr2/mrrc/mrrc2 ARM instruction in the disassembler. Since the upredicability conditions are complex, C++ code was added to handle them. llvm-svn: 155001	2012-04-18 13:12:50 +00:00
Silviu Baranga	a2944116dc	Fixed decoding for the ARM cdp2 instruction. The restriction on the coprocessor number was removed for this instruction. llvm-svn: 155000	2012-04-18 13:02:55 +00:00
Silviu Baranga	9da1918c84	Add suport for unpredicatble cases of the cmp, tst, teq and cmnz ARM instructions in the disassembler. llvm-svn: 154999	2012-04-18 12:48:43 +00:00
Chad Rosier	41675546eb	Typo. llvm-svn: 154953	2012-04-17 21:48:36 +00:00
Jay Foad	08a0598cd4	Remove unused CCIfSubtarget. llvm-svn: 154921	2012-04-17 11:29:05 +00:00
James Molloy	a9bcf20d22	Fix bad EXTRACT_SUBREG in instruction selection for extending-loads on NEON. llvm-svn: 154915	2012-04-17 08:18:00 +00:00
Kevin Enderby	29ae538647	Fix ARM disassembly of VLD2 (single 2-element structure to all lanes) instructions with writebacks. And add test a case for all opcodes handed by DecodeVLD2DupInstruction() in ARMDisassembler.cpp . llvm-svn: 154884	2012-04-17 00:49:27 +00:00
Jim Grosbach	2bf5f73977	ARM two-operand forms for vhadd and vhsub instructions. rdar://11252521 llvm-svn: 154875	2012-04-16 23:00:25 +00:00
Jim Grosbach	003607f474	ARM handle :lower16: and :upper16: after a '#' prefix. rdar://11252521 llvm-svn: 154862	2012-04-16 21:18:46 +00:00
Jim Grosbach	6068d0014a	ARM assembly two-operand forms for VRSHL. rdar://11252521 llvm-svn: 154840	2012-04-16 18:03:16 +00:00
Jim Grosbach	cd1c000a9f	ARM two-operand aliases for VRHADD instructions. rdar://11252521 llvm-svn: 154832	2012-04-16 17:14:11 +00:00
Benjamin Kramer	673824b4a1	Wire up support for diagnostic ranges in the ARMAsmParser. As an example, attach range info to the "invalid instruction" message: $ clang -arch arm -c asm.c asm.c:2:11: error: invalid instruction __asm__("foo r0"); ^ <inline asm>:1:2: note: instantiated into assembly here foo r0 ^~~ llvm-svn: 154765	2012-04-15 17:04:27 +00:00
Evan Cheng	267a4ada52	On Darwin targets, only use vfma etc. if the source use fma() intrinsic explicitly. llvm-svn: 154689	2012-04-13 18:59:28 +00:00
Kevin Enderby	c407cc7a40	For ARM disassembly only print 32 unsigned bits for the address of branch targets so if the branch target has the high bit set it does not get printed as: beq 0xffffffff8008c404 llvm-svn: 154685	2012-04-13 18:46:37 +00:00
Kevin Enderby	40d4e47003	Fix a few more places in the ARM disassembler so that branches get symbolic operands added when using the C disassembler API. llvm-svn: 154628	2012-04-12 23:13:34 +00:00
Jim Grosbach	4324f426ce	ARM 'adr' fixups don't need the interworking addend tweaking. They reference the PC directly, so things work properly that way. rdar://11231229 llvm-svn: 154576	2012-04-12 01:19:35 +00:00
Kevin Enderby	72f18bbcff	Fixed a case of ARM disassembly getting an assert on a bad encoding of a VST instruction. llvm-svn: 154544	2012-04-11 22:40:17 +00:00
Jim Grosbach	6e536de1a1	ARM 'vuzp.32 Dd, Dm' is a pseudo-instruction. While there is an encoding for it in VUZP, the result of that is undefined, so we should avoid it. Define the instruction as a pseudo for VTRN.32 instead, as the ARM ARM indicates. rdar://11222366 llvm-svn: 154511	2012-04-11 17:40:18 +00:00
Jim Grosbach	4640c8169f	ARM 'vzip.32 Dd, Dm' is a pseudo-instruction. While there is an encoding for it in VZIP, the result of that is undefined, so we should avoid it. Define the instruction as a pseudo for VTRN.32 instead, as the ARM ARM indicates. rdar://11221911 llvm-svn: 154505	2012-04-11 16:53:25 +00:00
Evan Cheng	5efc442290	Add more fused mul+add/sub patterns. rdar://10139676 llvm-svn: 154484	2012-04-11 06:59:47 +00:00
Evan Cheng	48346c1cd9	Clean up ARM fused multiply + add/sub support some more: rename some isel predicates. Also remove NEON2 since it's not really useful and it is confusing. If NEON + VFP4 implies NEON2 but NEON2 doesn't imply NEON + VFP4, what does it really mean? rdar://10139676 llvm-svn: 154480	2012-04-11 05:33:07 +00:00
Evan Cheng	67a09fc397	Match (fneg (fma) to vfnma. rdar://10139676 llvm-svn: 154469	2012-04-11 01:21:25 +00:00
Kevin Enderby	d2980cd041	Fix ARM disassembly of VLD instructions with writebacks. And add test a case for all opcodes handed by DecodeVLDInstruction() in ARMDisassembler.cpp . llvm-svn: 154459	2012-04-11 00:25:40 +00:00
Jim Grosbach	ad66de155b	ARM add missing Thumb1 two-operand aliases for shift-by-immediate. rdar://11222742 llvm-svn: 154457	2012-04-11 00:15:16 +00:00
Evan Cheng	aca6c822e6	Fix a number of problems with ARM fused multiply add/subtract instructions. 1. The new instruction itinerary entries are not properly described. 2. The asm parser can't handle vfms and vfnms. 3. There were no assembler, disassembler test cases. 4. HasNEON2 has the wrong assembler predicate. rdar://10139676 llvm-svn: 154456	2012-04-11 00:13:00 +00:00
Evan Cheng	d0007f3c83	Handle llvm.fma.* intrinsics. rdar://10914096 llvm-svn: 154439	2012-04-10 21:40:28 +00:00
Jim Grosbach	df5a244797	ARM fix cc_out operand handling for t2SUBrr instructions. We were incorrectly conflating some add variants which don't have a cc_out operand with the mirroring sub encodings, which do. Part of the awesome non-orthogonality legacy of thumb1. Similarly, handling of add/sub of an immediate was sometimes incorrectly removing the cc_out operand for add/sub register variants. rdar://11216577 llvm-svn: 154411	2012-04-10 17:31:55 +00:00
Evan Cheng	f8bad08001	Fix a long standing tail call optimization bug. When a libcall is emitted legalizer always use the DAG entry node. This is wrong when the libcall is emitted as a tail call since it effectively folds the return node. If the return node's input chain is not the entry (i.e. call, load, or store) use that as the tail call input chain. PR12419 rdar://9770785 rdar://11195178 llvm-svn: 154370	2012-04-10 01:51:00 +00:00
Jim Grosbach	8f99bc3aed	ARM LDR/LDRT has the same encoding collision as STR/STRT. Generalized logic of r154141. llvm-svn: 154362	2012-04-10 00:13:07 +00:00
Chad Rosier	e0e38f61a5	When performing a truncating store, it's possible to rearrange the data in-register, such that we can use a single vector store rather then a series of scalar stores. For func_4_8 the generated code vldr d16, LCPI0_0 vmov d17, r0, r1 vadd.i16 d16, d17, d16 vmov.u16 r0, d16[3] strb r0, [r2, #3] vmov.u16 r0, d16[2] strb r0, [r2, #2] vmov.u16 r0, d16[1] strb r0, [r2, #1] vmov.u16 r0, d16[0] strb r0, [r2] bx lr becomes vldr d16, LCPI0_0 vmov d17, r0, r1 vadd.i16 d16, d17, d16 vuzp.8 d16, d17 vst1.32 {d16[0]}, [r2, :32] bx lr I'm not fond of how this combine pessimizes 2012-03-13-DAGCombineBug.ll, but I couldn't think of a way to judiciously apply this combine. This ldrh r0, [r0, #4] strh r0, [r1] becomes vldr d16, [r0] vmov.u16 r0, d16[2] vmov.32 d16[0], r0 vuzp.16 d16, d17 vst1.32 {d16[0]}, [r1, :32] PR11158 rdar://10703339 llvm-svn: 154340	2012-04-09 20:32:02 +00:00
Chad Rosier	99cbde9e82	Update comments and remove unnecessary isVolatile() check. llvm-svn: 154336	2012-04-09 19:38:15 +00:00
Bob Wilson	6f9be7e2c6	Fix Thumb __builtin_longjmp with integrated assembler. <rdar://problem/11203543> The tLDRr instruction with the last register operand set to the zero register prints in assembly as if no register was specified, and the assembler encodes it as a tLDRi instruction with a zero immediate. With the integrated assembler, that zero register gets emitted as "r0", so we get "ldr rx, [ry, r0]" which is broken. Emit the instruction as tLDRi with a zero immediate. I don't know if there's a good way to write a testcase for this. Suggestions welcome. Opportunities for follow-up work: 1) The asm printer should complain if a non-optional register operand is set to the zero register, instead of silently dropping it. 2) The integrated assembler should complain in the same situation, instead of silently emitting the operand as "r0". llvm-svn: 154261	2012-04-07 16:51:59 +00:00
Jim Grosbach	0c509fa6bf	Tidy up. 80 columns. llvm-svn: 154226	2012-04-06 23:43:50 +00:00
Jakob Stoklund Olesen	baa3566091	ARMPat is equivalent to Requires<[IsARM]>. llvm-svn: 154210	2012-04-06 21:21:59 +00:00
Jakob Stoklund Olesen	b4bd3880ba	Eliminate iOS-specific tail call instructions. After register masks were introdruced to represent the call clobbers, it is no longer necessary to have duplicate instruction for iOS. llvm-svn: 154209	2012-04-06 21:17:42 +00:00
Chandler Carruth	8a102c21e3	There is no portable std::abs overload for int64_t, use the llvm::abs64 which exists for this purpose. llvm-svn: 154199	2012-04-06 20:10:52 +00:00
Jakob Stoklund Olesen	967b86a0a2	Allow negative immediates in ARM and Thumb2 compares. ARM and Thumb2 mode can use cmn instructions to compare against negative immediates. Thumb1 mode can't. llvm-svn: 154183	2012-04-06 17:45:04 +00:00
Jakob Stoklund Olesen	6a2e99a46a	Deduplicate ARM call-related instructions. We had special instructions for iOS because r9 is call-clobbered, but that is represented dynamically by the register mask operands now, so there is no need for the pseudo-instructions. llvm-svn: 154144	2012-04-06 00:04:58 +00:00
Jim Grosbach	d6a1a1dc2f	ARM: Don't form a t2LDRi8 or t2STRi8 with an offset of zero. The load/store optimizer splits LDRD/STRD into two instructions when the register pairing doesn't work out. For negative offsets in Thumb2, it uses t2STRi8 to do that. That's fine, except for the case when the offset is in the range [-4,-1]. In that case, we'll also form a second t2STRi8 with the original offset plus 4, resulting in a t2STRi8 with a non-negative offset, which ends up as if it were an STRT, which is completely bogus. Similarly for loads. No testcase, unfortunately, as any I've been able to construct is both large and extremely fragile. rdar://11193937 llvm-svn: 154141	2012-04-05 23:51:24 +00:00
Jim Grosbach	930f2f66e7	ARM assembly aliases for add negative immediates using sub. 'add r2, #-1024' should just use 'sub r2, #1024' rather than erroring out. Thumb1 aliases for adding a negative immediate to the stack pointer, also. rdar://11192734 llvm-svn: 154123	2012-04-05 20:57:13 +00:00
Silviu Baranga	af3c79f0ac	Added support for unpredictable ADC/SBC instructions on ARM, and also fixed some corner cases involving the PC register as an operand for these instructions. llvm-svn: 154101	2012-04-05 16:19:29 +00:00
Silviu Baranga	d365397daa	Added support for handling unpredictable arithmetic instructions on ARM. llvm-svn: 154100	2012-04-05 16:13:15 +00:00
Jim Grosbach	15c6884a4b	ARM assembly aliases for two-operand V[R]SHR instructions. rdar://11189467 llvm-svn: 154087	2012-04-05 07:23:53 +00:00
Jim Grosbach	3d00eecc53	ARM assembly parsing for 'msr' plain 'cpsr' operand. Plain 'cpsr' is an alias for 'cpsr_fc'. rdar://11153753 llvm-svn: 154080	2012-04-05 03:17:53 +00:00
Jakob Stoklund Olesen	0a5b72f0e4	Implement ARMBaseInstrInfo::commuteInstruction() for MOVCCr. A MOVCCr instruction can be commuted by inverting the condition. This can help reduce register pressure and remove unnecessary copies in some cases. <rdar://problem/11182914> llvm-svn: 154033	2012-04-04 18:23:42 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Dylan Noblesmith	7a3973d3e0	ARMDisassembler: drop bogus dependency on ARMCodeGen And indirectly, a dependency on most of the core LLVM optimization libraries. llvm-svn: 153957	2012-04-03 15:48:14 +00:00
Benjamin Kramer	1c0541b031	Move getOpcodeName from the various target InstPrinters into the superclass MCInstPrinter. All implementations used the same code. llvm-svn: 153866	2012-04-02 08:32:38 +00:00
Craig Topper	dab9e35ad0	Remove getInstructionName from MCInstPrinter implementations in favor of using the instruction name table from MCInstrInfo. Reduces static data in the InstPrinter implementations. llvm-svn: 153863	2012-04-02 07:01:04 +00:00
Craig Topper	54bfde79db	Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo. llvm-svn: 153860	2012-04-02 06:09:36 +00:00
Jakob Stoklund Olesen	d915503486	Add a 2 byte safety margin in offset computations. ARMConstantIslandPass still has bugs where jump table compression can cause constant pool entries to go out of range. Add a safety margin of 2 bytes when placing constant islands, but use the real max displacement for verification. <rdar://problem/11156595> llvm-svn: 153789	2012-03-31 00:06:44 +00:00
Jakob Stoklund Olesen	24bb3d59d7	Add more debugging output to ARMConstantIslandPass. llvm-svn: 153788	2012-03-31 00:06:42 +00:00
Jim Grosbach	913cc3072d	ARM fix encoding fixup resolution for ldrd and friends. The 8-bit payload is not contiguous in the opcode. Move the upper nibble over 4 bits into the correct place. rdar://11158641 llvm-svn: 153780	2012-03-30 21:54:22 +00:00
Jim Grosbach	fdaab531b7	ARM assembler should prefer non-aliases encoding of cmp. When an immediate is both a value [t2_]so_imm and a [t2_]so_imm_neg, we want to use the non-negated form to make sure we prefer the normal encoding, not the aliased encoding via the negation of, e.g., 'cmp.w'. llvm-svn: 153770	2012-03-30 19:59:02 +00:00
Jim Grosbach	daa04130ed	ARM encoding for VSWP got the second operand incorrect. Make the non-tied register operand names line up with what the base class encoding handler expects. rdar://11157236 llvm-svn: 153766	2012-03-30 18:53:01 +00:00
Jim Grosbach	74005ae691	ARM can only use narrow encoding for low regs. llvm-svn: 153765	2012-03-30 18:39:43 +00:00
Jim Grosbach	def5e34812	ARM integrated assembler should encoding choice for add/sub imm. For 'adds r2, r2, #56' outside of an IT block, the 16-bit encoding T2 can be used for this syntax. Prefer the narrow encoding when possible. rdar://11156277 llvm-svn: 153759	2012-03-30 17:20:40 +00:00
Jim Grosbach	199ab90946	ARM assembly parsing needs to be paranoid about negative immediates. Make sure to treat immediates as unsigned when doing relative comparisons. rdar://11153621 llvm-svn: 153753	2012-03-30 16:31:31 +00:00
James Molloy	fb5cd6085f	Ensure conditional BL instructions for ARM are given the fixup fixup_arm_condbranch. Patch by Tim Northover! llvm-svn: 153737	2012-03-30 09:15:32 +00:00
Evan Cheng	a40d40602c	ARM target should allow codegenprep to duplicate ret instructions to enable tailcall opt. rdar://11140249 llvm-svn: 153717	2012-03-30 01:24:39 +00:00
Jakob Stoklund Olesen	d8af9a5ee1	Invalidate liveness in ARMConstantIslandPass. This pass splits basic blocks to insert constant islands, and it doesn't recompute the live-in lists. No later passes depend on accurate liveness information. This fixes PR12410 where the machine code verifier was complaining. llvm-svn: 153700	2012-03-29 23:14:26 +00:00
Jakob Stoklund Olesen	2f2897372a	Prefer even-odd D-register pairs. We are sometimes allocatinog from the DPair register class which contains odd-even pairs in addition to the Q registers. Place the Q registers first in the DPair allocation order as they can be copied with a single instruction. The odd-even pairs should only be allocated as a last resort. llvm-svn: 153699	2012-03-29 22:54:32 +00:00
Lang Hames	591cdaf2ee	Try using vmov.i32 to materialize FP32 constants that can't be materialized by vmov.f32. llvm-svn: 153696	2012-03-29 21:56:11 +00:00
Jim Grosbach	0b0298302c	ARM assembly 'cmp lr, #0' should not encode using 'cmn'. The CMP->CMN alias was matching for an immediate of zero when it should only match for negative values. rdar://11129224 llvm-svn: 153689	2012-03-29 21:19:52 +00:00
Jakob Stoklund Olesen	caa6bd273f	Handle register copies for the new ARM register classes. ARM recently gained DPair, DTriple, and DQuad register classes. Update copyPhysReg() to handle copies in these register classes. No test case, it is difficult to make the register allocator emit the odd copies reliably. The missing DPair copy caused a failure on partialsums in the nightly test suite. <rdar://problem/11147997> llvm-svn: 153686	2012-03-29 21:10:40 +00:00
Jakob Stoklund Olesen	b6a7a89289	Don't kill the base register when expanding strd. When an strd instruction doesn't get the registers it wants, it can be expanded into two str instructions. Make sure the first str doesn't kill the base register in the case where the base and data registers are identical: t2STRi12 %R0<kill>, %R0, 4, pred:14, pred:%noreg t2STRi12 %R2<kill>, %R0, 8, pred:14, pred:%noreg <rdar://problem/11101911> llvm-svn: 153611	2012-03-28 23:07:03 +00:00
Jakob Stoklund Olesen	cdee326ab6	Preserve implicit defs in ARMLoadStoreOptimizer. When a number of sub-register VLRDS instructions are combined into a VLDM, preserve any super-register implicit defs. This is required to keep the register scavenger and machine code verifier happy. Enable machine code verification after ARMLoadStoreOptimizer. ARM/2012-01-26-CopyPropKills.ll was failing because of this. llvm-svn: 153610	2012-03-28 22:50:56 +00:00
Jakob Stoklund Olesen	9e512120b7	Spill DPair registers, not just QPR. The arm_neon intrinsics can create virtual registers from the DPair register class which allows both even-odd and odd-even D-register pairs. This fixes PR12389. llvm-svn: 153603	2012-03-28 21:20:32 +00:00
Jakob Stoklund Olesen	8cb97523c6	Revert r153516: "Invalidate liveness in Thumb2ITBlockPass." Revert r153519: "ARMLoadStoreOptimizer invalidates register liveness." These patches caused miscompilations in povray by turning off branch folding's updating of live-in lists. It turns out the the late scheduler depends on the live-in lists, even if it doesn't need correct kill flags. <rdar://problem/11139228> llvm-svn: 153593	2012-03-28 20:11:44 +00:00
Richard Barton	7ce39497b4	Fixup VST1.32 with writeback instruction. Also re-factor non-writeback version. llvm-svn: 153573	2012-03-28 10:18:11 +00:00
Jakob Stoklund Olesen	4acbcb3171	ARMLoadStoreOptimizer invalidates register liveness. This pass tries to update kill flags, but there are still many bugs. Passes after the load/store optimizer don't need accurate liveness, so don't even try. <rdar://problem/11101911> llvm-svn: 153519	2012-03-27 17:33:52 +00:00
Jakob Stoklund Olesen	14459cdc49	Invalidate liveness in Thumb2ITBlockPass. llvm-svn: 153516	2012-03-27 17:06:06 +00:00
Craig Topper	1fcf5bcae1	Prune some includes llvm-svn: 153502	2012-03-27 07:54:11 +00:00
Craig Topper	f6e7e12f75	Remove unnecessary llvm:: qualifications llvm-svn: 153500	2012-03-27 07:21:54 +00:00
Evan Cheng	a2b48d985b	ARM has a peephole optimization which looks for a def / use pair. The def produces a 32-bit immediate which is consumed by the use. It tries to fold the immediate by breaking it into two parts and fold them into the immmediate fields of two uses. e.g movw r2, #40885 movt r3, #46540 add r0, r0, r3 => add.w r0, r0, #3019898880 add.w r0, r0, #30146560 ; However, this transformation is incorrect if the user produces a flag. e.g. movw r2, #40885 movt r3, #46540 adds r0, r0, r3 => add.w r0, r0, #3019898880 adds.w r0, r0, #30146560 Note the adds.w may not set the carry flag even if the original sequence would. rdar://11116189 llvm-svn: 153484	2012-03-26 23:31:00 +00:00
Craig Topper	6e80c28017	Prune some includes and forward declarations. llvm-svn: 153429	2012-03-26 06:58:25 +00:00
Craig Topper	5fa0caafc0	Prune includes and replace uses of ARMRegisterInfo.h with ARMBaeRegisterInfo.h llvm-svn: 153422	2012-03-26 00:45:15 +00:00
Craig Topper	07720d8dcd	Replace uses of ARMBaseInstrInfo and ARMTargetMachine with the Base versions. llvm-svn: 153421	2012-03-25 23:49:58 +00:00
Craig Topper	d4a964cd70	Prune some includes and forward declarations. llvm-svn: 153415	2012-03-25 18:10:17 +00:00
Jim Grosbach	190e7b6e18	ARM tidy up ARMConstantIsland.cpp. No functional change, just tidy up the code and nomenclature a bit. llvm-svn: 153347	2012-03-23 23:07:03 +00:00
Silviu Baranga	4afd7d2316	Added soft fail checks for the disassembler when decoding some corner cases of the STRD, STRH, LDRD, LDRH, LDRSH and LDRSB instructions on ARM. llvm-svn: 153252	2012-03-22 14:14:49 +00:00
Silviu Baranga	d213f2111a	Added soft fail cases for the disassembler when decoding LDRSBT, LDRHT or LDRSHT instruction on ARM llvm-svn: 153251	2012-03-22 13:24:43 +00:00
Silviu Baranga	a6ea32afdd	Added soft fail cases for the disassembler when decoding MUL instructions on ARM. llvm-svn: 153250	2012-03-22 13:14:39 +00:00
Kevin Enderby	7e7d5eefb2	Fix ARM disassembly of VST1 and VST2 instructions with writeback. And add test case for all opcodes handed by DecodeVSTInstruction() in ARMDisassembler.cpp . llvm-svn: 153218	2012-03-21 20:54:32 +00:00
Evan Cheng	759b1d169f	Change conditional instructions definitions, e.g. ANDCC, ARMPseudoExpand and t2PseudoExpand. llvm-svn: 153135	2012-03-20 21:28:05 +00:00
Matt Beaumont-Gay	dc873d5e6a	remove unused variable llvm-svn: 153116	2012-03-20 19:52:05 +00:00
Bob Wilson	b60f8f875c	Require a base pointer for stack realignment when SP may vary dynamically. ARMBaseRegisterInfo::canRealignStack was checking for variable-sized objects but not for stack adjustments around calls. Use hasReservedCallFrame() to check for both. The hasBasePointer function was already correctly checking both conditions, so the effect of this was that a base pointer would be used without checking whether the base pointer register could be reserved. I don't have a small testcase for this. <rdar://problem/11075906> llvm-svn: 153110	2012-03-20 19:28:25 +00:00
Bob Wilson	ca690320fb	Remove some redundant checks. ARMFrameLowering::hasReservedCallFrame is already checking for variable sized objects, so there's no point in checking it twice. llvm-svn: 153109	2012-03-20 19:28:22 +00:00
Kevin Enderby	816ca27ef6	Fix assembling ARM vst2 instructions with double-spaced registers. llvm-svn: 153099	2012-03-20 17:41:51 +00:00
Jim Grosbach	997614f597	ARM non-scattered MachO relocations for movw/movt. Needed when building -mdynamic-no-pic code. rdar://10459256 llvm-svn: 153097	2012-03-20 17:25:45 +00:00
Silviu Baranga	32a49333ec	The ARM instructions that have an unpredictable behavior when the pc register operand is given now fail with soft fail. Modified the regression tests to reflect this. llvm-svn: 153089	2012-03-20 15:54:56 +00:00
Richard Barton	7caea33dfa	Test Commit - add a newline llvm-svn: 153083	2012-03-20 10:50:35 +00:00
Jim Grosbach	c4aa60ffe9	ARM branch relaxation for unconditional t1 branches. rdar://11059157 llvm-svn: 153055	2012-03-19 21:32:32 +00:00
Jim Grosbach	67e76babd3	ARM assembly, accept optional '#' on lane index number. rdar://11057160 llvm-svn: 153053	2012-03-19 20:39:53 +00:00
Anton Korobeynikov	3edd854d64	Perform mul combine when multiplying wiht negative constants. Patch by Weiming Zhao! This fixes PR12212 llvm-svn: 153049	2012-03-19 19:19:50 +00:00
Craig Topper	188ed9d56e	Reorder includes to match coding standards. Fix an issue or two exposed by that. llvm-svn: 152978	2012-03-17 07:33:42 +00:00
Bill Wendling	23f8c4a50c	Check if we can handle the arguments of a call (and therefore the call) in fast-isel before emitting code. If the program bails after code was emitted, then it could lead to the stack being adjusted more than once (two CALLSEQ_BEGINs emitted) but being adjuste back only once after the call. This leads to general badness and gnashing of teeth. <rdar://problem/11050630> llvm-svn: 152959	2012-03-16 23:11:07 +00:00
Jim Grosbach	c40b0f72bb	ARM fix silly typo in optional operand alias. rdar://11065671 llvm-svn: 152954	2012-03-16 22:18:29 +00:00
Jim Grosbach	db7db7d3a3	ARM divided syntax fmrx/fmxr mnemonics. llvm-svn: 152946	2012-03-16 21:06:13 +00:00
Jim Grosbach	905686a82a	ARM ldm/stm register lists can be out of order. It's not a good style idea, as the registers will be laid down in memory in numerical order, not the order they're in the list, but it's legal. vldm/vstm are stricter. rdar://11064740 llvm-svn: 152943	2012-03-16 20:48:38 +00:00
Jim Grosbach	7cb9a13b02	ARM optional operand on MRC/MCR assembly instructions. rdar://11058464 llvm-svn: 152883	2012-03-16 00:45:58 +00:00
Jim Grosbach	24d90e2ddc	ARM vmrs system registers mvfr0 and mvfr1 handling. rdar://11058464 llvm-svn: 152881	2012-03-16 00:27:18 +00:00
Jim Grosbach	6d9766b355	Remove inadvertant commit. llvm-svn: 152870	2012-03-15 23:00:30 +00:00
Chad Rosier	26d05887d9	[fast-isel] Address Eli's comments for r152847. Specifically, add a test case and still allow immediate encoding, just not with cmn. rdar://11038907 llvm-svn: 152869	2012-03-15 22:54:20 +00:00
Chad Rosier	01cecbffd6	[fast-isel] Don't try to encode LONG_MIN using cmn instructions. rdar://11038907 llvm-svn: 152847	2012-03-15 21:40:23 +00:00
Jim Grosbach	d28888dd77	ARM case-insensitive checking for APSR_nzcv. rdar://11056591 llvm-svn: 152846	2012-03-15 21:34:14 +00:00
Jim Grosbach	d74560b170	ARM aliases for pre-unified syntax fcmpz[sd] mnemonics. rdar://11056647 llvm-svn: 152834	2012-03-15 20:48:18 +00:00
Lang Hames	c35ee8b54a	Use vmov.f32 to materialize f32 consts on ARM. This relaxes constraints on register allocation by allowing all 32 D-registers to be used. Patch by Cameron Zwarich. llvm-svn: 152824	2012-03-15 18:49:02 +00:00
Kristof Beyls	327d2f9da5	Fix VCVT decoding (between floating-point and fixed-point, Floating-point). Patch by Richard Barton. llvm-svn: 152814	2012-03-15 17:50:29 +00:00
Bob Wilson	274d6f1777	Switch to unified syntax for VFP instructions in inline assembly. <rdar://problem/11024696> llvm-svn: 152548	2012-03-12 06:15:36 +00:00
Craig Topper	bef78fc2ee	Convert more static tables of registers used by calling convention to uint16_t to reduce space. llvm-svn: 152538	2012-03-11 07:57:25 +00:00
Craig Topper	ca658c2264	Use uint16_t to store registers and opcode in static tables in the target specific backends. llvm-svn: 152537	2012-03-11 07:16:55 +00:00
Craig Topper	5a4bcc749a	Use uint16_t to store instruction implicit uses and defs. Reduces static data. llvm-svn: 152301	2012-03-08 08:22:45 +00:00
Jim Grosbach	11e8c0d6b5	ARM don't use MCRelaxAll, as it's not safe on ARM. The ARM code generator makes aggressive assumptions about the encodings being selected for branches which MCRelaxAll invalidates. rdar://11006355 llvm-svn: 152268	2012-03-08 00:07:52 +00:00
Chad Rosier	377f1f2d39	[fast-isel] ARMEmitCmp generates FMSTAT, which transfers the floating-point condition flags to CPSR. This allows us to simplify SelectCmp. Patch by Zonr Chang <zonr.xchg@gmail.com>. llvm-svn: 152243	2012-03-07 20:59:26 +00:00
Jim Grosbach	eadd8ee49c	ARM pre-v6 assembly parsing for umull/smull. llvm-svn: 152188	2012-03-07 01:09:17 +00:00
Jim Grosbach	8db462042c	ARM pre-v6 alias for 'nop' to 'mov r0, r0' llvm-svn: 152185	2012-03-07 00:52:41 +00:00
Jim Grosbach	eed9992b26	Tidy up. Remove dead code that slipped into previous commit. llvm-svn: 152184	2012-03-07 00:52:39 +00:00
Jim Grosbach	ed428bc1ce	ARM more NEON VLD/VST composite physical register refactoring. Register pair, all lanes subscripting. llvm-svn: 152157	2012-03-06 23:10:38 +00:00
Jim Grosbach	13a292cc74	ARM refactor more NEON VLD/VST instructions to use composite physregs Register pair VLD1/VLD2 all-lanes instructions. Kill off more of the pseudos as a result. llvm-svn: 152150	2012-03-06 22:01:44 +00:00
Jim Grosbach	63ee881cd6	Tidy up. Kill some dead code. llvm-svn: 152131	2012-03-06 18:59:19 +00:00
Jakob Stoklund Olesen	579e701fd9	Allow the same types in DPair as in QPR. llvm-svn: 152129	2012-03-06 18:44:11 +00:00
Kevin Enderby	520eb3ba8a	Fix a bug in the ARM disassembly of the neon VLD2 all lanes instruction. llvm-svn: 152127	2012-03-06 18:33:12 +00:00
Jakob Stoklund Olesen	d9b427ee65	Add <imp-def> operands when reloading into physregs. When an instruction only writes sub-registers, it is still necessary to add an <imp-def> operand for the super-register. When reloading into a virtual register, rewriting will add the operand, but when loading directly into a virtual register, the <imp-def> operand is still necessary. llvm-svn: 152095	2012-03-06 02:48:17 +00:00
Lang Hames	718cfbe05a	Split fpscr into two registers: FPSCR and FPSCR_NZCV. The fpscr register contains both flags (set by FP operations/comparisons) and control bits. The control bits (FPSCR) should be reserved, since they're always available and needn't be defined before use. The flag bits (FPSCR_NZCV) should like to be unreserved so they can be hoisted by MachineCSE. This fixes PR12165. llvm-svn: 152076	2012-03-06 00:19:55 +00:00
Jim Grosbach	8dc347fc27	ARM vpush/vpop assembler mnemonics accept an optional size suffix. rdar://10988114 llvm-svn: 152068	2012-03-05 23:16:31 +00:00
Jim Grosbach	e5307f9019	ARM Refactor VLD/VST spaced pair instructions. Use the new composite physical registers. llvm-svn: 152063	2012-03-05 21:43:40 +00:00
Jim Grosbach	c71bf4739a	ARM Remove a bit of dead code. llvm-svn: 152061	2012-03-05 21:09:58 +00:00
Jim Grosbach	c988e0c521	ARM refactor away a bunch of VLD/VST pseudo instructions. With the new composite physical registers to represent arbitrary pairs of DPR registers, we don't need the pseudo-registers anymore. Get rid of a bunch of them that use DPR register pairs and just use the real instructions directly instead. llvm-svn: 152045	2012-03-05 19:33:30 +00:00
Jim Grosbach	fd93a59557	Make MCRegisterInfo available to the the MCInstPrinter. Used to allow context sensitive printing of super-register or sub-register references. llvm-svn: 152043	2012-03-05 19:33:20 +00:00
Sebastian Pop	957a6583f1	updated patch for the ARM fused multiply add/sub In this update: - I assumed neon2 does not imply vfpv4, but neon and vfpv4 imply neon2. - I kept setting .fpu=neon-vfpv4 code attribute because that is what the assembler understands. Patch by Ana Pazos <apazos@codeaurora.org> llvm-svn: 152036	2012-03-05 17:39:52 +00:00
Craig Topper	4b02a29eba	Convert more GenRegisterInfo tables from unsigned to uint16_t to reduce static data size. llvm-svn: 152016	2012-03-05 05:37:41 +00:00
Jakob Stoklund Olesen	f729ceae04	Use <def,undef> operands when spilling NEON bundles. MachineOperands that define part of a virtual register must have an <undef> flag if they are not intended as read-modify-write operands. The old trick of adding an <imp-def> operand doesn't work any longer. Fixes PR12177. llvm-svn: 152008	2012-03-04 18:40:30 +00:00
Craig Topper	b35eacb0f0	Use uint16_t instead of unsigned to store registers in reg classes. Reduces static data size. llvm-svn: 151998	2012-03-04 10:16:38 +00:00
Craig Topper	420525ce3b	Use uint16_t to store registers in callee saved register tables to reduce size of static data. llvm-svn: 151996	2012-03-04 03:33:22 +00:00
Evan Cheng	d12af5dc69	Neuter the optimization I implemented with r107852 and r108258 which turn some floating point equality comparisons into integer ones with -ffast-math. The issue is the optimization causes +0.0 != -0.0. Now the optimization is only done when one side is known to be 0.0. The other side's sign bit is masked off for the comparison. rdar://10964603 llvm-svn: 151861	2012-03-01 23:27:13 +00:00
Jakob Stoklund Olesen	693225f04a	Handle regmasks in Thumb1RegisterInfo::saveScavengerRegister(). This function could have r12 live across a function call when compiling thumb1 code. The test case for this is not included because it is very long. It must provoke emergency spilling near a function call. The behavior is provoked by MultiSource/Applications/JM/lencod, and it triggers an assertion in the scavenger. <rdar://problem/10963642> llvm-svn: 151855	2012-03-01 22:57:32 +00:00
Jim Grosbach	6990e5f08c	ARM use the right opcode for FP<->Integer move in fast-isel. rdar://10965031 llvm-svn: 151850	2012-03-01 22:47:09 +00:00
Kevin Enderby	f0269b4270	Change ARMInstPrinter::printPredicateOperand() so it will not abort if it runs into the undefined 15 condition code value. llvm-svn: 151844	2012-03-01 22:13:02 +00:00
Derek Schuff	56b662ce0f	Make MemoryObject accessor members const again llvm-svn: 151687	2012-02-29 01:09:06 +00:00
Jim Grosbach	617f84ddbd	ARM implement TargetInstrInfo::getNoopForMachoTarget() Without this hook, functions w/ a completely empty body (including no epilogue) will cause an MCEmitter assertion failure. For example, define internal fastcc void @empty_function() { unreachable } rdar://10947471 llvm-svn: 151673	2012-02-28 23:53:30 +00:00
Jim Grosbach	a0ec8896ac	ARM vbit/vbif/vbsl assembly optional size suffix. These instructions accept but do not require a size suffix. rdar://10947225 llvm-svn: 151646	2012-02-28 19:11:07 +00:00
Evan Cheng	65f9d19c4f	Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. llvm-svn: 151645	2012-02-28 18:51:51 +00:00
Daniel Dunbar	ee7b899343	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Evan Cheng	87c7b09d8d	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Jakob Stoklund Olesen	92c15b2b2c	Enable ARM base pointer when calling functions with large arguments. When an outgoing call takes more than 2k of arguments on the stack, we don't allocate that call frame in the prolog, but adjust the stack pointer immediately before the call instead. This causes problems with the emergency spill slot because PEI can't track stack pointer adjustments on the second pass, and if the outgoing arguments are too big, SP can't be used to reach the emergency spill slot at all. Work around these problems by ensuring there is a base or frame pointer that can be used to access the emergency spill slot. <rdar://problem/10917166> llvm-svn: 151604	2012-02-28 01:15:01 +00:00
Jim Grosbach	7b811d30d9	ARM BL/BLX instruction fixups should use relocations. We on the linker to resolve calls to the appropriate BL/BLX instruction to make interworking function correctly. It uses the symbol in the relocation to do that, so we need to be careful about being too clever. To enable this for ARM mode, split the BL/BLX fixup kind off from the unconditional-branch fixups. rdar://10927209 llvm-svn: 151571	2012-02-27 21:36:23 +00:00
Kevin Enderby	1489b523c3	Fix the symbolic operand added for the C disassmbler API for the ARM bl thumb instruction. The PC adjustment is +4 in Thumb mode and +8 in ARM mode. llvm-svn: 151530	2012-02-27 18:15:15 +00:00
Benjamin Kramer	9fceb90175	Remove unused cl::opt, make another opt static. llvm-svn: 151398	2012-02-24 22:09:25 +00:00
Jim Grosbach	09b602d85c	Thumb2 asm aliases for wide bitwise w/ immediate instructions. llvm-svn: 151384	2012-02-24 19:06:05 +00:00
Jia Liu	13830229fd	comment fix llvm-svn: 151339	2012-02-24 02:15:21 +00:00
Jakob Stoklund Olesen	fa7a53746c	Switch ARM target to register masks. I'll let the buildbots determine the compile time improvements from this change, but 464.h264ref has 5% faster codegen at -O2. This patch does cause some assembly changes. Branch folding can make different decisions about calls with dead return values. CriticalAntiDepBreaker may choose different registers because its liveness tracking is affected. MachineCopyPropagation may sometimes leave a dead copy behind. llvm-svn: 151331	2012-02-24 01:19:29 +00:00
Jim Grosbach	3a21e2c33e	Make sure the regs are low regs for tMUL size reduction. llvm-svn: 151318	2012-02-24 00:53:11 +00:00
Jim Grosbach	c01104dfbf	Thumb2 size reduction fix for tied operands of tMUL. The tied source operand of tMUL is the second source operand, not the first like every other two-address thumb instruction. Special case it in the size reduction pass to make sure we create the tMUL instruction properly. llvm-svn: 151315	2012-02-24 00:33:36 +00:00
Dan Gohman	d4a77c4682	When emitting a cmp with 0 for a lowered select, mask out the high bits of the value carying the boolean condition, as their contents are undefined. This fixes rdar://10887484. llvm-svn: 151310	2012-02-24 00:09:36 +00:00
Kevin Enderby	6fbcd8d439	Updated the llvm-mc disassembler C API to support for the X86 target. rdar://10873652 As part of this I updated the llvm-mc disassembler C API to always call the SymbolLookUp call back even if there is no getOpInfo call back. If there is a getOpInfo call back that is tried first and then if that gets no information then the SymbolLookUp is called. I also made the code more robust by memset(3)'ing to zero the LLVMOpInfo1 struct before then setting SymbolicOp.Value before for the call to getOpInfo. And also don't use any values from the LLVMOpInfo1 struct if getOpInfo returns 0. And also don't use any of the ReferenceType or ReferenceName values from SymbolLookUp if it returns NULL. rdar://10873563 and rdar://10873683 For the X86 target also fixed bugs so the annotations get printed. Also fixed a few places in the ARM target that was not producing symbolic operands for some instructions. rdar://10878166 llvm-svn: 151267	2012-02-23 18:18:17 +00:00
Duncan Sands	a354d58f8d	Remove unused variable. llvm-svn: 151251	2012-02-23 11:01:22 +00:00
Evan Cheng	f258a15bdf	Canonicalize (srl (bswap x), 16) to (rotr (bswap x), 16) if the high 16 bits of x are zero. This optimizes rev + lsr 16 to rev16. rdar://10750814 llvm-svn: 151230	2012-02-23 02:58:19 +00:00
Evan Cheng	e87681cf34	Optimize a couple of common patterns involving conditional moves where the false value is zero. Instead of a cmov + op, issue an conditional op instead. e.g. cmp r9, r4 mov r4, #0 moveq r4, #1 orr lr, lr, r4 should be: cmp r9, r4 orreq lr, lr, #1 That is, optimize (or x, (cmov 0, y, cond)) to (or.cond x, y). Similarly extend this to xor as well as (and x, (cmov -1, y, cond)) => (and.cond x, y). It's possible to extend this to ADD and SUB but I don't think they are common. rdar://8659097 llvm-svn: 151224	2012-02-23 01:19:06 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Craig Topper	760b134ffa	Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified. llvm-svn: 151134	2012-02-22 05:59:10 +00:00
Jakob Stoklund Olesen	5f37f1c39d	Clarify ARM calling conventions. llvm-svn: 151113	2012-02-22 01:07:19 +00:00
Jakob Stoklund Olesen	6909faaf35	Calls don't really change the stack pointer. Even if a call instruction has %SP<imp-def> operands, it doesn't change the value of the stack pointer. llvm-svn: 151104	2012-02-21 23:47:43 +00:00
Evan Cheng	0460ae8d80	Proper support for a bastardized darwin-eabi hybird ABI. llvm-svn: 151083	2012-02-21 20:46:00 +00:00
James Molloy	547d4c0662	Improve generated code for extending loads and some trunc stores on ARM. Teach TargetSelectionDAG about lengthening loads for vector types and set v4i8 as legal. Allow FP_TO_UINT for v4i16 from v4i32. llvm-svn: 150956	2012-02-20 09:24:05 +00:00
Ahmed Charles	636a3d618c	Remove dead code. Improve llvm_unreachable text. Simplify some control flow. llvm-svn: 150918	2012-02-19 11:37:01 +00:00
Jia Liu	608dc6e257	comment fix ARM.h llvm-svn: 150904	2012-02-19 02:04:03 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Jakob Stoklund Olesen	4fad5b2b9e	Handle regmask operands in ARMInstrInfo. llvm-svn: 150833	2012-02-17 19:23:15 +00:00
Jakob Stoklund Olesen	96732a438d	Fix ARMBaseInstrInfo::getInstrLatency for calls. Calls always clobber CPSR. llvm-svn: 150831	2012-02-17 19:07:59 +00:00
Chad Rosier	fcd29ae390	[fast-isel] Add support for returning non-legal types with no sign- or zero- entend flag. llvm-svn: 150774	2012-02-17 01:21:28 +00:00
Lang Hames	5bade3dc6e	Re-enable 150652 and 150654 - Make FPSCR non-reserved, and make MachineCSE bail on reserved registers. This should be safe as of r150786. llvm-svn: 150769	2012-02-17 00:27:16 +00:00
Chad Rosier	a0d3c75015	Remove unnecessary assignment to temporary, ResultReg. llvm-svn: 150737	2012-02-16 22:45:33 +00:00
Lang Hames	55a2a96153	Oop - r150653 + r150654 broke one of my test cases. Backing out for now... llvm-svn: 150655	2012-02-16 02:32:10 +00:00
Lang Hames	11ca986b17	FPSCR shouldn't be reserved. llvm-svn: 150654	2012-02-16 02:28:14 +00:00
Chad Rosier	0bc5132457	Add braces to if clause to make symmetric with associate else clause. llvm-svn: 150591	2012-02-15 17:36:21 +00:00
Bill Wendling	dfb45f4d68	Strip the pointer casts from the constants here. The c'tor list is stored as a list of 'void ()*'s, so all of the functions are bitcast to that. However, the dyn_cast doesn't automagically look through bitcasts. Do that for it. <rdar://problem/10813350> llvm-svn: 150572	2012-02-15 09:14:08 +00:00
Chad Rosier	dccc4794e6	Use a temporary variable, rather then a series of redundant calls. llvm-svn: 150536	2012-02-15 00:23:55 +00:00
Chad Rosier	5b9c3974d2	Remove unnecessary assignment to temporary, ResultReg. llvm-svn: 150520	2012-02-14 22:29:48 +00:00
Lang Hames	876f24f706	Third time's the charm...? llvm-svn: 150447	2012-02-14 00:34:30 +00:00
Lang Hames	185455df7e	Unswap swap operands, partially reducing confusion. llvm-svn: 150444	2012-02-14 00:17:12 +00:00
Bill Wendling	05d6f2ff1e	Don't reserve the R0 and R1 registers here. We don't use these registers, and marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437	2012-02-13 23:47:16 +00:00
Lang Hames	aef4ca78c5	Make operands for VSWP read-modify-write. llvm-svn: 150433	2012-02-13 23:37:19 +00:00
Benjamin Kramer	428704eb52	Make the EDis tables const. llvm-svn: 150304	2012-02-11 14:51:07 +00:00
Jim Grosbach	1c9dd2974f	Revert r150222, as the clang driver now handles this properly. Now that the clang driver passes the CPU and feature information to the backend when processing assembly files (150273), this isn't necessary. llvm-svn: 150274	2012-02-10 20:38:46 +00:00
Jason W Kim	c7f4841769	Make valgrind happy. llvm-svn: 150251	2012-02-10 16:07:59 +00:00
Jim Grosbach	ffc02c5ffc	ARM on darwin, v6 implies the presence of VFP for the assembler. rdar://10838899 llvm-svn: 150222	2012-02-10 02:21:49 +00:00
James Molloy	d9ba4fd48f	Teach the MC and disassembler about SoftFail, and hook it up to UNPREDICTABLE on ARM. Wire this to tBLX in order to provide test coverage. llvm-svn: 150169	2012-02-09 10:56:31 +00:00
Andrew Trick	1fa5bcbe2a	Codegen pass definition cleanup. No functionality. Moving toward a uniform style of pass definition to allow easier target configuration. Globally declare Pass ID. Globally declare pass initializer. Use INITIALIZE_PASS consistently. Add a call to the initializer from CodeGen.cpp. Remove redundant "createPass" functions and "getPassName" methods. While cleaning up declarations, cleaned up comments (sorry for large diff). llvm-svn: 150100	2012-02-08 21:23:13 +00:00
Chad Rosier	0ee8c513f7	[fast-isel] Add support for SUBs with non-legal types. llvm-svn: 150047	2012-02-08 02:45:44 +00:00
Chad Rosier	bd471255a9	[fast-isel] Add support for ORs with non-legal types. llvm-svn: 150045	2012-02-08 02:29:21 +00:00
Chad Rosier	ded4c99f2e	[fast-isel] Add support for indirect branches. llvm-svn: 150014	2012-02-07 23:56:08 +00:00
Evan Cheng	45d8f8a08c	Do not fold ADD / SUB into load / store (to form pre-indexed, post-indexed load / store) if the ADD / SUB has a live definition of CPSR. Bug reported by David Meyer. Alas, no test case. llvm-svn: 149970	2012-02-07 07:09:28 +00:00
Craig Topper	e55c556a24	Convert assert(0) to llvm_unreachable llvm-svn: 149961	2012-02-07 02:50:20 +00:00
Chad Rosier	685b20c114	[fast-isel] Add support for ADDs with non-legal types. llvm-svn: 149934	2012-02-06 23:50:07 +00:00
Derek Schuff	8b2dcad4b5	Enable streaming of bitcode This CL delays reading of function bodies from initial parse until materialization, allowing overlap of compilation with bitcode download. llvm-svn: 149918	2012-02-06 22:30:29 +00:00
Evan Cheng	613d6d3b43	DefinesPredicate should only look for def operands. Patch by Ludwig Meier. llvm-svn: 149846	2012-02-05 19:55:04 +00:00
Duncan Sands	ae22c60f90	Persuade GCC that there is nothing worth warning about here (there isn't). llvm-svn: 149834	2012-02-05 14:20:11 +00:00
Andrew Trick	f8ea108c05	TargetPassConfig: confine the MC configuration to TargetMachine. Passes prior to instructon selection are now split into separate configurable stages. Header dependencies are simplified. The bulk of this diff is simply removal of the silly DisableVerify flags. Sorry for the target header churn. Attempting to stabilize them. llvm-svn: 149754	2012-02-04 02:56:59 +00:00
Chad Rosier	b84a4b4c64	[fast-isel] Add support for URem. llvm-svn: 149716	2012-02-03 21:23:45 +00:00
Chad Rosier	e023d5d7f3	[fast-isel] Rename isZExt to isSigned. No functional change intended. llvm-svn: 149714	2012-02-03 21:14:11 +00:00
Chad Rosier	aaa55a88b6	[fast-isel] Add support for UDIV. llvm-svn: 149712	2012-02-03 21:07:27 +00:00
Chad Rosier	41f0e78b6c	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	a8a8ac5d47	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
Andrew Trick	ccb673659a	Added TargetPassConfig. The first little step toward configuring codegen passes. Allows command line overrides to be centralized in LLVMTargetMachine.cpp. LLVMTargetMachine can intercept common passes and give precedence to command line overrides. Allows adding "internal" target configuration options without touching TargetOptions. Encapsulates the PassManager. Provides a good point to initialize all CodeGen passes so that Pass ID's can be used in APIs. Allows modifying the target configuration hooks without rebuilding the world. llvm-svn: 149672	2012-02-03 05:12:41 +00:00
Jakob Stoklund Olesen	caed1c9370	Add pseudo-registers for pairs, triples, and quads of D registers. NEON loads and stores accept single and double spaced pairs, triples, and quads of D registers. This patch adds new register classes to accurately model those constraints: Dn, Dn+1 Dn, Dn+2 ---------------------- DPair DPairSpc DTriple DTripleSpc DQuad DQuadSpc Also extend the existing QQ and QQQQ register classes to contains all Q pairs and quads instead of just the aligned ones. These new register classes will make it possible to accurately model constraints on NEON loads and stores, and we can get rid of all the NEON pseudo-instructions. The late scheduler will be able to accurately model instruction dependencies from the explicit operands. This more than doubles the number of ARM registers, but the backend passes are quite good at handling this. The llc -O0 compile time only regresses by 1.5%. Future work on register mask operands will recover this regression. llvm-svn: 149640	2012-02-02 22:45:32 +00:00
Jakob Stoklund Olesen	c7024a48db	Move ARM subreg index compositions to the SubRegIndex itself. llvm-svn: 149557	2012-02-01 23:16:43 +00:00
Jim Grosbach	a2147ce313	Tidy up. One more return type mismatch fix. llvm-svn: 149452	2012-01-31 23:51:09 +00:00
Jim Grosbach	44091c2f10	Refactor loop for better readability. Excellent suggestion from Ben Kramer. llvm-svn: 149417	2012-01-31 20:56:55 +00:00
Jim Grosbach	b4d3a6af97	Add explanatory comment. llvm-svn: 149416	2012-01-31 20:34:53 +00:00
Anton Korobeynikov	d0c46550fe	Cleanups for EABI standard functions llvm-svn: 149195	2012-01-29 09:11:50 +00:00
Anton Korobeynikov	1b42e64280	Use base AAPCS for varargs functions even for AAPCS-VFP CC llvm-svn: 149194	2012-01-29 09:06:09 +00:00
Bob Wilson	de0c335560	Add a note about a potential optimization for clz/ctz patterns for ARM (and other targets). llvm-svn: 149182	2012-01-28 18:30:07 +00:00
James Molloy	b47489d4ef	Ensure .AliasedSymbol() is called on all uses of getSymbol(). Affects ARM and MIPS ELF backends. Fixes PR11877 llvm-svn: 149180	2012-01-28 15:58:32 +00:00
Jim Grosbach	20275a8577	Better user diagnostics for more ARM MachO relocation errors. llvm-svn: 149102	2012-01-27 00:37:12 +00:00
Jim Grosbach	5e5eabb5ab	Keep source information, if available, around for ARM Fixups. Adjust an example MachObjectWriter diagnostic to use the information to issue a better message. Before: LLVM ERROR: unknown ARM fixup kind! After: x.s:6:5: error: unsupported relocation on symbol beq bar ^ rdar://9800182 llvm-svn: 149093	2012-01-26 23:20:15 +00:00
Jim Grosbach	c8f2b7877b	Tidy up. Fix mismatched return types for error handling. llvm-svn: 149062	2012-01-26 15:56:45 +00:00
James Molloy	6685c08e5f	Add support for the R_ARM_TARGET1 relocation, which should be given to relocations applied to all C++ constructors and destructors. This enables the linker to match concrete relocation types (absolute or relative) with whatever library or C++ support code is being linked against. llvm-svn: 149057	2012-01-26 09:25:43 +00:00
Anton Korobeynikov	7722a2d4e3	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00
Jim Grosbach	82f76d1275	ARM assemly parsing and validation of IT instruction. "Although a Thumb2 instruction, the IT mnemonic shall be permitted in ARM mode, and the condition verified to match the condition code(s) on the following instruction(s)." PR11853 llvm-svn: 148969	2012-01-25 19:52:01 +00:00
Jim Grosbach	086cbfac7d	NEON VLD4(all lanes) assembly parsing and encoding. llvm-svn: 148884	2012-01-25 00:01:08 +00:00
Jim Grosbach	ccb6d55dae	Tidy up. Rename VLD4DUP patterns for consistency. llvm-svn: 148883	2012-01-24 23:47:07 +00:00
Jim Grosbach	b78403ce48	NEON VLD3(all lanes) assembly parsing and encoding. llvm-svn: 148882	2012-01-24 23:47:04 +00:00
Jim Grosbach	8e2722cdb0	NEON VST4(one lane) assembly parsing and encoding. llvm-svn: 148836	2012-01-24 18:53:13 +00:00
Owen Anderson	d845d9d9e9	Widen the instruction encoder that TblGen emits to a 64 bits, which should accomodate every target I can think of offhand. llvm-svn: 148833	2012-01-24 18:37:29 +00:00
Jim Grosbach	14952a0e32	NEON VLD4(one lane) assembly parsing and encoding. llvm-svn: 148832	2012-01-24 18:37:25 +00:00
Jim Grosbach	3cfef8d467	NEON Two-operand assembly aliases for VSRA. llvm-svn: 148821	2012-01-24 17:55:36 +00:00
Jim Grosbach	7ae12cc546	NEON Two-operand assembly aliases for VSLI. llvm-svn: 148819	2012-01-24 17:49:15 +00:00
Jim Grosbach	7b6f0f67aa	NEON Two-operand assembly aliases for VSRI. llvm-svn: 148818	2012-01-24 17:46:58 +00:00
Jim Grosbach	681db34eae	NEON add correct predicates for some asm aliases. llvm-svn: 148815	2012-01-24 17:23:29 +00:00
Anton Korobeynikov	3cad0c21ed	Use correct register class for am2offset register operands. This pacifies machine verifier llvm-svn: 148782	2012-01-24 04:58:56 +00:00
Jim Grosbach	da70eac268	NEON VST4(multiple 4 element structures) assembly parsing. llvm-svn: 148764	2012-01-24 00:58:13 +00:00
Jim Grosbach	ed561fc850	NEON VLD4(multiple 4 element structures) assembly parsing. llvm-svn: 148762	2012-01-24 00:43:17 +00:00
Jim Grosbach	1e946a4f91	Tidy up. Remove some vertical space for readability. llvm-svn: 148761	2012-01-24 00:43:12 +00:00
Chandler Carruth	ed975232bc	Revert r148686 (and r148694, a fix to it) due to a serious layering violation -- MC cannot depend on CodeGen. Specifically, the MCTargetDesc component of each target is actually a subcomponent of the MC library. As such, it cannot depend on the target-independent code generator, because MC itself cannot depend on the target-independent code generator. This change moved a flag from the ARM MCTargetDesc file ARMMCAsmInfo.cpp to the CodeGen layer in ARMException.cpp, leaving behind an 'extern' to refer back to it. That layering order isn't viable givin the constraints outlined above. Commandline flags are designed to be static specifically to avoid these types of bugs. Fixing this is likely going to require some non-trivial refactoring. llvm-svn: 148759	2012-01-24 00:30:17 +00:00
Jim Grosbach	17bacab475	Fix typo. llvm-svn: 148757	2012-01-24 00:12:39 +00:00
Jim Grosbach	d3d36d9315	NEON VST3(single element from one lane) assembly parsing. llvm-svn: 148755	2012-01-24 00:07:41 +00:00
Jim Grosbach	1a74724fc9	NEON VST3(multiple 3-element structures) assembly parsing. llvm-svn: 148748	2012-01-23 23:45:44 +00:00
Jim Grosbach	ac2af3ffab	NEON VLD3(multiple 3-element structures) assembly parsing. llvm-svn: 148745	2012-01-23 23:20:46 +00:00
Anton Korobeynikov	820417af07	Add missed mayStore flag to STREXD / t2STREXD llvm-svn: 148742	2012-01-23 22:57:52 +00:00
Jim Grosbach	a8b444b08b	NEON VLD3 lane-indexed assembly parsing and encoding. llvm-svn: 148734	2012-01-23 21:53:26 +00:00
Jim Grosbach	d28ef9ac46	Simplify some NEON assembly pseudo definitions. Let the generic token alias definitions handle the data subtype suffices. We don't need explicit versions for each. llvm-svn: 148718	2012-01-23 19:39:08 +00:00
NAKAMURA Takumi	28ea8f523b	ARMAsmPrinter.cpp: Try to fix up r148686. EnableARMEHABI was also here. llvm-svn: 148694	2012-01-23 09:14:42 +00:00
Evgeniy Stepanov	482cdc4ebd	An option to selectively enable parts of ARM EHABI support. This change adds an new value to the --arm-enable-ehabi option that disables emitting unwinding descriptors. This mode gives a working backtrace() without the (currently broken) exception support. llvm-svn: 148686	2012-01-23 07:57:39 +00:00
Anton Korobeynikov	5482b9f535	Add fused multiple+add instructions from VFPv4. Patch by Ana Pazos! llvm-svn: 148658	2012-01-22 12:07:33 +00:00
Jim Grosbach	78dcaed8ca	Thumb2 'add rd, pc, imm' alternate form for 'adr' instruction. llvm-svn: 148601	2012-01-21 00:07:56 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Bob Wilson	6c7aaec077	ARM vector any_extends need to be selected to vmovl. <rdar://problem/10723651> We have patterns for vector sext and zext operations but were missing anyext. Without those patterns, codegen will fail when the selection DAG has any_extend nodes. llvm-svn: 148568	2012-01-20 20:59:56 +00:00
Jim Grosbach	90f5780fc1	VST2 four-register w/ update pseudos for fixed/register update. rdar://10724489 llvm-svn: 148560	2012-01-20 19:16:00 +00:00
Jim Grosbach	a9d36fbca7	NEON use vmov.i32 to splat some f32 values into vectors. For bit patterns that aren't representable using the 8-bit floating point representation for vmov.f32, but are representable via vmov.i32, treat the .f32 syntax as an alias. Most importantly, this covers the case 'vmov.f32 Vd, #0.0'. rdar://10616677 llvm-svn: 148556	2012-01-20 18:09:51 +00:00
Benjamin Kramer	116e99a469	Silence warnings about mixing enums. llvm-svn: 148495	2012-01-19 21:11:13 +00:00
Evgeniy Stepanov	4c7eb477b5	Emit ARM EHABI unwinding instructions for 3 more Thumb instructions. llvm-svn: 148473	2012-01-19 12:53:06 +00:00
Jim Grosbach	235c8d2d94	ARM assembly diagnostic caret in better position for FPImm. llvm-svn: 148459	2012-01-19 02:47:30 +00:00
Jim Grosbach	44e5c39c29	Thumb2 relaxation for tADR to t2ADR. llvm-svn: 148456	2012-01-19 02:09:38 +00:00
Jim Grosbach	b008df40d3	Add comment and fix range check in condition. llvm-svn: 148455	2012-01-19 01:50:30 +00:00
Evan Cheng	2879467d4e	- Slight change to finalizeBundle() interface. LastMI is not exclusive (pointing to instruction right after the last instruction in the bundle. - Add a finalizeBundle() variant that doesn't specify LastMI. Instead, the code will find the last instruction in the bundle by following the 'InsideBundle' marker. This is useful in case bundles are formed early (i.e. during MI scheduling) but finalized later (i.e. after register allocator has finished rewriting virtual registers with physical registers). llvm-svn: 148444	2012-01-19 00:46:06 +00:00
Evan Cheng	1eb2bb2295	Rename Finalizebundle to finalizeBundle to conform to coding guideline. llvm-svn: 148440	2012-01-19 00:06:10 +00:00
Jakob Stoklund Olesen	f1fb1d2375	Ignore register mask operands when lowering instructions to MC. This is similar to implicit register operands. MC doesn't understand register liveness and call clobbers. llvm-svn: 148437	2012-01-18 23:52:19 +00:00
Jim Grosbach	94298a906a	Thumb2 alternate syntax for LDR(literal) and friends. Explicit pc-relative syntax. For example, "ldrb r2, [pc, #-22]". rdar://10250964 llvm-svn: 148432	2012-01-18 22:46:46 +00:00
Jim Grosbach	cbd3f27354	Replace FIXME with explanatory comment. llvm-svn: 148427	2012-01-18 22:04:42 +00:00
Jim Grosbach	cb80eb2e75	Thumb2 relaxation for LDR(literal). If the fixup is out of range for the Thumb1 instruction, relax it to the Thumb2 encoding instead. rdar://10711829 llvm-svn: 148424	2012-01-18 21:54:16 +00:00
Jim Grosbach	9ab3d8be4e	Rename pattern for clarity. llvm-svn: 148422	2012-01-18 21:54:09 +00:00
Jim Grosbach	aba3de99c0	Tidy up. MCAsmBackend naming conventions. llvm-svn: 148400	2012-01-18 18:52:16 +00:00
Jim Grosbach	adcc938c46	Thumb2 load/store fixups don't set the thumb bit. Load/store instructions w/ a fixup to be relative a function marked as thumb don't use the low bit to specify thumb vs. non-thumb like interworking branches do, so don't set it when dealing with those fixups. rdar://10348687. llvm-svn: 148366	2012-01-18 00:40:25 +00:00
Jim Grosbach	3b50c9ec7f	Move some ARM specific MCAssmebler bits into the ARMAsmBackend. llvm-svn: 148364	2012-01-18 00:23:57 +00:00
Jakob Stoklund Olesen	f43b599550	Add a CoveredBySubRegs property to Register descriptions. When set, this bit indicates that a register is completely defined by the value of its sub-registers. Use the CoveredBySubRegs property to infer which super-registers are call-preserved given a list of callee-saved registers. For example, the ARM registers D8-D15 are callee-saved. This now automatically implies that Q4-Q7 are call-preserved. Conversely, Win64 callees save XMM6-XMM15, but the corresponding YMM6-YMM15 registers are not call-preserved because they are not fully defined by their sub-registers. llvm-svn: 148363	2012-01-18 00:16:39 +00:00
Jakob Stoklund Olesen	fdbb12b235	Implement ARMBaseRegisterInfo::getCallPreservedMask(). Move ARM callee-saved lists into ARMCallingConv.td. llvm-svn: 148357	2012-01-17 23:09:00 +00:00
David Blaikie	486df738c3	Removing unused default switch cases in switches over enums that already account for all enumeration values explicitly. (This time I believe I've checked all the -Wreturn-type warnings from GCC & added the couple of llvm_unreachables necessary to silence them. If I've missed any, I'll happily fix them as soon as I know about them) llvm-svn: 148262	2012-01-16 23:24:27 +00:00
David Blaikie	5d8e42755c	Refactor variables unused under non-assert builds (& remove two entirely unused variables). llvm-svn: 148230	2012-01-16 05:17:39 +00:00
Benjamin Kramer	339ced4e34	Return an ArrayRef from ShuffleVectorSDNode::getMask and push it through CodeGen. llvm-svn: 148218	2012-01-15 13:16:05 +00:00
Evan Cheng	6bb95253eb	After r147827 and r147902, it's now possible for unallocatable registers to be live across BBs before register allocation. This miscompiled 197.parser when a cmp + b are optimized to a cbnz instruction even though the CPSR def is live-in a successor. cbnz r6, LBB89_12 ... LBB89_12: ble LBB89_1 The fix consists of two parts. 1) Teach LiveVariables that some unallocatable registers might be liveouts so don't mark their last use as kill if they are. 2) ARM constantpool island pass shouldn't form cbz / cbnz if the conditional branch does not kill CPSR. rdar://10676853 llvm-svn: 148168	2012-01-14 01:53:46 +00:00
Jakob Stoklund Olesen	35545421c8	Use RegisterTuples to generate pseudo-registers. The QQ and QQQQ registers are not 'real', they are pseudo-registers used to model some vld and vst instructions. This makes the call clobber lists longer, but I intend to get rid of those soon. llvm-svn: 148151	2012-01-13 22:55:42 +00:00
Eric Christopher	d284c1d80d	Fix assert. llvm-svn: 147966	2012-01-11 20:55:27 +00:00
Andrew Trick	642f0f6a40	ARM Ld/St Optimizer fix. Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits. Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12 llvm-svn: 147922	2012-01-11 03:56:08 +00:00
Jakob Stoklund Olesen	20f1dd5faf	Consider unknown alignment caused by OptimizeThumb2Instructions(). This function runs after all constant islands have been placed, and may shrink some instructions to their 2-byte forms. This can actually cause some constant pool entries to move out of range because of growing alignment padding. Treat instructions that may be shrunk the same as inline asm - they erode the known alignment bits. Also reinstate an old assertion in verify(). It is correct now that basic block offsets include alignments. Add a single large test case that will hopefully exercise many parts of the constant island pass. <rdar://problem/10670199> llvm-svn: 147885	2012-01-10 22:32:14 +00:00
Jim Grosbach	74ac7d50a1	ARM updating VST2 pseudo-lowering fixed vs. register update. rdar://10663487 llvm-svn: 147876	2012-01-10 21:11:12 +00:00
Richard Smith	ad5b42c02f	Move default case for covered enum outside of switch. llvm-svn: 147870	2012-01-10 19:43:09 +00:00
Richard Smith	3f1035410f	Fix a -Wreturn-type warning in g++. llvm-svn: 147867	2012-01-10 19:10:22 +00:00
David Blaikie	edbb58c577	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Jakob Stoklund Olesen	f09a316542	Accurately model hardware alignment rounding. On Thumb, the displacement computation hardware uses the address of the current instruction rouned down to a multiple of 4. Include this rounding in the UserOffset we compute for each instruction. When inline asm is present, the instruction alignment may not be known. Constrain the maximum displacement instead in that case. This makes it possible for CreateNewWater() and OffsetIsInRange() to agree about the valid displacements. When they disagree, infinite looping happens. As always, test cases for this stuff are insane. <rdar://problem/10660175> llvm-svn: 147825	2012-01-10 01:34:59 +00:00
Jakob Stoklund Olesen	1a80e3a26b	Catch runaway ARMConstantIslandPass even in -Asserts builds. The pass is prone to looping, and it is better to crash than loop forever, even in a -Asserts build. <rdar://problem/10660175> llvm-svn: 147806	2012-01-09 22:16:24 +00:00
Evan Cheng	4882e488f7	Don't forget to transfer implicit uses of return instruction. llvm-svn: 147752	2012-01-08 20:41:16 +00:00
Jakob Stoklund Olesen	083dbdca7f	Match SelectionDAG logic for enabling movt. Darwin doesn't do static, and ELF targets only support static. llvm-svn: 147740	2012-01-07 20:49:15 +00:00
Benjamin Kramer	6898db6269	Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. llvm-svn: 147738	2012-01-07 19:42:13 +00:00
Jakob Stoklund Olesen	8cdce7e690	Use getRegForValue() to materialize the address of ARM globals. This enables basic local CSE, giving us 20% smaller code for consumer-typeset in -O0 builds. <rdar://problem/10658692> llvm-svn: 147720	2012-01-07 04:07:22 +00:00
Evan Cheng	501e3095e8	Copy implicit defs (e.g. r0) when changing tBX_RET to tPOP_RET. This bug is exposed with an upcoming change will would delete the copy to return register because there is no use! It's amazing anything works. llvm-svn: 147715	2012-01-07 02:55:54 +00:00
Jakob Stoklund Olesen	68f034ee1a	Use movw+movt in ARMFastISel::ARMMaterializeGV. This eliminates a lot of constant pool entries for -O0 builds of code with many global variable accesses. This speeds up -O0 codegen of consumer-typeset by 2x because the constant island pass no longer has to look at thousands of constant pool entries. <rdar://problem/10629774> llvm-svn: 147712	2012-01-07 01:47:05 +00:00
Jakob Stoklund Olesen	68a922c0e9	Enable aligned NEON spilling by default. Experiments show this to be a small speedup for modern ARM cores. llvm-svn: 147689	2012-01-06 22:19:37 +00:00
Jakob Stoklund Olesen	690511137c	Abort AdjustBBOffsetsAfter early when possible. llvm-svn: 147685	2012-01-06 21:40:15 +00:00
Jakob Stoklund Olesen	d110e2a83f	Reapply r146997, "Heed spill slot alignment on ARM." Now that canRealignStack() understands frozen reserved registers, it is safe to use it for aligned spill instructions. It will only return true if the registers reserved at the beginning of register allocation allow for dynamic stack realignment. <rdar://problem/10625436> llvm-svn: 147579	2012-01-05 00:26:57 +00:00
Jakob Stoklund Olesen	9cb477db25	Avoid reserving an ARM base pointer during register allocation. Once register allocation has started the reserved registers are frozen. Fix the ARM canRealignStack() hook to respect the frozen register state. Now the hook returns false if register allocation was started with frame pointer elimination enabled. It also returns false if register allocation started without a reserved base pointer, and stack realignment would require a base pointer. This bug was breaking oggenc on armv6. No test case, an upcoming patch will use this functionality to realign the stack for spill slots when possible. llvm-svn: 147578	2012-01-05 00:26:52 +00:00
Evan Cheng	801d98b3f0	Fix more places which should be checking for iOS, not darwin. llvm-svn: 147513	2012-01-04 01:55:04 +00:00
Jakob Stoklund Olesen	1b7f2a7638	Revert r146997, "Heed spill slot alignment on ARM." This patch caused a miscompilation of oggenc because a frame pointer was suddenly needed halfway through register allocation. <rdar://problem/10625436> llvm-svn: 147487	2012-01-03 22:34:35 +00:00
Matt Beaumont-Gay	b982d8eb65	Fix malformed assert. If anybody has strong feelings about 'default: assert(0 && "blah")' vs 'default: llvm_unreachable("blah")', feel free to regularize the instances of each in this file. llvm-svn: 147459	2012-01-03 19:03:59 +00:00
Jakob Stoklund Olesen	103318e9ea	Fix Comments. llvm-svn: 147238	2011-12-24 04:17:01 +00:00
Jakob Stoklund Olesen	0965585cb1	Experimental support for aligned NEON spills. ARM targets with NEON units have access to aligned vector loads and stores that are potentially faster than unaligned operations. Add support for spilling the callee-saved NEON registers to an aligned stack area using 16-byte aligned NEON loads and store. This feature is off by default, controlled by an -align-neon-spills command line option. llvm-svn: 147211	2011-12-23 00:36:18 +00:00
Bob Wilson	1a74de9504	Add variants of the dispatchsetup pseudo for Thumb and !VFP. <rdar://10620138> My change r146949 added register clobbers to the eh_sjlj_dispatchsetup pseudo instruction, but on Thumb1 some of those registers cannot be used. This caused massive failures on the testsuite when compiling for Thumb1. While fixing that, I noticed that the eh_sjlj_setjmp instruction has a "nofp" variant, and I realized that dispatchsetup needs the same thing, so I have added that as well. llvm-svn: 147204	2011-12-22 23:39:48 +00:00
Jim Grosbach	ea2319112f	ARM VFP assembly parsing and encoding for VCVT(float <--> fixed point). rdar://10558523 llvm-svn: 147189	2011-12-22 22:19:05 +00:00
Bob Wilson	268d2599e0	Add missing usesCustomInserter flag on Int_eh_sjlj_setjmp_nofp. Noticed by inspection; I don't have a testcase for this. llvm-svn: 147188	2011-12-22 22:12:44 +00:00
Jim Grosbach	c4d8d2f155	Tidy up. Use predicate function a bit more liberally. llvm-svn: 147184	2011-12-22 22:02:35 +00:00
Rafael Espindola	6ca42c5be3	Fix incorrect relocation generation. Patch by Kristof Beyls. Fixes PR11214. llvm-svn: 147180	2011-12-22 21:36:43 +00:00
Jim Grosbach	f0d25117c6	ARM VFP add encoding of the bitcount to fixed-point<-->floating point. insns. The value from the operands isn't right yet, but we weren't encoding it at all previously. The parser needs to twiddle the values when building the instruction. Partial for: rdar://10558523 llvm-svn: 147170	2011-12-22 19:55:21 +00:00
Jim Grosbach	b65dd04923	Remove some bogus comments. llvm-svn: 147169	2011-12-22 19:45:01 +00:00
Jim Grosbach	489ed5929e	ARM pre-UAL aliases. fcmp[sd]. llvm-svn: 147158	2011-12-22 19:20:45 +00:00
Jim Grosbach	12ccf45bbb	ARM assembler should accept shift-by-zero for any shifted-immediate operand. Just treat it as-if the shift wasn't there at all. 'as' compatibility. rdar://10604767 llvm-svn: 147153	2011-12-22 18:04:04 +00:00
Jim Grosbach	21488b8839	ARM assembly parser canonicallize on 'lsl' for shift-by-zero form. llvm-svn: 147152	2011-12-22 17:37:00 +00:00
Jim Grosbach	3794d82af5	Tidy up. Trailing whitespace. llvm-svn: 147151	2011-12-22 17:17:10 +00:00
Jim Grosbach	62bffd8827	Nuke invalid comment from copy/paste. llvm-svn: 147150	2011-12-22 17:04:50 +00:00
Rafael Espindola	84d00f11cd	Make the virtual methods in ARMELFObjectWriter public. llvm-svn: 147132	2011-12-22 02:58:12 +00:00
Rafael Espindola	2da9777cef	Hopefully fix the cmake build. llvm-svn: 147121	2011-12-22 01:11:01 +00:00
Rafael Espindola	4449b21294	Fix name in comments. llvm-svn: 147119	2011-12-22 01:06:53 +00:00
Richard Smith	32a756b7ce	Unbreak cmake build after r147115. llvm-svn: 147117	2011-12-22 01:03:35 +00:00
Rafael Espindola	a0124055b1	Move the ARM specific parts of the ELF writer to Target/ARM. llvm-svn: 147115	2011-12-22 00:37:50 +00:00
Jim Grosbach	2b80dad572	ARM NEON mnemonic aliase for vrecpeq. llvm-svn: 147109	2011-12-21 23:52:37 +00:00
Jim Grosbach	7869d8c01e	ARM VFP optional data type on VMOV GPR<-->SPR. llvm-svn: 147104	2011-12-21 23:24:15 +00:00
Jim Grosbach	260b4b336a	ARM NEON optional data type on VSWP instructions. llvm-svn: 147103	2011-12-21 23:09:28 +00:00
Jim Grosbach	a50e24fcb3	ARM NEON mnemonic aliases for vzipq and vswpq. llvm-svn: 147102	2011-12-21 23:04:33 +00:00
Jim Grosbach	1152cc0cad	ARM asm parser should be more lenient w/ .thumb_func directive. Rather than require the symbol to be explicitly an argument of the directive, allow it to look ahead and grab the symbol from the next non-whitespace line. rdar://10611140 llvm-svn: 147100	2011-12-21 22:30:16 +00:00
Jim Grosbach	8c59bbc1ed	Thumb2 assembly parsing of 'mov rd, rn, rrx'. Maps to the RRX instruction. Missed this case earlier. rdar://10615373 llvm-svn: 147096	2011-12-21 21:04:19 +00:00
Jim Grosbach	b3ef713e44	Thumb2 assembly parsing of 'mov(register shifted register)' aliases. These map to the ASR, LSR, LSL, ROR instruction definitions. rdar://10615373 llvm-svn: 147094	2011-12-21 20:54:00 +00:00
Jakob Stoklund Olesen	3588a43e3a	Move common code into an MRI function. llvm-svn: 147071	2011-12-21 19:50:05 +00:00
Jim Grosbach	c80a264386	ARM NEON assmebly parsing for VLD2 to all lanes instructions. llvm-svn: 147069	2011-12-21 19:40:55 +00:00
Chad Rosier	7248bda595	Fix a couple of copy-n-paste bugs. Noticed by George Russell! llvm-svn: 147064	2011-12-21 18:56:22 +00:00
Rafael Espindola	1ad4095d6b	Reduce the exposure of Triple::OSType in the ELF object writer. This will avoid including ADT/Triple.h in many places when the target specific bits are moved. llvm-svn: 147059	2011-12-21 17:00:36 +00:00
Evan Cheng	dc8a1aaea6	Fix a couple of copy-n-paste bugs. Noticed by George Russell. llvm-svn: 147032	2011-12-21 03:04:10 +00:00
Jim Grosbach	7de7ab83fa	ARM assembly parsing allows constant expressions for lane indices. llvm-svn: 147028	2011-12-21 01:19:23 +00:00
Jim Grosbach	c5af54ec89	ARM NEON VLD2 assembly parsing for structure to all lanes, non-writeback. llvm-svn: 147025	2011-12-21 00:38:54 +00:00
Jim Grosbach	cd22e4a81e	ARM .req register name aliases are case insensitive, just like regnames. llvm-svn: 147009	2011-12-20 23:11:00 +00:00
Jim Grosbach	4eda145c7f	Move comment to appropriate place. llvm-svn: 147000	2011-12-20 22:26:38 +00:00
Jakob Stoklund Olesen	b95c102c2f	Heed spill slot alignment on ARM. Use the spill slot alignment as well as the local variable alignment to determine when the stack needs to be realigned. This works now that the ARM target can always realign the stack by using a base pointer. Still respect the ARMBaseRegisterInfo::canRealignStack() function vetoing a realigned stack. Don't use aligned spill code in that case. llvm-svn: 146997	2011-12-20 22:15:04 +00:00
Jim Grosbach	2c59052984	ARM assembly parsing and encoding for VST2 single-element, double spaced. llvm-svn: 146990	2011-12-20 20:46:29 +00:00
Jim Grosbach	75e2ab5db2	ARM assembly parsing and encoding for VLD2 single-element, double spaced. llvm-svn: 146983	2011-12-20 19:21:26 +00:00
Evan Cheng	68132d8093	ARM target code clean up. Check for iOS, not Darwin where it makes sense. llvm-svn: 146981	2011-12-20 18:26:50 +00:00
Jason W Kim	135d244b56	First steps in ARM AsmParser support for .eabi_attribute and .arch (Both used for Linux gnueabi) No behavioral change yet (no tests need so far) llvm-svn: 146977	2011-12-20 17:38:12 +00:00
Chandler Carruth	e805b16e3d	Fix up the CMake build for the new files added in r146960, they're likely to stay either way that discussion ends up resolving itself. llvm-svn: 146966	2011-12-20 08:42:11 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Bob Wilson	75f12cc3fe	Mark ARM eh_sjlj_dispatchsetup as clobbering all registers. Radar 10567930. We used to rely on the *eh_sjlj_setjmp instructions to mark that a function with setjmp/longjmp exception handling clobbers all the registers. But with the recent reorganization of ARM EH, those eh_sjlj_setjmp instructions are expanded away earlier, before PEI can see them to determine what registers to save and restore. Mark the dispatchsetup instruction in the same way, since that instruction cannot be expanded early. This also more accurately reflects when the registers are clobbered. llvm-svn: 146949	2011-12-20 01:29:27 +00:00
Jim Grosbach	e2ca9e5b5f	ARM assembly shifts by zero should be plain 'mov' instructions. "mov r1, r2, lsl #0" should assemble as "mov r1, r2" even though it's not strictly legal UAL syntax. It's a common extension and the friendly thing to do. rdar://10604663 llvm-svn: 146937	2011-12-20 00:59:38 +00:00
Jim Grosbach	045b6c71a6	ARM NEON assembly aliases for VMOV<-->VMVN for i32 immediates. e.g., "vmov.i32 d4, #-118" can be assembled as "vmvn.i32 d4, #117" rdar://10603913 llvm-svn: 146925	2011-12-19 23:51:07 +00:00
Jim Grosbach	8648c10184	ARM assembly parsing and encoding support for LDRD(label). rdar://9932658 llvm-svn: 146921	2011-12-19 23:06:24 +00:00
Jim Grosbach	64f4de29e0	ARM NEON two-operand aliases for VPADD. rdar://10602276 llvm-svn: 146895	2011-12-19 19:51:03 +00:00
Jim Grosbach	e16acacc3a	ARM VFP pre-UAL mnemonic aliases for fmul[sd]. llvm-svn: 146892	2011-12-19 19:43:50 +00:00
Jim Grosbach	92a939ae73	ARM VFP pre-UAL mnemonic aliases for fcpy[sd] and fdiv[sd]. llvm-svn: 146887	2011-12-19 19:02:41 +00:00
Jim Grosbach	9ae4fc035b	ARM NEON implied destination aliases for VMAX/VMIN. llvm-svn: 146885	2011-12-19 18:57:38 +00:00
Jim Grosbach	cef98cddbe	ARM NEON relax parse time diagnostics for alignment specifiers. There's more variation that we need to handle. Error checking will need to be on operand predicates. llvm-svn: 146884	2011-12-19 18:31:43 +00:00
Jim Grosbach	a7d2421603	Tidy up. llvm-svn: 146882	2011-12-19 18:11:17 +00:00
Jakob Stoklund Olesen	24159e346d	Remove a register class that can just as well be synthesized. Add the new TableGen register class synthesizer feature to the release notes. llvm-svn: 146875	2011-12-19 16:53:40 +00:00
Jakob Stoklund Olesen	c7b437ae34	Emit a getMatchingSuperRegClass() implementation for every target. Use information computed while inferring new register classes to emit accurate, table-driven implementations of getMatchingSuperRegClass(). Delete the old manual, error-prone implementations in the targets. llvm-svn: 146873	2011-12-19 16:53:34 +00:00
Evan Cheng	903231bc58	Fix a CPSR liveness tracking bug introduced when I converted IT block to bundle. llvm-svn: 146805	2011-12-17 01:25:34 +00:00
Jakob Stoklund Olesen	465cdf3ba4	Preserve more memory operands in ARMExpandPseudo. I don't think this affects anything but verbose assembly. llvm-svn: 146787	2011-12-17 00:07:02 +00:00
Jakob Stoklund Olesen	9790187b6c	Fix off-by-one error in bucket sort. The bad sorting caused a misaligned basic block when building 176.vpr in ARM mode. <rdar://problem/10594653> llvm-svn: 146767	2011-12-16 23:00:05 +00:00
Jakob Stoklund Olesen	5af144809e	Don't adjust for alignment padding in OffsetIsInRange. This adjustment is already included in the block offsets computed by BasicBlockInfo, and adjusting again here can cause the pass to loop. When CreateNewWater splits a basic block, OffsetIsInRange would reject the new CPE on the next pass because of the too conservative alignment adjustment. This caused the block to be split again, and so on. llvm-svn: 146751	2011-12-16 19:10:00 +00:00
Jakob Stoklund Olesen	2a05f691ab	Note ARM constant island alignment in the release notes. The command line option should be removed, but not until the feature has gotten a lot of testing. The ARMConstantIslandPass tends to have subtle bugs that only show up after a while. llvm-svn: 146739	2011-12-16 16:07:41 +00:00
Jim Grosbach	4a29971f02	ARM NEON aliases for vmovq.f* llvm-svn: 146714	2011-12-16 00:12:22 +00:00
Jim Grosbach	66886253a7	Thumb2 ADR assembly parsing w/o the .w suffix. llvm-svn: 146710	2011-12-15 23:52:17 +00:00
Eli Friedman	c9bf1b1bff	Make check a bit more strict so we don't call ARM_AM::getFP32Imm with a value that isn't a 32-bit value. (This is just to be safe; I don't think this actually causes any issues in practice.) llvm-svn: 146700	2011-12-15 22:56:53 +00:00
Jim Grosbach	a47294e24d	ARM NEON VCLE is an alias for VCGE w/ the source operands reversed. llvm-svn: 146699	2011-12-15 22:56:33 +00:00
Jim Grosbach	4a5c887370	ARM NEON VTBL/VTBX assembly parsing and encoding. llvm-svn: 146691	2011-12-15 22:27:11 +00:00
Jakob Stoklund Olesen	cba8e8c3e0	Enable proper constant island alignment by default. The code size increase is tiny (< 0.05%) because so little code uses 16-byte constant pool entries. llvm-svn: 146690	2011-12-15 22:14:45 +00:00
Jim Grosbach	c2f16a3499	Silence warning. llvm-svn: 146686	2011-12-15 21:54:55 +00:00
Jim Grosbach	2f50e92f40	ARM NEON two-register double spaced register list parsing support. llvm-svn: 146685	2011-12-15 21:44:33 +00:00
Jakob Stoklund Olesen	9efd7ebf0a	Consider CPE alignment in CreateNewWater(). An aligned constant pool entry may require extra alignment padding where the new water is created. Take that into account when computing offset. Also consider the alignment of other constant pool entries when splitting a basic block. Alignment padding may make it necessary to move the split point higher. llvm-svn: 146609	2011-12-14 23:48:54 +00:00
Jim Grosbach	da51104282	ARM NEON better assembly operand range checking for lane indices of VLD/VST. llvm-svn: 146608	2011-12-14 23:35:06 +00:00
Jim Grosbach	a8aa30b620	ARM NEON VLD2/VST2 lane indexed assembly parsing and encoding. llvm-svn: 146605	2011-12-14 23:25:46 +00:00
Jim Grosbach	bb18fb4f52	ARM NEON fix alignment encoding for VST2 w/ writeback. Add tests for w/ writeback instruction parsing and encoding. llvm-svn: 146594	2011-12-14 21:49:24 +00:00
Jim Grosbach	8e987f5e25	Nuke old code. Missed in last commit. llvm-svn: 146590	2011-12-14 21:41:32 +00:00
Jim Grosbach	88ac761aa4	ARM NEON refactor VST2 w/ writeback instructions. In addition to improving the representation, this adds support for assembly parsing of these instructions. llvm-svn: 146588	2011-12-14 21:32:11 +00:00
Jim Grosbach	b7ec06c5c9	ARM NEON improve factoring a bit. No functional change. llvm-svn: 146585	2011-12-14 20:59:15 +00:00
Evan Cheng	da103bf9ec	Model ARM predicated write as read-mod-write. e.g. r0 = mov #0 r0 = moveq #1 Then the second instruction has an implicit data dependency on the first instruction. Sadly I have yet to come up with a small test case that demonstrate the post-ra scheduler taking advantage of this. llvm-svn: 146583	2011-12-14 20:00:08 +00:00
Jim Grosbach	8d24618975	ARM NEON VST2 assembly parsing and encoding. Work in progress. Parsing for non-writeback, single spaced register lists works now. The rest have the representations better factored, but still need more to be able to parse properly. llvm-svn: 146579	2011-12-14 19:35:22 +00:00
Jakob Stoklund Olesen	e5585e8fed	Fix speling and 80-col. llvm-svn: 146575	2011-12-14 18:49:13 +00:00
Jim Grosbach	4288b9786f	Fix copy/pasto that skipped the 'modify' step. llvm-svn: 146571	2011-12-14 18:12:37 +00:00
Jim Grosbach	1bb6e066f6	ARM/Thumb2 mov vs. mvn alias goes both ways. llvm-svn: 146570	2011-12-14 17:56:51 +00:00
Chad Rosier	ded6160473	VFP2 is required for FP loads. Noticed by inspection. llvm-svn: 146569	2011-12-14 17:55:03 +00:00
Chad Rosier	fce28914ea	Tidy up. llvm-svn: 146568	2011-12-14 17:32:02 +00:00
Jim Grosbach	a342667fd0	ARM/Thumb2 'cmp rn, #imm' alias to cmn. When 'cmp rn #imm' doesn't match due to the immediate not being representable, but 'cmn rn, #-imm' does match, use the latter in place of the former, as it's equivalent. rdar://10552389 llvm-svn: 146567	2011-12-14 17:30:24 +00:00
Chad Rosier	a26979be29	Fix 80-column violation and extraneous brackets. llvm-svn: 146566	2011-12-14 17:26:05 +00:00
Jim Grosbach	ab5830e51b	ARM assembler support for the target-specific .req directive. rdar://10549683 llvm-svn: 146543	2011-12-14 02:16:11 +00:00
Evan Cheng	7fae11b231	- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function to finalize MI bundles (i.e. add BUNDLE instruction and computing register def and use lists of the BUNDLE instruction) and a pass to unpack bundles. - Teach more of MachineBasic and MachineInstr methods to be bundle aware. - Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to prevent IT blocks from being broken apart. llvm-svn: 146542	2011-12-14 02:11:42 +00:00
Jim Grosbach	485e5622f4	Thumb2 assembler aliases for "mov(shifted register)" rdar://10549767 llvm-svn: 146520	2011-12-13 22:45:11 +00:00
Jim Grosbach	18bf363078	ARM LDM/STM system instruction variants. rdar://10550269 llvm-svn: 146519	2011-12-13 21:48:29 +00:00
Jim Grosbach	6eb142a616	Thumb2 pre/post indexed stores can be from any non-PC GPR. rdar://10549786 llvm-svn: 146518	2011-12-13 21:10:25 +00:00
Jim Grosbach	5ac89675a0	Thumb2 tweak for ccout handling in RSB parsing. llvm-svn: 146516	2011-12-13 21:06:41 +00:00
Jim Grosbach	1f1a3598c2	ARM thumb2 parsing of "rsb rd, rn, #0". rdar://10549741 llvm-svn: 146515	2011-12-13 20:50:38 +00:00
Jim Grosbach	4b0844e191	ARM NEON two-operand aliases for VQDMULH. llvm-svn: 146514	2011-12-13 20:40:37 +00:00
Jim Grosbach	561e4e18cf	ARM pre-UAL NEG mnemonic for convenience when porting old code. llvm-svn: 146511	2011-12-13 20:23:22 +00:00
Jim Grosbach	2a2348e6c2	ARM add some more pre-UAL VFP mnemonics for convenience when porting old code. llvm-svn: 146508	2011-12-13 20:13:48 +00:00
Jim Grosbach	9227f39c53	ARM add more 'gas' compatibility aliases for NEON instructions. llvm-svn: 146507	2011-12-13 20:08:32 +00:00
Chad Rosier	563de603f7	[fast-isel] Unaligned loads of floats are not supported. Therefore, convert to a regular load and then move the result from a GPR to a FPR. llvm-svn: 146502	2011-12-13 19:22:14 +00:00
Chandler Carruth	637cc6a8aa	Initial CodeGen support for CTTZ/CTLZ where a zero input produces an undefined result. This adds new ISD nodes for the new semantics, selecting them when the LLVM intrinsic indicates that the undef behavior is desired. The new nodes expand trivially to the old nodes, so targets don't actually need to do anything to support these new nodes besides indicating that they should be expanded. I've done this for all the operand types that I could figure out for all the targets. Owners of various targets, please review and let me know if any of these are incorrect. Note that the expand behavior is conservatively correct, and exactly matches LLVM's current behavior with these operations. Ideally this patch will not change behavior in any way. For example the regtest suite finds the exact same instruction sequences coming out of the code generator. That's why there are no new tests here -- all of this is being exercised by the existing test suite. Thanks to Duncan Sands for reviewing the various bits of this patch and helping me get the wrinkles ironed out with expanding for each target. Also thanks to Chris for clarifying through all the discussions that this is indeed the approach he was looking for. That said, there are likely still rough spots. Further review much appreciated. llvm-svn: 146466	2011-12-13 01:56:10 +00:00
Jakob Stoklund Olesen	bfa576fe8e	Account for CPE alignment when searching for new water. Constant pool entries with different alignment may cause more alignment padding to be inserted. Compute the amount of padding needed, and try to pick the location that requires the least amount of padding. Also take the extra padding into account when the water is above the use. llvm-svn: 146458	2011-12-13 00:44:30 +00:00
Daniel Dunbar	8889bb08b8	LLVMBuild: Introduce a common section which currently has a list of the subdirectories to traverse into. - Originally I wanted to avoid this and just autoscan, but this has one key flaw in that new subdirectories can not automatically trigger a rerun of the llvm-build tool. This is particularly a pain when switching back and forth between trees where one has added a subdirectory, as the dependencies will tend to be wrong. This will also eliminates FIXME implicitly. llvm-svn: 146436	2011-12-12 22:45:54 +00:00
Bob Wilson	fadc2c83e5	Implement 'e' and 'f' modifiers for Neon inline asm. <rdar://problem/10551006> These modifiers simply select either the low or high D subregister of a Neon Q register. I've also removed the unimplemented 'p' modifier, which turns out to be a bit different than the comment here suggests and as far as I can tell was only intended for internal use in Apple's version of gcc. llvm-svn: 146417	2011-12-12 21:45:15 +00:00
Daniel Dunbar	27a7489a03	LLVMBuild: Remove trailing newline, which irked me. llvm-svn: 146409	2011-12-12 19:48:00 +00:00
Jakob Stoklund Olesen	91a7bcbb9b	Add a postOffset() alignment argument. This computes the offset of the layout sucessor block, considering its alignment as well. llvm-svn: 146401	2011-12-12 19:25:54 +00:00
Jakob Stoklund Olesen	0863de458d	Fix typo. llvm-svn: 146400	2011-12-12 19:25:51 +00:00
Jakob Stoklund Olesen	17c27a8898	Also set the proper alignment on inner islands and the function itself. Downgrade the alignment of the initial constant island when constant pool entries are moved elsewhere. This is all gated by -arm-align-constant-islands. llvm-svn: 146391	2011-12-12 18:45:45 +00:00
Jakob Stoklund Olesen	2a75997858	Make MF a class member instead of passing it around everywhere. Also add an MCP member pointing to the machine constant pool. No functional change intended. llvm-svn: 146382	2011-12-12 18:16:53 +00:00
Jakob Stoklund Olesen	b5f52aad22	Add a -arm-align-constant-islands flag, default off. Order constant pool entries by descending alignment in the initial island to ensure packing and correct alignment. When the command line flag is set, also align the basic block containing the constant pool entries. This is only a partial implementation of constant island alignment. More to come. llvm-svn: 146375	2011-12-12 16:49:37 +00:00
Stepan Dyatkovskiy	4683740967	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Third attempt: simplified checks in test for armv7-apple-darwin11. llvm-svn: 146341	2011-12-11 14:35:48 +00:00
Chad Rosier	6641294e3b	Revert r146322 to appease buildbots. Original commit message: Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146328	2011-12-10 19:55:03 +00:00
Stepan Dyatkovskiy	df0b779e9f	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146322	2011-12-10 08:42:24 +00:00
Jakob Stoklund Olesen	146ac7b609	Try to align the point where a large basic block is split. The split point is picked such that the newly created water has the same alignment as the function. This makes the island suitable for constant pool entries with potentially higher alignment. This also fixes an issue where the basic block was split one instruction too late, causing nonconvergence of the algorithm. <rdar://problem/10550705> There is still an issue with correctly packing differently aligned entries in the island. llvm-svn: 146314	2011-12-10 02:55:10 +00:00
Jakob Stoklund Olesen	b3734522fa	More debug output formatting. llvm-svn: 146313	2011-12-10 02:55:06 +00:00
Jim Grosbach	54337b8617	ARM add some more pre-UAL VFP mnemonics for convenience when porting old code. llvm-svn: 146300	2011-12-10 00:01:02 +00:00
Eli Friedman	4e36a934dc	Splats can contain undef's; make sure to handle them correctly. PR11526. llvm-svn: 146299	2011-12-09 23:54:42 +00:00
Jim Grosbach	8be2f6577e	ARM add some pre-UAL VFP mnemonics for convenience when porting old code. llvm-svn: 146296	2011-12-09 23:34:09 +00:00
Jim Grosbach	ef70e9b704	ARM allows '' syntax, not just '#imm' for assembly. Backwards compatibility with 'gas'. #imm is the preferered and documented syntax, but lots of existing code uses the '$' prefix, so we should support it if we can. llvm-svn: 146285	2011-12-09 22:25:03 +00:00
Jim Grosbach	6192b6570d	ARM assembly aliases for BIC<-->AND (immediate). When the immediate operand of an AND or BIC instruction isn't representable in the immediate field of the instruction, but the bitwise negation of the immediate is, assemble the instruction as the inverse operation instead with the inverted immediate as the operand. rdar://10550057 llvm-svn: 146283	2011-12-09 22:02:17 +00:00
Jim Grosbach	ea1b353e67	ARM NEON data type aliases for VBIC(register). llvm-svn: 146281	2011-12-09 21:46:04 +00:00
Jim Grosbach	d146a02c79	ARM assembly parsing and encoding for VLD2 with writeback. Refactor the instructions into fixed writeback and register-stride writeback variants to simplify the offset operand (no more optional register operand using reg0). This is a simpler representation and allows the assembly parser to more easily handle these instructions. Add tests for the instruction variants now supported. llvm-svn: 146278	2011-12-09 21:28:25 +00:00
Jakob Stoklund Olesen	f85723626c	User a helper overload for a common pattern. llvm-svn: 146270	2011-12-09 19:44:39 +00:00
Jim Grosbach	8a4009dab2	Tidy up. Better base class factoring. llvm-svn: 146267	2011-12-09 19:07:20 +00:00
Jim Grosbach	b076e6f3d5	Tidy up. Better base class factoring. llvm-svn: 146266	2011-12-09 18:54:11 +00:00
Jakob Stoklund Olesen	5f5fa12413	Tweak debugging output. llvm-svn: 146264	2011-12-09 18:20:35 +00:00
Jim Grosbach	8cc83fa1b7	ARM convenience aliases for VSQRT. llvm-svn: 146201	2011-12-08 22:51:25 +00:00
Jim Grosbach	db731be7b8	ARM 64-bit VEXT assembly uses a .64 suffix, not .32, amazingly enough. llvm-svn: 146194	2011-12-08 22:19:04 +00:00
Jim Grosbach	ba7d6ed05d	ARM VSHR implied destination operand form aliases. llvm-svn: 146192	2011-12-08 22:06:06 +00:00
Jim Grosbach	98bc797b4d	ARM asm parser, just issue a warning for a duplicate reg in a list. For better 'gas' compatibility. llvm-svn: 146185	2011-12-08 21:34:20 +00:00
Jim Grosbach	ab9c8bb45b	ARM VSUB implied destination operand form aliases. llvm-svn: 146182	2011-12-08 20:56:26 +00:00
Jim Grosbach	66c9ad7642	ARM VQADD implied destination operand form aliases. llvm-svn: 146179	2011-12-08 20:49:43 +00:00
Jim Grosbach	e9ee1092e1	ARM a few more VMUL implied destination operand form aliases. llvm-svn: 146177	2011-12-08 20:42:35 +00:00
Jim Grosbach	4edc7360c7	ARM assembler support for register name aliases. rdar://10550084 llvm-svn: 146170	2011-12-08 19:27:38 +00:00
Daniel Dunbar	c09e4593b2	Revert r146143, "Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2).", it is failing tests. llvm-svn: 146157	2011-12-08 17:32:18 +00:00
Stepan Dyatkovskiy	a4bcf27dae	Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). llvm-svn: 146143	2011-12-08 07:55:03 +00:00
Jim Grosbach	00326406d4	ARM NEON two-operand aliases for VSHL(immediate). llvm-svn: 146125	2011-12-08 01:30:04 +00:00
Jakob Stoklund Olesen	14e024dff7	Drop the HasInlineAsm flag. It is not used any more. We are tracking inline assembly misalignments directly through the BBInfo.Unalign and KnownBits fields. A simple conservative size estimate is not good enough since it can cause alignment padding to be underestimated. llvm-svn: 146124	2011-12-08 01:22:39 +00:00
Jim Grosbach	f10a635eb4	ARM NEON two-operand aliases for VSHL(register). llvm-svn: 146123	2011-12-08 01:12:35 +00:00
Jakob Stoklund Olesen	bd97f5d753	Simplify offset verification. llvm-svn: 146121	2011-12-08 01:10:05 +00:00

... 7 8 9 10 11 ...

6520 Commits