llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Grosbach	05eccf0e44	One Printer to rule them all, One Printer to find them, One Printer to lower them all and in the back end bind them. (Remove option to use the old non-MC asm printer.) llvm-svn: 115038	2010-09-29 15:23:40 +00:00
Gabor Greif	d36e3e8850	improve heuristics to find the 'and' corresponding to 'tst' to also catch opportunities on thumb2 added some doxygen on the way llvm-svn: 115033	2010-09-29 10:12:08 +00:00
Chris Lattner	a63292a3ca	implement rdar://8456378 and PR7557 - support for the fstsw, an instruction that requires a WHOLE NEW wonderful kind of alias. llvm-svn: 115015	2010-09-29 01:50:45 +00:00
Chris Lattner	b44fd24fc1	change the protocol TargetAsmPArser::MatchInstruction method to take an MCStreamer to emit into instead of an MCInst to fill in. This allows the matcher extra flexibility and is more convenient. llvm-svn: 115014	2010-09-29 01:42:58 +00:00
Eric Christopher	3a7e8cd6bd	Rework comparison handling to set a register on true/false. This avoids problems with phi-nodes in blocks that have hard and not virtual registers. Accordingly update branch handling to compensate. llvm-svn: 115013	2010-09-29 01:14:47 +00:00
Eric Christopher	edd4b600f3	Remove unnecessary set ahead of time. llvm-svn: 115011	2010-09-29 00:50:57 +00:00
Evan Cheng	2259d67a33	Separate itinerary classes for mvn from mov; for tst / teq from cmp / cmn. llvm-svn: 115010	2010-09-29 00:49:25 +00:00
Eric Christopher	2c8e7f421c	Remove assert, add comment. llvm-svn: 115009	2010-09-29 00:49:09 +00:00
Evan Cheng	c35d7bbe43	Assign bitwise binary instructions different itinerary classes from ALU instructions such as add / sub. llvm-svn: 115008	2010-09-29 00:27:46 +00:00
Evan Cheng	0097dd0d5a	Add support to model pipeline bypass / forwarding. llvm-svn: 115005	2010-09-28 23:50:49 +00:00
Eric Christopher	a86a6d2fed	32-bit constant ints only for now. llvm-svn: 115001	2010-09-28 22:47:54 +00:00
Oscar Fuentes	b4b12535e8	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Owen Anderson	a3181e2d79	Add a subtarget hook for reporting the misprediction penalty. Use this to provide more precise cost modeling for if-conversion. Now if only we had a way to estimate the misprediction probability. Adjsut CodeGen/ARM/ifcvt10.ll. The pipeline on Cortex-A8 is long enough that it is still profitable to predicate an ldm, but the shorter pipeline on Cortex-A9 makes it unprofitable. llvm-svn: 114995	2010-09-28 21:57:50 +00:00
Eric Christopher	953b1afd5f	Integer materialization needed the same thinko change. llvm-svn: 114994	2010-09-28 21:55:34 +00:00
Nick Lewycky	7d483d352b	Resolve this GCC warning: ARMTargetMachine.cpp:53: error: control reaches end of non-void function llvm-svn: 114992	2010-09-28 21:40:26 +00:00
Anton Korobeynikov	81bdc93bbb	User proper libcall names & condcodes while compiling for ARM EABI. Patch by Evzen Muller! llvm-svn: 114991	2010-09-28 21:39:26 +00:00
Owen Anderson	88af7d00fc	Part one of switching to using a more sane heuristic for determining if-conversion profitability. Rather than having arbitrary cutoffs, actually try to cost model the conversion. For now, the constants are tuned to more or less match our existing behavior, but these will be changed to reflect realistic values as this work proceeds. llvm-svn: 114973	2010-09-28 18:32:13 +00:00
Jim Grosbach	45c83d496f	Factor out dbg_value comment printing and teach MC asm printing to use it. This should make the arm-linux self-host buildbot happy again. llvm-svn: 114964	2010-09-28 17:05:56 +00:00
Oscar Fuentes	3da4255d07	Add ARM Disassembler to the CMake build. llvm-svn: 114949	2010-09-28 11:48:19 +00:00
Eric Christopher	bf86fd3c47	80-col fixups. llvm-svn: 114943	2010-09-28 04:18:29 +00:00
Bob Wilson	3dc97324c1	Add a command line option "-arm-strict-align" to disallow unaligned memory accesses for ARM targets that would otherwise allow it. Radar 8465431. llvm-svn: 114941	2010-09-28 04:09:35 +00:00
Eric Christopher	7990df1ae2	Rework builtin handling and call setup. The builtin handling now takes a libcall operand, sets up the arguments correctly and handles stack adjustments. llvm-svn: 114934	2010-09-28 01:21:42 +00:00
Eric Christopher	e68635acdb	Fix typo. llvm-svn: 114931	2010-09-28 00:35:33 +00:00
Eric Christopher	6f98bfd870	Fix fp constant loads to have a destination register. llvm-svn: 114930	2010-09-28 00:35:09 +00:00
Jim Grosbach	175d6411c8	Enable the MC-ized ARM asm printer. Passing all local tests, so it's time to enable it for real. Leaving the CL option in place to it's easy to disable it again if (when) testers find something I've missed. llvm-svn: 114915	2010-09-27 22:28:11 +00:00
Jim Grosbach	9e9ed98305	ARM-mode eh.sjlj.longjmp MC lowering llvm-svn: 114896	2010-09-27 21:47:04 +00:00
Jim Grosbach	11fed543c9	Enable the MC-ized ARM asm printer. Passing all local tests, so it's time to enable it for real. Leaving the CL option in place to it's easy to disable it again if (when) testers find something I've missed. llvm-svn: 114892	2010-09-27 21:28:44 +00:00
Daniel Dunbar	6b2aaf1a36	Hard to imagine there are still people using inferior compilers. llvm-svn: 114862	2010-09-27 20:12:58 +00:00
Rafael Espindola	69aa15155f	Odd additional stub framework for the ARM MC ELF emission. llc now recognizes the "intent" to support MC/obj emission for ARM, but given that they are all stubs, it asserts on --filetype=obj --march=arm Patch by Jason Kim. llvm-svn: 114856	2010-09-27 18:31:37 +00:00
Eric Christopher	0720611e3a	Insert missing coherency in comment. Add a quick check for hardware divide support also. llvm-svn: 114813	2010-09-27 06:08:12 +00:00
Eric Christopher	29ab6d1f82	Mass rename for Jim. llvm-svn: 114812	2010-09-27 06:02:23 +00:00
Evan Cheng	48cc21620f	Fix IIC_iEXTAr itinerary class of Cortex-A9. llvm-svn: 114784	2010-09-25 01:09:28 +00:00
Evan Cheng	8f9a2244fc	Remove a unused instruction itinerary class. llvm-svn: 114782	2010-09-25 01:06:02 +00:00
Evan Cheng	62d626ce86	Fix zero and sign extension instructions scheduling itineraries. llvm-svn: 114780	2010-09-25 00:49:35 +00:00
Evan Cheng	e37da03e60	More pseudo instruction scheduling itinerary fixes. llvm-svn: 114768	2010-09-24 22:41:41 +00:00
Evan Cheng	1d35ad62cc	Fix scheduling itinerary for pseudo mov immediate instructions which expand into two real instructions. llvm-svn: 114766	2010-09-24 22:03:46 +00:00
Jim Grosbach	4a6ab13fb9	Add ARM explicit MCInst lowering for the Thumb eh.sjlj.setjmp sequence. llvm-svn: 114758	2010-09-24 20:47:58 +00:00
Evan Cheng	dbcc4b4d4d	Enable code placement optimization pass for ARM. llvm-svn: 114746	2010-09-24 19:07:23 +00:00
Evan Cheng	40a4222996	Fix a potential null dereference bug. llvm-svn: 114723	2010-09-24 05:18:35 +00:00
Owen Anderson	2c5df619c4	Revert r114703 and r114702, removing the isConditionalMove flag from instructions. After further reflection, this isn't going to achieve the purpose I intended it for. Back to the drawing board! llvm-svn: 114710	2010-09-23 23:45:25 +00:00
Bob Wilson	7fbbe9a43a	Set alignment operand for NEON VST instructions. llvm-svn: 114709	2010-09-23 23:42:37 +00:00
Jim Grosbach	c0aed7179a	ARM-mode eh.sjlj.setjmp pseudo MC-inst lowering expansion llvm-svn: 114707	2010-09-23 23:33:56 +00:00
Jim Grosbach	2f3728f576	#+4 --> #4 for consistency with other asm output llvm-svn: 114706	2010-09-23 23:32:38 +00:00
Jim Grosbach	07f07290d8	Fix formatting of output .s code llvm-svn: 114705	2010-09-23 23:03:26 +00:00
Owen Anderson	bd57e0ce3d	Add isConditionalMove bits to X86 and ARM instructions. llvm-svn: 114703	2010-09-23 22:57:01 +00:00
Bob Wilson	9eeb890172	Set alignment operand for NEON VLD instructions. llvm-svn: 114696	2010-09-23 21:43:54 +00:00
Jim Grosbach	7d34837676	never mind. I can't read, apparently llvm-svn: 114689	2010-09-23 19:42:17 +00:00
Evan Cheng	1596f7f6f3	Fix r114632. Return if the only terminator is an unconditional branch after the redundant ones are deleted. llvm-svn: 114688	2010-09-23 19:42:03 +00:00
Jim Grosbach	836341a17a	Fix opcode value for the 'trap' instruction, keeping the type suffix on the constant. Hopefully the non-Darwin bots will like it... llvm-svn: 114687	2010-09-23 19:32:40 +00:00
Jim Grosbach	3d50a3e237	explicit 'unsigned long' on constant value. Hopefully make bots happier. llvm-svn: 114686	2010-09-23 19:08:04 +00:00
Benjamin Kramer	e38495dbc0	Unbreak build. Jim, please review. llvm-svn: 114684	2010-09-23 18:57:26 +00:00
Jim Grosbach	8503054410	Clean up the 'trap' instruction printing a bit. Non-Darwin assemblers don't (yet) recognize the 'trap' mnemonic, so we use .short/.long to emit the opcode directly. On Darwin, however, we do want the mnemonic for more readable assembly code and better disassembly. Adjust the .td file to use the 'trap' mnemonic and handle using the binutils workaround in the assembly printer. Also tweak the formatting of the opcode values to make them consistent between the MC printer and the old printer. llvm-svn: 114679	2010-09-23 18:05:37 +00:00
Jim Grosbach	ea20e257b2	nuke unused var llvm-svn: 114676	2010-09-23 17:58:00 +00:00
Evan Cheng	66c8cd2b32	If there are multiple unconditional branches terminating a block, eliminate all but the first one. Those will never be executed. There was logic to do this but it was faulty. llvm-svn: 114632	2010-09-23 06:54:40 +00:00
Jim Grosbach	85dcd3d0f4	Add support for ELF PLT references for ARM MC asm printing. Adding a new VariantKind to the MCSymbolExpr seems like overkill, but I'm not sure there's a more straightforward way to get the printing difference captured. (i.e., x86 uses @PLT, ARM uses (PLT)). llvm-svn: 114613	2010-09-22 23:27:36 +00:00
Jim Grosbach	a9424d4f2f	Enable a few additional asserts in MC instruction lowering. llvm-svn: 114601	2010-09-22 23:01:28 +00:00
Bob Wilson	463a05342a	Change VDUPLANE DAG combiner to just return the result instead of calling CombineTo to avoid putting the result on the worklist. I don't think it makes much difference for now, but it might help someday as we add more DAG combine optimizations. llvm-svn: 114595	2010-09-22 22:27:30 +00:00
Bob Wilson	2280674fa9	Combine both VMOVDRR(VMOVRRD) and VMOVRRD(VMOVDRR), instead of just doing one of those. Refactor to share code for handling BUILD_VECTOR(VMOVRRD). I don't have a testcase that exercises this, but it seems like an obvious good thing to do. llvm-svn: 114589	2010-09-22 22:09:21 +00:00
Jim Grosbach	1f57cc4a59	add FIXME llvm-svn: 114578	2010-09-22 20:55:15 +00:00
Jim Grosbach	003fd5b65e	Remove a few commented out bits llvm-svn: 114576	2010-09-22 20:32:34 +00:00
Jim Grosbach	e12c8ba05b	Add PrintSpecial() handling for in ARM MC instruction printer. llvm-svn: 114563	2010-09-22 18:37:14 +00:00
Jim Grosbach	284eebc1ae	Add MC instruction printer support for ARM and Thumb1 jump tables. llvm-svn: 114555	2010-09-22 17:39:48 +00:00
Jim Grosbach	1573b29ea7	Add MC instruction printer support for TB[BH] style thumb2 jump tables. llvm-svn: 114553	2010-09-22 17:15:35 +00:00
Jim Grosbach	754e1efffc	Clean up comment. llvm-svn: 114550	2010-09-22 16:45:13 +00:00
Evan Cheng	d757c88bba	OptimizeCompareInstr should avoid iterating pass the beginning of the MBB when the 'and' instruction is after the comparison. llvm-svn: 114506	2010-09-21 23:49:07 +00:00
Jim Grosbach	d64f9b8381	Add start of support for MC instruction printer of ARM jump tables. Filling in the rest of it is next up. llvm-svn: 114500	2010-09-21 23:28:16 +00:00
Owen Anderson	61158f98ab	Enable target-specific mul-lowering on ARM, even at -Os. Remove a test that this makes irrelevant, but add a new test for the new, improved functionality. llvm-svn: 114494	2010-09-21 22:51:46 +00:00
Chris Lattner	0e023ea02a	fix a long standing wart: all the ComplexPattern's were being passed the root of the match, even though only a few patterns actually needed this (one in X86, several in ARM [which should be refactored anyway], and some in CellSPU that I don't feel like detangling). Instead of requiring all ComplexPatterns to take the dead root, have targets opt into getting the root by putting SDNPWantRoot on the ComplexPattern. llvm-svn: 114471	2010-09-21 20:31:19 +00:00
Chris Lattner	886250c8f0	convert a couple more places to use the new getStore() llvm-svn: 114463	2010-09-21 18:51:21 +00:00
Bob Wilson	5549d496dd	Define the TargetLowering::getTgtMemIntrinsic hook for ARM so that NEON load and store intrinsics are represented with MemIntrinsicSDNodes. llvm-svn: 114454	2010-09-21 17:56:22 +00:00
Jim Grosbach	cbac342e1a	Fix errant printing of [v]ldm instructions that aren't a pop llvm-svn: 114445	2010-09-21 16:45:31 +00:00
Gabor Greif	1a25ae88ff	Fix buglet when the TST instruction directly uses the AND result. I am unable to write a test for this case, help is solicited, though... What I did is to tickle the code in the debugger and verify that we do the right thing. llvm-svn: 114430	2010-09-21 13:30:57 +00:00
Gabor Greif	adbbb93d3d	Move the search for the appropriate AND instruction into OptimizeCompareInstr. This necessitates the passing of CmpValue around, so widen the virtual functions to accomodate. No functionality changes. llvm-svn: 114428	2010-09-21 12:01:15 +00:00
Chris Lattner	7727d05dbb	convert the targets off the non-MachinePointerInfo of getLoad. llvm-svn: 114410	2010-09-21 06:44:06 +00:00
Chris Lattner	2510de2bea	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Chris Lattner	e3d864b857	convert targets to the new MF.getMachineMemOperand interface. llvm-svn: 114391	2010-09-21 04:39:43 +00:00
Jim Grosbach	94dfd6fc4f	Simplify ARM callee-saved register handling by removing the distinction between the high and low registers for prologue/epilogue code. This was a Darwin-only thing that wasn't providing a realistic benefit anymore. Combining the save areas simplifies the compiler code and results in better ARM/Thumb2 codegen. For example, previously we would generate code like: push {r4, r5, r6, r7, lr} add r7, sp, #12 stmdb sp!, {r8, r10, r11} With this change, we combine the register saves and generate: push {r4, r5, r6, r7, r8, r10, r11, lr} add r7, sp, #12 rdar://8445635 llvm-svn: 114340	2010-09-20 19:32:20 +00:00
Michael J. Spencer	abf60e3421	Fix build. llvm-svn: 114292	2010-09-18 17:54:37 +00:00
Eric Christopher	a6ba082cb6	Thumb opcodes for thumb calls. llvm-svn: 114263	2010-09-18 02:32:38 +00:00
Eric Christopher	aef6499bf1	Add addrmode5 fp load support. Swap float/thumb operand adding to handle thumb with floating point. llvm-svn: 114256	2010-09-18 01:59:37 +00:00
Eric Christopher	30f2300ed2	Floating point stores have a 3rd addressing mode type. llvm-svn: 114254	2010-09-18 01:23:38 +00:00
Jim Grosbach	af5d63583e	factor out a simple helper function to create a label for PC-relative instructions (PICADD, PICLDR, et.al.) llvm-svn: 114243	2010-09-18 00:05:05 +00:00
Jim Grosbach	8a5a6a6c1e	PC-relative pseudo instructions are lowered and printed directly. Any encounter with one in the generic printing code is an error. llvm-svn: 114242	2010-09-18 00:04:53 +00:00
Benjamin Kramer	de636ca9a8	Fix vmov.f64 disassembly on targets where sizeof(long) != 8. llvm-svn: 114240	2010-09-17 23:48:07 +00:00
Jim Grosbach	3d97920829	Add MC-inst handling for tPICADD llvm-svn: 114237	2010-09-17 23:41:53 +00:00
Bob Wilson	cb6db98897	Add target-specific DAG combiner for BUILD_VECTOR and VMOVRRD. An i64 value should be in GPRs when it's going to be used as a scalar, and we use VMOVRRD to make that happen, but if the value is converted back to a vector we need to fold to a simple bit_convert. Radar 8407927. llvm-svn: 114233	2010-09-17 22:59:05 +00:00
Jim Grosbach	7a6c37d3e7	Teach the (non-MC) instruction printer to use the cannonical names for push/pop, and shift instructions on ARM. Update the tests to match. llvm-svn: 114230	2010-09-17 22:36:38 +00:00
Eric Christopher	2ccc1aa696	Rework arm fast isel branch and compare code. llvm-svn: 114226	2010-09-17 22:28:18 +00:00
Jim Grosbach	132a0ce787	Hook up verbose asm comment printing for SOImm operands in MC printer llvm-svn: 114215	2010-09-17 21:33:25 +00:00
Jim Grosbach	4e51d0bebb	trailing whitespace llvm-svn: 114212	2010-09-17 21:25:10 +00:00
Jim Grosbach	1287f4f3b8	Add skeleton infrastructure for the ARMMCCodeEmitter class. Patch by Jason Kim! llvm-svn: 114195	2010-09-17 18:46:17 +00:00
Jim Grosbach	0d35df1cfe	handle the upper16/lower16 target operand flags on symbol references for MC instruction lowering. llvm-svn: 114191	2010-09-17 18:25:25 +00:00
Jim Grosbach	a7d430b51c	expand PICLDR MC lowering to handle other PICLDR and PICSTR versions. llvm-svn: 114183	2010-09-17 16:25:52 +00:00
Jim Grosbach	218e22da8b	MC-ization of the PICLDR pseudo. Next up, adding the other variants (PICLDRB, et. al.) and PICSTR* llvm-svn: 114098	2010-09-16 17:43:25 +00:00
Jim Grosbach	ee1934a2da	Make sure to promote single precision floats to double before extracting them from the APFloat. llvm-svn: 114096	2010-09-16 17:37:30 +00:00
Bob Wilson	a625b0110b	Remove support for "dregpair" operand modifier, now that it is no longer being used for anything. llvm-svn: 114067	2010-09-16 04:55:00 +00:00
Bob Wilson	450c6cfaff	When expanding ARM pseudo registers, copy the existing predicate operands instead of using default predicates on the expanded instructions. llvm-svn: 114066	2010-09-16 04:25:37 +00:00
Jim Grosbach	298d0fd1c8	store MC FP immediates as a double instead of as an APFloat, thus avoiding an unnecessary dtor for MCOperand. llvm-svn: 114064	2010-09-16 03:45:21 +00:00
Bob Wilson	62c454847d	Add missing break. llvm-svn: 114048	2010-09-16 00:31:32 +00:00
Bob Wilson	6b853c3ce3	Change VLDMQ and VSTMQ to be pseudo instructions. They are expanded after register allocation to VLDMD and VSTMD respectively. This avoids using the dregpair operand modifier. llvm-svn: 114047	2010-09-16 00:31:02 +00:00
Jim Grosbach	cb54b960b8	Add support for the 'lane' modifier on vdup operands llvm-svn: 114030	2010-09-15 22:13:23 +00:00
Jakob Stoklund Olesen	44857a38fa	Remember VLDMQ. llvm-svn: 114026	2010-09-15 21:40:11 +00:00
Jakob Stoklund Olesen	b929c7173d	Add missing break. llvm-svn: 114025	2010-09-15 21:40:09 +00:00
Jim Grosbach	27ab5fbd2b	Teach the MC disassembler to handle vmov.f32 and vmov.f64 immediate to register moves. Previously, the immediate was printed as the encoded integer value, which is incorrect. llvm-svn: 114021	2010-09-15 21:04:54 +00:00
Jim Grosbach	40e85fbf17	move getRegisterNumbering() to out of ARMBaseRegisterInfo into the helper functions in ARMBaseInfo.h so it can be used in the MC library as well. For anything bigger than this, we may want a means to have a small support library for shared helper functions like this. Cross that bridge when we come to it. llvm-svn: 114016	2010-09-15 20:26:25 +00:00
Jim Grosbach	9569567255	simplify getRegisterNumbering(). Remove the unused isSPVFP argument and merge the common cases. llvm-svn: 114013	2010-09-15 19:52:17 +00:00
Jim Grosbach	789ca9a1e9	Refactor uses of getRegisterNumbering() to not need the isSPVFP argument. Check if the register is a member of the SPR register class directly instead. llvm-svn: 114012	2010-09-15 19:44:57 +00:00
Jim Grosbach	29fe94e75e	Reduce dependencies in the ARM MC instruction printer. llvm-svn: 114009	2010-09-15 19:27:50 +00:00
Jim Grosbach	2b48b5557a	Fix spelling typo. llvm-svn: 114008	2010-09-15 19:26:50 +00:00
Jim Grosbach	91fbd8f86e	Factor out basic enums and hleper functions from ARM.h for cleaner sharing between the compiler back end and the MC libraries. llvm-svn: 114007	2010-09-15 19:26:06 +00:00
Jim Grosbach	7bbf3fd0d6	Add support for floating point immediates to MC instruction printing. ARM VFP instructions use it for loading some constants, so implement that handling. Not thrilled with adding a member to MCOperand, but not sure there's much of a better option that's not pretty fragile (like putting a double in the union instead and just assuming that's good enough). Suggestions welcome... llvm-svn: 113996	2010-09-15 18:47:08 +00:00
Jakob Stoklund Olesen	33005d1327	Recognize VST1q64Pseudo and VSTMQ as stack slot stores. Recognize VLD1q64Pseudo as a stack slot load. Reject these if they are loading or storing a subregister. The API (and VirtRegRewriter) doesn't know how to deal with that. llvm-svn: 113985	2010-09-15 17:27:09 +00:00
Bob Wilson	660d7ecf32	Reapply Gabor's 113839, 113840, and 113876 with a fix for a problem encountered while building llvm-gcc for arm. This is probably the same issue that the ppc buildbot hit. llvm::prior works on a MachineBasicBlock::iterator, not a plain MachineInstr. llvm-svn: 113983	2010-09-15 17:12:08 +00:00
Gabor Greif	9ae4b271f2	the darwin9-powerpc buildbot keeps consistently crashing, backing out following to get it back to green, so I can investigate in peace: svn merge -c -113840 llvm/test/CodeGen/ARM/arm-and-tst-peephole.ll svn merge -c -113876 -c -113839 llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp llvm-svn: 113980	2010-09-15 16:53:07 +00:00
Jakob Stoklund Olesen	11f5be3b86	Move ARM is{LoadFrom,StoreTo}StackSlot closer to their siblings so they won't be forgotten in the future. Coalesce identical cases in switch. No functional changes intended. llvm-svn: 113979	2010-09-15 16:36:26 +00:00
Bob Wilson	2c00b5098a	Spelling fix. llvm-svn: 113978	2010-09-15 16:28:21 +00:00
Bob Wilson	b1e9d4bff1	Use VLD1/VST1 pseudo instructions for loadRegFromStackSlot and storeRegToStackSlot. llvm-svn: 113918	2010-09-15 01:48:05 +00:00
Jim Grosbach	c7cf42d80b	Reapply r113875 with additional cleanups. "The register specified for a dregpair is the corresponding Q register, so to get the pair, we need to look up the sub-regs based on the qreg. Create a lookup function since we don't have access to TargetRegisterInfo here to be able to use getSubReg(ARM::dsub_[01])." Additionaly, fix the NEON VLD1* and VST1* instruction patterns not to use the dregpair modifier for the 2xdreg versions. Explicitly specifying the two registers as operands is more correct and more consistent with the other instruction patterns. This enables further cleanup of special case code in the disassembler as a nice side-effect. llvm-svn: 113903	2010-09-14 23:54:06 +00:00
Eric Christopher	8b9126694d	Emit libcalls for SDIV, this requires some call infrastructure that needs to be shared a bit more widely around. llvm-svn: 113886	2010-09-14 23:03:37 +00:00
Jim Grosbach	c07b2afe5e	revert 113875 momentarilly. Need to fix the MC disassembler to handle the change. llvm-svn: 113878	2010-09-14 22:38:39 +00:00
Jim Grosbach	29cad6c2fc	trailing whitespace cleanup llvm-svn: 113877	2010-09-14 22:27:15 +00:00
Gabor Greif	b54e9387ab	an attempt to salvage the darwin9-powerpc buildbot, which could be miscompiling this line llvm-svn: 113876	2010-09-14 22:25:16 +00:00
Jim Grosbach	b523be2bb3	The register specified for a dregpair is the corresponding Q register, so to get the pair, we need to look up the sub-regs based on the qreg. Create a lookup function since we don't have access to TargetRegisterInfo here to be able to use getSubReg(ARM::dsub_[01]). llvm-svn: 113875	2010-09-14 22:20:33 +00:00
Gabor Greif	22f6922505	set isCompare for another three Thumb1 instructions llvm-svn: 113867	2010-09-14 22:00:50 +00:00
Jim Grosbach	a244f70113	Add predicate and 's' bit operands to PICADD instruction lowering. llvm-svn: 113860	2010-09-14 21:28:17 +00:00
Bob Wilson	62e9a052b9	Avoid warnings. llvm-svn: 113857	2010-09-14 21:12:05 +00:00
Jim Grosbach	7ae94222cd	fix comment typo llvm-svn: 113856	2010-09-14 21:05:34 +00:00
Bob Wilson	dd29db5635	Make NEON ld/st pseudo instruction classes take the instruction itinerary as an argument, so that we can distinguish instructions with the same register classes but different numbers of registers (e.g., vld3 and vld4). Fix some of the non-pseudo NEON ld/st instruction itineraries to reflect the number of registers loaded or stored, not just the opcode name. llvm-svn: 113854	2010-09-14 20:59:49 +00:00
Gabor Greif	2afac8e9bd	set comparable for a bunch of Thumb instructions llvm-svn: 113849	2010-09-14 20:47:43 +00:00
Jim Grosbach	cf98cbaef1	Don't ignore the CPSR implicit def when lowering a MachineInstruction to an MCInst. llvm-svn: 113847	2010-09-14 20:41:27 +00:00
Jim Grosbach	bc7eeaf233	Clarify comment llvm-svn: 113846	2010-09-14 20:35:46 +00:00
Gabor Greif	d0cef1e2ef	Eliminate a 'tst' that immediately follows an 'and' by morphing the 'and' to its recording form 'andS'. This is basically a test commit into this area, to see whether the bots like me. Several generalizations can be applied and various avenues of code simplification are open. I'll introduce those as I go. I am aware of stylistic input from Bill Wendling, about where put the analysis complexity, but I am positive that we can move things around easily and will find a satisfactory solution. llvm-svn: 113839	2010-09-14 09:23:22 +00:00
Eric Christopher	726838a3e5	Fix QOpcode assignment to Opc. llvm-svn: 113837	2010-09-14 08:31:25 +00:00
Michael J. Spencer	93c9b2ea93	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Bob Wilson	c597fd3b4a	Convert some VTBL and VTBX instructions to use pseudo instructions prior to register allocation. Remove the NEONPreAllocPass, which is no longer needed. Yeah!! llvm-svn: 113818	2010-09-13 23:55:10 +00:00
Bob Wilson	d5c57a5ed4	Switch all the NEON vld-lane and vst-lane instructions over to the new pseudo-instruction approach. Change ARMExpandPseudoInsts to use a table to record all the NEON load/store information. llvm-svn: 113812	2010-09-13 23:01:35 +00:00
Jim Grosbach	7aeff13cae	trailing whitespace llvm-svn: 113768	2010-09-13 18:25:42 +00:00
Chris Lattner	a2a9d16b78	fix the asmparser so that the target is responsible for skipping to the end of the line on a parser error, allowing skipping to happen for syntactic errors but not for semantic errors. Before we would miss emitting a diagnostic about the second line, because we skipped it due to the semantic error on the first line: foo %eax bar %al This fixes rdar://8414033 - llvm-mc ignores lines after an invalid instruction mnemonic errors llvm-svn: 113688	2010-09-11 16:18:25 +00:00
Bill Wendling	27dddd1fd1	Rename ConvertToSetZeroFlag to something more general. llvm-svn: 113670	2010-09-11 00:13:50 +00:00
Bill Wendling	d0a5f4e238	No need to recompute the SrcReg and CmpValue. llvm-svn: 113666	2010-09-10 23:46:12 +00:00
Bill Wendling	041230014c	Move some of the decision logic for converting an instruction into one that sets the 'zero' bit down into the back-end. There are other cases where this logic isn't sufficient, so they should be handled separately. llvm-svn: 113665	2010-09-10 23:34:19 +00:00
Eric Christopher	72497e5d90	Start sketching out ARM fast-isel calls. llvm-svn: 113662	2010-09-10 23:18:12 +00:00
Eric Christopher	cc766a20d3	For consistency. llvm-svn: 113659	2010-09-10 23:10:30 +00:00
Eric Christopher	cc1367851b	Newline at end of file. llvm-svn: 113654	2010-09-10 22:46:03 +00:00
Eric Christopher	1c06917f15	Split out some of the calling convention bits so that they can be used for fast-isel. llvm-svn: 113652	2010-09-10 22:42:06 +00:00
Bill Wendling	aee679bf35	Modify the comparison optimizations in the peephole optimizer to update the iterator when an optimization took place. This allows us to do more insane things with the code than just remove an instruction or two. llvm-svn: 113640	2010-09-10 21:55:43 +00:00
Jim Grosbach	1f77ee5691	Add a missing case to duplicateCPV() for LSDA constants. Add a FIXME. rdar://8302157 llvm-svn: 113637	2010-09-10 21:38:22 +00:00
Michael J. Spencer	dc38d36ccb	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. llvm-svn: 113632	2010-09-10 21:14:25 +00:00
Bob Wilson	ed19768cec	Calculate the number of VLDM/VSTM registers by subtracting the number of fixed operands from the total number of operands (including the variadic ones). llvm-svn: 113597	2010-09-10 18:25:35 +00:00
Bill Wendling	ac0ad0f634	Reword since this may not be a bug but intended behavior. llvm-svn: 113584	2010-09-10 10:31:11 +00:00
Bob Wilson	8617234658	Fix merging base-updates for VLDM/VSTM: Before I switched these instructions to use AddrMode4, there was a count of the registers stored in one of the operands. I changed that to just count the operands but forgot to adjust for the size of D registers. This was noticed by Evan as a performance problem but it is a potential correctness bug as well, since it is possible that this could merge a base update with a non-matching immediate. llvm-svn: 113576	2010-09-10 05:15:04 +00:00
Evan Cheng	bf4070756f	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Eric Christopher	712bd0a604	Fix build error. llvm-svn: 113566	2010-09-10 00:35:09 +00:00
Eric Christopher	860fc9370f	Update comments, reorganize some code, rename variables to be more clear. No functional change. llvm-svn: 113565	2010-09-10 00:34:35 +00:00
Eric Christopher	22fd29a94a	64-bit fp loads can come straight out of the constant pool, not as bad as I'd thought. llvm-svn: 113561	2010-09-09 23:50:00 +00:00
Eric Christopher	4bd7047324	SIToFP and FPToSI conversions work only on fp-reg to fp-reg. Move some data around and implement a couple of move routines to do this. llvm-svn: 113546	2010-09-09 21:44:45 +00:00
Eric Christopher	2cbe0fd956	New "move to fp reg" routine. Use it. llvm-svn: 113537	2010-09-09 20:49:25 +00:00
Eric Christopher	82b05d7206	"Strike that, reverse it." -- Mr. Wonka. Truncate when truncating, extend when extending. llvm-svn: 113536	2010-09-09 20:36:19 +00:00
Eric Christopher	5903c0be2a	Add FPTrunc, fix some bugs where I forgot to update the value map. llvm-svn: 113533	2010-09-09 20:26:31 +00:00
Eric Christopher	6e3eeba4d9	Basic FP->Int, Int->FP conversions. llvm-svn: 113523	2010-09-09 18:54:59 +00:00
Evan Cheng	367a5df8cf	For each instruction itinerary class, specify the number of micro-ops each instruction in the class would be decoded to. Or zero if the number of uOPs must be determined dynamically. This will be used to determine the cost-effectiveness of predicating a micro-coded instruction. llvm-svn: 113513	2010-09-09 18:18:55 +00:00
Bob Wilson	4adbaf1843	Fix NEON VLD pseudo instruction itineraries that were incorrectly copied from the VST pseudos. The VLD/VST scheduling still needs work (see pr6722), but at least we shouldn't confuse the loads with the stores. llvm-svn: 113473	2010-09-09 05:40:26 +00:00
Eric Christopher	2ff757d422	Nuke whitespace and fix some indenting. llvm-svn: 113463	2010-09-09 01:06:51 +00:00
Eric Christopher	bd3d121641	Handle 64-bit floating point binops as well. llvm-svn: 113461	2010-09-09 01:02:03 +00:00
Eric Christopher	24dc27f73a	Basic 32-bit FP operations. llvm-svn: 113459	2010-09-09 00:53:57 +00:00
Bob Wilson	84971c850a	For double-spaced VLD3/VLD4 instructions, copy the explicit super-register use operand from the pseudo instruction to the new instruction as an implicit use. This will preserve any other flags (e.g., kill) on the operand. llvm-svn: 113456	2010-09-09 00:38:32 +00:00
Eric Christopher	f14b9bf98d	Handle float->double extension. llvm-svn: 113455	2010-09-09 00:26:48 +00:00
Eric Christopher	3cf63f1edd	Rewrite TargetMaterializeConstant splitting it out into two functions for integer and fp constants. Implement todo to use vfp3 instructions to materialize easy constants if we can. llvm-svn: 113453	2010-09-09 00:19:41 +00:00
Bob Wilson	4ccd5ce6ea	Simplify copying over operands from pseudo NEON load/store instructions. For VLD3/VLD4 with double-spaced registers, add the implicit use of the super register for both the instruction loading the even registers and the instruction loading the odd registers. llvm-svn: 113452	2010-09-09 00:15:32 +00:00
Bob Wilson	359f8ba337	Clean up a comment. llvm-svn: 113442	2010-09-08 23:39:54 +00:00
Eric Christopher	c3e9c404aa	Very basic compare support. llvm-svn: 113440	2010-09-08 23:13:45 +00:00
Eric Christopher	5838af54bf	Delete dead code. llvm-svn: 113436	2010-09-08 22:58:35 +00:00
Evan Cheng	722cd122dc	Fix LDM_RET schedule itinery. llvm-svn: 113435	2010-09-08 22:57:08 +00:00
Eric Christopher	6489df7c8c	Make the loads/stores match the type we really want to store. llvm-svn: 113417	2010-09-08 21:49:50 +00:00
Jim Grosbach	504d23bd05	Re-enable usage of the ARM base pointer. r113394 fixed the known failures. Re-running some nightly testers w/ it enabled to verify. llvm-svn: 113399	2010-09-08 20:12:02 +00:00
Jim Grosbach	21c9471706	Fix errant fall-throughs causing the base pointer to be used when the frame pointer was intended. rdar://8401980 llvm-svn: 113394	2010-09-08 19:55:28 +00:00
Eric Christopher	f5dd1929a2	Rewrite TargetMaterializeConstant. llvm-svn: 113387	2010-09-08 18:56:34 +00:00
Jim Grosbach	7dfca6fb51	Be more careful about when to do dynamic stack realignment. Since we have an option to disable base pointer usage, pay attention to it when deciding if we can realign (if no base pointer and VLAs, we can't). llvm-svn: 113366	2010-09-08 17:22:12 +00:00
Jim Grosbach	53aa5e31e1	Add missing assert llvm-svn: 113365	2010-09-08 17:05:45 +00:00
Chris Lattner	91689c1d0f	change the MC "ParseInstruction" interface to make it the implementation's job to check for and lex the EndOfStatement marker. llvm-svn: 113347	2010-09-08 05:10:46 +00:00
NAKAMURA Takumi	7a23aa081a	ARM/Disassembler: Fix definitions incompatible(unsigned and uint32_t) to Cygwin-1.5, following up to r113255. llvm-svn: 113345	2010-09-08 04:48:17 +00:00
Jim Grosbach	535d3b4e09	remove trailing whitespace llvm-svn: 113338	2010-09-08 03:54:02 +00:00
Jim Grosbach	19cb2f4c67	remove obsolete comment llvm-svn: 113337	2010-09-08 03:51:44 +00:00
Jim Grosbach	261df12f64	disable for the moment while tracking down a few Thumb2-O0 failure that look related. (attempt deux, complete w/ test update this time) llvm-svn: 113333	2010-09-08 02:00:34 +00:00
Jim Grosbach	b2c950187e	woops. need to update a test along with this. llvm-svn: 113332	2010-09-08 01:49:09 +00:00
Jim Grosbach	7cda56ea6a	disable temporarily while sorting out a few test failures in Thumb2-O0 tests. llvm-svn: 113331	2010-09-08 01:47:49 +00:00
Jim Grosbach	136d035e45	correct spill code to properly determine if dynamic stack realignment is present in the function and thus whether aligned load/store instructions can be used. llvm-svn: 113323	2010-09-08 00:26:59 +00:00
Jim Grosbach	abcbe2474d	VFP/NEON load/store multiple instructions are addrmode4, not 5. llvm-svn: 113322	2010-09-08 00:25:50 +00:00
Jim Grosbach	88628e9738	To shrink a t2LDM instruction to the 16-bit wide tLDM instruction, the base register must be one of the destination registers for the load. Otherwise, the tLDM instruction will write-back to the base register, which isn't what's desired (otherwise, we'd have a t2LDM_UPD instead). rdar://8394087 llvm-svn: 113297	2010-09-07 22:30:53 +00:00
Jim Grosbach	9877af3b46	grammar tweak llvm-svn: 113289	2010-09-07 21:30:25 +00:00
Chris Lattner	091012d5d5	hopefully fix a problem building on cygwin-1.5 llvm-svn: 113255	2010-09-07 19:50:53 +00:00
Chris Lattner	339cc7bfef	in the case where an instruction only has one implementation of a mneumonic, report operand errors with better location info. For example, we now report: t.s:6:14: error: invalid operand for instruction cwtl $1 ^ but we fail for common cases like: t.s:11:4: error: invalid operand for instruction addl $1, $1 ^ because we don't know if this is supposed to be the reg/imm or imm/reg form. llvm-svn: 113178	2010-09-06 22:11:18 +00:00
Chris Lattner	a22a368e7c	change MatchInstructionImpl to return an enum instead of bool. llvm-svn: 113165	2010-09-06 19:22:17 +00:00
Chris Lattner	3e4582ada5	have AsmMatcherEmitter.cpp produce the hunk of code that gets included into the middle of the class, and rework how the different sections of the generated file are conditionally included for simplicity. llvm-svn: 113163	2010-09-06 19:11:01 +00:00
Chris Lattner	f43cb302ca	remove some dead code. t2addrmode_imm8s4 is never used in a pattern, so there is no need to define a matching function. llvm-svn: 113122	2010-09-05 22:51:11 +00:00
Chris Lattner	e40007a71b	cleanups. llvm-svn: 113119	2010-09-05 21:18:45 +00:00
Chris Lattner	65b48b5dfc	zap dead code. llvm-svn: 113073	2010-09-04 18:12:00 +00:00
Jim Grosbach	03f4be86ba	Re-apply r112883: "For ARM stack frames that utilize variable sized objects and have either large local stack areas or require dynamic stack realignment, allocate a base register via which to access the local frame. This allows efficient access to frame indices not accessible via the FP (either due to being out of range or due to dynamic realignment) or the SP (due to variable sized object allocation). In particular, this greatly improves efficiency of access to spill slots in Thumb functions which contain VLAs." r112986 fixed a latent bug exposed by the above. llvm-svn: 112989	2010-09-03 18:37:12 +00:00
Jim Grosbach	21a2a2579f	Check the local frame alignment for determining whether dynamic stack alignment should be performed. Otherwise dynamic realignment may trigger when the register allocator has already used the frame pointer as a general purpose register. That is, we need to make sure that the list of reserved registers doesn't change after register allocation. llvm-svn: 112986	2010-09-03 18:28:19 +00:00
Bob Wilson	35fafca587	Finish converting the rest of the NEON VLD instructions to use pseudo- instructions prior to regalloc. Since it's getting a little close to the 2.8 branch deadline, I'll have to leave the rest of the instructions handled by the NEONPreAllocPass for now, but I didn't want to leave half of the VLD instructions converted and the other half not. llvm-svn: 112983	2010-09-03 18:16:02 +00:00
Daniel Dunbar	2ac3386ef3	Revert "For ARM stack frames that utilize variable sized objects and have either", it is breaking oggenc with Clang for ARMv6. This reverts commit 8d6e29cfda270be483abf638850311670829ee65. llvm-svn: 112962	2010-09-03 15:26:42 +00:00
Bob Wilson	f65c9ef720	Replace NEON vabdl, vaba, and vabal intrinsics with combinations of the vabd intrinsic and add and/or zext operations. In the case of vaba, this also avoids the need for a DAG combine pattern to combine vabd with add. Update tests. Auto-upgrade the old intrinsics. llvm-svn: 112941	2010-09-03 01:35:08 +00:00
Eric Christopher	6aaed72949	Simple branch instruction support. llvm-svn: 112923	2010-09-03 00:35:47 +00:00
Eric Christopher	c3e118ef3d	Add basic support for materializing constants (including fp) and stores. llvm-svn: 112912	2010-09-02 23:43:26 +00:00
Jim Grosbach	7fd9aea67c	For ARM stack frames that utilize variable sized objects and have either large local stack areas or require dynamic stack realignment, allocate a base register via which to access the local frame. This allows efficient access to frame indices not accessible via the FP (either due to being out of range or due to dynamic realignment) or the SP (due to variable sized object allocation). In particular, this greatly improves efficiency of access to spill slots in Thumb functions which contain VLAs. rdar://7352504 rdar://8374540 rdar://8355680 llvm-svn: 112883	2010-09-02 22:29:01 +00:00
Jim Grosbach	b2a9025bad	trailing whitespace llvm-svn: 112852	2010-09-02 19:52:39 +00:00
Jim Grosbach	66c681a644	Now that register allocation properly considers reserved regs, simplify the ARM register class allocation order functions to take advantage of that. llvm-svn: 112841	2010-09-02 18:14:29 +00:00
Bob Wilson	5a1df805e5	Fill in a missing comment. llvm-svn: 112826	2010-09-02 16:17:29 +00:00
Bob Wilson	75a6408f88	Convert VLD1 and VLD2 instructions to use pseudo-instructions until after regalloc. llvm-svn: 112825	2010-09-02 16:00:54 +00:00
Eric Christopher	2020d69800	Clang's -ccc-host-triple was ignoring the arch specifier on my triple, I don't need to implement this quite yet - and not for ConstantInt anyhow. llvm-svn: 112798	2010-09-02 02:30:46 +00:00
Eric Christopher	92db201e23	This should be TargetMaterializeConstant instead. llvm-svn: 112795	2010-09-02 01:48:11 +00:00
Eric Christopher	6a0333c1ed	One definition of isThumb is plenty, thanks. llvm-svn: 112793	2010-09-02 01:39:14 +00:00
Jim Grosbach	8ee5cd99ef	Remove trailing whitespace llvm-svn: 112790	2010-09-02 01:02:06 +00:00
Eric Christopher	74487fcbe7	Rework arm fast-isel load and store handling. Move offset computation into the "address selection" routine and handle constant materialization for stores. llvm-svn: 112788	2010-09-02 00:53:56 +00:00
Jim Grosbach	6f2067659d	trivial cleanup llvm-svn: 112779	2010-09-02 00:02:26 +00:00
Jim Grosbach	dffc9d328d	Simplify the tGPR register class now that the register allocators know not to try to allocate reserved registers. llvm-svn: 112774	2010-09-01 23:50:23 +00:00
Bob Wilson	38ab35a911	Remove NEON vmull, vmlal, and vmlsl intrinsics, replacing them with multiply, add, and subtract operations with zero-extended or sign-extended vectors. Update tests. Add auto-upgrade support for the old intrinsics. llvm-svn: 112773	2010-09-01 23:50:19 +00:00
Eric Christopher	fde5a3d494	Some basic store support. llvm-svn: 112752	2010-09-01 22:16:27 +00:00
Eric Christopher	3ce9c4a65f	Add some more load types in. llvm-svn: 112721	2010-09-01 18:01:32 +00:00
Chris Lattner	94f834348f	zap dead code. llvm-svn: 112712	2010-09-01 16:04:34 +00:00
Chris Lattner	39eccb4754	temporarily revert r112664, it is causing a decoding conflict, and the testcases should be merged. llvm-svn: 112711	2010-09-01 16:00:50 +00:00
Bill Wendling	6789f8b6ae	We have a chance for an optimization. Consider this code: int x(int t) { if (t & 256) return -26; return 0; } We generate this: tst.w r0, #256 mvn r0, #25 it eq moveq r0, #0 while gcc generates this: ands r0, r0, #256 it ne mvnne r0, #25 bx lr Scandalous really! During ISel time, we can look for this particular pattern. One where we have a "MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND instruction to 0. Something like this (greatly simplified): %r0 = ISD::AND ... ARMISD::CMPZ %r0, 0 @ sets [CPSR] %r0 = ARMISD::MOVCC 0, -26 @ reads [CPSR] All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR] when it's zero. The zero value will all ready be in the %r0 register and we only need to change it if the AND wasn't zero. Easy! llvm-svn: 112664	2010-08-31 22:41:22 +00:00
Bill Wendling	d657d82597	And ANDS pattern to match the t2ANDS pattern. llvm-svn: 112654	2010-08-31 22:05:37 +00:00
Jim Grosbach	9ce9210e47	SP relative offsets need to be adjusted by the local allocation size when determining if they're likely to be in range of the SP when resolving frame references. llvm-svn: 112624	2010-08-31 18:52:31 +00:00
Jim Grosbach	6f6b590b99	this assert should just be a condition, since this function is just asking if the offset is legally encodable, not actually trying to do the encoding. llvm-svn: 112622	2010-08-31 18:49:31 +00:00
Bill Wendling	b70dc8777e	- Cleanup some whitespaces. - Convert {0,1} and friends into 0b01, which is identical and more consistent. llvm-svn: 112593	2010-08-31 07:50:46 +00:00
Eric Christopher	901176a755	Rewrite slightly so we can expand for floating point types easier. llvm-svn: 112568	2010-08-31 01:28:42 +00:00
Eric Christopher	bbd1098989	If we have an unhandled type then assert, we shouldn't get here for things we can't handle. llvm-svn: 112559	2010-08-30 23:48:26 +00:00
Anton Korobeynikov	48043d0173	Expand MOVi32imm in ARM mode after regalloc. This provides scheduling opportunities (extra instruction can go in between MOVT / MOVW pair removing the stall). llvm-svn: 112546	2010-08-30 22:50:36 +00:00
Bill Wendling	87bb14c566	Use the existing T2I_bin_s_irs pattern instead of creating T2I_bin_sw_irs, which is meant to do exactly the same thing. Thanks to Jim Grosbach for pointing this out! :-) llvm-svn: 112538	2010-08-30 22:05:23 +00:00
Jakob Stoklund Olesen	4d30f90e35	Remember to clear the shadow kill flag at the same time as clearing the real kill flag. This could cause duplicate kill flags when the same register was used twice in a continuous sequence of STRs. There is no small test case. <rdar://problem/8218046> llvm-svn: 112534	2010-08-30 21:52:40 +00:00
Bob Wilson	4cd8a126c3	Remove NEON vmovn intrinsic, replacing it with vector truncate operations. Auto-upgrade the old intrinsic and update tests. llvm-svn: 112507	2010-08-30 20:02:30 +00:00
Jim Grosbach	fef37287a8	Make ARM add rN, sp, #imm instructions rematerializable. That's how the address of locals is calculated, so this should help relieve register pressure a bit. Recalculating the local address is almost always going to be better than spilling. llvm-svn: 112503	2010-08-30 19:49:58 +00:00
Bob Wilson	e2f8bdac14	When expanding NEON VST pseudo instructions, if the original super-register operand is killed, add it to the expanded instruction as an implicit kill operand instead of marking the individual subregs with kill flags. This should work better in general and also handles the case for VST3 where one of the subregs was not referenced in the expanded instruction and so was not marked killed. llvm-svn: 112494	2010-08-30 18:10:48 +00:00
Bill Wendling	f8dfa461fa	Create Thumb2sI_cpsr and T2sI_cpsr. These new classes indicate that CPSR is the optional modified register (instead of reg0). Along with r112461 it will make sure that the optional define of CPSR is marked as "def" and will thus mark the instructions using these classes (t2ANDS*) as setting the 's' flag. llvm-svn: 112462	2010-08-30 01:47:35 +00:00
Bill Wendling	8fc2b590b9	Fix whitespaces. No functionality changes. llvm-svn: 112421	2010-08-29 11:31:07 +00:00
Bob Wilson	d0c054886c	Remove NEON vaddl, vaddw, vsubl, and vsubw intrinsics. Instead, use llvm IR add/sub operations with one or both operands sign- or zero-extended. Auto-upgrade the old intrinsics. llvm-svn: 112416	2010-08-29 05:57:34 +00:00
Bill Wendling	df9ec17d53	- Add a parameter to T2I_bin_irs for those patterns which set the S bit. - Create T2I_bin_sw_irs to be like T2I_bin_w_irs, but that it sets the S bit. llvm-svn: 112399	2010-08-29 03:55:31 +00:00
Bill Wendling	b0dc465c04	Name ANDflag to ANDS, which is less stupid. llvm-svn: 112395	2010-08-29 03:06:09 +00:00
Bill Wendling	ac64ed0923	File missing from last commit. llvm-svn: 112394	2010-08-29 03:02:28 +00:00
Bill Wendling	0a65116cce	Create an ARMISD::AND node. This node is exactly like the "ARM::AND" node, but it sets the CPSR register. llvm-svn: 112393	2010-08-29 03:02:11 +00:00
Bob Wilson	950882be07	Use pseudo instructions for VST1 and VST2. llvm-svn: 112357	2010-08-28 05:12:57 +00:00
Bob Wilson	8ee9394750	We don't need to custom-select VLDMQ and VSTMQ anymore. llvm-svn: 112336	2010-08-28 00:20:11 +00:00
Bob Wilson	ca5af12920	When merging Thumb2 loads/stores, do not give up when the offset is one of the special values that for ARM would be used with IB or DA modes. Fall through and consider materializing a new base address is it would be profitable. llvm-svn: 112329	2010-08-27 23:57:52 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Bob Wilson	af371b49a8	Unsigned value cannot be < 0. llvm-svn: 112300	2010-08-27 21:44:35 +00:00
Jim Grosbach	6a77066913	Simplify eliminateFrameIndex() interface back down now that PEI doesn't need to try to re-use scavenged frame index reference registers. rdar://8277890 llvm-svn: 112241	2010-08-26 23:32:16 +00:00
Jim Grosbach	e82d5b4aaf	tidy up a bit. no functional change. llvm-svn: 112228	2010-08-26 21:56:30 +00:00
Jim Grosbach	17da935964	Turn off the scavenging based frame reg reuse briefly to measure whether it's still having a significant effect. It shouldn't be now that the pre-RA virtual base reg stuff is in. Assuming that's valididated by the nightly testers, we can simplify a lot of the PEI frame index code. llvm-svn: 112220	2010-08-26 21:29:54 +00:00
Bob Wilson	97919e9c59	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Bill Wendling	a9c03f4fae	Reapply r112176 without removing the other CMN patterns (that was unintentional). llvm-svn: 112206	2010-08-26 18:33:51 +00:00
Jim Grosbach	074d22e1ac	Restrict the register to tGPR to make sure the str instruction will be encodable as a 16-bit wide instruction. llvm-svn: 112195	2010-08-26 17:02:47 +00:00
Dan Gohman	10b20b2b81	Revert r112176; it broke test/CodeGen/Thumb2/thumb2-cmn.ll. llvm-svn: 112191	2010-08-26 15:50:25 +00:00
Bill Wendling	a9a0599b39	There seems to be a (potential) hardware bug with the CMN instruction and comparison with 0. These two pieces of code should give identical results: rsbs r1, r1, 0 cmp r0, r1 mov r0, #0 it ls mov r0, #1 and: cmn r0, r1 mov r0, #0 it ls mov r0, #1 However, the CMN gives the opposite result when r1 is 0. This is because the carry flag is set in the CMP case but not in the CMN case. In short, the CMP instruction doesn't perform a truncate of the (logical) NOT of 0 plus the value of r0 and the carry bit (because the "carry bit" parameter to AddWithCarry is defined as 1 in this case, the carry flag will always be set when r0 >= 0). The CMN instruction doesn't perform a NOT of 0 so there is never a "carry" when this AddWithCarry is performed (because the "carry bit" parameter to AddWithCarry is defined as 0). The AddWithCarry in the CMP case seems to be relying upon the identity: ~x + 1 = -x However when x is 0 and unsigned, this doesn't hold: x = 0 ~x = 0xFFFF FFFF ~x + 1 = 0x1 0000 0000 (-x = 0) != (0x1 0000 0000 = ~x + 1) Therefore, we should disable all versions of CMN, especially when comparing against zero, until we can limit when the CMN instruction is used (when we know that the RHS is not 0) or when we have a hardware fix for this. (See the ARM docs for the "AddWithCarry" pseudo-code.) This is related to <rdar://problem/7569620>. llvm-svn: 112176	2010-08-26 09:07:33 +00:00
Bob Wilson	4cec44975e	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Jim Grosbach	08da771ec3	Enable pre-RA virtual frame base register allocation. rdar://8277890 llvm-svn: 112127	2010-08-26 00:58:06 +00:00
Bob Wilson	4629f423f8	Revert svn 107892 (with changes to work with trunk). It caused a crash if a VLD result was not used (Radar 8355607). It should also fix pr7988, but I haven't verified that yet. llvm-svn: 112118	2010-08-26 00:13:36 +00:00
Bob Wilson	9392b0e960	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Jim Grosbach	0a84487fa7	Don't override the var from the enclosing scope. When doing copy/paste/modify, it's apparently rather important to remember the 'modify' bit... llvm-svn: 112075	2010-08-25 19:11:34 +00:00
Daniel Dunbar	a54a1b0edf	ARM/Thumb2: Fix a misselect in getARMCmp, when attempting to adjust a signed comparison that would overflow. - The other under/overflow cases can't actually happen because the immediates which would trigger them are legal (so we don't enter this code), but adjusted the style to make it clear the transform is always valid. llvm-svn: 112053	2010-08-25 16:58:05 +00:00
Eric Christopher	7a0d8c69cb	Do type checks before we bother to do everything else. llvm-svn: 112039	2010-08-25 08:43:57 +00:00
Eric Christopher	761e7fb605	Reorganize load mechanisms. Handle types in a little less fixed way. Fix some todos. No functional change. llvm-svn: 112031	2010-08-25 07:23:49 +00:00
Eric Christopher	15b182f4d4	Fix predicate and add a comment. llvm-svn: 111981	2010-08-24 22:34:11 +00:00
Eric Christopher	236ec8f3b5	Rework braindead conditionals I put in yesterday. llvm-svn: 111974	2010-08-24 22:07:27 +00:00
Eric Christopher	6c99ebf5b0	Fix thumb2 mode loads to have the correct operand ordering. Add a todo to fix this in the port. llvm-svn: 111973	2010-08-24 22:03:02 +00:00
Jim Grosbach	2eedb7949e	Add ARM heuristic for when to allocate a virtual base register for stack access. rdar://8277890&7352504 llvm-svn: 111968	2010-08-24 21:19:33 +00:00
Jim Grosbach	b77d67f318	Move enabling the local stack allocation pass into the target where it belongs. For now it's still a command line option, but the interface to the generic code doesn't need to know that. llvm-svn: 111942	2010-08-24 19:05:43 +00:00
Jim Grosbach	35b7c033d4	add ARM cmd line option to force always using virtual base regs when possible. Intended to help ease reproducing problems by increasing base register usage after heuristics for only using the when needed are in place. llvm-svn: 111930	2010-08-24 18:04:52 +00:00
Bill Wendling	2c64ba63a1	Add comments for what the condition code symbols mean. llvm-svn: 111889	2010-08-24 01:11:30 +00:00
Eric Christopher	46d3a56e5d	Update comment. llvm-svn: 111887	2010-08-24 01:10:52 +00:00
Eric Christopher	c0c00ca33f	Fix the opcode and the operands for the load instruction. llvm-svn: 111885	2010-08-24 01:10:04 +00:00
Eric Christopher	eb47692c22	Add register class hack that needs to go away, but makes it more obvious that it needs to go away. Use loadRegFromStackSlot where possible. Also, remember to update the value map. llvm-svn: 111883	2010-08-24 00:50:47 +00:00
Eric Christopher	9d4e471cc2	Add some more debugging code, make it more obvious that RegOffset is getting an address for an object and select some default values. llvm-svn: 111871	2010-08-24 00:07:24 +00:00
Eric Christopher	e3107d6283	Don't need the extra register here. llvm-svn: 111864	2010-08-23 23:28:04 +00:00
Eric Christopher	414501c511	Add some more "get address into register" code and a more TODOs/FIXMEs. llvm-svn: 111860	2010-08-23 23:14:31 +00:00
Eric Christopher	8d03b8a8ce	Add an ARMFunctionInfo member and use it. llvm-svn: 111854	2010-08-23 22:32:45 +00:00
Eric Christopher	00202ee329	Start getting ARM loads/address computation going. llvm-svn: 111850	2010-08-23 21:44:12 +00:00
Bob Wilson	9a511c07e4	Replace the arm.neon.vmovls and vmovlu intrinsics with vector sign-extend and zero-extend operations. llvm-svn: 111614	2010-08-20 04:54:02 +00:00
Eric Christopher	985d9e4ea8	Fix loop conditionals (MO.isDef() asserts that it's a reg) and move some constraints around. llvm-svn: 111594	2010-08-20 00:36:24 +00:00
Eric Christopher	d8e8a2945e	Add a couple of random comments. llvm-svn: 111592	2010-08-20 00:20:31 +00:00
Jim Grosbach	56e56323c8	Better handling of offsets on frame index references. rdar://8277890 llvm-svn: 111585	2010-08-19 23:52:25 +00:00
Jim Grosbach	8c58bd30dc	Add Thumb1 support for virtual frame indices. rdar://8277890 llvm-svn: 111533	2010-08-19 17:52:13 +00:00
Eric Christopher	a5d60c62b1	Silence warning. llvm-svn: 111518	2010-08-19 15:35:27 +00:00
Eric Christopher	0d274a0258	Add an AddOptionalDefs method and use it. llvm-svn: 111489	2010-08-19 00:37:05 +00:00
Bill Wendling	768d3b510c	Add the "isCompare" attribute to the defm instead of each individual instr. llvm-svn: 111481	2010-08-19 00:05:48 +00:00
Eric Christopher	8a70781cac	Remove extra header. llvm-svn: 111456	2010-08-18 23:38:16 +00:00
Jim Grosbach	dbfc2ce95d	Enable ARM base register reuse to local stack slot allocation. Whenever a new frame index reference to an object in the local block is seen, check if it's near enough to any previously allocaated base register to re-use. rdar://8277890 llvm-svn: 111443	2010-08-18 22:44:49 +00:00
Bill Wendling	ad2aa57774	Minor simplification. Gets rid of a needless temporary. llvm-svn: 111430	2010-08-18 21:32:07 +00:00
Jim Grosbach	e0e9b3013f	Add hook for re-using virtual base registers for local stack slot access. Nothing fancy, just ask the target if any currently available base reg is in range for the instruction under consideration and use the first one that is. Placeholder ARM implementation simply returns false for now. ongoing saga of rdar://8277890 llvm-svn: 111374	2010-08-18 17:57:37 +00:00
Bob Wilson	fb7eaff759	Expand ZERO_EXTEND operations for NEON vector types. Testcase from Nick Lewycky. llvm-svn: 111341	2010-08-18 01:45:52 +00:00
Jim Grosbach	3cf08661f4	Add materialization of virtual base registers for frame indices allocated into the local block. Resolve references to those indices to a new base register. For simplification and testing purposes, a new virtual base register is allocated for each frame index being resolved. The result is truly horrible, but correct, code that's good for exercising the new code paths. Next up is adding thumb1 support, which should be very simple. Following that will be adding base register re-use and implementing a reasonable ARM heuristic for when a virtual base register should be generated at all. llvm-svn: 111315	2010-08-17 22:41:55 +00:00
Jakob Stoklund Olesen	e2cbaf6ed7	Don't call tablegen'ed Predicate_* functions in the ARM target. llvm-svn: 111277	2010-08-17 20:39:04 +00:00
Jim Grosbach	62800a990b	80 column cleanup. llvm-svn: 111266	2010-08-17 18:39:16 +00:00
Jim Grosbach	c252ee2375	Add hook to examine an instruction referencing a frame index to determine whether to allocate a virtual frame base register to resolve the frame index reference in it. Implement a simple version for ARM to aid debugging. In LocalStackSlotAllocation, scan the function for frame index references to local frame indices and ask the target whether to allocate virtual frame base registers for any it encounters. Purely infrastructural for debug output. Next step is to actually allocate base registers, then add intelligent re-use of them. rdar://8277890 llvm-svn: 111262	2010-08-17 18:13:53 +00:00
Jim Grosbach	8995a1018c	explicitly handle no-op cases for clarity. Fixes clang warning. llvm-svn: 111260	2010-08-17 18:00:41 +00:00
Bob Wilson	942b10f511	Change ARM PKHTB and PKHBT instructions to use a shift_imm operand to avoid printing "lsl #0". This fixes the remaining parts of pr7792. Make corresponding changes for encoding/decoding these instructions. llvm-svn: 111251	2010-08-17 17:23:19 +00:00
Chris Lattner	72a364c107	fix emacs language spec's, patch by Edmund Grimley-Evans! llvm-svn: 111241	2010-08-17 16:20:04 +00:00
Bob Wilson	411dfad981	Allow more cases of undef shuffle indices and add tests for them. llvm-svn: 111226	2010-08-17 05:54:34 +00:00
Eric Christopher	09f757d4bc	Copy over some overridden MI wrappers for ARM fast-isel. This is where we're adding predicates and optional defs to the MachineInstrs. llvm-svn: 111222	2010-08-17 01:25:29 +00:00
Eric Christopher	663f49900d	Make arm fast-isel possible to enable via command line. llvm-svn: 111219	2010-08-17 00:46:57 +00:00

... 4 5 6 7 8 ...

3428 Commits