llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Jakob Stoklund Olesen	a648c6a757	Teach VirtRegRewriter to handle spilling in instructions that have multiple definitions of the virtual register. This happens when spilling the registers produced by REG_SEQUENCE: %reg1047:5<def>, %reg1047:6<def>, %reg1047:7<def> = VLD3d8 %reg1033, 0, pred:14, pred:%reg0 The rewriter would spill the register multiple times, dead store elimination tried to keep up, but ended up cutting the branch it was sitting on. llvm-svn: 104321	2010-05-21 16:36:13 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Dan Gohman	ee2fea3cd7	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Dan Gohman	20fab456da	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Jakob Stoklund Olesen	e11cdf8cc8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Bob Wilson	f070b1b571	Testcase to go with 104141. llvm-svn: 104142	2010-05-19 18:58:37 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Jakob Stoklund Olesen	430b6e40ab	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	1e4f55200d	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Evan Cheng	f2c9a96f3c	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Evan Cheng	29c463862e	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Jakob Stoklund Olesen	176a9c4272	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Jakob Stoklund Olesen	132668102e	Keep track of the last place a live virtreg was used. This allows us to add accurate kill markers, something the scavenger likes. Add some more tests from ARM that needed this. llvm-svn: 103521	2010-05-11 23:24:45 +00:00
Evan Cheng	2fa5a7e7e4	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Duncan Sands	ebf838274f	Correct some bogus target triples. llvm-svn: 103265	2010-05-07 17:03:48 +00:00
Jim Grosbach	245b169212	fix copy/paste oops. llvm-svn: 103122	2010-05-05 21:07:46 +00:00
Jim Grosbach	44d7f49887	Add tests for ARMV7M divide instruction use llvm-svn: 103120	2010-05-05 20:47:15 +00:00
Jim Grosbach	e36cd72e38	remove unneeded underscores. llvm-svn: 103114	2010-05-05 19:55:58 +00:00
Jim Grosbach	5ced648ba8	Convert to filecheck llvm-svn: 103113	2010-05-05 19:41:11 +00:00
Dan Gohman	0553acff5e	Fix tests to use fadd, fsub, and fmul, instead of add, sub, and mul, when the type is floating-point. llvm-svn: 102969	2010-05-03 22:36:46 +00:00
Dan Gohman	2ad68de4aa	Fix a bug which prevented tail merging of return instructions in beneficial cases. See the changes in test/CodeGen/X86/tail-opts.ll and test/CodeGen/ARM/ifcvt2.ll for details. The fix is to change HashEndOfMBB to hash at most one instruction, instead of trying to apply heuristics about when it will be profitable to consider more than one instruction. The regular tail-merging heuristics are already prepared to handle the same cases, and they're more precise. Also, make test/CodeGen/ARM/ifcvt5.ll and test/CodeGen/Thumb2/thumb2-branch.ll slightly more complex so that they continue to test what they're intended to test. And, this eliminates the problem in test/CodeGen/Thumb2/2009-10-15-ITBlockBranch.ll, the testcase from PR5204. Update it accordingly. llvm-svn: 102907	2010-05-03 14:35:47 +00:00
Duncan Sands	211427bda9	Remove the -enable-sjlj-eh option, which doesn't do anything. Remove the -enable-eh option which is only used by the JIT, and replace it with -jit-enable-eh. llvm-svn: 102865	2010-05-02 15:36:26 +00:00
Jim Grosbach	825cb299cd	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Bob Wilson	92a4685dd2	Fix tests for Neon load/store intrinsics to match the i8* types expected by the intrinsics. The reason for those i8* types is that the intrinsics are overloaded on the vector type and we don't have a way to declare an intrinsic where one argument is an overloaded vector type and another argument is a pointer to the vector element type. The bitcasts added here will match what the frontend will typically generate when these intrinsics are used. llvm-svn: 101840	2010-04-20 00:17:16 +00:00
Nick Lewycky	fbe8d2803d	Fix declarations in a few more tests. llvm-svn: 101676	2010-04-17 21:29:25 +00:00
Dan Gohman	4fee6f3bdd	Start function numbering at 0. llvm-svn: 101638	2010-04-17 16:29:15 +00:00
Jakob Stoklund Olesen	b642a27525	Fix PR6847. RegScavenger should ignore DebugValues. llvm-svn: 101392	2010-04-15 20:28:39 +00:00
Chris Lattner	f9b2e3c68a	add a simple dag combine to replace trivial shl+lshr with and. This happens with the store->load narrowing stuff. llvm-svn: 101348	2010-04-15 05:28:43 +00:00
Bob Wilson	c05b887c84	Don't custom lower bit converts to ARM VMOVDRRD or VMOVDRR when the operand does not have a legal type. The legalizer does not know how to handle those nodes. Radar 7854640. llvm-svn: 101282	2010-04-14 20:45:23 +00:00
Bob Wilson	699bdf7adf	Handle a v2f64 formal parameter that is split between registers and memory such that the entire second half is in memory. Radar 7855014. llvm-svn: 101181	2010-04-13 22:03:22 +00:00
Bob Wilson	030591320d	Add a testcase for svn r100568. llvm-svn: 100876	2010-04-09 18:29:29 +00:00
Dale Johannesen	f118f9788b	Split big test into multiple directories to cater to those who don't build all targets. llvm-svn: 100688	2010-04-07 20:43:35 +00:00
Jim Grosbach	71fcb4fedd	switch the flag for using NEON for SP floating point to a subtarget 'feature'. Re-commit. This time complete with testsuite updates. llvm-svn: 99570	2010-03-25 23:47:34 +00:00
Bob Wilson	162242b63b	pr6652: Use LDM to restore PC to the return address on ARMv4. Patch by John Tytgat! llvm-svn: 99096	2010-03-20 22:20:40 +00:00
Johnny Chen	8f3004cff2	Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98745	2010-03-17 17:52:21 +00:00
Bob Wilson	1b4e8cc69c	--- Reverse-merging r98637 into '.': U test/CodeGen/ARM/tls2.ll U test/CodeGen/ARM/arm-negative-stride.ll U test/CodeGen/ARM/2009-10-30.ll U test/CodeGen/ARM/globals.ll U test/CodeGen/ARM/str_pre-2.ll U test/CodeGen/ARM/ldrd.ll U test/CodeGen/ARM/2009-10-27-double-align.ll U test/CodeGen/Thumb2/thumb2-strb.ll U test/CodeGen/Thumb2/ldr-str-imm12.ll U test/CodeGen/Thumb2/thumb2-strh.ll U test/CodeGen/Thumb2/thumb2-ldr.ll U test/CodeGen/Thumb2/thumb2-str_pre.ll U test/CodeGen/Thumb2/thumb2-str.ll U test/CodeGen/Thumb2/thumb2-ldrh.ll U utils/TableGen/TableGen.cpp U utils/TableGen/DisassemblerEmitter.cpp D utils/TableGen/RISCDisassemblerEmitter.h D utils/TableGen/RISCDisassemblerEmitter.cpp U Makefile.rules U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/Makefile U lib/Target/ARM/AsmPrinter/ARMInstPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMAsmPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMInstPrinter.h D lib/Target/ARM/Disassembler U lib/Target/ARM/ARMInstrFormats.td U lib/Target/ARM/ARMAddressingModes.h U lib/Target/ARM/Thumb2ITBlockPass.cpp llvm-svn: 98640	2010-03-16 16:59:47 +00:00
Johnny Chen	3d9327bd06	Initial ARM/Thumb disassembler check-in. It consists of a tablgen backend (RISCDisassemblerEmitter) which emits the decoder functions for ARM and Thumb, and the disassembler core which invokes the decoder function and builds up the MCInst based on the decoded Opcode. Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98637	2010-03-16 16:36:54 +00:00
Bob Wilson	298a83ecfe	Stop using the old pre-UAL syntax for LDM/STM instruction suffixes. This does not move entirely to UAL syntax, since the default "increment after" suffix is empty but we still use "IA" for that. llvm-svn: 98635	2010-03-16 16:19:07 +00:00
Anton Korobeynikov	79a7c7823d	Fix typo llvm-svn: 98506	2010-03-14 18:42:52 +00:00
Anton Korobeynikov	846a117892	Feature test for half precision FP. llvm-svn: 98504	2010-03-14 18:42:43 +00:00
Chris Lattner	9efbbcbe45	fix AsmPrinter::GetBlockAddressSymbol to always return a unique label instead of trying to form one based on the BB name (which causes collisions if the name is empty). This fixes PR6608 llvm-svn: 98495	2010-03-14 17:53:23 +00:00
Evan Cheng	80ad113731	Enable machine cse pass. llvm-svn: 98132	2010-03-10 03:07:41 +00:00
Anton Korobeynikov	bf16a17fc1	Initial bits of ARMv4-only support. Patch by John Tytgat! llvm-svn: 97886	2010-03-06 19:39:36 +00:00
Bob Wilson	749ba9a7d5	pr6478: The frame pointer spill frame index is only defined when there is a frame pointer. llvm-svn: 97755	2010-03-04 21:42:36 +00:00
Bob Wilson	cf6e29a818	pr6480: Don't try producing ld/st-multiple instructions when the address is an undef value. This is only going to come up for bugpoint-reduced tests -- correct programs will not access memory at undefined addresses -- so it's not worth the effort of doing anything more aggressive. llvm-svn: 97745	2010-03-04 21:04:38 +00:00
Bob Wilson	ba8ac74fd9	Check for comparisons of +/- zero when optimizing less-than-or-equal and greater-than-or-equal SELECT_CCs to NEON vmin/vmax instructions. This is only allowed when UnsafeFPMath is set or when at least one of the operands is known to be nonzero. llvm-svn: 97065	2010-02-24 22:15:53 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Bob Wilson	70aa8d0745	Fix pr6111: Avoid using the LR register for the target address of an indirect branch in ARM v4 code, since it gets clobbered by the return address before it is used. Instead of adding a new register class containing all the GPRs except LR, just use the existing tGPR class. llvm-svn: 96360	2010-02-16 17:24:15 +00:00
Dan Gohman	45774ce0ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Bob Wilson	0f52d0c074	Delete dead PHI machine instructions. These can be created due to type legalization even when the IR-level optimizer has removed dead phis, such as when the high half of an i64 value is unused on a 32-bit target. I had to adjust a few test cases that had dead phis. This is a partial fix for Radar 7627077. llvm-svn: 95816	2010-02-10 22:58:57 +00:00
Chris Lattner	ae67ca33ed	convert to filecheck. llvm-svn: 95608	2010-02-08 23:47:34 +00:00
Evan Cheng	ea5c6be766	Run codegen dce pass for all targets at all optimization levels. Previously it's only run for x86 with fastisel. I've found it being very effective in eliminating some obvious dead code as result of formal parameter lowering especially when tail call optimization eliminated the need for some of the loads from fixed frame objects. It also shrinks a number of the tests. A couple of tests no longer make sense and are now eliminated. llvm-svn: 95493	2010-02-06 09:07:11 +00:00
Anton Korobeynikov	25df248382	Fix a gross typo: ARMv6+ may or may not support unaligned memory operations. Even if they are suported by the core, they can be disabled (this is just a configuration bit inside some register). Allow unaligned memops on darwin and conservatively disallow them otherwise. llvm-svn: 94889	2010-01-30 14:08:12 +00:00
Chris Lattner	b657c4cdc3	emit jump table an alias ".set" directives through MCStreamer as assignments. .set x, a-b is the same as: x = a-b llvm-svn: 94596	2010-01-26 21:53:08 +00:00
Rafael Espindola	dcb03f0f6b	Emit .comm alignment in bytes but .align in powers of 2 for ARM ELF. Original patch by Sandeep Patel and updated by me. llvm-svn: 94582	2010-01-26 20:21:43 +00:00
Rafael Espindola	4cb52db485	Update test for darwin. llvm-svn: 94421	2010-01-25 15:32:10 +00:00
Rafael Espindola	a1141dd6ab	Fix PR6134. We are not emitting alignments on Darwin for "bar". Not sure what is the correct way to do it. llvm-svn: 94400	2010-01-25 02:27:39 +00:00
Dan Gohman	045f81981a	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Evan Cheng	4668a3b935	Test case for r93758. llvm-svn: 93824	2010-01-19 00:35:20 +00:00
Bob Wilson	9349437c65	The Neon "vtst" instruction takes a suffix that is the element size alone -- adding an "i" to the suffix, indicating that the elements are integers, is accepted but not part of the standard syntax. This helps us pass a few more of the Neon tests from gcc. llvm-svn: 93677	2010-01-17 06:35:17 +00:00
Bob Wilson	298cdac99c	Run the pre-register allocation tail duplication pass by default. Remove the -pre-regalloc-taildup command-line option, and add a new -disable-early-taildup option. llvm-svn: 93597	2010-01-16 00:29:50 +00:00
Chris Lattner	25d8ed3773	remove uses of deprecated functions, this generates slightly different BlockAddress labels, but nothing semantically important. Add a FIXME that BlockAddress codegen is broken if the LLVM BB has an empty name (e.g. strip was run). llvm-svn: 93303	2010-01-13 07:30:49 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Chris Lattner	5967840a5f	Make this more likely to generate a libcall. llvm-svn: 92387	2010-01-01 03:26:51 +00:00
Bob Wilson	3152b0471b	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Evan Cheng	0c2544fd6b	- Support inline asm 'w' constraint for 128-bit vector types. - Also support the 'q' NEON registers asm code. llvm-svn: 90894	2009-12-08 23:06:22 +00:00
Bob Wilson	0bbd3077ce	Recognize canonical forms of vector shuffles where the same vector is used for both source operands. In the canonical form, the 2nd operand is changed to an undef and the shuffle mask is adjusted to only reference elements from the 1st operand. Radar 7434842. llvm-svn: 90417	2009-12-03 06:40:55 +00:00
Evan Cheng	1d31fc9123	Fix PR5614: parts of a physical register def may be killed the rest. llvm-svn: 90180	2009-12-01 00:44:45 +00:00
Anton Korobeynikov	2522908653	Materialize global addresses via movt/movw pair, this is always better than doing the same via constpool: 1. Load from constpool costs 3 cycles on A9, movt/movw pair - just 2. 2. Load from constpool might stall up to 300 cycles due to cache miss. 3. Movt/movw does not use load/store unit. 4. Less constpool entries => better compiler performance. This is only enabled on ELF systems, since darwin does not have needed relocations (yet). llvm-svn: 89720	2009-11-24 00:44:37 +00:00
Jim Grosbach	dbb4140f37	move fconst[sd] to UAL. <rdar://7414913> llvm-svn: 89700	2009-11-23 21:08:25 +00:00
Edward O'Callaghan	21d7e8aeb1	Convert ARM tests to FileCheck for PR5307. llvm-svn: 89593	2009-11-22 14:23:33 +00:00
Edward O'Callaghan	8966897524	Forgot to alter RUN line when converting to FileCheck. llvm-svn: 89588	2009-11-22 13:09:48 +00:00
Edward O'Callaghan	7150767800	Fix for bad FileCheck converts in revision 89584. llvm-svn: 89586	2009-11-22 12:50:05 +00:00
Edward O'Callaghan	15dd46215e	Convert a few tests to FileCheck for PR5307. llvm-svn: 89584	2009-11-22 11:45:44 +00:00
Jim Grosbach	e09e95b35c	Revert 89562. We're being sneakier than I was giving us credit for, and this isn't necessary. llvm-svn: 89568	2009-11-21 23:34:09 +00:00
Jim Grosbach	43fd822249	Darwin requires a frame pointer for all non-leaf functions to support correct backtraces. llvm-svn: 89562	2009-11-21 21:40:08 +00:00
Evan Cheng	bdb43a9d99	Remat VLDRD from constpool. Clean up some instruction property specifications. llvm-svn: 89478	2009-11-20 19:57:15 +00:00
Evan Cheng	81a2851bcb	Fix codegen of conditional move of immediates. We were not making use of the immediate forms of cmov instructions at all. llvm-svn: 89423	2009-11-20 00:54:03 +00:00
Bob Wilson	6456fb94f5	Fix buildbots. llvm-svn: 89274	2009-11-18 23:30:38 +00:00
Bob Wilson	108aadf972	Tail duplication still needs to iterate. Duplicating new instructions onto the tail of a block may make that block a new candidate for duplication. llvm-svn: 89264	2009-11-18 22:52:37 +00:00
Anton Korobeynikov	a2873f4d59	Forgot to commit test fixes llvm-svn: 89138	2009-11-17 20:38:36 +00:00
Jim Grosbach	01c1cae34d	Detect need for autoalignment of the stack earlier to catch spills more conservatively. eliminateFrameIndex() machinery adjust to handle addr mode 6 (vld1/vst1) used for spills. Fix tests to expect aligned Q-reg spilling llvm-svn: 88874	2009-11-15 21:45:34 +00:00
Evan Cheng	6ad7da96fe	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
Evan Cheng	e3b312fec9	Add radar number. llvm-svn: 88739	2009-11-14 02:11:32 +00:00
Evan Cheng	d2c10508cd	Fix PR5412: Fix an inverted check and another missing sub-register check. llvm-svn: 88738	2009-11-14 02:09:09 +00:00
Evan Cheng	78fa302e7d	Fix PR5411. Bug in UpdateKills. A reg def partially define its super-registers. llvm-svn: 88719	2009-11-13 23:16:41 +00:00
Evan Cheng	d190b8216f	Fix PR5410: LiveVariables lost subreg def: D0<def,dead> = ... ... = S0<use, kill> S0<def> = ... ... D0<def> = The first D0 def is correctly marked dead, however, livevariables should have added an implicit def of S0 or we end up with a use without a def. llvm-svn: 88690	2009-11-13 20:36:40 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Anton Korobeynikov	226467d6a6	It turns out that the testcase in question uncovered subreg-handling bug. Add assert in asmprinter to catch such cases and xfail the tests. PR is to be filled. llvm-svn: 86375	2009-11-07 15:20:32 +00:00
Anton Korobeynikov	9df3acf486	Honour subreg machine operands during asmprinting llvm-svn: 86303	2009-11-06 23:45:15 +00:00
Bob Wilson	d95ccd6c4d	Print VMOV (immediate) operands as hexadecimal values. Apple's assembler will not accept negative values for these. LLVM's default operand printing sign extends values, so that valid unsigned values appear as negative immediates. Print all VMOV immediate operands as hex values to resolve this. Radar 7372576. llvm-svn: 86301	2009-11-06 23:33:28 +00:00
Evan Cheng	408aa56fb5	Remove ARMPCLabelIndex from ARMISelLowering. Use ARMFunctionInfo::createConstPoolEntryUId() instead. llvm-svn: 86294	2009-11-06 22:24:13 +00:00
Dan Gohman	98693a3ac2	Update these tests for the new label names. llvm-svn: 86192	2009-11-05 23:31:40 +00:00
Bob Wilson	90d0b82e12	Attempt again to fix buildbot failures: make expected output less specific and compile with -mtriple to specify *-apple-darwin targets. llvm-svn: 86081	2009-11-05 00:30:35 +00:00
Bob Wilson	e8ca96cf24	Fix broken test. llvm-svn: 86045	2009-11-04 20:04:11 +00:00
Bob Wilson	16f60b9216	Add test for ARM indirectbr codegen. llvm-svn: 86042	2009-11-04 19:25:34 +00:00
Evan Cheng	0410bced1c	fconsts / fconstd immediate should be proceeded with #. llvm-svn: 85952	2009-11-03 21:59:33 +00:00
Evan Cheng	f42b5af549	Re-apply 85799. It turns out my code isn't buggy. llvm-svn: 85947	2009-11-03 21:40:02 +00:00
Evan Cheng	8d681f0471	Fix PR5367. QPR_8 is the super regclass of DPR_8 and SPR_8. llvm-svn: 85871	2009-11-03 05:52:54 +00:00
Anton Korobeynikov	fbe0256b23	Revert r85049, it is causing PR5367 llvm-svn: 85847	2009-11-03 00:24:48 +00:00
Evan Cheng	a8a58efc03	Revert 85799 for now. It might be breaking llvm-gcc driver. llvm-svn: 85827	2009-11-02 21:49:14 +00:00
Evan Cheng	2729543984	Initilize the machine LICM CSE map upon the first time an instruction is hoisted to the loop preheader. Add instructions which are already in the preheader block that may be common expressions of those that are hoisted out. These does get a few more instructions CSE'ed. llvm-svn: 85799	2009-11-02 08:09:49 +00:00
Evan Cheng	fb2d385221	Remove an irrelevant and poorly reduced test case. llvm-svn: 85794	2009-11-02 07:11:54 +00:00
Anton Korobeynikov	4d23754b14	Handle splats of undefs properly. This includes the testcase for PR5364 as well. llvm-svn: 85767	2009-11-02 00:12:06 +00:00
Anton Korobeynikov	8cce1eb6aa	64-bit FP loads & stores operate on both NEON and VFP pipelines. llvm-svn: 85765	2009-11-02 00:11:06 +00:00
Jim Grosbach	5cba8de2c8	vml[as].f32 cause stalls in following advanced SIMD instructions. Avoid using them for scalar floating point operations for now. llvm-svn: 85697	2009-10-31 22:57:36 +00:00
Jim Grosbach	0de95af62d	Update test to be more explicit about what instruction sequences are expected for each operation. llvm-svn: 85689	2009-10-31 21:52:58 +00:00
Jim Grosbach	8fe6fd702d	Expand 64-bit logical shift right inline llvm-svn: 85687	2009-10-31 21:42:19 +00:00
Jim Grosbach	624fcb286e	Expand 64-bit arithmetic shift right inline llvm-svn: 85685	2009-10-31 21:00:56 +00:00
Jim Grosbach	5d994048dd	Expand 64 bit left shift inline rather than using the libcall. For now, this is unconditional. Making it still use the libcall when optimizing for size would be a good adjustment. llvm-svn: 85675	2009-10-31 19:38:01 +00:00
Benjamin Kramer	7e06083a3a	Add missing colons for FileCheck. llvm-svn: 85674	2009-10-31 19:22:24 +00:00
Jim Grosbach	bf1cb1343f	Convert to FileCheck llvm-svn: 85673	2009-10-31 19:06:53 +00:00
Rafael Espindola	ab7c709f43	This fixes functions like void f (int a1, int a2, int a3, int a4, int a5,...) In ARMTargetLowering::LowerFormalArguments if the function has 4 or more regular arguments we used to set VarArgsFrameIndex using an offset of 0, which is only correct if the function has exactly 4 regular arguments. llvm-svn: 85590	2009-10-30 14:33:14 +00:00
Evan Cheng	4a609f3cef	Use fconsts and fconstd to materialize small fp constants. llvm-svn: 85362	2009-10-28 01:44:26 +00:00
Rafael Espindola	d92a3633e1	Add missing testcase. llvm-svn: 85266	2009-10-27 17:59:03 +00:00
Bob Wilson	d169e6c281	Fix the rest of the ARM failures by converting them to FileCheck. llvm-svn: 85208	2009-10-27 06:16:45 +00:00
Bob Wilson	04580c8307	Fix some more failures by converting to FileCheck. llvm-svn: 85207	2009-10-27 05:50:28 +00:00
Bob Wilson	e8d20795a3	Convert to FileCheck, fixing failure due to tab change in the process. llvm-svn: 85204	2009-10-27 05:30:47 +00:00
Evan Cheng	b9f3520660	Update tests. llvm-svn: 85050	2009-10-25 07:53:48 +00:00
Bob Wilson	9d763cc3f8	Revert 84843. Evan, this was breaking some of the if-conversion tests. llvm-svn: 84868	2009-10-22 16:52:21 +00:00
Evan Cheng	3615b9bef3	Move if-conversion before post-regalloc scheduling so the predicated instruction get scheduled properly. llvm-svn: 84843	2009-10-22 06:48:32 +00:00
Evan Cheng	0f55e9ce2e	Don't generate sbfx / ubfx with negative lsb field. Patch by David Conrad. llvm-svn: 84813	2009-10-22 00:40:00 +00:00
Evan Cheng	786b15fe12	Match more patterns to movt. llvm-svn: 84751	2009-10-21 08:15:52 +00:00
Anton Korobeynikov	a6faf60831	Fix invalid for vector types fneg(bitconvert(x)) => bitconvert(x ^ sign) transform. llvm-svn: 84683	2009-10-20 21:37:45 +00:00
Chris Lattner	0ed889521b	convert to filecheck syntax and make a lot more aggressive. llvm-svn: 84517	2009-10-19 18:27:56 +00:00
Chris Lattner	7ea0c35ba0	rename test llvm-svn: 84515	2009-10-19 18:18:07 +00:00
Evan Cheng	03da4dba59	Enable post-alloc scheduling for all ARM variants except for Thumb1. llvm-svn: 84249	2009-10-16 06:11:08 +00:00
Bob Wilson	3b51560ae4	Revise ARM inline assembly memory operands to require the memory address to be in a register. The previous use of ARM address mode 2 was completely arbitrary and inappropriate for Thumb. Radar 7137468. llvm-svn: 84022	2009-10-13 20:50:28 +00:00
Sandeep Patel	423e42b371	Add ARMv6T2 SBFX/UBFX instructions. Approved by Anton Korobeynikov. llvm-svn: 84009	2009-10-13 18:59:48 +00:00
Benjamin Kramer	258c7fa33a	Eliminate some redundant llvm-as calls. llvm-svn: 83837	2009-10-12 09:31:55 +00:00
Dan Gohman	50998f4584	Update this test; the code is the same but it gets counted as one fewer remat. llvm-svn: 83690	2009-10-09 23:31:04 +00:00
Bob Wilson	35b6173a17	Merge a bunch of NEON tests into larger files so they run faster. llvm-svn: 83667	2009-10-09 20:20:54 +00:00
Bob Wilson	6dd3b9ad58	Convert some ARM tests with lots of greps to use FileCheck. llvm-svn: 83651	2009-10-09 17:20:46 +00:00
Bob Wilson	e9b19f76cb	Commit one last NEON test to use FileCheck. That's all of them now! llvm-svn: 83617	2009-10-09 05:31:56 +00:00
Bob Wilson	24b84fecf2	Convert more NEON tests to use FileCheck. llvm-svn: 83616	2009-10-09 05:14:48 +00:00
Bob Wilson	84e7967fae	Add codegen support for NEON vst4lane intrinsics with 128-bit vectors. llvm-svn: 83600	2009-10-09 00:01:36 +00:00
Bob Wilson	c409030838	Add codegen support for NEON vst3lane intrinsics with 128-bit vectors. llvm-svn: 83598	2009-10-08 23:51:31 +00:00
Bob Wilson	b851eb356a	Add codegen support for NEON vst2lane intrinsics with 128-bit vectors. llvm-svn: 83596	2009-10-08 23:38:24 +00:00
Bob Wilson	1fd98d67e3	Convert more NEON tests to use FileCheck. llvm-svn: 83595	2009-10-08 23:33:03 +00:00
Bob Wilson	38ba47225a	Add codegen support for NEON vld4lane intrinsics with 128-bit vectors. Also fix some copy-and-paste errors in previous changes. llvm-svn: 83590	2009-10-08 22:53:57 +00:00
Bob Wilson	f448255063	Convert more NEON tests to use FileCheck. llvm-svn: 83587	2009-10-08 22:33:53 +00:00
Bob Wilson	cf54e934f8	Add codegen support for NEON vld3lane intrinsics with 128-bit vectors. llvm-svn: 83585	2009-10-08 22:27:33 +00:00
Anton Korobeynikov	222b86cd54	Use lower16 / upper16 imm modifiers to asmprint 32-bit imms splitted via movt/movw pair. llvm-svn: 83572	2009-10-08 20:43:22 +00:00
Bob Wilson	c2728f44a9	Add codegen support for NEON vld2lane intrinsics with 128-bit vectors. llvm-svn: 83568	2009-10-08 18:56:10 +00:00
Bob Wilson	7d94eb4722	Convert more NEON tests to use FileCheck. llvm-svn: 83528	2009-10-08 06:02:10 +00:00
Bob Wilson	b6b0ab6117	Add codegen support for NEON vst4 intrinsics with <1 x i64> vectors. llvm-svn: 83526	2009-10-08 05:18:18 +00:00
Bob Wilson	71387b4b2f	Add codegen support for NEON vst3 intrinsics with <1 x i64> vectors. llvm-svn: 83518	2009-10-08 00:28:28 +00:00
Bob Wilson	d4f5670096	Add codegen support for NEON vst2 intrinsics with <1 x i64> vectors. llvm-svn: 83513	2009-10-08 00:21:01 +00:00
Bob Wilson	32cc4ec304	Add codegen support for NEON vld4 intrinsics with <1 x i64> vectors. llvm-svn: 83508	2009-10-07 23:54:04 +00:00
Bob Wilson	d1de3b82ff	Convert more NEON tests to use FileCheck. llvm-svn: 83507	2009-10-07 23:47:21 +00:00
Bob Wilson	5ef3c6d9f4	Add codegen support for NEON vld3 intrinsics with <1 x i64> vectors. llvm-svn: 83506	2009-10-07 23:39:57 +00:00
Bob Wilson	763be1a248	Add codegen support for NEON vld2 intrinsics with <1 x i64> vectors. llvm-svn: 83502	2009-10-07 22:57:01 +00:00
Bob Wilson	6d850f294d	Convert more NEON tests to use FileCheck. llvm-svn: 83497	2009-10-07 22:30:19 +00:00
Bob Wilson	70f004d9e6	Convert test to FileCheck. llvm-svn: 83487	2009-10-07 20:51:42 +00:00
Bob Wilson	e7ef4a9a6b	Add codegen support for NEON vst4 intrinsics with 128-bit vectors. llvm-svn: 83486	2009-10-07 20:49:18 +00:00
Bob Wilson	23464866ad	Add codegen support for NEON vst3 intrinsics with 128-bit vectors. llvm-svn: 83484	2009-10-07 20:30:08 +00:00
Bob Wilson	3dcb5377ef	Add codegen support for NEON vst2 intrinsics with 128-bit vectors. llvm-svn: 83482	2009-10-07 18:47:39 +00:00
Bob Wilson	ab3a9474d6	Add codegen support for NEON vld4 intrinsics with 128-bit vectors. llvm-svn: 83479	2009-10-07 18:09:32 +00:00
Bob Wilson	6bbefc2f67	Add codegen support for NEON vld3 intrinsics with 128-bit vectors. llvm-svn: 83471	2009-10-07 17:24:55 +00:00
Bob Wilson	aa47a8d71a	Add tests for vld2 of 128-bit vectors. llvm-svn: 83468	2009-10-07 17:19:13 +00:00
Bob Wilson	3251776d1d	Update NEON struct names to match llvm-gcc changes. (This is not required for correctness but might help with sanity.) llvm-svn: 83415	2009-10-06 21:16:19 +00:00
Evan Cheng	4ad726b4be	Fix tests. llvm-svn: 83241	2009-10-02 06:53:57 +00:00
Evan Cheng	2dcee28a61	Move load / store multiple before post-alloc scheduling. llvm-svn: 83236	2009-10-02 04:57:15 +00:00
David Goodwin	1cc6dd97da	Remove neonfp attribute and instead set default based on CPU string. Add -arm-use-neon-fp to override the default. llvm-svn: 83218	2009-10-01 22:19:57 +00:00
David Goodwin	9a051a5922	Restore the -post-RA-scheduler flag as an override for the target specification. Remove -mattr for setting PostRAScheduler enable and instead use CPU string. llvm-svn: 83215	2009-10-01 21:46:35 +00:00
David Goodwin	17199b56b0	Remove -post-RA-schedule flag and add a TargetSubtarget method to enable post-register-allocation scheduling. By default it is off. For ARM, enable/disable with -mattr=+/-postrasched. Enable by default for cortex-a8. llvm-svn: 83122	2009-09-30 00:10:16 +00:00
David Goodwin	bef958c716	Post-RA regressions. llvm-svn: 83075	2009-09-29 17:10:26 +00:00
Evan Cheng	139c3dba53	Fix PR4687. Pre ARMv5te does not support ldrd / strd. Patch by John Tytgat. llvm-svn: 83058	2009-09-29 07:07:30 +00:00
Evan Cheng	e0c5313493	Coalescer should not delete extract_subreg, insert_subreg, and subreg_to_reg of physical registers. This is especially critical for the later two since they start the live interval of a super-register. e.g. %DO<def> = INSERT_SUBREG %D0<undef>, %S0<kill>, 1 If this instruction is eliminated, the register scavenger will not be happy as D0 is not defined previously. This fixes PR5055. llvm-svn: 82968	2009-09-28 05:28:43 +00:00
Anton Korobeynikov	7c2b1e71c1	Use movt/movw pair to materialize 32 bit constants on ARMv6T2+. This should be better than single load from constpool. llvm-svn: 82948	2009-09-27 23:52:58 +00:00
Evan Cheng	cf2a9c9962	Remove this test. llvm-svn: 82869	2009-09-26 18:51:37 +00:00
Daniel Dunbar	ccde96e96b	"Update" tests for -disable-if-conversion removal. I think branch.ll should just be removed, but I XFAIL'd it for now. llvm-svn: 82847	2009-09-26 05:29:36 +00:00
Evan Cheng	d080f7bf26	Convert test to filecheck. llvm-svn: 82835	2009-09-26 02:41:17 +00:00
Evan Cheng	3872b3c13e	Flip -disable-post-RA-scheduler to -post-RA-scheduler. llvm-svn: 82803	2009-09-25 21:38:11 +00:00
Dan Gohman	48b185d6f7	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Bob Wilson	d60367c198	pr4926: ARM requires the stack pointer to be aligned, even for leaf functions. For the AAPCS ABI, SP must always be 4-byte aligned, and at any "public interface" it must be 8-byte aligned. For the older ARM APCS ABI, the stack alignment is just always 4 bytes. For X86, we currently align SP at entry to a function (e.g., to 16 bytes for Darwin), but no stack alignment is needed at other times, such as for a leaf function. After discussing this with Dan, I decided to go with the approach of adding a new "TransientStackAlignment" field to TargetFrameInfo. This value specifies the stack alignment that must be maintained even in between calls. It defaults to 1 except for ARM, where it is 4. (Some other targets may also want to set this if they have similar stack requirements. It's not currently required for PPC because it sets targetHandlesStackFrameRounding and handles the alignment in target-specific code.) The existing StackAlignment value specifies the alignment upon entry to a function, which is how we've been using it anyway. llvm-svn: 82767	2009-09-25 14:41:49 +00:00
Bob Wilson	6cd4aee5e9	Convert to FileCheck. llvm-svn: 82710	2009-09-24 20:23:02 +00:00
Evan Cheng	26ea28eb5f	Fix PR5024 with a big hammer: disable the double-def assertion in the scavenger. LiveVariables add implicit kills to correctly track partial register kills. This works well enough and is fairly accurate. But coalescer can make it impossible to maintain these markers. e.g. BL <ga:sss1>, %R0<kill,undef>, %S0<kill>, %R0<imp-def>, %R1<imp-def,dead>, %R2<imp-def,dead>, %R3<imp-def,dead>, %R12<imp-def,dead>, %LR<imp-def,dead>, %D0<imp-def>, ... ... %reg1031<def> = FLDS <cp#1>, 0, 14, %reg0, Mem:LD4[ConstantPool] ... %S0<def> = FCPYS %reg1031<kill>, 14, %reg0, %D0<imp-use,kill> When reg1031 and S0 are coalesced, the copy (FCPYS) will be eliminated the the implicit-kill of D0 is lost. In this case it's possible to move the marker to the FLDS. But in many cases, this is not possible. Suppose %reg1031<def> = FOO <cp#1>, %D0<imp-def> ... %S0<def> = FCPYS %reg1031<kill>, 14, %reg0, %D0<imp-use,kill> When FCPYS goes away, the definition of S0 is the "FOO" instruction. However, transferring the D0 implicit-kill to FOO doesn't work since it is the def of D0 itself. We need to fix this in another time by introducing a "kill" pseudo instruction to track liveness. Disabling the assertion is not ideal, but machine verifier is doing that job now. It's important to know double-def is not a miscomputation since it means a register should be free but it's not tracked as free. It's a performance issue instead. llvm-svn: 82677	2009-09-24 02:27:09 +00:00
Evan Cheng	262f86ed90	Fix PR5024. LiveVariables physical register defs should commit only after all of the defs are processed. Also fix a implicit_def propagation bug: a implicit_def of a physical register should be applied to uses of the sub-registers. llvm-svn: 82616	2009-09-23 06:28:31 +00:00
Evan Cheng	08d1e41c10	Fix PR5024. LiveVariables::FindLastPartialDef should return a set of sub-registers that were defined by the last partial def, not just a single sub-register. llvm-svn: 82535	2009-09-22 08:34:46 +00:00
Evan Cheng	0dfed43a5b	Fix a pasto. Also simplify for Bill's benefit. llvm-svn: 82505	2009-09-22 01:48:19 +00:00
Evan Cheng	255f416470	Clean up spill weight computation. Also some changes to give loop induction variable increment / decrement slighter high priority. This has major impact on some micro-benchmarks. On MultiSource/Applications and spec tests, it's a minor win. It also reduce 256.bzip instruction count by 8%, 55 on 164.gzip on i386 / Darwin. llvm-svn: 82485	2009-09-21 21:12:25 +00:00
Evan Cheng	fccbd0afc6	Fix PR4986. "r1024 = insert_subreg r1024, undef, 2" cannot be turned in an implicit_def. Instead, it's an identity copy so it should be eliminated. Also make sure to update livevariable kill information. llvm-svn: 82436	2009-09-21 04:32:32 +00:00
Bob Wilson	0bf35c25fe	Convert more tests to FileCheck. llvm-svn: 81915	2009-09-15 20:58:02 +00:00
Sandeep Patel	f3369c22a7	Fix superreg use in ARMAsmPrinter. Approved by Anton Korobeynikov. llvm-svn: 81878	2009-09-15 17:53:11 +00:00
Anton Korobeynikov	6c89da7027	Define proper subreg sets for arm - this should fix bunch of subtle problems with subreg - superreg mapping and also fix PR4965. llvm-svn: 81657	2009-09-13 00:59:43 +00:00
Dan Gohman	b165c11021	Remove an unnecessary -f. llvm-svn: 81546	2009-09-11 18:41:06 +00:00
Dan Gohman	a080159a7c	Convert more tests to avoid llvm-as. llvm-svn: 81545	2009-09-11 18:36:27 +00:00
Bob Wilson	39f51320ca	Don't swap the operands of a subtraction when trying to create a post-decrement load/store. llvm-svn: 81464	2009-09-10 22:09:31 +00:00
Bob Wilson	a2e8333eed	Fix pr4939: Change FPCCToARMCC to translate SETOLE to ARMCC::LS. See the bug report for details. llvm-svn: 81397	2009-09-09 23:14:54 +00:00
Dan Gohman	c8054d90fb	Eliminate more uses of llvm-as and llvm-dis. llvm-svn: 81293	2009-09-09 00:09:15 +00:00
Anton Korobeynikov	7697d37777	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Anton Korobeynikov	59e2b8e894	Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and makes the code faster. llvm-svn: 81220	2009-09-08 15:22:32 +00:00
Daniel Dunbar	30e30587eb	Remove stale greps. llvm-svn: 80986	2009-09-04 05:07:52 +00:00
Bob Wilson	36d8c75eca	Convert tests to FileCheck. llvm-svn: 80983	2009-09-04 04:07:19 +00:00
Bob Wilson	e072f8eedb	Convert a test to FileCheck. llvm-svn: 80975	2009-09-04 00:32:31 +00:00
Evan Cheng	1b38952c99	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Anton Korobeynikov	f0da41c3e4	More missed vdup patterns llvm-svn: 80838	2009-09-02 21:21:28 +00:00
Bob Wilson	d7797754d4	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	39dc89b458	Fix incorrect declarations of intrinsics in this test. llvm-svn: 80705	2009-09-01 18:50:43 +00:00
Bob Wilson	ff69320427	Add test for vld{234}_lane instructions. llvm-svn: 80658	2009-09-01 04:27:10 +00:00
Bob Wilson	33b408a10f	Fix pr4843: When an instruction has multiple destination registers that are tied to different source registers, the TwoAddressInstructionPass needs to be smarter. Change it to check before replacing a source register whether that source register is tied to a different destination register, and if so, defer handling it until a subsequent iteration. llvm-svn: 80654	2009-09-01 04:18:40 +00:00
Jim Grosbach	f09e8d5497	SJLJ is arm/darwin only for now. force the triple for the test llvm-svn: 80651	2009-09-01 02:34:49 +00:00
Jim Grosbach	20eac92d88	Clean up LSDA name generation and use for SJLJ exception handling. This makes an eggregious hack somewhat more palatable. Bringing the LSDA forward and making it a GV available for reference would be even better, but is beyond the scope of what I'm looking to solve at this point. Objective C++ code could generate function names that broke the previous scheme. This fixes that. llvm-svn: 80649	2009-09-01 01:57:56 +00:00
David Goodwin	c8985204d9	Don't mark a register live at an undef use. llvm-svn: 80621	2009-08-31 20:47:02 +00:00
Anton Korobeynikov	3681144bd8	Add missed pattern llvm-svn: 80502	2009-08-30 19:06:39 +00:00
Anton Korobeynikov	eab572a8ff	EXTRACT_VECTOR_ELEMENT can have result type different from element type. Remove the assertion and generalize the code for ARM NEON stuff. llvm-svn: 80498	2009-08-30 17:14:54 +00:00
Anton Korobeynikov	ece642a54c	Do not assert on too wide splats we don't support. llvm-svn: 80409	2009-08-29 00:08:18 +00:00
Anton Korobeynikov	cd41d07f29	Add missed extract_element pattern llvm-svn: 80408	2009-08-28 23:41:26 +00:00
Evan Cheng	43b9ca6f42	Let Darwin linker auto-synthesize stubs and lazy-pointers. This deletes a bunch of nasty code in ARM asm printer. llvm-svn: 80404	2009-08-28 23:18:09 +00:00
Evan Cheng	6da267de23	v4, v5 does not support sxtb / sxth. llvm-svn: 80322	2009-08-28 00:31:43 +00:00
Anton Korobeynikov	205cac837f	scalar_to_vector is fully legal now (implemented as subreg accesses) llvm-svn: 80249	2009-08-27 16:04:47 +00:00
Anton Korobeynikov	d0b0262edf	Ok, sometimes it's profitable to turn scalar_to_vector stuff into subreg access. Add a testcase. llvm-svn: 80246	2009-08-27 14:51:42 +00:00
Evan Cheng	7a37b1a2ca	Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which cannot fold any immediate offset. llvm-svn: 80191	2009-08-27 01:23:50 +00:00
Bob Wilson	faebdee4dd	Convert some more Neon tests to FileCheck. llvm-svn: 80120	2009-08-26 18:11:50 +00:00
Anton Korobeynikov	0f756b27ae	Expand scalar_to_vector - we don't have any isel logic for it now llvm-svn: 80107	2009-08-26 16:26:09 +00:00
David Goodwin	ae6bc8214a	Fixup register kills after scheduling. llvm-svn: 80002	2009-08-25 17:03:05 +00:00
Dan Gohman	0d4bbf2c4a	Remove obsolete -f flags. llvm-svn: 79992	2009-08-25 15:38:29 +00:00
Dale Johannesen	fbc9a2e33b	Split test into 3. llvm-svn: 79926	2009-08-24 17:51:19 +00:00
Eli Friedman	682d8c1881	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Bob Wilson	616335f6c1	Use CHECK-NEXT to make sure we're only getting one copy of each shuffle instruction. llvm-svn: 79702	2009-08-22 00:13:23 +00:00
Bob Wilson	a70623102e	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Bob Wilson	f73af72d30	Add some tests for vext.16 and vext.32. llvm-svn: 79638	2009-08-21 16:35:24 +00:00
Bob Wilson	51c7aa04ec	Remove Neon intrinsics for VZIP, VUZP, and VTRN. We will represent these as vector shuffles. Temporarily remove the tests for these operations until the new implementation is working. llvm-svn: 79579	2009-08-21 00:01:42 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bill Wendling	bae6b2cca3	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	d3fade656f	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	52d4e64711	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Jakob Stoklund Olesen	ffa73acfec	Refine EarlyClobber assert in register scavenger. It is legal for an inline asm operand to use an earlyclobber register if the use operand is tied to the earlyclobber operand. The issue is discussed here: http://gcc.gnu.org/ml/gcc/1999-04n/msg00431.html We should perhaps let only the machine code verifier worry about these finer details. EarlyClobber operands are not really interesting to the scavenger. This fixes PR4528 for the third time. llvm-svn: 79122	2009-08-15 18:16:58 +00:00
Jakob Stoklund Olesen	4af3c864bc	Don't setCalleeSavedInfoValid() until spills are interted. In a naked function, the flag is never set and getPristineRegs() returns an empty list. That means naked functions are able to clobber callee saved registers, but that is the whole point of naked functions. This fixes PR4716. llvm-svn: 79096	2009-08-15 13:10:46 +00:00
Bob Wilson	4b35448360	Generate Neon VTBL and VTBX instructions from the corresponding intrinsics. llvm-svn: 78835	2009-08-12 20:51:55 +00:00
Chris Lattner	0c533d909a	now that these are in file-check format, we can merge them together into one bigger test (which runs faster) llvm-svn: 78672	2009-08-11 15:54:17 +00:00
Bob Wilson	8f5c447bfa	Convert more Neon tests to use FileCheck. llvm-svn: 78648	2009-08-11 05:51:19 +00:00
Bob Wilson	12842f9865	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Bob Wilson	741a9c7bf6	Use new EVT::vAny type to combine Neon intrinsics for VPADD. llvm-svn: 78632	2009-08-11 01:15:26 +00:00

... 3 4 5 6 7 ...

735 Commits