llvm-project

Commit Graph

Author	SHA1	Message	Date
Bruno Cardoso Lopes	006c9371a1	Fix 80-column and style llvm-svn: 139061	2011-09-03 00:46:40 +00:00
Bruno Cardoso Lopes	dbb40015ff	Tidy up some SSE/AVX convert intrinsics. Also add an AVX version of OptForSize pattern llvm-svn: 139060	2011-09-03 00:46:38 +00:00
Owen Anderson	40d756eacc	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Jakob Stoklund Olesen	1f72dd40c7	Pseudo CMOV instructions don't clobber EFLAGS. The explanation about a 0 argument being materialized as xor is no longer valid. Rematerialization will check if EFLAGS is live before clobbering it. The code produced by X86TargetLowering::EmitLoweredSelect does not clobber EFLAGS. This causes one less testb instruction to be generated in the cmov.ll test case. llvm-svn: 139057	2011-09-02 23:52:55 +00:00
Jakob Stoklund Olesen	f08354d183	Check for EFLAGS live-out before clobbering it. It is only allowed to clobber EFLAGS at the end of a block if it isn't live-in to any successor. llvm-svn: 139056	2011-09-02 23:52:52 +00:00
Jakob Stoklund Olesen	d0c8a31c8b	Use existing function. llvm-svn: 139055	2011-09-02 23:52:49 +00:00
Jim Grosbach	34842ceb97	Thumb2 parsing and encoding for BXJ. llvm-svn: 139053	2011-09-02 23:43:09 +00:00
Jim Grosbach	a0d34d3b5e	Thumb2 parsing and encoding of B instruction. Tweak handling of IT blocks a bit to enable this. The differentiation between B and Bcc needs special sauce. llvm-svn: 139049	2011-09-02 23:22:08 +00:00
Jakob Stoklund Olesen	38019e3188	Remove unused variables. llvm-svn: 139047	2011-09-02 22:41:25 +00:00
Eli Friedman	f3dd6da7a8	Don't fast-isel for atomic load/store; some cases require extra handling missing from fast-isel. llvm-svn: 139044	2011-09-02 22:33:24 +00:00
Jim Grosbach	a216debb37	Thumb2 parsing and encoding for ASR. For other shift and rotate instructions, too. Tests for those forthcoming as I work my way through the ISA. llvm-svn: 139040	2011-09-02 21:28:54 +00:00
Andrew Trick	bbb226a827	Comment and clarifying assert. llvm-svn: 139036	2011-09-02 21:20:46 +00:00
Bill Wendling	73e6333ce1	No need to get fancy inserting a PHI node when the values are stored in stack slots. This fixes a bug where the number of nodes coming into the PHI node may not equal the number of predecessors. E.g., two or more landingpad instructions may require a PHI before reaching the eh.exception and eh.selector instructions. llvm-svn: 139035	2011-09-02 21:17:08 +00:00
Kevin Enderby	5b03f72292	Change X86 disassembly to print immediates values as signed by default. Special case those instructions that the immediate is not sign-extend. radr://8795217 llvm-svn: 139028	2011-09-02 20:01:23 +00:00
Jim Grosbach	05dec8b122	Tidy up. Formatting. llvm-svn: 139024	2011-09-02 18:46:15 +00:00
Bill Wendling	a336e70573	Update comments to reflect reality. llvm-svn: 139023	2011-09-02 18:43:33 +00:00
Jim Grosbach	0a57375a28	Tidy up. 80 columns. llvm-svn: 139022	2011-09-02 18:43:25 +00:00
Jim Grosbach	370e923434	Thumb2 parsing and encoding for AND (register). llvm-svn: 139021	2011-09-02 18:41:35 +00:00
Jakob Stoklund Olesen	97fe09ad2e	Simplify by using isFullCopy(). llvm-svn: 139019	2011-09-02 18:18:29 +00:00
Bill Wendling	4e1d018935	Revert r138826 until PR10834 can be fixed. llvm-svn: 139018	2011-09-02 18:15:04 +00:00
Jim Grosbach	2761155203	Thumb2 parsing and encoding for ADD (register). llvm-svn: 139017	2011-09-02 18:14:46 +00:00
Duncan Sands	5c04c62765	Darwin wants ctors/dtors to be ordered the other way round to linux. llvm-svn: 139015	2011-09-02 18:07:19 +00:00
Andrew Trick	31b941a60d	Enable SCEV-based unrolling by default. This changes loop unrolling to use the same mechanism for trip count computation as indvars. This is a stronger check that tends to unroll more loops. A very common side-effect is that many single iteration loops will be removed sooner. The real goal was simply to remove dependence on canonical IVs. x86 is break even. ARM performance changes to expect (+ is good): External/SPEC/CFP2000/183.equake/183.equake +13% SingleSource/Benchmarks/Dhrystone/fldry +21% MultiSource/Applications/spiff/spiff +3% SingleSource/Benchmarks/Stanford/Puzzle -14% The Puzzle regression is actually an improvement in loop optimization that defeats GVN: rdar://problem/10065079. llvm-svn: 139009	2011-09-02 17:26:28 +00:00
Jakub Staszak	59a1de1c9d	Return undef value (instead of arbitrary) for wrong or undef index in ConstantVector. llvm-svn: 139007	2011-09-02 17:01:40 +00:00
Jakub Staszak	057d423e4b	ConstantVector returns arbitrary value for the wrong index. This fixes PR10813. llvm-svn: 139006	2011-09-02 15:43:43 +00:00
Jakub Staszak	7470fb01d0	Compare type size instead of type _store_ size to make sure that BitCastInst will be valid. This fixes PR10820. llvm-svn: 139005	2011-09-02 14:57:37 +00:00
Kalle Raiskila	f5769c1070	Pass signed (not unsigned) 10 bit field to SPU 'ori' instruction. llvm-svn: 139004	2011-09-02 10:05:01 +00:00
Bill Wendling	5b49bb6bf5	Perform the upgrading of the old EH to the new EH in a more sane manner. Perform the upgrading in steps. * First, create a map of the invokes to the EH intrinsics. * Next, take that mapping and determine if the invoke's unwind destination has a single predecessor. If not, then create a new empty block to hold the new landingpad instruction. * Create a landingpad instruction into the uwnind destination. Fill it with the values from the old selector. Map the old intrinsic calls to the new landingpad values (there may be multiple landingpad instructions per instrinic call pairs). * Go through the old intrinsic calls, create a PHI node when necessary, and then replace their values with the new values from the landingpad instructions. * Delete all dead instructions. * ??? * Profit! llvm-svn: 138990	2011-09-02 01:30:08 +00:00
Owen Anderson	ed96b58bd2	Merge the ARM disassembler header into the implementation file, since it is not externally exposed. llvm-svn: 138982	2011-09-01 23:35:51 +00:00
Owen Anderson	03aadae01f	Fix 80 columns violations. llvm-svn: 138980	2011-09-01 23:23:50 +00:00
Dan Gohman	3767be9aee	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Benjamin Kramer	6397051ece	Don't drop alignment info on local common symbols. - On COFF the .lcomm directive has an alignment argument. - On ELF we fall back to .local + .comm Based on a patch by NAKAMURA Takumi. Fixes PR9337, PR9483 and PR10128. llvm-svn: 138976	2011-09-01 23:04:27 +00:00
Eli Friedman	d7776ed030	Null-initialize to shut up -Wuninitialized warnings. llvm-svn: 138974	2011-09-01 22:27:41 +00:00
James Molloy	5ada2a7e27	Fix apparent build error caused by r138948 on certain versions of GCC with -Werror. Sorry for the inconvenience. llvm-svn: 138973	2011-09-01 22:01:14 +00:00
Bill Wendling	a3ba6d3b80	Reduce indentation. No functionality change. llvm-svn: 138968	2011-09-01 21:29:49 +00:00
Bill Wendling	bf8280ff27	Change worklist driven deletion to be an iterative process. Duncan noticed this! llvm-svn: 138967	2011-09-01 21:28:33 +00:00
Eli Friedman	71f5c2f158	Fix an issue with the IR sink pass found by inspection. (I'm not sure anyone is actually using this, but might as well fix it since I found the issue.) llvm-svn: 138965	2011-09-01 21:21:24 +00:00
Nick Lewycky	2e2d75fb97	Fix the build for us -Werror users. Remove broken emacs mode major notation marking a C++ file as C. No functionality change. llvm-svn: 138963	2011-09-01 21:09:04 +00:00
Eli Friedman	b053ac7e0c	Make isSafeToSpeculativelyExecute() return the right answer for some new instructions. Found by inspection; not sure what practical impact, if any, this has. llvm-svn: 138962	2011-09-01 21:03:03 +00:00
Jakob Stoklund Olesen	5dc87d0f4d	Permit remat of partial register defs when it is safe. An instruction may define part of a register where the other bits are undefined. In that case, it is safe to rematerialize the instruction. For example: %vreg2:ssub_0<def> = VLDRS <cp#0>, 0, pred:14, pred:%noreg, %vreg2<imp-def> The extra <imp-def> operand indicates that the instruction does not read the other parts of the virtual register, so a remat is safe. This patch simply allows multiple def operands for the virtual register. It is MI->readsVirtualRegister() that determines if we depend on a previous value so remat is impossible. llvm-svn: 138953	2011-09-01 18:27:51 +00:00
Jim Grosbach	f6d5d60f99	ARM 'rscs' mnemonic is carry-setting 'rsc', not 'rs' with a 'cs' condition code. llvm-svn: 138952	2011-09-01 18:22:13 +00:00
Bruno Cardoso Lopes	f61d1c072e	Fix vbroadcast matching logic to early unmatch if the node doesn't have only one use. Fix PR10825. llvm-svn: 138951	2011-09-01 18:15:06 +00:00
James Molloy	db4ce60328	Fix up r137380 based on post-commit review by Jim Grosbach. llvm-svn: 138948	2011-09-01 18:02:14 +00:00
Owen Anderson	35d240f9e8	t2Bcc is allowed to have a predicate without a preceding IT instruction. llvm-svn: 138946	2011-09-01 17:47:45 +00:00
Jakob Stoklund Olesen	e417273fce	Revert r138794, "Do not try to rematerialize a value from a partial definition." The problem is fixed for all register allocators by r138944, so this patch is no longer necessary. <rdar://problem/10032939> llvm-svn: 138945	2011-09-01 17:25:18 +00:00
Jakob Stoklund Olesen	6357fa2f06	Prevent remat of partial register redefinitions. An instruction that redefines only part of a larger register can never be rematerialized since the virtual register value depends on the old value in other parts of the register. This was fixed for the inline spiller in r138794. This patch fixes the problem for all register allocators, and includes a small test case. <rdar://problem/10032939> llvm-svn: 138944	2011-09-01 17:18:50 +00:00
Evan Cheng	90da66bb69	Teach MachineLICM reg pressure tracking code to deal with MVT::untyped. Sorry, I can't come up with a small test case. rdar://10043690 llvm-svn: 138934	2011-09-01 01:45:00 +00:00
Bill Wendling	a617c32745	Resubmit with fix. Properly remove the instructions except for landingpad, which should be removed only when its invokes are. llvm-svn: 138932	2011-09-01 01:28:11 +00:00
Bill Wendling	9f7cf20e60	Submitted this too early. llvm-svn: 138931	2011-09-01 01:18:33 +00:00
Bill Wendling	2d1f11f743	Don't DCE the landingpad instruction. The landingpad instruction can be removed only when its invokes are removed. llvm-svn: 138930	2011-09-01 01:16:58 +00:00
Andrew Trick	832a6a1909	PreRA scheduler should avoid cloning compares. Added canClobberReachingPhysRegUse() to handle a particular pattern in which a two-address instruction could be forced to interfere with EFLAGS, causing a compare to be unnecessarilly cloned. Fixes rdar://problem/5875261 llvm-svn: 138924	2011-09-01 00:54:31 +00:00
Bill Wendling	b9a899995d	Don't forget to add the landingpad and resume instructions to the InstructionList. This was found via a nightly build of 483.xalancbmk. llvm-svn: 138923	2011-09-01 00:50:20 +00:00
Jim Grosbach	1d3c137839	Thumb2 assembly parsing and encoding for ADD(immediate). llvm-svn: 138922	2011-09-01 00:28:52 +00:00
Chad Rosier	17847ae757	Fixup for functions that return a bool. llvm-svn: 138918	2011-08-31 23:49:05 +00:00
Jim Grosbach	e1995f2566	Static relocation model Thumb jump table interworking. Make sure the low bit of the PC is set when loading an address directly for jump tables in static relocation model. llvm-svn: 138912	2011-08-31 22:23:09 +00:00
Owen Anderson	4af0aa98d5	The asm parser currently selects the wrong encoding for non-conditional Thumb2 branches. However, this exposed a number of situations where the decoder was too permissive in allowing invalid instructions to decode successful. Specify additional fixed bits to close those gaps. llvm-svn: 138910	2011-08-31 22:00:41 +00:00
David Greene	7df940d660	Fix Size Typing Stores sizes as uint64_t to avoid possible truncation. llvm-svn: 138901	2011-08-31 21:34:20 +00:00
Jim Grosbach	99bc84662f	Thumb2 t2Bcc should encode as t2B when condition is 'always'. llvm-svn: 138898	2011-08-31 21:17:31 +00:00
Bruno Cardoso Lopes	a0d85139e5	Move more code around and duplicate AVX patterns: MOVHPS and MOVLPS llvm-svn: 138897	2011-08-31 21:15:32 +00:00
Bruno Cardoso Lopes	21a180367b	Move MOVAPS,MOVUPS patterns close to the instructions definition llvm-svn: 138896	2011-08-31 21:15:29 +00:00
Bruno Cardoso Lopes	941001312a	Remove "_Int" forms of MOVUPSmr and MOVAPSmr llvm-svn: 138895	2011-08-31 21:15:22 +00:00
Bill Wendling	770d0f0700	Make sure we aren't deleting the landingpad instruction. The landingpad instruction is required in the landing pad block. Because we're not deleting terminating instructions, the invoke may still jump to here (see Transforms/SCCP/2004-11-16-DeadInvoke.ll). Remove all uses of the landingpad instruction, but keep it around until code-gen can remove the basic block. llvm-svn: 138890	2011-08-31 20:55:20 +00:00
Owen Anderson	a455a0b1e7	Fix encoding for tBcc with immediate offset operand. llvm-svn: 138889	2011-08-31 20:26:14 +00:00
Eli Friedman	ae1acddb95	Misc cleanup; addresses Duncan's comments on r138877. llvm-svn: 138887	2011-08-31 20:13:26 +00:00
Owen Anderson	939cd21248	When performing instruction selection for LDR_PRE_IMM/LDRB_PRE_IMM, we still need to preserve the sign of the index. This fixes miscompilations of Quicksort in the nightly testsuite, and hopefully others as well. <rdar://problem/10046188> llvm-svn: 138885	2011-08-31 20:00:11 +00:00
Jim Grosbach	cfa9421e16	Remove FIXME. Thumb2 MOV instruction will use separate custom tricks. When we want encoding T3 (the wide encoding), we can explicitly check for that and twiddle the CanAcceptCarrySet accordingly. For now, just correctly handle encodings T1 and T2 when in Thumb2 mode. llvm-svn: 138879	2011-08-31 18:39:39 +00:00
Eli Friedman	e839ecb70b	Fill in type legalization for MERGE_VALUES in all the various cases. Patch by Micah Villmow. (No testcase because the issue only showed up in an out-of-tree backend.) llvm-svn: 138877	2011-08-31 18:36:04 +00:00
Owen Anderson	5c160fd243	Fix roundtripping of Thumb BL/BLX instructions with immediate offsets instead of labels. llvm-svn: 138874	2011-08-31 18:30:20 +00:00
Jim Grosbach	c61fc8f301	tBcc is OK to be predicated in Thumb2 outside of IT blocks (obviously). llvm-svn: 138873	2011-08-31 18:29:05 +00:00
Eli Friedman	7c3bdede25	Generic expansion for atomic load/store into cmpxchg/atomicrmw xchg; implements 64-bit atomic load/store for ARM. llvm-svn: 138872	2011-08-31 18:26:09 +00:00
Jim Grosbach	af8c3cc710	Thumb2 parsing and encoding for ADC(register). Also add instruction aliases for non-.w versions of SBC since they're the same. llvm-svn: 138871	2011-08-31 18:23:08 +00:00
Devang Patel	df060bc3c2	After r138010, subroutine type does not have context info. Update type verifier accordingly. This fixes ptype.exp gdb testsuite regressions. llvm-svn: 138869	2011-08-31 18:04:31 +00:00
Eli Friedman	1ccecbb9d3	64-bit atomic cmpxchg for ARM. llvm-svn: 138868	2011-08-31 17:52:22 +00:00
Akira Hatanaka	1fcf140ae3	Fix typo. Patch by Liu. llvm-svn: 138866	2011-08-31 17:49:04 +00:00
David Greene	cdef71f4f3	Compress Repeated Byte Output Emit a repeated sequence of bytes using .zero. This saves an enormous amount of asm file space for certain programs. llvm-svn: 138864	2011-08-31 17:30:56 +00:00
Jim Grosbach	6d606fbe14	Tweak Thumb1 ADD encoding selection a bit. When the destination register of an add immediate instruction is explicitly specified, encoding T1 is preferred, else encoding T2 is preferred. llvm-svn: 138862	2011-08-31 17:07:33 +00:00
Jakob Stoklund Olesen	cd893390f5	Put VMOVS widening under a command line option, off by default. It appears that our use of the imp-use and imp-def flags with sub-registers is not yet robust enough to support this. The failing test case is complicated, I am working on a reduction. <rdar://problem/10044201> llvm-svn: 138861	2011-08-31 17:00:02 +00:00
Rafael Espindola	6e31dfea35	Spelling and grammar fixes to problems found by Duncan. llvm-svn: 138858	2011-08-31 16:43:33 +00:00
Eli Friedman	635d9692b6	Make sure we don't crash when -miphoneos-version-min is specified on x86. Hopefully this will fix gcc testsuite failures. llvm-svn: 138856	2011-08-31 16:19:51 +00:00
Eric Christopher	72d1d5e193	Rework this conditional a bit. Patch by Sanjoy Das llvm-svn: 138853	2011-08-31 04:17:21 +00:00
Bruno Cardoso Lopes	9fc6b8be03	- Move all MOVSS and MOVSD patterns close to their definitions - Duplicate some store patterns to their AVX forms! - Catched a bug while restricting the patterns subtarget, fix it and update a testcase to check it properly llvm-svn: 138851	2011-08-31 03:04:20 +00:00
Bruno Cardoso Lopes	aa1daa63da	Remove unnecessary AVX checks llvm-svn: 138850	2011-08-31 03:04:14 +00:00
Bruno Cardoso Lopes	db520db514	Teach more places to use VMOVAPS,VMOVUPS instead of MOVAPS,MOVUPS, whenever AVX is enabled. llvm-svn: 138849	2011-08-31 03:04:09 +00:00
Evan Cheng	cb1e5bae4c	Fix (movhps load) lowering / pattern to match more cases. rdar://10050549 llvm-svn: 138848	2011-08-31 02:05:24 +00:00
Eli Friedman	2c7bb52f56	Some minor cleanups for r138845. llvm-svn: 138846	2011-08-31 00:41:05 +00:00
Eli Friedman	c3f9c4a852	Some 64-bit atomic operations on ARM. 64-bit cmpxchg coming next. llvm-svn: 138845	2011-08-31 00:31:29 +00:00
Rafael Espindola	a45c20b049	Remove the old tail duplication pass. It is not used and is unable to update ssa, so it has to be run really early in the pipeline. Any replacement should probably use the SSAUpdater. llvm-svn: 138841	2011-08-30 23:03:45 +00:00
Owen Anderson	2fa06a7226	Fix issues with disassembly of IT instructions involving condition codes other the EQ/NE. Discovered by roundtrip testing. llvm-svn: 138840	2011-08-30 22:58:27 +00:00
Owen Anderson	fdf3cd7f2b	Fix encoding of CBZ/CBNZ Thumb2 instructions with immediate offsets rather than labels. llvm-svn: 138837	2011-08-30 22:15:17 +00:00
Benjamin Kramer	58298f028c	Teach macho-dump to dump the uleb128s referred to by linkedit_data segments. llvm-svn: 138836	2011-08-30 22:10:58 +00:00
Owen Anderson	d16fb43b1f	Fix encoding of PC-relative Thumb1 LDR's when using immediate offsets instead of labels. llvm-svn: 138835	2011-08-30 22:10:03 +00:00
Owen Anderson	543c89fb15	Fix encoding of Thumb1 B instructions with immediate offsets, which is necessary for round-tripping. llvm-svn: 138834	2011-08-30 22:03:20 +00:00
Owen Anderson	1732c2ebf6	Clean up whitespace. llvm-svn: 138833	2011-08-30 21:58:18 +00:00
Bill Wendling	6470e07e20	Fix off-by-one error Benjamin noticed. llvm-svn: 138832	2011-08-30 21:23:24 +00:00
Owen Anderson	e316e5b2ad	Speculatively revert r138809 in an attempt to fix DragonEgg. llvm-svn: 138829	2011-08-30 21:11:06 +00:00
Bill Wendling	7a9c3033a4	Enable compact unwind info by default. This only applies to Darwin when CFI is disabled. llvm-svn: 138826	2011-08-30 20:54:11 +00:00
Jeffrey Yasskin	065c35726f	Fix C++0x narrowing errors when char is unsigned. In the case of EDInstInfo, this would actually cause a bug when -1 became 255 and was then compared >=0 in llvm-mc/Disassembler.cpp. llvm-svn: 138825	2011-08-30 20:53:29 +00:00
Rafael Espindola	94d3253626	Adds support for variable sized allocas. For a variable sized alloca, code is inserted to first check if the current stacklet has enough space. If so, space is allocated by simply decrementing the stack pointer. Otherwise a runtime routine (__morestack_allocate_stack_space in libgcc) is called which allocates the required memory from the heap. Patch by Sanjoy Das. llvm-svn: 138818	2011-08-30 19:47:04 +00:00
Rafael Espindola	3353017668	Adds a SelectionDAG node X86SegAlloca which will be custom lowered from DYNAMIC_STACKALLOC. Two new pseudo instructions (SEG_ALLOCA_32 and SEG_ALLOCA_64) which will match X86SegAlloca (based on word size) are also added. They will be custom emitted to inject the actual stack handling code. Patch by Sanjoy Das. llvm-svn: 138814	2011-08-30 19:43:21 +00:00
Rafael Espindola	c21742112b	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. llvm-svn: 138812	2011-08-30 19:39:58 +00:00

1 2 3 4 5 ...

49493 Commits