llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	75afc7afe8	Remove dead code. llvm-svn: 148384	2012-01-18 10:10:28 +00:00
Nadav Rotem	3b8f0cc9fa	Fix a bug in the type-legalization of vector integers. When we bitcast one vector type to another, we must not bitcast the result if one type is widened while the other is promoted. llvm-svn: 148383	2012-01-18 08:33:18 +00:00
Pete Cooper	c52eeed310	Fix ISD::REG_SEQUENCE to accept physical registers and change TwoAddressInstructionPass to insert copies for any physical reg operands of the REG_SEQUENCE llvm-svn: 148377	2012-01-18 04:16:16 +00:00
Jim Grosbach	adcc938c46	Thumb2 load/store fixups don't set the thumb bit. Load/store instructions w/ a fixup to be relative a function marked as thumb don't use the low bit to specify thumb vs. non-thumb like interworking branches do, so don't set it when dealing with those fixups. rdar://10348687. llvm-svn: 148366	2012-01-18 00:40:25 +00:00
Jim Grosbach	3b50c9ec7f	Move some ARM specific MCAssmebler bits into the ARMAsmBackend. llvm-svn: 148364	2012-01-18 00:23:57 +00:00
Jakob Stoklund Olesen	f43b599550	Add a CoveredBySubRegs property to Register descriptions. When set, this bit indicates that a register is completely defined by the value of its sub-registers. Use the CoveredBySubRegs property to infer which super-registers are call-preserved given a list of callee-saved registers. For example, the ARM registers D8-D15 are callee-saved. This now automatically implies that Q4-Q7 are call-preserved. Conversely, Win64 callees save XMM6-XMM15, but the corresponding YMM6-YMM15 registers are not call-preserved because they are not fully defined by their sub-registers. llvm-svn: 148363	2012-01-18 00:16:39 +00:00
Jakob Stoklund Olesen	fdbb12b235	Implement ARMBaseRegisterInfo::getCallPreservedMask(). Move ARM callee-saved lists into ARMCallingConv.td. llvm-svn: 148357	2012-01-17 23:09:00 +00:00
Jim Grosbach	3fa6dcfebb	Fix MCJIT memory leak of owned TargetMachine. The JIT is expected to take ownership of the TM that's passed in. The MCJIT wasn't freeing it, resulting in leaks. llvm-svn: 148356	2012-01-17 23:08:46 +00:00
Jakob Stoklund Olesen	d51a710bde	Move X86 callee saved register lists to the X86CallConv .td file. Add a trivial implementation of the getCallPreservedMask() hook. llvm-svn: 148347	2012-01-17 22:47:01 +00:00
Jakub Staszak	173bce3d2b	Move includes to the .cpp file. llvm-svn: 148342	2012-01-17 22:16:31 +00:00
Jim Grosbach	4045507fea	MC tweak symbol difference resolution for non-local symbols. When the non-local symbol in the expression is in the same fragment as the second symbol, the assembler can still evaluate the expression without needing a relocation. For example, on ARM: _foo: ldr lr, (_foo - 4) rdar://10348687 llvm-svn: 148341	2012-01-17 22:14:39 +00:00
Devang Patel	c9ed518792	Intel syntax: Fix parser match class to check memory operand size. llvm-svn: 148338	2012-01-17 21:48:03 +00:00
Nadav Rotem	fb6ddee0e9	Transform: (EXTRACT_VECTOR_ELT( VECTOR_SHUFFLE )) -> EXTRACT_VECTOR_ELT. llvm-svn: 148337	2012-01-17 21:44:01 +00:00
Devang Patel	a7143b6a2b	Intel syntax: Parse "BYTE PTR [RDX + RCX]" llvm-svn: 148334	2012-01-17 21:25:10 +00:00
Dan Gohman	e7a243fea5	Add a new ObjC ARC optimization pass to eliminate unneeded autorelease push+pop pairs. llvm-svn: 148330	2012-01-17 20:52:24 +00:00
Dan Gohman	b9936296d3	Add a new PassManagerBuilder customization point, EP_ModuleOptimizerEarly, to allow passes to be added before the main ModulePass optimizers. llvm-svn: 148329	2012-01-17 20:51:32 +00:00
Devang Patel	2ed6718616	Untabify. llvm-svn: 148322	2012-01-17 19:09:22 +00:00
Devang Patel	8b39be79ad	Intel syntax: Do not unncessarily create plus expression for memory operand displacement. llvm-svn: 148321	2012-01-17 19:08:07 +00:00
Devang Patel	41b9ddeb7a	Intel syntax: Robustify memory operand parsing. llvm-svn: 148312	2012-01-17 18:00:18 +00:00
Manuel Klimek	85d26f9807	Removes template magic to build up containers. Instead, we now put the attributes of the container into members. llvm-svn: 148302	2012-01-17 09:34:07 +00:00
Nadav Rotem	86c3807b99	Fix warning. llvm-svn: 148301	2012-01-17 09:31:09 +00:00
Nadav Rotem	86e5390dbf	Fix 11769. In CanXFormVExtractWithShuffleIntoLoad we assumed that EXTRACT_VECTOR_ELT can be later handled by the DAGCombiner. However, in some cases on AVX, the EXTRACT_VECTOR_ELT is legalized to EXTRACT_SUBVECTOR + EXTRACT_VECTOR_ELT, which currently is not handled by the DAGCombiner. In this patch I added a check that we only extract from the XMM part. llvm-svn: 148298	2012-01-17 09:13:19 +00:00
Craig Topper	02cb0fb136	Teach DAG combiner to turn a BUILD_VECTOR of UNDEFs into an UNDEF of vector type. llvm-svn: 148297	2012-01-17 09:09:48 +00:00
Craig Topper	9cafcd8baa	Remove unnecessary AVX check from an assert. hasSSE2 is enough. llvm-svn: 148295	2012-01-17 08:23:44 +00:00
David Blaikie	a5708dc3a3	Provide better messages in llvm_unreachable. llvm-svn: 148293	2012-01-17 07:00:13 +00:00
Andrew Trick	7ccdc5c192	misched: Inital interface and implementation for ScheduleTopDownLive and ShuffleInstructions. llvm-svn: 148291	2012-01-17 06:55:07 +00:00
Andrew Trick	e1c034fefe	Renamed MachineScheduler to ScheduleTopDownLive. Responding to code review. llvm-svn: 148290	2012-01-17 06:55:03 +00:00
Andrew Trick	8093eac51d	Moving options declarations around. More short term hackery until we have a way to configure passes that work on LiveIntervals. llvm-svn: 148289	2012-01-17 06:54:59 +00:00
Andrew Trick	12728f04ca	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
Craig Topper	37b10ef250	Fix a crasher when PerformShiftCombine receives a BUILD_VECTOR of all UNDEF. Probably could use better handling in DAG combine or getNode. Fixes PR11772. llvm-svn: 148285	2012-01-17 04:44:50 +00:00
David Blaikie	b48ed1a4cb	Remove unreachable code. (replace with llvm_unreachable to help GCC where necessary) llvm-svn: 148284	2012-01-17 04:43:56 +00:00
Rafael Espindola	cbda0e255d	Add 148175 back. I am unable to reproduce any non determinism in a dragonegg or clang bootstrap. I will keep an eye on the bots. Original message: Only emit the Leh_func_endN symbol when needed. llvm-svn: 148283	2012-01-17 04:19:20 +00:00
Pete Cooper	e3d305a206	Changed flag operand of ISD::FP_ROUND to TargetConstant as it should not get checked for legalisation llvm-svn: 148275	2012-01-17 01:54:07 +00:00
Lang Hames	818e1ffd74	Fix typo in comment. llvm-svn: 148268	2012-01-17 00:39:29 +00:00
Jim Grosbach	06594e1018	Tidy up. llvm-svn: 148265	2012-01-16 23:50:58 +00:00
Jim Grosbach	0ddb3a4963	ExecutionEngine interface to re-map addresses for engines that support it. llvm-svn: 148264	2012-01-16 23:50:55 +00:00
Jim Grosbach	9df6cc8f4f	MCJIT handle a few more simple x86 relocations for MachO. llvm-svn: 148263	2012-01-16 23:50:49 +00:00
David Blaikie	486df738c3	Removing unused default switch cases in switches over enums that already account for all enumeration values explicitly. (This time I believe I've checked all the -Wreturn-type warnings from GCC & added the couple of llvm_unreachables necessary to silence them. If I've missed any, I'll happily fix them as soon as I know about them) llvm-svn: 148262	2012-01-16 23:24:27 +00:00
Hal Finkel	b1691ccaaa	Cleanup PPC RLWINM8 vs RLWINM No test case: output assembly will be identical. llvm-svn: 148261	2012-01-16 23:22:50 +00:00
Hal Finkel	8606e3c7e3	AggressiveAntiDepBreaker needs to skip debug values because a debug value does not have a corresponding SUnit llvm-svn: 148260	2012-01-16 22:53:41 +00:00
Jakob Stoklund Olesen	86ae07f049	Extract method for detecting constant unallocatable physregs. It is safe to move uses of such registers. llvm-svn: 148259	2012-01-16 22:34:08 +00:00
Jim Grosbach	eff0a40d7e	MCJIT support for non-function sections. Move to a by-section allocation and relocation scheme. This allows better support for sections which do not contain externally visible symbols. Flesh out the relocation address vs. local storage address separation a bit more as well. Remote process JITs use this to tell the relocation resolution code where the code will live when it executes. The startFunctionBody/endFunctionBody interfaces to the JIT and the memory manager are deprecated. They'll stick around for as long as the old JIT does, but the MCJIT doesn't use them anymore. llvm-svn: 148258	2012-01-16 22:26:39 +00:00
Stepan Dyatkovskiy	2931a59ec5	Fixed comment in loop-unswitch. llvm-svn: 148252	2012-01-16 20:48:04 +00:00
Jakob Stoklund Olesen	6de6d3e4ec	Give better scavenger errors by invoking the verifier. llvm-svn: 148251	2012-01-16 20:38:31 +00:00
Jakob Stoklund Olesen	374ed322f2	Add a new kind of MachineOperand: MO_RegisterMask. Register masks will be used as a compact representation of large clobber lists. Currently, an x86 call instruction has some 40 operands representing call-clobbered registers. That's more than 1kB of useless operands per call site. A register mask operand references a bit mask of call-preserved registers, everything else is clobbered. The bit mask will typically come from TargetRegisterInfo::getCallPreservedMask(). By abandoning ImplicitDefs for call-clobbered registers, it also becomes possible to share call instruction descriptions between calling conventions, and we can get rid of the WINCALL* instructions. This patch introduces the new operand kind. Future patches will add RegMask support to target-independent passes before finally the fixed clobber lists can be removed from call instruction descriptions. llvm-svn: 148250	2012-01-16 19:22:00 +00:00
Eli Friedman	206ca569aa	Make sure the non-SSE lowering for fences correctly clobbers EFLAGS. PR11768. llvm-svn: 148240	2012-01-16 16:42:21 +00:00
Eli Friedman	75e3db4c7a	Get rid of unused codegen-only instruction. llvm-svn: 148239	2012-01-16 16:29:35 +00:00
Craig Topper	db8890aedd	Give priority to AVX over SSE for 128-bit floating point unpck instructions. llvm-svn: 148233	2012-01-16 09:56:42 +00:00
Eli Bendersky	1b0cd0f1b1	A fix for the previous commit: "integer constant is too large for ‘long’ type" error on some 32-bit bots llvm-svn: 148232	2012-01-16 09:31:10 +00:00
Eli Bendersky	4c647587b1	Adding a basic ELF dynamic loader and MC-JIT for ELF. Functionality is currently basic and will be enhanced with future patches. Patch developed by Andy Kaylor and Daniel Malea. Reviewed on llvm-commits. llvm-svn: 148231	2012-01-16 08:56:09 +00:00
David Blaikie	5d8e42755c	Refactor variables unused under non-assert builds (& remove two entirely unused variables). llvm-svn: 148230	2012-01-16 05:17:39 +00:00
Pete Cooper	e85b95d754	Changed intrinsic ID operand to a target constant as its not used in any arithmetic so should not be checked in legalisation llvm-svn: 148228	2012-01-16 04:08:12 +00:00
Nadav Rotem	57935243bd	[AVX] Optimize x86 VSELECT instructions using SimplifyDemandedBits. We know that the blend instructions only use the MSB, so if the mask is sign-extended then we can convert it into a SHL instruction. This is a common pattern because the type-legalizer sign-extends the i1 type which is used by the LLVM-IR for the condition. Added a new optimization in SimplifyDemandedBits for SIGN_EXTEND_INREG -> SHL. llvm-svn: 148225	2012-01-15 19:27:55 +00:00
Benjamin Kramer	339ced4e34	Return an ArrayRef from ShuffleVectorSDNode::getMask and push it through CodeGen. llvm-svn: 148218	2012-01-15 13:16:05 +00:00
Benjamin Kramer	5a377e28da	DAGCombiner: Deduplicate code. llvm-svn: 148217	2012-01-15 11:50:43 +00:00
Stepan Dyatkovskiy	7ec12e431a	Cosmetic patch for r148215. llvm-svn: 148216	2012-01-15 09:45:11 +00:00
Stepan Dyatkovskiy	cb2adbacf8	Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to std::map, since we need to keep a valid pointer to properties of current loop. Message for r148132: LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148215	2012-01-15 09:44:07 +00:00
Chandler Carruth	da22f30e72	Remove SetWorkingDirectory from the Process interface. Nothing in LLVM or Clang is using this, and it would be hard to use it correctly given the thread hostility of the function. Also, it never checked the return which is rather dangerous with chdir. If someone was in fact using this, please let me know, as well as what the usecase actually is so that I can add it back and make it more correct and secure to use. (That said, it's never going to be "safe" per-se, but we could at least document the risks...) llvm-svn: 148211	2012-01-15 08:41:35 +00:00
David Blaikie	fdcd669bc6	Remove dead code. llvm-svn: 148206	2012-01-15 01:09:13 +00:00
Craig Topper	201c1a3505	Truncate of undef is just undef of smaller size. llvm-svn: 148205	2012-01-15 01:05:11 +00:00
Craig Topper	c10e1abaf3	Fix the memop type on a couple 256-bit AVX instructions that were using f128mem instead of f256mem. llvm-svn: 148196	2012-01-14 18:29:57 +00:00
Craig Topper	d78429f850	Add a bunch of AVX instructions to the folding tables. Also fixed the alignment on 256-bit AVX2 instructions. llvm-svn: 148194	2012-01-14 18:14:53 +00:00
Duncan Sands	90212bde1f	Speculatively revert commit 148175 (rafael), to see if this fixes non-determinism in the 32 bit dragonegg buildbot. Original commit message: Only emit the Leh_func_endN symbol when needed. llvm-svn: 148191	2012-01-14 17:16:48 +00:00
Andrew Trick	23ef0d6c40	Fix a corner case hit by redundant phi elimination running after LSR. Fixes PR11761: bad IR w/ redundant Phi elim llvm-svn: 148177	2012-01-14 03:17:23 +00:00
Rafael Espindola	dfde7631fa	Only emit the Leh_func_endN symbol when needed. llvm-svn: 148175	2012-01-14 02:36:51 +00:00
Andrew Trick	59ac4fb706	misched: Initial code for building an MI level scheduling DAG llvm-svn: 148174	2012-01-14 02:17:18 +00:00
Andrew Trick	dbee9d8900	Move physreg dependency generation into aptly named addPhysRegDeps. llvm-svn: 148173	2012-01-14 02:17:15 +00:00
Andrew Trick	1d028a364d	misched: Added ScheduleDAGInstrs::IsPostRA llvm-svn: 148172	2012-01-14 02:17:12 +00:00
Andrew Trick	7e120f4e66	misched: Invoke the DAG builder on each sequence of schedulable instructions. llvm-svn: 148171	2012-01-14 02:17:09 +00:00
Andrew Trick	6344087e17	Move things around to make the file navigable, even though it will probably be split up later. llvm-svn: 148170	2012-01-14 02:17:06 +00:00
Evan Cheng	6bb95253eb	After r147827 and r147902, it's now possible for unallocatable registers to be live across BBs before register allocation. This miscompiled 197.parser when a cmp + b are optimized to a cbnz instruction even though the CPSR def is live-in a successor. cbnz r6, LBB89_12 ... LBB89_12: ble LBB89_1 The fix consists of two parts. 1) Teach LiveVariables that some unallocatable registers might be liveouts so don't mark their last use as kill if they are. 2) ARM constantpool island pass shouldn't form cbz / cbnz if the conditional branch does not kill CPSR. rdar://10676853 llvm-svn: 148168	2012-01-14 01:53:46 +00:00
Chad Rosier	71a185c5c6	Fix pasto from r146196. llvm-svn: 148167	2012-01-14 01:50:21 +00:00
Dan Gohman	4cf362acc1	Fix an unused variable warning that Chad noticed. llvm-svn: 148164	2012-01-14 00:47:44 +00:00
Rafael Espindola	a693128778	Remove previous commit while I debug the bot failures. llvm-svn: 148156	2012-01-13 23:28:50 +00:00
Jakob Stoklund Olesen	35545421c8	Use RegisterTuples to generate pseudo-registers. The QQ and QQQQ registers are not 'real', they are pseudo-registers used to model some vld and vst instructions. This makes the call clobber lists longer, but I intend to get rid of those soon. llvm-svn: 148151	2012-01-13 22:55:42 +00:00
Rafael Espindola	cef42c30a7	Remove label that is not used anymore. llvm-svn: 148150	2012-01-13 22:41:58 +00:00
Eli Friedman	d476fdc392	Speculatively revert r148132+r148133 to try and fix a buildbot failure. llvm-svn: 148149	2012-01-13 22:34:39 +00:00
Andrew Trick	f35c84032d	Remove pointless mode line in .cpp file. llvm-svn: 148143	2012-01-13 22:04:16 +00:00
Devang Patel	7066d28043	Revert r148131, it was committed before it was ready. llvm-svn: 148134	2012-01-13 19:28:58 +00:00
Stepan Dyatkovskiy	0a920fa210	Cosmetic patch for r148132. llvm-svn: 148133	2012-01-13 19:27:22 +00:00
Stepan Dyatkovskiy	cbcbdb237f	LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148132	2012-01-13 19:13:54 +00:00
Devang Patel	7ecdc6d4f5	Refactor. llvm-svn: 148131	2012-01-13 19:12:18 +00:00
Craig Topper	e52d86a740	Convert SHUFPD with the same register for both sources to PSHUFD if it would prevent a register copy. Similar to SHUFPS, but requires the mask to be converted. llvm-svn: 148112	2012-01-13 09:21:41 +00:00
Craig Topper	b1c2ebf6ee	use v8i32 as optimal mem type over v8f32 if AVX2 is enabled. Similar to SSE2 vs SSE1. llvm-svn: 148109	2012-01-13 08:32:21 +00:00
Craig Topper	cb7e13d7c0	Make X86 instruction selection use 256-bit VPXOR for build_vector of all ones if AVX2 is enabled. This gives the ExeDepsFix pass a chance to choose FP vs int as appropriate. Also use v8i32 as the type for getZeroVector if AVX2 is enabled. This is consistent with SSE2 using prefering v4i32. llvm-svn: 148108	2012-01-13 08:12:35 +00:00
Craig Topper	9f14d9f939	Add patterns for v16i16 and v32i8 immAllZerosV to select VPXOR to match v4i64 and v8i32. llvm-svn: 148106	2012-01-13 06:59:47 +00:00
Andrew Trick	e77e84e4b7	Added the MachineSchedulerPass skeleton. llvm-svn: 148105	2012-01-13 06:30:30 +00:00
Andrew Trick	4d4fef238a	wrong filename llvm-svn: 148103	2012-01-13 06:30:22 +00:00
Andrew Trick	b1be1aa8f8	80-col violation llvm-svn: 148102	2012-01-13 06:30:19 +00:00
Craig Topper	a4c5a47b97	Use 8i32 constant pool entry for converting AVX2_SETALLONES. Possibly fixes PR11750. llvm-svn: 148101	2012-01-13 06:12:41 +00:00
Craig Topper	2aa07f832e	Fix typo in PerformAddCombine that caused any vector type to be checked for horizontal add/sub if AVX2 is enabled. This caused an assert to fail for non 128/256-bit vectors when done before type legalizing. Fixes PR11749. llvm-svn: 148096	2012-01-13 05:04:25 +00:00
Jakob Stoklund Olesen	dd8fbf572e	Delete CodeInit and CodeRecTy from TableGen. The code type was always identical to a string anyway. Now it is simply a synonym. The code literal syntax [{...}] is still valid. llvm-svn: 148092	2012-01-13 03:38:34 +00:00
Jakob Stoklund Olesen	9d1c5eeb32	Use uniqued StringInit pointers for lookups. This avoids a gazillion StringMap and dynamic_cast calls, making TableGen run 3x faster. llvm-svn: 148091	2012-01-13 03:16:35 +00:00
Evan Cheng	fa8326334b	DAGCombine's logic for forming pre- and post- indexed loads / stores were being overly conservative. It was concerned about cases where it would prohibit folding simple [r, c] addressing modes. e.g. ldr r0, [r2] ldr r1, [r2, #4] => ldr r0, [r2], #4 ldr r1, [r2] Change the logic to look for such cases which allows it to form indexed memory ops more aggressively. rdar://10674430 llvm-svn: 148086	2012-01-13 01:37:24 +00:00
Bill Wendling	9c8456f7ef	Fix off-by-one error. llvm-svn: 148077	2012-01-13 00:41:53 +00:00
Dan Gohman	728db4997a	Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that the optimizer doesn't eliminate objc_retainBlock calls which are needed for their side effect of copying blocks onto the heap. This implements rdar://10361249. llvm-svn: 148076	2012-01-13 00:39:07 +00:00
Pete Cooper	9bcb72136e	Added MVT::v2f16 llvm-svn: 148067	2012-01-12 23:14:13 +00:00
Bill Wendling	49c4dfb534	Revert accidental commit. llvm-svn: 148065	2012-01-12 23:06:28 +00:00
Bill Wendling	ee5eaebc58	Fix the code that was WRONG. The registers are placed into the saved registers list in the reverse order, which is why the original loop was written to loop backwards. llvm-svn: 148064	2012-01-12 23:05:03 +00:00
Pete Cooper	99415fea87	Added FPOW, FEXP, FLOG to PromoteNode so that custom actions can be set to Promote for those operations. Sorry, no test case yet llvm-svn: 148050	2012-01-12 21:46:18 +00:00
Elena Demikhovsky	060f6ccdb8	Fixed a bug in LowerVECTOR_SHUFFLE caused assertion failure lc: X86ISelLowering.cpp:6480: llvm::SDValue llvm::X86TargetLowering::LowerVECTOR_SHUFFLE(llvm::SDValue, llvm::SelectionDAG&) const: Assertion `V1.getOpcode() != ISD::UNDEF&& "Op 1 of shuffle should not be undef"' failed. Added a test. llvm-svn: 148044	2012-01-12 20:33:10 +00:00
Evan Cheng	5c03a6b8f5	When hoisting common code, watch out for uses which are marked "kill". If the killed registers are needed below the insertion point, then unset the kill marker. Sorry I'm not able to find a reduced test case. rdar://10660944 llvm-svn: 148043	2012-01-12 20:31:24 +00:00
Rafael Espindola	00e861ed57	Support segmented stacks on 64-bit FreeBSD. This patch uses tcb_spare field in the tcb structure to store info. Patch by Jyun-Yan You. llvm-svn: 148041	2012-01-12 20:24:30 +00:00
Rafael Espindola	10745d3381	Support segmented stacks on win32. Uses the pvArbitrary slot of the TIB, which is reserved for applications. We only support frames with a static size. llvm-svn: 148040	2012-01-12 20:22:08 +00:00
Evan Cheng	09cc429cb1	Allow targets to select source order pre-RA scheduler. llvm-svn: 148033	2012-01-12 18:27:52 +00:00
Devang Patel	4a6e778aae	Rename X86ATTAsmParser -> X86AsmParser We are using one parser to parse att as well as intel style syntax. llvm-svn: 148032	2012-01-12 18:03:40 +00:00
Jakob Stoklund Olesen	994fed689f	Make SplitAnalysis::UseSlots private. llvm-svn: 148031	2012-01-12 17:53:44 +00:00
Benjamin Kramer	9ece950ddb	After Jakob's r147938 exception handling on i386 was completely broken. Restore the (obviously wrong) behavior from before r147938 without relying on undefined behavior. Add a fat FIXME note. This should fix nightly tester failures. llvm-svn: 148030	2012-01-12 17:37:18 +00:00
Nadav Rotem	0a0a829bea	Fix a bug in the AVX 256-bit shuffle code in cases where the splat element is on the boundary of two 128-bit vectors. The attached testcase was stuck in an endless loop. llvm-svn: 148027	2012-01-12 15:31:55 +00:00
Benjamin Kramer	5b3aa60b44	X86: Generalize the x << (y & const) optimization to also catch masks with more set bits set than 31 or 63. llvm-svn: 148024	2012-01-12 12:41:34 +00:00
Devang Patel	fc6be102ae	Add predicate method check match memory operand size, if available. In att style asm syntax memory operand size is derived from suffix attached with mnemonic. In intel style asm syntax it is part of memory operand hence predicate method check is required to select appropriate instruction. llvm-svn: 148006	2012-01-12 01:51:42 +00:00
Bill Wendling	58c7569854	A DenseMap of a std::map isn't a very good idea because the "grow()" method will need to make a deep copy of each of the std::maps. Use a std::map of the std::map instead. This improves the compile time of sqlite3 by ~2%. llvm-svn: 148003	2012-01-12 01:41:03 +00:00
Devang Patel	46831de240	Add intel style operand parser skeleton. This is a work in progress. llvm-svn: 148002	2012-01-12 01:36:43 +00:00
Chandler Carruth	eb21da060b	Switch all of the uses of my InsertDAGNode helper to follow the exact same pattern. We already had this pattern is a few places, but others tried to make a rough approximation of an actual DAG structure. As not everywhere went to this trouble, nothing could rely on this being done. In fact, I've checked all references to these node Ids, and the ones that are using the topo-sort properties are actually satisfied with a strict-weak-ordering. The requirement appears to be that Use >= Def. I've added a big blurb of comments to this bit of the transform to clarify why the order is so important for the next reader of the code. I'm starting with this change as it is very small, and trivially reverted if something breaks or the >= above really does need to be >. If that proves the case, we can hide the problem by reverting this patch, but the problem exists elsewhere as well, and so a more comprehensive solution will be needed. llvm-svn: 148001	2012-01-12 01:34:44 +00:00
Bill Wendling	4ec081a4d2	Revert r147978. A DenseMap's iterators may become invalidated here. llvm-svn: 147980	2012-01-11 23:43:34 +00:00
Jakob Stoklund Olesen	20f19eb9ab	Make data structures private. llvm-svn: 147979	2012-01-11 23:19:08 +00:00
Bill Wendling	f0275df9e3	Use a DenseMap. This appears to improve sqlite3's compile time by ~2%. llvm-svn: 147978	2012-01-11 22:57:32 +00:00
Jakob Stoklund Olesen	73edbf1682	Sink spillInterferences into RABasic. This helper method is too simplistic for RAGreedy. llvm-svn: 147976	2012-01-11 22:52:14 +00:00
Jakob Stoklund Olesen	06ec420347	Cleanup. llvm-svn: 147975	2012-01-11 22:52:11 +00:00
Jakob Stoklund Olesen	a818d804a1	Move RegAllocBase into its own cpp file separate from RABasic. No functional change. llvm-svn: 147972	2012-01-11 22:28:30 +00:00
Eli Friedman	b31c627be1	Re-fix the issue Bill fixed in r147899 in a slightly different way, which doesn't abuse the semantics of linker_private. We don't really want to merge any string constant with a weak_odr global. llvm-svn: 147971	2012-01-11 22:06:46 +00:00
Eric Christopher	d284c1d80d	Fix assert. llvm-svn: 147966	2012-01-11 20:55:27 +00:00
Argyrios Kyrtzidis	cd8fe08e4d	Disable the crash reporter when running lit tests. llvm-svn: 147965	2012-01-11 20:53:25 +00:00
Nadav Rotem	b5ce6ee835	On AVX, we can load v8i32 at a time. The bug happens when two uneven loads are used. When we load the v12i32 type, the GenWidenVectorLoads method generates two loads: v8i32 and v4i32 and attempts to use CONCAT_VECTORS to join them. In this fix I concat undef values to widen the smaller value. The test "widen_load-2.ll" also exposes this bug on AVX. llvm-svn: 147964	2012-01-11 20:19:17 +00:00
Rafael Espindola	d90466bcbf	Support segmented stacks on mac. This uses TLS slot 90, which actually belongs to JavaScriptCore. We only support frames with static size Patch by Brian Anderson. llvm-svn: 147960	2012-01-11 19:00:37 +00:00
Rafael Espindola	4eecacb9c8	Generate the segmented stack prologue for fastcc too. Patch by Brian Anderson. llvm-svn: 147958	2012-01-11 18:41:19 +00:00
Chandler Carruth	3212a34269	Revert r147945 which disabled an addressing mode transformation. I had hoped this would revive one of the llvm-gcc selfhost build bots, but it didn't so it doesn't appear that my transform is the culprit. If anyone else is seeing failures, please let me know! llvm-svn: 147957	2012-01-11 18:36:12 +00:00
Rafael Espindola	2b89448d60	Use unsigned comparison in segmented stack prologue. This is a comparison of two addresses, and GCC does the comparison unsigned. Patch by Brian Anderson. llvm-svn: 147954	2012-01-11 18:23:35 +00:00
Kostya Serebryany	687d078192	[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395 : don't instrument the function at all on x86_32 if it has a large asm blob llvm-svn: 147953	2012-01-11 18:15:23 +00:00
Rafael Espindola	6635ae1c17	Explicitly set the scale to 1 on some segstack prologue instrs. Patch by Brian Anderson. llvm-svn: 147952	2012-01-11 18:14:03 +00:00
Kevin Enderby	6223cf72e6	The error check for using -g with a .s file already containing dwarf .file directives was in the wrong place and getting triggered incorectly with a cpp .file directive. This change fixes that and adds a test case. llvm-svn: 147951	2012-01-11 18:04:47 +00:00
Jan Sjödin	21f83d9f36	Add XOP Intrinsics and tests llvm-svn: 147949	2012-01-11 15:20:20 +00:00
Nadav Rotem	baae7e4577	Fix a bug in the lowering of BUILD_VECTOR for AVX. SCALAR_TO_VECTOR does not zero untouched elements. Use INSERT_VECTOR_ELT instead. llvm-svn: 147948	2012-01-11 14:07:51 +00:00
Duncan Sands	0bf46b5363	Don't try to create a GEP when the pointee type is unsized (such GEPs are invalid). Fixes a crash on array1.C from the GCC testsuite when compiled with dragonegg. llvm-svn: 147946	2012-01-11 12:20:08 +00:00
Chandler Carruth	9bc48e5215	Disable the transformation I added in r147936 to see if it fixes some strange build bot failures that look like a miscompile into an infloop. I'll investigate this tomorrow, but I'd both like to know whether my patch is the culprit, and get the bots back to green. llvm-svn: 147945	2012-01-11 12:17:47 +00:00
Chandler Carruth	3eacfb83fa	Hoist a really redundant code pattern into a helper function, and delete lots of lines of code. No functionality changed. llvm-svn: 147942	2012-01-11 11:04:36 +00:00
Chandler Carruth	b0049f4a43	Simplify the AND-rooted mask+shift checking code to match that of the SRL-rooted code. llvm-svn: 147941	2012-01-11 09:35:04 +00:00
Chandler Carruth	3dbcda8478	Unify the interface of the three mask+shift transform helpers, and factor the differences that were hiding in one of them into its other caller, the SRL handling code. No change in behavior. llvm-svn: 147940	2012-01-11 09:35:02 +00:00
Chandler Carruth	aa01e6661a	Clarify and make explicit some of the requirements for transforming mask+shift pairs at the beginning of the ISD::AND case block, and then hoist the final pattern into a helper function, simplifying and reflowing it appropriately. This should have no observable behavior change, but several simplifications fell out of this such as directly computing the new mask constant, etc. llvm-svn: 147939	2012-01-11 09:35:00 +00:00
Jakob Stoklund Olesen	6039983755	Fix undefined code and reenable test case. I don't think the compact encoding code is right, but at least is has defined behavior now. llvm-svn: 147938	2012-01-11 09:08:04 +00:00
Chandler Carruth	51d3076bbf	Hoist the logic to transform shift+mask combinations into sub-register extracts and scaled addressing modes into its own helper function. No functionality changed here, just hoisting and layout fixes falling out of that hoisting. llvm-svn: 147937	2012-01-11 08:48:20 +00:00
Chandler Carruth	55b2cdee26	Teach the X86 instruction selection to do some heroic transforms to detect a pattern which can be implemented with a small 'shl' embedded in the addressing mode scale. This happens in real code as follows: unsigned x = my_accelerator_table[input >> 11]; Here we have some lookup table that we look into using the high bits of 'input'. Each entity in the table is 4-bytes, which means this implicitly gets turned into (once lowered out of a GEP): (unsigned)((char)my_accelerator_table + ((input >> 11) << 2)); The shift right followed by a shift left is canonicalized to a smaller shift right and masking off the low bits. That hides the shift right which x86 has an addressing mode designed to support. We now detect masks of this form, and produce the longer shift right followed by the proper addressing mode. In addition to saving a (rather large) instruction, this also reduces stalls in Intel chips on benchmarks I've measured. In order for all of this to work, one part of the DAG needs to be canonicalized still further* than it currently is. This involves removing pointless 'trunc' nodes between a zextload and a zext. Without that, we end up generating spurious masks and hiding the pattern. llvm-svn: 147936	2012-01-11 08:41:08 +00:00
Stepan Dyatkovskiy	8216569812	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
Andrew Trick	e81211f45c	Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. This interface is misleading and dangerous, but it is actually what we need for unrolling. llvm-svn: 147926	2012-01-11 06:52:55 +00:00
Rafael Espindola	647841b181	Add big endian mips support. Based on a patch by Jack Carter. llvm-svn: 147924	2012-01-11 04:04:14 +00:00
Rafael Espindola	870c4e92b9	Add the skeleton of an asm parser for mips. llvm-svn: 147923	2012-01-11 03:56:41 +00:00
Andrew Trick	642f0f6a40	ARM Ld/St Optimizer fix. Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits. Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12 llvm-svn: 147922	2012-01-11 03:56:08 +00:00
Jakob Stoklund Olesen	8b1d023a4a	Detect when a value is undefined on an edge to a landing pad. Consider this code: int h() { int x; try { x = f(); g(); } catch (...) { return x+1; } return x; } The variable x is undefined on the first edge to the landing pad, but it has the f() return value on the second edge to the landing pad. SplitAnalysis::getLastSplitPoint() would assume that the return value from f() was live into the landing pad when f() throws, which is of course impossible. Detect these cases, and treat them as if the landing pad wasn't there. This allows spill code to be inserted after the function call to f(). <rdar://problem/10664933> llvm-svn: 147912	2012-01-11 02:07:05 +00:00
Jakob Stoklund Olesen	67aec12409	Exclusively use SplitAnalysis::getLastSplitPoint(). Delete the alternative implementation in LiveIntervalAnalysis. These functions computed the same thing, but SplitAnalysis caches the result. llvm-svn: 147911	2012-01-11 02:07:00 +00:00
Evan Cheng	d9725a38d6	Avoid CSE of instructions which define physical registers across MBBs unless the physical registers are not allocatable. llvm-svn: 147902	2012-01-11 00:38:11 +00:00
Bill Wendling	c79155192d	If the global variable is removed by the linker, then don't constant merge it with other symbols. An object in the __cfstring section is suppoed to be filled with CFString objects, which have a pointer to ___CFConstantStringClassReference followed by a pointer to a __cstring. If we allow the object in the __cstring section to be merged with another global, then it could end up in any section. Because the linker is going to remove these symbols in the final executable, we shouldn't bother to merge them. <rdar://problem/10564621> llvm-svn: 147899	2012-01-11 00:13:08 +00:00
Eric Christopher	43a1182975	Don't avoid recursing for pointer types, just reference types. Expand on the comment. Fixes constvars.exp on the gdb test builder. llvm-svn: 147897	2012-01-11 00:01:29 +00:00
Lang Hames	995c63329a	Fixed order of operands in comment to match code. llvm-svn: 147890	2012-01-10 22:53:20 +00:00
Joerg Sonnenberger	96cd35cf6d	Default stack alignment for 32bit x86 should be 4 Bytes, not 8 Bytes. Add a test that checks the stack alignment of a simple function for Darwin, Linux and NetBSD for 32bit and 64bit mode. llvm-svn: 147888	2012-01-10 22:43:53 +00:00
Jakob Stoklund Olesen	20f1dd5faf	Consider unknown alignment caused by OptimizeThumb2Instructions(). This function runs after all constant islands have been placed, and may shrink some instructions to their 2-byte forms. This can actually cause some constant pool entries to move out of range because of growing alignment padding. Treat instructions that may be shrunk the same as inline asm - they erode the known alignment bits. Also reinstate an old assertion in verify(). It is correct now that basic block offsets include alignments. Add a single large test case that will hopefully exercise many parts of the constant island pass. <rdar://problem/10670199> llvm-svn: 147885	2012-01-10 22:32:14 +00:00
Evan Cheng	da46832e42	80 col violation. llvm-svn: 147884	2012-01-10 22:27:32 +00:00
Chad Rosier	1a8f0ccd8c	Add missing VEX predicates to VMOVSDto64rr/VMOVSDto64mr. This fixes a few failing test cases on our internal AVX nightly tester. rdar://10663637 llvm-svn: 147881	2012-01-10 22:14:06 +00:00
Devang Patel	227b6279b6	Let asm parser query asm syntax dialect. llvm-svn: 147880	2012-01-10 21:49:42 +00:00
Kevin Enderby	f7d77069ca	This is the matching change for the data structure name changes for the functional change in r147860 to use DW_TAG_label's instead TAG_subprogram's. This only changes names and updates comments. No functional change. llvm-svn: 147877	2012-01-10 21:12:34 +00:00
Jim Grosbach	74ac7d50a1	ARM updating VST2 pseudo-lowering fixed vs. register update. rdar://10663487 llvm-svn: 147876	2012-01-10 21:11:12 +00:00
Benjamin Kramer	233149cf06	Fix some leftover control reaches end of non-void function warnings. llvm-svn: 147874	2012-01-10 20:47:20 +00:00
Chandler Carruth	9a7510af46	Teach the triple library about the androideabi environment. Patch by Evgeniy Stepanov. llvm-svn: 147871	2012-01-10 19:46:00 +00:00
Richard Smith	ad5b42c02f	Move default case for covered enum outside of switch. llvm-svn: 147870	2012-01-10 19:43:09 +00:00
Bill Wendling	d5ab02600e	For i386, don't use the generic code. As the comment around 7746 says, it's better to use the x87 extended precision here than SSE. And the generic code doesn't know how to do that. It also regains the speed lost for the uint64_to_float.c testcase. <rdar://problem/10669858> llvm-svn: 147869	2012-01-10 19:41:30 +00:00
Richard Smith	3f1035410f	Fix a -Wreturn-type warning in g++. llvm-svn: 147867	2012-01-10 19:10:22 +00:00
Chandler Carruth	4c0ee749bb	Cleanup these asserts to follow common LLVM style and coding conventions. Also, clarify the grouping of one of the asserts to silence -Wparentheses. llvm-svn: 147863	2012-01-10 18:18:52 +00:00
Chandler Carruth	f3e8502cc1	Add 'llvm_unreachable' to passify GCC's understanding of the constraints of several newly un-defaulted switches. This also helps optimizers (including LLVM's) recognize that every case is covered, and we should assume as much. llvm-svn: 147861	2012-01-10 18:08:01 +00:00
Kevin Enderby	8d4a2204b7	Various crash reporting tools have a problem with the dwarf generated for assembly source when it generates the TAG_subprogram dwarf debug info for the labels that have nothing between them as in this bit of assembly source: % cat ZeroLength.s _func1: _func2: nop One solution would be to not emit the subsequent labels with the same address and use the next label with a different address or the end of the section for the AT_high_pc value of the TAG_subprogram. Turns out in llvm-mc it is not possible in all cases to determine of two symbols have the same value at the point we put out the TAG_subprogram dwarf debug info. So we will have llvm-mc instead of putting out TAG_subprogram's put out DW_TAG_label's. And the DW_TAG_label does not have a AT_high_pc value which avoids the problem. This commit is only the functional change to make the diffs clear as to what is really being changed. The next commit will be to clean up the names of such things like MCGenDwarfSubprogramEntry to something like MCGenDwarfLabelEntry. rdar://10666925 llvm-svn: 147860	2012-01-10 17:52:29 +00:00
Devang Patel	67bf992a8f	Add definition for intel asm variant. Right now, this just adds additional entries in match table. The parser does not use them yet. llvm-svn: 147859	2012-01-10 17:51:54 +00:00
David Blaikie	edbb58c577	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Nadav Rotem	61bdf79035	Fix a bug in the legalization of shuffle vectors. When we emulate shuffles using BUILD_VECTORS we may be using a BV of different type. Make sure to cast it back. llvm-svn: 147851	2012-01-10 14:28:46 +00:00
Benjamin Kramer	077ae1d760	Add definitions for AMD's bobcat (aka btver1) llvm-svn: 147846	2012-01-10 11:50:02 +00:00
Craig Topper	430f3f1bd6	Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector. There is no vbroadcastsd xmm, but we do need to support 64-bit integers broadcasted into xmm. Also factor the AVX check into the isVectorBroadcast function. This makes more sense since the AVX2 check was already inside. llvm-svn: 147844	2012-01-10 08:23:59 +00:00
Craig Topper	b0c0f72ae6	Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is the final piece to remove the AVX hack that disabled SSE. llvm-svn: 147843	2012-01-10 06:54:16 +00:00
Craig Topper	d97bbd7b60	Remove hasSSEorAVX functions and change all callers to use just hasSSE. AVX is now an SSE level and no longer disables SSE checks. llvm-svn: 147842	2012-01-10 06:37:29 +00:00
Craig Topper	eb8f9e9e5b	Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget. llvm-svn: 147841	2012-01-10 06:30:56 +00:00
Evan Cheng	0be4144a68	Allow machine-cse to look across MBB boundary when cse'ing instructions that define physical registers. It's currently very restrictive, only catching cases where the CE is in an immediate (and only) predecessor. But it catches a surprising large number of cases. rdar://10660865 llvm-svn: 147827	2012-01-10 02:02:58 +00:00
Andrew Trick	d5d2db9af9	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Jakob Stoklund Olesen	f09a316542	Accurately model hardware alignment rounding. On Thumb, the displacement computation hardware uses the address of the current instruction rouned down to a multiple of 4. Include this rounding in the UserOffset we compute for each instruction. When inline asm is present, the instruction alignment may not be known. Constrain the maximum displacement instead in that case. This makes it possible for CreateNewWater() and OffsetIsInRange() to agree about the valid displacements. When they disagree, infinite looping happens. As always, test cases for this stuff are insane. <rdar://problem/10660175> llvm-svn: 147825	2012-01-10 01:34:59 +00:00
Rafael Espindola	5cb98f1062	Remove the logging streamer. llvm-svn: 147820	2012-01-10 00:40:39 +00:00
Jakob Stoklund Olesen	1a80e3a26b	Catch runaway ARMConstantIslandPass even in -Asserts builds. The pass is prone to looping, and it is better to crash than loop forever, even in a -Asserts build. <rdar://problem/10660175> llvm-svn: 147806	2012-01-09 22:16:24 +00:00
Devang Patel	29ba4f97e6	Fix asm string wrt variants. llvm-svn: 147805	2012-01-09 21:32:02 +00:00
Andrew Trick	248d410e3e	Adding IV chain generation to LSR. After collecting chains, check if any should be materialized. If so, hide the chained IV users from the LSR solver. LSR will only solve for the head of the chain. GenerateIVChains will then materialize the chained IV users by computing the IV relative to its previous value in the chain. In theory, chained IV users could be exposed to LSR's solver. This would be considerably complicated to implement and I'm not aware of a case where we need it. In practice it's more important to intelligently prune the search space of nontrivial loops before running the solver, otherwise the solver is often forced to prune the most optimal solutions. Hiding the chained users does this well, so that LSR is more likely to find the best IV for the chain as a whole. llvm-svn: 147801	2012-01-09 21:18:52 +00:00
Andrew Trick	29fe5f03d7	Adding collection of IV chains to LSR. This collects a set of IV uses within the loop whose values can be computed relative to each other in a sequence. Following checkins will make use of this information. llvm-svn: 147797	2012-01-09 19:50:34 +00:00
Devang Patel	85d684a4d9	Split AsmParser into two components - AsmParser and AsmParserVariant AsmParser holds info specific to target parser. AsmParserVariant holds info specific to asm variants supported by the target. llvm-svn: 147787	2012-01-09 19:13:28 +00:00
Andrew Trick	4dc3eff5ae	"Minor LSR debugging stuff" llvm-svn: 147785	2012-01-09 18:58:16 +00:00
Devang Patel	fa8df4837a	Update language check. Do not ignore DW_LANG_Python. Patch by Joe Groff! llvm-svn: 147781	2012-01-09 17:49:47 +00:00
Benjamin Kramer	f7fe24f40a	Move assert to the right place. llvm-svn: 147779	2012-01-09 17:36:29 +00:00
Benjamin Kramer	f9d0cc0160	InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. This subsumes several other transforms while enabling us to catch more cases. llvm-svn: 147777	2012-01-09 17:23:27 +00:00
Chandler Carruth	c16622daff	Don't rely on the fact that shift values are never very large, and thus this substraction will result in small negative numbers at worst which become very large positive numbers on assignment and are thus caught by the <=4 check on the next line. The >0 check clearly intended to catch these as negative numbers. Spotted by inspection, and impossible to trigger given the shift widths that can be used. llvm-svn: 147773	2012-01-09 09:47:25 +00:00
Craig Topper	f287a4509e	Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level. Predicate functions have been altered to maintain previous names and behavior. llvm-svn: 147770	2012-01-09 09:02:13 +00:00
Craig Topper	b89805c77d	Add HasAVX predicate to some of the AVX patterns. llvm-svn: 147769	2012-01-09 08:34:00 +00:00
Craig Topper	a51f7f75c2	Reorder a bunch of patterns to put the AVX version first thus giving it priority over the SSE version. Another step towards trying to remove the AVX hack that disables SSE from X86Subtarget. llvm-svn: 147768	2012-01-09 08:10:38 +00:00
Craig Topper	ef7f5bf8c9	Clean up patterns for MOVNT*. Not sure why there were floating point types on MOVNTPS and MOVNTDQ. And v4i64 was completely missing. llvm-svn: 147767	2012-01-09 06:52:46 +00:00
Craig Topper	c1f5622ad3	Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no AVX equivalent so we should use the SSE version. llvm-svn: 147766	2012-01-09 06:38:55 +00:00
Craig Topper	a081644f8a	Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations ANDPS/ORPS/XORPS/ANDNPS. This fixes a pattern ordering issue that meant that the SSE2 instructions could never be directly selected since the SSE1 patterns would always match first. This is largely moot with the ExeDepsFix pass, but I'm trying to audit for all such ordering issues. llvm-svn: 147765	2012-01-09 05:07:01 +00:00
Craig Topper	210e4f81b3	Change some places that were checking for AVX OR SSE1/2 to use hasXMM/hasXMMInt instead. Also fix one place that checked SSE3, but accidentally excluded AVX to use hasSSE3orAVX. This is a step towards removing the AVX hack from the X86Subtarget.h llvm-svn: 147764	2012-01-09 02:28:15 +00:00
Rafael Espindola	f28213ca01	Don't print an unused label before .cfi_endproc. llvm-svn: 147763	2012-01-09 00:17:29 +00:00
Craig Topper	744f6311d3	Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level. llvm-svn: 147762	2012-01-09 00:11:29 +00:00
Craig Topper	c1ab7afec8	Enable FISTTP* instructions when AVX is enabled. llvm-svn: 147758	2012-01-08 23:04:21 +00:00
Benjamin Kramer	6609f741b9	Tweak my last commit to be less conservative about uses. We still save an instruction when just the "and" part is replaced. Also change the code to match comments more closely. llvm-svn: 147753	2012-01-08 21:12:51 +00:00
Evan Cheng	4882e488f7	Don't forget to transfer implicit uses of return instruction. llvm-svn: 147752	2012-01-08 20:41:16 +00:00
Evan Cheng	520730ff23	Avoid eraseing copies from a reserved register unless the definition can be safely proven not to have been clobbered. No small test case possible. llvm-svn: 147751	2012-01-08 19:52:28 +00:00
Benjamin Kramer	da37e15345	InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test. This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set. llvm-svn: 147749	2012-01-08 18:32:24 +00:00
Victor Umansky	540651cf59	Reverted commit #147601 upon Evan's request. llvm-svn: 147748	2012-01-08 17:20:33 +00:00
Rafael Espindola	81a6274e7c	Remove MCELFStreamer.h. llvm-svn: 147745	2012-01-07 23:18:39 +00:00
Rafael Espindola	382412032c	Don't print a label before .cfi_startproc when we don't need to. This makes the produce assembly when using CFI just a bit more readable. llvm-svn: 147743	2012-01-07 22:42:19 +00:00
Jakob Stoklund Olesen	083dbdca7f	Match SelectionDAG logic for enabling movt. Darwin doesn't do static, and ELF targets only support static. llvm-svn: 147740	2012-01-07 20:49:15 +00:00
Craig Topper	f210619d08	Fix typo in the X86 backend readme. Patch from Jaeden Amero. llvm-svn: 147739	2012-01-07 20:35:21 +00:00
Benjamin Kramer	6898db6269	Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. llvm-svn: 147738	2012-01-07 19:42:13 +00:00
Craig Topper	ca66bba45e	Remove unnecessary check of hasAVX(). It's already included in hasXMM(). llvm-svn: 147734	2012-01-07 18:48:43 +00:00
Craig Topper	0515cd41e4	Replace some uses of hasNUsesOfValue(0, X) with !hasAnyUseOfValue(X) llvm-svn: 147733	2012-01-07 18:31:09 +00:00
Craig Topper	43a1bd6ac7	Add some DAG combines for SUBC/SUBE. If nothing uses the carry/borrow out of subc, turn it into a sub. Turn (subc x, x) into 0 with no borrow. Turn (subc x, 0) into x with no borrow. Turn (subc -1, x) into (xor x, -1) with no borrow. Turn sube with no borrow in into subc. llvm-svn: 147728	2012-01-07 09:06:39 +00:00
Jakob Stoklund Olesen	434fb37bb4	Optimize reserved register coalescing. Reserved registers don't have proper live ranges, their LiveInterval simply has a snippet of liveness for each def. Virtual registers with a single value that is a copy of a reserved register (typically %esp) can be coalesced with the reserved register if the live range doesn't overlap any reserved register defs. When coalescing with a reserved register, don't modify the reserved register live range. Just leave it as a bunch of dead defs. This eliminates quadratic coalescer behavior in i386 functions with many function calls. PR11699 llvm-svn: 147726	2012-01-07 07:39:50 +00:00
Jakob Stoklund Olesen	a8879087b5	Use the 'regalloc' debug tag for most register allocator tracing. llvm-svn: 147725	2012-01-07 07:39:47 +00:00
Andrew Trick	06f6c05d08	Enable redundant phi elimination after LSR. This will be more important as we extend the LSR pass in ways that don't rely on the formula solver. In particular, we need it for constructing IV chains. llvm-svn: 147724	2012-01-07 07:08:17 +00:00
Jakob Stoklund Olesen	8cdce7e690	Use getRegForValue() to materialize the address of ARM globals. This enables basic local CSE, giving us 20% smaller code for consumer-typeset in -O0 builds. <rdar://problem/10658692> llvm-svn: 147720	2012-01-07 04:07:22 +00:00
Evan Cheng	6cc8d49885	Revert part of r147716. Looks like x87 instructions kill markers are all messed up so branch folding pass can't use the scavenger. :-( This doesn't breaks anything currently. It just means targets which do not carefully update kill markers cannot run post-ra scheduler (not new, it has always been the case). We should fix this at some point since it's really hacky. llvm-svn: 147719	2012-01-07 03:35:48 +00:00
Andrew Trick	732ad80dbb	LSR: Don't optimize loops if an outer loop has no preheader. LoopSimplify may not run on some outer loops, e.g. because of indirect branches. SCEVExpander simply cannot handle outer loops with no preheaders. Fixes rdar://10655343 SCEVExpander segfault. llvm-svn: 147718	2012-01-07 03:16:50 +00:00
Rafael Espindola	0708209642	Split Finish into Finish and FinishImpl to have a common place to do end of file error checking. Use that to error on an unfinished cfi_startproc. The error is not nice, but is already better than a segmentation fault. llvm-svn: 147717	2012-01-07 03:13:18 +00:00
Evan Cheng	00b1a3cd7e	Added a late machine instruction copy propagation pass. This catches opportunities that only present themselves after late optimizations such as tail duplication .e.g. ## BB#1: movl %eax, %ecx movl %ecx, %eax ret The register allocator also leaves some of them around (due to false dep between copies from phi-elimination, etc.) This required some changes in codegen passes. Post-ra scheduler and the pseudo-instruction expansion passes have been moved after branch folding and tail merging. They were before branch folding before because it did not always update block livein's. That's fixed now. The pass change makes independently since we want to properly schedule instructions after branch folding / tail duplication. rdar://10428165 rdar://10640363 llvm-svn: 147716	2012-01-07 03:02:36 +00:00
Evan Cheng	501e3095e8	Copy implicit defs (e.g. r0) when changing tBX_RET to tPOP_RET. This bug is exposed with an upcoming change will would delete the copy to return register because there is no use! It's amazing anything works. llvm-svn: 147715	2012-01-07 02:55:54 +00:00
Jakob Stoklund Olesen	68f034ee1a	Use movw+movt in ARMFastISel::ARMMaterializeGV. This eliminates a lot of constant pool entries for -O0 builds of code with many global variable accesses. This speeds up -O0 codegen of consumer-typeset by 2x because the constant island pass no longer has to look at thousands of constant pool entries. <rdar://problem/10629774> llvm-svn: 147712	2012-01-07 01:47:05 +00:00
Andrew Trick	2ec61a896b	LSR: run DeleteDeadPhis before replaceCongruentPhis. llvm-svn: 147711	2012-01-07 01:36:44 +00:00
Andrew Trick	f730f39f3f	Cleanup comments and argument types related to my previous replaceCongruentPhis checkin. llvm-svn: 147709	2012-01-07 01:29:21 +00:00
Andrew Trick	5adedf5d47	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Eric Christopher	c206d46709	Make the 'x' constraint work for AVX registers as well. Fixes rdar://10614894 llvm-svn: 147704	2012-01-07 01:02:09 +00:00
Andrew Trick	ff4e2b7d23	Missing raw_ostream.h breaks MSVC build. llvm-svn: 147703	2012-01-07 00:54:28 +00:00
Andrew Trick	881a776875	Expose isNonConstantNegative to users of ScalarEvolution. llvm-svn: 147700	2012-01-07 00:27:31 +00:00
Chad Rosier	73a3fab480	Add comment. llvm-svn: 147696	2012-01-06 23:45:47 +00:00
Eric Christopher	8ea8e4fc76	Add a comment and ensure that anyone else looking at this code doesn't start to bleed from the eyes. llvm-svn: 147695	2012-01-06 23:03:37 +00:00
Eric Christopher	090fcc1a10	Use const vector references instead of a vector copy. Spotted by Devang. llvm-svn: 147694	2012-01-06 23:03:34 +00:00
Eric Christopher	5a28a6ee2f	Use -> instead of (*iter). llvm-svn: 147693	2012-01-06 23:03:27 +00:00
Jakob Stoklund Olesen	68a922c0e9	Enable aligned NEON spilling by default. Experiments show this to be a small speedup for modern ARM cores. llvm-svn: 147689	2012-01-06 22:19:37 +00:00
Andrew Trick	9a5b242d3c	Put all IVUsers in the processed set. Allow querying IVUsers with isIVUserOrOperand. llvm-svn: 147686	2012-01-06 21:41:55 +00:00
Jakob Stoklund Olesen	690511137c	Abort AdjustBBOffsetsAfter early when possible. llvm-svn: 147685	2012-01-06 21:40:15 +00:00
Andrew Trick	b8045cbcb1	SCEVExpander: hoistStep should check strict dominance. llvm-svn: 147683	2012-01-06 21:23:43 +00:00
Andrew Trick	85460d0d32	Tracing to help investigate issues with SjLj spill code. llvm-svn: 147682	2012-01-06 21:16:27 +00:00
Chad Rosier	64dc8aa44f	Initializing to false makes better sense. Thanks, David. llvm-svn: 147679	2012-01-06 20:11:59 +00:00
Chad Rosier	a3d90a9467	Fix uninitialized variable warning. llvm-svn: 147676	2012-01-06 20:02:49 +00:00
Chad Rosier	6b64c3c683	Fix uninitialized variable warning. llvm-svn: 147675	2012-01-06 19:59:58 +00:00
Eric Christopher	667a074be0	Fix a leak I noticed while reviewing the accelerator table changes. Passes lldb testsuite. rdar://10652330 llvm-svn: 147673	2012-01-06 19:35:04 +00:00
Kostya Serebryany	3411f2ea68	[asan] cleanup: remove the SIGILL-related code (compiler part) llvm-svn: 147667	2012-01-06 18:09:21 +00:00
Eli Bendersky	d8e2572909	Fix typo in string llvm-svn: 147654	2012-01-06 07:49:17 +00:00
Eric Christopher	21bde87bf3	As part of the ongoing work in finalizing the accelerator tables, extend the debug type accelerator tables to contain the tag and a flag stating whether or not a compound type is a complete type. rdar://10652330 llvm-svn: 147651	2012-01-06 04:35:23 +00:00
Dan Gohman	5ab9c0a927	Fix SpeculativelyExecuteBB to either speculate all or none of the phis present in the bottom of the CFG triangle, as the transformation isn't ever valuable if the branch can't be eliminated. Also, unify some heuristics between SimplifyCFG's multiple if-converters, for consistency. This fixes rdar://10627242. llvm-svn: 147630	2012-01-05 23:58:56 +00:00
Eli Friedman	55fa49f32d	PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into global initializers if there's an implied extension or truncation. llvm-svn: 147625	2012-01-05 23:03:32 +00:00
Rafael Espindola	23f8d64b58	Link symbols with different visibilities according to the rules in the System V Application Binary Interface. This lets us use -fvisibility-inlines-hidden with LTO. Fixes PR11697. llvm-svn: 147624	2012-01-05 23:02:01 +00:00
Dan Gohman	5267211899	Revert r56315. When the instruction to speculate is a load, this code can incorrectly move the load across a store. This never happens in practice today, but only because the current heuristics accidentally preclude it. llvm-svn: 147623	2012-01-05 22:54:35 +00:00
Benjamin Kramer	69eab4e0af	Kill ObjectCodeEmitter and BinaryObject, they were unused and superseded by MC. llvm-svn: 147618	2012-01-05 22:31:37 +00:00
Nick Lewycky	f740db31e2	SCCCaptured is trivially false on entry to this loop and not modified inside it. Eliminate the dead test for it on each loop iteration. No functionality change. llvm-svn: 147616	2012-01-05 22:21:45 +00:00
Rafael Espindola	afcf571ef9	Remove the old ELF writer. llvm-svn: 147615	2012-01-05 22:07:43 +00:00
Danil Malyshev	7e325789af	A small re-factored JIT/MCJIT::getPointerToNamedFunction(), so it could be called with the base class. llvm-svn: 147610	2012-01-05 21:16:14 +00:00
Sebastian Pop	99ab273a77	revert r147542 after comments from Joerg Sonnenberger llvm-svn: 147608	2012-01-05 18:28:46 +00:00
Chandler Carruth	eab5029964	Remove an unused variable. llvm-svn: 147605	2012-01-05 11:25:47 +00:00
Chandler Carruth	e041a30bb9	Prevent a DAGCombine from firing where there are two uses of a combined-away node and the result of the combine isn't substantially smaller than the input, it's just canonicalized. This is the first part of a significant (7%) performance gain for Snappy's hot decompression loop. llvm-svn: 147604	2012-01-05 11:05:55 +00:00
Craig Topper	29b0737452	Mark scalar FMA4 instructions as ignoring the VEX.L bit. llvm-svn: 147602	2012-01-05 08:56:10 +00:00
Victor Umansky	9255b6d9fe	Peephole optimization of ptest-conditioned branch in X86 arch. Performs instruction combining of sequences generated by ptestz/ptestc intrinsics to ptest+jcc pair for SSE and AVX. Testing: passed 'make check' including LIT tests for all sequences being handled (both SSE and AVX) Reviewers: Evan Cheng, David Blaikie, Bruno Lopes, Elena Demikhovsky, Chad Rosier, Anton Korobeynikov llvm-svn: 147601	2012-01-05 08:46:19 +00:00
Andrew Trick	100af0adf7	Minor postra scheduler cleanup. It could result in more precise antidependence latency on ARM in exceedingly rare cases. llvm-svn: 147594	2012-01-05 02:52:11 +00:00
Bill Wendling	ac27f0c830	Replace the uint64_t -> double convertion algorithm with one that's more efficient. This small bit of ASM code is sufficient to do what the old algorithm did: movq %rax, %xmm0 punpckldq (c0), %xmm0 // c0: (uint4){ 0x43300000U, 0x45300000U, 0U, 0U } subpd (c1), %xmm0 // c1: (double2){ 0x1.0p52, 0x1.0p52 * 0x1.0p32 } #ifdef __SSE3__ haddpd %xmm0, %xmm0 #else pshufd $0x4e, %xmm0, %xmm1 addpd %xmm1, %xmm0 #endif It's arguably faster. One caveat, the 'haddpd' instruction isn't very fast on all processors. <rdar://problem/7719814> llvm-svn: 147593	2012-01-05 02:13:20 +00:00
Jakob Stoklund Olesen	d110e2a83f	Reapply r146997, "Heed spill slot alignment on ARM." Now that canRealignStack() understands frozen reserved registers, it is safe to use it for aligned spill instructions. It will only return true if the registers reserved at the beginning of register allocation allow for dynamic stack realignment. <rdar://problem/10625436> llvm-svn: 147579	2012-01-05 00:26:57 +00:00
Jakob Stoklund Olesen	9cb477db25	Avoid reserving an ARM base pointer during register allocation. Once register allocation has started the reserved registers are frozen. Fix the ARM canRealignStack() hook to respect the frozen register state. Now the hook returns false if register allocation was started with frame pointer elimination enabled. It also returns false if register allocation started without a reserved base pointer, and stack realignment would require a base pointer. This bug was breaking oggenc on armv6. No test case, an upcoming patch will use this functionality to realign the stack for spill slots when possible. llvm-svn: 147578	2012-01-05 00:26:52 +00:00
Jakob Stoklund Olesen	d19d3cab09	Freeze reserved registers before starting register allocation. The register allocators don't currently support adding reserved registers while they are running. Extend the MRI API to keep track of the set of reserved registers when register allocation started. Target hooks like hasFP() and needsStackRealignment() can look at this set to avoid reserving more registers during register allocation. llvm-svn: 147577	2012-01-05 00:26:49 +00:00
Dan Gohman	7ac046a261	Generalize isSafeToSpeculativelyExecute to work on arbitrary Values, rather than just Instructions, since it's interesting for ConstantExprs too. llvm-svn: 147560	2012-01-04 23:01:09 +00:00
Benjamin Kramer	9c48f26341	Silence warnings of a mysterious compiler that still defaults to C89. llvm-svn: 147553	2012-01-04 22:06:45 +00:00
Sebastian Pop	0f357d6c22	use getHostTriple instead of getDefaultTargetTriple in getClosestTargetForJIT Get back getHostTriple. For JIT compilation, use the host triple instead of the default target: this fixes some JIT testcases that used to fail when the compiler has been configured as a cross compiler. llvm-svn: 147542	2012-01-04 19:47:22 +00:00
Akira Hatanaka	aac3e06bf7	Enable -soft-float for MIPS. llvm-svn: 147541	2012-01-04 19:29:11 +00:00
Nick Lewycky	6d1d4bb6a1	Remove pointless asserts. llvm-svn: 147529	2012-01-04 09:42:30 +00:00
Nick Lewycky	0c48afa0ed	Teach instcombine all sorts of great stuff about shifts that have exact, nuw or nsw bits on them. llvm-svn: 147528	2012-01-04 09:28:29 +00:00
Craig Topper	f726e15f44	Allow vector shuffle normalizing to use concat vector even if the sources are commuted in the shuffle mask. llvm-svn: 147527	2012-01-04 09:23:09 +00:00
Craig Topper	279c77b677	Implement VECTOR_SHUFFLE canonicalizations during DAG combine. llvm-svn: 147525	2012-01-04 08:07:43 +00:00
Akira Hatanaka	3b775b8cc3	Rename immLUiOpnd. llvm-svn: 147519	2012-01-04 03:09:26 +00:00
Akira Hatanaka	b89a4bfe41	- Define base classes for Jump-and-link instructions and make 32-bit and 64-bit versions derive from them. - JALR64 is not needed since N64 does not emit jal. - Add template parameter to BranchLink that sets the rt field. - Fix the set of temporary registers for O32 and N64. llvm-svn: 147518	2012-01-04 03:02:47 +00:00
Akira Hatanaka	c669d7a6db	Have getRegForInlineAsmConstraint return the correct register class when target is Mips64. llvm-svn: 147516	2012-01-04 02:45:01 +00:00
Evan Cheng	801d98b3f0	Fix more places which should be checking for iOS, not darwin. llvm-svn: 147513	2012-01-04 01:55:04 +00:00
Evan Cheng	104dbb0fd1	For x86, canonicalize max (x > y) ? x : y => (x >= y) ? x : y So for something like (x - y) > 0 : (x - y) ? 0 It will be (x - y) >= 0 : (x - y) ? 0 This makes is possible to test sign-bit and eliminate a comparison against zero. e.g. subl %esi, %edi testl %edi, %edi movl $0, %eax cmovgl %edi, %eax => xorl %eax, %eax subl %esi, $edi cmovsl %eax, %edi rdar://10633221 llvm-svn: 147512	2012-01-04 01:41:39 +00:00
Chris Lattner	6b77a07f75	Turn a few more inline asm errors into "emitErrors" instead of fatal errors. Before we'd get: $ clang t.c fatal error: error in backend: Invalid operand for inline asm constraint 'i'! Now we get: $ clang t.c t.c:16:5: error: invalid operand for inline asm constraint 'i'! "movq (%4), %%mm0\n" ^ Which at least gets us the inline asm that is the problem. llvm-svn: 147502	2012-01-03 23:51:01 +00:00
Chris Lattner	e22e613128	generalize LLVMContext::emitError to take a twine instead of a StringRef. llvm-svn: 147501	2012-01-03 23:47:05 +00:00
Chad Rosier	6ca97df951	Fix 80-column violations. llvm-svn: 147495	2012-01-03 23:19:12 +00:00
Jakob Stoklund Olesen	1b7f2a7638	Revert r146997, "Heed spill slot alignment on ARM." This patch caused a miscompilation of oggenc because a frame pointer was suddenly needed halfway through register allocation. <rdar://problem/10625436> llvm-svn: 147487	2012-01-03 22:34:35 +00:00
Jakob Stoklund Olesen	4043d92872	Assert when reserved registers have been assigned. This can only happen if the set of reserved registers changes during register allocation. <rdar://problem/10625436> llvm-svn: 147486	2012-01-03 22:34:31 +00:00
Nadav Rotem	6d31bac85e	Revert 147426 because it caused pr11696. llvm-svn: 147485	2012-01-03 22:19:42 +00:00
Nadav Rotem	1e7dda13c8	Fix incorrect widening of the bitcast sdnode in case the incoming operand is integer-promoted. llvm-svn: 147484	2012-01-03 22:12:28 +00:00
Chad Rosier	493c1b3152	Enhance DAGCombine for transforming 128->256 casts into a vmovaps, rather then a vxorps + vinsertf128 pair if the original vector came from a load. rdar://10594409 llvm-svn: 147481	2012-01-03 21:05:52 +00:00
Nick Lewycky	228f5b4ba3	Conform to the style guide; remove 'else' after 'return'. Also remove an extra if-statement by turning it into an assert. No functionality change. llvm-svn: 147474	2012-01-03 20:33:00 +00:00
Owen Anderson	fcc041eabf	Remove the restriction that target intrinsics can only involve legal types. Targets can perfects well support intrinsics on illegal types, as long as they are prepared to perform custom expansion during type legalization. For example, a target where i64 is illegal might still support the i64 intrinsic operation using pairs of i32's. ARM already does some expansions like this for non-intrinsic operations. llvm-svn: 147472	2012-01-03 20:09:02 +00:00
Lang Hames	c405ac4429	Clarified assert text. llvm-svn: 147471	2012-01-03 20:05:57 +00:00
Matt Beaumont-Gay	b982d8eb65	Fix malformed assert. If anybody has strong feelings about 'default: assert(0 && "blah")' vs 'default: llvm_unreachable("blah")', feel free to regularize the instances of each in this file. llvm-svn: 147459	2012-01-03 19:03:59 +00:00
Nick Lewycky	bc26b2d162	Fix typo in ruler. No functionality change. llvm-svn: 147454	2012-01-03 18:22:43 +00:00
Devang Patel	c1215324a3	Intel style asm variant does not need '%' prefix. llvm-svn: 147453	2012-01-03 18:22:10 +00:00
Stepan Dyatkovskiy	a3e8b00f75	Type: replaced usage of ID with getTypeID(). llvm-svn: 147446	2012-01-03 14:05:04 +00:00
Elena Demikhovsky	8ec21a2801	Fixed a bug in SelectionDAG.cpp. The failure seen on win32, when i64 type is illegal. It happens on stage of conversion VECTOR_SHUFFLE to BUILD_VECTOR. The failure message is: llc: SelectionDAG.cpp:784: void VerifyNodeCommon(llvm::SDNode*): Assertion `(I->getValueType() == EltVT \|\| (EltVT.isInteger() && I->getValueType().isInteger() && EltVT.bitsLE(I->getValueType()))) && "Wrong operand type!"' failed. I added a special test that checks vector shuffle on win32. llvm-svn: 147445	2012-01-03 11:59:04 +00:00
Andrew Trick	cbcc98fb50	Fix SCEVExpander to handle loops with no preheader when LSR gives it a "phony" insertion point. Fixes rdar://10619599: "SelectionDAGBuilder shouldn't visit PHI nodes!" assert llvm-svn: 147439	2012-01-02 21:25:10 +00:00
Craig Topper	5bacb7e9e5	Miscellaneous shuffle lowering cleanup. No functional changes. Primarily converting the indexing loops to unsigned to be consistent across functions. llvm-svn: 147430	2012-01-02 09:17:37 +00:00
Craig Topper	53d559641f	Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also make it return false if there's not even a load at all. This makes the code better match the code in DAGCombiner that it tries to match. These two changes prevent some cases where vector_shuffles were making it to instruction selection and causing the older shuffle selection code to be triggered. Also needed to fix a bad pattern that this change exposed. This is the first step towards getting rid of the old shuffle selection support. No test cases yet because there's no way to tell whether a shuffle was handled in the legalize stage or at instruction selection. llvm-svn: 147428	2012-01-02 08:46:48 +00:00
Nadav Rotem	6c7a0e6c8b	Optimize the sequence blend(sign_extend(x)) to blend(shl(x)) since SSE blend instructions only look at the highest bit. llvm-svn: 147426	2012-01-02 08:05:46 +00:00
Rafael Espindola	b79934657c	Materialize functions whose basic blocks are used by global variables. Fixes PR11677. llvm-svn: 147425	2012-01-02 07:49:53 +00:00
Craig Topper	b910984458	Allow CRC32 instructions to be selected when AVX is enabled. llvm-svn: 147411	2012-01-01 19:51:58 +00:00
Craig Topper	1c064e0a89	Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers. llvm-svn: 147409	2012-01-01 19:40:22 +00:00
Benjamin Kramer	47aecca51a	X86Disassembler: Fix undefined behavior found by GCC 4.6 llvm-svn: 147404	2012-01-01 17:55:36 +00:00
Benjamin Kramer	9442cd01f6	PatternMatch: Introduce a matcher for instructions with the "exact" bit. Use it to simplify a few matchers. llvm-svn: 147403	2012-01-01 17:55:30 +00:00
Rafael Espindola	d3df940169	Revert 147399. It broke CodeGen/ARM/vext.ll. llvm-svn: 147400	2012-01-01 17:36:23 +00:00
Elena Demikhovsky	67f80c3432	Fixed a bug in SelectionDAG.cpp. The failure seen on win32, when i64 type is illegal. It happens on stage of conversion VECTOR_SHUFFLE to BUILD_VECTOR. The failure message is: llc: SelectionDAG.cpp:784: void VerifyNodeCommon(llvm::SDNode*): Assertion `(I->getValueType() == EltVT \|\| (EltVT.isInteger() && I->getValueType().isInteger() && EltVT.bitsLE(I->getValueType()))) && "Wrong operand type!"' failed. I added a special test that checks vector shuffle on win32. llvm-svn: 147399	2012-01-01 16:22:47 +00:00
Craig Topper	6e54ba7eee	Merge X86 SHUFPS and SHUFPD node types. llvm-svn: 147394	2011-12-31 23:50:21 +00:00
Craig Topper	d51092d93a	Add patterns for integer forms of SHUFPD/VSHUFPD with a memory load. llvm-svn: 147393	2011-12-31 23:24:49 +00:00
Craig Topper	0e796fee11	Fix typo in a SHUFPD and VSHUFPD pattern that prevented SHUFPD/VSHUFPD with a load from being selected. llvm-svn: 147392	2011-12-31 23:15:11 +00:00
Nick Lewycky	b59008c694	Make use of the exact bit when optimizing '(X >>exact 3) << 1' to eliminate the 'and' that would zero out the trailing bits, and to produce an exact shift ourselves. llvm-svn: 147391	2011-12-31 21:30:22 +00:00
Dylan Noblesmith	1c65a21ec4	VMCore: add assert for miscompile See PR11652. Trying to add this assert to setSubclassData() itself actually prevented the miscompile entirely, so it has to be here. This makes the source of the bug more obvious than the other asserts triggering later on did. llvm-svn: 147390	2011-12-31 13:58:58 +00:00
Bruno Cardoso Lopes	cd1d447d62	Cleanup Mips code and rename some variables. Patch by Jack Carter llvm-svn: 147383	2011-12-30 21:09:41 +00:00
Bruno Cardoso Lopes	d5b2834fb7	Improve Mips JIT. Implement encoder methods getJumpTargetOpValue and getBranchTargetOpValue for jmptarget and brtarget Mips tablegen operand types in the code emitter for old-style JIT. Rename the pc relative relocation for branches - new name is Mips::reloc_mips_pc16. Patch by Sasa Stankovic llvm-svn: 147382	2011-12-30 21:04:30 +00:00
Craig Topper	a5d1fc2cc7	Make FMA4 imply AVX so that YMM registers would be available. Necessitates removing from Bulldozer CPU types since it would enable AVX code generation implicitly. Also make SSE4A imply SSE3. Without some level of SSE implied, XMM registers wouldn't be legal. llvm-svn: 147369	2011-12-30 07:16:00 +00:00
Craig Topper	2ba766ae84	Add disassembler support for VPERMIL2PD and VPERMIL2PS. llvm-svn: 147368	2011-12-30 06:23:39 +00:00
Craig Topper	03a0beda88	Add FMA4 instructions to disassembler. llvm-svn: 147367	2011-12-30 05:20:36 +00:00
Craig Topper	cd93de93fa	Separate the concept of having memory access in operand 4 from the concept of having the W bit set for XOP instructons. Removes ORing W-bits in the encoder and will similarly simplify the disassembler implementation. llvm-svn: 147366	2011-12-30 04:48:54 +00:00
Craig Topper	c0f9bcb5d5	Combine FMA4 SS/SD patterns with the instruction definitions. llvm-svn: 147365	2011-12-30 03:33:59 +00:00
Craig Topper	51fe43fcd9	Combine FMA4 PS/PD patterns with the instruction definitions. llvm-svn: 147364	2011-12-30 03:17:15 +00:00
Craig Topper	6c08930c5e	Change FMA4 memory forms to use memopv* instead of alignedloadv*. No need to force alignment on these instructions. Add a couple testcases for memory forms. llvm-svn: 147361	2011-12-30 02:18:36 +00:00
Craig Topper	2ca79b9d4b	Fix load size for FMA4 SS/SD instructions. They need to use f32 and f64 size, but with the special handling to be compatible with the intrinsic expecting a vector. Similar handling is already used elsewhere. llvm-svn: 147360	2011-12-30 01:49:53 +00:00
Hal Finkel	692d1fb355	Cleanup stack/frame register define/kill states. This fixes two bugs: 1. The ST*UX instructions that store and update the stack pointer did not set define/kill on R1. This became a problem when I activated post-RA scheduling (and had incorrectly adjusted the Frames-large test). 2. eliminateFrameIndex did not kill its scavenged temporary register, and this could cause the scavenger to exhaust all available registers (and its emergency spill slot) when there were a lot of CR values to spill. The 2010-02-12-saveCR test has been adjusted to check for this. llvm-svn: 147359	2011-12-30 00:34:00 +00:00
Rafael Espindola	4ea99816ef	Implement cfi_restore. Patch by Brian Anderson! llvm-svn: 147356	2011-12-29 21:43:03 +00:00
Rafael Espindola	03dbffd8ce	Rename Remember and Restore to RememberState and RestoreState for consistency. llvm-svn: 147354	2011-12-29 21:09:08 +00:00
Craig Topper	d773607eee	Fix execution domains for PS/PD FMA3 instructions. Add SS/SD forms o FMA3 instructions. llvm-svn: 147353	2011-12-29 20:43:40 +00:00
Rafael Espindola	ef4aa35164	Implement .cfi_escape. Patch by Brian Anderson! llvm-svn: 147352	2011-12-29 20:24:47 +00:00
Craig Topper	8cab06a214	Expose FMA3 instructions to the disassembler. llvm-svn: 147351	2011-12-29 20:03:14 +00:00
Craig Topper	e1bd05128e	Make FMA3 imply AVX needs to be enabled. Particularly because 256-bit types aren't valid unless AVX is enabled. llvm-svn: 147349	2011-12-29 19:46:19 +00:00
Craig Topper	dd286a5201	Change XOP detection to use the correct CPUID bit instead of using the FMA4 bit. llvm-svn: 147348	2011-12-29 19:25:56 +00:00
Craig Topper	a060afb5ba	Add FeaturePOPCNT to all CPU types that lost it was removed from SSE42/SSE4A in r147339. llvm-svn: 147347	2011-12-29 18:47:31 +00:00
Craig Topper	97f05c5768	Mark non-VEX forms of PCLMUL instructions as requiring SSE2 to be enabled along with CLMUL. That's required for the XMM registers to be valid for integer data. Doesn't change any behavior since the CLMUL instructions don't have patterns yet. llvm-svn: 147345	2011-12-29 18:08:36 +00:00
Craig Topper	1559123c77	Mark non-VEX forms of AES instructions as requiring SSE2 to be enabled along with AES. Since that's required for the XMM registers to be valid for integer data. Doesn't change any behavior though since you can't use an intrinsic with an illegal type anyway. Just makes it consistent with the VEX forms. llvm-svn: 147344	2011-12-29 18:00:08 +00:00
Craig Topper	9e61291bf5	Remove the separate explicit AES instruction patterns. They are equivalent to the patterns specified by the instructions. Also remove unnecessary bitconverts from the AES patterns. llvm-svn: 147342	2011-12-29 17:41:56 +00:00
Craig Topper	7bd3305f3e	Make SSE42 and SSE4A not imply POPCNT. POPCNT should be able to be disabled on its own without disabling SSE4.2 or SSE4A. llvm-svn: 147339	2011-12-29 15:51:45 +00:00
Craig Topper	0fdf720ded	Make LowerBUILD_VECTOR keep node vector types consistent when creating MOVL for v16i16 and v32i8. llvm-svn: 147337	2011-12-29 03:34:54 +00:00
Craig Topper	862c9b65be	Remove some elses after returns. llvm-svn: 147336	2011-12-29 03:20:51 +00:00
Craig Topper	274e20a499	Remove trailing spaces. Fix an assert to use && instead of \|\| before string. Add same assert on similar code path. llvm-svn: 147335	2011-12-29 03:09:33 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Eli Friedman	3a01ddb7e9	Fix type-checking for load transformation which is not legal on floating-point types. PR11674. llvm-svn: 147323	2011-12-28 21:24:44 +00:00
Nadav Rotem	3c3dd6e588	PR11662. Promotion of the mask operand needs to be done using PromoteTargetBoolean, and not padded with garbage. llvm-svn: 147309	2011-12-28 13:08:20 +00:00
Elena Demikhovsky	b3515a8d4b	Fixed a bug in LowerVECTOR_SHUFFLE and LowerBUILD_VECTOR. Matching MOVLP mask for AVX (265-bit vectors) was wrong. The failure was detected by conformance tests. llvm-svn: 147308	2011-12-28 08:14:01 +00:00
Nick Lewycky	8640fdf0b7	Demystify this comment. llvm-svn: 147307	2011-12-28 06:57:32 +00:00
Benjamin Kramer	46236ee5cf	Switch StringMap from an array of structures to a structure of arrays. - -25% memory usage of the main table on x86_64 (was wasted in struct padding). - no significant performance change. llvm-svn: 147294	2011-12-27 20:35:07 +00:00
Nick Lewycky	398255e70c	Use false not zero, as a bool. llvm-svn: 147292	2011-12-27 18:27:22 +00:00
Nick Lewycky	a8e84fb56b	Turn cos(-x) into cos(x). Patch by Alexander Malyshev! llvm-svn: 147291	2011-12-27 18:25:50 +00:00
Benjamin Kramer	b668401b2e	Clean up some Release build warnings. llvm-svn: 147289	2011-12-27 11:41:05 +00:00
Craig Topper	df34d152bd	Add handling of x86_avx2_pmovmskb to computeMaskedBitsForTargetNode for consistency. Add comments and an assert for BMI instructions to PerformXorCombine since the enabling of the combine is conditional on it, but the function itself isn't. llvm-svn: 147287	2011-12-27 06:27:23 +00:00
Nick Lewycky	c554a9b58e	Teach simplifycfg to recompute branch weights when merging some branches, and to discard weights when appropriate. Still more to do (and a new TODO), but it's a start! llvm-svn: 147286	2011-12-27 04:31:52 +00:00
Nick Lewycky	4c131387c3	Using Inst->setMetadata(..., NULL) should be safe to remove metadata even when there is non of that type to remove. This fixes a crasher in the particular case where the instruction has metadata but no metadata storage in the context (this is only possible if the instruction has !dbg but no other metadata info). llvm-svn: 147285	2011-12-27 01:17:40 +00:00
Rafael Espindola	2b14b80b60	Fix warning. llvm-svn: 147284	2011-12-26 23:12:42 +00:00
Eli Friedman	e96286cdf2	Make sure DAGCombiner doesn't introduce multiple loads from the same memory location. PR10747, part 2. llvm-svn: 147283	2011-12-26 22:49:32 +00:00
Nick Lewycky	8d302df4a4	Update the branch weight metadata when reversing the order of a branch. llvm-svn: 147280	2011-12-26 20:54:14 +00:00
Nick Lewycky	e87d54c817	Sort includes, canonicalize whitespace, fix typos. No functionality change. llvm-svn: 147279	2011-12-26 20:37:40 +00:00
Nadav Rotem	c1faeac410	Fix a typo in the widening of vectors in PromoteIntRes. Patch by Shemer Anat. llvm-svn: 147272	2011-12-25 20:01:38 +00:00
Venkatraman Govindaraju	1fc8263b4d	Sparc: Implement emitFrameIndexDebugValue and getDebugValue Location hooks. llvm-svn: 147269	2011-12-25 18:50:24 +00:00
Rafael Espindola	2d3dac3e87	Remove unused variables. llvm-svn: 147261	2011-12-25 01:20:19 +00:00
Benjamin Kramer	b16bd77bd2	InstCombine: Add a combine that turns (2^n)-1 ^ x back into (2^n)-1 - x iff x is smaller than 2^n and it fuses with a following add. This was intended to undo the sub canonicalization in cases where it's not profitable, but it also finds some cases on it's own. llvm-svn: 147256	2011-12-24 17:31:53 +00:00
Benjamin Kramer	4ee5747fdd	ComputeMaskedBits: Make knownzero computation more aggressive for ctlz with undef zero. unsigned foo(unsigned x) { return 31 - __builtin_clz(x); } now compiles into a single "bsrl" instruction on x86. llvm-svn: 147255	2011-12-24 17:31:46 +00:00
Benjamin Kramer	010337c838	InstCombine: Canonicalize (2^n)-1 - x into (2^n)-1 ^ x iff x is known to be smaller than 2^n. This has the obvious advantage of being commutable and is always a win on x86 because const - x wastes a register there. On less weird architectures this may lead to a regression because other arithmetic doesn't fuse with it anymore. I'll address that problem in a followup. llvm-svn: 147254	2011-12-24 17:31:38 +00:00
Rafael Espindola	a56ab0ede7	Section relative fixups are a coff concept, not a x86 one. Replace the x86 specific reloc_coff_secrel32 with a generic FK_SecRel_4. llvm-svn: 147252	2011-12-24 14:47:52 +00:00
Chandler Carruth	a3d54fe0ae	Use standard promotion for i8 CTTZ nodes and i8 CTLZ nodes when the LZCNT instructions are available. Force promotion to i32 to get a smaller encoding since the fix-ups necessary are just as complex for either promoted type We can't do standard promotion for CTLZ when lowering through BSR because it results in poor code surrounding the 'xor' at the end of this instruction. Essentially, if we promote the entire CTLZ node to i32, we end up doing the xor on a 32-bit CTLZ implementation, and then subtracting appropriately to get back to an i8 value. Instead, our custom logic just uses the knowledge of the incoming size to compute a perfect xor. I'd love to know of a way to fix this, but so far I'm drawing a blank. I suspect the legalizer could be more clever and/or it could collude with the DAG combiner, but how... ;] llvm-svn: 147251	2011-12-24 12:12:34 +00:00
Chandler Carruth	38ce24455d	Add systematic testing for cttz as well, and fix the bug I spotted by inspection earlier. llvm-svn: 147250	2011-12-24 11:46:10 +00:00
Benjamin Kramer	767bbe48c1	Chandler fixed this. llvm-svn: 147247	2011-12-24 11:23:32 +00:00
Chandler Carruth	c9fcde2347	Expand more when we have a nice 'tzcnt' instruction, to avoid generating 'bsf' instructions here. This one is actually debatable to my eyes. It's not clear that any chip implementing 'tzcnt' would have a slow 'bsf' for any reason, and unless EFLAGS or a zero input matters, 'tzcnt' is just a longer encoding. Still, this restores the old behavior with 'tzcnt' enabled for now. llvm-svn: 147246	2011-12-24 11:11:38 +00:00
Chandler Carruth	7e9453e916	Switch the lowering of CTLZ_ZERO_UNDEF from a .td pattern back to the X86ISelLowering C++ code. Because this is lowered via an xor wrapped around a bsr, we want the dagcombine which runs after isel lowering to have a chance to clean things up. In particular, it is very common to see code which looks like: (sizeof(x)8 - 1) ^ __builtin_clz(x) Which is trying to compute the most significant bit of 'x'. That's actually the value computed directly by the 'bsr' instruction, but if we match it too late, we'll get completely redundant xor instructions. The more naive code for the above (subtracting rather than using an xor) still isn't handled correctly due to the dagcombine getting confused. Also, while here fix an issue spotted by inspection: we should have been expanding the zero-undef variants to the normal variants when there is an 'lzcnt' instruction. Do so, and test for this. We don't want to generate unnecessary 'bsr' instructions. These two changes fix some regressions in encoding and decoding benchmarks. However, there is still a lot* to be improve on in this type of code. llvm-svn: 147244	2011-12-24 10:55:54 +00:00
Jakob Stoklund Olesen	103318e9ea	Fix Comments. llvm-svn: 147238	2011-12-24 04:17:01 +00:00
Akira Hatanaka	1cf7576707	Add MachineMemOperands to instructions generated in storeRegToStackSlot or loadRegFromStackSlot. llvm-svn: 147235	2011-12-24 03:11:18 +00:00
Akira Hatanaka	6f54a46133	Detect unaligned loads/stores that have been added for Mips64 support. llvm-svn: 147234	2011-12-24 03:07:37 +00:00
Akira Hatanaka	695d113adc	If target ABI is N64, LEA should be daddiu. llvm-svn: 147232	2011-12-24 02:59:27 +00:00
Rafael Espindola	908d2ed14e	Move x86 specific bits of the COFF writer to lib/Target/X86. llvm-svn: 147231	2011-12-24 02:14:02 +00:00
Rafael Espindola	b120ea2b92	Define trivial destructor inline. llvm-svn: 147230	2011-12-24 01:53:13 +00:00
Rafael Espindola	a2da8aa505	Make GetRelocType pure virtual. llvm-svn: 147229	2011-12-24 01:36:25 +00:00
Nick Lewycky	d9d1de4f69	Fix typo "infinte". llvm-svn: 147226	2011-12-23 23:49:25 +00:00
Mon P Wang	5d44a4332a	When not destroying the source, the linker is not remapping the types. Added support to CloneFunctionInto to allow remapping for this case. llvm-svn: 147217	2011-12-23 02:18:32 +00:00
Jakob Stoklund Olesen	0965585cb1	Experimental support for aligned NEON spills. ARM targets with NEON units have access to aligned vector loads and stores that are potentially faster than unaligned operations. Add support for spilling the callee-saved NEON registers to an aligned stack area using 16-byte aligned NEON loads and store. This feature is off by default, controlled by an -align-neon-spills command line option. llvm-svn: 147211	2011-12-23 00:36:18 +00:00
Bob Wilson	1a74de9504	Add variants of the dispatchsetup pseudo for Thumb and !VFP. <rdar://10620138> My change r146949 added register clobbers to the eh_sjlj_dispatchsetup pseudo instruction, but on Thumb1 some of those registers cannot be used. This caused massive failures on the testsuite when compiling for Thumb1. While fixing that, I noticed that the eh_sjlj_setjmp instruction has a "nofp" variant, and I realized that dispatchsetup needs the same thing, so I have added that as well. llvm-svn: 147204	2011-12-22 23:39:48 +00:00
Dylan Noblesmith	f3b1760496	TableGen: add a comment llvm-svn: 147199	2011-12-22 23:16:09 +00:00
Dylan Noblesmith	345b7430a9	try to fix MSVC build llvm-svn: 147198	2011-12-22 23:08:39 +00:00
Dylan Noblesmith	9e5b178ecc	drop unneeded config.h includes llvm-svn: 147197	2011-12-22 23:04:07 +00:00
Chad Rosier	00bbedff03	Fix 80-column violations. llvm-svn: 147192	2011-12-22 22:35:21 +00:00
Rafael Espindola	e61724aa00	Move all the dependencies on X86FixupKinds.h to a single method in preparation to moving it to lib/Target/X86. llvm-svn: 147190	2011-12-22 22:21:47 +00:00
Jim Grosbach	ea2319112f	ARM VFP assembly parsing and encoding for VCVT(float <--> fixed point). rdar://10558523 llvm-svn: 147189	2011-12-22 22:19:05 +00:00
Bob Wilson	268d2599e0	Add missing usesCustomInserter flag on Int_eh_sjlj_setjmp_nofp. Noticed by inspection; I don't have a testcase for this. llvm-svn: 147188	2011-12-22 22:12:44 +00:00
Jim Grosbach	c4d8d2f155	Tidy up. Use predicate function a bit more liberally. llvm-svn: 147184	2011-12-22 22:02:35 +00:00
Rafael Espindola	6ca42c5be3	Fix incorrect relocation generation. Patch by Kristof Beyls. Fixes PR11214. llvm-svn: 147180	2011-12-22 21:36:43 +00:00
Chad Rosier	3ba90a1655	Add the actual code for r147175. llvm-svn: 147176	2011-12-22 21:10:46 +00:00
Jim Grosbach	f0d25117c6	ARM VFP add encoding of the bitcount to fixed-point<-->floating point. insns. The value from the operands isn't right yet, but we weren't encoding it at all previously. The parser needs to twiddle the values when building the instruction. Partial for: rdar://10558523 llvm-svn: 147170	2011-12-22 19:55:21 +00:00
Jim Grosbach	b65dd04923	Remove some bogus comments. llvm-svn: 147169	2011-12-22 19:45:01 +00:00
Jim Grosbach	489ed5929e	ARM pre-UAL aliases. fcmp[sd]. llvm-svn: 147158	2011-12-22 19:20:45 +00:00
Rafael Espindola	250096233b	Fix an incomplete refactoring of the ppc backend. Thanks to rdivacky for reporting it. It does need some some tests... llvm-svn: 147154	2011-12-22 18:38:06 +00:00
Jim Grosbach	12ccf45bbb	ARM assembler should accept shift-by-zero for any shifted-immediate operand. Just treat it as-if the shift wasn't there at all. 'as' compatibility. rdar://10604767 llvm-svn: 147153	2011-12-22 18:04:04 +00:00
Jim Grosbach	21488b8839	ARM assembly parser canonicallize on 'lsl' for shift-by-zero form. llvm-svn: 147152	2011-12-22 17:37:00 +00:00
Jim Grosbach	3794d82af5	Tidy up. Trailing whitespace. llvm-svn: 147151	2011-12-22 17:17:10 +00:00
Jim Grosbach	62bffd8827	Nuke invalid comment from copy/paste. llvm-svn: 147150	2011-12-22 17:04:50 +00:00
Benjamin Kramer	f1fd6e394d	Give string constants generated by IRBuilder private linkage. Fixes PR11640. llvm-svn: 147144	2011-12-22 14:22:14 +00:00
Chandler Carruth	b024aa021d	Make the unreachable probability much much heavier. The previous probability wouldn't be considered "hot" in some weird loop structures or other compounding probability patterns. This makes it much harder to confuse, but isn't really a principled fix. I'd actually like it if we could model a zero probability, as it would make this much easier to reason about. Suggestions for how to do this better are welcome. llvm-svn: 147142	2011-12-22 09:26:37 +00:00
Rafael Espindola	29abd977de	Kill the monstrosity that was ELFObjectWriter.h. llvm-svn: 147136	2011-12-22 03:38:00 +00:00
Rafael Espindola	34a68afc05	Misc cleanups. llvm-svn: 147135	2011-12-22 03:24:43 +00:00
Eli Friedman	2aae94fa70	Fix APInt::rotl and APInt::rotr so that they work correctly. Found while writing some code that tried to use them. llvm-svn: 147134	2011-12-22 03:15:35 +00:00
Rafael Espindola	1dc45d8df4	Move the Mips only bits of the ELF writer to lib/Target/Mips. llvm-svn: 147133	2011-12-22 03:03:17 +00:00
Rafael Espindola	84d00f11cd	Make the virtual methods in ARMELFObjectWriter public. llvm-svn: 147132	2011-12-22 02:58:12 +00:00
Chad Rosier	1b7e2baf47	Speculatively revert r146578 to determine if it is the cause of a number of performance regressions (both execution-time and compile-time) on our nightly testers. Original commit message: Fix for bug #11429: Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 147131	2011-12-22 02:40:57 +00:00
Rafael Espindola	cc369ac0a2	Move the MBlaze ELF writer bits to lib/Target/MBlaze. llvm-svn: 147129	2011-12-22 02:28:24 +00:00
Pete Cooper	1c3b1efa58	Hoisted some loop invariant smallvector lookups out of a MachineLICM loop llvm-svn: 147127	2011-12-22 02:13:25 +00:00
Rafael Espindola	428b9ee036	Fix cmake. llvm-svn: 147126	2011-12-22 02:06:17 +00:00
Pete Cooper	1eed5b51e8	Changed MachineLICM to use a worklist list MachineCSE instead of recursion. Fixes <rdar://problem/10584116> llvm-svn: 147125	2011-12-22 02:05:40 +00:00
Rafael Espindola	38a400df3b	Move PPC bits to lib/Target/PowerPC. llvm-svn: 147124	2011-12-22 01:57:09 +00:00
Rafael Espindola	2da9777cef	Hopefully fix the cmake build. llvm-svn: 147121	2011-12-22 01:11:01 +00:00
Rafael Espindola	4449b21294	Fix name in comments. llvm-svn: 147119	2011-12-22 01:06:53 +00:00
Akira Hatanaka	e2eed9649e	Local dynamic TLS model for direct object output. Create the correct TLS MIPS ELF relocations. Patch by Jack Carter. llvm-svn: 147118	2011-12-22 01:05:17 +00:00
Richard Smith	32a756b7ce	Unbreak cmake build after r147115. llvm-svn: 147117	2011-12-22 01:03:35 +00:00
Rafael Espindola	a0124055b1	Move the ARM specific parts of the ELF writer to Target/ARM. llvm-svn: 147115	2011-12-22 00:37:50 +00:00
Rafael Espindola	6faa1533fb	getEFlags is const. llvm-svn: 147114	2011-12-22 00:21:50 +00:00
Jim Grosbach	2b80dad572	ARM NEON mnemonic aliase for vrecpeq. llvm-svn: 147109	2011-12-21 23:52:37 +00:00
Jim Grosbach	7869d8c01e	ARM VFP optional data type on VMOV GPR<-->SPR. llvm-svn: 147104	2011-12-21 23:24:15 +00:00
Jim Grosbach	260b4b336a	ARM NEON optional data type on VSWP instructions. llvm-svn: 147103	2011-12-21 23:09:28 +00:00
Jim Grosbach	a50e24fcb3	ARM NEON mnemonic aliases for vzipq and vswpq. llvm-svn: 147102	2011-12-21 23:04:33 +00:00
Jakub Staszak	9061616f9e	Revert patch from 147090. There is not point to make code less readable if we don't get any serious benefit there. llvm-svn: 147101	2011-12-21 23:02:08 +00:00
Jim Grosbach	1152cc0cad	ARM asm parser should be more lenient w/ .thumb_func directive. Rather than require the symbol to be explicitly an argument of the directive, allow it to look ahead and grab the symbol from the next non-whitespace line. rdar://10611140 llvm-svn: 147100	2011-12-21 22:30:16 +00:00
Dan Gohman	51c81685a8	Fix a copy+pasto. No testcase, because the symptoms of dereferencing an invalid iterator aren't reproducible. rdar://10614085. llvm-svn: 147098	2011-12-21 21:43:50 +00:00
Jim Grosbach	8c59bbc1ed	Thumb2 assembly parsing of 'mov rd, rn, rrx'. Maps to the RRX instruction. Missed this case earlier. rdar://10615373 llvm-svn: 147096	2011-12-21 21:04:19 +00:00
Chad Rosier	3172488cc0	Fix 80-column violations. llvm-svn: 147095	2011-12-21 20:59:09 +00:00
Jim Grosbach	b3ef713e44	Thumb2 assembly parsing of 'mov(register shifted register)' aliases. These map to the ASR, LSR, LSL, ROR instruction definitions. rdar://10615373 llvm-svn: 147094	2011-12-21 20:54:00 +00:00
Nick Lewycky	c186d07bbe	Continue counting intrinsics as instructions (except when they aren't, such as debug info) and for being vector operations. Fixes regression from r147037. llvm-svn: 147093	2011-12-21 20:26:03 +00:00
Nick Lewycky	281e2747e0	Fix typo and spacing, no functionality change. llvm-svn: 147092	2011-12-21 20:21:55 +00:00
Jakub Staszak	df5133455f	- Change a few operator[] to lookup which is cheaper. - Add some constantness. llvm-svn: 147090	2011-12-21 20:18:54 +00:00
Lang Hames	e49fbd0755	Oops - LiveIntervalUnion.cpp file does use std::find. Moving STL header include to LiveIntervalUnion.cpp file. llvm-svn: 147089	2011-12-21 20:16:11 +00:00
Lang Hames	93176d72e7	Remove disused STL header include. llvm-svn: 147088	2011-12-21 20:12:54 +00:00
Rafael Espindola	f61ff34252	Switch from WriteEFlags to getEFlags in preparation for moving it to Target/. llvm-svn: 147087	2011-12-21 20:09:46 +00:00
Jakob Stoklund Olesen	3588a43e3a	Move common code into an MRI function. llvm-svn: 147071	2011-12-21 19:50:05 +00:00
Jim Grosbach	c80a264386	ARM NEON assmebly parsing for VLD2 to all lanes instructions. llvm-svn: 147069	2011-12-21 19:40:55 +00:00
Chad Rosier	3ede414127	No case stmt for BUILD_VECTOR in PerformDAGCombine(), so I assume this isn't necessary. Please chime in if I'm mistaken. llvm-svn: 147065	2011-12-21 19:14:52 +00:00
Chad Rosier	7248bda595	Fix a couple of copy-n-paste bugs. Noticed by George Russell! llvm-svn: 147064	2011-12-21 18:56:22 +00:00
Manuel Klimek	25eb0ac418	Changes the JSON parser to use the SourceMgr. Diagnostics are now emitted via the SourceMgr and we use MemoryBuffer for buffer management. Switched the code to make use of the trailing '0' that MemoryBuffer guarantees where it makes sense. llvm-svn: 147063	2011-12-21 18:16:39 +00:00
Rafael Espindola	b264d33854	Move the X86 specific bits of the ELF writer to the Target/X86 directory. Other targets will follow shortly. llvm-svn: 147060	2011-12-21 17:30:17 +00:00
Rafael Espindola	1ad4095d6b	Reduce the exposure of Triple::OSType in the ELF object writer. This will avoid including ADT/Triple.h in many places when the target specific bits are moved. llvm-svn: 147059	2011-12-21 17:00:36 +00:00
Rafael Espindola	9e252bf038	Small refactoring so that RelocNeedsGOT can stay in the target independent side when the target specific bits are moved to the Target directory. llvm-svn: 147053	2011-12-21 14:26:29 +00:00
Manuel Klimek	b761ff3e24	Removes unused field TheError from LLLexer. llvm-svn: 147049	2011-12-21 10:02:45 +00:00
Craig Topper	b8b1b4c1de	Remove mode specific disassembler classes and just call X86GenericDisassembler constructor with appropriate argument in the creation functions. This removes a few tables that needed to be anchored. llvm-svn: 147046	2011-12-21 08:06:52 +00:00
Craig Topper	f30188418b	Fix typo in a couple comments llvm-svn: 147045	2011-12-21 06:30:53 +00:00
Nick Lewycky	da22fc6a1d	A call to a function marked 'noinline' is not an inline candidate. The sole call site of an intrinsic is also not an inline candidate. While here, make it more obvious that this code ignores all intrinsics. Noticed by inspection! llvm-svn: 147037	2011-12-21 06:06:30 +00:00
Nick Lewycky	b4039f633c	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
Evan Cheng	dc8a1aaea6	Fix a couple of copy-n-paste bugs. Noticed by George Russell. llvm-svn: 147032	2011-12-21 03:04:10 +00:00
Jim Grosbach	7de7ab83fa	ARM assembly parsing allows constant expressions for lane indices. llvm-svn: 147028	2011-12-21 01:19:23 +00:00
Jim Grosbach	c5af54ec89	ARM NEON VLD2 assembly parsing for structure to all lanes, non-writeback. llvm-svn: 147025	2011-12-21 00:38:54 +00:00
Akira Hatanaka	964c891e61	Fix bug in zero-store peephole pattern reported in pr11615. The patch and test case were originally written by Mans Rullgard. llvm-svn: 147024	2011-12-21 00:31:10 +00:00
Akira Hatanaka	1d8efaba7e	Expand 64-bit CTLZ nodes if target architecture does not support it. Add test case for DCLO and DCLZ. llvm-svn: 147022	2011-12-21 00:20:27 +00:00
Akira Hatanaka	410ce9cb44	Expand 64-bit CTPOP and CTTZ. llvm-svn: 147021	2011-12-21 00:14:05 +00:00
Akira Hatanaka	91c052c4d8	Expand 64-bit atomic load and store. llvm-svn: 147019	2011-12-21 00:02:58 +00:00
Akira Hatanaka	4706ac9715	Add definition of DSBH (Double Swap Bytes within Halfwords) and DSHD (Double Swap Halfwords within Doublewords). Add a pattern which replaces 64-bit bswap with a DSBH and DSHD pair. llvm-svn: 147017	2011-12-20 23:56:43 +00:00
Akira Hatanaka	43c1ff4db3	Add definition of WSBH (Word Swap Bytes within Halfwords), which is an instruction supported by mips32r2, and add a pattern which replaces bswap with a ROTR and WSBH pair. WSBW is removed since it is not an instruction the current architectures support. llvm-svn: 147015	2011-12-20 23:47:44 +00:00
Akira Hatanaka	79aed157e7	64-bit uint-fp conversion nodes are expanded. llvm-svn: 147014	2011-12-20 23:40:56 +00:00
Akira Hatanaka	2bb8d068f5	Enable custom lowering DYNAMIC_STACKALLOC nodes. llvm-svn: 147013	2011-12-20 23:35:46 +00:00
Akira Hatanaka	8e2c02e2d6	Set the correct stack pointer register that should be saved or restored. llvm-svn: 147012	2011-12-20 23:28:36 +00:00
Chris Lattner	eaf9b7629a	Fix a nasty bug in the type remapping stuff that I added that is breaking kc++ on the build bot in some cases. The basic issue happens when a source module contains both a "%foo" type and a "%foo.42" type. It will see the later one, check to see if the destination module contains a "%foo" type, and it will return true... because both the source and destination modules are in the same LLVMContext. We don't want to map source types to other source types, so don't do the remapping if the mapped type came from the source module. Unfortunately, I've been unable to reduce a decent testcase for this, kc++ is pretty great that way. llvm-svn: 147010	2011-12-20 23:14:57 +00:00
Jim Grosbach	cd22e4a81e	ARM .req register name aliases are case insensitive, just like regnames. llvm-svn: 147009	2011-12-20 23:11:00 +00:00
Akira Hatanaka	cb2a85bc22	Add function MipsDAGToDAGISel::SelectMULT and factor out code that generates nodes needed for multiplication. Add code for selecting 64-bit MULHS and MULHU nodes. llvm-svn: 147008	2011-12-20 23:10:57 +00:00
Akira Hatanaka	2c8d1734f8	Fix indentation. llvm-svn: 147007	2011-12-20 22:58:01 +00:00
Akira Hatanaka	cf10f08825	64-bit data directive. llvm-svn: 147005	2011-12-20 22:52:19 +00:00
Akira Hatanaka	494fdf1499	32-to-64-bit sext_inreg pattern. llvm-svn: 147004	2011-12-20 22:40:40 +00:00
Akira Hatanaka	8756816e6f	Add 64-bit extload patterns. llvm-svn: 147003	2011-12-20 22:36:08 +00:00
Akira Hatanaka	0cee2045c9	Add patterns for matching extloads with 64-bit address. The patterns are enabled only when the target ABI is N64. llvm-svn: 147001	2011-12-20 22:33:53 +00:00
Jim Grosbach	4eda145c7f	Move comment to appropriate place. llvm-svn: 147000	2011-12-20 22:26:38 +00:00
Akira Hatanaka	dac1d48d8d	Add code in MipsDAGToDAGISel for selecting constant +0.0. MIPS64 can generate constant +0.0 with a single DMTC1 instruction. llvm-svn: 146999	2011-12-20 22:25:50 +00:00
Jakob Stoklund Olesen	b95c102c2f	Heed spill slot alignment on ARM. Use the spill slot alignment as well as the local variable alignment to determine when the stack needs to be realigned. This works now that the ARM target can always realign the stack by using a base pointer. Still respect the ARMBaseRegisterInfo::canRealignStack() function vetoing a realigned stack. Don't use aligned spill code in that case. llvm-svn: 146997	2011-12-20 22:15:04 +00:00
Akira Hatanaka	14468c6cb6	Revert part of r146995 that was accidentally commmitted. llvm-svn: 146996	2011-12-20 22:09:36 +00:00
Akira Hatanaka	4e210691c0	32-to-64-bit sign extension pattern. llvm-svn: 146995	2011-12-20 22:06:20 +00:00
Akira Hatanaka	9b9bd1cc15	Add a pattern for matching zero-store with 64-bit address. The pattern is enabled only when the target ABI is N64. llvm-svn: 146992	2011-12-20 21:50:49 +00:00
Jim Grosbach	2c59052984	ARM assembly parsing and encoding for VST2 single-element, double spaced. llvm-svn: 146990	2011-12-20 20:46:29 +00:00
Lang Hames	6cee53d06e	Fix assert condition. llvm-svn: 146987	2011-12-20 20:23:40 +00:00
Jakub Staszak	96f8c551e3	Add some constantness to BranchProbabilityInfo and BlockFrequnencyInfo. llvm-svn: 146986	2011-12-20 20:03:10 +00:00
Devang Patel	9224540efc	Add support to add named metadata operand. Patch by Andrew Wilkins! llvm-svn: 146984	2011-12-20 19:29:36 +00:00
Jim Grosbach	75e2ab5db2	ARM assembly parsing and encoding for VLD2 single-element, double spaced. llvm-svn: 146983	2011-12-20 19:21:26 +00:00
Evan Cheng	68132d8093	ARM target code clean up. Check for iOS, not Darwin where it makes sense. llvm-svn: 146981	2011-12-20 18:26:50 +00:00
Jason W Kim	135d244b56	First steps in ARM AsmParser support for .eabi_attribute and .arch (Both used for Linux gnueabi) No behavioral change yet (no tests need so far) llvm-svn: 146977	2011-12-20 17:38:12 +00:00
Elena Demikhovsky	ec7e6e0946	This is the second fix related to VZEXT_MOVL node. The failure that I see in the current version is: LLVM ERROR: Cannot select: 0x18b8f70: v4i64 = X86ISD::VZEXT_MOVL 0x18beee0 [ID=14] 0x18beee0: v4i64 = insert_subvector 0x18b8c70, 0x18b9170, 0x18b9570 [ID=13] 0x18b8c70: v4i64 = insert_subvector 0x18b9870, 0x18bf4e0, 0x18b9970 [ID=12] 0x18b9870: v4i64 = undef [ID=4] 0x18bf4e0: v2i64 = bitcast 0x18bf3e0 [ID=10] 0x18bf3e0: v4i32 = BUILD_VECTOR 0x18b9770, 0x18b9770, 0x18b9770, 0x18b9770 [ID=8] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9970: i32 = Constant<0> [ID=3] 0x18b9170: v2i64 = undef [ORD=1] [ID=1] 0x18b9570: i32 = Constant<2> [ID=5] llvm-svn: 146975	2011-12-20 13:34:28 +00:00
Chandler Carruth	24680c24d8	Begin teaching the X86 target how to efficiently codegen patterns that use the zero-undefined variants of CTTZ and CTLZ. These are just simple patterns for now, there is more to be done to make real world code using these constructs be optimized and codegen'ed properly on X86. The existing tests are spiffed up to check that we no longer generate unnecessary cmov instructions, and that we generate the very important 'xor' to transform bsr which counts the index of the most significant one bit to the number of leading (most significant) zero bits. Also they now check that when the variant with defined zero result is used, the cmov is still produced. llvm-svn: 146974	2011-12-20 11:19:37 +00:00
Manuel Klimek	fe198ced31	Fixes a potential compilation error. Pulling the template implementation into the header to guarantee that it's visible to all possible instantiations. llvm-svn: 146973	2011-12-20 11:04:23 +00:00
Manuel Klimek	47151c37b6	Pulls the implementation of skip() into JSONParser. This is the first step towards migrating more of the parser implementation into the parser class. llvm-svn: 146971	2011-12-20 10:42:52 +00:00
Manuel Klimek	f8d73192cc	Addressing style issues in JSON parser. llvm-svn: 146968	2011-12-20 09:26:26 +00:00
Chandler Carruth	e805b16e3d	Fix up the CMake build for the new files added in r146960, they're likely to stay either way that discussion ends up resolving itself. llvm-svn: 146966	2011-12-20 08:42:11 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Andrew Trick	b9aa26f8ea	LSR: Fix another corner case in expansion of postinc users. Fixes PR11571: Instruction does not dominate all uses llvm-svn: 146950	2011-12-20 01:42:24 +00:00
Bob Wilson	75f12cc3fe	Mark ARM eh_sjlj_dispatchsetup as clobbering all registers. Radar 10567930. We used to rely on the *eh_sjlj_setjmp instructions to mark that a function with setjmp/longjmp exception handling clobbers all the registers. But with the recent reorganization of ARM EH, those eh_sjlj_setjmp instructions are expanded away earlier, before PEI can see them to determine what registers to save and restore. Mark the dispatchsetup instruction in the same way, since that instruction cannot be expanded early. This also more accurately reflects when the registers are clobbered. llvm-svn: 146949	2011-12-20 01:29:27 +00:00
Jim Grosbach	e2ca9e5b5f	ARM assembly shifts by zero should be plain 'mov' instructions. "mov r1, r2, lsl #0" should assemble as "mov r1, r2" even though it's not strictly legal UAL syntax. It's a common extension and the friendly thing to do. rdar://10604663 llvm-svn: 146937	2011-12-20 00:59:38 +00:00
Chris Lattner	9eb3f00406	Now that PR11464 is fixed, reapply the patch to fix PR11464, merging types by name when we can. We still don't guarantee type name linkage but we do it when obviously the right thing to do. This makes LTO type names easier to read, for example. llvm-svn: 146932	2011-12-20 00:12:26 +00:00
Chris Lattner	5e3bd9727a	fix PR11464 by preventing the linker from mapping two different struct types from the source module onto the same opaque destination type. An opaque type can only be resolved to one thing or another after all. llvm-svn: 146929	2011-12-20 00:03:52 +00:00
Dan Gohman	94580ab375	Add basic generic CodeGen support for half. llvm-svn: 146927	2011-12-20 00:02:33 +00:00
Jim Grosbach	045b6c71a6	ARM NEON assembly aliases for VMOV<-->VMVN for i32 immediates. e.g., "vmov.i32 d4, #-118" can be assembled as "vmvn.i32 d4, #117" rdar://10603913 llvm-svn: 146925	2011-12-19 23:51:07 +00:00
Jim Grosbach	8648c10184	ARM assembly parsing and encoding support for LDRD(label). rdar://9932658 llvm-svn: 146921	2011-12-19 23:06:24 +00:00
Evan Cheng	4266a79351	Add a if-conversion optimization that allows 'true' side of a diamond to be unpredicated. That is, turn subeq r0, r1, #1 addne r0, r1, #1 into sub r0, r1, #1 addne r0, r1, #1 For targets where conditional instructions are always executed, this may be beneficial. It may remove pseudo anti-dependency in out-of-order execution CPUs. e.g. op r1, ... str r1, [r10] ; end-of-life of r1 as div result cmp r0, #65 movne r1, #44 ; raw dependency on previous r1 moveq r1, #12 If movne is unpredicated, then op r1, ... str r1, [r10] cmp r0, #65 mov r1, #44 ; r1 written unconditionally moveq r1, #12 Both mov and moveq are no longer depdendent on the first instruction. This gives the out-of-order execution engine more freedom to reorder them. This has passed entire LLVM test suite. But it has not been enabled for any ARM variant pending more performance evaluation. rdar://8951196 llvm-svn: 146914	2011-12-19 22:01:30 +00:00
Akira Hatanaka	db47e0c49d	Add patterns for matching immediates whose lower 16-bit is cleared. These patterns emit a single LUi instruction instead of a pair of LUi and ORi. llvm-svn: 146900	2011-12-19 20:21:18 +00:00
Eli Friedman	5bb6826fdc	Attempt to fix PR11607 by shuffling around which class defines which methods. llvm-svn: 146897	2011-12-19 20:06:03 +00:00
Akira Hatanaka	9e1d369e3c	Tidy up. Simplify logic. No functional change intended. llvm-svn: 146896	2011-12-19 19:52:25 +00:00
Jim Grosbach	64f4de29e0	ARM NEON two-operand aliases for VPADD. rdar://10602276 llvm-svn: 146895	2011-12-19 19:51:03 +00:00
Akira Hatanaka	2a232d81f6	Remove definitions of double word shift plus 32 instructions. Assembler or direct-object emitter should emit the appropriate shift instruction depending on the shift amount. llvm-svn: 146893	2011-12-19 19:44:09 +00:00
Jim Grosbach	e16acacc3a	ARM VFP pre-UAL mnemonic aliases for fmul[sd]. llvm-svn: 146892	2011-12-19 19:43:50 +00:00
Akira Hatanaka	c4db30e358	Remove unused predicate. llvm-svn: 146889	2011-12-19 19:32:20 +00:00
Akira Hatanaka	3c9f336361	Remove the restriction on the first operand of the add node in SelectAddr. This change reduces the number of instructions generated. For example, (load (add (sub $n0, $n1), (MipsLo got(s)))) results in the following sequence of instructions: 1. sub $n2, $n0, $n1 2. lw got(s)($n2) Previously, three instructions were needed. 1. sub $n2, $n0, $n1 2. addiu $n3, $n2, got(s) 3. lw 0($n3) llvm-svn: 146888	2011-12-19 19:28:37 +00:00
Jim Grosbach	92a939ae73	ARM VFP pre-UAL mnemonic aliases for fcpy[sd] and fdiv[sd]. llvm-svn: 146887	2011-12-19 19:02:41 +00:00
Jim Grosbach	9ae4fc035b	ARM NEON implied destination aliases for VMAX/VMIN. llvm-svn: 146885	2011-12-19 18:57:38 +00:00
Jim Grosbach	cef98cddbe	ARM NEON relax parse time diagnostics for alignment specifiers. There's more variation that we need to handle. Error checking will need to be on operand predicates. llvm-svn: 146884	2011-12-19 18:31:43 +00:00
Jim Grosbach	a7d2421603	Tidy up. llvm-svn: 146882	2011-12-19 18:11:17 +00:00
Jakob Stoklund Olesen	24159e346d	Remove a register class that can just as well be synthesized. Add the new TableGen register class synthesizer feature to the release notes. llvm-svn: 146875	2011-12-19 16:53:40 +00:00
Jakob Stoklund Olesen	8f9c6c4ad0	Handle sub-register operands in recomputeRegClass(). Now that getMatchingSuperRegClass() returns accurate results, it can be used to compute constraints imposed by instructions using a sub-register of a virtual register. This means we can recompute the register class of any virtual register by combining the constraints from all its uses. llvm-svn: 146874	2011-12-19 16:53:37 +00:00
Jakob Stoklund Olesen	c7b437ae34	Emit a getMatchingSuperRegClass() implementation for every target. Use information computed while inferring new register classes to emit accurate, table-driven implementations of getMatchingSuperRegClass(). Delete the old manual, error-prone implementations in the targets. llvm-svn: 146873	2011-12-19 16:53:34 +00:00
Jakub Staszak	1b1d523d9e	- Use getExitingBlock instead of getExitingBlocks. - Remove trailing spaces. llvm-svn: 146854	2011-12-18 21:52:30 +00:00
Benjamin Kramer	1b54835a10	Another variadics tweak. llvm-svn: 146852	2011-12-18 20:51:31 +00:00
Joerg Sonnenberger	d6cb7649d8	Allow inlining of functions with returns_twice calls, if they have the attribute themselve. llvm-svn: 146851	2011-12-18 20:35:43 +00:00
Benjamin Kramer	530b820500	Use the fancy new VariadicFunction template instead of a plain variadic function. Some compilers were complaining about passing StringRef to it. llvm-svn: 146850	2011-12-18 19:59:20 +00:00
Benjamin Kramer	32481916eb	Hexagon: Remove unused variables. llvm-svn: 146846	2011-12-18 12:00:09 +00:00
Chad Rosier	5e5bee4c52	Revert 146728 as it's causing failures on some of the external bots as well as internal nightly testers. Original commit message: By popular demand, link up types by name if they are isomorphic and one is an autorenamed version of the other. This makes the IR easier to read, because we don't end up with random renamed versions of the types after LTO'ing a large app. llvm-svn: 146838	2011-12-17 22:19:53 +00:00
Kevin Enderby	8b3deabd2d	Revert r146822 at Pete Cooper's request as it broke clang self hosting. Hope I did this correctly :) llvm-svn: 146834	2011-12-17 19:48:52 +00:00
Craig Topper	a913dde0ef	Remove an unused X86ISD node type. llvm-svn: 146833	2011-12-17 19:16:44 +00:00
Benjamin Kramer	792edd3c75	X86: Factor the bswap asm matching to be slightly less horrible to read. llvm-svn: 146831	2011-12-17 14:36:05 +00:00
Pete Cooper	eadf124d2b	SimplifyCFG now predicts some conditional branches to true or false depending on previous branch on same comparison operands. For example, if (a == b) { if (a > b) // this is false Fixes some of the issues on <rdar://problem/10554090> llvm-svn: 146822	2011-12-17 06:32:38 +00:00
Evan Cheng	903231bc58	Fix a CPSR liveness tracking bug introduced when I converted IT block to bundle. llvm-svn: 146805	2011-12-17 01:25:34 +00:00
Pete Cooper	ebf98c1304	Refactor code used in InstCombine::FoldAndOfICmps to new file. This will be used by SimplifyCfg in a later commit. llvm-svn: 146803	2011-12-17 01:20:32 +00:00
Rafael Espindola	d3df3d3527	Add back the MC bits of 126425. Original patch by Nathan Jeffords. I added the asm parsing and testcase. llvm-svn: 146801	2011-12-17 01:14:52 +00:00
Lang Hames	da07b3ad42	Make sure that the lower bits on the VSELECT condition are properly set. llvm-svn: 146800	2011-12-17 01:08:46 +00:00
Jakob Stoklund Olesen	465cdf3ba4	Preserve more memory operands in ARMExpandPseudo. I don't think this affects anything but verbose assembly. llvm-svn: 146787	2011-12-17 00:07:02 +00:00
Dan Gohman	518cda42b9	The powers that be have decided that LLVM IR should now support 16-bit "half precision" floating-point with a first-class type. This patch adds basic IR support (but not codegen support). llvm-svn: 146786	2011-12-17 00:04:22 +00:00
Eric Christopher	27886c6c1e	When recursing for the original size of a type, stop if we are at a pointer or a reference type - we actually just want the size of the pointer then for that. Fixes rdar://10335756 llvm-svn: 146785	2011-12-16 23:42:45 +00:00
Eric Christopher	da011dd0e3	Resolve part of a fixme and add a new one. llvm-svn: 146784	2011-12-16 23:42:42 +00:00
Eric Christopher	03faed3eac	Add a fixme here. llvm-svn: 146783	2011-12-16 23:42:38 +00:00
Eric Christopher	365d083585	Extraneous whitespace and 80-col. llvm-svn: 146780	2011-12-16 23:42:31 +00:00
Jakob Stoklund Olesen	9790187b6c	Fix off-by-one error in bucket sort. The bad sorting caused a misaligned basic block when building 176.vpr in ARM mode. <rdar://problem/10594653> llvm-svn: 146767	2011-12-16 23:00:05 +00:00
Dylan Noblesmith	1c419ff50d	APInt: update asserts for base-36 Hexatridecimal was added in r139695. And fix the unittest that now triggers the assert. llvm-svn: 146754	2011-12-16 20:36:31 +00:00
Jakob Stoklund Olesen	5af144809e	Don't adjust for alignment padding in OffsetIsInRange. This adjustment is already included in the block offsets computed by BasicBlockInfo, and adjusting again here can cause the pass to loop. When CreateNewWater splits a basic block, OffsetIsInRange would reject the new CPE on the next pass because of the too conservative alignment adjustment. This caused the block to be split again, and so on. llvm-svn: 146751	2011-12-16 19:10:00 +00:00
Benjamin Kramer	9ca2e7293b	Hexagon: Fix a nasty order-of-initialization bug. Reenable the tests. llvm-svn: 146750	2011-12-16 19:08:59 +00:00
Devang Patel	78847f0bbe	In DICompositeType, referenced to derived type is either metadata or null. llvm-svn: 146744	2011-12-16 17:51:31 +00:00
Jakob Stoklund Olesen	2a05f691ab	Note ARM constant island alignment in the release notes. The command line option should be removed, but not until the feature has gotten a lot of testing. The ARMConstantIslandPass tends to have subtle bugs that only show up after a while. llvm-svn: 146739	2011-12-16 16:07:41 +00:00
Manuel Klimek	2c899a181c	Adds a JSON parser and a benchmark (json-bench) to catch performance regressions. llvm-svn: 146735	2011-12-16 13:09:10 +00:00
Chris Lattner	3fdf98c60f	By popular demand, link up types by name if they are isomorphic and one is an autorenamed version of the other. This makes the IR easier to read, because we don't end up with random renamed versions of the types after LTO'ing a large app. llvm-svn: 146728	2011-12-16 08:36:07 +00:00
Craig Topper	a4d411cb1b	Don't try to match 'unpackl/h v, v' for 32xi8 and 16xi16 when only AVX1 is supported. Fix 'unpackh v, v' for 256-bit types to understand 128-bit lanes. llvm-svn: 146726	2011-12-16 08:06:31 +00:00
NAKAMURA Takumi	93d990bd61	Target/Hexagon: Fix CMake build. llvm-svn: 146724	2011-12-16 06:21:02 +00:00
Andrew Trick	ca3417e932	Avoid a confusing assert for silly options: -unroll-runtime -unroll-count=1. No need for an explicit test case for an unsupported combination of options. llvm-svn: 146721	2011-12-16 02:03:48 +00:00
Jim Grosbach	4a29971f02	ARM NEON aliases for vmovq.f* llvm-svn: 146714	2011-12-16 00:12:22 +00:00
Jim Grosbach	66886253a7	Thumb2 ADR assembly parsing w/o the .w suffix. llvm-svn: 146710	2011-12-15 23:52:17 +00:00
Eli Friedman	64944090ff	Make sure we correctly note the existence of an i8 immediate for vblendvps and friends, so we compute fixups correctly. PR11586. llvm-svn: 146709	2011-12-15 23:46:18 +00:00
Nick Lewycky	c9e935c7e2	Move parts of lib/Target that use CodeGen into lib/CodeGen. llvm-svn: 146702	2011-12-15 22:58:58 +00:00
Eli Friedman	c9bf1b1bff	Make check a bit more strict so we don't call ARM_AM::getFP32Imm with a value that isn't a 32-bit value. (This is just to be safe; I don't think this actually causes any issues in practice.) llvm-svn: 146700	2011-12-15 22:56:53 +00:00
Jim Grosbach	a47294e24d	ARM NEON VCLE is an alias for VCGE w/ the source operands reversed. llvm-svn: 146699	2011-12-15 22:56:33 +00:00
Kostya Serebryany	7a9eb49a47	[asan] add the name of the module to the description of a global variable. This improves the readability of global-buffer-overflow reports. llvm-svn: 146698	2011-12-15 22:55:55 +00:00
Tony Linthicum	b3705e0b9e	Add MCTargetDesc library to Hexagon target llvm-svn: 146692	2011-12-15 22:29:08 +00:00
Jim Grosbach	4a5c887370	ARM NEON VTBL/VTBX assembly parsing and encoding. llvm-svn: 146691	2011-12-15 22:27:11 +00:00
Jakob Stoklund Olesen	cba8e8c3e0	Enable proper constant island alignment by default. The code size increase is tiny (< 0.05%) because so little code uses 16-byte constant pool entries. llvm-svn: 146690	2011-12-15 22:14:45 +00:00
Chad Rosier	41dbf59e12	Add missing zmovl AVX patterns which were causing crashes. Patch by Elena Demikhovsky <elena.demikhovsky@intel.com>! llvm-svn: 146689	2011-12-15 22:11:31 +00:00
Kostya Serebryany	cd1aba8b4d	[asan] fix a bug (issue 19) where dlclose and the following mmap caused a false positive. compiler part. llvm-svn: 146688	2011-12-15 21:59:03 +00:00
Jim Grosbach	c2f16a3499	Silence warning. llvm-svn: 146686	2011-12-15 21:54:55 +00:00
Jim Grosbach	2f50e92f40	ARM NEON two-register double spaced register list parsing support. llvm-svn: 146685	2011-12-15 21:44:33 +00:00
Chad Rosier	75ed9dcbc6	Fix assert in LowerBUILD_VECTOR for v16i16 type on AVX. Patch by Elena Demikhovsky <elena.demikhovsky@intel.com>! llvm-svn: 146684	2011-12-15 21:34:44 +00:00
Lang Hames	c44b5e469b	Fix VSELECT operand order. Was previously backwards, causing bogus vector shift results - <rdar://problem/10559581>. llvm-svn: 146671	2011-12-15 18:57:27 +00:00

... 9 10 11 12 13 ...

52778 Commits