llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	8c139a5125	Clear kill flags before propagating a copy. The live range of the source register may be extended when a redundant copy is eliminated. Make sure any kill flags between the two copies are cleared. This fixes PR11765. llvm-svn: 149069	2012-01-26 17:52:15 +00:00
Jim Grosbach	c8f2b7877b	Tidy up. Fix mismatched return types for error handling. llvm-svn: 149062	2012-01-26 15:56:45 +00:00
James Molloy	6685c08e5f	Add support for the R_ARM_TARGET1 relocation, which should be given to relocations applied to all C++ constructors and destructors. This enables the linker to match concrete relocation types (absolute or relative) with whatever library or C++ support code is being linked against. llvm-svn: 149057	2012-01-26 09:25:43 +00:00
Victor Umansky	5f29b0e57b	Fix for the following bug in AVX codegen for double-to-int conversions: . "fptosi" and "fptoui" IR instructions are defined with round-to-zero rounding mode. . Currently for AVX mode for <4xdouble> and <8xdouble> the "VCVTPD2DQ.128" and "VCVTPD2DQ.256" instructions are selected (for .fp_to_sint. DAG node operation ) by AVX codegen. However they use round-to-nearest-even rounding mode. . Consequently, the conversion produces incorrect numbers. The fix is to replace selection of VCVTPD2DQ instructions with VCVTTPD2DQ instructions. The latter use truncate (i.e. round-to-zero) rounding mode. As .fp_to_sint. DAG node operation is used only for lowering of "fptosi" and "fptoui" IR instructions, the fix in X86InstrSSE.td definition file doesn.t have an impact on other LLVM flows. The patch includes changes in the .td file, LIT test for the changes and a fix in a legacy LIT test (which produced asm code conflicting with LLVN IR spec). llvm-svn: 149056	2012-01-26 08:51:39 +00:00
Craig Topper	86e44bc829	Add HasXOP predicate check covering a bunch of XOP intrinsic patterns. llvm-svn: 149054	2012-01-26 07:51:55 +00:00
Craig Topper	1c0e22f57a	Fix AVX vs SSE patterns ordering issue for VPCMPESTRM and VPCMPISTRM. llvm-svn: 149053	2012-01-26 07:31:30 +00:00
Craig Topper	b91760eff8	Remove some more patterns by custom lowering intrinsics to target specific nodes. llvm-svn: 149052	2012-01-26 07:18:03 +00:00
Chris Lattner	34d9a17778	unbreak test/Bitcode/shuffle.ll. llvm-svn: 149033	2012-01-26 03:10:45 +00:00
Chris Lattner	8722fe5a24	simplify by using ShuffleVectorInst::getMaskValue. llvm-svn: 149029	2012-01-26 02:54:54 +00:00
Chris Lattner	cf12970bd0	eliminate the Constant::getVectorElements method. There are better (and more robust) ways to do what it was doing now. Also, add static methods for decoding a ShuffleVector mask. llvm-svn: 149028	2012-01-26 02:51:13 +00:00
Chris Lattner	fa77500d96	Continue improving support for ConstantDataAggregate, and use the new methods recently added to (sometimes greatly!) simplify code. llvm-svn: 149024	2012-01-26 02:32:04 +00:00
Chris Lattner	f14a67f5d3	Add a ConstantDataVector::getSplatValue() method, for parity with ConstantVector. Fix some outright bugs in the implementation of ConstantArray and Constant struct, which would cause us to not make one big UndefValue when asking for an array/struct with all undef elements. Enhance Constant::isAllOnesValue to work with ConstantDataVector. llvm-svn: 149021	2012-01-26 02:31:22 +00:00
Chris Lattner	8326bd8e10	some general cleanup, using new methods and tidying up old code. llvm-svn: 149006	2012-01-26 00:42:34 +00:00
Chris Lattner	3dbad40341	fix pasto in the new (and still unused) ShuffleVectorInst::getShuffleMask method. llvm-svn: 149005	2012-01-26 00:41:50 +00:00
Chris Lattner	154aabc0c4	add StructType helpers too. llvm-svn: 149000	2012-01-26 00:06:44 +00:00
Chris Lattner	40a279e1c5	Ok, break down and add some cast<>'ing helper methods to the Type class to reduce the number of cast<>'s we have. This allows someone to use things like Ty->getVectorNumElements() instead of cast<VectorType>(Ty)->getNumElements() when you know that a type is a vector. It would be a great general cleanup to move the codebase to use these, I will do so in the code I'm touching. llvm-svn: 148999	2012-01-26 00:01:10 +00:00
Chris Lattner	1dcb654311	add some helper methods to ShuffleVectorInst and enhance its "isValidOperands" and "getMaskValue" methods to allow ConstantDataSequential. llvm-svn: 148998	2012-01-25 23:49:49 +00:00
Jakob Stoklund Olesen	4864a81aa3	Improve sub-register def handling in ProcessImplicitDefs. This boils down to using MachineOperand::readsReg() more. This fixes PR11829 where a use ended up after the first def when lowering REG_SEQUENCE instructions involving IMPLICIT_DEFs. llvm-svn: 148996	2012-01-25 23:36:27 +00:00
Anton Korobeynikov	7722a2d4e3	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00
Lang Hames	f1508b78f9	Don't add live ranges for aliases of physregs that are live in to the function. They don't appear to be used, and are inconsistent with handling of other physreg intervals (i.e. intervals that are not live-in) where ranges are not inserted for aliases. llvm-svn: 148986	2012-01-25 22:11:06 +00:00
Jim Grosbach	65e2465550	Tidy up. s/Low Level Virtual Machine/LLVM/. LLVM isn't an acronym anymore. llvm-svn: 148985	2012-01-25 22:00:23 +00:00
Lang Hames	19feb5f241	Always break upon finding a vreg operand (in Release as well as +Asserts). Remove assertion which can no longer trigger. llvm-svn: 148984	2012-01-25 21:53:23 +00:00
Jim Grosbach	82f76d1275	ARM assemly parsing and validation of IT instruction. "Although a Thumb2 instruction, the IT mnemonic shall be permitted in ARM mode, and the condition verified to match the condition code(s) on the following instruction(s)." PR11853 llvm-svn: 148969	2012-01-25 19:52:01 +00:00
Nick Lewycky	0e496cddf0	Use precomputed BB size instead of BB->size(). llvm-svn: 148964	2012-01-25 18:54:13 +00:00
Chris Lattner	33633a90a0	fix a bug I introduced in r148929, this is not a splat! Thanks to Eli for noticing. llvm-svn: 148947	2012-01-25 09:56:22 +00:00
Nick Lewycky	3c3feaf40c	Gracefully degrade precision in branch probability numbers. llvm-svn: 148946	2012-01-25 09:43:14 +00:00
Nick Lewycky	70d50ee8fb	Support pointer comparisons against constants, when looking at the inline-cost savings from a pointer argument becoming an alloca. Sometimes callees will even compare a pointer to null and then branch to an otherwise unreachable block! Detect these cases and compute the number of saved instructions, instead of bailing out and reporting no savings. llvm-svn: 148941	2012-01-25 08:27:40 +00:00
Chris Lattner	6705883ad8	use Constant::getAggregateElement to simplify a bunch of code. llvm-svn: 148934	2012-01-25 06:48:06 +00:00
Craig Topper	7834900950	Custom lower PSIGN and PSHUFB intrinsics to their corresponding target specific nodes so we can remove the isel patterns. llvm-svn: 148933	2012-01-25 06:43:11 +00:00
Chris Lattner	7e683d10a8	constify some methods and add a new Constant::getAggregateElement helper method for the common operation of extracting an element out of a constant aggregate. llvm-svn: 148931	2012-01-25 06:16:32 +00:00
Chris Lattner	47a86bdbe2	use ConstantVector::getSplat in a few places. llvm-svn: 148929	2012-01-25 06:02:56 +00:00
Craig Topper	ce4f9c5668	Custom lower phadd and phsub intrinsics to target specific nodes. Remove the patterns that are no longer necessary. llvm-svn: 148927	2012-01-25 05:37:32 +00:00
Chris Lattner	e9eed29b5b	reapply r148901 with a crucial fix. "Introduce a new ConstantVector::getSplat constructor function to simplify a really common case." llvm-svn: 148924	2012-01-25 05:19:54 +00:00
Craig Topper	5bcf070e68	Remove AVX 256-bit unaligned load intrinsics. 128-bit versions had been removed a while ago. llvm-svn: 148922	2012-01-25 04:42:03 +00:00
Akira Hatanaka	012f041bce	Mark 64-bit register RA_64 unused too. llvm-svn: 148918	2012-01-25 04:19:22 +00:00
Akira Hatanaka	01d3c42f90	Modify MipsFrameLowering::emitPrologue and emitEpilogue. - Use MipsAnalyzeImmediate to expand immediates that do not fit in 16-bit. - Change the types of variables so that they are sufficiently large to handle 64-bit pointers. - Emit instructions to set register $28 in a function prologue after instructions which store callee-saved registers have been emitted. llvm-svn: 148917	2012-01-25 04:12:04 +00:00
Akira Hatanaka	d1d4b3efcf	Modify MipsRegisterInfo::eliminateFrameIndex to use MipsAnalyzeImmediate to expand offsets that do not fit in the 16-bit immediate field of load and store instructions. Also change the types of variables so that they are sufficiently large to handle 64-bit pointers. llvm-svn: 148916	2012-01-25 03:55:10 +00:00
Craig Topper	3ad5bc019a	Merge intrinsic pattern and no pattern versions of VCVTSD2SI intruction definitions. Matches non-AVX version of same instructions. llvm-svn: 148914	2012-01-25 03:52:09 +00:00
NAKAMURA Takumi	6c421ea484	MipsAnalyzeImmediate.h: Fix to add DataTypes.h for msvc. inttypes.h is not supplied in msvc. llvm-svn: 148912	2012-01-25 03:34:41 +00:00
Nick Lewycky	ff50962534	Fix assert("msg"). Fix unused-variable warnings complaining about VT used only in asserts. llvm-svn: 148910	2012-01-25 03:20:12 +00:00
NAKAMURA Takumi	96a21dcea3	Target/Mips: Unbreak CMake build. llvm-svn: 148909	2012-01-25 03:15:46 +00:00
Akira Hatanaka	86d5fadd57	Lower 64-bit immediates using MipsAnalyzeImmediate that has just been added. Add a test case to show fewer instructions are needed to load an immediate with the new way of loading immediates. llvm-svn: 148908	2012-01-25 03:01:35 +00:00
Argyrios Kyrtzidis	939b7a0b7c	Revert r148901 because it crashes llvm tests. Original log: Introduce a new ConstantVector::getSplat constructor function to simplify a really common case. llvm-svn: 148906	2012-01-25 02:42:41 +00:00
Chris Lattner	9fe7dd872b	Introduce a new ConstantVector::getSplat constructor function to simplify a really common case. llvm-svn: 148901	2012-01-25 01:53:58 +00:00
Akira Hatanaka	ff36fd3de3	Add class MipsAnalyzeImmediate which comes up with an instruction sequence to load an immediate. llvm-svn: 148900	2012-01-25 01:43:36 +00:00
Chris Lattner	8a3df5495a	Remove the Type::getNumElements() method, which is only called in 4 places, did something extremely surprising, and shadowed actually useful implementations that had completely different behavior. llvm-svn: 148898	2012-01-25 01:32:59 +00:00
Chris Lattner	9be59599b3	Use the right method to get the # elements in a CDS. llvm-svn: 148897	2012-01-25 01:27:20 +00:00
Jim Grosbach	086cbfac7d	NEON VLD4(all lanes) assembly parsing and encoding. llvm-svn: 148884	2012-01-25 00:01:08 +00:00
Jim Grosbach	ccb6d55dae	Tidy up. Rename VLD4DUP patterns for consistency. llvm-svn: 148883	2012-01-24 23:47:07 +00:00
Jim Grosbach	b78403ce48	NEON VLD3(all lanes) assembly parsing and encoding. llvm-svn: 148882	2012-01-24 23:47:04 +00:00
Jakob Stoklund Olesen	1b8e437ab6	Set correct <def,undef> flags when lowering REG_SEQUENCE. A REG_SEQUENCE instruction is lowered into a sequence of partial defs: %vreg7:ssub_0<def,undef> = COPY %vreg20:ssub_0 %vreg7:ssub_1<def> = COPY %vreg2 %vreg7:ssub_2<def> = COPY %vreg2 %vreg7:ssub_3<def> = COPY %vreg2 The first def needs an <undef> flag to indicate it is the beginning of the live range, while the other defs are read-modify-write. Previously, we depended on LiveIntervalAnalysis to notice and fix the missing <def,undef>, but that solution was never robust, it was causing problems with ProcessImplicitDefs and the lowering of chained REG_SEQUENCE instructions. This fixes PR11841. llvm-svn: 148879	2012-01-24 23:28:42 +00:00
Jakob Stoklund Olesen	66ef9ad33f	Use the standard MachineFunction::print() after SlotIndexes. llvm-svn: 148878	2012-01-24 23:28:38 +00:00
Akira Hatanaka	d7970f9e4b	Sign-extend 32-bit integer arguments when they are passed in 64-bit registers, which is what N32/64 does. llvm-svn: 148875	2012-01-24 23:18:43 +00:00
Akira Hatanaka	7e6c195c11	Pass CCState by reference. llvm-svn: 148871	2012-01-24 22:07:36 +00:00
Akira Hatanaka	77dbd786c8	Pattern for f32 to i64 conversion. llvm-svn: 148869	2012-01-24 22:05:25 +00:00
Jim Grosbach	35bc8f9159	ARM Darwin symbol ref differences w/o subsection-via-symbols. When not using subsections via symbols, the assembler can resolve symbol differences (including pcrel references) to non-local labels at assembly time, not just those in the same atom. llvm-svn: 148865	2012-01-24 21:45:25 +00:00
Devang Patel	a410ed3ced	Intel Syntax: Extend special hand coded logic, to recognize special instructions, for intel syntax. llvm-svn: 148864	2012-01-24 21:43:36 +00:00
Akira Hatanaka	9f7ec1538f	64-bit sign extension in register instructions. llvm-svn: 148862	2012-01-24 21:41:09 +00:00
Matt Beaumont-Gay	88297ef665	Sink assert-only variables into the asserts llvm-svn: 148849	2012-01-24 19:43:30 +00:00
Kostya Serebryany	c11d1dd133	[asan] enable asan only for the functions that have Attribute::AddressSafety llvm-svn: 148846	2012-01-24 19:34:43 +00:00
Jim Grosbach	8e2722cdb0	NEON VST4(one lane) assembly parsing and encoding. llvm-svn: 148836	2012-01-24 18:53:13 +00:00
Owen Anderson	d845d9d9e9	Widen the instruction encoder that TblGen emits to a 64 bits, which should accomodate every target I can think of offhand. llvm-svn: 148833	2012-01-24 18:37:29 +00:00
Jim Grosbach	14952a0e32	NEON VLD4(one lane) assembly parsing and encoding. llvm-svn: 148832	2012-01-24 18:37:25 +00:00
Jakob Stoklund Olesen	5b9deabae8	Fix old doxygen comment. llvm-svn: 148825	2012-01-24 18:09:18 +00:00
Jim Grosbach	3cfef8d467	NEON Two-operand assembly aliases for VSRA. llvm-svn: 148821	2012-01-24 17:55:36 +00:00
Jim Grosbach	7ae12cc546	NEON Two-operand assembly aliases for VSLI. llvm-svn: 148819	2012-01-24 17:49:15 +00:00
Jim Grosbach	7b6f0f67aa	NEON Two-operand assembly aliases for VSRI. llvm-svn: 148818	2012-01-24 17:46:58 +00:00
Jim Grosbach	681db34eae	NEON add correct predicates for some asm aliases. llvm-svn: 148815	2012-01-24 17:23:29 +00:00
Chris Lattner	a0d01ff567	basic instcombine support for CDS. llvm-svn: 148806	2012-01-24 14:31:22 +00:00
Chris Lattner	139822fc83	C++, CBE, and TLOF support for ConstantDataSequential llvm-svn: 148805	2012-01-24 14:17:05 +00:00
Chris Lattner	2068393140	Rearrange argument order of ::get methods so that LLVMContext comes first, add a ConstantDataArray::getString method that corresponds to the (to be removed) StringRef version of ConstantArray::get, but is dramatically more efficient. llvm-svn: 148804	2012-01-24 14:04:40 +00:00
Elena Demikhovsky	0b0c5d8c4c	ZERO_EXTEND operation is optimized for AVX. v8i16 -> v8i32, v4i32 -> v4i64 - used vpunpck* instructions. llvm-svn: 148803	2012-01-24 13:54:13 +00:00
Chris Lattner	00245f420a	add more support for ConstantDataSequential llvm-svn: 148802	2012-01-24 13:41:11 +00:00
Evgeniy Stepanov	33a7e2f2a1	An option to selectively enable part of ARM EHABI support. This change adds an new option --arm-enable-ehabi-descriptors that enables emitting unwinding descriptors. This provides a mode with a working backtrace() without the (currently broken) exception support. llvm-svn: 148800	2012-01-24 13:05:33 +00:00
Benjamin Kramer	8aefffca4c	Bit pack DIE structures better. 16 bits are sufficient to store attributes, tags and forms. llvm-svn: 148799	2012-01-24 12:08:28 +00:00
Eric Christopher	3e8ccc2000	Remove generation of DW_AT_sibling. Nothing as far as I can tell uses it. Saves about 1.5% on debug info size. rdar://10278198 llvm-svn: 148794	2012-01-24 09:43:28 +00:00
Chris Lattner	5d4497bf4a	Add AsmPrinter (aka MCLowering) support for ConstantDataSequential, and clean up some other misc stuff. Unlike ConstantArray, we will prefer to emit .fill directives for "String" arrays that all have the same value, since they are denser than emitting a .ascii llvm-svn: 148793	2012-01-24 09:31:43 +00:00
Chris Lattner	5dd4d87ce0	Add various "string" methods to ConstantDataSequential, which have the same semantics as ConstantArray's but much more efficient because they don't have to return std::string's. The ConstantArray methods will eventually be removed. llvm-svn: 148792	2012-01-24 09:01:07 +00:00
Chris Lattner	f7eb543380	teach valuetracking about ConstantDataSequential llvm-svn: 148790	2012-01-24 07:54:10 +00:00
Chris Lattner	e166a8548f	switch SCEV to use the new ConstantFoldLoadThroughGEPIndices function instead of its own hard coded thing, allowing it to handle ConstantDataSequential and fixing some obscure bugs (e.g. it would previously crash on a CAZ of vector type). llvm-svn: 148788	2012-01-24 05:49:24 +00:00
Chris Lattner	f488b35826	Split the interesting bits of ConstantFoldLoadThroughGEPConstantExpr out into a new ConstantFoldLoadThroughGEPIndices (more useful) function and rewrite it to be simpler, more efficient, and to handle the new ConstantDataSequential type. Enhance ConstantFoldLoadFromConstPtr to handle ConstantDataSequential. llvm-svn: 148786	2012-01-24 05:43:50 +00:00
Chris Lattner	030af79b14	Add some accessor methods to CAZ and UndefValue that help simplify clients. Make some CDS methods public. llvm-svn: 148785	2012-01-24 05:42:11 +00:00
Anton Korobeynikov	3cad0c21ed	Use correct register class for am2offset register operands. This pacifies machine verifier llvm-svn: 148782	2012-01-24 04:58:56 +00:00
Jakob Stoklund Olesen	c46534a0cd	Preserve <def,undef> flags in CoalesceExtSubRegs. This won't have an effect until EliminateRegSequences() starts setting the undef flags. llvm-svn: 148779	2012-01-24 04:44:01 +00:00
Chris Lattner	e4f3f102c2	implement the ConstantDataSequential accessor methods. No need for 'getOperand' :) llvm-svn: 148778	2012-01-24 04:43:41 +00:00
Craig Topper	0d8e67aebd	Add comments near load pattern fragments indicating that all integer vector loads are promoted to v2i64 or v4i64 so that no one tries to reintroduce pattern fragments for other types. llvm-svn: 148771	2012-01-24 03:03:17 +00:00
Jim Grosbach	da70eac268	NEON VST4(multiple 4 element structures) assembly parsing. llvm-svn: 148764	2012-01-24 00:58:13 +00:00
Jim Grosbach	ed561fc850	NEON VLD4(multiple 4 element structures) assembly parsing. llvm-svn: 148762	2012-01-24 00:43:17 +00:00
Jim Grosbach	1e946a4f91	Tidy up. Remove some vertical space for readability. llvm-svn: 148761	2012-01-24 00:43:12 +00:00
Chandler Carruth	ed975232bc	Revert r148686 (and r148694, a fix to it) due to a serious layering violation -- MC cannot depend on CodeGen. Specifically, the MCTargetDesc component of each target is actually a subcomponent of the MC library. As such, it cannot depend on the target-independent code generator, because MC itself cannot depend on the target-independent code generator. This change moved a flag from the ARM MCTargetDesc file ARMMCAsmInfo.cpp to the CodeGen layer in ARMException.cpp, leaving behind an 'extern' to refer back to it. That layering order isn't viable givin the constraints outlined above. Commandline flags are designed to be static specifically to avoid these types of bugs. Fixing this is likely going to require some non-trivial refactoring. llvm-svn: 148759	2012-01-24 00:30:17 +00:00
Jim Grosbach	17bacab475	Fix typo. llvm-svn: 148757	2012-01-24 00:12:39 +00:00
Jim Grosbach	d3d36d9315	NEON VST3(single element from one lane) assembly parsing. llvm-svn: 148755	2012-01-24 00:07:41 +00:00
Devang Patel	eba7d3dba9	Fix typo. llvm-svn: 148751	2012-01-23 23:56:33 +00:00
Jim Grosbach	1a74724fc9	NEON VST3(multiple 3-element structures) assembly parsing. llvm-svn: 148748	2012-01-23 23:45:44 +00:00
Jim Grosbach	ac2af3ffab	NEON VLD3(multiple 3-element structures) assembly parsing. llvm-svn: 148745	2012-01-23 23:20:46 +00:00
Anton Korobeynikov	820417af07	Add missed mayStore flag to STREXD / t2STREXD llvm-svn: 148742	2012-01-23 22:57:52 +00:00
Chris Lattner	3756b91313	start the implementation of a new ConstantDataVector and ConstantDataArray classes, per PR1324. Not all of their helper functions are implemented, nothing creates them, and the rest of the compiler doesn't handle them yet. llvm-svn: 148741	2012-01-23 22:57:10 +00:00
Bill Wendling	11eeeff24f	Remove extraneous ';'s. llvm-svn: 148740	2012-01-23 22:55:02 +00:00
David Blaikie	d3303ded75	Remove dead default. llvm-svn: 148738	2012-01-23 22:37:11 +00:00
Devang Patel	cf893a437e	Intel syntax: Robustify parsing of memory operand's displacement experssion. llvm-svn: 148737	2012-01-23 22:35:25 +00:00
Jim Grosbach	a8b444b08b	NEON VLD3 lane-indexed assembly parsing and encoding. llvm-svn: 148734	2012-01-23 21:53:26 +00:00
Rafael Espindola	3c47e37387	Add support for .cfi_signal_frame. Fixes pr11762. llvm-svn: 148733	2012-01-23 21:51:52 +00:00
Lang Hames	2f6377cafe	copyImplicitOps is redundant here - the loop above already copies these ops. llvm-svn: 148725	2012-01-23 21:15:01 +00:00
Jakob Stoklund Olesen	20948fab69	Fix PR11829. PostRA LICM was too aggressive. This fixes a typo in r148589. llvm-svn: 148724	2012-01-23 21:01:15 +00:00
Jakob Stoklund Olesen	9082353e3b	Simplify debug output. llvm-svn: 148723	2012-01-23 21:01:11 +00:00
Devang Patel	e660fdd953	Intel syntax: Parse memory operand with empty base reg, e.g. DWORD PTR [4*RDI] llvm-svn: 148721	2012-01-23 20:20:06 +00:00
Jim Grosbach	d28ef9ac46	Simplify some NEON assembly pseudo definitions. Let the generic token alias definitions handle the data subtype suffices. We don't need explicit versions for each. llvm-svn: 148718	2012-01-23 19:39:08 +00:00
Matt Beaumont-Gay	54db64e2e4	Silence warnings in -asserts build llvm-svn: 148715	2012-01-23 18:46:04 +00:00
Devang Patel	880bc1644b	Intel syntax: Parse segment registers. llvm-svn: 148712	2012-01-23 18:31:58 +00:00
Chris Lattner	c7f9fd4da8	convert CAZ, UndefValue, and CPN to use DenseMap's again, this time without using OwningPtr. OwningPtr would barf when the densemap had to reallocate, which doesn't appear to happen on the regression test suite, but obviously happens in real life :) llvm-svn: 148700	2012-01-23 15:20:12 +00:00
Chris Lattner	962c272f95	revert r148691 and 148693 llvm-svn: 148698	2012-01-23 15:09:44 +00:00
Alexander Potapenko	c94cf8faf6	Implemented AddressSanitizer::getPassName() llvm-svn: 148697	2012-01-23 11:22:43 +00:00
NAKAMURA Takumi	28ea8f523b	ARMAsmPrinter.cpp: Try to fix up r148686. EnableARMEHABI was also here. llvm-svn: 148694	2012-01-23 09:14:42 +00:00
Chris Lattner	4494e1ae25	switch UndefValue and ConstantPointerNull over to DenseMap's for uniquing. llvm-svn: 148693	2012-01-23 08:52:32 +00:00
Chris Lattner	1910c9c3a0	Replace a use of ConstantUniqueMap for CAZ constants with a simple DenseMap. Now that the type system rewrite has landed, there is no need for its complexity and std::map'ness. llvm-svn: 148691	2012-01-23 08:42:38 +00:00
Craig Topper	edd1d0acfc	Custom lower PCMPEQ/PCMPGT intrinsics to target specific nodes and remove the intrinsic patterns. llvm-svn: 148687	2012-01-23 08:18:28 +00:00
Evgeniy Stepanov	482cdc4ebd	An option to selectively enable parts of ARM EHABI support. This change adds an new value to the --arm-enable-ehabi option that disables emitting unwinding descriptors. This mode gives a working backtrace() without the (currently broken) exception support. llvm-svn: 148686	2012-01-23 07:57:39 +00:00
Craig Topper	6b90c5d03e	Update more places to use target specific nodes for vector shifts instead of intrinsics. llvm-svn: 148685	2012-01-23 06:46:22 +00:00
Craig Topper	5e80db4e4f	Custom lower vector shift intrinsics to target specific nodes and remove the patterns that are no longer needed. llvm-svn: 148684	2012-01-23 06:16:53 +00:00
Rafael Espindola	624e30894a	Avoid using an invalidated iterator. llvm-svn: 148681	2012-01-23 05:07:16 +00:00
Rafael Espindola	abf456e320	The iteration order over a std::set<Module*> depends on the addresses of the modules. Avoid that to make the order the linker sees the modules deterministic. llvm-svn: 148676	2012-01-23 03:41:53 +00:00
Craig Topper	20c98df340	Remove pattern fragments for v32i8, v16i16, v8i32, v16i8, v8i16, and v4i32 loads. All integer vector loads are promoted to v2i64 or v4i64 so these pattern fragments can never match. Fix or remove patterns that used these fragments. llvm-svn: 148672	2012-01-23 00:06:44 +00:00
Nick Lewycky	c31ceda7d9	Make Value::isDereferenceablePointer() handle unreachable code blocks. (This returns false in the event the computation feeding into the pointer is unreachable, which maybe ought to be true -- but this is at least consistent with undef->isDereferenceablePointer().) Fixes PR11825! llvm-svn: 148671	2012-01-23 00:05:17 +00:00
Craig Topper	0b7ad76bd0	Combine X86 CMPPD and CMPPS node types. Simplifies selection code and pattern matching. llvm-svn: 148670	2012-01-22 23:36:02 +00:00
Craig Topper	bd4884371b	Merge PCMPEQB/PCMPEQW/PCMPEQD/PCMPEQQ and PCMPGTB/PCMPGTW/PCMPGTD/PCMPGTQ X86 ISD node types into only two node types. Simplifying opcode selection and pattern matching. llvm-svn: 148667	2012-01-22 22:42:16 +00:00
Nicolas Geoffray	e197d943f3	Use Attributes::None instead of 0 after r148553 change on Attributes from unsigned to their own class. llvm-svn: 148665	2012-01-22 20:05:26 +00:00
Craig Topper	094626414d	Add target specific ISD node types for SSE/AVX vector shuffle instructions and change all the code that used to create intrinsic nodes to create the new nodes instead. llvm-svn: 148664	2012-01-22 19:15:14 +00:00
Anton Korobeynikov	0251f20ad1	Add an option to disable buggy copy propagation pass llvm-svn: 148662	2012-01-22 14:08:34 +00:00
Anton Korobeynikov	5482b9f535	Add fused multiple+add instructions from VFPv4. Patch by Ana Pazos! llvm-svn: 148658	2012-01-22 12:07:33 +00:00
Eli Bendersky	bff72ba77e	Remove trailing spaces llvm-svn: 148654	2012-01-22 09:02:48 +00:00
Eli Bendersky	c3c80f0986	Basic runtime dynamic loading capabilities added to ELFObjectFile, implemented in a subclass named DyldELFObject. This class supports rebasing the object file it represents by re-mapping section addresses to the actual memory addresses the object was placed in. This is required for MC-JIT implementation on ELF with debugging support. Patch reviewed on llvm-commits. Developed together with Ashok Thirumurthi and Andrew Kaylor. llvm-svn: 148653	2012-01-22 09:01:03 +00:00
Eli Bendersky	058d647adf	Split the lib/ExecutionEngine/RuntimeDyld/RuntimeDyldImpl.h header to smaller logical headers. ELF and MachO implementations of RuntimeDyldImpl go into their own header files now. Reviewed on llvm-commits llvm-svn: 148652	2012-01-22 07:05:02 +00:00
Craig Topper	a4ed5246d8	Make code a little less verbose. llvm-svn: 148651	2012-01-22 03:07:48 +00:00
Craig Topper	cb3433cd58	Remove unused X86 ISD node type defines. llvm-svn: 148644	2012-01-22 01:15:56 +00:00
Craig Topper	123adfa0f3	Move some vector shift patterns into their instruction definitions. llvm-svn: 148643	2012-01-22 00:41:20 +00:00
Craig Topper	dcaa5fbd08	Add memory patterns for some of the fp<->integer conversion instructions. Fold some patterns into instruction definitions. llvm-svn: 148641	2012-01-21 18:37:15 +00:00
Benjamin Kramer	5cff13a3fb	Remove unused variables. llvm-svn: 148635	2012-01-21 10:42:44 +00:00
Craig Topper	39bc1e4d25	Fix PR11819 introduced by r148537. I'd commit the test case, but the generated code is terrible as it gets fully scalarized. Expect a future commit to fix that. llvm-svn: 148632	2012-01-21 08:49:33 +00:00
Evan Cheng	64a2beca52	Fix an obvious typo. llvm-svn: 148622	2012-01-21 03:31:03 +00:00
Jakob Stoklund Olesen	8e3bb315d8	Handle register masks in LiveVariables. A register mask operand kills any live physreg that isn't preserved. Unlike an implicit-def operand, the clobbered physregs are never live afterwards. This means LiveVariables has to track a much smaller number of live physregs, and it should spend much less time in addRegisterDead(). llvm-svn: 148609	2012-01-21 00:58:53 +00:00
Jim Grosbach	8c592f13e3	RuntimeDyld alignment adjustment from MachO file. The MachO file stores section alignment as log2(alignment-in-bytes). The allocation routines want the raw alignment-in-bytes value, so adjust for that. llvm-svn: 148604	2012-01-21 00:21:53 +00:00
Jim Grosbach	78dcaed8ca	Thumb2 'add rd, pc, imm' alternate form for 'adr' instruction. llvm-svn: 148601	2012-01-21 00:07:56 +00:00
Jakob Stoklund Olesen	52ee45d64a	Delete an unused member variable. llvm-svn: 148594	2012-01-20 22:48:59 +00:00
Jim Grosbach	1dc4a77a23	Fix inverted condition. llvm-svn: 148593	2012-01-20 22:44:03 +00:00
Devang Patel	ce6a2ca8c8	Intel syntax: Robustify register parsing. llvm-svn: 148591	2012-01-20 22:32:05 +00:00
Jakob Stoklund Olesen	6b17ef58a1	Support register masks in MachineLICM. Only PostRA LICM is affected. llvm-svn: 148589	2012-01-20 22:27:12 +00:00
Jakob Stoklund Olesen	58614f2f5a	Handle register masks in DeadMachineInstructionElim. Don't track live physregs that are clobbered by a register mask operand. llvm-svn: 148588	2012-01-20 22:27:09 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Andrew Trick	b9c822ab0b	Handle a corner case with IV chain collection with bailout instead of assert. Fixes PR11783: bad cast to AddRecExpr. llvm-svn: 148572	2012-01-20 21:23:40 +00:00
Devang Patel	d0930fff85	Intel syntax: Parse ... PTR [-8] llvm-svn: 148570	2012-01-20 21:21:01 +00:00
Devang Patel	f36613cb45	Intel syntax: For now, disable ambiguous JMP64pcrel32 for intel syntax. llvm-svn: 148569	2012-01-20 21:14:06 +00:00
Bob Wilson	6c7aaec077	ARM vector any_extends need to be selected to vmovl. <rdar://problem/10723651> We have patterns for vector sext and zext operations but were missing anyext. Without those patterns, codegen will fail when the selection DAG has any_extend nodes. llvm-svn: 148568	2012-01-20 20:59:56 +00:00
Jim Grosbach	91f5a3f253	TblGen diagnostic for mismatched template instantiation. Providing a template argment to a non-templatized class was crashing tblgen. Add a diagnostic. For example, $ cat bug.td class A; def B : A<0> { } $ llvm-tblgen bug.td bug.td:3:11: error: template argument provided to non-template class def B : A<0> { ^ llvm-svn: 148565	2012-01-20 20:02:39 +00:00
Jim Grosbach	90f5780fc1	VST2 four-register w/ update pseudos for fixed/register update. rdar://10724489 llvm-svn: 148560	2012-01-20 19:16:00 +00:00
Jim Grosbach	a9d36fbca7	NEON use vmov.i32 to splat some f32 values into vectors. For bit patterns that aren't representable using the 8-bit floating point representation for vmov.f32, but are representable via vmov.i32, treat the .f32 syntax as an alias. Most importantly, this covers the case 'vmov.f32 Vd, #0.0'. rdar://10616677 llvm-svn: 148556	2012-01-20 18:09:51 +00:00
Kostya Serebryany	a5054ad2f3	Extend Attributes to 64 bits Problem: LLVM needs more function attributes than currently available (32 bits). One such proposed attribute is "address_safety", which shows that a function is being checked for address safety (by AddressSanitizer, SAFECode, etc). Solution: - extend the Attributes from 32 bits to 64-bits - wrap the object into a class so that unsigned is never erroneously used instead - change "unsigned" to "Attributes" throughout the code, including one place in clang. - the class has no "operator uint64 ()", but it has "uint64_t Raw() " to support packing/unpacking. - the class has "safe operator bool()" to support the common idiom: if (Attributes attr = getAttrs()) useAttrs(attr); - The CTOR from uint64_t is marked explicit, so I had to add a few explicit CTOR calls - Add the new attribute "address_safety". Doing it in the same commit to check that attributes beyond first 32 bits actually work. - Some of the functions from the Attribute namespace are worth moving inside the class, but I'd prefer to have it as a separate commit. Tested: "make check" on Linux (32-bit and 64-bit) and Mac (10.6) built/run spec CPU 2006 on Linux with clang -O2. This change will break clang build in lib/CodeGen/CGCall.cpp. The following patch will fix it. llvm-svn: 148553	2012-01-20 17:56:17 +00:00
Benjamin Kramer	d3309a3434	Add missing breaks to switch. Found by the clang static analyzer. llvm-svn: 148543	2012-01-20 14:42:37 +00:00
Benjamin Kramer	084b9f4134	Remove a bunch of unused variable assignments. Found by the clang static analyzer. llvm-svn: 148541	2012-01-20 14:42:32 +00:00
Benjamin Kramer	fe4848b55d	Remove obviously invalid early exit that prevented analyzing ConstantAggregateZeros. Found by the clang static analyzer. llvm-svn: 148540	2012-01-20 14:42:25 +00:00
Craig Topper	a409479023	Improve 256-bit shuffle splitting to allow 2 sources in each 128-bit lane. As long as only a single lane of the source is used in the lane in the destination. This makes the splitting match much closer to what happens with 256-bit shuffles when AVX is disabled and only 128-bit XMM is allowed. llvm-svn: 148537	2012-01-20 09:29:03 +00:00
Nick Lewycky	e8415fea4b	Fix CountCodeReductionForAlloca to more accurately represent what SROA can and can't handle. Also don't produce non-zero results for things which won't be transformed by SROA at all just because we saw the loads/stores before we saw the use of the address. llvm-svn: 148536	2012-01-20 08:35:20 +00:00
Andrew Trick	c908b43d9f	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Craig Topper	3469212c82	Add support for selecting 256-bit PALIGNR. llvm-svn: 148532	2012-01-20 05:53:00 +00:00
Bill Wendling	9249261858	When lowering the 'resume' instruction, look to see if we can eliminate the 'insertvalue' instructions that recreate the structure returned by the 'landingpad' instruction. Because the 'insertvalue' instruction isn't supported by FastISel, this can save a bit of time during -O0 compilation. llvm-svn: 148520	2012-01-20 00:53:28 +00:00
Eli Friedman	32c7c25dcb	Support MSVC x86-32 sret convention. PR11688. Patch by Joe Groff. llvm-svn: 148513	2012-01-20 00:05:46 +00:00
Benjamin Kramer	116e99a469	Silence warnings about mixing enums. llvm-svn: 148495	2012-01-19 21:11:13 +00:00
Owen Anderson	4b53e188c1	Add a dump() implementation for sub-instruction MCOperands. llvm-svn: 148493	2012-01-19 19:32:20 +00:00
Dan Gohman	8ee108bf98	Set the "tail" flag on pattern-matched objc_storeStrong calls. rdar://10531041. llvm-svn: 148490	2012-01-19 19:14:36 +00:00
Devang Patel	f83dcfd052	Post process 'and', 'sub' instructions and select better encoding, if available. llvm-svn: 148489	2012-01-19 18:40:55 +00:00
Nick Lewycky	219e6bcb71	Actually, this code handles wrapped sets just fine. Noticed by inspection. llvm-svn: 148487	2012-01-19 18:19:42 +00:00
Devang Patel	2529dd9e00	Intel syntax: There is no need to create unary expr for simple negative displacement. llvm-svn: 148486	2012-01-19 18:15:51 +00:00
Devang Patel	4a62ff9bcb	Post process 'xor', 'or' and 'cmp' instructions and select better encoding, if available. llvm-svn: 148485	2012-01-19 17:53:25 +00:00
Evgeniy Stepanov	4c7eb477b5	Emit ARM EHABI unwinding instructions for 3 more Thumb instructions. llvm-svn: 148473	2012-01-19 12:53:06 +00:00
Craig Topper	a875b7ccc7	Folding table additions and fixes for AVX. llvm-svn: 148467	2012-01-19 08:50:38 +00:00
Craig Topper	80576e8d1f	Merge 128-bit and 256-bit SHUFPS/SHUFPD handling. llvm-svn: 148466	2012-01-19 08:19:12 +00:00
Evan Cheng	c2679b2958	More bundle related API additions. llvm-svn: 148465	2012-01-19 07:47:03 +00:00
Evan Cheng	d42aba53e6	Rewriter should definitly rewrite instructions inside bundles. llvm-svn: 148464	2012-01-19 07:46:36 +00:00
Evan Cheng	6ca2272183	Enhance finalizeBundle to return end of bundle iterator because it makes sense. llvm-svn: 148462	2012-01-19 06:13:10 +00:00
Jim Grosbach	235c8d2d94	ARM assembly diagnostic caret in better position for FPImm. llvm-svn: 148459	2012-01-19 02:47:30 +00:00
Jim Grosbach	44e5c39c29	Thumb2 relaxation for tADR to t2ADR. llvm-svn: 148456	2012-01-19 02:09:38 +00:00
Jim Grosbach	b008df40d3	Add comment and fix range check in condition. llvm-svn: 148455	2012-01-19 01:50:30 +00:00
Evan Cheng	2879467d4e	- Slight change to finalizeBundle() interface. LastMI is not exclusive (pointing to instruction right after the last instruction in the bundle. - Add a finalizeBundle() variant that doesn't specify LastMI. Instead, the code will find the last instruction in the bundle by following the 'InsideBundle' marker. This is useful in case bundles are formed early (i.e. during MI scheduling) but finalized later (i.e. after register allocator has finished rewriting virtual registers with physical registers). llvm-svn: 148444	2012-01-19 00:46:06 +00:00
Nick Lewycky	ecc0084f72	Add a TargetOption for disabling tail calls. llvm-svn: 148442	2012-01-19 00:34:10 +00:00
Evan Cheng	1eb2bb2295	Rename Finalizebundle to finalizeBundle to conform to coding guideline. llvm-svn: 148440	2012-01-19 00:06:10 +00:00
Jakob Stoklund Olesen	ff482f733b	Add experimental -x86-use-regmask command line option. It adds register mask operands to x86 call instructions. Once all the backend passes support register mask operands, this will be permanently enabled. llvm-svn: 148438	2012-01-18 23:52:22 +00:00
Jakob Stoklund Olesen	f1fb1d2375	Ignore register mask operands when lowering instructions to MC. This is similar to implicit register operands. MC doesn't understand register liveness and call clobbers. llvm-svn: 148437	2012-01-18 23:52:19 +00:00
Jakob Stoklund Olesen	9349351d72	Add a RegisterMaskSDNode class. This SelectionDAG node will be attached to call nodes by LowerCall(), and eventually becomes a MO_RegisterMask MachineOperand on the MachineInstr representing the call instruction. LowerCall() will attach a register mask that depends on the calling convention. llvm-svn: 148436	2012-01-18 23:52:12 +00:00
Rafael Espindola	f5e78fa8d1	Add support for the gnueabihf environment. Patch by Sylvestre Ledru. llvm-svn: 148434	2012-01-18 23:35:29 +00:00
Jim Grosbach	94298a906a	Thumb2 alternate syntax for LDR(literal) and friends. Explicit pc-relative syntax. For example, "ldrb r2, [pc, #-22]". rdar://10250964 llvm-svn: 148432	2012-01-18 22:46:46 +00:00
Devang Patel	de47cced25	Process instructions after match to select alternative encoding which may be more desirable. llvm-svn: 148431	2012-01-18 22:42:29 +00:00
Jim Grosbach	cbd3f27354	Replace FIXME with explanatory comment. llvm-svn: 148427	2012-01-18 22:04:42 +00:00
Jim Grosbach	cb80eb2e75	Thumb2 relaxation for LDR(literal). If the fixup is out of range for the Thumb1 instruction, relax it to the Thumb2 encoding instead. rdar://10711829 llvm-svn: 148424	2012-01-18 21:54:16 +00:00
Jim Grosbach	d4dbd09d85	MCAssembler tweak for determining when a symbol difference is resolved. If the two fragments are in the same Atom, then the difference expression is resolvable at compile time. Previously we were checking that they were in the same fragment, but that breaks down in the presence of instruction relaxation which has multiple fragments in the same atom. rdar://10711829 llvm-svn: 148423	2012-01-18 21:54:12 +00:00
Jim Grosbach	9ab3d8be4e	Rename pattern for clarity. llvm-svn: 148422	2012-01-18 21:54:09 +00:00
Dan Gohman	8f12faeb14	Add a depth limit to avoid runaway recursion. llvm-svn: 148419	2012-01-18 21:24:45 +00:00
Dan Gohman	82041c2e60	Use llvm.global_ctors to locate global constructors instead of recognizing them by name. llvm-svn: 148416	2012-01-18 21:19:38 +00:00
Jakub Staszak	632a355a01	Remove trailing spaces and unneeded includes. llvm-svn: 148415	2012-01-18 21:16:33 +00:00
Lang Hames	1997de0100	Fixed macro condition. llvm-svn: 148408	2012-01-18 19:48:31 +00:00
Jim Grosbach	e2d298168c	Tidy up. 80 columns. llvm-svn: 148401	2012-01-18 18:52:20 +00:00
Jim Grosbach	aba3de99c0	Tidy up. MCAsmBackend naming conventions. llvm-svn: 148400	2012-01-18 18:52:16 +00:00
Bill Wendling	75afc7afe8	Remove dead code. llvm-svn: 148384	2012-01-18 10:10:28 +00:00
Nadav Rotem	3b8f0cc9fa	Fix a bug in the type-legalization of vector integers. When we bitcast one vector type to another, we must not bitcast the result if one type is widened while the other is promoted. llvm-svn: 148383	2012-01-18 08:33:18 +00:00
Pete Cooper	c52eeed310	Fix ISD::REG_SEQUENCE to accept physical registers and change TwoAddressInstructionPass to insert copies for any physical reg operands of the REG_SEQUENCE llvm-svn: 148377	2012-01-18 04:16:16 +00:00
Jim Grosbach	adcc938c46	Thumb2 load/store fixups don't set the thumb bit. Load/store instructions w/ a fixup to be relative a function marked as thumb don't use the low bit to specify thumb vs. non-thumb like interworking branches do, so don't set it when dealing with those fixups. rdar://10348687. llvm-svn: 148366	2012-01-18 00:40:25 +00:00
Jim Grosbach	3b50c9ec7f	Move some ARM specific MCAssmebler bits into the ARMAsmBackend. llvm-svn: 148364	2012-01-18 00:23:57 +00:00
Jakob Stoklund Olesen	f43b599550	Add a CoveredBySubRegs property to Register descriptions. When set, this bit indicates that a register is completely defined by the value of its sub-registers. Use the CoveredBySubRegs property to infer which super-registers are call-preserved given a list of callee-saved registers. For example, the ARM registers D8-D15 are callee-saved. This now automatically implies that Q4-Q7 are call-preserved. Conversely, Win64 callees save XMM6-XMM15, but the corresponding YMM6-YMM15 registers are not call-preserved because they are not fully defined by their sub-registers. llvm-svn: 148363	2012-01-18 00:16:39 +00:00
Jakob Stoklund Olesen	fdbb12b235	Implement ARMBaseRegisterInfo::getCallPreservedMask(). Move ARM callee-saved lists into ARMCallingConv.td. llvm-svn: 148357	2012-01-17 23:09:00 +00:00
Jim Grosbach	3fa6dcfebb	Fix MCJIT memory leak of owned TargetMachine. The JIT is expected to take ownership of the TM that's passed in. The MCJIT wasn't freeing it, resulting in leaks. llvm-svn: 148356	2012-01-17 23:08:46 +00:00
Jakob Stoklund Olesen	d51a710bde	Move X86 callee saved register lists to the X86CallConv .td file. Add a trivial implementation of the getCallPreservedMask() hook. llvm-svn: 148347	2012-01-17 22:47:01 +00:00
Jakub Staszak	173bce3d2b	Move includes to the .cpp file. llvm-svn: 148342	2012-01-17 22:16:31 +00:00
Jim Grosbach	4045507fea	MC tweak symbol difference resolution for non-local symbols. When the non-local symbol in the expression is in the same fragment as the second symbol, the assembler can still evaluate the expression without needing a relocation. For example, on ARM: _foo: ldr lr, (_foo - 4) rdar://10348687 llvm-svn: 148341	2012-01-17 22:14:39 +00:00
Devang Patel	c9ed518792	Intel syntax: Fix parser match class to check memory operand size. llvm-svn: 148338	2012-01-17 21:48:03 +00:00
Nadav Rotem	fb6ddee0e9	Transform: (EXTRACT_VECTOR_ELT( VECTOR_SHUFFLE )) -> EXTRACT_VECTOR_ELT. llvm-svn: 148337	2012-01-17 21:44:01 +00:00
Devang Patel	a7143b6a2b	Intel syntax: Parse "BYTE PTR [RDX + RCX]" llvm-svn: 148334	2012-01-17 21:25:10 +00:00
Dan Gohman	e7a243fea5	Add a new ObjC ARC optimization pass to eliminate unneeded autorelease push+pop pairs. llvm-svn: 148330	2012-01-17 20:52:24 +00:00
Dan Gohman	b9936296d3	Add a new PassManagerBuilder customization point, EP_ModuleOptimizerEarly, to allow passes to be added before the main ModulePass optimizers. llvm-svn: 148329	2012-01-17 20:51:32 +00:00
Devang Patel	2ed6718616	Untabify. llvm-svn: 148322	2012-01-17 19:09:22 +00:00
Devang Patel	8b39be79ad	Intel syntax: Do not unncessarily create plus expression for memory operand displacement. llvm-svn: 148321	2012-01-17 19:08:07 +00:00
Devang Patel	41b9ddeb7a	Intel syntax: Robustify memory operand parsing. llvm-svn: 148312	2012-01-17 18:00:18 +00:00
Manuel Klimek	85d26f9807	Removes template magic to build up containers. Instead, we now put the attributes of the container into members. llvm-svn: 148302	2012-01-17 09:34:07 +00:00
Nadav Rotem	86c3807b99	Fix warning. llvm-svn: 148301	2012-01-17 09:31:09 +00:00
Nadav Rotem	86e5390dbf	Fix 11769. In CanXFormVExtractWithShuffleIntoLoad we assumed that EXTRACT_VECTOR_ELT can be later handled by the DAGCombiner. However, in some cases on AVX, the EXTRACT_VECTOR_ELT is legalized to EXTRACT_SUBVECTOR + EXTRACT_VECTOR_ELT, which currently is not handled by the DAGCombiner. In this patch I added a check that we only extract from the XMM part. llvm-svn: 148298	2012-01-17 09:13:19 +00:00
Craig Topper	02cb0fb136	Teach DAG combiner to turn a BUILD_VECTOR of UNDEFs into an UNDEF of vector type. llvm-svn: 148297	2012-01-17 09:09:48 +00:00
Craig Topper	9cafcd8baa	Remove unnecessary AVX check from an assert. hasSSE2 is enough. llvm-svn: 148295	2012-01-17 08:23:44 +00:00
David Blaikie	a5708dc3a3	Provide better messages in llvm_unreachable. llvm-svn: 148293	2012-01-17 07:00:13 +00:00
Andrew Trick	7ccdc5c192	misched: Inital interface and implementation for ScheduleTopDownLive and ShuffleInstructions. llvm-svn: 148291	2012-01-17 06:55:07 +00:00
Andrew Trick	e1c034fefe	Renamed MachineScheduler to ScheduleTopDownLive. Responding to code review. llvm-svn: 148290	2012-01-17 06:55:03 +00:00
Andrew Trick	8093eac51d	Moving options declarations around. More short term hackery until we have a way to configure passes that work on LiveIntervals. llvm-svn: 148289	2012-01-17 06:54:59 +00:00
Andrew Trick	12728f04ca	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
Craig Topper	37b10ef250	Fix a crasher when PerformShiftCombine receives a BUILD_VECTOR of all UNDEF. Probably could use better handling in DAG combine or getNode. Fixes PR11772. llvm-svn: 148285	2012-01-17 04:44:50 +00:00
David Blaikie	b48ed1a4cb	Remove unreachable code. (replace with llvm_unreachable to help GCC where necessary) llvm-svn: 148284	2012-01-17 04:43:56 +00:00
Rafael Espindola	cbda0e255d	Add 148175 back. I am unable to reproduce any non determinism in a dragonegg or clang bootstrap. I will keep an eye on the bots. Original message: Only emit the Leh_func_endN symbol when needed. llvm-svn: 148283	2012-01-17 04:19:20 +00:00
Pete Cooper	e3d305a206	Changed flag operand of ISD::FP_ROUND to TargetConstant as it should not get checked for legalisation llvm-svn: 148275	2012-01-17 01:54:07 +00:00
Lang Hames	818e1ffd74	Fix typo in comment. llvm-svn: 148268	2012-01-17 00:39:29 +00:00
Jim Grosbach	06594e1018	Tidy up. llvm-svn: 148265	2012-01-16 23:50:58 +00:00
Jim Grosbach	0ddb3a4963	ExecutionEngine interface to re-map addresses for engines that support it. llvm-svn: 148264	2012-01-16 23:50:55 +00:00
Jim Grosbach	9df6cc8f4f	MCJIT handle a few more simple x86 relocations for MachO. llvm-svn: 148263	2012-01-16 23:50:49 +00:00
David Blaikie	486df738c3	Removing unused default switch cases in switches over enums that already account for all enumeration values explicitly. (This time I believe I've checked all the -Wreturn-type warnings from GCC & added the couple of llvm_unreachables necessary to silence them. If I've missed any, I'll happily fix them as soon as I know about them) llvm-svn: 148262	2012-01-16 23:24:27 +00:00
Hal Finkel	b1691ccaaa	Cleanup PPC RLWINM8 vs RLWINM No test case: output assembly will be identical. llvm-svn: 148261	2012-01-16 23:22:50 +00:00
Hal Finkel	8606e3c7e3	AggressiveAntiDepBreaker needs to skip debug values because a debug value does not have a corresponding SUnit llvm-svn: 148260	2012-01-16 22:53:41 +00:00
Jakob Stoklund Olesen	86ae07f049	Extract method for detecting constant unallocatable physregs. It is safe to move uses of such registers. llvm-svn: 148259	2012-01-16 22:34:08 +00:00
Jim Grosbach	eff0a40d7e	MCJIT support for non-function sections. Move to a by-section allocation and relocation scheme. This allows better support for sections which do not contain externally visible symbols. Flesh out the relocation address vs. local storage address separation a bit more as well. Remote process JITs use this to tell the relocation resolution code where the code will live when it executes. The startFunctionBody/endFunctionBody interfaces to the JIT and the memory manager are deprecated. They'll stick around for as long as the old JIT does, but the MCJIT doesn't use them anymore. llvm-svn: 148258	2012-01-16 22:26:39 +00:00
Stepan Dyatkovskiy	2931a59ec5	Fixed comment in loop-unswitch. llvm-svn: 148252	2012-01-16 20:48:04 +00:00
Jakob Stoklund Olesen	6de6d3e4ec	Give better scavenger errors by invoking the verifier. llvm-svn: 148251	2012-01-16 20:38:31 +00:00
Jakob Stoklund Olesen	374ed322f2	Add a new kind of MachineOperand: MO_RegisterMask. Register masks will be used as a compact representation of large clobber lists. Currently, an x86 call instruction has some 40 operands representing call-clobbered registers. That's more than 1kB of useless operands per call site. A register mask operand references a bit mask of call-preserved registers, everything else is clobbered. The bit mask will typically come from TargetRegisterInfo::getCallPreservedMask(). By abandoning ImplicitDefs for call-clobbered registers, it also becomes possible to share call instruction descriptions between calling conventions, and we can get rid of the WINCALL* instructions. This patch introduces the new operand kind. Future patches will add RegMask support to target-independent passes before finally the fixed clobber lists can be removed from call instruction descriptions. llvm-svn: 148250	2012-01-16 19:22:00 +00:00
Eli Friedman	206ca569aa	Make sure the non-SSE lowering for fences correctly clobbers EFLAGS. PR11768. llvm-svn: 148240	2012-01-16 16:42:21 +00:00
Eli Friedman	75e3db4c7a	Get rid of unused codegen-only instruction. llvm-svn: 148239	2012-01-16 16:29:35 +00:00
Craig Topper	db8890aedd	Give priority to AVX over SSE for 128-bit floating point unpck instructions. llvm-svn: 148233	2012-01-16 09:56:42 +00:00
Eli Bendersky	1b0cd0f1b1	A fix for the previous commit: "integer constant is too large for ‘long’ type" error on some 32-bit bots llvm-svn: 148232	2012-01-16 09:31:10 +00:00
Eli Bendersky	4c647587b1	Adding a basic ELF dynamic loader and MC-JIT for ELF. Functionality is currently basic and will be enhanced with future patches. Patch developed by Andy Kaylor and Daniel Malea. Reviewed on llvm-commits. llvm-svn: 148231	2012-01-16 08:56:09 +00:00
David Blaikie	5d8e42755c	Refactor variables unused under non-assert builds (& remove two entirely unused variables). llvm-svn: 148230	2012-01-16 05:17:39 +00:00
Pete Cooper	e85b95d754	Changed intrinsic ID operand to a target constant as its not used in any arithmetic so should not be checked in legalisation llvm-svn: 148228	2012-01-16 04:08:12 +00:00
Nadav Rotem	57935243bd	[AVX] Optimize x86 VSELECT instructions using SimplifyDemandedBits. We know that the blend instructions only use the MSB, so if the mask is sign-extended then we can convert it into a SHL instruction. This is a common pattern because the type-legalizer sign-extends the i1 type which is used by the LLVM-IR for the condition. Added a new optimization in SimplifyDemandedBits for SIGN_EXTEND_INREG -> SHL. llvm-svn: 148225	2012-01-15 19:27:55 +00:00
Benjamin Kramer	339ced4e34	Return an ArrayRef from ShuffleVectorSDNode::getMask and push it through CodeGen. llvm-svn: 148218	2012-01-15 13:16:05 +00:00
Benjamin Kramer	5a377e28da	DAGCombiner: Deduplicate code. llvm-svn: 148217	2012-01-15 11:50:43 +00:00
Stepan Dyatkovskiy	7ec12e431a	Cosmetic patch for r148215. llvm-svn: 148216	2012-01-15 09:45:11 +00:00
Stepan Dyatkovskiy	cb2adbacf8	Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to std::map, since we need to keep a valid pointer to properties of current loop. Message for r148132: LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148215	2012-01-15 09:44:07 +00:00
Chandler Carruth	da22f30e72	Remove SetWorkingDirectory from the Process interface. Nothing in LLVM or Clang is using this, and it would be hard to use it correctly given the thread hostility of the function. Also, it never checked the return which is rather dangerous with chdir. If someone was in fact using this, please let me know, as well as what the usecase actually is so that I can add it back and make it more correct and secure to use. (That said, it's never going to be "safe" per-se, but we could at least document the risks...) llvm-svn: 148211	2012-01-15 08:41:35 +00:00
David Blaikie	fdcd669bc6	Remove dead code. llvm-svn: 148206	2012-01-15 01:09:13 +00:00
Craig Topper	201c1a3505	Truncate of undef is just undef of smaller size. llvm-svn: 148205	2012-01-15 01:05:11 +00:00
Craig Topper	c10e1abaf3	Fix the memop type on a couple 256-bit AVX instructions that were using f128mem instead of f256mem. llvm-svn: 148196	2012-01-14 18:29:57 +00:00
Craig Topper	d78429f850	Add a bunch of AVX instructions to the folding tables. Also fixed the alignment on 256-bit AVX2 instructions. llvm-svn: 148194	2012-01-14 18:14:53 +00:00
Duncan Sands	90212bde1f	Speculatively revert commit 148175 (rafael), to see if this fixes non-determinism in the 32 bit dragonegg buildbot. Original commit message: Only emit the Leh_func_endN symbol when needed. llvm-svn: 148191	2012-01-14 17:16:48 +00:00
Andrew Trick	23ef0d6c40	Fix a corner case hit by redundant phi elimination running after LSR. Fixes PR11761: bad IR w/ redundant Phi elim llvm-svn: 148177	2012-01-14 03:17:23 +00:00
Rafael Espindola	dfde7631fa	Only emit the Leh_func_endN symbol when needed. llvm-svn: 148175	2012-01-14 02:36:51 +00:00
Andrew Trick	59ac4fb706	misched: Initial code for building an MI level scheduling DAG llvm-svn: 148174	2012-01-14 02:17:18 +00:00
Andrew Trick	dbee9d8900	Move physreg dependency generation into aptly named addPhysRegDeps. llvm-svn: 148173	2012-01-14 02:17:15 +00:00
Andrew Trick	1d028a364d	misched: Added ScheduleDAGInstrs::IsPostRA llvm-svn: 148172	2012-01-14 02:17:12 +00:00
Andrew Trick	7e120f4e66	misched: Invoke the DAG builder on each sequence of schedulable instructions. llvm-svn: 148171	2012-01-14 02:17:09 +00:00
Andrew Trick	6344087e17	Move things around to make the file navigable, even though it will probably be split up later. llvm-svn: 148170	2012-01-14 02:17:06 +00:00
Evan Cheng	6bb95253eb	After r147827 and r147902, it's now possible for unallocatable registers to be live across BBs before register allocation. This miscompiled 197.parser when a cmp + b are optimized to a cbnz instruction even though the CPSR def is live-in a successor. cbnz r6, LBB89_12 ... LBB89_12: ble LBB89_1 The fix consists of two parts. 1) Teach LiveVariables that some unallocatable registers might be liveouts so don't mark their last use as kill if they are. 2) ARM constantpool island pass shouldn't form cbz / cbnz if the conditional branch does not kill CPSR. rdar://10676853 llvm-svn: 148168	2012-01-14 01:53:46 +00:00
Chad Rosier	71a185c5c6	Fix pasto from r146196. llvm-svn: 148167	2012-01-14 01:50:21 +00:00
Dan Gohman	4cf362acc1	Fix an unused variable warning that Chad noticed. llvm-svn: 148164	2012-01-14 00:47:44 +00:00
Rafael Espindola	a693128778	Remove previous commit while I debug the bot failures. llvm-svn: 148156	2012-01-13 23:28:50 +00:00
Jakob Stoklund Olesen	35545421c8	Use RegisterTuples to generate pseudo-registers. The QQ and QQQQ registers are not 'real', they are pseudo-registers used to model some vld and vst instructions. This makes the call clobber lists longer, but I intend to get rid of those soon. llvm-svn: 148151	2012-01-13 22:55:42 +00:00
Rafael Espindola	cef42c30a7	Remove label that is not used anymore. llvm-svn: 148150	2012-01-13 22:41:58 +00:00
Eli Friedman	d476fdc392	Speculatively revert r148132+r148133 to try and fix a buildbot failure. llvm-svn: 148149	2012-01-13 22:34:39 +00:00
Andrew Trick	f35c84032d	Remove pointless mode line in .cpp file. llvm-svn: 148143	2012-01-13 22:04:16 +00:00
Devang Patel	7066d28043	Revert r148131, it was committed before it was ready. llvm-svn: 148134	2012-01-13 19:28:58 +00:00
Stepan Dyatkovskiy	0a920fa210	Cosmetic patch for r148132. llvm-svn: 148133	2012-01-13 19:27:22 +00:00
Stepan Dyatkovskiy	cbcbdb237f	LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148132	2012-01-13 19:13:54 +00:00
Devang Patel	7ecdc6d4f5	Refactor. llvm-svn: 148131	2012-01-13 19:12:18 +00:00
Craig Topper	e52d86a740	Convert SHUFPD with the same register for both sources to PSHUFD if it would prevent a register copy. Similar to SHUFPS, but requires the mask to be converted. llvm-svn: 148112	2012-01-13 09:21:41 +00:00
Craig Topper	b1c2ebf6ee	use v8i32 as optimal mem type over v8f32 if AVX2 is enabled. Similar to SSE2 vs SSE1. llvm-svn: 148109	2012-01-13 08:32:21 +00:00
Craig Topper	cb7e13d7c0	Make X86 instruction selection use 256-bit VPXOR for build_vector of all ones if AVX2 is enabled. This gives the ExeDepsFix pass a chance to choose FP vs int as appropriate. Also use v8i32 as the type for getZeroVector if AVX2 is enabled. This is consistent with SSE2 using prefering v4i32. llvm-svn: 148108	2012-01-13 08:12:35 +00:00
Craig Topper	9f14d9f939	Add patterns for v16i16 and v32i8 immAllZerosV to select VPXOR to match v4i64 and v8i32. llvm-svn: 148106	2012-01-13 06:59:47 +00:00
Andrew Trick	e77e84e4b7	Added the MachineSchedulerPass skeleton. llvm-svn: 148105	2012-01-13 06:30:30 +00:00
Andrew Trick	4d4fef238a	wrong filename llvm-svn: 148103	2012-01-13 06:30:22 +00:00
Andrew Trick	b1be1aa8f8	80-col violation llvm-svn: 148102	2012-01-13 06:30:19 +00:00
Craig Topper	a4c5a47b97	Use 8i32 constant pool entry for converting AVX2_SETALLONES. Possibly fixes PR11750. llvm-svn: 148101	2012-01-13 06:12:41 +00:00
Craig Topper	2aa07f832e	Fix typo in PerformAddCombine that caused any vector type to be checked for horizontal add/sub if AVX2 is enabled. This caused an assert to fail for non 128/256-bit vectors when done before type legalizing. Fixes PR11749. llvm-svn: 148096	2012-01-13 05:04:25 +00:00
Jakob Stoklund Olesen	dd8fbf572e	Delete CodeInit and CodeRecTy from TableGen. The code type was always identical to a string anyway. Now it is simply a synonym. The code literal syntax [{...}] is still valid. llvm-svn: 148092	2012-01-13 03:38:34 +00:00
Jakob Stoklund Olesen	9d1c5eeb32	Use uniqued StringInit pointers for lookups. This avoids a gazillion StringMap and dynamic_cast calls, making TableGen run 3x faster. llvm-svn: 148091	2012-01-13 03:16:35 +00:00
Evan Cheng	fa8326334b	DAGCombine's logic for forming pre- and post- indexed loads / stores were being overly conservative. It was concerned about cases where it would prohibit folding simple [r, c] addressing modes. e.g. ldr r0, [r2] ldr r1, [r2, #4] => ldr r0, [r2], #4 ldr r1, [r2] Change the logic to look for such cases which allows it to form indexed memory ops more aggressively. rdar://10674430 llvm-svn: 148086	2012-01-13 01:37:24 +00:00
Bill Wendling	9c8456f7ef	Fix off-by-one error. llvm-svn: 148077	2012-01-13 00:41:53 +00:00
Dan Gohman	728db4997a	Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that the optimizer doesn't eliminate objc_retainBlock calls which are needed for their side effect of copying blocks onto the heap. This implements rdar://10361249. llvm-svn: 148076	2012-01-13 00:39:07 +00:00
Pete Cooper	9bcb72136e	Added MVT::v2f16 llvm-svn: 148067	2012-01-12 23:14:13 +00:00
Bill Wendling	49c4dfb534	Revert accidental commit. llvm-svn: 148065	2012-01-12 23:06:28 +00:00
Bill Wendling	ee5eaebc58	Fix the code that was WRONG. The registers are placed into the saved registers list in the reverse order, which is why the original loop was written to loop backwards. llvm-svn: 148064	2012-01-12 23:05:03 +00:00
Pete Cooper	99415fea87	Added FPOW, FEXP, FLOG to PromoteNode so that custom actions can be set to Promote for those operations. Sorry, no test case yet llvm-svn: 148050	2012-01-12 21:46:18 +00:00
Elena Demikhovsky	060f6ccdb8	Fixed a bug in LowerVECTOR_SHUFFLE caused assertion failure lc: X86ISelLowering.cpp:6480: llvm::SDValue llvm::X86TargetLowering::LowerVECTOR_SHUFFLE(llvm::SDValue, llvm::SelectionDAG&) const: Assertion `V1.getOpcode() != ISD::UNDEF&& "Op 1 of shuffle should not be undef"' failed. Added a test. llvm-svn: 148044	2012-01-12 20:33:10 +00:00
Evan Cheng	5c03a6b8f5	When hoisting common code, watch out for uses which are marked "kill". If the killed registers are needed below the insertion point, then unset the kill marker. Sorry I'm not able to find a reduced test case. rdar://10660944 llvm-svn: 148043	2012-01-12 20:31:24 +00:00
Rafael Espindola	00e861ed57	Support segmented stacks on 64-bit FreeBSD. This patch uses tcb_spare field in the tcb structure to store info. Patch by Jyun-Yan You. llvm-svn: 148041	2012-01-12 20:24:30 +00:00
Rafael Espindola	10745d3381	Support segmented stacks on win32. Uses the pvArbitrary slot of the TIB, which is reserved for applications. We only support frames with a static size. llvm-svn: 148040	2012-01-12 20:22:08 +00:00
Evan Cheng	09cc429cb1	Allow targets to select source order pre-RA scheduler. llvm-svn: 148033	2012-01-12 18:27:52 +00:00
Devang Patel	4a6e778aae	Rename X86ATTAsmParser -> X86AsmParser We are using one parser to parse att as well as intel style syntax. llvm-svn: 148032	2012-01-12 18:03:40 +00:00
Jakob Stoklund Olesen	994fed689f	Make SplitAnalysis::UseSlots private. llvm-svn: 148031	2012-01-12 17:53:44 +00:00
Benjamin Kramer	9ece950ddb	After Jakob's r147938 exception handling on i386 was completely broken. Restore the (obviously wrong) behavior from before r147938 without relying on undefined behavior. Add a fat FIXME note. This should fix nightly tester failures. llvm-svn: 148030	2012-01-12 17:37:18 +00:00
Nadav Rotem	0a0a829bea	Fix a bug in the AVX 256-bit shuffle code in cases where the splat element is on the boundary of two 128-bit vectors. The attached testcase was stuck in an endless loop. llvm-svn: 148027	2012-01-12 15:31:55 +00:00
Benjamin Kramer	5b3aa60b44	X86: Generalize the x << (y & const) optimization to also catch masks with more set bits set than 31 or 63. llvm-svn: 148024	2012-01-12 12:41:34 +00:00
Devang Patel	fc6be102ae	Add predicate method check match memory operand size, if available. In att style asm syntax memory operand size is derived from suffix attached with mnemonic. In intel style asm syntax it is part of memory operand hence predicate method check is required to select appropriate instruction. llvm-svn: 148006	2012-01-12 01:51:42 +00:00
Bill Wendling	58c7569854	A DenseMap of a std::map isn't a very good idea because the "grow()" method will need to make a deep copy of each of the std::maps. Use a std::map of the std::map instead. This improves the compile time of sqlite3 by ~2%. llvm-svn: 148003	2012-01-12 01:41:03 +00:00
Devang Patel	46831de240	Add intel style operand parser skeleton. This is a work in progress. llvm-svn: 148002	2012-01-12 01:36:43 +00:00
Chandler Carruth	eb21da060b	Switch all of the uses of my InsertDAGNode helper to follow the exact same pattern. We already had this pattern is a few places, but others tried to make a rough approximation of an actual DAG structure. As not everywhere went to this trouble, nothing could rely on this being done. In fact, I've checked all references to these node Ids, and the ones that are using the topo-sort properties are actually satisfied with a strict-weak-ordering. The requirement appears to be that Use >= Def. I've added a big blurb of comments to this bit of the transform to clarify why the order is so important for the next reader of the code. I'm starting with this change as it is very small, and trivially reverted if something breaks or the >= above really does need to be >. If that proves the case, we can hide the problem by reverting this patch, but the problem exists elsewhere as well, and so a more comprehensive solution will be needed. llvm-svn: 148001	2012-01-12 01:34:44 +00:00
Bill Wendling	4ec081a4d2	Revert r147978. A DenseMap's iterators may become invalidated here. llvm-svn: 147980	2012-01-11 23:43:34 +00:00
Jakob Stoklund Olesen	20f19eb9ab	Make data structures private. llvm-svn: 147979	2012-01-11 23:19:08 +00:00
Bill Wendling	f0275df9e3	Use a DenseMap. This appears to improve sqlite3's compile time by ~2%. llvm-svn: 147978	2012-01-11 22:57:32 +00:00
Jakob Stoklund Olesen	73edbf1682	Sink spillInterferences into RABasic. This helper method is too simplistic for RAGreedy. llvm-svn: 147976	2012-01-11 22:52:14 +00:00
Jakob Stoklund Olesen	06ec420347	Cleanup. llvm-svn: 147975	2012-01-11 22:52:11 +00:00
Jakob Stoklund Olesen	a818d804a1	Move RegAllocBase into its own cpp file separate from RABasic. No functional change. llvm-svn: 147972	2012-01-11 22:28:30 +00:00
Eli Friedman	b31c627be1	Re-fix the issue Bill fixed in r147899 in a slightly different way, which doesn't abuse the semantics of linker_private. We don't really want to merge any string constant with a weak_odr global. llvm-svn: 147971	2012-01-11 22:06:46 +00:00
Eric Christopher	d284c1d80d	Fix assert. llvm-svn: 147966	2012-01-11 20:55:27 +00:00
Argyrios Kyrtzidis	cd8fe08e4d	Disable the crash reporter when running lit tests. llvm-svn: 147965	2012-01-11 20:53:25 +00:00
Nadav Rotem	b5ce6ee835	On AVX, we can load v8i32 at a time. The bug happens when two uneven loads are used. When we load the v12i32 type, the GenWidenVectorLoads method generates two loads: v8i32 and v4i32 and attempts to use CONCAT_VECTORS to join them. In this fix I concat undef values to widen the smaller value. The test "widen_load-2.ll" also exposes this bug on AVX. llvm-svn: 147964	2012-01-11 20:19:17 +00:00
Rafael Espindola	d90466bcbf	Support segmented stacks on mac. This uses TLS slot 90, which actually belongs to JavaScriptCore. We only support frames with static size Patch by Brian Anderson. llvm-svn: 147960	2012-01-11 19:00:37 +00:00
Rafael Espindola	4eecacb9c8	Generate the segmented stack prologue for fastcc too. Patch by Brian Anderson. llvm-svn: 147958	2012-01-11 18:41:19 +00:00
Chandler Carruth	3212a34269	Revert r147945 which disabled an addressing mode transformation. I had hoped this would revive one of the llvm-gcc selfhost build bots, but it didn't so it doesn't appear that my transform is the culprit. If anyone else is seeing failures, please let me know! llvm-svn: 147957	2012-01-11 18:36:12 +00:00
Rafael Espindola	2b89448d60	Use unsigned comparison in segmented stack prologue. This is a comparison of two addresses, and GCC does the comparison unsigned. Patch by Brian Anderson. llvm-svn: 147954	2012-01-11 18:23:35 +00:00
Kostya Serebryany	687d078192	[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395 : don't instrument the function at all on x86_32 if it has a large asm blob llvm-svn: 147953	2012-01-11 18:15:23 +00:00
Rafael Espindola	6635ae1c17	Explicitly set the scale to 1 on some segstack prologue instrs. Patch by Brian Anderson. llvm-svn: 147952	2012-01-11 18:14:03 +00:00
Kevin Enderby	6223cf72e6	The error check for using -g with a .s file already containing dwarf .file directives was in the wrong place and getting triggered incorectly with a cpp .file directive. This change fixes that and adds a test case. llvm-svn: 147951	2012-01-11 18:04:47 +00:00
Jan Sjödin	21f83d9f36	Add XOP Intrinsics and tests llvm-svn: 147949	2012-01-11 15:20:20 +00:00
Nadav Rotem	baae7e4577	Fix a bug in the lowering of BUILD_VECTOR for AVX. SCALAR_TO_VECTOR does not zero untouched elements. Use INSERT_VECTOR_ELT instead. llvm-svn: 147948	2012-01-11 14:07:51 +00:00
Duncan Sands	0bf46b5363	Don't try to create a GEP when the pointee type is unsized (such GEPs are invalid). Fixes a crash on array1.C from the GCC testsuite when compiled with dragonegg. llvm-svn: 147946	2012-01-11 12:20:08 +00:00
Chandler Carruth	9bc48e5215	Disable the transformation I added in r147936 to see if it fixes some strange build bot failures that look like a miscompile into an infloop. I'll investigate this tomorrow, but I'd both like to know whether my patch is the culprit, and get the bots back to green. llvm-svn: 147945	2012-01-11 12:17:47 +00:00
Chandler Carruth	3eacfb83fa	Hoist a really redundant code pattern into a helper function, and delete lots of lines of code. No functionality changed. llvm-svn: 147942	2012-01-11 11:04:36 +00:00
Chandler Carruth	b0049f4a43	Simplify the AND-rooted mask+shift checking code to match that of the SRL-rooted code. llvm-svn: 147941	2012-01-11 09:35:04 +00:00
Chandler Carruth	3dbcda8478	Unify the interface of the three mask+shift transform helpers, and factor the differences that were hiding in one of them into its other caller, the SRL handling code. No change in behavior. llvm-svn: 147940	2012-01-11 09:35:02 +00:00
Chandler Carruth	aa01e6661a	Clarify and make explicit some of the requirements for transforming mask+shift pairs at the beginning of the ISD::AND case block, and then hoist the final pattern into a helper function, simplifying and reflowing it appropriately. This should have no observable behavior change, but several simplifications fell out of this such as directly computing the new mask constant, etc. llvm-svn: 147939	2012-01-11 09:35:00 +00:00
Jakob Stoklund Olesen	6039983755	Fix undefined code and reenable test case. I don't think the compact encoding code is right, but at least is has defined behavior now. llvm-svn: 147938	2012-01-11 09:08:04 +00:00
Chandler Carruth	51d3076bbf	Hoist the logic to transform shift+mask combinations into sub-register extracts and scaled addressing modes into its own helper function. No functionality changed here, just hoisting and layout fixes falling out of that hoisting. llvm-svn: 147937	2012-01-11 08:48:20 +00:00
Chandler Carruth	55b2cdee26	Teach the X86 instruction selection to do some heroic transforms to detect a pattern which can be implemented with a small 'shl' embedded in the addressing mode scale. This happens in real code as follows: unsigned x = my_accelerator_table[input >> 11]; Here we have some lookup table that we look into using the high bits of 'input'. Each entity in the table is 4-bytes, which means this implicitly gets turned into (once lowered out of a GEP): (unsigned)((char)my_accelerator_table + ((input >> 11) << 2)); The shift right followed by a shift left is canonicalized to a smaller shift right and masking off the low bits. That hides the shift right which x86 has an addressing mode designed to support. We now detect masks of this form, and produce the longer shift right followed by the proper addressing mode. In addition to saving a (rather large) instruction, this also reduces stalls in Intel chips on benchmarks I've measured. In order for all of this to work, one part of the DAG needs to be canonicalized still further* than it currently is. This involves removing pointless 'trunc' nodes between a zextload and a zext. Without that, we end up generating spurious masks and hiding the pattern. llvm-svn: 147936	2012-01-11 08:41:08 +00:00
Stepan Dyatkovskiy	8216569812	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
Andrew Trick	e81211f45c	Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. This interface is misleading and dangerous, but it is actually what we need for unrolling. llvm-svn: 147926	2012-01-11 06:52:55 +00:00
Rafael Espindola	647841b181	Add big endian mips support. Based on a patch by Jack Carter. llvm-svn: 147924	2012-01-11 04:04:14 +00:00
Rafael Espindola	870c4e92b9	Add the skeleton of an asm parser for mips. llvm-svn: 147923	2012-01-11 03:56:41 +00:00
Andrew Trick	642f0f6a40	ARM Ld/St Optimizer fix. Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits. Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12 llvm-svn: 147922	2012-01-11 03:56:08 +00:00
Jakob Stoklund Olesen	8b1d023a4a	Detect when a value is undefined on an edge to a landing pad. Consider this code: int h() { int x; try { x = f(); g(); } catch (...) { return x+1; } return x; } The variable x is undefined on the first edge to the landing pad, but it has the f() return value on the second edge to the landing pad. SplitAnalysis::getLastSplitPoint() would assume that the return value from f() was live into the landing pad when f() throws, which is of course impossible. Detect these cases, and treat them as if the landing pad wasn't there. This allows spill code to be inserted after the function call to f(). <rdar://problem/10664933> llvm-svn: 147912	2012-01-11 02:07:05 +00:00
Jakob Stoklund Olesen	67aec12409	Exclusively use SplitAnalysis::getLastSplitPoint(). Delete the alternative implementation in LiveIntervalAnalysis. These functions computed the same thing, but SplitAnalysis caches the result. llvm-svn: 147911	2012-01-11 02:07:00 +00:00
Evan Cheng	d9725a38d6	Avoid CSE of instructions which define physical registers across MBBs unless the physical registers are not allocatable. llvm-svn: 147902	2012-01-11 00:38:11 +00:00

... 5 6 7 8 9 ...

52778 Commits