llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	55a2a96153	Oop - r150653 + r150654 broke one of my test cases. Backing out for now... llvm-svn: 150655	2012-02-16 02:32:10 +00:00
Lang Hames	11ca986b17	FPSCR shouldn't be reserved. llvm-svn: 150654	2012-02-16 02:28:14 +00:00
Jakob Stoklund Olesen	8a450cb2fa	Enable register mask operands for x86 calls. Call instructions no longer have a list of 43 call-clobbered registers. Instead, they get a single register mask operand with a bit vector of call-preserved registers. This saves a lot of memory, 42 x 32 bytes = 1344 bytes per call instruction, and it speeds up building call instructions because those 43 imp-def operands no longer need to be added to use-def lists. (And removed and shifted and re-added for every explicit call operand). Passes like LiveVariables, LiveIntervals, RAGreedy, PEI, and BranchFolding are significantly faster because they can deal with call clobbers in bulk. Overall, clang -O2 is between 0% and 8% faster, uniformly distributed depending on call density in the compiled code. Debug builds using clang -O0 are 0% - 3% faster. I have verified that this patch doesn't change the assembly generated for the LLVM nightly test suite when building with -disable-copyprop and -disable-branch-fold. Branch folding behaves slightly differently in a few cases because call instructions have different hash values now. Copy propagation flushes its data structures when it crosses a register mask operand. This causes it to leave a few dead copies behind, on the order of 20 instruction across the entire nightly test suite, including SPEC. Fixing this properly would require the pass to use different data structures. llvm-svn: 150638	2012-02-16 00:02:50 +00:00
Sirish Pande	30804c24ca	Optimize redundant sign extends and negation of predicates. llvm-svn: 150606	2012-02-15 18:52:27 +00:00
Eric Christopher	53da633f93	Revert "Replacing HexagonOptimizeSZExtends with HexagonPeephole." This reverts commit 1656806a944bbd23e98c6e578810fe02495ab741. llvm-svn: 150605	2012-02-15 18:34:25 +00:00
Eric Christopher	d9811eb7be	Revert "Optimize redundant sign extends and negation of predicates" as it's breaking the build. This reverts commit 11241abca5e2a313412fed594bb9d9fa2a2057fb. llvm-svn: 150604	2012-02-15 18:32:25 +00:00
Sirish Pande	99571325f1	Replacing HexagonOptimizeSZExtends with HexagonPeephole. llvm-svn: 150603	2012-02-15 18:31:35 +00:00
Sirish Pande	4736aee81e	Optimize redundant sign extends and negation of predicates llvm-svn: 150601	2012-02-15 18:22:18 +00:00
Chad Rosier	0bc5132457	Add braces to if clause to make symmetric with associate else clause. llvm-svn: 150591	2012-02-15 17:36:21 +00:00
Bill Wendling	dfb45f4d68	Strip the pointer casts from the constants here. The c'tor list is stored as a list of 'void ()*'s, so all of the functions are bitcast to that. However, the dyn_cast doesn't automagically look through bitcasts. Do that for it. <rdar://problem/10813350> llvm-svn: 150572	2012-02-15 09:14:08 +00:00
Andrew Trick	c9ce9d2315	Added TargetPassConfig::disablePass/substitutePass as a general mechanism to override specific passes. llvm-svn: 150562	2012-02-15 03:21:47 +00:00
Chad Rosier	f0687634c3	Use a temporary variable, rather then a series of redundant calls. llvm-svn: 150538	2012-02-15 00:36:26 +00:00
Pete Cooper	c21ebf5c41	Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user llvm-svn: 150537	2012-02-15 00:33:37 +00:00
Chad Rosier	dccc4794e6	Use a temporary variable, rather then a series of redundant calls. llvm-svn: 150536	2012-02-15 00:23:55 +00:00
Chad Rosier	5b9c3974d2	Remove unnecessary assignment to temporary, ResultReg. llvm-svn: 150520	2012-02-14 22:29:48 +00:00
Craig Topper	cfad98f745	Move old movl vector_shuffle patterns. Not needed anymore since vector_shuffles shouldn't reach isel. llvm-svn: 150462	2012-02-14 08:14:53 +00:00
Lang Hames	876f24f706	Third time's the charm...? llvm-svn: 150447	2012-02-14 00:34:30 +00:00
Lang Hames	185455df7e	Unswap swap operands, partially reducing confusion. llvm-svn: 150444	2012-02-14 00:17:12 +00:00
Bill Wendling	05d6f2ff1e	Don't reserve the R0 and R1 registers here. We don't use these registers, and marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437	2012-02-13 23:47:16 +00:00
Lang Hames	aef4ca78c5	Make operands for VSWP read-modify-write. llvm-svn: 150433	2012-02-13 23:37:19 +00:00
Craig Topper	8b19d78808	Still more vector_shuffle pattern removal. llvm-svn: 150365	2012-02-13 07:23:41 +00:00
Ahmed Charles	32e983e4fc	Fix various issues (or do cleanups) found by enabling certain MSVC warnings. - Use unsigned literals when the desired result is unsigned. This mostly allows unsigned/signed mismatch warnings to be less noisy even if they aren't on by default. - Remove misplaced llvm_unreachable. - Add static to a declaration of a function on MSVC x86 only. - Change some instances of calling a static function through a variable to simply calling that function while removing the unused variable. llvm-svn: 150364	2012-02-13 06:30:56 +00:00
Craig Topper	74650add0e	Remove more vector_shuffle patterns for unpack. These should be target specific nodes when they get to isel. llvm-svn: 150363	2012-02-13 05:48:49 +00:00
Craig Topper	6d471c9e49	Recommit r150328. Previous test failures should be fixed by r150360. llvm-svn: 150362	2012-02-13 05:10:10 +00:00
Craig Topper	87119fa37f	Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel. llvm-svn: 150360	2012-02-13 04:30:38 +00:00
NAKAMURA Takumi	0826c17d00	Revert r150328, "Remove more vector_shuffle patterns." It caused 3 failures on pre-penryn and non-x86(generic) hosts. llvm-svn: 150357	2012-02-13 00:10:15 +00:00
Pete Cooper	71be57bb32	Fixed bug when custom lowering DEC64m on x86. If the DEC node had more than one user, it was doing this lowering but leaving the original DEC node around and so decrementing twice. Fixes PR11964. llvm-svn: 150356	2012-02-13 00:10:03 +00:00
Craig Topper	e24c94af81	Remove more vector_shuffle patterns. llvm-svn: 150328	2012-02-12 08:14:35 +00:00
Nick Lewycky	4b273cb7ea	Remove redundant getAnalysis<> calls in GlobalOpt. Add a few Itanium ABI calls to TargetLibraryInfo and use one of them in GlobalOpt. llvm-svn: 150323	2012-02-12 02:15:20 +00:00
Craig Topper	d40d9eb2b3	Remove more vector_shuffle patterns. llvm-svn: 150321	2012-02-12 01:07:34 +00:00
Craig Topper	330ca97700	Remove more vector_shuffle patterns. llvm-svn: 150314	2012-02-11 23:31:01 +00:00
Anton Korobeynikov	c6b4017ce2	Add support for implicit TLS model used with MS VC runtime. Patch by Kai Nacke! llvm-svn: 150307	2012-02-11 17:26:53 +00:00
Benjamin Kramer	915e3d9568	Don't mix declarations and code. llvm-svn: 150305	2012-02-11 16:01:02 +00:00
Benjamin Kramer	428704eb52	Make the EDis tables const. llvm-svn: 150304	2012-02-11 14:51:07 +00:00
Benjamin Kramer	478e8de8ef	Reuse the enum names from X86Desc in the X86Disassembler. This requires some gymnastics to make it available for C code. Remove the names from the disassembler tables, making them relocation free. llvm-svn: 150303	2012-02-11 14:50:54 +00:00
Craig Topper	981c6cf7b3	Remove some patterns for matching vector_shuffle instructions since vector_shuffles should be custom lowered before isel. llvm-svn: 150299	2012-02-11 07:43:35 +00:00
Craig Topper	11826a6e10	Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle mask checks on. This seemed to be confusing things such that vector_shuffle ops to got through to iselection. This is another step towards removing the vector_shuffle handling patterns from isel. llvm-svn: 150296	2012-02-11 06:24:48 +00:00
Jim Grosbach	1c9dd2974f	Revert r150222, as the clang driver now handles this properly. Now that the clang driver passes the CPU and feature information to the backend when processing assembly files (150273), this isn't necessary. llvm-svn: 150274	2012-02-10 20:38:46 +00:00
Jason W Kim	c7f4841769	Make valgrind happy. llvm-svn: 150251	2012-02-10 16:07:59 +00:00
Andrew Trick	f08915ca8e	unnecessary include llvm-svn: 150228	2012-02-10 04:10:44 +00:00
Andrew Trick	f4ff234384	PTX no longer needs to provide its own backend. llvm-svn: 150227	2012-02-10 04:10:40 +00:00
Andrew Trick	d3f8fe81f4	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
Jim Grosbach	ffc02c5ffc	ARM on darwin, v6 implies the presence of VFP for the assembler. rdar://10838899 llvm-svn: 150222	2012-02-10 02:21:49 +00:00
Sirish Pande	545983ea46	Test for commit access. llvm-svn: 150178	2012-02-09 15:20:33 +00:00
James Molloy	d9ba4fd48f	Teach the MC and disassembler about SoftFail, and hook it up to UNPREDICTABLE on ARM. Wire this to tBLX in order to provide test coverage. llvm-svn: 150169	2012-02-09 10:56:31 +00:00
Craig Topper	a0cd970b81	More tweaks to get the size of the X86 disassembler tables down. llvm-svn: 150167	2012-02-09 08:58:07 +00:00
Craig Topper	487e744f66	Flatten some of the arrays in the X86 disassembler tables to reduce space needed to store pointers on 64-bit hosts and reduce relocations needed at startup. Part of PR11953. llvm-svn: 150161	2012-02-09 07:45:30 +00:00
Jakob Stoklund Olesen	4519fd0b21	Handle register masks when searching for EFLAGS clobbers. Calls clobber the flags, but when using register masks there is no EFLAGS<imp-def> operand. llvm-svn: 150117	2012-02-09 00:17:22 +00:00
Andrew Trick	1fa5bcbe2a	Codegen pass definition cleanup. No functionality. Moving toward a uniform style of pass definition to allow easier target configuration. Globally declare Pass ID. Globally declare pass initializer. Use INITIALIZE_PASS consistently. Add a call to the initializer from CodeGen.cpp. Remove redundant "createPass" functions and "getPassName" methods. While cleaning up declarations, cleaned up comments (sorry for large diff). llvm-svn: 150100	2012-02-08 21:23:13 +00:00
Andrew Trick	3ed444a16a	Move pass configuration out of pass constructors: StackSlotColoring. llvm-svn: 150097	2012-02-08 21:22:57 +00:00
Andrew Trick	df7e3769b5	Move pass configuration out of pass constructors: PostRAScheduler. llvm-svn: 150096	2012-02-08 21:22:53 +00:00
Andrew Trick	58648e4e98	Move pass configuration out of pass constructors: BranchFolderPass llvm-svn: 150095	2012-02-08 21:22:48 +00:00
Andrew Trick	dd37d52f95	Added TargetPassConfig::setOpt llvm-svn: 150093	2012-02-08 21:22:39 +00:00
Andrew Trick	c044917a8a	Move pass configuration out of pass constructors: TailDuplicate::PreRegAlloc llvm-svn: 150091	2012-02-08 21:22:30 +00:00
Brendon Cahoon	6f35837048	Use TSFlag bit to describe instruction properties. Creating the isPredicated TSFlag enables the code to use the property defined in the instruction format instead of using a large switch statement. llvm-svn: 150078	2012-02-08 18:25:47 +00:00
Elena Demikhovsky	1adc1d53dd	Fixed a bug in printing "cmp" pseudo ops. > This IR code > %res = call <8 x float> @llvm.x86.avx.cmp.ps.256(<8 x float> %a0, <8 x float> %a1, i8 14) > fails with assertion: > > llc: X86ATTInstPrinter.cpp:62: void llvm::X86ATTInstPrinter::printSSECC(const llvm::MCInst, unsigned int, llvm::raw_ostream&): Assertion `0 && "Invalid ssecc argument!"' failed. > 0 llc 0x0000000001355803 > 1 llc 0x0000000001355dc9 > 2 libpthread.so.0 0x00007f79a30575d0 > 3 libc.so.6 0x00007f79a23a1945 gsignal + 53 > 4 libc.so.6 0x00007f79a23a2f21 abort + 385 > 5 libc.so.6 0x00007f79a239a810 __assert_fail + 240 > 6 llc 0x00000000011858d5 llvm::X86ATTInstPrinter::printSSECC(llvm::MCInst const, unsigned int, llvm::raw_ostream&) + 119 I added the full testing for all possible pseudo-ops of cmp. I extended X86AsmPrinter.cpp and X86IntelInstPrinter.cpp. You'l also see lines alignments (unrelated to this fix) in X86IselLowering.cpp from my previous check-in. llvm-svn: 150068	2012-02-08 08:37:26 +00:00
Craig Topper	172b9243cd	Remove a couple unneeded intrinsic patterns llvm-svn: 150067	2012-02-08 08:29:30 +00:00
Craig Topper	5405571fe0	Remove GCC builtins for vpermilp* intrinsics as clang no longer needs them. Custom lower the intrinsics to the vpermilp target specific node and remove intrinsic patterns. llvm-svn: 150060	2012-02-08 06:36:57 +00:00
Chad Rosier	0ee8c513f7	[fast-isel] Add support for SUBs with non-legal types. llvm-svn: 150047	2012-02-08 02:45:44 +00:00
Chad Rosier	bd471255a9	[fast-isel] Add support for ORs with non-legal types. llvm-svn: 150045	2012-02-08 02:29:21 +00:00
Chad Rosier	ded4c99f2e	[fast-isel] Add support for indirect branches. llvm-svn: 150014	2012-02-07 23:56:08 +00:00
Evan Cheng	1b81fddd65	Use LEA to adjust stack ptr for Atom. Patch by Andy Zhang. llvm-svn: 150008	2012-02-07 22:50:41 +00:00
Evan Cheng	45d8f8a08c	Do not fold ADD / SUB into load / store (to form pre-indexed, post-indexed load / store) if the ADD / SUB has a live definition of CPSR. Bug reported by David Meyer. Alas, no test case. llvm-svn: 149970	2012-02-07 07:09:28 +00:00
Craig Topper	b27fd77c3f	Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. llvm-svn: 149968	2012-02-07 06:28:42 +00:00
Craig Topper	e55c556a24	Convert assert(0) to llvm_unreachable llvm-svn: 149961	2012-02-07 02:50:20 +00:00
Chad Rosier	685b20c114	[fast-isel] Add support for ADDs with non-legal types. llvm-svn: 149934	2012-02-06 23:50:07 +00:00
Andrew Trick	34914910f5	Add TargetPassConfig to the PassManager for use inside passes llvm-svn: 149926	2012-02-06 22:51:15 +00:00
Derek Schuff	8b2dcad4b5	Enable streaming of bitcode This CL delays reading of function bodies from initial parse until materialization, allowing overlap of compilation with bitcode download. llvm-svn: 149918	2012-02-06 22:30:29 +00:00
Chris Lattner	8213c8af29	Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 149912	2012-02-06 21:56:39 +00:00
Bill Wendling	d5d95b0b51	[unwind removal] We no longer have 'unwind' instructions being generated, so remove the code that handles them. llvm-svn: 149901	2012-02-06 21:16:41 +00:00
Benjamin Kramer	2496717052	X86: Don't call malloc for 4 bits. No functionality change. llvm-svn: 149866	2012-02-06 12:06:18 +00:00
Benjamin Kramer	ae87d7b4b2	Hexagon: Remove forbidden iostream includes (it introduces static initializers) Reorder includes while at it. llvm-svn: 149863	2012-02-06 10:19:29 +00:00
Craig Topper	1f71057747	Add shuffle decoding support for 256-bit pshufd. Merge vpermilp* and pshufd decoding. llvm-svn: 149859	2012-02-06 07:17:51 +00:00
Evan Cheng	613d6d3b43	DefinesPredicate should only look for def operands. Patch by Ludwig Meier. llvm-svn: 149846	2012-02-05 19:55:04 +00:00
Duncan Sands	994c1103c8	Remove dead test: this was already checked and handled a few lines above. llvm-svn: 149841	2012-02-05 19:30:06 +00:00
Duncan Sands	ae22c60f90	Persuade GCC that there is nothing worth warning about here (there isn't). llvm-svn: 149834	2012-02-05 14:20:11 +00:00
Duncan Sands	efabc2572f	Don't initialize CV in terms of itself! Spotted by GCC. llvm-svn: 149833	2012-02-05 14:16:09 +00:00
Chandler Carruth	ebd90c58e6	Begin fleshing out more convenience predicates in llvm::Triple and convert at least one client over to use them. Subsequent patches both to LLVM and Clang will try to convert more people over to a common set of predicates. This round of predicates is focused on OS-categorization predicates. llvm-svn: 149815	2012-02-05 08:26:40 +00:00
Craig Topper	c4965bce14	Convert assert(0) to llvm_unreachable llvm-svn: 149814	2012-02-05 07:21:30 +00:00
Craig Topper	4ed7278ff4	Convert assert(0) to llvm_unreachable in X86 Target directory. llvm-svn: 149809	2012-02-05 05:38:58 +00:00
Craig Topper	83f3bdaa45	Convert some assert(0) in default of switch statements to llvm_unreachable. llvm-svn: 149808	2012-02-05 03:43:23 +00:00
Craig Topper	1d471e31ba	Add target specific node for PMULUDQ. Change patterns to use it and custom lower intrinsics to it. Use it instead of intrinsic to handle 64-bit vector multiplies. llvm-svn: 149807	2012-02-05 03:14:49 +00:00
Chris Lattner	cf9e8f6968	reapply the patches reverted in r149470 that reenable ConstantDataArray, but with a critical fix to the SelectionDAG code that optimizes copies from strings into immediate stores: the previous code was stopping reading string data at the first nul. Address this by adding a new argument to llvm::getConstantStringInfo, preserving the behavior before the patch. llvm-svn: 149800	2012-02-05 02:29:43 +00:00
Craig Topper	4daa67483d	Remove most of the intrinsics for XOP VPCMOV instruction. They all aliased to the same instruction with different types. This would be better accomplished with casts in the not yet created xopintrin.h header file. llvm-svn: 149795	2012-02-05 00:55:56 +00:00
Andrew Trick	f8ea108c05	TargetPassConfig: confine the MC configuration to TargetMachine. Passes prior to instructon selection are now split into separate configurable stages. Header dependencies are simplified. The bulk of this diff is simply removal of the silly DisableVerify flags. Sorry for the target header churn. Attempting to stabilize them. llvm-svn: 149754	2012-02-04 02:56:59 +00:00
Chad Rosier	b84a4b4c64	[fast-isel] Add support for URem. llvm-svn: 149716	2012-02-03 21:23:45 +00:00
Chad Rosier	e023d5d7f3	[fast-isel] Rename isZExt to isSigned. No functional change intended. llvm-svn: 149714	2012-02-03 21:14:11 +00:00
Chad Rosier	aaa55a88b6	[fast-isel] Add support for UDIV. llvm-svn: 149712	2012-02-03 21:07:27 +00:00
Chad Rosier	41f0e78b6c	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	a8a8ac5d47	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
Craig Topper	47e6d26911	Remove getShuffleVPERMILPImmediate function, getShuffleSHUFImmediate performs the same calculation. llvm-svn: 149683	2012-02-03 06:52:33 +00:00
Craig Topper	d5ffe0900d	Remove unnecessary qualification on 256-bit vector handling in LowerBUILD_VECTOR. Condition was already guaranteed by earlier code. llvm-svn: 149680	2012-02-03 06:32:21 +00:00
Andrew Trick	ccb673659a	Added TargetPassConfig. The first little step toward configuring codegen passes. Allows command line overrides to be centralized in LLVMTargetMachine.cpp. LLVMTargetMachine can intercept common passes and give precedence to command line overrides. Allows adding "internal" target configuration options without touching TargetOptions. Encapsulates the PassManager. Provides a good point to initialize all CodeGen passes so that Pass ID's can be used in APIs. Allows modifying the target configuration hooks without rebuilding the world. llvm-svn: 149672	2012-02-03 05:12:41 +00:00
Andrew Trick	808a7a6ce6	whitespace llvm-svn: 149671	2012-02-03 05:12:30 +00:00
Akira Hatanaka	f0b08445f6	Add a new MachineJumpTableInfo entry type, EK_GPRel64BlockAddress, which is needed to emit a 64-bit gp-relative relocation entry. Make changes necessary for emitting jump tables which have entries with directive .gpdword. This patch does not implement the parts needed for direct object emission or JIT. llvm-svn: 149668	2012-02-03 04:33:00 +00:00
Lang Hames	bb682450f9	Incorporate suggestions Chad, Jakob and Evan's suggestions on r149957. llvm-svn: 149655	2012-02-03 01:13:49 +00:00
Jakob Stoklund Olesen	5e1ac45b93	Require non-NULL register masks. It doesn't seem worthwhile to give meaning to a NULL register mask pointer. It complicates all the code using register mask operands. llvm-svn: 149646	2012-02-02 23:52:57 +00:00
Jakob Stoklund Olesen	caed1c9370	Add pseudo-registers for pairs, triples, and quads of D registers. NEON loads and stores accept single and double spaced pairs, triples, and quads of D registers. This patch adds new register classes to accurately model those constraints: Dn, Dn+1 Dn, Dn+2 ---------------------- DPair DPairSpc DTriple DTripleSpc DQuad DQuadSpc Also extend the existing QQ and QQQQ register classes to contains all Q pairs and quads instead of just the aligned ones. These new register classes will make it possible to accurately model constraints on NEON loads and stores, and we can get rid of all the NEON pseudo-instructions. The late scheduler will be able to accurately model instruction dependencies from the explicit operands. This more than doubles the number of ARM registers, but the backend passes are quite good at handling this. The llc -O0 compile time only regresses by 1.5%. Future work on register mask operands will recover this regression. llvm-svn: 149640	2012-02-02 22:45:32 +00:00
Elena Demikhovsky	6fbb4d2842	Minor change in signature of the getZeroVector() llvm-svn: 149601	2012-02-02 09:20:18 +00:00
Elena Demikhovsky	fb44980b41	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. llvm-svn: 149600	2012-02-02 09:10:43 +00:00
Francois Pichet	26f302d568	Unbreak the MSVC build. llvm-svn: 149599	2012-02-02 08:36:09 +00:00
Lang Hames	0269caafa6	Set EFLAGS correctly in EmitLoweredSelect on X86. llvm-svn: 149597	2012-02-02 07:48:37 +00:00
Akira Hatanaka	961883c1cf	Set the correct stack pointer register. llvm-svn: 149585	2012-02-02 03:17:04 +00:00
Akira Hatanaka	f029537e68	Expand EHSELECTION and EHSELECTION nodes. Set the correct exception pointer and selector registers. llvm-svn: 149584	2012-02-02 03:13:40 +00:00
Akira Hatanaka	d9fef17749	Add DWARF numbers of 64-bit registers. llvm-svn: 149583	2012-02-02 02:56:14 +00:00
Rafael Espindola	77295818f0	Fix the cmake build llvm-svn: 149561	2012-02-01 23:40:51 +00:00
Andrew Trick	8523b16ff5	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Jakob Stoklund Olesen	c7024a48db	Move ARM subreg index compositions to the SubRegIndex itself. llvm-svn: 149557	2012-02-01 23:16:43 +00:00
Mon P Wang	9f05206659	Avoid creating an extract element to an illegal type after LegalizeTypes has run. llvm-svn: 149548	2012-02-01 22:15:20 +00:00
Andrew Trick	d06df96a7c	VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling. Patch by Sergei Larin! llvm-svn: 149547	2012-02-01 22:13:57 +00:00
Chad Rosier	e273cb08c4	Tidy up. llvm-svn: 149521	2012-02-01 18:45:51 +00:00
Elena Demikhovsky	824eed70a6	Passing AVX 256-bit structures in Win64 was wrong. Fixed Win64 calling conventions. llvm-svn: 149494	2012-02-01 10:46:14 +00:00
Elena Demikhovsky	34cca175ab	Shortened code in shuffle masks llvm-svn: 149493	2012-02-01 10:33:05 +00:00
Elena Demikhovsky	0e48c70ba7	Optimization for "truncate" operation on AVX. Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles. llvm-svn: 149485	2012-02-01 07:56:44 +00:00
Stepan Dyatkovskiy	513aaa5691	SwitchInst refactoring. The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want. What was done: 1. Changed semantics of index inside the getCaseValue method: getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous. 2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned. 3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment. 4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst. 4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor. 4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor. Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang. llvm-svn: 149481	2012-02-01 07:49:51 +00:00
Craig Topper	9cdb8bdf04	Don't create VBROADCAST nodes if any nodes use the chain result from the load. Fixes PR11900. llvm-svn: 149478	2012-02-01 06:51:58 +00:00
Argyrios Kyrtzidis	17c981a45b	Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. These are: r149348 r149351 r149352 r149354 r149356 r149357 r149361 r149362 r149364 r149365 llvm-svn: 149470	2012-02-01 04:51:17 +00:00
Jim Grosbach	a2147ce313	Tidy up. One more return type mismatch fix. llvm-svn: 149452	2012-01-31 23:51:09 +00:00
Jim Grosbach	44091c2f10	Refactor loop for better readability. Excellent suggestion from Ben Kramer. llvm-svn: 149417	2012-01-31 20:56:55 +00:00
Jim Grosbach	b4d3a6af97	Add explanatory comment. llvm-svn: 149416	2012-01-31 20:34:53 +00:00
Devang Patel	a173ee56fd	Add assembler dialect attribute in asm parser which lets target specific asm parser change dialect on the fly. llvm-svn: 149396	2012-01-31 18:14:05 +00:00
Craig Topper	b85e40f738	Remove pcmpgt/pcmpeq intrinsics as clang is not using them. llvm-svn: 149367	2012-01-31 06:52:44 +00:00
Chris Lattner	8ea967d050	with recent changes, ConstantArray is never a "string". Remove the associated methods and constant fold the clients to false. llvm-svn: 149362	2012-01-31 06:05:00 +00:00
Chris Lattner	3a565d6241	use the right accessor for ConstantDataArray. llvm-svn: 149342	2012-01-31 03:16:39 +00:00
Evan Cheng	4e7992eeba	PR11834: Use macros which are defined on Windows. Patch by Marina Yatsina. llvm-svn: 149294	2012-01-30 23:10:32 +00:00
Devang Patel	7cdb2ff6b5	Intel syntax. Adjust special code, used to recognize cmp<comparison code>{ss,sd,ps,pd}, for intel syntax. llvm-svn: 149291	2012-01-30 22:47:12 +00:00
Devang Patel	9a9bb5c5db	Intel syntax. Support .intel_syntax directive. llvm-svn: 149270	2012-01-30 20:02:42 +00:00
Benjamin Kramer	396c590818	Fix refacto. llvm-svn: 149269	2012-01-30 20:01:35 +00:00
Douglas Gregor	e577cfe172	Eliminate narrowing conversion in initializer list, to make C++11 happy llvm-svn: 149254	2012-01-30 16:57:18 +00:00
Benjamin Kramer	20af25f47b	X86: Simplify shuffle mask generation code. llvm-svn: 149248	2012-01-30 15:16:21 +00:00
Craig Topper	516cba3380	Fix pattern for memory form of PSHUFD for use with FP vectors to remove bitcast to an integer vector that normal code wouldn't have. Also remove bitcasts from code that turns splat vector loads into a shuffle as it was making the broken pattern necessary. llvm-svn: 149232	2012-01-30 07:50:31 +00:00
Craig Topper	ca29bcfc10	Move some XOP patterns into instruction definition. Replae VPCMOV intrinsic patterns with custom lowering to a target specific nodes. llvm-svn: 149216	2012-01-30 01:10:15 +00:00
Anton Korobeynikov	d0c46550fe	Cleanups for EABI standard functions llvm-svn: 149195	2012-01-29 09:11:50 +00:00
Anton Korobeynikov	1b42e64280	Use base AAPCS for varargs functions even for AAPCS-VFP CC llvm-svn: 149194	2012-01-29 09:06:09 +00:00
Bob Wilson	de0c335560	Add a note about a potential optimization for clz/ctz patterns for ARM (and other targets). llvm-svn: 149182	2012-01-28 18:30:07 +00:00
James Molloy	b47489d4ef	Ensure .AliasedSymbol() is called on all uses of getSymbol(). Affects ARM and MIPS ELF backends. Fixes PR11877 llvm-svn: 149180	2012-01-28 15:58:32 +00:00
Devang Patel	63fe5697f4	Intel Syntax: Parse mem operand with seg reg. QWORD PTR FS:[320] llvm-svn: 149142	2012-01-27 19:48:28 +00:00
Craig Topper	5639e9e8fb	Move some patterns back near their instructions and use AddedComplexity to fix priority. Merge some patterns into their instruction definition. llvm-svn: 149122	2012-01-27 07:09:40 +00:00
Jim Grosbach	8f28dbdde5	Keep source location information for X86 MCFixup's. llvm-svn: 149106	2012-01-27 00:51:27 +00:00
Jim Grosbach	20275a8577	Better user diagnostics for more ARM MachO relocation errors. llvm-svn: 149102	2012-01-27 00:37:12 +00:00
Jim Grosbach	b591277c4a	Better diagnostic for malformed .org assembly directive. Provide source line number information. llvm-svn: 149101	2012-01-27 00:37:08 +00:00
Jim Grosbach	5e5eabb5ab	Keep source information, if available, around for ARM Fixups. Adjust an example MachObjectWriter diagnostic to use the information to issue a better message. Before: LLVM ERROR: unknown ARM fixup kind! After: x.s:6:5: error: unsupported relocation on symbol beq bar ^ rdar://9800182 llvm-svn: 149093	2012-01-26 23:20:15 +00:00
Jakob Stoklund Olesen	fc9dce25f7	Handle call-clobbered ymm registers on Win64. The Win64 calling convention has xmm6-15 as callee-saved while still clobbering all ymm registers. Add a YMM_HI_6_15 pseudo-register that aliases the clobbered part of the ymm registers, and mark that as call-clobbered. This allows live xmm registers across calls. This hack wouldn't be necessary with RegisterMask operands representing the call clobbers, but they are not quite operational yet. llvm-svn: 149088	2012-01-26 22:59:28 +00:00
Jim Grosbach	c8f2b7877b	Tidy up. Fix mismatched return types for error handling. llvm-svn: 149062	2012-01-26 15:56:45 +00:00
James Molloy	6685c08e5f	Add support for the R_ARM_TARGET1 relocation, which should be given to relocations applied to all C++ constructors and destructors. This enables the linker to match concrete relocation types (absolute or relative) with whatever library or C++ support code is being linked against. llvm-svn: 149057	2012-01-26 09:25:43 +00:00
Victor Umansky	5f29b0e57b	Fix for the following bug in AVX codegen for double-to-int conversions: . "fptosi" and "fptoui" IR instructions are defined with round-to-zero rounding mode. . Currently for AVX mode for <4xdouble> and <8xdouble> the "VCVTPD2DQ.128" and "VCVTPD2DQ.256" instructions are selected (for .fp_to_sint. DAG node operation ) by AVX codegen. However they use round-to-nearest-even rounding mode. . Consequently, the conversion produces incorrect numbers. The fix is to replace selection of VCVTPD2DQ instructions with VCVTTPD2DQ instructions. The latter use truncate (i.e. round-to-zero) rounding mode. As .fp_to_sint. DAG node operation is used only for lowering of "fptosi" and "fptoui" IR instructions, the fix in X86InstrSSE.td definition file doesn.t have an impact on other LLVM flows. The patch includes changes in the .td file, LIT test for the changes and a fix in a legacy LIT test (which produced asm code conflicting with LLVN IR spec). llvm-svn: 149056	2012-01-26 08:51:39 +00:00
Craig Topper	86e44bc829	Add HasXOP predicate check covering a bunch of XOP intrinsic patterns. llvm-svn: 149054	2012-01-26 07:51:55 +00:00
Craig Topper	1c0e22f57a	Fix AVX vs SSE patterns ordering issue for VPCMPESTRM and VPCMPISTRM. llvm-svn: 149053	2012-01-26 07:31:30 +00:00
Craig Topper	b91760eff8	Remove some more patterns by custom lowering intrinsics to target specific nodes. llvm-svn: 149052	2012-01-26 07:18:03 +00:00
Anton Korobeynikov	7722a2d4e3	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00
Jim Grosbach	82f76d1275	ARM assemly parsing and validation of IT instruction. "Although a Thumb2 instruction, the IT mnemonic shall be permitted in ARM mode, and the condition verified to match the condition code(s) on the following instruction(s)." PR11853 llvm-svn: 148969	2012-01-25 19:52:01 +00:00
Chris Lattner	33633a90a0	fix a bug I introduced in r148929, this is not a splat! Thanks to Eli for noticing. llvm-svn: 148947	2012-01-25 09:56:22 +00:00
Craig Topper	7834900950	Custom lower PSIGN and PSHUFB intrinsics to their corresponding target specific nodes so we can remove the isel patterns. llvm-svn: 148933	2012-01-25 06:43:11 +00:00
Chris Lattner	47a86bdbe2	use ConstantVector::getSplat in a few places. llvm-svn: 148929	2012-01-25 06:02:56 +00:00
Craig Topper	ce4f9c5668	Custom lower phadd and phsub intrinsics to target specific nodes. Remove the patterns that are no longer necessary. llvm-svn: 148927	2012-01-25 05:37:32 +00:00
Craig Topper	5bcf070e68	Remove AVX 256-bit unaligned load intrinsics. 128-bit versions had been removed a while ago. llvm-svn: 148922	2012-01-25 04:42:03 +00:00
Akira Hatanaka	012f041bce	Mark 64-bit register RA_64 unused too. llvm-svn: 148918	2012-01-25 04:19:22 +00:00
Akira Hatanaka	01d3c42f90	Modify MipsFrameLowering::emitPrologue and emitEpilogue. - Use MipsAnalyzeImmediate to expand immediates that do not fit in 16-bit. - Change the types of variables so that they are sufficiently large to handle 64-bit pointers. - Emit instructions to set register $28 in a function prologue after instructions which store callee-saved registers have been emitted. llvm-svn: 148917	2012-01-25 04:12:04 +00:00
Akira Hatanaka	d1d4b3efcf	Modify MipsRegisterInfo::eliminateFrameIndex to use MipsAnalyzeImmediate to expand offsets that do not fit in the 16-bit immediate field of load and store instructions. Also change the types of variables so that they are sufficiently large to handle 64-bit pointers. llvm-svn: 148916	2012-01-25 03:55:10 +00:00
Craig Topper	3ad5bc019a	Merge intrinsic pattern and no pattern versions of VCVTSD2SI intruction definitions. Matches non-AVX version of same instructions. llvm-svn: 148914	2012-01-25 03:52:09 +00:00
NAKAMURA Takumi	6c421ea484	MipsAnalyzeImmediate.h: Fix to add DataTypes.h for msvc. inttypes.h is not supplied in msvc. llvm-svn: 148912	2012-01-25 03:34:41 +00:00
NAKAMURA Takumi	96a21dcea3	Target/Mips: Unbreak CMake build. llvm-svn: 148909	2012-01-25 03:15:46 +00:00
Akira Hatanaka	86d5fadd57	Lower 64-bit immediates using MipsAnalyzeImmediate that has just been added. Add a test case to show fewer instructions are needed to load an immediate with the new way of loading immediates. llvm-svn: 148908	2012-01-25 03:01:35 +00:00
Akira Hatanaka	ff36fd3de3	Add class MipsAnalyzeImmediate which comes up with an instruction sequence to load an immediate. llvm-svn: 148900	2012-01-25 01:43:36 +00:00
Jim Grosbach	086cbfac7d	NEON VLD4(all lanes) assembly parsing and encoding. llvm-svn: 148884	2012-01-25 00:01:08 +00:00
Jim Grosbach	ccb6d55dae	Tidy up. Rename VLD4DUP patterns for consistency. llvm-svn: 148883	2012-01-24 23:47:07 +00:00
Jim Grosbach	b78403ce48	NEON VLD3(all lanes) assembly parsing and encoding. llvm-svn: 148882	2012-01-24 23:47:04 +00:00
Akira Hatanaka	d7970f9e4b	Sign-extend 32-bit integer arguments when they are passed in 64-bit registers, which is what N32/64 does. llvm-svn: 148875	2012-01-24 23:18:43 +00:00
Akira Hatanaka	7e6c195c11	Pass CCState by reference. llvm-svn: 148871	2012-01-24 22:07:36 +00:00
Akira Hatanaka	77dbd786c8	Pattern for f32 to i64 conversion. llvm-svn: 148869	2012-01-24 22:05:25 +00:00
Devang Patel	a410ed3ced	Intel Syntax: Extend special hand coded logic, to recognize special instructions, for intel syntax. llvm-svn: 148864	2012-01-24 21:43:36 +00:00
Akira Hatanaka	9f7ec1538f	64-bit sign extension in register instructions. llvm-svn: 148862	2012-01-24 21:41:09 +00:00
Jim Grosbach	8e2722cdb0	NEON VST4(one lane) assembly parsing and encoding. llvm-svn: 148836	2012-01-24 18:53:13 +00:00
Owen Anderson	d845d9d9e9	Widen the instruction encoder that TblGen emits to a 64 bits, which should accomodate every target I can think of offhand. llvm-svn: 148833	2012-01-24 18:37:29 +00:00
Jim Grosbach	14952a0e32	NEON VLD4(one lane) assembly parsing and encoding. llvm-svn: 148832	2012-01-24 18:37:25 +00:00
Jim Grosbach	3cfef8d467	NEON Two-operand assembly aliases for VSRA. llvm-svn: 148821	2012-01-24 17:55:36 +00:00
Jim Grosbach	7ae12cc546	NEON Two-operand assembly aliases for VSLI. llvm-svn: 148819	2012-01-24 17:49:15 +00:00
Jim Grosbach	7b6f0f67aa	NEON Two-operand assembly aliases for VSRI. llvm-svn: 148818	2012-01-24 17:46:58 +00:00
Jim Grosbach	681db34eae	NEON add correct predicates for some asm aliases. llvm-svn: 148815	2012-01-24 17:23:29 +00:00
Chris Lattner	139822fc83	C++, CBE, and TLOF support for ConstantDataSequential llvm-svn: 148805	2012-01-24 14:17:05 +00:00
Elena Demikhovsky	0b0c5d8c4c	ZERO_EXTEND operation is optimized for AVX. v8i16 -> v8i32, v4i32 -> v4i64 - used vpunpck* instructions. llvm-svn: 148803	2012-01-24 13:54:13 +00:00
Anton Korobeynikov	3cad0c21ed	Use correct register class for am2offset register operands. This pacifies machine verifier llvm-svn: 148782	2012-01-24 04:58:56 +00:00
Craig Topper	0d8e67aebd	Add comments near load pattern fragments indicating that all integer vector loads are promoted to v2i64 or v4i64 so that no one tries to reintroduce pattern fragments for other types. llvm-svn: 148771	2012-01-24 03:03:17 +00:00
Jim Grosbach	da70eac268	NEON VST4(multiple 4 element structures) assembly parsing. llvm-svn: 148764	2012-01-24 00:58:13 +00:00
Jim Grosbach	ed561fc850	NEON VLD4(multiple 4 element structures) assembly parsing. llvm-svn: 148762	2012-01-24 00:43:17 +00:00
Jim Grosbach	1e946a4f91	Tidy up. Remove some vertical space for readability. llvm-svn: 148761	2012-01-24 00:43:12 +00:00
Chandler Carruth	ed975232bc	Revert r148686 (and r148694, a fix to it) due to a serious layering violation -- MC cannot depend on CodeGen. Specifically, the MCTargetDesc component of each target is actually a subcomponent of the MC library. As such, it cannot depend on the target-independent code generator, because MC itself cannot depend on the target-independent code generator. This change moved a flag from the ARM MCTargetDesc file ARMMCAsmInfo.cpp to the CodeGen layer in ARMException.cpp, leaving behind an 'extern' to refer back to it. That layering order isn't viable givin the constraints outlined above. Commandline flags are designed to be static specifically to avoid these types of bugs. Fixing this is likely going to require some non-trivial refactoring. llvm-svn: 148759	2012-01-24 00:30:17 +00:00
Jim Grosbach	17bacab475	Fix typo. llvm-svn: 148757	2012-01-24 00:12:39 +00:00
Jim Grosbach	d3d36d9315	NEON VST3(single element from one lane) assembly parsing. llvm-svn: 148755	2012-01-24 00:07:41 +00:00
Devang Patel	eba7d3dba9	Fix typo. llvm-svn: 148751	2012-01-23 23:56:33 +00:00
Jim Grosbach	1a74724fc9	NEON VST3(multiple 3-element structures) assembly parsing. llvm-svn: 148748	2012-01-23 23:45:44 +00:00
Jim Grosbach	ac2af3ffab	NEON VLD3(multiple 3-element structures) assembly parsing. llvm-svn: 148745	2012-01-23 23:20:46 +00:00
Anton Korobeynikov	820417af07	Add missed mayStore flag to STREXD / t2STREXD llvm-svn: 148742	2012-01-23 22:57:52 +00:00
Devang Patel	cf893a437e	Intel syntax: Robustify parsing of memory operand's displacement experssion. llvm-svn: 148737	2012-01-23 22:35:25 +00:00
Jim Grosbach	a8b444b08b	NEON VLD3 lane-indexed assembly parsing and encoding. llvm-svn: 148734	2012-01-23 21:53:26 +00:00
Devang Patel	e660fdd953	Intel syntax: Parse memory operand with empty base reg, e.g. DWORD PTR [4*RDI] llvm-svn: 148721	2012-01-23 20:20:06 +00:00
Jim Grosbach	d28ef9ac46	Simplify some NEON assembly pseudo definitions. Let the generic token alias definitions handle the data subtype suffices. We don't need explicit versions for each. llvm-svn: 148718	2012-01-23 19:39:08 +00:00
Devang Patel	880bc1644b	Intel syntax: Parse segment registers. llvm-svn: 148712	2012-01-23 18:31:58 +00:00
NAKAMURA Takumi	28ea8f523b	ARMAsmPrinter.cpp: Try to fix up r148686. EnableARMEHABI was also here. llvm-svn: 148694	2012-01-23 09:14:42 +00:00
Craig Topper	edd1d0acfc	Custom lower PCMPEQ/PCMPGT intrinsics to target specific nodes and remove the intrinsic patterns. llvm-svn: 148687	2012-01-23 08:18:28 +00:00

... 2 3 4 5 6 ...

20859 Commits