llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Grosbach	d0860d6e02	EmitAtomicCmpSwap() custome inserter needs to delete the MI passed in. EmitAtomicBinary() already does this. llvm-svn: 93479	2010-01-15 00:18:34 +00:00
Jakob Stoklund Olesen	0ca14e4498	ARM "l" constraint for inline asm means R0-R7, also for Thumb2. This is consistent with llvm-gcc's arm/constraints.md. Certain instructions (e.g. CBZ, CBNZ) require a low register, even in Thumb2 mode. llvm-svn: 93436	2010-01-14 18:19:56 +00:00
Jakob Stoklund Olesen	fcf91ee403	Fix pasto llvm-svn: 93342	2010-01-13 19:54:39 +00:00
Bill Wendling	919b7aab2e	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Evan Cheng	db4d798619	Delete the instruction just before the function terminates for consistency sake. llvm-svn: 91836	2009-12-21 19:53:39 +00:00
Rafael Espindola	b73b4fd30e	Fix libstdc++ build on ARM linux and part of PR5770. MI was not being used but it was also not being deleted, so it was kept in the garbage list. The memory itself was freed once the function code gen was done. Once in a while the codegen of another function would create an instruction on the same address. Adding it to the garbage group would work once, but when another pointer was added it would cause an assert as "Cache" was about to be pushed to Ts. For a patch that make us detect problems like this earlier, take a look at http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20091214/092758.html With that patch we assert as soon and the new instruction is added to the garbage set. llvm-svn: 91691	2009-12-18 16:59:39 +00:00
Bob Wilson	3152b0471b	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Jim Grosbach	ea8f6e31a0	nand atomic requires opposite operand ordering llvm-svn: 91371	2009-12-15 00:12:35 +00:00
Jim Grosbach	3c4f04112a	Add ARMv6 memory and sync barrier instructions llvm-svn: 91329	2009-12-14 21:24:16 +00:00
Jim Grosbach	57ccc19617	Thumb2 atomic operations llvm-svn: 91321	2009-12-14 20:14:59 +00:00
Jim Grosbach	8f3c70e909	atomic binary operations up to 32-bits wide. llvm-svn: 91260	2009-12-14 04:22:04 +00:00
Jim Grosbach	8f9a3ac12c	Framework for atomic binary operations. The emitter for the pseudo instructions just issues an error for the moment. The front end won't yet generate these intrinsics for ARM, so this is behind the scenes until complete. llvm-svn: 91200	2009-12-12 01:40:06 +00:00
Jim Grosbach	5c4e99fca6	Rough first pass at compare_and_swap atomic builtins for ARM mode. Work in progress. llvm-svn: 91090	2009-12-11 01:42:04 +00:00
Jim Grosbach	53e8854443	Add memory barrier intrinsic support for ARM. Moving towards adding the atomic operations intrinsics. llvm-svn: 91003	2009-12-10 00:11:09 +00:00
Evan Cheng	0c2544fd6b	- Support inline asm 'w' constraint for 128-bit vector types. - Also support the 'q' NEON registers asm code. llvm-svn: 90894	2009-12-08 23:06:22 +00:00
Bob Wilson	0bbd3077ce	Recognize canonical forms of vector shuffles where the same vector is used for both source operands. In the canonical form, the 2nd operand is changed to an undef and the shuffle mask is adjusted to only reference elements from the 1st operand. Radar 7434842. llvm-svn: 90417	2009-12-03 06:40:55 +00:00
Anton Korobeynikov	2522908653	Materialize global addresses via movt/movw pair, this is always better than doing the same via constpool: 1. Load from constpool costs 3 cycles on A9, movt/movw pair - just 2. 2. Load from constpool might stall up to 300 cycles due to cache miss. 3. Movt/movw does not use load/store unit. 4. Less constpool entries => better compiler performance. This is only enabled on ELF systems, since darwin does not have needed relocations (yet). llvm-svn: 89720	2009-11-24 00:44:37 +00:00
Dan Gohman	de5dea869f	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Devang Patel	ed85e12da6	We are not using DBG_STOPPOINT anymore. llvm-svn: 89536	2009-11-21 02:46:55 +00:00
David Greene	1fbe054450	Add a bool flag to StackObjects telling whether they reference spill slots. The AsmPrinter will use this information to determine whether to print a spill/reload comment. Remove default argument values. It's too easy to pass a wrong argument value when multiple arguments have default values. Make everything explicit to trap bugs early. Update all targets to adhere to the new interfaces.. llvm-svn: 87022	2009-11-12 20:49:22 +00:00
Evan Cheng	15b80e4a9f	isLegalICmpImmediate should take a signed integer; code clean up. llvm-svn: 86964	2009-11-12 07:13:11 +00:00
Evan Cheng	3d3c24a82c	Add TargetLowering::isLegalICmpImmediate. It tells LSR what immediate can be folded into target icmp instructions. llvm-svn: 86858	2009-11-11 19:05:52 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Evan Cheng	408aa56fb5	Remove ARMPCLabelIndex from ARMISelLowering. Use ARMFunctionInfo::createConstPoolEntryUId() instead. llvm-svn: 86294	2009-11-06 22:24:13 +00:00
Bob Wilson	b389f2a04d	Revert previous change to a comment. The BlockAddresses go in the constant pool so they don't get wrapped separately. llvm-svn: 85844	2009-11-03 00:02:05 +00:00
Bob Wilson	1c66e8a6b7	Put BlockAddresses into ARM constant pools. llvm-svn: 85824	2009-11-02 20:59:23 +00:00
Anton Korobeynikov	4d23754b14	Handle splats of undefs properly. This includes the testcase for PR5364 as well. llvm-svn: 85767	2009-11-02 00:12:06 +00:00
Jim Grosbach	8fe6fd702d	Expand 64-bit logical shift right inline llvm-svn: 85687	2009-10-31 21:42:19 +00:00
Jim Grosbach	624fcb286e	Expand 64-bit arithmetic shift right inline llvm-svn: 85685	2009-10-31 21:00:56 +00:00
Jim Grosbach	5d994048dd	Expand 64 bit left shift inline rather than using the libcall. For now, this is unconditional. Making it still use the libcall when optimizing for size would be a good adjustment. llvm-svn: 85675	2009-10-31 19:38:01 +00:00
Evan Cheng	cdbb70c065	It's safe to remat t2LDRpci; Add PseudoSourceValue to load / store's to enable more machine licm. More changes coming. llvm-svn: 85643	2009-10-31 03:39:36 +00:00
Bob Wilson	6b00f4b7a8	Fix a comment. llvm-svn: 85610	2009-10-30 20:13:25 +00:00
Rafael Espindola	ab7c709f43	This fixes functions like void f (int a1, int a2, int a3, int a4, int a5,...) In ARMTargetLowering::LowerFormalArguments if the function has 4 or more regular arguments we used to set VarArgsFrameIndex using an offset of 0, which is only correct if the function has exactly 4 regular arguments. llvm-svn: 85590	2009-10-30 14:33:14 +00:00
Bob Wilson	1cf0b03064	Add ARM codegen for indirect branches. clang/test/CodeGen/indirect-goto.c runs! (unoptimized) llvm-svn: 85577	2009-10-30 05:45:42 +00:00
Evan Cheng	ec6d7c945d	Give ARMISD::EH_SJLJ_LONGJMP and EH_SJLJ_SETJMP names. llvm-svn: 85381	2009-10-28 06:55:03 +00:00
Evan Cheng	4a609f3cef	Use fconsts and fconstd to materialize small fp constants. llvm-svn: 85362	2009-10-28 01:44:26 +00:00
Bob Wilson	854530a7dd	Most of the NEON shuffle instructions do not support 64-bit element types. llvm-svn: 84785	2009-10-21 21:36:27 +00:00
Evan Cheng	786b15fe12	Match more patterns to movt. llvm-svn: 84751	2009-10-21 08:15:52 +00:00
Benjamin Kramer	3301207a15	Random #include pruning. llvm-svn: 84632	2009-10-20 11:44:38 +00:00
Bob Wilson	419160bd79	Revert svn r80498 and replace it with a different solution. The only problem I can see with the original code was that I forgot that this runs after type legalization and hence the result type will always be i32. (Custom legalization of EXTRACT_VECTOR_ELT is only enabled for vector types with 8- and 16-bit elements.) Regarding the FIXME comment: any information about sign and zero-extension should be captured by separate extension operations. The DAG combiner should handle those to produce either VGETLANEu or VGETLANEs, and that seems to be working now. If there are cases that we're missing, let me know. llvm-svn: 84218	2009-10-15 23:12:05 +00:00
Bob Wilson	b62d160b3c	More Neon clean-up: avoid the need for custom-lowering vld/st-lane intrinsics by creating TargetConstants during instruction selection instead of during legalization. llvm-svn: 84042	2009-10-13 22:29:24 +00:00
Bob Wilson	1fdbe1152d	NEON VLD/VST are now fully implemented. For operations that expand to multiple instructions, the expansion is done during selection so there is no need to do anything special during legalization. llvm-svn: 84036	2009-10-13 21:55:24 +00:00
Anton Korobeynikov	75b59fb055	Add PseudoSourceValues for constpool stuff on ELF (Darwin should use something similar) and register spills. llvm-svn: 83435	2009-10-07 00:06:35 +00:00
Evan Cheng	32a47ea7b6	getFunctionAlignment should return log2 alignment. llvm-svn: 83242	2009-10-02 06:57:25 +00:00
Anton Korobeynikov	29a44df5f8	ARM does not support offset folding (yet). Disable it for now. This fixes PR5031. Unfortunately, there is no small testcase :( llvm-svn: 82643	2009-09-23 19:04:09 +00:00
Evan Cheng	9827ad39a7	Fix PR4926. When target hook EmitInstrWithCustomInserter() insert new basic blocks and update CFG, it should also inform sdisel of the changes so the phi source operands will come from the right basic blocks. llvm-svn: 82311	2009-09-19 09:51:03 +00:00
Evan Cheng	270d0f986f	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Bob Wilson	5d8cfb217c	Expand vector floating-point conversions not supported by NEON. llvm-svn: 82074	2009-09-16 20:20:44 +00:00
Bob Wilson	6cc46577f4	Expand some more vector operations not supported by Neon. llvm-svn: 81969	2009-09-16 00:32:15 +00:00
Bob Wilson	4ed397c141	Neon does not support vector divide or remainder. Expand them. llvm-svn: 81966	2009-09-16 00:17:28 +00:00
Bob Wilson	194a2518e5	Expand all v2f64 arithmetic operations for Neon. Radar 7200803. (This should also fix the SingleSource/UnitTests/Vector/sumarray-dbl test.) llvm-svn: 81959	2009-09-15 23:55:57 +00:00
Bob Wilson	a2e8333eed	Fix pr4939: Change FPCCToARMCC to translate SETOLE to ARMCC::LS. See the bug report for details. llvm-svn: 81397	2009-09-09 23:14:54 +00:00
Anton Korobeynikov	7697d37777	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Evan Cheng	1b38952c99	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Sandeep Patel	68c5f477fa	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Bob Wilson	d7797754d4	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	da9817cddd	Generate code for vld{234}_lane intrinsics. llvm-svn: 80656	2009-09-01 04:26:28 +00:00
Jim Grosbach	20eac92d88	Clean up LSDA name generation and use for SJLJ exception handling. This makes an eggregious hack somewhat more palatable. Bringing the LSDA forward and making it a GV available for reference would be even better, but is beyond the scope of what I'm looking to solve at this point. Objective C++ code could generate function names that broke the previous scheme. This fixes that. llvm-svn: 80649	2009-09-01 01:57:56 +00:00
Anton Korobeynikov	eab572a8ff	EXTRACT_VECTOR_ELEMENT can have result type different from element type. Remove the assertion and generalize the code for ARM NEON stuff. llvm-svn: 80498	2009-08-30 17:14:54 +00:00
Anton Korobeynikov	ece642a54c	Do not assert on too wide splats we don't support. llvm-svn: 80409	2009-08-29 00:08:18 +00:00
Evan Cheng	43b9ca6f42	Let Darwin linker auto-synthesize stubs and lazy-pointers. This deletes a bunch of nasty code in ARM asm printer. llvm-svn: 80404	2009-08-28 23:18:09 +00:00
Anton Korobeynikov	ba53af58f0	Hopefully the final missing part :( scalar_to_vector is fully legal now llvm-svn: 80251	2009-08-27 16:25:49 +00:00
Anton Korobeynikov	58ebae4acd	Transform float scalar_to_vector into subreg accesses. No idea whether this is profitable or not. llvm-svn: 80245	2009-08-27 14:38:44 +00:00
Bob Wilson	e0636a7aed	Remove unneeded ARM-specific DAG nodes for VLD* and VST* Neon operations. The instructions can be selected directly from the intrinsics. We will need to add some ARM-specific nodes for VLD/VST of 3 and 4 128-bit vectors, but those are not yet implemented. llvm-svn: 80117	2009-08-26 17:39:53 +00:00
Anton Korobeynikov	0f756b27ae	Expand scalar_to_vector - we don't have any isel logic for it now llvm-svn: 80107	2009-08-26 16:26:09 +00:00
Eli Friedman	682d8c1881	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Bob Wilson	a70623102e	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Anton Korobeynikov	232b19c3d5	Fix some typos and use type-based isel for VZIP/VUZP/VTRN llvm-svn: 79625	2009-08-21 12:41:42 +00:00
Anton Korobeynikov	9a232f46a8	Add lowering of ARM 4-element shuffles to multiple instructios via perfectshuffle-generated table. llvm-svn: 79624	2009-08-21 12:41:24 +00:00
Anton Korobeynikov	ce3ff1be8a	Add nodes & dummy matchers for some v{zip,uzp,trn} instructions llvm-svn: 79622	2009-08-21 12:40:50 +00:00
Anton Korobeynikov	e3046618de	Expand EXTRACT_SUBVECTOR llvm-svn: 79621	2009-08-21 12:40:35 +00:00
Anton Korobeynikov	38f284f2ae	Provide vext.{16,32} llvm-svn: 79620	2009-08-21 12:40:21 +00:00
Anton Korobeynikov	c32e99e3ed	Use masks not nodes for vector shuffle predicates. Provide set of 'legal' masks, so legalizer won't infinite cycle llvm-svn: 79619	2009-08-21 12:40:07 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bill Wendling	bae6b2cca3	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	d3fade656f	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	52d4e64711	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Evan Cheng	6ddd7bcdd1	Turn on if-conversion for thumb2. llvm-svn: 79084	2009-08-15 07:59:10 +00:00
Anton Korobeynikov	a6b3ce203a	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Evan Cheng	dc49a8d3f1	Add Thumb2 lsr hooks. llvm-svn: 79032	2009-08-14 20:09:37 +00:00
Evan Cheng	09c070f4ce	80 col violation. llvm-svn: 79026	2009-08-14 19:11:20 +00:00
Bob Wilson	6f34e278c7	Now that all the legal Neon shuffles (or at least the ones that have been implemented so far) are recognized during legalization, it is easy to fall back to the default expansion for other shuffles. llvm-svn: 78995	2009-08-14 05:16:33 +00:00
Bob Wilson	eb54d51759	Create a new ARM-specific DAG node, VDUP, to represent a splat from a scalar_to_vector. Generate these VDUP nodes during legalization instead of trying to recognize the pattern during selection. llvm-svn: 78994	2009-08-14 05:13:08 +00:00
Bob Wilson	cce31f6831	During legalization, change Neon vdup_lane operations from shuffles to target-specific VDUPLANE nodes. This allows the subreg handling for the quad-register version to be done easily with Pats in the .td file, instead of with custom code in ARMISelDAGToDAG.cpp. llvm-svn: 78993	2009-08-14 05:08:32 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Bob Wilson	3e4c012d54	Add a fixme message about canonicalizing floating-point vector types. llvm-svn: 78897	2009-08-13 06:01:30 +00:00
Bob Wilson	ef6e602bf4	Revert r78852 for now. I want to do this differently, but I don't have time to fix it tonight. llvm-svn: 78896	2009-08-13 05:58:56 +00:00
Bob Wilson	c6800b55e6	Add a comment to describe why vector shuffles are legalized to custom DAG nodes. llvm-svn: 78884	2009-08-13 02:13:04 +00:00
Bob Wilson	fcd6361ad1	Use cast<> instead of dyn_cast<> in places where the type is known. llvm-svn: 78881	2009-08-13 01:57:47 +00:00
Bob Wilson	ff2db10211	Recognize Neon VDUP shuffles during legalization instead of selection. llvm-svn: 78852	2009-08-12 22:54:19 +00:00
Bob Wilson	ea3a402ae7	Recognize Neon VREV shuffles during legalization instead of selection. llvm-svn: 78850	2009-08-12 22:31:50 +00:00
Jim Grosbach	3cfc6463c9	Add catch block handling to SjLj exception handling. llvm-svn: 78817	2009-08-12 17:38:44 +00:00
Evan Cheng	bb2af3555c	Shrink Thumb2 movcc instructions. llvm-svn: 78790	2009-08-12 05:17:19 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Jim Grosbach	f24f9d9cb6	Whitespace cleanup. Remove trailing whitespace. llvm-svn: 78666	2009-08-11 15:33:49 +00:00
Bob Wilson	12842f9865	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Jim Grosbach	693e36a3e8	SjLj based exception handling unwinding support. This patch is nasty, brutish and short. Well, it's kinda short. Definitely nasty and brutish. The front-end generates the register/unregister calls into the SjLj runtime, call-site indices and landing pad dispatch. The back end fills in the LSDA with the call-site information provided by the front end. Catch blocks are not yet implemented. Built on Darwin and verified no llvm-core "make check" regressions. llvm-svn: 78625	2009-08-11 00:09:57 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	3e77df2bcd	SimpleValueType-ify a few more methods on TargetLowering. llvm-svn: 78595	2009-08-10 20:46:15 +00:00
Owen Anderson	246617857f	Continue the SimpleValueType-ification. llvm-svn: 78593	2009-08-10 20:18:46 +00:00
Evan Cheng	b972e5633f	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Bob Wilson	0127031c20	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
Anton Korobeynikov	ef98dbe3de	Remove redundand checks: the only way to have, e.g. f32 RegVT is exactly hardfloat case. llvm-svn: 78237	2009-08-05 20:15:19 +00:00
Anton Korobeynikov	ef42862ef5	Unbreak the stuff, this is ugly, but we cannot do better for now with 'plain' C calling conv. llvm-svn: 78232	2009-08-05 19:40:16 +00:00
Anton Korobeynikov	22ef75155e	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Bob Wilson	20f79e321e	Change DAG nodes for Neon VLD2/3/4 operations to return multiple results. Get rid of yesterday's code to fix the register usage during isel. Select the new DAG nodes to machine instructions. The new pre-alloc pass to choose adjacent registers for these results is not done, so the results of this will generally not assemble yet. llvm-svn: 78136	2009-08-05 00:49:09 +00:00
Bob Wilson	f45dee3ad2	Lower Neon VLD* intrinsics to custom DAG nodes, and manually allocate the results to fixed registers. llvm-svn: 78025	2009-08-04 00:36:16 +00:00
Bob Wilson	17f8878114	Minor cleanup. No functional changes intended. llvm-svn: 78024	2009-08-04 00:25:01 +00:00
Bob Wilson	f307e0bd6d	Lower CONCAT_VECTOR during legalization instead of matching it during isel. Add a testcase. llvm-svn: 77992	2009-08-03 20:36:38 +00:00
Chris Lattner	4e7dfafc03	convert ctors/dtors section to be in TLOF instead of TAI. llvm-svn: 77842	2009-08-02 00:34:36 +00:00
Evan Cheng	6ab54fdb0a	Fix Thumb2 function call isel. Thumb1 and Thumb2 should share the same instructions for calls since BL and BLX are always 32-bit long and BX is always 16-bit long. Also, we should be using BLX to call external function stubs. llvm-svn: 77756	2009-08-01 00:16:10 +00:00
Chris Lattner	51d5b43cda	refactor section construction in TLOF to be through an explicit initialize method, which can be called when an MCContext is available. llvm-svn: 77687	2009-07-31 17:42:42 +00:00
Bob Wilson	0dbdec8042	Lower a 128-bit BUILD_VECTOR with 2 elements to a pair of INSERT_VECTOR_ELTs. llvm-svn: 77557	2009-07-30 00:31:25 +00:00
Evan Cheng	c6d70ae063	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
Evan Cheng	c8bed03349	In thumb2 mode, add pc is unpredictable. Use add + mov pc instead (that is until more optimization goes in). llvm-svn: 77364	2009-07-28 20:53:24 +00:00
Chris Lattner	a3242e93b7	the apple "ld_classic" linker doesn't support .literal16 in 32-bit mode, and "ld64" (the default linker) falls back to it in -static mode. llvm-svn: 77334	2009-07-28 17:50:28 +00:00
Chris Lattner	5e693ed07b	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Bob Wilson	8a37bbebfd	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Chris Lattner	55452c2bea	fix an arm codegen bug (the same as PR4482 on ppc) where available_externally symbols were not getting stubs. While I'm at it, add a big testcase for stub generation to make sure I don't break anything. llvm-svn: 75737	2009-07-15 04:12:33 +00:00
Bob Wilson	3f17aee94b	Remove an extra space. llvm-svn: 75658	2009-07-14 18:44:34 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Bob Wilson	844d6c82a7	Fix comment typos. llvm-svn: 75479	2009-07-13 18:11:36 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Torok Edwin	6dd2730024	Start converting to new error handling API. cerr+abort -> llvm_report_error assert(0)+abort -> LLVM_UNREACHABLE (assert(0)+llvm_unreachable-> abort() included) llvm-svn: 75018	2009-07-08 18:01:40 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Evan Cheng	b24e51e2d9	Add some more Thumb2 multiplication instructions. llvm-svn: 74889	2009-07-07 01:17:28 +00:00
Tilmann Scheller	aea6059ed4	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Evan Cheng	0e8bde5910	Add thumb2 sign / zero extend with rotate instructions. llvm-svn: 74755	2009-07-03 01:43:10 +00:00
Evan Cheng	84c6cda2ef	Thumb2 pre/post indexed loads. llvm-svn: 74696	2009-07-02 07:28:31 +00:00
Evan Cheng	844f0b4562	80 col violation. llvm-svn: 74693	2009-07-02 06:44:30 +00:00
Bill Wendling	512ff7353e	Update comments to make it clear that the function alignment is the Log2 of the bytes and not bytes. llvm-svn: 74624	2009-07-01 18:50:55 +00:00
Bill Wendling	31ceb1bcba	Add an "alignment" field to the MachineFunction object. It makes more sense to have the alignment be calculated up front, and have the back-ends obey whatever alignment is decided upon. This allows for future work that would allow for precise no-op placement and the like. llvm-svn: 74564	2009-06-30 22:38:32 +00:00
David Goodwin	dbf11ba800	Rename ARMcmpNZ to ARMcmpZ and use it to represent comparisons that set only the Z flag (i.e. eq and ne). Make ARMcmpZ commutative. llvm-svn: 74423	2009-06-29 15:33:01 +00:00
David Goodwin	aa294c5593	Thumb-2 has CLZ. llvm-svn: 74322	2009-06-26 20:47:43 +00:00
Bob Wilson	2e076c4e02	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Evan Cheng	d305869ca2	Add comments. llvm-svn: 73761	2009-06-19 07:06:07 +00:00
Evan Cheng	1592035e67	Should be using Bcc (average) latency to determine if-conversion threshold, not BL. llvm-svn: 73759	2009-06-19 06:56:26 +00:00
Evan Cheng	4e712de541	Latency information for ARM v6. It's rough and not yet hooked up. Right now we are only using branch latency to determine if-conversion limits. llvm-svn: 73747	2009-06-19 01:51:50 +00:00
Evan Cheng	a0ca298f8a	Remove UseThumbBacktraces. Just check if subtarget is darwin. llvm-svn: 73734	2009-06-18 23:14:30 +00:00
Anton Korobeynikov	a8fd40b50a	Address review comments: add 3 ARM calling conventions. Dispatch C calling conv. to one of these conventions based on target triple and subtarget features. llvm-svn: 73530	2009-06-16 18:50:49 +00:00
Anton Korobeynikov	77d1943637	The attached patches implement most of the ARM AAPCS-VFP hard float ABI. The missing piece is support for putting "homogeneous aggregates" into registers. Patch by Sandeep Patel! llvm-svn: 73095	2009-06-08 22:53:56 +00:00
Bob Wilson	ccbc17b3a3	Only 64-bit targets support TImode libcalls. Disable the TImode shift libcalls for ARM. This fixes rdar://6908807. llvm-svn: 72269	2009-05-22 17:38:41 +00:00
Bob Wilson	320d54a2d8	Fix pr4202: Disable CodePlacementOpt for ARM. The ARMConstantIslandPass has to run last because it needs to know the exact size and position of every basic block. Currently CodePlacementOpt is set up to run last. It might be worthwhile to investigate reordering these passes, but for now, let's just make it work. llvm-svn: 72037	2009-05-18 20:55:32 +00:00
Jim Grosbach	06928192ae	Update the names of the exception handling sjlj instrinsics to llvm.eh.sjlj.* for better clarity as to their purpose and scope. Add a description of llvm.eh.sjlj.setjmp to ExceptionHandling.html. (llvm.eh.sjlj.longjmp documentation coming when that implementation is added). llvm-svn: 71758	2009-05-14 00:46:35 +00:00
Evan Cheng	ab0d23396a	Run code placement optimization for targets that want it (arm and x86 for now). llvm-svn: 71726	2009-05-13 21:42:09 +00:00
Jim Grosbach	aeca45dd6f	Add support for GCC compatible builtin setjmp and longjmp intrinsics. This is a supporting preliminary patch for GCC-compatible SjLJ exception handling. Note that these intrinsics are not designed to be invoked directly by the user, but rather used by the front-end as target hooks for exception handling. llvm-svn: 71610	2009-05-12 23:59:14 +00:00
Bob Wilson	0041bd3523	Change LowerCallResult method so that CCValAssign::BCvt can be used with f64 types. This is not used for anything yet. llvm-svn: 70006	2009-04-25 00:33:20 +00:00
Bob Wilson	40e784ce69	Adjust a comment to reflect what the code does. Splitting a 64-bit argument between registers and the stack may be required with the APCS ABI, but it isn't tied to using a particular version of the ARM architecture. llvm-svn: 69978	2009-04-24 17:05:01 +00:00
Bob Wilson	f134b2d212	Fix up some problems with getCopyToReg and getCopyFromReg nodes being chained and "flagged" together. I also made a few changes to handle the chain and flag values more consistently. I found these problems by inspection so I'm not aware of anything that breaks because of them (thus no testcase). llvm-svn: 69977	2009-04-24 17:00:36 +00:00
Bob Wilson	f8b85477ae	Move duplicated AddLiveIn function from X86 and ARM backends to be a method in the MachineFunction class, renaming it to addLiveIn for consistency with the same method in MachineBasicBlock. Thanks for Anton for suggesting this. llvm-svn: 69615	2009-04-20 18:36:57 +00:00
Bob Wilson	b0b10f8bf6	Move the AddLiveIn function definition closer to its uses. llvm-svn: 69382	2009-04-17 20:42:34 +00:00
Bob Wilson	deeaf70dad	Rearrange code to reduce indentation. llvm-svn: 69381	2009-04-17 20:40:45 +00:00
Bob Wilson	ea09d4aca8	Clean up formatting, remove trailing whitespace, fix comment typos and punctuation. No functional changes. llvm-svn: 69378	2009-04-17 20:35:10 +00:00
Bob Wilson	a4c2290e5f	Use CallConvLower.h and TableGen descriptions of the calling conventions for ARM. Patch by Sandeep Patel. llvm-svn: 69371	2009-04-17 19:07:39 +00:00
Bob Wilson	866c174f79	Fix PR3795: Apply Dan's suggested fix for ARMTargetLowering::isLegalAddressingMode. llvm-svn: 68619	2009-04-08 17:55:28 +00:00
Jim Grosbach	fde2110aa9	PR2985 / <rdar://problem/6584986> When compiling in Thumb mode, only the low (R0-R7) registers are available for most instructions. Breaking the low registers into a new register class handles this. Uses of R12, SP, etc, are handled explicitly where needed with copies inserted to move results into low registers where the rest of the code generator can deal with them. llvm-svn: 68545	2009-04-07 20:34:09 +00:00
Bob Wilson	cf1ec2cc68	Fix PR3862: Recognize some ARM-specific constraints for immediates in inline assembly. llvm-svn: 68218	2009-04-01 17:58:54 +00:00
Bob Wilson	dc40d5ae2c	Fix a few more indentation problems and an 80-column violation. llvm-svn: 67416	2009-03-20 23:16:43 +00:00
Bob Wilson	7117a916f5	No functional changes. Fix indentation and whitespace only. llvm-svn: 67412	2009-03-20 22:42:55 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Chris Lattner	4147f08e44	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	ce5dfb692a	ARM isLegalAddressImmediate should check if type is a simple type now that optimizer can create values of funky scalar types. llvm-svn: 66429	2009-03-09 19:15:00 +00:00
Duncan Sands	12da8ce3d2	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dale Johannesen	7647da67ea	Remove refs to non-DebugLoc versions of BuildMI from ARM. llvm-svn: 64429	2009-02-13 02:25:56 +00:00
Dan Gohman	747e55bc9a	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	62fd95d6ec	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	400dc2e2e4	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Dale Johannesen	f08a47bb70	Remove non-DebugLoc forms of CopyToReg and CopyFromReg. Adjust callers. llvm-svn: 63789	2009-02-04 23:02:30 +00:00
Dale Johannesen	021052a705	Remove non-DebugLoc versions of getLoad and getStore. Adjust the many callers of those versions. llvm-svn: 63767	2009-02-04 20:06:27 +00:00
Dale Johannesen	abf66b8343	Add some DL propagation to places that didn't have it yet. More coming. llvm-svn: 63673	2009-02-03 22:26:09 +00:00
Dale Johannesen	555a375bb6	Make LowerCallTo and LowerArguments take a DebugLoc argument. Adjust all callers and overloaded versions. llvm-svn: 63444	2009-01-30 23:10:59 +00:00
Chris Lattner	80b283c1cd	silence a warning when assertions are disabled. llvm-svn: 62976	2009-01-25 23:08:00 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Dan Gohman	02b93136e9	Const-qualify getPreIndexedAddressParts and friends. llvm-svn: 62259	2009-01-15 16:29:45 +00:00
Evan Cheng	2a03c7e977	Re-did 60519. It turns out Darwin's handling of hidden visibility symbols are a bit more complicate than I expected. Both declarations and weak definitions still need a stub indirection. However, the stubs are in data section and they contain the addresses of the actual symbols. llvm-svn: 60571	2008-12-05 01:06:39 +00:00
Bill Wendling	6949f6135b	Temporarily revert r60519. It was causing a bootstrap failure: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -DHAVE_CONFIG_H -I. -I../../../llvm-gcc.src/libgomp -I. -I../../../llvm-gcc.src/libgomp/config/posix -I../../../llvm-gcc.src/libgomp -Wall -pthread -Werror -O2 -g -O2 -MT barrier.lo -MD -MP -MF .deps/barrier.Tpo -c ../../../llvm-gcc.src/libgomp/barrier.c -fno-common -DPIC -o .libs/barrier.o checking for sys/file.h... /var/folders/zG/zGE-ZJOGFiGjv0B5cs5oYE+++TM/-Tmp-//cc34Jg5P.s:13:non-relocatable subtraction expression, "_gomp_tls_key" minus "L1$pb" /var/folders/zG/zGE-ZJOGFiGjv0B5cs5oYE+++TM/-Tmp-//cc34Jg5P.s:13:symbol: "_gomp_tls_key" can't be undefined in a subtraction expression make[4]: * [barrier.lo] Error 1 make[4]: * Waiting for unfinished jobs.... /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -DHAVE_CONFIG_H -I. -I../../../llvm-gcc.src/libgomp -I. -I../../../llvm-gcc.src/libgomp/config/posix -I../../../llvm-gcc.src/libgomp -Wall -pthread -Werror -O2 -g -O2 -MT alloc.lo -MD -MP -MF .deps/alloc.Tpo -c ../../../llvm-gcc.src/libgomp/alloc.c -o alloc.o >/dev/null 2>&1 yes checking for sys/param.h... make[3]: * [all-recursive] Error 1 make[2]: * [all] Error 2 make[1]: * [all-target-libgomp] Error 2 make[1]: * Waiting for unfinished jobs.... llvm-svn: 60527	2008-12-04 04:07:00 +00:00
Evan Cheng	011c4fa8a1	Visibility hidden GVs do not require extra load of symbol address from the GOT or non-lazy-ptr. llvm-svn: 60519	2008-12-04 01:56:50 +00:00
Duncan Sands	3d960941b1	There are no longer any places that require a MERGE_VALUES node with only one operand, so get rid of special code that only existed to handle that possibility. llvm-svn: 60349	2008-12-01 11:41:29 +00:00
Duncan Sands	6ed40141f7	Change the interface to the type legalization method ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348	2008-12-01 11:39:25 +00:00
Evan Cheng	e3827d9061	Actually ARM / Mac OS X does have UINTTOFP_I64_F{64\|32} libcalls. llvm-svn: 58725	2008-11-04 22:19:55 +00:00
Evan Cheng	297b32a367	Custom lower bit_convert i64 -> f64 into FMDRR. This is now happening with legalizetypes. llvm-svn: 58714	2008-11-04 19:57:48 +00:00
Evan Cheng	07d53b1d33	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dale Johannesen	0e32a2c935	Add "inreg" field to CallSDNode (doesn't increase its size). Adjust various lowering functions to pass this info through from CallInst. Use it to implement sseregparm returns on X86. Remove X86_ssecall calling convention. llvm-svn: 56677	2008-09-26 19:31:26 +00:00
Dale Johannesen	7a74e71489	Make log, log2, log10, exp, exp2 use Expand by default. llvm-svn: 56471	2008-09-22 21:57:32 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	d3fe174c53	Define CallSDNode, an SDNode subclass for use with ISD::CALL. Currently it just holds the calling convention and flags for isVarArgs and isTailCall. And it has several utility methods, which eliminate magic 5+2*i and similar index computations in several places. CallSDNodes are not CSE'd. Teach UpdateNodeOperands to handle nodes that are not CSE'd gracefully. llvm-svn: 56183	2008-09-13 01:54:27 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dan Gohman	4d5b5fe812	Delete an unused variable. llvm-svn: 55915	2008-09-08 16:28:17 +00:00
Dale Johannesen	da2d80688b	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Evan Cheng	c90a11256e	Teach ARM isLegalAddressingMode to handle unknown type without crashing. This fixes pr2589. llvm-svn: 54004	2008-07-25 00:55:17 +00:00
Chris Lattner	9fc580f2d0	add support for returning i128, PR2532. llvm-svn: 53472	2008-07-11 20:53:00 +00:00
Dan Gohman	3b46030375	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Duncan Sands	93e180342a	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	739a0548c4	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Dan Gohman	5c73a886b4	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Duncan Sands	1ae6ef83ee	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	2505d86783	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Evan Cheng	86e476b7cb	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Duncan Sands	d97eea372a	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Evan Cheng	ae2c56d93e	Default ISD::PREFETCH to expand. llvm-svn: 48169	2008-03-10 19:38:10 +00:00
Dale Johannesen	4e622ec86d	Increase ISD::ParamFlags to 64 bits. Increase the ByValSize field to 32 bits, thus enabling correct handling of ByVal structs bigger than 0x1ffff. Abstract interface a bit. Fixes gcc.c-torture/execute/pr23135.c and gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing on ppc32, quietly producing wrong code on x86-32.) llvm-svn: 48122	2008-03-10 02:17:22 +00:00
Evan Cheng	95cf661534	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Anton Korobeynikov	40d67c59d5	Remove bunch of gcc 4.3-related warnings from Target llvm-svn: 47369	2008-02-20 11:22:39 +00:00
Andrew Lenharth	fedcf477b5	I cannot find a libgcc function for this builtin. Therefor expanding it to a noop (which is how it use to be treated). If someone who knows the x86 backend better than me could tell me how to get a lock prefix on an instruction, that would be nice to complete x86 support. llvm-svn: 47213	2008-02-16 14:46:26 +00:00
Duncan Sands	4c95dbd69f	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Nate Begeman	53e1b3f9d5	Change how FP immediates are handled. 1) ConstantFP is now expand by default 2) ConstantFP is not turned into TargetConstantFP during Legalize if it is legal. This allows ConstantFP to be handled like Constant, allowing for targets that can encode FP immediates as MachineOperands. As a bonus, fix up Itanium FP constants, which now correctly match, and match more constants! Hooray. llvm-svn: 47121	2008-02-14 08:57:00 +00:00
Dan Gohman	e1d9ee66ed	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Chris Lattner	f6518cf4ab	don't try to avoid inserting loads when lowering FORMAL_ARGUMENTS. DAGCombine is now quite good at zapifying them. llvm-svn: 47053	2008-02-13 07:35:30 +00:00
Dan Gohman	f990faf23b	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Nate Begeman	bcc182f50d	Remove some dead code llvm-svn: 47036	2008-02-12 22:54:40 +00:00
Dan Gohman	2d489b5081	Re-apply the memory operand changes, with a fix for the static initializer problem, a minor tweak to the way the DAGISelEmitter finds load/store nodes, and a renaming of the new PseudoSourceValue objects. llvm-svn: 46827	2008-02-06 22:27:42 +00:00
Nate Begeman	ef14d5f926	Eliminate some redundant code. llvm-svn: 46720	2008-02-04 21:44:06 +00:00
Evan Cheng	27b32b87ed	Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. llvm-svn: 46623	2008-01-31 21:00:00 +00:00
Dan Gohman	3646fdda67	Create a new class, MemOperand, for describing memory references in the backend. Introduce a new SDNode type, MemOperandSDNode, for holding a MemOperand in the SelectionDAG IR, and add a MemOperand list to MachineInstr, and code to manage them. Remove the offset field from SrcValueSDNode; uses of SrcValueSDNode that were using it are all all using MemOperandSDNode now. Also, begin updating some getLoad and getStore calls to use the PseudoSourceValue objects. Most of this was written by Florian Brander, some reorganization and updating to TOT by me. llvm-svn: 46585	2008-01-31 00:25:39 +00:00
Evan Cheng	29cfb67e28	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Dan Gohman	47a7d6fafe	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Duncan Sands	95d46ef887	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Chris Lattner	1ea55cf816	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	f3f4ad9dd6	implement a trivial readme entry. llvm-svn: 44380	2007-11-27 22:36:16 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Bill Wendling	f359fed9f9	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Lauro Ramos Venancio	f6a67bf700	[ARM] Implement __builtin_thread_pointer. llvm-svn: 43892	2007-11-08 17:20:05 +00:00
Rafael Espindola	fa0df55bdd	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Rafael Espindola	419b6d7ce4	Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold and by restructuring the X86 version. New I just have to move this to a common place :-) llvm-svn: 43554	2007-10-31 14:39:58 +00:00
Rafael Espindola	063f177300	Make ARM an X86 memcpy expansion more similar to each other. Now both subtarget define getMaxInlineSizeThreshold and the expansion uses it. This should not change generated code. llvm-svn: 43552	2007-10-31 11:52:06 +00:00
Evan Cheng	1f2dd35898	Fix memcpy lowering when addresses are 4-byte aligned but size is not multiple of 4. llvm-svn: 43234	2007-10-22 22:11:27 +00:00
Rafael Espindola	18a831d783	split LowerMEMCPY into LowerMEMCPYCall and LowerMEMCPYInline in the ARM backend. llvm-svn: 43176	2007-10-19 14:35:17 +00:00
Chris Lattner	84f3461c49	legalizing the ret operation on f64 shouldn't introduce a new i64 bit convert needlessly. llvm-svn: 43116	2007-10-18 06:17:07 +00:00
Dan Gohman	482732af9d	Set ISD::FPOW to Expand. llvm-svn: 42881	2007-10-11 23:21:31 +00:00
Dan Gohman	a160361c85	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Dale Johannesen	3cf889f75e	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Duncan Sands	644f917358	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Gabor Greif	e16561cd5d	Here is the bulk of the sanitizing. Almost all occurrences of "bytecode" in the sources have been eliminated. llvm-svn: 37913	2007-07-05 17:07:56 +00:00

... 3 4 5 6 7 ...

511 Commits