llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Benjamin Kramer	21d75078b5	Remove now unused Context variables. llvm-svn: 77495	2009-07-29 19:14:17 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Jakob Stoklund Olesen	1ae0736830	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Owen Anderson	f945a9ed07	Move a few more convenience factory functions from Constant to LLVMContext. llvm-svn: 75840	2009-07-15 21:51:10 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Jakob Stoklund Olesen	ed0e1a0552	Implement support for promotion of AND/OR/XOR on integer types. The blackfin processor has a legal i16 type, but only logic operations on i32. llvm-svn: 75419	2009-07-12 18:10:18 +00:00
Jakob Stoklund Olesen	6b9f63cafa	Fix types in PromoteNode handling of CTPOP and friends. llvm-svn: 75418	2009-07-12 17:43:20 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
Tilmann Scheller	aea6059ed4	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
David Greene	30048bdb63	This increases the maximum for MVT::LAST_VALUETYPE This change doubles the allowable value for MVT::LAST_VALUETYPE. It does this by doing several things. 1. Introduces MVT::MAX_ALLOWED_LAST_VALUETYPE which in this change has a value of 64. This value contains the current maximum for the MVT::LAST_VALUETYPE. 2. Instead of checking "MVT::LAST_VALUETYPE <= 32", all of those uses now become "MVT::LAST_VALUETYPE <= MVT::MAX_ALLOWED_LAST_VALUETYPE" 3. Changes the dimension of the ValueTypeActions from 2 elements to four elements and adds comments ahead of the declaration indicating the it is "(MVT::MAX_ALLOWED_LAST_VALUETYPE/32) * 2". This at least lets us find what is affected if and when MVT::MAX_ALLOWED_LAST_VALUETYPE gets changed. 4. Adds initializers for the new elements of ValueTypeActions. This does NOT add any types in MVT. That would be done separately. This doubles the size of ValueTypeActions from 64 bits to 128 bits and gives us the freedom to add more types for AVX. llvm-svn: 74110	2009-06-24 19:41:55 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	aee3f62b75	Factor out a couple of helpers. llvm-svn: 72992	2009-06-06 07:04:42 +00:00
Eli Friedman	e1dc193f35	Re-commit r72514 and r72516 with a fixed version of BR_CC lowering. This patch removes some special cases for opcodes and does a bit of cleanup. llvm-svn: 72536	2009-05-28 20:40:34 +00:00
Bill Wendling	f193838d2b	Temporarily revert r72514 (and dependent patch r72516). It was causing this failure during llvm-gcc bootstrap: Assertion failed: (!Tmp2.getNode() && "Can't legalize BR_CC with legal condition!"), function ExpandNode, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 2923. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/libgcc2.c:1727: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. llvm-svn: 72530	2009-05-28 18:18:59 +00:00
Eli Friedman	9b9df77260	Remove a couple of useless functions. llvm-svn: 72516	2009-05-28 04:49:34 +00:00
Eli Friedman	3aa278394e	Remove special cases for more opcodes. This is basically the end of this series of patches for LegalizeDAG; the remaining special cases can't be removed without more infrastructure work. There's a FIXME for each relevant opcode near the beginning of SelectionDAGLegalize::LegalizeOp. llvm-svn: 72514	2009-05-28 04:39:57 +00:00
Eli Friedman	5df7202d3b	Remove special case for SETCC opcode; add some comments explaining why some special cases are necessary. llvm-svn: 72511	2009-05-28 03:56:57 +00:00
Eli Friedman	e1bc3798e6	Some minor cleanups. llvm-svn: 72509	2009-05-28 03:06:16 +00:00
Eli Friedman	ed795153c7	Minor cleanups; add a better explanation for the issue with BUILD_VECTOR. llvm-svn: 72469	2009-05-27 12:42:55 +00:00
Eli Friedman	2892d82378	Remove more special cases for opcodes. llvm-svn: 72468	2009-05-27 12:20:41 +00:00
Eli Friedman	3b251705fd	Remove special cases for more opcodes. llvm-svn: 72467	2009-05-27 07:58:35 +00:00
Eli Friedman	0e49431422	Removing more special cases from LegalizeDAG. llvm-svn: 72465	2009-05-27 07:32:27 +00:00
Eli Friedman	568839681c	Eliminate more special cases for opcodes. llvm-svn: 72464	2009-05-27 07:05:37 +00:00
Eli Friedman	d6f2834496	Remove more special cases from LegalizeDAG. llvm-svn: 72456	2009-05-27 03:33:44 +00:00
Eli Friedman	b3554158c5	Remove unused argument. llvm-svn: 72455	2009-05-27 02:21:29 +00:00
Eli Friedman	a8f9a0261e	Remove more opcode special cases. llvm-svn: 72454	2009-05-27 02:16:40 +00:00
Eli Friedman	21d349b3c5	Start of refactoring LegalizeDAG so that we don't need specialized handling for every single opcode. llvm-svn: 72447	2009-05-27 01:25:56 +00:00
Eli Friedman	4a951bf2ad	Delete a bunch of dead code from LegalizeDAG. llvm-svn: 72414	2009-05-26 08:55:52 +00:00
Eli Friedman	fd8b335ca4	Minor improvement to FCOPYSIGN to use BIT_CONVERT in cases where the corresponding integer type is legal. llvm-svn: 72373	2009-05-24 20:29:11 +00:00
Eli Friedman	fe87034cef	Rewrite ISD::FCOPYSIGN lowering to never use i64. Not really ideal, but it's late, and I don't have any better ideas at the moment. Fixes PR4257. llvm-svn: 72363	2009-05-24 10:21:20 +00:00
Eli Friedman	a4e1675dac	Remove checks of getTypeAction from LegalizeOp; we already assert that all results and all operands are legal, so this change shouldn't affect behavior at all. llvm-svn: 72359	2009-05-24 08:42:01 +00:00
Eli Friedman	5e0d150689	Disable type legalization in LegalizeDAG. This leaves around 4000 lines of dead code; I'll clean that up in subsequent commits. llvm-svn: 72358	2009-05-24 02:46:31 +00:00
Eli Friedman	7badee92ad	Fix a bug in the expansion of EXTRACT_SUBVECTOR in ExpandExtractFromVectorThroughStack. llvm-svn: 72351	2009-05-23 23:03:28 +00:00
Eli Friedman	40afdb63ec	Add a proper implementation of EXTRACT_SUBVECTOR legalization that doesn't split legal vector operands. This is necessary because the type legalization (and therefore, vector splitting) code will be going away soon. llvm-svn: 72349	2009-05-23 22:37:25 +00:00
Jay Foad	7d0479f2c2	Use v.data() instead of &v[0] when SmallVector v might be empty. llvm-svn: 72210	2009-05-21 09:52:38 +00:00
Dan Gohman	9521cadff7	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Anton Korobeynikov	e7a9661f31	Properly expand libcalls for urem / srem. Also make code more straightforward. llvm-svn: 71238	2009-05-08 18:51:08 +00:00
Duncan Sands	2338f6c57e	Add generic expansion of SUB when ADD and XOR are legal. Based on a patch by Micah Villmow. llvm-svn: 71078	2009-05-06 11:29:50 +00:00
Anton Korobeynikov	05b7a7c8f8	Properly handle sdiv / udiv / srem / urem libcalls llvm-svn: 70764	2009-05-03 13:18:16 +00:00
Anton Korobeynikov	f3fc92d6fc	Add libcall expansion for 16 and 128 bit muls llvm-svn: 70749	2009-05-03 13:13:51 +00:00
Bob Wilson	62a3124fb8	Allow CONCAT_VECTORS nodes to be legal or have custom lowering for some targets. Changes to take advantage of this will come later. llvm-svn: 70560	2009-05-01 17:55:32 +00:00
Argyrios Kyrtzidis	a5037484a4	Make DebugLoc independent of DwarfWriter. -Replace DebugLocTuple's Source ID with CompileUnit's GlobalVariable* -Remove DwarfWriter::getOrCreateSourceID -Make necessary changes for the above (fix callsites, etc.) llvm-svn: 70520	2009-04-30 23:22:31 +00:00
Bill Wendling	026e5d7667	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	7e6e352735	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Nate Begeman	5f829d896d	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Bill Wendling	084669a1c9	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Bill Wendling	56f2987a87	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	d0ae15946c	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Duncan Sands	bfa037705e	Now that PR2957 is resolved, remove a bunch of no-longer needed workarounds. llvm-svn: 70234	2009-04-27 19:33:03 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dan Gohman	8aa28b9c34	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Bob Wilson	f6c2195383	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Bob Wilson	f074ca7454	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Dan Gohman	de912e2475	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Dan Gohman	b425feb2aa	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	cc841a3810	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	2e55923fba	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Bill Wendling	c6869f4695	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Chris Lattner	21cf4bf235	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Evan Cheng	a49de9de2e	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Evan Cheng	86673f2806	Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438	2009-02-25 07:04:34 +00:00
Bill Wendling	786c5973f7	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367	2009-02-24 02:35:30 +00:00
Scott Michel	9d31aca679	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Scott Michel	cf0da6c597	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Bill Wendling	3c50922ea0	--- Merging (from foreign repository) r64714 into '.': U include/llvm/CodeGen/DebugLoc.h U lib/CodeGen/SelectionDAG/LegalizeDAG.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp Enable debug location generation at -Os. This goes with the reapplication of the r63639 patch. llvm-svn: 64715	2009-02-17 01:04:54 +00:00
Bill Wendling	65c0fd4c44	Revert this. It was breaking stuff. llvm-svn: 64428	2009-02-13 02:16:35 +00:00
Bill Wendling	1c21ac3066	Turn off the old way of handling debug information in the code generator. Use the new way, where all of the information is passed on SDNodes and machine instructions. llvm-svn: 64427	2009-02-13 02:01:04 +00:00
Dale Johannesen	9c310711bb	Use getDebugLoc forwarder instead of getNode()->getDebugLoc. No functional change. llvm-svn: 64026	2009-02-07 19:59:05 +00:00
Dale Johannesen	62fd95d6ec	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	dc93bbc4b0	And one more file. llvm-svn: 63971	2009-02-06 21:55:48 +00:00
Dale Johannesen	021052a705	Remove non-DebugLoc versions of getLoad and getStore. Adjust the many callers of those versions. llvm-svn: 63767	2009-02-04 20:06:27 +00:00
Mon P Wang	34650735d0	Avoids generating a legalization assert for the case where a vector type is legal but when legalizing the operation, we split the vector type and generate a library call whose type needs to be promoted. For example, X86 with SSE on but MMX off, a divide v2i64 will be scalarized to 2 calls to a library using i64. llvm-svn: 63760	2009-02-04 19:38:14 +00:00
Dale Johannesen	679073b420	Remove non-DebugLoc forms of the exotic forms of Lod and Sto; patch uses. llvm-svn: 63716	2009-02-04 02:34:38 +00:00
Dale Johannesen	9888edee10	Fill in more omissions in DebugLog propagation. I think that's it for this directory. llvm-svn: 63690	2009-02-04 00:13:36 +00:00
Dale Johannesen	f1163e9a4d	Propagation in TargetLowering. Includes passing a DL into SimplifySetCC which gets called elsewhere. llvm-svn: 63583	2009-02-03 00:47:48 +00:00
Dale Johannesen	72ba6df1a9	Last DebugLoc propagation for this file. llvm-svn: 63574	2009-02-02 23:46:53 +00:00
Dale Johannesen	b5dd922a92	More DebugLoc propagation. This should be everything except LegalizeOp itself. llvm-svn: 63560	2009-02-02 22:49:46 +00:00
Dale Johannesen	a02e45ca19	DebugLoc propagation. ExpandOp and PromoteOp, among others. llvm-svn: 63555	2009-02-02 22:12:50 +00:00
Dale Johannesen	ad00f6e010	More DebugLoc propagation. llvm-svn: 63543	2009-02-02 20:41:04 +00:00
Dale Johannesen	8525d83aac	DebugLoc propagation for int<->fp conversions. llvm-svn: 63537	2009-02-02 19:03:57 +00:00
Duncan Sands	3ed768868d	Fix PR3453 and probably a bunch of other potential crashes or wrong code with codegen of large integers: eliminate the legacy getIntegerVTBitMask and getIntegerVTSignBit methods, which returned their value as a uint64_t, so couldn't handle huge types. llvm-svn: 63494	2009-02-01 18:06:53 +00:00
Duncan Sands	41826036b1	Fix PR3401: when using large integers, the type returned by getShiftAmountTy may be too small to hold shift values (it is an i8 on x86-32). Before and during type legalization, use a large but legal type for shift amounts: getPointerTy; afterwards use getShiftAmountTy, fixing up any shift amounts with a big type during operation legalization. Thanks to Dan for writing the original patch (which I shamelessly pillaged). llvm-svn: 63482	2009-01-31 15:50:11 +00:00
Dale Johannesen	555a375bb6	Make LowerCallTo and LowerArguments take a DebugLoc argument. Adjust all callers and overloaded versions. llvm-svn: 63444	2009-01-30 23:10:59 +00:00
Bill Wendling	8fb81f1b3d	Get rid of the non-DebugLoc-ified getNOT() method. llvm-svn: 63442	2009-01-30 23:03:19 +00:00
Dan Gohman	14d55f0a5c	Explicitly add PseudoSourceValue information when lowering BUILD_VECTOR and conversions to stack operations. llvm-svn: 63333	2009-01-29 21:02:43 +00:00
Dan Gohman	4aa1846215	Make isOperationLegal do what its name suggests, and introduce a new isOperationLegalOrCustom, which does what isOperationLegal previously did. Update a bunch of callers to use isOperationLegalOrCustom instead of isOperationLegal. In some case it wasn't obvious which behavior is desired; when in doubt I changed then to isOperationLegalOrCustom as that preserves their previous behavior. This is for the second half of PR3376. llvm-svn: 63212	2009-01-28 17:46:25 +00:00
Dan Gohman	b3bbde3e62	Use ValueType::bitsLT to simplify some code. llvm-svn: 63170	2009-01-28 03:10:52 +00:00
Dan Gohman	172ad92b29	Use ZERO_EXTEND instead of ANY_EXTEND when promoting shift amounts, to avoid implicitly assuming that target architectures will ignore the high bits. llvm-svn: 63169	2009-01-28 02:58:31 +00:00
Dan Gohman	fb58faf29e	Add an assertion to the form of SelectionDAG::getConstant that takes a uint64_t to verify that the value is in range for the given type, to help catch accidental overflow. Fix a few places that relied on getConstant implicitly truncating the value. llvm-svn: 63128	2009-01-27 20:39:34 +00:00
Nate Begeman	b09b0242ca	Fix an indent and a typo. llvm-svn: 62940	2009-01-24 22:12:48 +00:00
Bob Wilson	c58900504b	Add SelectionDAG::getNOT method to construct bitwise NOT operations, corresponding to the "not" and "vnot" PatFrags. Use the new method in some places where it seems appropriate. llvm-svn: 62768	2009-01-22 17:39:32 +00:00
Scott Michel	ed7d79fce4	CellSPU: - Ensure that (operation) legalization emits proper FDIV libcall when needed. - Fix various bugs encountered during llvm-spu-gcc build, along with various cleanups. - Start supporting double precision comparisons for remaining libgcc2 build. Discovered interesting DAGCombiner feature, which is currently solved via custom lowering (64-bit constants are not legal on CellSPU, but DAGCombiner insists on inserting one anyway.) - Update README. llvm-svn: 62664	2009-01-21 04:58:48 +00:00
Dan Gohman	91febd1330	More consts on TargetLowering references. llvm-svn: 62262	2009-01-15 16:58:17 +00:00
Dan Gohman	4bdf021e05	Use const with TargetLowering references in a few more places. llvm-svn: 62260	2009-01-15 16:43:02 +00:00
Devang Patel	5c6e1e3b7d	Use DebugInfo interface to lower dbg_* intrinsics. llvm-svn: 62127	2009-01-13 00:35:13 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Duncan Sands	8feb694e8f	Fix PR3274: when promoting the condition of a BRCOND node, promote from i1 all the way up to the canonical SetCC type. In order to discover an appropriate type to use, pass MVT::Other to getSetCCResultType. In order to be able to do this, change getSetCCResultType to take a type as an argument, not a value (this is also more logical). llvm-svn: 61542	2009-01-01 15:52:00 +00:00
Scott Michel	0c9259f149	Teach LeaglizeDAG that i64 mul can be a libcall. llvm-svn: 61463	2008-12-29 03:21:37 +00:00
Dan Gohman	12f2490489	Clean up the atomic opcodes in SelectionDAG. This removes all the _8, _16, _32, and _64 opcodes and replaces each group with an unsuffixed opcode. The MemoryVT field of the AtomicSDNode is now used to carry the size information. In tablegen, the size-specific opcodes are replaced by size-independent opcodes that utilize the ability to compose them with predicates. This shrinks the per-opcode tables and makes the code that handles atomics much more concise. llvm-svn: 61389	2008-12-23 21:37:04 +00:00
Mon P Wang	a501640ffa	Added support for vector widening. llvm-svn: 61209	2008-12-18 20:03:17 +00:00
Mon P Wang	015a7f57b2	Fix expansion of vsetcc to set the high bit for true instead of 1. llvm-svn: 61129	2008-12-17 08:49:47 +00:00
Duncan Sands	f312dc7729	Reapply r60997, this time without forgetting that target constants are allowed to have an illegal type. llvm-svn: 61006	2008-12-14 09:43:15 +00:00
Bill Wendling	e5af6f1990	Temporarily revert r60997. It was causing this failure: Running /Users/void/llvm/llvm.src/test/CodeGen/Generic/dg.exp ... FAIL: /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll Failed with exit(1) at line 1 while running: llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll \| llc \| /usr/bin/grep 68719476738 Assertion failed: ((TypesNeedLegalizing \|\| getTypeAction(VT) == Legal) && "Illegal type introduced after type legalization?"), function HandleOp, file /Users/void/llvm/llvm.src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 493. 0 llc 0x0085392e char const* std::find<char const, char>(char const, char const, char const&) + 98 1 llc 0x00853e63 llvm::sys::PrintStackTraceOnErrorSignal() + 593 2 libSystem.B.dylib 0x96cac09b _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1765097359 4 libSystem.B.dylib 0x96d24ec2 raise + 26 5 libSystem.B.dylib 0x96d3447f abort + 73 6 libSystem.B.dylib 0x96d26063 __assert_rtn + 101 7 llc 0x004f9018 llvm::cast_retty<llvm::SubprogramDesc, llvm::DebugInfoDesc>::ret_type llvm::cast<llvm::Sub ... llvm-svn: 61001	2008-12-13 23:53:00 +00:00
Duncan Sands	24092271cc	LegalizeDAG is not supposed to introduce illegal types into the DAG if they were not already there. Check this with an assertion. llvm-svn: 60997	2008-12-13 22:33:38 +00:00
Mon P Wang	f95bd2078d	Added basic support for expanding VSETCC llvm-svn: 60974	2008-12-13 08:15:14 +00:00
Duncan Sands	b6f09933c0	On big-endian machines it is wrong to do a full width register load followed by a truncating store for the copy, since the load will not place the value in the lower bits. Probably partial loads/stores can never happen here, but fix it anyway. llvm-svn: 60972	2008-12-13 07:18:38 +00:00
Duncan Sands	8f352fe100	When expanding unaligned loads and stores do not make use of illegal integer types: instead, use a stack slot and copying via integer registers. The existing code is still used if the bitconvert is to a legal integer type. This fires on the PPC testcases 2007-09-08-unaligned.ll and vec_misaligned.ll. It looks like equivalent code is generated with these changes, just permuted, but it's hard to tell. With these changes, nothing in LegalizeDAG produces illegal integer types anymore. This is a prerequisite for removing the LegalizeDAG type legalization code. While there I noticed that the existing code doesn't handle trunc store of f64 to f32: it turns this into an i64 store, which represents a 4 byte stack smash. I added a FIXME about this. Hopefully someone more motivated than I am will take care of it. llvm-svn: 60964	2008-12-12 21:47:02 +00:00
Evan Cheng	3270a1dec3	Fix add/sub expansion: don't create ADD / SUB with two results (seems like everyone is doing this these days :-). Patch by Daniel M Gessel! llvm-svn: 60958	2008-12-12 18:49:09 +00:00
Duncan Sands	e4bcb8e2dd	When using a 4 byte jump table on a 64 bit machine, do an extending load of the 4 bytes rather than a potentially illegal (type) i32 load followed by a sign extend. llvm-svn: 60945	2008-12-12 08:13:38 +00:00
Mon P Wang	9c2d26d208	Added support for SELECT v8i8 v4i16 for X86 (MMX) Added support for TRUNC v8i16 to v8i8 for X86 (MMX) llvm-svn: 60916	2008-12-12 01:25:51 +00:00
Mon P Wang	c68b3c4fc1	Whitespace clean up (tabs with spaces) llvm-svn: 60866	2008-12-11 00:44:22 +00:00
Bill Wendling	f482f379ef	Whitespace changes. llvm-svn: 60826	2008-12-10 02:01:32 +00:00
Bill Wendling	db8ec2d75a	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Mon P Wang	8a5366332f	In LegalizeOp, don't change the result type of CONVERT_RNDSAT when promoting one of its operand. llvm-svn: 60749	2008-12-09 07:27:39 +00:00
Mon P Wang	4dd832d241	Fix getNode to allow a vector for the shift amount for shifts of vectors. Fix the shift amount when unrolling a vector shift into scalar shifts. Fix problem in getShuffleScalarElt where it assumes that the input of a bit convert must be a vector. llvm-svn: 60740	2008-12-09 05:46:39 +00:00
Scott Michel	9b0b28e021	Non-functional change: make custom lowering for truncate stylistically consistent with the way it's generally done in other places. llvm-svn: 60439	2008-12-02 19:55:08 +00:00
Tilmann Scheller	318ccb0e62	make it possible to custom lower TRUNCATE (needed for the CellSPU target) llvm-svn: 60409	2008-12-02 12:12:25 +00:00
Mon P Wang	6e1c6ad127	Removed some unnecessary code in widening. llvm-svn: 60406	2008-12-02 07:35:08 +00:00
Duncan Sands	3d960941b1	There are no longer any places that require a MERGE_VALUES node with only one operand, so get rid of special code that only existed to handle that possibility. llvm-svn: 60349	2008-12-01 11:41:29 +00:00
Duncan Sands	6ed40141f7	Change the interface to the type legalization method ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348	2008-12-01 11:39:25 +00:00
Sanjiv Gupta	7ae1a84465	Removing redundant semicolons. No functionality change. llvm-svn: 60149	2008-11-27 05:58:04 +00:00
Sanjiv Gupta	80810f8c6b	Allow custom lowering of ADDE/ADDC/SUBE/SUBC operations. llvm-svn: 60102	2008-11-26 11:19:00 +00:00
Bill Wendling	b4ff5322c1	A simplification for checking whether the signs of the operands and sum differ. Thanks, Duncan. llvm-svn: 60043	2008-11-25 19:40:17 +00:00
Bill Wendling	bf592fccd4	Now with the correct type for the 0. llvm-svn: 60016	2008-11-25 08:19:22 +00:00
Bill Wendling	d06c625b95	Get rid of unused variable. llvm-svn: 60015	2008-11-25 08:13:20 +00:00
Bill Wendling	4498b47677	Hacker's Delight says, "Signed integer overflow of addition occurs if and only if the operands have the same sign and the sum has sign opposite to that of the operands." llvm-svn: 60014	2008-11-25 08:12:19 +00:00
Bill Wendling	66835479d7	- Make lowering of "add with overflow" customizable by back-ends. - Mark "add with overflow" as having a custom lowering for X86. Give it a null lowering representation for now. llvm-svn: 59971	2008-11-24 19:21:46 +00:00
Evan Cheng	a8fd1f2c8e	Eliminate some unused variable compile time warnings. llvm-svn: 59952	2008-11-24 07:09:49 +00:00
Bill Wendling	2278f8f5e1	Add support for llvm.uadd.with.overflow. llvm-svn: 59926	2008-11-24 01:38:29 +00:00
Bill Wendling	be8e7f851c	- Move conversion of [SU]ADDO from DAG combiner into legalizer. - Add "promote integer type" stuff to the legalizer for these nodes. llvm-svn: 59847	2008-11-22 00:22:52 +00:00
Mon P Wang	f414cbc1fd	Add missing widen operations, fixed widening for extracting a subvector, and when loading/storing a widen vector, make sure that they are loaded and stored in consecutive order. llvm-svn: 59357	2008-11-15 06:05:52 +00:00
Mon P Wang	58fb9135e2	Added CONVERT_RNDSAT (conversion with rounding and saturation) SDNode to support targets that support these conversions. Users should avoid using this node as the current targets don't generating code for it. llvm-svn: 59001	2008-11-10 20:54:11 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Dale Johannesen	160be0ffda	Make FP tests requiring two compares work on PPC (PR 642). This is Chris' patch from the PR, modified to realize that SETUGT/SETULT occur legitimately with integers, plus two fixes in LegalizeDAG to pass a valid result type into LegalizeSetCC. The argument of TLI.getSetCCResultType is ignored on PPC, but I think I'm following usage elsewhere. llvm-svn: 58871	2008-11-07 22:54:33 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Mon P Wang	9a8d60a7c0	Widening cleanup llvm-svn: 58796	2008-11-06 05:31:54 +00:00
Dale Johannesen	db6b956585	80 columns llvm-svn: 58717	2008-11-04 20:52:49 +00:00
Duncan Sands	0207a3f897	Make VAARG work with x86 long double (which is 10 bytes long, but is passed in 12/16 bytes). llvm-svn: 58608	2008-11-03 11:51:11 +00:00
Mon P Wang	01b8a5a967	Add missing vsetcc expansion for widening llvm-svn: 58443	2008-10-30 18:21:52 +00:00
Mon P Wang	58c3794c27	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Dale Johannesen	28929589e7	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	97d3f6cfe3	Make the NaN test come second, heuristically assuming that NaNs are less common. llvm-svn: 57871	2008-10-21 03:12:54 +00:00
Evan Cheng	3b0f5e4d61	- Add target lowering hooks that specify which setcc conditions are illegal, i.e. conditions that cannot be checked with a single instruction. For example, SETONE and SETUEQ on x86. - Teach legalizer to implement illegal setcc as a and / or of a number of legal setcc nodes. For now, only implement FP conditions. e.g. SETONE is implemented as SETO & SETNE, SETUEQ is SETUO \| SETEQ. - Move x86 target over. llvm-svn: 57542	2008-10-15 02:05:31 +00:00
Evan Cheng	07d53b1d33	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Evan Cheng	da9b752883	FIX PR2794. Make sure SIGN_EXTEND_INREG nodes introduced by LegalizeSetCCOperands are leglized. Patch by Richard Pennington. llvm-svn: 57460	2008-10-13 18:46:18 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dale Johannesen	54306fe499	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Andrew Lenharth	21dca9cbb1	Use Dan's supperior check llvm-svn: 57255	2008-10-07 18:27:23 +00:00
Andrew Lenharth	d69bdaef64	No need for \|= llvm-svn: 57249	2008-10-07 17:11:29 +00:00
Andrew Lenharth	6d409f08be	Use ADDC if it is valid at any smaller size. Do it right this time llvm-svn: 57248	2008-10-07 17:09:16 +00:00
Andrew Lenharth	6606f17e50	Use ADDC if it is valid at any smaller size. fixes test/Codegen/Generic/i128-addsub.ll on x86 llvm-svn: 57247	2008-10-07 17:03:15 +00:00
Andrew Lenharth	3a9be150be	Expand arith on machines without carry flags llvm-svn: 57243	2008-10-07 14:15:42 +00:00
Chris Lattner	2416896b3c	wrap some long lines and expand i32 mul's to libcalls, inspired by a patch by Mikael Lepisto! llvm-svn: 57077	2008-10-04 21:27:46 +00:00
Dale Johannesen	5d60c1ebb1	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dale Johannesen	867d549fce	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	3a293e7404	Fix typos in comments. llvm-svn: 56919	2008-10-01 15:07:49 +00:00
Dan Gohman	86aa16a69a	Optimize SelectionDAG's AssignTopologicalOrder even further. Completely eliminate the TopOrder std::vector. Instead, sort the AllNodes list in place. This also eliminates the need to call AllNodes.size(), a linear-time operation, before performing the sort. Also, eliminate the Sources temporary std::vector, since it essentially duplicates the sorted result as it is being built. This also changes the direction of the topological sort from bottom-up to top-down. The AllNodes list starts out in roughly top-down order, so this reduces the amount of reordering needed. Top-down is also more convenient for Legalize, and ISel needed only minor adjustments. llvm-svn: 56867	2008-09-30 18:30:35 +00:00
Dale Johannesen	f61a84ec43	Remove misuse of ReplaceNodeResults for atomics with valid types. No functional change. llvm-svn: 56808	2008-09-29 22:25:26 +00:00
Dale Johannesen	0e32a2c935	Add "inreg" field to CallSDNode (doesn't increase its size). Adjust various lowering functions to pass this info through from CallInst. Use it to implement sseregparm returns on X86. Remove X86_ssecall calling convention. llvm-svn: 56677	2008-09-26 19:31:26 +00:00
Richard Pennington	4b35e64504	bug 2812: Segmentation fault on a big emdiam processor. llvm-svn: 56609	2008-09-25 16:15:10 +00:00
Dan Gohman	e2947e1e07	Fix the alignment of loads from constant pool entries when the load address has an offset from the base of the constant pool entry. llvm-svn: 56479	2008-09-22 22:40:08 +00:00
Dan Gohman	64d6c6fe30	Change SelectionDAG::getConstantPool to always set the alignment of the ConstantPoolSDNode, using the target's preferred alignment for the constant type. In LegalizeDAG, when performing loads from the constant pool, the ConstantPoolSDNode's alignment is used in the calls to getLoad and getExtLoad. This change prevents SelectionDAG::getLoad/getExtLoad from incorrectly choosing the ABI alignment for constant pool loads when Alignment == 0. The incorrect alignment is only a performance issue when ABI alignment does not equal preferred alignment (i.e., on x86 it was generating MOVUPS instead of MOVAPS for v4f32 constant loads when the default ABI alignment for 128bit vectors is forced to 1 byte.) Patch by Paul Redmond! llvm-svn: 56253	2008-09-16 22:05:41 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	ec270fb640	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	baf6762e26	The sequence for ppcf128 compares was not IEEE safe in the presence of NaNs. llvm-svn: 56136	2008-09-12 00:30:56 +00:00
Evan Cheng	0fff397a13	A few more places where FPOW is being ignored. llvm-svn: 56032	2008-09-09 23:35:53 +00:00
Evan Cheng	f4e5de4583	Legalizer was missing code that expand fpow to a libcall. llvm-svn: 56028	2008-09-09 23:02:14 +00:00
Dale Johannesen	da2d80688b	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Dale Johannesen	41be0d4445	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Dan Gohman	d56f73f2f2	Optimize SelectionDAG's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, use the SelectionDAG's topological sort in LegalizeDAG instead of having a separate implementation. llvm-svn: 55389	2008-08-26 21:42:18 +00:00
Dan Gohman	2af34bd309	Add libcalls for the new rounding opcodes. llvm-svn: 55133	2008-08-21 18:38:14 +00:00
Dan Gohman	c6337ac069	Add libm-oriented ISD opcodes for rounding operations. llvm-svn: 55130	2008-08-21 17:55:02 +00:00
Dan Gohman	550c9af91f	Improve support for vector casts in LLVM IR and CodeGen. llvm-svn: 54784	2008-08-14 20:04:46 +00:00
Nate Begeman	82f1925708	Fix broken CellSPU lowering, re-instate braces in Legalize llvm-svn: 54168	2008-07-29 19:07:27 +00:00
Nate Begeman	d63495ff25	Disable a fix in the previous patch, since it breaks CellSPU. The CellSPU codegen is broken, but needs to be fixed before we can put this back in. llvm-svn: 54164	2008-07-29 18:28:31 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	91e5dcb680	Tidy SDNode::use_iterator, and complete the transition to have it parallel its analogue, Value::value_use_iterator. The operator* method now returns the user, rather than the use. llvm-svn: 54127	2008-07-27 20:43:25 +00:00
Mon P Wang	7334350d31	When splitting a vector shuffle, fixed which type we used for the hi part llvm-svn: 54007	2008-07-25 01:30:26 +00:00
Dan Gohman	581cc87f57	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Duncan Sands	b0e3938651	Add VerifyNode, a place to put sanity checks on generic SDNode's (nodes with their own constructors should do sanity checking in the constructor). Add sanity checks for BUILD_VECTOR and fix all the places that were producing bogus BUILD_VECTORs, as found by "make check". My favorite is the BUILD_VECTOR with only two operands that was being used to build a vector with four elements! llvm-svn: 53850	2008-07-21 10:20:31 +00:00
Duncan Sands	77a3d05f1e	Factorize some code for determining which libcall to use. llvm-svn: 53713	2008-07-17 02:36:29 +00:00
Mon P Wang	97432f4f1b	Fixed potential bug if the source and target of a bit convert have different alignment llvm-svn: 53590	2008-07-15 05:28:34 +00:00
Dan Gohman	02c7c6cb33	Include a frame index in the "fixed stack" pseudo source value instead of using the frame index for the SVOffset, which was inconsistent. llvm-svn: 53486	2008-07-11 22:44:52 +00:00
Chris Lattner	87909d0629	Fix a bug in the soft-float handling of FCOPYSIGN that Duncan noticed when working on legalizetypes. Both legalizetypes and legalizeops now produce hte same code for CodeGen/ARM/fcopysign.ll. llvm-svn: 53435	2008-07-10 23:46:13 +00:00
Duncan Sands	5e6d1402c2	Add a mysteriously missing libcall, FPTOSINT_F80_I32. Be nice to 16 bit machines by supporting FP_TO_XINT expansion for these. llvm-svn: 53407	2008-07-10 15:33:02 +00:00
Evan Cheng	34ef1db87c	Do not CSE DEBUG_LOC, DBG_LABEL, DBG_STOPPOINT, DECLARE, and EH_LABEL SDNode's. This improves compile time slightly at -O0 -g. llvm-svn: 53246	2008-07-08 20:06:39 +00:00
Dan Gohman	56e3f63ec5	Add explicit keywords. llvm-svn: 53179	2008-07-07 18:00:37 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Evan Cheng	d8b83e1292	LegalizeSetCCOperands should legalize the result of ExpandLibCall. Patch by Richard Osborne. llvm-svn: 53169	2008-07-07 07:18:09 +00:00
Mon P Wang	5c755ff51b	Fixed generating incorrect aligned stores that I backout of r53031 that fixed problems in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53150	2008-07-05 20:40:31 +00:00
Duncan Sands	93e180342a	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Evan Cheng	fad8be450d	Backed out 53031. llvm-svn: 53110	2008-07-03 18:20:14 +00:00
Duncan Sands	739a0548c4	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Mon P Wang	4b7c1acf26	Fixed problem in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53031	2008-07-02 17:07:12 +00:00
Evan Cheng	4c609abd90	Eliminate a compile time warning. llvm-svn: 52982	2008-07-01 21:35:46 +00:00
Dan Gohman	fb19f9402b	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Dan Gohman	5c73a886b4	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Duncan Sands	1ae6ef83ee	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Chris Lattner	9d3740ed1c	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Dan Gohman	aa01afd47c	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Duncan Sands	5fb92e58de	Make custom lowering of ADD work correctly. This fixes PR2476; patch by Richard Osborne. The same problem exists for a bunch of other operators, but I'm ignoring this because they will be automagically fixed when the new LegalizeTypes infrastructure lands, since it already solves this problem centrally. llvm-svn: 52610	2008-06-22 09:42:16 +00:00
Dan Gohman	3792c470d5	Clean up some uses of std::distance, now that we have allnodes_size. llvm-svn: 52545	2008-06-20 17:15:19 +00:00
Evan Cheng	be0429c558	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Scott Michel	a7d8649f78	Fix spellnig error llvm-svn: 51917	2008-06-03 19:13:20 +00:00
Scott Michel	d831cc49e5	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Dan Gohman	8807147ada	Remove an unused variable. llvm-svn: 51721	2008-05-30 00:56:36 +00:00
Evan Cheng	9ac3631fa3	If the result of a BIT_CONVERT is a v1* vector, it doesn't mean its source is a v1* vector. llvm-svn: 51192	2008-05-16 17:19:05 +00:00
Nate Begeman	f79f52282c	Actually scalarize the operand to BIT_CONVERT instead of asking someone to do something with a v1 type. llvm-svn: 51160	2008-05-15 20:40:58 +00:00
Dan Gohman	fd3e3003f3	Whitespace cleanups. llvm-svn: 51089	2008-05-14 00:43:10 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Dan Gohman	ecb77385ab	Fix a missing break in the ISD::FLT_ROUNDS_ handling. Patch by giuma! llvm-svn: 50967	2008-05-12 16:07:15 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Scott Michel	be940424b3	Fix custom target lowering for zero/any/sign_extend: make sure that DAG.UpdateNodeOperands() is called before (not after) the call to TLI.LowerOperation(). llvm-svn: 50461	2008-04-30 00:26:38 +00:00
Nate Begeman	6f94f61317	Pull the code to perform an INSERT_VECTOR_ELT in memory out into its own function, and then use it to fix a bug in SplitVectorOp that expected inserts to always have constant insertion indices. llvm-svn: 50273	2008-04-25 18:07:40 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Dan Gohman	9752a8f3b4	Correct the SrcValue information in the Expand code for va_copy. llvm-svn: 49839	2008-04-17 02:09:26 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Duncan Sands	844d55a42a	Factor some libcall code. llvm-svn: 49583	2008-04-12 17:14:18 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Roman Levenstein	51f532f92d	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Evan Cheng	025cea1126	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Evan Cheng	0bd72c5ccd	More soft fp fixes. llvm-svn: 49016	2008-04-01 02:18:22 +00:00
Evan Cheng	4cabe4b452	Pasto. llvm-svn: 49014	2008-04-01 02:00:09 +00:00
Evan Cheng	611abc03ed	Add comment. llvm-svn: 49013	2008-04-01 01:51:26 +00:00
Evan Cheng	86e476b7cb	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Roman Levenstein	358e04a185	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Duncan Sands	d97eea372a	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Dale Johannesen	12c76db312	Make conversions of i8/i16 to ppcf128 work. llvm-svn: 48493	2008-03-18 17:28:38 +00:00
Nate Begeman	63eb03f800	Tabs -> spaces Use getIntPtrConstant in a couple places to shorten stuff up Handle splitting vector shuffles with undefs in the mask llvm-svn: 48351	2008-03-14 00:53:31 +00:00
Dan Gohman	b72127ac4c	More APInt-ification. llvm-svn: 48344	2008-03-13 22:13:53 +00:00
Dan Gohman	d6819da453	Generalize ExpandIntToFP to handle the case where the operand is legal and it's the result that requires expansion. This code is a little confusing because the TargetLoweringInfo tables for [US]INT_TO_FP use the operand type (the integer type) rather than the result type. llvm-svn: 48206	2008-03-11 01:59:03 +00:00
Dan Gohman	10f7d850cf	More APInt-ification. llvm-svn: 48201	2008-03-11 00:11:06 +00:00
Dan Gohman	f4300950f1	Implement more support for fp-to-i128 and i128-to-fp conversions. llvm-svn: 48189	2008-03-10 23:03:31 +00:00
Dan Gohman	272e234477	Fix mul expansion to check the correct number of bits for zero extension when checking if an unsigned multiply is safe. llvm-svn: 48171	2008-03-10 20:42:19 +00:00
Scott Michel	a6729e8666	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Chris Lattner	322c826c9d	Fix two problems in SelectionDAGLegalize::ExpandBUILD_VECTOR's handling of BUILD_VECTORS that only have two unique elements: 1. The previous code was nondeterminstic, because it walked a map in SDOperand order, which isn't determinstic. 2. The previous code didn't handle the case when one element was undef very well. Now we ensure that the generated shuffle mask has the undef vector on the RHS (instead of potentially being on the LHS) and that any elements that refer to it are themselves undef. This allows us to compile CodeGen/X86/vec_set-9.ll into: _test3: movd %rdi, %xmm0 punpcklqdq %xmm0, %xmm0 ret instead of: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret ... saving a register. llvm-svn: 48060	2008-03-09 00:29:42 +00:00
Evan Cheng	95cf661534	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Dale Johannesen	8ee39c61f2	Clarify that CALLSEQ_START..END may not be nested, and add some protection against creating such. llvm-svn: 47957	2008-03-05 19:14:03 +00:00
Chris Lattner	3dc3899007	Improve comment, pass in the original VT so that we can shrink a long double constant all the way to float, not stopping at double. llvm-svn: 47937	2008-03-05 06:46:58 +00:00
Dan Gohman	da7897c4e1	Codegen support for i128 UINT_TO_FP. This just fixes a bug in r47928 (Int64Ty is the correct type for the constant pool entry here) and removes the asserts, now that the code is capable of handling i128. llvm-svn: 47932	2008-03-05 02:07:31 +00:00
Evan Cheng	0a62cb44ce	Add a target lowering hook to control whether it's worthwhile to compress fp constant. For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive. llvm-svn: 47931	2008-03-05 01:30:59 +00:00
Andrew Lenharth	357061a74d	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Dan Gohman	d9d874b0cd	Codegen support for i128 SINT_TO_FP. llvm-svn: 47928	2008-03-05 01:08:17 +00:00
Evan Cheng	38caf77419	Refactor ExpandConstantFP so it can optimize load from constpool of types larger than f64 into extload from smaller types. llvm-svn: 47883	2008-03-04 08:05:30 +00:00
Dan Gohman	f2bbfa3ba0	More APInt-ification. llvm-svn: 47864	2008-03-03 22:20:46 +00:00
Andrew Lenharth	d032c33300	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Dale Johannesen	208cc8f1b9	Add MVT::is128BitVector and is64BitVector. Shrink unaligned load/store code using them. Per review of unaligned load/store vector patch. llvm-svn: 47782	2008-03-01 03:40:57 +00:00
Dan Gohman	837a6dccd7	Use the new convertFromAPInt instead of convertFromZeroExtendedInteger, which allows more of the surrounding arithmetic to be done with APInt instead of uint64_t. llvm-svn: 47745	2008-02-29 01:44:25 +00:00
Dale Johannesen	c4c3de2b52	Fix an assertion message. llvm-svn: 47722	2008-02-28 18:36:51 +00:00
Chris Lattner	9824ffef0c	implement expand for ISD::DECLARE by just deleting it. llvm-svn: 47708	2008-02-28 05:53:40 +00:00
Dale Johannesen	bf76a08e7c	Handle load/store of misaligned vectors that are the same size as an int type by doing a bitconvert of load/store of the int type (same algorithm as floating point). This makes them work for ppc Altivec. There was some code that purported to handle loads of (some) vectors by splitting them into two smaller vectors, but getExtLoad rejects subvector loads, so this could never have worked; the patch removes it. llvm-svn: 47696	2008-02-27 22:36:00 +00:00
Dan Gohman	e5e32ec8f7	Remove the `else', at Evan's insistence. llvm-svn: 47686	2008-02-27 19:44:57 +00:00
Duncan Sands	96658d0189	Support for legalizing MEMBARRIER. llvm-svn: 47667	2008-02-27 08:53:44 +00:00
Dan Gohman	66272a545b	Teach Legalize how to expand an EXTRACT_ELEMENT. llvm-svn: 47656	2008-02-27 01:52:30 +00:00
Dan Gohman	432e4a6742	Make some static variables const. llvm-svn: 47566	2008-02-25 21:39:34 +00:00
Dan Gohman	1f372edd97	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Dan Gohman	f3057a939d	Fix a regression in 403.gcc and 186.crafty introduced in 47383. To test that a value is >= 32, check that all of the high bits are zero, not just one or more. llvm-svn: 47467	2008-02-22 01:12:31 +00:00
Andrew Lenharth	95528943e9	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Dan Gohman	34fc7dbf5b	Convert Legalize to use the APInt form of ComputeMaskedBits. llvm-svn: 47383	2008-02-20 16:57:27 +00:00
Andrew Lenharth	fedcf477b5	I cannot find a libgcc function for this builtin. Therefor expanding it to a noop (which is how it use to be treated). If someone who knows the x86 backend better than me could tell me how to get a lock prefix on an instruction, that would be nice to complete x86 support. llvm-svn: 47213	2008-02-16 14:46:26 +00:00
Andrew Lenharth	9b254eed32	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Scott Michel	a3cefeaf0c	Make tblgen a little smarter about constants smaller than i32. Currently, tblgen will complain if a sign-extended constant does not fit into a data type smaller than i32, e.g., i16. This causes a problem when certain hex constants are used, such as 0xff for byte masks or immediate xor values. tblgen will try the sign-extended value first and, if the sign extended value would overflow, it tries to see if the unsigned value will fit. Consequently, a software developer can now safely incant: (XORHIr16 R16C:$rA, 0xffff) which is somewhat clearer and more informative than incanting: (XORHIr16 R16C:$rA, (i16 -1)) even if the two are bitwise equivalent. Tblgen also outputs the 64-bit unsigned constant in the generated ISel code when getTargetConstant() is invoked. llvm-svn: 47188	2008-02-15 23:05:48 +00:00
Dan Gohman	a36ade5595	Use StoreSDNode::getValue instead of calling getOperand directly with a hard-coded operand number. llvm-svn: 47163	2008-02-15 18:11:59 +00:00
Duncan Sands	4c95dbd69f	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Nate Begeman	53e1b3f9d5	Change how FP immediates are handled. 1) ConstantFP is now expand by default 2) ConstantFP is not turned into TargetConstantFP during Legalize if it is legal. This allows ConstantFP to be handled like Constant, allowing for targets that can encode FP immediates as MachineOperands. As a bonus, fix up Itanium FP constants, which now correctly match, and match more constants! Hooray. llvm-svn: 47121	2008-02-14 08:57:00 +00:00
Nate Begeman	735ab3ce67	Support legalizing insert_vector_elt on targets where the element type is not legal. llvm-svn: 47048	2008-02-13 06:43:04 +00:00
Dan Gohman	54d3b5a1f5	From Chris' review: use cast instead of dyn_cast with an assert. llvm-svn: 46962	2008-02-11 18:58:42 +00:00
Duncan Sands	7377f5fbe3	Add a isBigEndian method to complement isLittleEndian. llvm-svn: 46954	2008-02-11 10:37:04 +00:00
Dan Gohman	16d4bc3dc0	Follow Chris' suggestion; change the PseudoSourceValue accessors to return pointers instead of references, since this is always what is needed. llvm-svn: 46857	2008-02-07 18:41:25 +00:00
Dan Gohman	2d489b5081	Re-apply the memory operand changes, with a fix for the static initializer problem, a minor tweak to the way the DAGISelEmitter finds load/store nodes, and a renaming of the new PseudoSourceValue objects. llvm-svn: 46827	2008-02-06 22:27:42 +00:00
Evan Cheng	efd142a920	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	263070ea2b	Rename RecordLabel to RecordSourceLine because that's what it is doing. llvm-svn: 46628	2008-02-01 02:05:57 +00:00
Evan Cheng	27b32b87ed	Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. llvm-svn: 46623	2008-01-31 21:00:00 +00:00
Evan Cheng	1c6c16ea11	Add an extra operand to LABEL nodes which distinguishes between debug, EH, or misc labels. This fixes the EH breakage. However I am not convinced this is the solution. llvm-svn: 46609	2008-01-31 09:59:15 +00:00
Dan Gohman	9ba4d76816	Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting with the real FLT_ROUNDS (defined in <float.h>). llvm-svn: 46587	2008-01-31 00:41:03 +00:00
Dan Gohman	3646fdda67	Create a new class, MemOperand, for describing memory references in the backend. Introduce a new SDNode type, MemOperandSDNode, for holding a MemOperand in the SelectionDAG IR, and add a MemOperand list to MachineInstr, and code to manage them. Remove the offset field from SrcValueSDNode; uses of SrcValueSDNode that were using it are all all using MemOperandSDNode now. Also, begin updating some getLoad and getStore calls to use the PseudoSourceValue objects. Most of this was written by Florian Brander, some reorganization and updating to TOT by me. llvm-svn: 46585	2008-01-31 00:25:39 +00:00
Dan Gohman	47a7d6fafe	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Nate Begeman	ef33767efb	Properly expand extract-element for non-power-of-2 codegen llvm-svn: 46486	2008-01-29 02:24:00 +00:00
Duncan Sands	95d46ef887	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Duncan Sands	88de26cffb	The final piece needed for storing arbitrary precision integers. Handle truncstore of a legal type to an unusual number of bits. Most of this code is not reachable unless the new legalize infrastructure is turned on. llvm-svn: 46249	2008-01-22 07:17:34 +00:00
Dale Johannesen	949e5a2f8a	Do not generate a FP_ROUND of f64 to f64. llvm-svn: 46195	2008-01-20 01:18:38 +00:00
Chris Lattner	1ea55cf816	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	72733e573b	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00
Chris Lattner	7ca4d5b1f3	merge a few pieces of code that do the store/load to stack pattern to use EmitStackConvert now. llvm-svn: 46066	2008-01-16 07:51:34 +00:00
Chris Lattner	87bc3e7ece	rename ExpandBIT_CONVERT to EmitStackConvert, generalizing it to allow it to emit different load and store kinds. llvm-svn: 46065	2008-01-16 07:45:30 +00:00
Chris Lattner	a2c7ff3386	simplify a bunch of code by using SelectionDAG::CreateStackTemporary instead of inlining its body. llvm-svn: 46062	2008-01-16 07:03:22 +00:00
Chris Lattner	91d86242f9	Change legalizeop of FP_ROUND and FP_EXTEND to not fall through into the ANY_EXTEND/ZERO_EXTEND/SIGN_EXTEND code to simplify it. Unmerge the code for FP_ROUND and FP_EXTEND from each other to make each one simpler. llvm-svn: 46061	2008-01-16 06:57:07 +00:00
Chris Lattner	ec224888a6	The type of the 'abort' node should be pointer type (because it's a function pointer) not MVT::Other. This fixes builtin_trap lowering on ppc, alpha, ia64 llvm-svn: 46018	2008-01-15 22:09:33 +00:00
Chris Lattner	ee8df1f4d3	Add support for targets that have a legal ISD::TRAP. llvm-svn: 46014	2008-01-15 21:58:08 +00:00
Anton Korobeynikov	6bbbc4cbfa	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Duncan Sands	53c954fa86	Output sinl for a long double FSIN node, not sin. Likewise fix up a bunch of other libcalls. While there I remove NEG_F32 and NEG_F64 since they are not used anywhere. This fixes 9 Ada ACATS failures. llvm-svn: 45833	2008-01-10 10:28:30 +00:00
Nate Begeman	5743da502e	If custom lowering of insert element fails, the result Val will be 0. Don't overwrite a variable used by the fallthrough code path in this case. llvm-svn: 45630	2008-01-05 20:47:37 +00:00
Duncan Sands	57a60f0466	Fix PR1833 - eh.exception and eh.selector return two values, which means doing extra legalization work. It would be easier to get this kind of thing right if there was some documentation... llvm-svn: 45472	2007-12-31 18:35:50 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	cab915f9cf	Implement expand support for MERGE_VALUEs that only produces one result. llvm-svn: 44304	2007-11-24 19:12:15 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Chris Lattner	09c0393d5e	ExpandUnalignedLoad doesn't handle vectors right at all apparently. Fix a couple of problems: 1. Don't assume the VT-1 is a VT that is half the size. 2. Treat vectors of FP in the vector path, not the FP path. This has a couple of remaining problems before it will work with the code in PR1811: the code below this change assumes that it can use extload/shift/or to construct the result, which isn't right for vectors. This also doesn't handle vectors of 1 or vectors that aren't pow-2. llvm-svn: 44243	2007-11-19 21:38:03 +00:00
Chris Lattner	6fa95ec19d	Implement vector expand support for shuffle_vector. This fixes PR1811. llvm-svn: 44242	2007-11-19 21:16:54 +00:00
Chris Lattner	67d77945e7	Implement splitting of UNDEF nodes. This is the first step towards fixing PR1811 llvm-svn: 44239	2007-11-19 20:21:32 +00:00
Dan Gohman	36347a26f9	Add support in SplitVectorOp for remainder operators. llvm-svn: 44233	2007-11-19 15:15:03 +00:00
Nate Begeman	d4d45c268c	Add support for vectors to int <-> float casts. llvm-svn: 44204	2007-11-17 03:58:34 +00:00
Anton Korobeynikov	66b91e66ec	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Nate Begeman	bd117f06ba	Basic non-power-of-2 vector support llvm-svn: 44181	2007-11-15 21:15:26 +00:00
Bill Wendling	f359fed9f9	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Duncan Sands	e795efea5b	Move MinAlign to MathExtras.h. llvm-svn: 43944	2007-11-09 13:41:39 +00:00
Evan Cheng	797d56ff17	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Evan Cheng	f14006f4d6	Didn't mean to check these in. llvm-svn: 43923	2007-11-09 01:28:33 +00:00
Evan Cheng	1bf166312b	Bug fix. Passive nodes are not in SUnitMap. llvm-svn: 43922	2007-11-09 01:27:11 +00:00
Dan Gohman	ccfc028283	Remainder operations must be either integer or floating-point. llvm-svn: 43781	2007-11-06 22:11:54 +00:00
Dan Gohman	08143e397d	Add support for vector remainder operations. llvm-svn: 43744	2007-11-05 23:35:22 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Dale Johannesen	b066c1f216	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00
Duncan Sands	1826deda68	The guaranteed alignment of ptr+offset is only the minimum of of offset and the alignment of ptr if these are both powers of 2. While the ptr alignment is guaranteed to be a power of 2, there is no reason to think that offset is. For example, if offset is 12 (the size of a long double on x86-32 linux) and the alignment of ptr is 8, then the alignment of ptr+offset will in general be 4, not 8. Introduce a function MinAlign, lifted from gcc, for computing the minimum guaranteed alignment. I've tried to fix up everywhere under lib/CodeGen/SelectionDAG/. I also changed some places that weren't wrong (because both values were a power of 2), as a defensive change against people copying and pasting the code. Hopefully someone who cares about alignment will review the rest of LLVM and fix up the remaining places. Since I'm on x86 I'm not very motivated to do this myself... llvm-svn: 43421	2007-10-28 12:59:45 +00:00
Dale Johannesen	a4a972e32d	Another expansion for i64 multiply, suitable for PPC. llvm-svn: 43314	2007-10-24 22:26:08 +00:00
Dale Johannesen	771188cf60	Fix a few places vector operations were not getting the operand's type from the right place. llvm-svn: 43195	2007-10-20 00:07:52 +00:00
Dale Johannesen	6802d0c96f	Redo "last ppc long double fix" as Chris wants. llvm-svn: 43189	2007-10-19 20:29:00 +00:00
Chris Lattner	064c31ebac	Fix a really nasty vector miscompilation bill recently introduced. llvm-svn: 43181	2007-10-19 16:47:35 +00:00
Rafael Espindola	846c19dd70	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Bill Wendling	de16ad1446	Negative indices aren't allowed here. llvm-svn: 43161	2007-10-19 01:10:49 +00:00
Bill Wendling	070aca5d25	Pointer arithmetic should be done with the index the same size as the pointer. llvm-svn: 43120	2007-10-18 08:32:37 +00:00
Dan Gohman	8f518b9875	Add support for ISD::SELECT in SplitVectorOp. llvm-svn: 43072	2007-10-17 14:48:28 +00:00
Duncan Sands	d42c812f4a	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Duncan Sands	052c843559	Fixes due to lack of type-safety for ValueType: (1) ValueType being passed instead of an opcode; (2) ValueType being passed for isVolatile (!) in getLoad. llvm-svn: 43028	2007-10-16 09:07:20 +00:00
Chris Lattner	d6f7d44eae	Move CreateStackTemporary out to SelectionDAG llvm-svn: 42995	2007-10-15 17:48:57 +00:00
Chris Lattner	b193517eed	One xform performed by LegalizeDAG is transformation of "store of fp" to "store of int". Make two changes: 1) only xform "store of f32" if i32 is a legal type for the target. 2) only xform "store of f64" if either i64 or i32 are legal for the target. 3) if i64 isn't legal, manually lower to 2 stores of i32 instead of letting a later pass of legalize do it. This is ugly, but helps future changes I'm about to commit. llvm-svn: 42980	2007-10-15 05:46:06 +00:00
Chris Lattner	fbbe570994	remove misleading comment. llvm-svn: 42970	2007-10-14 20:35:12 +00:00
Chris Lattner	ebe491ea9c	If a target doesn't have HasMULHU or HasUMUL_LOHI, ExpandOp would return without lo/hi set. Fall through to making a libcall instead. llvm-svn: 42969	2007-10-14 18:35:05 +00:00
Chris Lattner	5e6fe054a2	Add a simple optimization to simplify the input to truncate and truncstore instructions, based on the knowledge that they don't demand the top bits. llvm-svn: 42952	2007-10-13 06:35:54 +00:00
Dale Johannesen	61c574fc51	ppc long double. Implement fabs and fneg. llvm-svn: 42924	2007-10-12 19:02:17 +00:00
Dale Johannesen	a1a4a9ebfa	Implement i64->ppcf128 conversions. llvm-svn: 42919	2007-10-12 17:52:03 +00:00
Dan Gohman	4f056f3c10	Add support to SplitVectorOp for powi, where the second operand is a scalar integer. llvm-svn: 42906	2007-10-12 14:13:46 +00:00
Dale Johannesen	05ff9e8cda	PPC long double. Implement a couple more conversions. llvm-svn: 42888	2007-10-12 01:37:08 +00:00
Dan Gohman	2a7de41682	Codegen support for vector intrinsics. Factor out the code that expands the "nasty scalar code" for unrolling vectors into a separate routine, teach it how to handle mixed vector/scalar operands, as seen in powi, and use it for several operators, including sin, cos, powi, and pow. Add support in SplitVectorOp for fpow, fpowi and for several unary operators. llvm-svn: 42884	2007-10-11 23:57:53 +00:00
Dale Johannesen	6472eb63c2	Implement ppc long double->uint conversion. Make ppc long double constants print. llvm-svn: 42882	2007-10-11 23:32:15 +00:00
Dale Johannesen	007aa378ad	Next PPC long double bits. First cut at constants. No compile-time support for constant operations yet, just format transformations. Make readers and writers work. Split constants into 2 doubles in Legalize. llvm-svn: 42865	2007-10-11 18:07:22 +00:00
Dale Johannesen	666323eacd	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Dan Gohman	a160361c85	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Neil Booth	5f00973393	convertFromInteger, as originally written, expected sign-extended input. APInt unfortunately zero-extends signed integers, so Dale modified the function to expect zero-extended input. Make this assumption explicit in the function name. llvm-svn: 42732	2007-10-07 11:45:55 +00:00
Dale Johannesen	f864ac96d8	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Dale Johannesen	c0154c06d6	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Dan Gohman	12334acbfb	Legalize support for MUL_LOHI and DIVREM. llvm-svn: 42636	2007-10-05 14:17:22 +00:00
Evan Cheng	fd11ef4665	Silence a warning. llvm-svn: 42619	2007-10-05 01:09:32 +00:00
Dale Johannesen	9150652b21	Constant fold int-to-long-double conversions; use APFloat for int-to-float/double; use round-to-nearest for these (implementation-defined, seems to match gcc). llvm-svn: 42484	2007-09-30 18:19:03 +00:00
Dan Gohman	a90183e7d1	Teach SplitVectorOp how to split INSERT_VECTOR_ELT. llvm-svn: 42457	2007-09-28 23:53:40 +00:00
Dale Johannesen	789b5a505b	Fix long double -> uint64 conversion. llvm-svn: 42440	2007-09-28 18:44:17 +00:00
Dale Johannesen	25a00a63eb	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Evan Cheng	75439b3b78	Silence a compiler warning. llvm-svn: 42389	2007-09-27 07:35:39 +00:00
Dale Johannesen	f04d37d3a9	Fix f80 UNDEF. llvm-svn: 42359	2007-09-26 17:26:49 +00:00
Dan Gohman	6002818999	Use the correct result value type instead of using getValueType(0) in ExpandEXTRACT_VECTOR_ELT and SplitVectorOp. This fixes an abort in the included testcase. llvm-svn: 42264	2007-09-24 15:54:53 +00:00
Dale Johannesen	4230512f32	Change APFloat::convertFromInteger to take the incoming bit width instead of number of words allocated, which makes it actually work for int->APF conversions. Adjust callers. Add const to one of the APInt constructors to prevent surprising match when called with const argument. llvm-svn: 42210	2007-09-21 22:09:37 +00:00
Dale Johannesen	7d67e547b5	More long double fixes. x86_64 should build now. llvm-svn: 42155	2007-09-19 23:55:34 +00:00
Dale Johannesen	b59d25fe54	Fix longdouble -> uint conversion. llvm-svn: 42143	2007-09-19 17:53:26 +00:00
Dale Johannesen	7f724e9b94	Adjust per revew comments. llvm-svn: 42002	2007-09-16 16:51:49 +00:00
Dale Johannesen	98d3a08d8f	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Dale Johannesen	028084efe5	Revise previous patch per review comments. Next round of x87 long double stuff. Getting close now, basically works. llvm-svn: 41875	2007-09-12 03:30:33 +00:00
Dale Johannesen	245dceb06d	Add APInt interfaces to APFloat (allows directly access to bits). Use them in place of float and double interfaces where appropriate. First bits of x86 long double constants handling (untested, probably does not work). llvm-svn: 41858	2007-09-11 18:32:33 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Dale Johannesen	29e6ac4281	Implement misaligned FP loads and stores. llvm-svn: 41786	2007-09-08 19:29:23 +00:00
Dale Johannesen	d246b2ca5c	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Anton Korobeynikov	2bdec2a5ee	Fix use of declaration inside case block llvm-svn: 41584	2007-08-29 23:18:48 +00:00
Anton Korobeynikov	830b1cb4e9	Lower FRAME_TO_ADDR_OFFSET to zero by default (if not custom lowered) llvm-svn: 41578	2007-08-29 19:28:29 +00:00
Chris Lattner	2ed652f11d	Allow target constants to be illegal types. The target should know how to handle them. This fixes test/CodeGen/Generic/asm-large-immediate.ll llvm-svn: 41388	2007-08-25 01:00:22 +00:00
Evan Cheng	cb6d65e1bf	Avoid issue on 64-bit hosts. llvm-svn: 41143	2007-08-17 18:02:22 +00:00
Evan Cheng	631ccc6144	If dynamic_stackalloc alignment is > stack alignment, first issue an instruction to align the stack ptr before the decrement. llvm-svn: 41133	2007-08-16 23:50:06 +00:00

... 6 7 8 9 10 ...

1271 Commits