llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
Dan Gohman	733a64db57	Fix a bug where DAGCombine was producing an illegal ConstantFP node after legalize, and remove the workaround code from the ARM backend. llvm-svn: 78615	2009-08-10 23:15:10 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Dan Gohman	b717091e69	Make this comment more closely reflect the code. llvm-svn: 78569	2009-08-10 16:50:32 +00:00
Jakob Stoklund Olesen	dc6bccbaa6	Don't build illegal ops in DAGCombiner::SimplifyBinOpWithSameOpcodeHands(). Blackfin supports and/or/xor on i32 but not on i16. Teach DAGCombiner::SimplifyBinOpWithSameOpcodeHands to not produce illegal nodes after legalize ops. llvm-svn: 78497	2009-08-08 20:42:17 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Dan Gohman	3f323847bc	Avoid forming a SELECT_CC in a type that the target doesn't support. This isn't immediately interesting, because Legalize ends up lowering SELECT_CC if the target doesn't support it, but this simplifies the process. Also, if the SELECT_CC would be expanded in Legalize, it can potentially end up with two copies of the condition expression. By leaving it as SELECT+SETCC, the SELECT can be expanded into two SELECTs that use a single SETCC. The two comparisons are usually CSE'd, but depending on when various expressions get legalized, the comparison expression could involve calls to library functions, such that the comparison expression may not be able to be CSE'd. This will be needed by a future patch. llvm-svn: 77896	2009-08-02 16:19:38 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Jakob Stoklund Olesen	1ae0736830	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Owen Anderson	f945a9ed07	Move a few more convenience factory functions from Constant to LLVMContext. llvm-svn: 75840	2009-07-15 21:51:10 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Dale Johannesen	5234d3795f	Revert 72707 and 72709, for the moment. llvm-svn: 72712	2009-06-02 03:12:52 +00:00
Dale Johannesen	0b8ca79253	Make the implicit inputs and outputs of target-independent ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to) instead of MVT::Flag. Remove CARRY_FALSE in favor of 0; adjust all target-independent code to use this format. Most targets will still produce a Flag-setting target-dependent version when selection is done. X86 is converted to use i32 instead, which means TableGen needs to produce different code in xxxGenDAGISel.inc. This keys off the new supportsHasI1 bit in xxxInstrInfo, currently set only for X86; in principle this is temporary and should go away when all other targets have been converted. All relevant X86 instruction patterns are modified to represent setting and using EFLAGS explicitly. The same can be done on other targets. The immediate behavior change is that an ADC/ADD pair are no longer tightly coupled in the X86 scheduler; they can be separated by instructions that don't clobber the flags (MOV). I will soon add some peephole optimizations based on using other instructions that set the flags to feed into ADC. llvm-svn: 72707	2009-06-01 23:27:20 +00:00
Evan Cheng	86cdb4b345	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Evan Cheng	6673ff08fe	Incorporate patch feedbacks. llvm-svn: 72533	2009-05-28 18:41:02 +00:00
Evan Cheng	a9cda8abf2	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Torok Edwin	be6a9a151a	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Daniel Dunbar	a8c1658619	Silence Release-Asserts warnings. llvm-svn: 72011	2009-05-18 16:43:04 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Evan Cheng	cfc0513080	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Bill Wendling	026e5d7667	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	5f829d896d	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Bill Wendling	084669a1c9	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Bill Wendling	56f2987a87	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	d0ae15946c	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	be36f5ccda	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Dan Gohman	4539987920	Add a top-level comment about DAGCombiner's role in the compiler. llvm-svn: 70052	2009-04-25 17:09:45 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Bob Wilson	da188ebbbd	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	0e8d199f91	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	ad3e549a53	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Evan Cheng	fd81c73cde	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Bill Wendling	aa28be652c	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Mon P Wang	523c0852c6	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Mon P Wang	c86715631c	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Chris Lattner	4147f08e44	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Chris Lattner	43d6377f89	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Evan Cheng	aa887653f4	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Chris Lattner	4249b9a698	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Chris Lattner	ab5a443144	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Nate Begeman	a9e981225e	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Evan Cheng	a49de9de2e	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Scott Michel	9d31aca679	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Dan Gohman	e7fe80fcf9	Fix a bug that David Greene found in the DAGCombiner's logic that checks whether it's safe to transform a store of a bitcast value into a store of the original value. llvm-svn: 65201	2009-02-20 23:29:13 +00:00
Scott Michel	cf0da6c597	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	400dc2e2e4	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Dale Johannesen	f1163e9a4d	Propagation in TargetLowering. Includes passing a DL into SimplifySetCC which gets called elsewhere. llvm-svn: 63583	2009-02-03 00:47:48 +00:00
Duncan Sands	3ed768868d	Fix PR3453 and probably a bunch of other potential crashes or wrong code with codegen of large integers: eliminate the legacy getIntegerVTBitMask and getIntegerVTSignBit methods, which returned their value as a uint64_t, so couldn't handle huge types. llvm-svn: 63494	2009-02-01 18:06:53 +00:00
Bill Wendling	a6c75ffd73	Forgot some more DebugLoc propagations. llvm-svn: 63493	2009-02-01 11:19:36 +00:00
Duncan Sands	41826036b1	Fix PR3401: when using large integers, the type returned by getShiftAmountTy may be too small to hold shift values (it is an i8 on x86-32). Before and during type legalization, use a large but legal type for shift amounts: getPointerTy; afterwards use getShiftAmountTy, fixing up any shift amounts with a big type during operation legalization. Thanks to Dan for writing the original patch (which I shamelessly pillaged). llvm-svn: 63482	2009-01-31 15:50:11 +00:00
Bill Wendling	3b585af0ec	Don't use DebugLoc::getUnknownLoc(). Default to something hopefully sensible. llvm-svn: 63473	2009-01-31 03:12:48 +00:00
Bill Wendling	31b50991cb	More DebugLoc propagation. llvm-svn: 63454	2009-01-30 23:59:18 +00:00
Bill Wendling	27d9dd4b57	More DebugLoc propagation. llvm-svn: 63452	2009-01-30 23:36:47 +00:00
Bill Wendling	306bfc2213	More DebugLoc propagation in LOAD etc. methods. llvm-svn: 63451	2009-01-30 23:27:35 +00:00
Bill Wendling	0bd29743e3	More DebugLoc propagation in floating-point methods. llvm-svn: 63446	2009-01-30 23:15:49 +00:00
Bill Wendling	6fbf5495f8	Standardize comments about folding xforms. llvm-svn: 63443	2009-01-30 23:10:18 +00:00
Bill Wendling	8fb81f1b3d	Get rid of the non-DebugLoc-ified getNOT() method. llvm-svn: 63442	2009-01-30 23:03:19 +00:00
Bill Wendling	3dc5d2454e	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63441	2009-01-30 22:57:07 +00:00
Bill Wendling	cb9be5d174	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63440	2009-01-30 22:53:48 +00:00
Bill Wendling	4e0a61514b	Propagate debug loc info for BIT_CONVERT. llvm-svn: 63439	2009-01-30 22:44:24 +00:00
Bill Wendling	7bfa43b022	Propagate debug loc info for more *_EXTEND methods. llvm-svn: 63437	2009-01-30 22:33:24 +00:00
Bill Wendling	9b3dc8d848	Propagate debug loc info for ANY_EXTEND. llvm-svn: 63436	2009-01-30 22:27:33 +00:00
Bill Wendling	c409318562	Propagate debug loc info for some of the *_EXTEND functions. llvm-svn: 63434	2009-01-30 22:23:15 +00:00
Bill Wendling	b6b6f46fe4	- Propagate debug loc info for SELECT. - Added xform for (select X, 1, Y) and (select X, Y, 0), which was commented on, but missing. llvm-svn: 63428	2009-01-30 22:02:18 +00:00
Bill Wendling	d51e3ff540	Propagate debug loc info for Shifts. llvm-svn: 63424	2009-01-30 21:37:17 +00:00
Bill Wendling	35972a9460	Propagate debug loc info for XOR and MatchRotate. llvm-svn: 63420	2009-01-30 21:14:50 +00:00
Bill Wendling	f29b6e1318	Propagate debug loc info for OR. Also clean up some comments. llvm-svn: 63419	2009-01-30 20:59:34 +00:00
Bill Wendling	ff8acd684f	Perform obvious constant arithmetic folding. llvm-svn: 63417	2009-01-30 20:50:00 +00:00
Bill Wendling	8617191302	Propagate debug loc info for AND. Also clean up some comments. llvm-svn: 63416	2009-01-30 20:43:18 +00:00
Bill Wendling	781db7a1ad	Propagate debug loc info in SimplifyBinOpWithSameOpcodeHands. llvm-svn: 63411	2009-01-30 19:25:47 +00:00
Bill Wendling	9b3407e5bb	Propagate debug loc info in SimplifyNodeWithTwoResults. llvm-svn: 63376	2009-01-30 03:08:40 +00:00
Bill Wendling	faed065e5c	Propagate debug loc info for MULHS. llvm-svn: 63375	2009-01-30 03:00:18 +00:00
Bill Wendling	d033af09fd	Propagate debug loc info for SREM and UREM. llvm-svn: 63374	2009-01-30 02:57:00 +00:00
Bill Wendling	aff3e03765	Propagate debug loc info for UDIV. llvm-svn: 63373	2009-01-30 02:55:25 +00:00
Bill Wendling	5b663e7b53	Propagate debug loc info for SDIV. llvm-svn: 63372	2009-01-30 02:52:17 +00:00
Bill Wendling	b48dcf67e5	Forgot to propagate debug loc info here. llvm-svn: 63371	2009-01-30 02:49:26 +00:00
Bill Wendling	091f92f568	Propagate debug loc info for MUL. llvm-svn: 63369	2009-01-30 02:45:56 +00:00
Bill Wendling	48ff08ef3e	Propagate debug loc info in SUB. llvm-svn: 63368	2009-01-30 02:42:10 +00:00
Bill Wendling	6127757920	Propagate debug loc info in ADDC and ADDE. llvm-svn: 63367	2009-01-30 02:38:00 +00:00
Bill Wendling	c442348dd7	Propagate debug loc info in DAG combine's "ADD". llvm-svn: 63366	2009-01-30 02:31:17 +00:00
Bill Wendling	cdd96133bd	- Propagate debug loc info in combineSelectAndUse(). - Modify ReassociateOps so that the resulting SDValue is what the comment claims it is. llvm-svn: 63365	2009-01-30 02:23:43 +00:00
Bill Wendling	9c9a3b6665	Propagate debug location info for the token factor. llvm-svn: 63355	2009-01-30 01:13:16 +00:00
Bill Wendling	f6d0aff0bd	Add DebugLoc propagation to some of the methods in DAG combiner. llvm-svn: 63350	2009-01-30 00:45:56 +00:00
Dan Gohman	e58ab79f33	Make x86's BT instruction matching more thorough, and add some dagcombines that help it match in several more cases. Add several more cases to test/CodeGen/X86/bt.ll. This doesn't yet include matching for BT with an immediate operand, it just covers more register+register cases. llvm-svn: 63266	2009-01-29 01:59:02 +00:00
Dan Gohman	4aa1846215	Make isOperationLegal do what its name suggests, and introduce a new isOperationLegalOrCustom, which does what isOperationLegal previously did. Update a bunch of callers to use isOperationLegalOrCustom instead of isOperationLegal. In some case it wasn't obvious which behavior is desired; when in doubt I changed then to isOperationLegalOrCustom as that preserves their previous behavior. This is for the second half of PR3376. llvm-svn: 63212	2009-01-28 17:46:25 +00:00
Dan Gohman	fb58faf29e	Add an assertion to the form of SelectionDAG::getConstant that takes a uint64_t to verify that the value is in range for the given type, to help catch accidental overflow. Fix a few places that relied on getConstant implicitly truncating the value. llvm-svn: 63128	2009-01-27 20:39:34 +00:00
Dan Gohman	8e4ac9b71a	Take the next steps in making SDUse more consistent with LLVM Use, and tidy up SDUse and related code. - Replace the operator= member functions with a set method, like LLVM Use has, and variants setInitial and setNode, which take care up updating use lists, like LLVM Use's does. This simplifies code that calls these functions. - getSDValue() is renamed to get(), as in LLVM Use, though most places can either use the implicit conversion to SDValue or the convenience functions instead. - Fix some more node vs. value terminology issues. Also, eliminate the one remaining use of SDOperandPtr, and SDOperandPtr itself. llvm-svn: 62995	2009-01-26 04:35:06 +00:00
Dan Gohman	1275e28ded	Fold x-0 to x in unsafe-fp-math mode. This comes up in the testcase from PR3376, and in fact is sufficient to completely avoid the problem in that testcase. There's an underlying problem though; TLI.isOperationLegal considers Custom to be Legal, which might be ok in some cases, but that's what DAGCombiner is using in many places to test if something is legal when LegalOperations is true. When DAGCombiner is running after legalize, this isn't sufficient. I'll address this in a separate commit. llvm-svn: 62860	2009-01-23 19:10:37 +00:00
Bob Wilson	c2dc7ee5d0	Fix a minor bug in DAGCombiner's folding of SELECT. Folding "select C, 0, 1" to "C ^ 1" is only valid when C is known to be either 0 or 1. Most of the similar foldings in this function only handle "i1" types, but this one appears intentionally written to handle larger integer types. If C has an integer type larger than "i1", this needs to check if the high bits of a boolean are known to be zero. I also changed the comment to describe this folding as "C ^ 1" instead of "~C", since that is what the code does and since the latter would only be valid for "i1" types. The good news is that most LLVM targets use TargetLowering::ZeroOrOneBooleanContent so this change will not disable the optimization; the bad news is that I've been unable to come up with a testcase to demonstrate the problem. I have also removed a "FIXME" comment for folding "select C, X, 0" to "C & X", since the code looks correct to me. It could be made more aggressive by not limiting the type to "i1", but that would then require checking for TargetLowering::ZeroOrNegativeOneBooleanContent. Similar changes could be done for the other SELECT foldings, but it was decided to be not worth the trouble and complexity (see e.g., r44663). llvm-svn: 62790	2009-01-22 22:05:48 +00:00
Dan Gohman	1f3411de47	Don't create ISD::FNEG nodes after legalize if they aren't legal. Simplify x+0 to x in unsafe-fp-math mode. This avoids a bunch of redundant work in many cases, because in unsafe-fp-math mode, ISD::FADD with a constant is considered free to negate, so the DAGCombiner often negates x+0 to -0-x thinking it's free, when in reality the end result is -x, which is more expensive than x. Also, combine x*0 to 0. This fixes PR3374. llvm-svn: 62789	2009-01-22 21:58:43 +00:00
Bob Wilson	c58900504b	Add SelectionDAG::getNOT method to construct bitwise NOT operations, corresponding to the "not" and "vnot" PatFrags. Use the new method in some places where it seems appropriate. llvm-svn: 62768	2009-01-22 17:39:32 +00:00
Dan Gohman	7e6b932f18	Simplify ReduceLoadWidth's logic: it doesn't need several different special cases after producing the new reduced-width load, because the new load already has the needed adjustments built into it. This fixes several bugs due to the special cases, including PR3317. llvm-svn: 62692	2009-01-21 15:17:51 +00:00
Dan Gohman	161b7b66ac	Fix a dagcombine to not generate loads of non-round integer types, as its comment says, even in the case where it will be generating extending loads. This fixes PR3216. llvm-svn: 62557	2009-01-20 01:06:45 +00:00
Dan Gohman	cd0b1bf0a0	Fix SelectionDAG::ReplaceAllUsesWith to behave correctly when uses are added to the From node while it is processing From's use list, because of automatic local CSE. The fix is to avoid visiting any new uses. Fix a few places in the DAGCombiner that assumed that after a RAUW call, the From node has no users and may be deleted. This fixes PR3018. llvm-svn: 62533	2009-01-19 21:44:21 +00:00
Mon P Wang	e9e7abb6b8	Simplify extract element based on comments from Duncan Sands. llvm-svn: 62459	2009-01-18 06:43:40 +00:00
Mon P Wang	ca6d6dea0b	Simplify extract element of a scalar to vector. llvm-svn: 62383	2009-01-17 00:07:25 +00:00
Dan Gohman	38978ba972	Use the getNode() accessor instead of accessing the Node member directly, which is private as of r55504. llvm-svn: 62364	2009-01-16 21:47:21 +00:00
Chris Lattner	41828cdb0a	new nodes should be added to the worklist, not old nodes. llvm-svn: 62359	2009-01-16 21:15:56 +00:00
Dan Gohman	619ef48a52	Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph and into the ScheduleDAGInstrs class, so that they don't get destructed and re-constructed for each block. This fixes a compile-time hot spot in the post-pass scheduler. To help facilitate this, tidy and do some minor reorganization in the scheduler constructor functions. llvm-svn: 62275	2009-01-15 19:20:50 +00:00
Dan Gohman	b9fa1d24f8	Fix a DAGCombiner abort on an invalid shift count constant. This fixes PR3250. llvm-svn: 61613	2009-01-03 19:22:06 +00:00
Duncan Sands	8feb694e8f	Fix PR3274: when promoting the condition of a BRCOND node, promote from i1 all the way up to the canonical SetCC type. In order to discover an appropriate type to use, pass MVT::Other to getSetCCResultType. In order to be able to do this, change getSetCCResultType to take a type as an argument, not a value (this is also more logical). llvm-svn: 61542	2009-01-01 15:52:00 +00:00
Dale Johannesen	ee573fcefc	Change comments so everybody can understand them, hopefully. llvm-svn: 61405	2008-12-23 23:47:22 +00:00
Dale Johannesen	acc84e5aa0	Add another permutation where we should get rid of a-a. llvm-svn: 61401	2008-12-23 23:01:27 +00:00
Dale Johannesen	d2a4685860	One more permutation of subtracting off a base value. llvm-svn: 61361	2008-12-23 01:59:54 +00:00
Dale Johannesen	f51dcef803	A new dag combine; several permutations of this are there under ADD, this one was missing. llvm-svn: 61107	2008-12-16 22:13:49 +00:00
Bill Wendling	1a317678bc	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Bill Wendling	40d2476adc	Clarify FIXME. llvm-svn: 60867	2008-12-11 01:26:44 +00:00
Mon P Wang	b5eb7205ea	Make fix for r60829 less conservative to allow the proper optimization for vec_extract-sse4.ll. llvm-svn: 60865	2008-12-11 00:26:16 +00:00
Bill Wendling	0864a75ebf	If ADD, SUB, or MUL have an overflow bit that's used, don't do transformation on them. The DAG combiner expects that nodes that are transformed have one value result. llvm-svn: 60857	2008-12-10 22:36:00 +00:00
Mon P Wang	4637c3c698	Fixed a bug when trying to optimize a extract vector element of a bit convert that changes the number of elements of a shuffle. llvm-svn: 60829	2008-12-10 03:59:02 +00:00
Dale Johannesen	54bdec238a	One more transformation. llvm-svn: 60432	2008-12-02 18:40:40 +00:00
Dale Johannesen	8c76670b5a	Add a few more transformations. llvm-svn: 60391	2008-12-02 01:30:54 +00:00
Dale Johannesen	73bc0ba4c9	Add a missing case in visitADD. llvm-svn: 60137	2008-11-27 00:43:21 +00:00
Duncan Sands	dc2dac181a	If the type legalizer actually legalized anything (this doesn't happen that often, since most code does not use illegal types) then follow it by a DAG combiner run that is allowed to generate illegal operations but not illegal types. I didn't modify the target combiner code to distinguish like this between illegal operations and illegal types, so it will not produce illegal operations as well as not producing illegal types. llvm-svn: 59960	2008-11-24 14:53:14 +00:00
Duncan Sands	8d6e2e13d5	Rename SetCCResultContents to BooleanContents. In practice these booleans are mostly produced by SetCC, however the concept is more general. llvm-svn: 59911	2008-11-23 15:47:28 +00:00
Bill Wendling	be8e7f851c	- Move conversion of [SU]ADDO from DAG combiner into legalizer. - Add "promote integer type" stuff to the legalizer for these nodes. llvm-svn: 59847	2008-11-22 00:22:52 +00:00
Bill Wendling	0b5be6c5e0	Default to converting UADDO to the generic form that SADDO is converted to. llvm-svn: 59801	2008-11-21 07:44:30 +00:00
Bill Wendling	8badb674eb	Remove chains. Unnecessary. llvm-svn: 59783	2008-11-21 02:22:59 +00:00
Bill Wendling	77538cc510	Rename "ADDO" to "SADDO" and "UADDO". The "UADDO" isn't equivalent to "ADDC" because the boolean it returns to indicate an overflow may not be treated like as a flag. It could be stored to memory, for instance. llvm-svn: 59780	2008-11-21 02:12:42 +00:00
Bill Wendling	74296c60ff	Implement the sadd_with_overflow intrinsic. This is converted into "ISD::ADDO". ISD::ADDO is lowered into a target-independent form that does the addition and then checks if the result is less than one of the operands. (If it is, then there was an overflow.) llvm-svn: 59779	2008-11-21 02:03:52 +00:00
Bill Wendling	49a5ce863e	Fix for PR3040: The CC was changed, but wasn't checked to see if it was legal if the DAG combiner was being run after legalization. Threw in a couple of checks just to make sure that it's okay. As far as the PR is concerned, no back-end target actually exhibited this problem, so there isn't an associated testcase. llvm-svn: 59035	2008-11-11 08:25:46 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Evan Cheng	c7b04a12bb	Type of shuffle mask has changed. llvm-svn: 58751	2008-11-05 06:04:18 +00:00
Chris Lattner	5fa1040130	Don't produce invalid comparisons after legalize. llvm-svn: 58320	2008-10-28 07:11:07 +00:00
Duncan Sands	c6d12bd665	Use a legal integer type for vector shuffle mask elements. Otherwise LegalizeTypes will, reasonably enough, legalize the mask, which may result in it no longer being a BUILD_VECTOR node (LegalizeDAG simply ignores the legality or not of vector masks). llvm-svn: 57782	2008-10-19 14:58:05 +00:00
Dan Gohman	2fe6bee5b6	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	a39b0a1f05	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Evan Cheng	07d53b1d33	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Dale Johannesen	54306fe499	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Dan Gohman	6e0548336a	Rename ConstantSDNode's getSignExtended to getSExtValue, for consistancy with ConstantInt, and re-implement it in terms of ConstantInt's getSExtValue. llvm-svn: 56700	2008-09-26 21:54:37 +00:00
Bill Wendling	dea91308ae	Reapplying r56550 llvm-svn: 56553	2008-09-24 10:25:02 +00:00
Eric Christopher	4e26a81371	Temporarily revert r56550 until missing commit can be added. llvm-svn: 56551	2008-09-24 08:30:44 +00:00
Bill Wendling	7c31464a0b	Refactor the constant folding code into it's own function. And call it from both the SelectionDAG and DAGCombiner code. The only functionality change is that now the DAG combiner is performing the constant folding for these operations instead of being a no-op. This is not in response to a bug, so there isn't a testcase. llvm-svn: 56550	2008-09-24 07:11:26 +00:00
Evan Cheng	13beeeb128	Per review feedback: Only perform (srl x, (trunc (and y, c))) -> (srl x, (and (trunc y), c)) etc. when both "trunc" and "and" have single uses. llvm-svn: 56452	2008-09-22 18:19:24 +00:00
Dan Gohman	ec270fb640	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dan Gohman	1df80f6b1c	In visitUREM, arrange for the temporary UDIV node to be revisited, consistent with the code in visitSREM. llvm-svn: 55923	2008-09-08 16:59:01 +00:00
Bill Wendling	5f7371d7b1	Revert my previous change -- the subtraction of two constants was a no-op before. This is taken care of in the selection DAG pass. In my opinion, this should be in one place or the other. I.e., it should probably be removed from the DAG combiner (along with the other arithmetic transformations on constants that are essentially no-ops). llvm-svn: 55889	2008-09-08 01:56:32 +00:00
Bill Wendling	df81749886	Convert // fold (sub c1, c2) -> c1-c2 from a no-op into an actual transformation. llvm-svn: 55886	2008-09-07 11:34:47 +00:00
Dan Gohman	921ddd69ba	Fix a search+replace-o. llvm-svn: 55824	2008-09-05 01:58:21 +00:00
Dan Gohman	634412fe35	Clean up uses of TargetLowering::getTargetMachine. llvm-svn: 55769	2008-09-04 15:39:15 +00:00
Bill Wendling	11284ea499	Another situation where ROTR is cheaper than ROTL. llvm-svn: 55577	2008-08-31 01:13:31 +00:00
Bill Wendling	4822a7ac8a	For this pattern, ROTR is the cheaper option. llvm-svn: 55576	2008-08-31 01:04:56 +00:00
Bill Wendling	fc72416447	- Fix comment so that it describes how the code really works: // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotl x, y) // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotr x, (sub 32, y)) Example: (x == 0xDEADBEEF and y == 4) (x << 4) \| (x >> 28) => 0xEADBEEF0 \| 0x0000000D => 0xEADBEEFD (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => 0xEADBEEFD - Fix comment and code for second version. It wasn't using the rot* propertly. // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotr x, y) // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotl x, (sub 32, y)) (x << 28) \| (x >> 4) => 0xD0000000 \| 0x0DEADBEE => 0xDDEADBEE (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => (0xEADBEEFD) llvm-svn: 55575	2008-08-31 00:37:27 +00:00
Gabor Greif	e12264bf41	fix some 80-col violations llvm-svn: 55571	2008-08-30 19:29:20 +00:00
Evan Cheng	cfb7f3abdf	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Evan Cheng	894be333f1	Fix 80 col. violations. llvm-svn: 55551	2008-08-29 23:20:46 +00:00
Evan Cheng	5e7658c2e4	Back out 55498. It broken Apple style bootstrapping. llvm-svn: 55549	2008-08-29 22:21:44 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Dan Gohman	f27e33baa7	Optimize DAGCombiner's worklist processing. Previously it started its work by putting all nodes in the worklist, requiring a big dynamic allocation. Now, DAGCombiner just iterates over the AllNodes list and maintains a worklist for nodes that are newly created or need to be revisited. This allows the worklist to stay small in most cases, so it can be a SmallVector. This has the side effect of making DAGCombine not miss a folding opportunity in alloca-align-rounding.ll. llvm-svn: 55498	2008-08-28 21:01:56 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Dan Gohman	837c13a029	Disable DAGCombine's alignment inference in "fast" codegen mode. llvm-svn: 55059	2008-08-20 16:30:28 +00:00
Dan Gohman	550c9af91f	Improve support for vector casts in LLVM IR and CodeGen. llvm-svn: 54784	2008-08-14 20:04:46 +00:00
Dan Gohman	127bb03b8c	Take the FrameOffset into account when computing the alignment of stack objects. This fixes PR2656. llvm-svn: 54646	2008-08-11 18:27:03 +00:00
Dan Gohman	345d63ccf2	Improve dagcombining for sext-loads and sext-in-reg nodes. llvm-svn: 54239	2008-07-31 00:50:31 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	91e5dcb680	Tidy SDNode::use_iterator, and complete the transition to have it parallel its analogue, Value::value_use_iterator. The operator* method now returns the user, rather than the use. llvm-svn: 54127	2008-07-27 20:43:25 +00:00
Evan Cheng	b8ff223f26	Fix pr2566: incorrect assumption about bit_convert. It doesn't not have to output a vector value. Patch by Nicolas Capens! llvm-svn: 53932	2008-07-22 20:42:56 +00:00
Dan Gohman	581cc87f57	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Duncan Sands	b0e3938651	Add VerifyNode, a place to put sanity checks on generic SDNode's (nodes with their own constructors should do sanity checking in the constructor). Add sanity checks for BUILD_VECTOR and fix all the places that were producing bogus BUILD_VECTORs, as found by "make check". My favorite is the BUILD_VECTOR with only two operands that was being used to build a vector with four elements! llvm-svn: 53850	2008-07-21 10:20:31 +00:00
Duncan Sands	32e387c461	Revert 53729, after waking up in the middle of the night realising that it was wrong :) I think the reason the same type was being used for the shufflevec of indices as for the actual indices is so that if one of them needs splitting then so does the other. After my patch it might be that the indices need splitting but not the rest, yet there is no good way of handling that. I think the right solution is to not have the shufflevec be an operand at all: just have it be the list of numbers it actually is, stored as extra info in the node. llvm-svn: 53768	2008-07-18 20:12:05 +00:00
Duncan Sands	656b256a1a	Use a legal type for elements of the vector_shuffle mask. These are just indices into the shuffled vector so their type is unrelated to the type of the shuffled elements (which is what was being used before). This fixes vec_shuffle-11.ll when using LegalizeTypes. What seems to have happened is that Dan's recent change r53687, which corrected the result type of the shuffle, somehow caused LegalizeTypes to notice that the mask operand was a BUILD_VECTOR with a legal type but elements of an illegal type (i64). LegalizeTypes legalized this by introducing a new BUILD_VECTOR of i32 and bitcasting it to the old type. But the mask operand is not supposed to be a bitcast but a straight BUILD_VECTOR of constants, causing a crash. llvm-svn: 53729	2008-07-17 19:28:41 +00:00
Dan Gohman	2714059079	Fix the result type of a VECTOR_SHUFFLE+BIT_CONVERT dagcombine. This was turned up by some new SelectionDAG assertion checks that I'm working on. llvm-svn: 53687	2008-07-16 16:13:58 +00:00
Dan Gohman	a76e60a77a	Use reserve. SelectionDAG::allnodes_size is linear, but that doesn't appear to outweigh the benefit of reducing heap traffic. If it does become a problem, we should teach SelectionDAG to keep a count of how many nodes are live, because there are several other places where that information would be useful as well. llvm-svn: 52926	2008-06-30 21:04:06 +00:00
Dan Gohman	6f7b5a6392	When folding a bitcast into a load or store, preserve the alignment information of the original load or store, which is checked to be at least as good, and possibly better. llvm-svn: 52849	2008-06-28 00:45:22 +00:00
Chris Lattner	df1cbdd645	duncan points out that isOperationLegal includes a check for type legality. Thanks Duncan! llvm-svn: 52786	2008-06-26 17:16:00 +00:00
Chris Lattner	b1e66ce3bb	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Dan Gohman	b4e2637e9b	Duncan pointed out this code could be tidied. llvm-svn: 52624	2008-06-23 15:29:14 +00:00
Dan Gohman	546505e7e1	Simplify some getNode calls. llvm-svn: 52604	2008-06-21 22:06:07 +00:00
Duncan Sands	37c1f5267b	Allow these transforms for types like i256 while still excluding types like i1 (not byte sized) and i120 (loading an i120 requires loading an i64, an i32, an i16 and an i8, which is expensive). llvm-svn: 52310	2008-06-16 08:14:38 +00:00
Duncan Sands	075293ff46	The transforms in visitEXTRACT_VECTOR_ELT are not valid if the load is volatile. Hopefully all wrong DAG combiner transforms of volatile loads and stores have now been caught. llvm-svn: 52293	2008-06-15 20:12:31 +00:00
Duncan Sands	b1bfff53fe	Remove a redundant AfterLegalize check. Turn on some code when !AfterLegalize - but since this whole code section is turned off by an "if (0)" it's not really turning anything on. llvm-svn: 52276	2008-06-14 17:48:34 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	bf17080ec2	Sometimes (rarely) nodes held in LegalizeTypes maps can be deleted. This happens when RAUW replaces a node N with another equivalent node E, deleting the first node. Solve this by adding (N, E) to ReplacedNodes, which is already used to remap nodes to replacements. This means that deleted nodes are being allowed in maps, which can be delicate: the memory may be reused for a new node which might get confused with the old deleted node pointer hanging around in the maps, so detect this and flush out maps if it occurs (ExpungeNode). The expunging operation is expensive, however it never occurs during a llvm-gcc bootstrap or anywhere in the nightly testsuite. It occurs three times in "make check": Alpha/illegal-element-type.ll, PowerPC/illegal-element-type.ll and X86/mmx-shift.ll. If expunging proves to be too expensive then there are other more complicated ways of solving the problem. In the normal case this patch adds the overhead of a few more map lookups, which is hopefully negligable. llvm-svn: 52214	2008-06-11 11:42:12 +00:00
Duncan Sands	67d0f332d5	Various tweaks related to apint codegen. No functionality change for non-funky-sized integers. llvm-svn: 52151	2008-06-09 15:48:25 +00:00
Duncan Sands	93b6609ae2	Remove some DAG combiner assumptions about sizes of integer types. Fix the isMask APInt method to actually work (hopefully) rather than crashing because it adds apints of different bitwidths. It looks like isShiftedMask is also broken, but I'm leaving that one to the APInt people (it is not used anywhere). llvm-svn: 52142	2008-06-09 11:32:28 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	643b3a0581	Add #includes to make some dependencies explicit. llvm-svn: 51496	2008-05-23 20:40:06 +00:00
Dan Gohman	c1a4e212a3	Code simplification. llvm-svn: 51345	2008-05-20 20:56:33 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Dan Gohman	c968c1f592	Evan pointed out that folding sext to zext may not be correct if the zext is not legal. llvm-svn: 50368	2008-04-28 18:47:17 +00:00
Dan Gohman	3eb10f758e	Teach DAGCombine to convert (sext x) to (zext x) when the sign-bit of x is known to be zero. llvm-svn: 50357	2008-04-28 16:58:24 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Roman Levenstein	51f532f92d	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Evan Cheng	025cea1126	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Dan Gohman	f549b26254	Fix a DAGCombiner optimization to respect volatile qualification. llvm-svn: 48994	2008-03-31 20:32:52 +00:00
Roman Levenstein	358e04a185	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Evan Cheng	df1690dc7c	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48792	2008-03-25 20:08:07 +00:00
Evan Cheng	fe7610f37f	Remove an unneeded test. llvm-svn: 48755	2008-03-24 23:55:16 +00:00
Evan Cheng	31604a62f6	Teach DAG combiner to commute commutable binary nodes in order to achieve sdisel CSE. llvm-svn: 48673	2008-03-22 01:55:50 +00:00
Christopher Lamb	3e9f49716e	Check even more carefully before applying this DAGCombine transform. llvm-svn: 48580	2008-03-20 04:31:39 +00:00
Evan Cheng	7a3e750fd2	Fix this xform: (sra (shl X, m), result_size) -> (sign_extend (trunc (shl X, result_size - n - m))) llvm-svn: 48578	2008-03-20 02:18:41 +00:00
Christopher Lamb	8fe9109469	Fix X86's isTruncateFree to not claim that truncate to i1 is free. This fixes Bill's testcase that failed for r48491. llvm-svn: 48542	2008-03-19 08:30:06 +00:00
Bill Wendling	efb4d9ef80	Temporarily revert r48491. It's breaking test/CodeGen/X86/xorl.ll. llvm-svn: 48510	2008-03-18 22:29:51 +00:00
Christopher Lamb	3e408d4d82	Target independent DAG transform to use truncate for field extraction + sign extend on targets where this is profitable. Passes nightly on x86-64. llvm-svn: 48491	2008-03-18 16:46:39 +00:00
Dan Gohman	b72127ac4c	More APInt-ification. llvm-svn: 48344	2008-03-13 22:13:53 +00:00
Evan Cheng	99ee78ef63	Clean up my own mess. X86 lowering normalize vector 0 to v4i32. However DAGCombine can fold (sub x, x) -> 0 after legalization. It can create a zero vector of a type that's not expected (e.g. v8i16). We don't want to disable the optimization since leaving a (sub x, x) is really bad. Add isel patterns for other types of vector 0 to ensure correctness. It's highly unlikely to happen other than in bugpoint reduced test cases. llvm-svn: 48279	2008-03-12 07:02:50 +00:00
Evan Cheng	0903aef2ff	Total brain cramp. llvm-svn: 48274	2008-03-12 02:05:05 +00:00
Evan Cheng	b9e4280e94	Somewhat better solution. llvm-svn: 48170	2008-03-10 19:58:22 +00:00
Scott Michel	a6729e8666	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Evan Cheng	831ae49599	Doh llvm-svn: 48140	2008-03-10 07:59:01 +00:00
Evan Cheng	b5d11980d9	Avoid creating BUILD_VECTOR of all zero elements of "non-normalized" type (e.g. v8i16 on x86) after legalizer. Instruction selection does not expect to see them. In all likelihood this can only be an issue in a bugpoint reduced test case. llvm-svn: 48136	2008-03-10 07:19:13 +00:00
Evan Cheng	567d2e5b57	Rename isOperand() to isOperandOf() (and other similar methods). It always confuses me. llvm-svn: 47872	2008-03-04 00:41:45 +00:00
Dan Gohman	e1c4f99549	Misc. APInt-ification in the DAGCombiner. llvm-svn: 47869	2008-03-03 23:51:38 +00:00
Dan Gohman	ae2b6fbb8e	Convert SimplifyDemandedMask and ShrinkDemandedConstant to use APInt. Change several cases in SimplifyDemandedMask that don't ever do any simplifying to reuse the logic in ComputeMaskedBits instead of duplicating it. llvm-svn: 47648	2008-02-27 00:25:32 +00:00
Chris Lattner	07c83cc86e	Fix PR2096, a regression introduced with my patch last night. This also fixes cfrac, flops, and 175.vpr llvm-svn: 47605	2008-02-26 17:09:59 +00:00
Chris Lattner	e7c14013f5	Fix isNegatibleForFree to not return true for ConstantFP nodes after legalize. Just because a constant is legal (e.g. 0.0 in SSE) doesn't mean that its negated value is legal (-0.0). We could make this stronger by checking to see if the negated constant is actually legal post negation, but it doesn't seem like a big deal. llvm-svn: 47591	2008-02-26 07:04:54 +00:00
Dan Gohman	1f372edd97	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Dan Gohman	360c86aed5	Add explicit keywords. llvm-svn: 47382	2008-02-20 16:44:09 +00:00
Dan Gohman	d0ff91dac5	Convert DAGCombiner to use the APInt form of ComputeMaskedBits. llvm-svn: 47381	2008-02-20 16:33:30 +00:00
Anton Korobeynikov	035eaacd1f	Update gcc 4.3 warnings fix patch with recent head changes llvm-svn: 47368	2008-02-20 11:10:28 +00:00
Evan Cheng	6200c225e0	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. llvm-svn: 47290	2008-02-18 23:04:32 +00:00
Chris Lattner	ee322b44a4	teach dag combiner how to eliminate MERGE_VALUES nodes. llvm-svn: 47052	2008-02-13 07:25:05 +00:00
Duncan Sands	7377f5fbe3	Add a isBigEndian method to complement isLittleEndian. llvm-svn: 46954	2008-02-11 10:37:04 +00:00
Bill Wendling	9c2ce9a32d	Return "(c1 + c2)" instead of yet another ADD node (which made this a no-op). llvm-svn: 46922	2008-02-10 08:10:24 +00:00
Chris Lattner	d2d166ea9f	the world doesn't need my debugging code. llvm-svn: 46678	2008-02-03 07:01:05 +00:00
Chris Lattner	b2b9d6f0fb	Change the 'global modification' APIs in SelectionDAG to take a new DAGUpdateListener object pointer instead of just returning a vector of deleted nodes. This makes the interfaces more efficient (no more allocating a vector [at least a malloc], filling it in, then walking it) and more clean. This also allows the client to be notified of nodes that are changed but not deleted. llvm-svn: 46677	2008-02-03 06:49:24 +00:00
Dan Gohman	47a7d6fafe	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Dan Gohman	70de4cb1cd	Use empty() instead of comparing size() with zero. llvm-svn: 46514	2008-01-29 13:02:09 +00:00
Chris Lattner	2ee91f4300	Fix PowerPC/./2007-10-18-PtrArithmetic.ll llvm-svn: 46424	2008-01-27 23:32:17 +00:00
Chris Lattner	d0496d0433	fix a crash on CodeGen/X86/vector-rem.ll llvm-svn: 46422	2008-01-27 23:21:58 +00:00
Chris Lattner	888560d62c	Implement some dag combines that allow doing fneg/fabs/fcopysign in integer registers if used by a bitconvert or using a bitconvert. This allows us to avoid constant pool loads and use cheaper integer instructions when the values come from or end up in integer regs anyway. For example, we now compile CodeGen/X86/fp-in-intregs.ll to: _test1: movl $2147483648, %eax xorl 4(%esp), %eax ret _test2: movl $1065353216, %eax orl 4(%esp), %eax andl $3212836864, %eax ret Instead of: _test1: movss 4(%esp), %xmm0 xorps LCPI2_0, %xmm0 movd %xmm0, %eax ret _test2: movss 4(%esp), %xmm0 andps LCPI3_0, %xmm0 movss LCPI3_1, %xmm1 andps LCPI3_2, %xmm1 orps %xmm0, %xmm1 movd %xmm1, %eax ret bitconverts can happen due to various calling conventions that require fp values to passed in integer regs in some cases, e.g. when returning a complex. llvm-svn: 46414	2008-01-27 17:42:27 +00:00
Chris Lattner	e30e33af4f	Infer alignment of loads and increase their alignment when we can tell they are from the stack. This allows us to compile stack-align.ll to: _test: movsd LCPI1_0, %xmm0 movapd %xmm0, %xmm1 * andpd 4(%esp), %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret instead of: _test: movsd LCPI1_0, %xmm0 movsd 4(%esp), %xmm1 ** andpd %xmm0, %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret llvm-svn: 46401	2008-01-26 19:45:50 +00:00
Chris Lattner	31e9edce1c	Fix some bugs in SimplifyNodeWithTwoResults where it would call deletenode to delete a node even if it was not dead in some cases. Instead, just add it to the worklist. Also, make sure to use the CombineTo methods, as it was doing things that were unsafe: the top level combine loop could touch dangling memory. This fixes CodeGen/Generic/2008-01-25-dag-combine-mul.ll llvm-svn: 46384	2008-01-26 01:09:19 +00:00
Chris Lattner	cb3cf546c3	reduce indentation llvm-svn: 46377	2008-01-25 23:34:24 +00:00
Chris Lattner	2d7a830ff3	Add skeletal code to increase the alignment of loads and stores when we can infer it. This will eventually help stuff, though it doesn't do much right now because all fixed FI's have an alignment of 1. llvm-svn: 46349	2008-01-25 07:20:16 +00:00
Chris Lattner	34ed27c46d	clarify a comment, thanks Duncan. llvm-svn: 46313	2008-01-24 17:10:01 +00:00
Chris Lattner	e97fa8cdf0	Fix this buggy transformation. Two observations: 1. we already know the value is dead, so don't bother replacing it with undef. 2. The very case the comment describes actually makes the load live which asserts in deletenode. If we do the replacement and the node becomes live, just treat it as new. This fixes a failure on X86/2008-01-16-InvalidDAGCombineXform.ll with some local changes in my tree. llvm-svn: 46306	2008-01-24 07:57:06 +00:00
Chris Lattner	d66eac62fd	The dag combiner is missing revisiting nodes that it really should, and thus leaving dead stuff around. This gets fed into the isel pass and causes certain foldings from happening because nodes have extraneous uses floating around. For example, if we turned foo(bar(x)) -> baz(x), we sometimes left bar(x) around. llvm-svn: 46305	2008-01-24 07:18:21 +00:00
Chris Lattner	0feb1b0f84	fold fp_round(fp_round(x)) -> fp_round(x). llvm-svn: 46304	2008-01-24 06:45:35 +00:00
Chris Lattner	1ea55cf816	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	7eabed3521	code cleanups, no functionality change. llvm-svn: 46126	2008-01-17 07:20:38 +00:00
Chris Lattner	72733e573b	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00

... 3 4 5 6 7 ...

817 Commits