llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	d1acffc845	Change constantexpr global variable initializers to convert the constants to MCExpr then emit them through MCStreamer with EmitValue. I think all global variable initializers are now going through mcstreamer. llvm-svn: 94293	2010-01-23 06:17:14 +00:00
Eric Christopher	c1451d764f	Don't lower splat vector load to relative to the esp if the stack may be misaligned. Update test accordingly. Patch by Evan Cheng! llvm-svn: 94291	2010-01-23 06:02:43 +00:00
Chris Lattner	82b86b0fce	stop testing for invalid output. llvm-svn: 94288	2010-01-23 05:45:28 +00:00
Chris Lattner	68eeb5ec9c	emit .ascii and .asciz through MCStreamer. llvm-svn: 94282	2010-01-23 04:54:10 +00:00
Chris Lattner	cabb6ff64d	remove this test. llvm-svn: 94276	2010-01-23 03:11:10 +00:00
Evan Cheng	8204911e1d	Fix test. llvm-svn: 94272	2010-01-23 01:21:27 +00:00
Evan Cheng	e1b8b5a01b	Fix tests. llvm-svn: 94271	2010-01-23 01:19:28 +00:00
Chris Lattner	88b8b1b419	make this less constrained, we want blank lines between globals. llvm-svn: 94201	2010-01-22 19:51:08 +00:00
Dan Gohman	045f81981a	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Chris Lattner	1526375827	testcase for r94095 llvm-svn: 94096	2010-01-21 20:01:04 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Chris Lattner	f8dcf784a7	emit basic block labels with mcstreamer. llvm-svn: 93993	2010-01-20 07:24:05 +00:00
Chris Lattner	4c8b1824f0	emit integer and fp zeros as (e.g.) .byte 0 instead of .space 1, for tidiness. llvm-svn: 93992	2010-01-20 07:19:19 +00:00
Chris Lattner	03cb2a3035	signficant cleanups to EmitGlobalConstant (including streamerization of int initializers), change some methods to be static functions, use raw_ostream::write_hex instead of a smallstring dance with APValue::toStringUnsigned(S, 16). llvm-svn: 93991	2010-01-20 07:11:32 +00:00
Dan Gohman	954f49014d	Fold (add x, shl(0 - y, n)) -> sub(x, shl(y, n)), to simplify some code that SCEVExpander can produce when running on behalf of LSR. llvm-svn: 93949	2010-01-19 23:30:49 +00:00
Dan Gohman	221708d826	Make SCEVAddRecExpr's getType return a pointer type when the add has a pointer member. This helps reduce unnecessary bitcasting and uglygeps. llvm-svn: 93939	2010-01-19 22:53:50 +00:00
Dan Gohman	fb9bea7150	Add nounwinds. llvm-svn: 93919	2010-01-19 21:51:51 +00:00
Jakob Stoklund Olesen	bdc17f6840	Remove predicates when changing an add into an unpredicable mov. Since the mov is executed unconditionally, make sure that the add didn't have any predicate. llvm-svn: 93909	2010-01-19 21:08:28 +00:00
Evan Cheng	bf43525a29	Do not extend extension results beyond the use of a PHI instruction at the start of a use block. A PHI use is expected to kill its source values. llvm-svn: 93895	2010-01-19 19:45:51 +00:00
Chris Lattner	a6368219ac	don't let asm-verbose break the check-next lines in these tests. llvm-svn: 93869	2010-01-19 06:39:54 +00:00
Chris Lattner	c7a062d187	Now that we have everything nicely factored (e.g. asmprinter is not doing global variable classification anymore) and hookized, sink almost all target targets global variable emission code into AsmPrinter and out of each target. Some notes: 1. PIC16 does completely custom and crazy stuff, so it is not changed. 2. XCore has some custom handling for extra directives. I'll look at it next. 3. This switches linux/ppc to use .globl instead of .global. If .globl is actually wrong, let me know and I'll fix it. 4. This makes linux/ppc get a lot of random cases right which were obviously wrong before, it is probably now a bit healthier. 5. Blackfin will probably start getting .comm and other things that it didn't before. If this is undesirable, it should explicitly opt out of these things by clearing the relevant fields of MCAsmInfo. This leads to a nice diffstat: 14 files changed, 127 insertions(+), 830 deletions(-) llvm-svn: 93858	2010-01-19 05:38:33 +00:00
Chris Lattner	a986aa33eb	fix a significant difference between llvm and gcc on ELF systems: GCC would put weak zero initialized mutable data in the .bss section, we would put it into a crasy '.gnu.linkonce.b.test,"aw",@nobits' section. Fixing this will allow simplifications next up. llvm-svn: 93844	2010-01-19 03:06:01 +00:00
Chris Lattner	024734e0f0	there is no need to emit a .section above .comm on linux. llvm-svn: 93842	2010-01-19 02:46:56 +00:00
Evan Cheng	4668a3b935	Test case for r93758. llvm-svn: 93824	2010-01-19 00:35:20 +00:00
Evan Cheng	88b65bc835	Canonicalize -1 - x to ~x. Instcombine does this but apparently there are situations where this pattern will escape the optimizer and / or created by isel. Here is a case that's seen in JavaScriptCore: %t1 = sub i32 0, %a %t2 = add i32 %t1, -1 The dag combiner pattern: ((c1-A)+c2) -> (c1+c2)-A will fold it to -1 - %a. llvm-svn: 93773	2010-01-18 21:38:44 +00:00
Chris Lattner	387c6b20cd	reduce this test and convert to filecheck, hopefully the linux buildbot will tell me something more useful. llvm-svn: 93688	2010-01-17 19:09:12 +00:00
Bob Wilson	9349437c65	The Neon "vtst" instruction takes a suffix that is the element size alone -- adding an "i" to the suffix, indicating that the elements are integers, is accepted but not part of the standard syntax. This helps us pass a few more of the Neon tests from gcc. llvm-svn: 93677	2010-01-17 06:35:17 +00:00
Kenneth Uildriks	dd6ddd1aeb	When checking for sret-demotion, it needs to use legal types. When using the return value of an sret-demoted call, it needs to use possibly illegal types that match the declared Type of the callee. llvm-svn: 93667	2010-01-16 23:37:33 +00:00
Chris Lattner	08eff61eeb	this teestcase takes a long time to crash, remove it. If someone cares about this, they should file a bug, it's not doing any good as an xfail. llvm-svn: 93604	2010-01-16 00:53:22 +00:00
Bob Wilson	298cdac99c	Run the pre-register allocation tail duplication pass by default. Remove the -pre-regalloc-taildup command-line option, and add a new -disable-early-taildup option. llvm-svn: 93597	2010-01-16 00:29:50 +00:00
David Greene	b0c0e6433f	Fix PR6019. A load has more than one use if it feeds a bitconvert that has more than one use. llvm-svn: 93576	2010-01-15 23:23:41 +00:00
Jim Grosbach	fd850837a3	add testcase for r93564 llvm-svn: 93567	2010-01-15 22:27:37 +00:00
Anton Korobeynikov	07e8171fcb	Reenable tests llvm-svn: 93555	2010-01-15 21:19:26 +00:00
Anton Korobeynikov	3a0b066d24	Temporary disable tests llvm-svn: 93501	2010-01-15 02:09:27 +00:00
Anton Korobeynikov	fdf7031a1a	Add variable-width shifts for MSP430 llvm-svn: 93468	2010-01-14 22:09:38 +00:00
Dan Gohman	dd5286dc63	Fix a codegen abort seen in 483.xalancbmk. llvm-svn: 93417	2010-01-14 03:08:49 +00:00
Chris Lattner	fb40a8e5f1	this test requires SSE, thanks to jyasskin for pointing this out. llvm-svn: 93360	2010-01-13 21:51:41 +00:00
Evan Cheng	af0ad65ff2	Commit some changes I had managed to lose last night while refactoring the code. Avoid change use of PHI instructions because it's not legal to insert any instructions before them. This fixes PR6027. llvm-svn: 93335	2010-01-13 19:16:39 +00:00
Evan Cheng	b5499d09d1	Re-enable extension optimization pass. llvm-svn: 93313	2010-01-13 08:45:40 +00:00
Chris Lattner	25d8ed3773	remove uses of deprecated functions, this generates slightly different BlockAddress labels, but nothing semantically important. Add a FIXME that BlockAddress codegen is broken if the LLVM BB has an empty name (e.g. strip was run). llvm-svn: 93303	2010-01-13 07:30:49 +00:00
Evan Cheng	d7d8f6d000	Disable opt-ext pass to unbreak the build for now. llvm-svn: 93286	2010-01-13 01:51:43 +00:00
Jeffrey Yasskin	0ad23efb0f	Try to fix the ARM and PPC buildbots. The -mattr=vector-unaligned-mem flag doesn't exist there, and this is an x86 test. llvm-svn: 93279	2010-01-13 00:31:43 +00:00
Evan Cheng	30bebff456	Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg. For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used. llvm-svn: 93278	2010-01-13 00:30:23 +00:00
Evan Cheng	0e9189371f	Add nounwind. llvm-svn: 93244	2010-01-12 18:29:23 +00:00
Duncan Sands	b7168c270e	Revert commit 93204, since it causes the assembler to barf on x86-64 linux with messages like this: Error: Incorrect register `%r14' used with `l' suffix llvm-svn: 93242	2010-01-12 17:46:16 +00:00
Dan Gohman	9059833de6	Make several tests less fragile. llvm-svn: 93230	2010-01-12 04:52:47 +00:00
Dan Gohman	c119580307	Reapply the MOV64r0 patch, with a fix: MOV64r0 clobbers EFLAGS. llvm-svn: 93229	2010-01-12 04:42:54 +00:00
Evan Cheng	42b07e9600	Add manual ISD::OR fastisel selection routines. TableGen is no longer autogen them after 93152 and 93191. llvm-svn: 93204	2010-01-11 22:59:27 +00:00
Evan Cheng	99789a7a76	Extend r93152 to work on OR r, r. If the source set bits are known not to overlap, then select as an ADD instead. llvm-svn: 93191	2010-01-11 22:03:29 +00:00
Chris Lattner	f0d26d4b74	reduce this to a sensible testcase. llvm-svn: 93189	2010-01-11 21:58:19 +00:00
David Greene	eb103c404b	Shorten up this testcase. llvm-svn: 93187	2010-01-11 21:50:35 +00:00
Evan Cheng	7bdf339602	Revert 93158. It's breaking quite a few x86_64 tests. llvm-svn: 93185	2010-01-11 21:13:41 +00:00
Jakob Stoklund Olesen	d2a1bee2d4	Avoid adding PHI arguments for a predecessor that has gone away when a BRCOND was constant folded. This fixes PR5980. llvm-svn: 93184	2010-01-11 21:02:33 +00:00
Dan Gohman	e99a3c191e	Use a 32-bit and with implicit zero-extension instead of a 64-bit and if it has an immediate with at least 32 bits of leading zeros, to avoid needing to materialize that immediate in a register first. FileCheckize, tidy, and extend a testcase to cover this case. This fixes rdar://7527390. llvm-svn: 93160	2010-01-11 17:58:34 +00:00
Dan Gohman	3a55686345	Re-instate MOV64r0 and MOV16r0, with adjustments to work with the new AsmPrinter. This is perhaps less elegant than describing them in terms of MOV32r0 and subreg operations, but it allows the current register to rematerialize them. llvm-svn: 93158	2010-01-11 17:37:57 +00:00
Dan Gohman	31e8637ac2	Generalize this check to avoid depending on a specific register assignment. llvm-svn: 93157	2010-01-11 17:24:27 +00:00
Dan Gohman	355ebc7f58	Make this test less trivial, to avoid spurious failures. llvm-svn: 93156	2010-01-11 17:23:56 +00:00
Evan Cheng	64d9f40557	Select an OR with immediate as an ADD if the input bits are known zero. This allow the instruction to be 3address-fied if needed. llvm-svn: 93152	2010-01-11 17:03:47 +00:00
David Greene	206351a1ff	Implement a feature (-vector-unaligned-mem) to allow targets to ignore alignment requirements for SIMD memory operands. This is useful on architectures like the AMD 10h that do not trap on unaligned references if a status bit is twiddled at startup time. llvm-svn: 93151	2010-01-11 16:29:42 +00:00
Jeffrey Yasskin	bb857e5d68	Fix http://llvm.org/PR5729 : x86-64 tail calls were putting their targets into R11, and then asserting that the target was in R9. Since R9 isn't reserved for the target anymore, and is used as an argument, this patch changes the assertion. llvm-svn: 93065	2010-01-09 18:56:43 +00:00
Dan Gohman	6bd3ef82ff	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	cc6d56bd3b	Fix a critical bug in 64-bit atomic operation lowering for 32-bit. The results of the cmpxchg8b instructions are being thrown away when it branches back to the top of the checking loop. This means the loop always compares against the old value and this can result in a dead lock. llvm-svn: 93028	2010-01-08 23:41:50 +00:00
Evan Cheng	58ec4fec88	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	dab2cd543f	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Evan Cheng	16b75ce19c	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Evan Cheng	90dc43fcf5	Fix a minor regression from my dag combiner changes. One more place which needs to look pass truncates. llvm-svn: 92885	2010-01-07 00:54:06 +00:00
Jakob Stoklund Olesen	f1522d612f	Add comments. llvm-svn: 92883	2010-01-07 00:51:04 +00:00
Jakob Stoklund Olesen	29a64c9575	Add Target hook to duplicate machine instructions. Some instructions refer to unique labels, and so cannot be trivially cloned with CloneMachineInstr. llvm-svn: 92873	2010-01-06 23:47:07 +00:00
Evan Cheng	166a4e6caa	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Dan Gohman	f34b289057	Move this test from test/Transforms/IndVarSimplify to test/CodeGen/X86, as doesn't use -indvars, and it does use llc -march=x86-64. llvm-svn: 92799	2010-01-05 22:52:54 +00:00
Bill Wendling	03f0af372c	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Dan Gohman	8c63ee7e28	Make this test more portable. llvm-svn: 92514	2010-01-04 21:23:34 +00:00
Dan Gohman	52183c3cc9	Add some tests and update an existing test to reflect recent x86 isel peeps. llvm-svn: 92509	2010-01-04 20:53:54 +00:00
Anton Korobeynikov	d91a14dba5	Fix invalid chain folding for memory variant of sdiv / udiv llvm-svn: 92472	2010-01-04 10:31:54 +00:00
Chris Lattner	1dae8766b1	fix PR5930, allowing the asmprinter to emit difference between two labels as a truncate. llvm-svn: 92455	2010-01-03 18:33:18 +00:00
Chris Lattner	f6a585fc2f	add PR# llvm-svn: 92451	2010-01-03 18:10:58 +00:00
Chris Lattner	a7cfc43af8	differences between two blockaddress's don't cause a global variable initializer to require relocations. llvm-svn: 92450	2010-01-03 18:09:40 +00:00
Chris Lattner	909c71c96a	allow this to work on linux hosts. llvm-svn: 92407	2010-01-02 00:22:15 +00:00
Chris Lattner	1eea3b0ada	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	6eef072eb6	rename file. llvm-svn: 92405	2010-01-01 23:55:04 +00:00
Chris Lattner	39f18e545e	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	5967840a5f	Make this more likely to generate a libcall. llvm-svn: 92387	2010-01-01 03:26:51 +00:00
Sanjiv Gupta	015215ca86	Extern declaration for unordered.f32 libcall was not being emitted. Fixed that. llvm-svn: 92242	2009-12-29 03:24:34 +00:00
Sanjiv Gupta	1ecffe13b2	Fixed llc crash for zext (i1 -> i8) loads. llvm-svn: 92201	2009-12-28 04:53:24 +00:00
Chris Lattner	f5e3ed64d5	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Sanjiv Gupta	cd419eebce	Reapply 91904. llvm-svn: 91996	2009-12-23 11:19:09 +00:00
Sanjiv Gupta	6920c17f1f	deleting empty file. llvm-svn: 91994	2009-12-23 10:35:24 +00:00
Sanjiv Gupta	f7b4f89588	Reverting back 91904. llvm-svn: 91993	2009-12-23 09:46:01 +00:00
Dale Johannesen	a864a67185	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Anton Korobeynikov	ef3fdc1cbd	Add testcase for PR5703 llvm-svn: 91931	2009-12-22 22:37:23 +00:00
Evan Cheng	71d7eaa87e	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Sanjiv Gupta	8c5f05fcee	While converting one of the operands to a memory operand, we need to check if it is Legal and does not result into a cyclic dep. llvm-svn: 91904	2009-12-22 14:25:37 +00:00
Sanjiv Gupta	8ac077df57	Emit direction operand in binary insns that stores in memory. llvm-svn: 91777	2009-12-19 13:52:01 +00:00
Sanjiv Gupta	bda8002e7f	Test cases for changes done in 91768. llvm-svn: 91773	2009-12-19 11:38:14 +00:00
Evan Cheng	b175de6356	Increase opportunities to optimize (brcond (srl (and c1), c2)). llvm-svn: 91717	2009-12-18 21:31:31 +00:00
Evan Cheng	4cf30b72bf	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Dan Gohman	51fbfb726f	Tidy up this testcase and add test for tailcall optimization with unreachable. llvm-svn: 91650	2009-12-18 01:05:06 +00:00

1 2 3 4 5 ...

2862 Commits