llvm-project

Commit Graph

Author	SHA1	Message	Date
David Greene	eb103c404b	Shorten up this testcase. llvm-svn: 93187	2010-01-11 21:50:35 +00:00
Evan Cheng	7bdf339602	Revert 93158. It's breaking quite a few x86_64 tests. llvm-svn: 93185	2010-01-11 21:13:41 +00:00
Jakob Stoklund Olesen	d2a1bee2d4	Avoid adding PHI arguments for a predecessor that has gone away when a BRCOND was constant folded. This fixes PR5980. llvm-svn: 93184	2010-01-11 21:02:33 +00:00
Dan Gohman	e99a3c191e	Use a 32-bit and with implicit zero-extension instead of a 64-bit and if it has an immediate with at least 32 bits of leading zeros, to avoid needing to materialize that immediate in a register first. FileCheckize, tidy, and extend a testcase to cover this case. This fixes rdar://7527390. llvm-svn: 93160	2010-01-11 17:58:34 +00:00
Dan Gohman	3a55686345	Re-instate MOV64r0 and MOV16r0, with adjustments to work with the new AsmPrinter. This is perhaps less elegant than describing them in terms of MOV32r0 and subreg operations, but it allows the current register to rematerialize them. llvm-svn: 93158	2010-01-11 17:37:57 +00:00
Dan Gohman	31e8637ac2	Generalize this check to avoid depending on a specific register assignment. llvm-svn: 93157	2010-01-11 17:24:27 +00:00
Dan Gohman	355ebc7f58	Make this test less trivial, to avoid spurious failures. llvm-svn: 93156	2010-01-11 17:23:56 +00:00
Evan Cheng	64d9f40557	Select an OR with immediate as an ADD if the input bits are known zero. This allow the instruction to be 3address-fied if needed. llvm-svn: 93152	2010-01-11 17:03:47 +00:00
David Greene	206351a1ff	Implement a feature (-vector-unaligned-mem) to allow targets to ignore alignment requirements for SIMD memory operands. This is useful on architectures like the AMD 10h that do not trap on unaligned references if a status bit is twiddled at startup time. llvm-svn: 93151	2010-01-11 16:29:42 +00:00
Jeffrey Yasskin	bb857e5d68	Fix http://llvm.org/PR5729 : x86-64 tail calls were putting their targets into R11, and then asserting that the target was in R9. Since R9 isn't reserved for the target anymore, and is used as an argument, this patch changes the assertion. llvm-svn: 93065	2010-01-09 18:56:43 +00:00
Dan Gohman	6bd3ef82ff	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	cc6d56bd3b	Fix a critical bug in 64-bit atomic operation lowering for 32-bit. The results of the cmpxchg8b instructions are being thrown away when it branches back to the top of the checking loop. This means the loop always compares against the old value and this can result in a dead lock. llvm-svn: 93028	2010-01-08 23:41:50 +00:00
Evan Cheng	58ec4fec88	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	dab2cd543f	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Evan Cheng	16b75ce19c	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Evan Cheng	90dc43fcf5	Fix a minor regression from my dag combiner changes. One more place which needs to look pass truncates. llvm-svn: 92885	2010-01-07 00:54:06 +00:00
Jakob Stoklund Olesen	f1522d612f	Add comments. llvm-svn: 92883	2010-01-07 00:51:04 +00:00
Jakob Stoklund Olesen	29a64c9575	Add Target hook to duplicate machine instructions. Some instructions refer to unique labels, and so cannot be trivially cloned with CloneMachineInstr. llvm-svn: 92873	2010-01-06 23:47:07 +00:00
Evan Cheng	166a4e6caa	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Dan Gohman	f34b289057	Move this test from test/Transforms/IndVarSimplify to test/CodeGen/X86, as doesn't use -indvars, and it does use llc -march=x86-64. llvm-svn: 92799	2010-01-05 22:52:54 +00:00
Bill Wendling	03f0af372c	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Dan Gohman	8c63ee7e28	Make this test more portable. llvm-svn: 92514	2010-01-04 21:23:34 +00:00
Dan Gohman	52183c3cc9	Add some tests and update an existing test to reflect recent x86 isel peeps. llvm-svn: 92509	2010-01-04 20:53:54 +00:00
Anton Korobeynikov	d91a14dba5	Fix invalid chain folding for memory variant of sdiv / udiv llvm-svn: 92472	2010-01-04 10:31:54 +00:00
Chris Lattner	1dae8766b1	fix PR5930, allowing the asmprinter to emit difference between two labels as a truncate. llvm-svn: 92455	2010-01-03 18:33:18 +00:00
Chris Lattner	f6a585fc2f	add PR# llvm-svn: 92451	2010-01-03 18:10:58 +00:00
Chris Lattner	a7cfc43af8	differences between two blockaddress's don't cause a global variable initializer to require relocations. llvm-svn: 92450	2010-01-03 18:09:40 +00:00
Chris Lattner	909c71c96a	allow this to work on linux hosts. llvm-svn: 92407	2010-01-02 00:22:15 +00:00
Chris Lattner	1eea3b0ada	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	6eef072eb6	rename file. llvm-svn: 92405	2010-01-01 23:55:04 +00:00
Chris Lattner	39f18e545e	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	5967840a5f	Make this more likely to generate a libcall. llvm-svn: 92387	2010-01-01 03:26:51 +00:00
Sanjiv Gupta	015215ca86	Extern declaration for unordered.f32 libcall was not being emitted. Fixed that. llvm-svn: 92242	2009-12-29 03:24:34 +00:00
Sanjiv Gupta	1ecffe13b2	Fixed llc crash for zext (i1 -> i8) loads. llvm-svn: 92201	2009-12-28 04:53:24 +00:00
Chris Lattner	f5e3ed64d5	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Sanjiv Gupta	cd419eebce	Reapply 91904. llvm-svn: 91996	2009-12-23 11:19:09 +00:00
Sanjiv Gupta	6920c17f1f	deleting empty file. llvm-svn: 91994	2009-12-23 10:35:24 +00:00
Sanjiv Gupta	f7b4f89588	Reverting back 91904. llvm-svn: 91993	2009-12-23 09:46:01 +00:00
Dale Johannesen	a864a67185	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Anton Korobeynikov	ef3fdc1cbd	Add testcase for PR5703 llvm-svn: 91931	2009-12-22 22:37:23 +00:00
Evan Cheng	71d7eaa87e	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Sanjiv Gupta	8c5f05fcee	While converting one of the operands to a memory operand, we need to check if it is Legal and does not result into a cyclic dep. llvm-svn: 91904	2009-12-22 14:25:37 +00:00
Sanjiv Gupta	8ac077df57	Emit direction operand in binary insns that stores in memory. llvm-svn: 91777	2009-12-19 13:52:01 +00:00
Sanjiv Gupta	bda8002e7f	Test cases for changes done in 91768. llvm-svn: 91773	2009-12-19 11:38:14 +00:00
Evan Cheng	b175de6356	Increase opportunities to optimize (brcond (srl (and c1), c2)). llvm-svn: 91717	2009-12-18 21:31:31 +00:00
Evan Cheng	4cf30b72bf	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Dan Gohman	51fbfb726f	Tidy up this testcase and add test for tailcall optimization with unreachable. llvm-svn: 91650	2009-12-18 01:05:06 +00:00

1 2 3 4 5 ...

2762 Commits