llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	5ab6d8b3fc	Eliminate more token factors by taking advantage of transitivity: if TF depends on A and B, and A depends on B, TF just needs to depend on A. With Jim's alias-analysis stuff enabled, this compiles the testcase in PR892 into: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %edx, 28(%esp) movl %eax, 32(%esp) movl %eax, 24(%esp) movl %edx, 36(%esp) movl 52(%esp), %ecx movl %ecx, 4(%esp) movl %eax, 8(%esp) movl %edx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret instead of: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %eax, 24(%esp) movl %edx, 28(%esp) movl 24(%esp), %eax movl %eax, 32(%esp) movl 28(%esp), %eax movl %eax, 36(%esp) movl 32(%esp), %eax movl 36(%esp), %ecx movl 52(%esp), %edx movl %edx, 4(%esp) movl %eax, 8(%esp) movl %ecx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret llvm-svn: 30821	2006-10-08 22:57:01 +00:00
Jim Laskey	0463e08005	Combiner alias analysis passes Multisource (release-asserts.) llvm-svn: 30818	2006-10-07 23:37:56 +00:00
Chris Lattner	f9f90bc239	Fix a bug legalizing zero-extending i64 loads into 32-bit loads. The bottom part was always forced to be sextload, even when we needed an zextload. llvm-svn: 30782	2006-10-07 00:58:36 +00:00
Chris Lattner	a389a612bb	initialize ivar llvm-svn: 30780	2006-10-06 22:52:08 +00:00
Chris Lattner	9d75324ddf	jump tables handle pic llvm-svn: 30776	2006-10-06 22:32:29 +00:00
Chris Lattner	f5839a0816	Fix a miscompilation of: long long foo(long long X) { return (long long)(signed char)(int)X; } Instead of: _foo: extsb r2, r4 srawi r3, r4, 31 mr r4, r2 blr we now produce: _foo: extsb r4, r4 srawi r3, r4, 31 blr This fixes a miscompilation in ConstantFolding.cpp. llvm-svn: 30768	2006-10-06 17:34:12 +00:00
Evan Cheng	df9ac47e5e	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	af309d29b1	Add getStore() helper function to create ISD::STORE nodes. llvm-svn: 30758	2006-10-05 22:57:11 +00:00
Jim Laskey	6549d22ef9	Alias analysis code clean ups. llvm-svn: 30753	2006-10-05 15:07:25 +00:00
Evan Cheng	f80dfa83a0	Fix some typos that can cause a flag value to have more than one use. llvm-svn: 30727	2006-10-04 22:23:53 +00:00
Jim Laskey	708d0db2d8	More extensive alias analysis. llvm-svn: 30721	2006-10-04 16:53:27 +00:00
Evan Cheng	5d9fd977d3	Combine ISD::EXTLOAD, ISD::SEXTLOAD, ISD::ZEXTLOAD into ISD::LOADX. Add an extra operand to LOADX to specify the exact value extension type. llvm-svn: 30714	2006-10-04 00:56:09 +00:00
Evan Cheng	91d76cb27f	Fix an obvious typo. llvm-svn: 30711	2006-10-03 23:08:27 +00:00
Jim Laskey	e73a22514d	Debugging kruft llvm-svn: 30688	2006-10-02 13:01:17 +00:00
Jim Laskey	1368c265da	Add ability to annotate (color) nodes in a viewGraph. llvm-svn: 30686	2006-10-02 12:26:53 +00:00
Chris Lattner	a9caf95591	refactor critical edge breaking out into the SplitCritEdgesForPHIConstants method. This is a baby step towards fixing PR925. llvm-svn: 30643	2006-09-28 06:17:10 +00:00
Andrew Lenharth	c19ef92403	Comments on JumpTableness llvm-svn: 30615	2006-09-26 20:02:30 +00:00
Jim Laskey	60832693a7	Load chain check is not needed llvm-svn: 30613	2006-09-26 17:44:58 +00:00
Jim Laskey	dde51671e5	Chain can be any operand llvm-svn: 30611	2006-09-26 09:32:41 +00:00
Jim Laskey	5f3e0af9d0	Wrong size for load llvm-svn: 30610	2006-09-26 08:14:06 +00:00
Jim Laskey	b4a864d533	Can't move a load node if it's chain is not used. llvm-svn: 30609	2006-09-26 07:37:42 +00:00
Jim Laskey	7aa0638aa9	Accidental enable of bad code llvm-svn: 30601	2006-09-25 21:11:32 +00:00
Jim Laskey	b5534e5c28	Fix chain dropping in load and drop unused stores in ret blocks. llvm-svn: 30600	2006-09-25 19:32:58 +00:00
Jim Laskey	d07be232ba	Core antialiasing for load and store. llvm-svn: 30597	2006-09-25 16:29:54 +00:00
Andrew Lenharth	783a4a9d86	Add support for other relocation bases to jump tables, as well as custom asm directives llvm-svn: 30593	2006-09-24 19:45:58 +00:00
Evan Cheng	77c0757f8b	PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. llvm-svn: 30590	2006-09-24 05:22:38 +00:00
Evan Cheng	449a0c7e33	Make it work for DAG combine of multi-value nodes. llvm-svn: 30573	2006-09-21 19:04:05 +00:00
Jim Laskey	35f7eebb49	core corrections llvm-svn: 30570	2006-09-21 17:35:47 +00:00
Jim Laskey	5d19d59017	Basic "in frame" alias analysis. llvm-svn: 30568	2006-09-21 16:28:59 +00:00
Chris Lattner	082db3f9aa	fold (aext (and (trunc x), cst)) -> (and x, cst). llvm-svn: 30561	2006-09-21 06:40:43 +00:00
Chris Lattner	fa9f92cf65	Check the right value type. This fixes 186.crafty on x86 llvm-svn: 30560	2006-09-21 06:17:39 +00:00
Chris Lattner	8d8a3bf9c9	Compile: int %test(ulong %tmp) { %tmp = load ulong %tmp ; <ulong> [#uses=1] %tmp.mask = shr ulong %tmp, ubyte 50 ; <ulong> [#uses=1] %tmp.mask = cast ulong %tmp.mask to ubyte %tmp2 = and ubyte %tmp.mask, 3 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp2 to int ; <int> [#uses=1] ret int %tmp2 } to: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax andl $3, %eax ret instead of: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax # TRUNCATE movb %al, %al andb $3, %al movzbl %al, %eax ret llvm-svn: 30558	2006-09-21 06:14:31 +00:00
Chris Lattner	a31f0a622b	Generalize (zext (truncate x)) and (sext (truncate x)) folding to work when the src/dst are not the same size. This catches things like "truncate 32-bit X to 8 bits, then zext to 16", which happens a bit on X86. llvm-svn: 30557	2006-09-21 06:00:20 +00:00
Chris Lattner	c8cd62d381	Compile: int test3(int a, int b) { return (a < 0) ? a : 0; } to: _test3: srawi r2, r3, 31 and r3, r2, r3 blr instead of: _test3: cmpwi cr0, r3, 1 li r2, 0 blt cr0, LBB2_2 ;entry LBB2_1: ;entry mr r3, r2 LBB2_2: ;entry blr This implements: PowerPC/select_lt0.ll:seli32_a_a llvm-svn: 30517	2006-09-20 06:41:35 +00:00
Chris Lattner	8746e2cd57	Fold the full generality of (any_extend (truncate x)) llvm-svn: 30514	2006-09-20 06:29:17 +00:00
Chris Lattner	8b68decb27	Two things: 1. teach SimplifySetCC that '(srl (ctlz x), 5) == 0' is really x != 0. 2. Teach visitSELECT_CC to use SimplifySetCC instead of calling it and ignoring the result. This allows us to compile: bool %test(ulong %x) { %tmp = setlt ulong %x, 4294967296 ret bool %tmp } to: _test: cntlzw r2, r3 cmplwi cr0, r3, 1 srwi r2, r2, 5 li r3, 0 beq cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr instead of: _test: addi r2, r3, -1 cntlzw r2, r2 cntlzw r3, r3 srwi r2, r2, 5 cmplwi cr0, r2, 0 srwi r2, r3, 5 li r3, 0 bne cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr This isn't wonderful, but it's an improvement. llvm-svn: 30513	2006-09-20 06:19:26 +00:00
Chris Lattner	875ea0cdbd	Expand 64-bit shifts more optimally if we know that the high bit of the shift amount is one or zero. For example, for: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } we get: _foo1: movb $31, %cl movl 4(%esp), %edx andb 12(%esp), %cl shll %cl, %edx xorl %eax, %eax ret _foo2: movb $223, %cl movl 4(%esp), %eax movl 8(%esp), %edx andb 12(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax ret instead of: _foo1: subl $4, %esp movl %ebx, (%esp) movb $32, %bl movl 8(%esp), %eax movl 12(%esp), %edx movb %bl, %cl orb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret _foo2: subl $4, %esp movl %ebx, (%esp) movb $223, %cl movl 8(%esp), %eax movl 12(%esp), %edx andb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx xorb %bl, %bl testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret llvm-svn: 30506	2006-09-20 03:38:48 +00:00
Chris Lattner	5a42ebcff3	Fold extract_element(cst) to cst llvm-svn: 30478	2006-09-19 05:02:39 +00:00
Chris Lattner	4c059f4962	Minor speedup for legalize by avoiding some malloc traffic llvm-svn: 30477	2006-09-19 04:51:23 +00:00
Evan Cheng	1fc7c363e6	Fix a typo. llvm-svn: 30474	2006-09-18 23:28:33 +00:00
Evan Cheng	4bfaf0bd2c	Allow i32 UDIV, SDIV, UREM, SREM to be expanded into libcalls. llvm-svn: 30470	2006-09-18 21:49:04 +00:00
Andrew Lenharth	c50458fb90	absolute addresses must match pointer size llvm-svn: 30461	2006-09-18 17:59:35 +00:00
Chris Lattner	e50f5d1fb1	Oh yeah, this is needed too llvm-svn: 30407	2006-09-16 05:08:34 +00:00
Chris Lattner	1b63391fdf	simplify control flow, no functionality change llvm-svn: 30403	2006-09-16 00:21:44 +00:00
Chris Lattner	fbadbda6ba	Allow custom expand of mul llvm-svn: 30402	2006-09-16 00:09:24 +00:00
Chris Lattner	46d710e6ea	Fold (X & C1) \| (Y & C2) -> (X\|Y) & C3 when possible. This implements CodeGen/X86/and-or-fold.ll llvm-svn: 30379	2006-09-14 21:11:37 +00:00
Chris Lattner	97614c86ce	Split rotate matching code out to its own function. Make it stronger, by matching things like ((x >> c1) & c2) \| ((x << c3) & c4) to (rot x, c5) & c6 llvm-svn: 30376	2006-09-14 20:50:57 +00:00
Chris Lattner	84cc1f7cb8	If LSR went through a lot of trouble to put constants (e.g. the addr of a global in a specific BB, don't undo this!). This allows us to compile CodeGen/X86/loop-hoist.ll into: _foo: xorl %eax, %eax * movl L_Arr$non_lazy_ptr, %ecx movl 4(%esp), %edx LBB1_1: #cond_true movl %eax, (%ecx,%eax,4) incl %eax cmpl %edx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret instead of: _foo: xorl %eax, %eax movl 4(%esp), %ecx LBB1_1: #cond_true * movl L_Arr$non_lazy_ptr, %edx movl %eax, (%edx,%eax,4) incl %eax cmpl %ecx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret This was noticed in 464.h264ref. This doesn't usually affect PPC, but strikes X86 all the time. llvm-svn: 30290	2006-09-13 06:02:42 +00:00
Chris Lattner	72b503bcad	Compile X << 1 (where X is a long-long) to: addl %ecx, %ecx adcl %eax, %eax instead of: movl %ecx, %edx addl %edx, %edx shrl $31, %ecx addl %eax, %eax orl %ecx, %eax and to: addc r5, r5, r5 adde r4, r4, r4 instead of: slwi r2,r9,1 srwi r0,r11,31 slwi r3,r11,1 or r2,r0,r2 on PPC. llvm-svn: 30284	2006-09-13 03:50:39 +00:00
Evan Cheng	45fe3bc72c	Added support for machine specific constantpool values. These are useful for representing expressions that can only be resolved at link time, etc. llvm-svn: 30278	2006-09-12 21:00:35 +00:00

1 2 3 4 5 ...

1239 Commits