llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	7dc8d2d025	Testcase for r89415. llvm-svn: 89417	2009-11-20 00:32:16 +00:00
Dan Gohman	94e617627d	Extend CaptureTracking to indicate when a value is never stored, even if it is not ultimately captured. Teach BasicAliasAnalysis that a local object address which does not escape and is never stored does not alias with a value resulting from a load. llvm-svn: 89398	2009-11-19 21:57:48 +00:00
Dan Gohman	cbc6ebb6fd	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Daniel Dunbar	0b2099ad5f	Unbreak test, Bruno please check. llvm-svn: 89329	2009-11-19 07:18:49 +00:00
Evan Cheng	b18525937c	More consistent thumb1 asm printing. llvm-svn: 89328	2009-11-19 06:57:41 +00:00
Evan Cheng	2a6c92fcb6	Shrink ldr / str [sp, imm0-1024] to 16-bit instructions. llvm-svn: 89326	2009-11-19 06:32:27 +00:00
Bruno Cardoso Lopes	4713b282ce	- Add sugregister logic to handle f64=(f32,f32). - Support mips1 like load/store of doubles: Instead of: sdc $f0, X($3) Generate: swc $f0, X($3) swc $f1, X+4($3) llvm-svn: 89322	2009-11-19 06:06:13 +00:00
Bill Wendling	77f0ea6b93	Test from Dhrystone to make sure that we're not emitting an aligned load for a string that's aligned at 8-bytes instead of 16-bytes. llvm-svn: 89295	2009-11-19 01:33:57 +00:00
Bob Wilson	6456fb94f5	Fix buildbots. llvm-svn: 89274	2009-11-18 23:30:38 +00:00
Richard Osborne	3bd09434a6	Add XCore support for indirectbr / blockaddress. llvm-svn: 89273	2009-11-18 23:20:42 +00:00
Bob Wilson	108aadf972	Tail duplication still needs to iterate. Duplicating new instructions onto the tail of a block may make that block a new candidate for duplication. llvm-svn: 89264	2009-11-18 22:52:37 +00:00
Bill Wendling	e9e9121f94	Not all ASM has # for comments. llvm-svn: 89250	2009-11-18 21:54:13 +00:00
Jakob Stoklund Olesen	575c3f3d72	Fix PR5300. When TwoAddressInstructionPass deletes a dead instruction, make sure that all register kills are accounted for. The 2-addr register does not get special treatment. llvm-svn: 89246	2009-11-18 21:33:35 +00:00
Jakob Stoklund Olesen	4797e58d6b	Fix inverted test and add testcase from failing self-host. llvm-svn: 89167	2009-11-18 00:02:18 +00:00
Jakob Stoklund Olesen	50ee5e7ddb	Remove fragile test. llvm-svn: 89150	2009-11-17 21:52:40 +00:00
Jim Grosbach	cdde77c6a3	Enable arm jumpt table adjustment. llvm-svn: 89143	2009-11-17 21:24:11 +00:00
Anton Korobeynikov	a2873f4d59	Forgot to commit test fixes llvm-svn: 89138	2009-11-17 20:38:36 +00:00
Jakob Stoklund Olesen	fffff88a3c	Enable -split-phi-edges by default, except when -regalloc=local. The local register allocator doesn't like it when LiveVariables is run. We should also disable edge splitting under -O0, but that has to wait a bit. llvm-svn: 89125	2009-11-17 19:15:50 +00:00
Evan Cheng	ba4e5da727	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Evan Cheng	84efacfaad	Revert 89021. It's miscompiling llvm-gcc driver driver at -O0. llvm-svn: 89082	2009-11-17 09:55:52 +00:00
Jakob Stoklund Olesen	9f0d55d8d8	Enable -split-phi-edges by default llvm-svn: 89021	2009-11-17 01:07:22 +00:00
Evan Cheng	d33400e636	MOV64rm should be marked isReMaterializable. llvm-svn: 89019	2009-11-17 00:55:55 +00:00
Jim Grosbach	0ad7efbace	Convert to FileCheck llvm-svn: 89007	2009-11-17 00:20:26 +00:00
Jim Grosbach	4781c3caf8	Convert to FileCheck llvm-svn: 89002	2009-11-17 00:03:38 +00:00
Jim Grosbach	805d195649	Cleanup. Missed removing these when converting. Oops. llvm-svn: 89001	2009-11-17 00:00:33 +00:00
Dan Gohman	b43e1ff236	Fix this test - there don't appear to be any actual Reload Reuses in this testcase. llvm-svn: 88998	2009-11-16 23:49:55 +00:00
Dan Gohman	9dede3b383	Revert r87049, which was the workaround for the regression triggered by the recent FixedStackPseudoSourceValue-related changes, now that the specific bug that affected it is fixed, in r88954. llvm-svn: 88997	2009-11-16 23:43:42 +00:00
Jeffrey Yasskin	0632b53bfe	Revert the test from r88984. It relies on being able to mmap 16GB of address space (though it only uses a small fraction of that), and the buildbots disallow that. Also add a comment to the Makefile's ulimit line warning future developers that changing it won't work. llvm-svn: 88994	2009-11-16 23:32:30 +00:00
Jim Grosbach	1deb0b9f53	Convert to FileCheck llvm-svn: 88991	2009-11-16 23:19:29 +00:00
Jeffrey Yasskin	10d3604a9e	Make X86-64 in the Large model always emit 64-bit calls. The large code model is documented at http://www.x86-64.org/documentation/abi.pdf and says that calls should assume their target doesn't live within the 32-bit pc-relative offset that fits in the call instruction. To do this, we turn off the global-address->target-global-address conversion in X86TargetLowering::LowerCall(). The first attempt at this broke the lazy JIT because it can separate the movabs(imm->reg) from the actual call instruction. The lazy JIT receives the address of the movabs as a relocation and needs to record the return address from the call; and then when that call happens, it needs to patch the movabs with the newly-compiled target. We could thread the call instruction into the relocation and record the movabs<->call mapping explicitly, but that seems to require at least as much new complication in the code generator as this change. To fix this, we make lazy functions _always_ go through a call stub. You'd think we'd only have to force lazy calls through a stub on difficult platforms, but that turns out to break indirect calls through a function pointer. The right fix for that is to distinguish between calls and address-of operations on uncompiled functions, but that's complex enough to leave for someone else to do. Another attempt at this defined a new CALL64i pseudo-instruction, which expanded to a 2-instruction sequence in the assembly output and was special-cased in the X86CodeEmitter's emitInstruction() function. That broke indirect calls in the same way as above. This patch also removes a hack forcing Darwin to the small code model. Without far-call-stubs, the small code model requires things of the JITMemoryManager that the DefaultJITMemoryManager can't provide. Thanks to echristo for lots of testing! llvm-svn: 88984	2009-11-16 22:41:33 +00:00
Evan Cheng	f25ef4ffb0	- Check memoperand alignment instead of checking stack alignment. Most load / store folding instructions are not referencing spill stack slots. - Mark MOVUPSrm re-materializable. llvm-svn: 88974	2009-11-16 21:56:03 +00:00
Jim Grosbach	9b32e22ad1	Convert to FileCheck llvm-svn: 88947	2009-11-16 20:04:15 +00:00
Lang Hames	16f6b3e607	Added a testcase for PR5495. llvm-svn: 88946	2009-11-16 20:03:13 +00:00
Jim Grosbach	980d94164d	Convert to FileCheck llvm-svn: 88942	2009-11-16 19:46:46 +00:00
Jim Grosbach	c670bdc311	tbb opt off by default llvm-svn: 88921	2009-11-16 17:24:45 +00:00
David Greene	25905c8336	Support spill comments. Have the asm printer emit a comment if an instruction is a spill or reload and have the spiller mark copies it introdues so the asm printer can also annotate those. llvm-svn: 88911	2009-11-16 15:12:23 +00:00
Evan Cheng	597f7b6ee3	Check if subreg index is zero. llvm-svn: 88899	2009-11-16 06:31:49 +00:00
Evan Cheng	11bf4493d4	For some targets, a copy can use a register multiple times, e.g. ppc. llvm-svn: 88895	2009-11-16 05:52:06 +00:00
Evan Cheng	8ca5d4b9ad	xfail for now. It has been failing. llvm-svn: 88892	2009-11-16 05:44:04 +00:00
Bruno Cardoso Lopes	537e409c58	- Fix a small bug while handling target constant pools (one param was missing). - Add a smarter constant pool loading, instead of: lui $2, %hi($CPI1_0) addiu $2, $2, %lo($CPI1_0) lwc1 $f0, 0($2) Generate: lui $2, %hi($CPI1_0) lwc1 $f0, %lo($CPI1_0)($2) llvm-svn: 88886	2009-11-16 04:33:42 +00:00
Jim Grosbach	01c1cae34d	Detect need for autoalignment of the stack earlier to catch spills more conservatively. eliminateFrameIndex() machinery adjust to handle addr mode 6 (vld1/vst1) used for spills. Fix tests to expect aligned Q-reg spilling llvm-svn: 88874	2009-11-15 21:45:34 +00:00
Nick Lewycky	95148689c9	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	6a6ac7e105	Correct typo. llvm-svn: 88831	2009-11-15 06:16:57 +00:00
Nick Lewycky	e29fa4c7a1	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Jim Grosbach	f16a3b7a9f	remove xfail llvm-svn: 88817	2009-11-14 21:57:35 +00:00
Richard Osborne	d5f2745965	Add XCore support for arbitrary-sized aggregate returns. llvm-svn: 88802	2009-11-14 19:33:35 +00:00
Nick Lewycky	c53e2ecf02	Teach BasicAA that a constant expression can't alias memory provably not allocated until runtime (such as an alloca). Patch by Hans Wennborg! llvm-svn: 88760	2009-11-14 06:15:14 +00:00
Evan Cheng	16797a1f55	Added getSubRegIndex(A,B) that returns subreg index of A to B. Use it to replace broken code in VirtRegRewriter. llvm-svn: 88753	2009-11-14 03:42:17 +00:00
Evan Cheng	6ad7da96fe	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
Evan Cheng	e3b312fec9	Add radar number. llvm-svn: 88739	2009-11-14 02:11:32 +00:00
Evan Cheng	d2c10508cd	Fix PR5412: Fix an inverted check and another missing sub-register check. llvm-svn: 88738	2009-11-14 02:09:09 +00:00
Dan Gohman	a627e26d39	Enable the tail call optimization when the caller returns undef. llvm-svn: 88737	2009-11-14 02:06:30 +00:00
Evan Cheng	66401c90da	When expanding t2STRDi8 r, r to two stores, add kill markers correctly. llvm-svn: 88734	2009-11-14 01:50:00 +00:00
Evan Cheng	78fa302e7d	Fix PR5411. Bug in UpdateKills. A reg def partially define its super-registers. llvm-svn: 88719	2009-11-13 23:16:41 +00:00
David Greene	659c1a9d78	Move DebugInfo checks into EmitComments and remove them from target-specific AsmPrinters. Not all comments need DebugInfo. Re-enable the line numbers comment test. llvm-svn: 88697	2009-11-13 21:34:57 +00:00
Dan Gohman	225fa59cac	When optimizing for size, don't tail-merge unless it's likely to be a code-size win, and not when it's only likely to be code-size neutral, such as when only a single instruction would be eliminated and a new branch would be required. This fixes rdar://7392894. llvm-svn: 88692	2009-11-13 21:02:15 +00:00
Evan Cheng	d190b8216f	Fix PR5410: LiveVariables lost subreg def: D0<def,dead> = ... ... = S0<use, kill> S0<def> = ... ... D0<def> = The first D0 def is correctly marked dead, however, livevariables should have added an implicit def of S0 or we end up with a use without a def. llvm-svn: 88690	2009-11-13 20:36:40 +00:00
Dan Gohman	f80dc08059	Don't let a noalias difference disrupt the tailcall optimization. llvm-svn: 88672	2009-11-13 18:49:38 +00:00
Dale Johannesen	5f4eecf961	Adjust isConstantSplat to allow for big-endian targets. PPC is such a target; make it work. llvm-svn: 87060	2009-11-13 01:45:18 +00:00
Daniel Dunbar	3f75f5ddcb	Update test. llvm-svn: 87049	2009-11-13 01:01:58 +00:00
Jim Grosbach	1025a4998b	Clean up testcase a bit. Simplify case blocks and adjust switch instruction to not take an undefined value as input. llvm-svn: 86997	2009-11-12 17:19:09 +00:00
Benjamin Kramer	5218176bc6	Fix typo in run line. llvm-svn: 86984	2009-11-12 12:35:27 +00:00
Gabor Greif	13431c6cdf	typo llvm-svn: 86980	2009-11-12 09:44:17 +00:00
Chris Lattner	eb9acbfb05	implement a nice little efficiency hack in the inliner. Since we're now running IPSCCP early, and we run functionattrs interlaced with the inliner, we often (particularly for small or noop functions) completely propagate all of the information about a call to its call site in IPSSCP (making a call dead) and functionattrs is smart enough to realize that the function is readonly (because it is interlaced with inliner). To improve compile time and make the inliner threshold more accurate, realize that we don't have to inline dead readonly function calls. Instead, just delete the call. This happens all the time for C++ codes, here are some counters from opt/llvm-ld counting the number of times calls were deleted vs inlined on various apps: Tramp3d opt: 5033 inline - Number of call sites deleted, not inlined 24596 inline - Number of functions inlined llvm-ld: 667 inline - Number of functions deleted because all callers found 699 inline - Number of functions inlined 483.xalancbmk opt: 8096 inline - Number of call sites deleted, not inlined 62528 inline - Number of functions inlined llvm-ld: 217 inline - Number of allocas merged together 2158 inline - Number of functions inlined 471.omnetpp: 331 inline - Number of call sites deleted, not inlined 8981 inline - Number of functions inlined llvm-ld: 171 inline - Number of functions deleted because all callers found 629 inline - Number of functions inlined Deleting a call is much faster than inlining it, and is insensitive to the size of the callee. :) llvm-svn: 86975	2009-11-12 07:56:08 +00:00
Evan Cheng	5d85a46f76	RegScavenger::enterBasicBlock should always reset register state. llvm-svn: 86972	2009-11-12 07:49:10 +00:00
Evan Cheng	85a9f430e9	- Teach LSR to avoid changing cmp iv stride if it will create an immediate that cannot be folded into target cmp instruction. - Avoid a phase ordering issue where early cmp optimization would prevent the later count-to-zero optimization. - Add missing checks which could cause LSR to reuse stride that does not have users. - Fix a bug in count-to-zero optimization code which failed to find the pre-inc iv's phi node. - Remove, tighten, loosen some incorrect checks disable valid transformations. - Quite a bit of code clean up. llvm-svn: 86969	2009-11-12 07:35:05 +00:00
Chris Lattner	5f6b8b2bcb	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	380ccbaeaa	should not commit when distracted. llvm-svn: 86929	2009-11-12 02:04:17 +00:00
Chris Lattner	e2a63f2798	We now thread some impossible condition information with LVI. llvm-svn: 86927	2009-11-12 01:55:20 +00:00
Chris Lattner	ba45616958	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	b584d1e456	move some stuff into DEBUG's and turn on lazy-value-info for the basic.ll testcase. llvm-svn: 86918	2009-11-12 01:22:16 +00:00
Dan Gohman	09478e975d	Tail merge at any size when there are two potentials blocks and one can be made to fall through into the other. llvm-svn: 86909	2009-11-12 00:39:10 +00:00
Kenneth Uildriks	9f34406a90	x86 users can now return arbitrary sized structs. Structs too large to fit in return registers will be returned through a hidden sret parameter introduced during SelectionDAG construction. llvm-svn: 86876	2009-11-11 19:59:24 +00:00
Dan Gohman	64b5d0f468	Add support for tail duplication to BranchFolding, and extend tail merging support to handle more cases. - Recognize several cases where tail merging is beneficial even when the tail size is smaller than the generic threshold. - Make use of MachineInstrDesc::isBarrier to help detect non-fallthrough blocks. - Check for and avoid disrupting fall-through edges in more cases. llvm-svn: 86871	2009-11-11 19:48:59 +00:00
Devang Patel	addf8b1ac6	Reenable StackTracke.cpp test. llvm-svn: 86861	2009-11-11 19:08:42 +00:00
Duncan Sands	ba61fed5d3	Don't trivially delete unused calls to llvm.invariant.start. This allows llvm.invariant.start to be used without necessarily being paired with a call to llvm.invariant.end. If you run the entire optimization pipeline then such calls are in fact deleted (adce does it), but that's actually a good thing since we probably do want them to be zapped late in the game. There should really be an integration test that checks that the llvm.invariant.start call lasts long enough that all passes that do interesting things with it get to do their stuff before it is deleted. But since no passes do anything interesting with it yet this will have to wait for later. llvm-svn: 86840	2009-11-11 15:34:13 +00:00
Evan Cheng	7e5e40c75e	Add nounwind. llvm-svn: 86814	2009-11-11 07:11:02 +00:00
Chris Lattner	3e308fb0ee	remove condprop testcases. llvm-svn: 86804	2009-11-11 05:25:16 +00:00
Daniel Dunbar	6a77f51520	Add missing run line. Devang, please check. llvm-svn: 86795	2009-11-11 03:10:03 +00:00
Bill Wendling	d656f8ec4c	Fix test to work on every platform. llvm-svn: 86786	2009-11-11 01:44:22 +00:00
Bill Wendling	5831283cb5	Fix test to work on every platform. llvm-svn: 86785	2009-11-11 01:41:32 +00:00
Devang Patel	b90dac093a	XFAIL for now. llvm-svn: 86784	2009-11-11 01:41:10 +00:00
Bill Wendling	676f44062e	Make sure that the exception handling data has the same visibility as the function it's generated for. llvm-svn: 86779	2009-11-11 01:24:59 +00:00
Devang Patel	78319c67ca	Do not assume first function scope seen represents current function. llvm-svn: 86771	2009-11-11 00:31:36 +00:00
Chris Lattner	6e960c8657	oops, didn't mean to commit this, no harm, but add a todoops, didn't mean to commit this, no harm, but add a todoo llvm-svn: 86768	2009-11-11 00:27:54 +00:00
Chris Lattner	741c94c719	Stub out a new lazy value info pass, which will eventually vend value constraint information to the optimizer. llvm-svn: 86767	2009-11-11 00:22:30 +00:00
Devang Patel	4450f26621	While creating DbgScopes, do not forget parent scope. llvm-svn: 86763	2009-11-11 00:18:40 +00:00
Evan Cheng	12f146d8f7	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Bill Wendling	47739b20fd	Test this on Darwin only. llvm-svn: 86752	2009-11-10 23:18:33 +00:00
Dale Johannesen	6f7d5b22bb	Emit correct code when making a ConstantPool entry for a vector constant whose component type is not a legal type for the target. (If the target ConstantPool cannot handle this type either, it has an opportunity to merge elements. In practice any target with 8-bit bytes must support i8 as data). 7320806 (partial). llvm-svn: 86751	2009-11-10 23:16:41 +00:00
Chris Lattner	9518fbb54e	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Bill Wendling	fc9469f311	Modify how the prologue encoded the "move" information for the FDE. GCC generates a sequence similar to this: __Z4funci: LFB2: mflr r0 LCFI0: stmw r30,-8(r1) LCFI1: stw r0,8(r1) LCFI2: stwu r1,-80(r1) LCFI3: mr r30,r1 LCFI4: where LCFI3 and LCFI4 are used by the FDE to indicate what the FP, LR, and other things are. We generated something more like this: Leh_func_begin1: mflr r0 stw r31, 20(r1) stw r0, 8(r1) Llabel1: stwu r1, -80(r1) Llabel2: mr r31, r1 Note that we are missing the "mr" instruction. This patch makes it more like the GCC output. llvm-svn: 86729	2009-11-10 22:14:04 +00:00
Chris Lattner	02e2cee7dc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	80e7e5a429	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	87fe40b32d	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Dan Gohman	1f31f6e265	Optimize test more. llvm-svn: 86714	2009-11-10 21:02:18 +00:00
Duncan Sands	1925d3a1d1	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Victor Hernandez	fcc77b1c02	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Chris Lattner	17529ac0c5	optimize test llvm-svn: 86672	2009-11-10 07:44:36 +00:00
Chris Lattner	1559bedcc7	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00

1 2 3 4 5 ...

8657 Commits