llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael J. Spencer	2d9860cbec	Object: Renable the tests now that none of the build bots complain about aliasing. llvm-svn: 123964	2011-01-21 05:07:13 +00:00
Nick Lewycky	6a083cf820	Don't try to pull vector bitcasts that change the number of elements through a select. A vector select is pairwise on each element so we'd need a new condition with the right number of elements to select on. Fixes PR8994. llvm-svn: 123963	2011-01-21 02:30:43 +00:00
Nick Lewycky	39b12c059d	Add a constant folding of casts from zero to zero. Fixes PR9011! While here, I'd like to complain about how vector is not an aggregate type according to llvm::Type::isAggregateType(), but they're listed under aggregate types in the LangRef and zero vectors are stored as ConstantAggregateZero. llvm-svn: 123956	2011-01-21 01:12:09 +00:00
Evan Cheng	028ccbfcbf	Don't be overly aggressive with CSE of "ldr constantpool". If it's a pc-relative value, the "add pc" must be CSE'ed at the same time. We could follow the same approach as T2 by adding pseudo instructions that combine the ldr + "add pc". But the better approach is to use movw + movt (which I will enable soon), so I'll leave this as a TODO. llvm-svn: 123949	2011-01-20 23:55:07 +00:00
Tobias Grosser	f07426b40d	Implement requiredTransitive The PassManager did not implement the transitivity of requiredTransitive. This was unnoticed since 2006. llvm-svn: 123942	2011-01-20 21:03:22 +00:00
Bruno Cardoso Lopes	1f69de3983	Add testcases for clz encoding llvm-svn: 123937	2011-01-20 19:27:16 +00:00
Bruno Cardoso Lopes	e965f06f7f	Fix the encoding and parsing of clrex instruction llvm-svn: 123936	2011-01-20 19:18:32 +00:00
Bruno Cardoso Lopes	d8f9b37f31	Add cdp/cdp2 instructions for thumb/thumb2 llvm-svn: 123929	2011-01-20 18:32:09 +00:00
Devang Patel	a573d5c16d	Disable objdump-trivial-object.test. It is broken on powerpc-darwin9. llvm-svn: 123928	2011-01-20 18:08:44 +00:00
Bruno Cardoso Lopes	33461ecc82	- Use a more appropriate name for Owen's ARM Parser isMCR hack since the same operands can be present in cdp/cdp2 instructions. Also increase the hack with cdp/cdp2 instructions. - Fix the encoding of cdp/cdp2 instructions for ARM (no thumb and thumb2 yet) and add testcases for t hem. llvm-svn: 123927	2011-01-20 18:06:58 +00:00
Bruno Cardoso Lopes	4d4b490fb7	Add mcr2 and mrc2 support to thumb2 targets llvm-svn: 123919	2011-01-20 16:58:48 +00:00
Bruno Cardoso Lopes	cf99dc7eb9	Add mcr* and mr*c support to thumb targets llvm-svn: 123917	2011-01-20 16:35:57 +00:00
Michael J. Spencer	11b293b304	Disable this test until I can figure out why it's broken. Not xfailed because it usese 100% CPU and times out, so it's annoying to run it. llvm-svn: 123915	2011-01-20 16:24:07 +00:00
Kalle Raiskila	6e5a54b36c	Allow sign-extending of i8 and i16 to i128 on SPU. llvm-svn: 123912	2011-01-20 15:49:06 +00:00
Duncan Sands	8fb2c3827c	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Eric Christopher	785db078b4	Expand invalid return values for umulo and smulo. Handle these similarly to add/sub by doing the normal operation and then checking for overflow afterwards. This generally relies on the DAG handling the later invalid operations as well. Fixes the 64-bit part of rdar://8622122 and rdar://8774702. llvm-svn: 123908	2011-01-20 08:54:28 +00:00
Evan Cheng	f2e914be15	Add test. llvm-svn: 123906	2011-01-20 08:38:21 +00:00
Evan Cheng	b8b0ad80a8	Sorry, several patches in one. TargetInstrInfo: Change produceSameValue() to take MachineRegisterInfo as an optional argument. When in SSA form, targets can use it to make more aggressive equality analysis. Machine LICM: 1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead. 2. Fix a bug which prevent CSE of instructions which are not re-materializable. 3. Use improved form of produceSameValue. ARM: 1. Teach ARM produceSameValue to look pass some PIC labels. 2. Look for operands from different loads of different constant pool entries which have same values. 3. Re-implement PIC GA materialization using movw + movt. Combine the pair with a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible to re-materialize the instruction, allow machine LICM to hoist the set of instructions out of the loop and make it possible to CSE them. It's a bit hacky, but it significantly improve code quality. 4. Some minor bug fixes as well. With the fixes, using movw + movt to materialize GAs significantly outperform the load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap and 176.gcc ~10%. llvm-svn: 123905	2011-01-20 08:34:58 +00:00
Michael J. Spencer	2d67ed8f3b	Object: Add some tests! llvm-svn: 123899	2011-01-20 06:39:15 +00:00
Venkatraman Govindaraju	058e12476c	Sparc backend: Implements a delay slot filler that attempt to fill delay slots with useful instructions. llvm-svn: 123884	2011-01-20 05:08:26 +00:00
Eric Christopher	bb14f65672	If we can, lower the multiply part of a umulo/smulo call to a libcall with an invalid type then split the result and perform the overflow check normally. Fixes the 32-bit parts of rdar://8622122 and rdar://8774702. llvm-svn: 123864	2011-01-20 00:29:24 +00:00
Devang Patel	2d9e532a3a	Fix debug info for merged global. llvm-svn: 123862	2011-01-20 00:02:16 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Bruno Cardoso Lopes	d6335ce508	Fix the encoding of mrrc and mcrr family of instructions. Also add testcases for mcr and mrc llvm-svn: 123837	2011-01-19 16:56:52 +00:00
Rafael Espindola	fc355bc070	Add unnamed_addr when we can show that address of a global is not used. llvm-svn: 123834	2011-01-19 16:32:21 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Owen Anderson	dac7a0174e	When matching asm operands, always try to match the most restricted type first. Unfortunately, while this is the "right" thing to do, it breaks some ARM asm parsing tests because MemMode5 and ThumbMemModeReg are ambiguous. This is tricky to resolve since neither is a subset of the other. XFAIL the test for now. The old way was broken in other ways, just ways we didn't happen to be testing, and our ARM asm parsing is going to require significant revisiting at a later point anyways. llvm-svn: 123786	2011-01-18 23:01:21 +00:00
Bruno Cardoso Lopes	2082057b18	Create two new generic classes to represent the following VMRS/VMSR variations: vmrs reg, fpexc vmrs reg, fpsid vmsr fpexc, reg vmsr fpsid, reg llvm-svn: 123783	2011-01-18 21:58:20 +00:00
Bruno Cardoso Lopes	cba727f291	Fix MRS encoding for arm and thumb. llvm-svn: 123778	2011-01-18 21:31:35 +00:00
Bruno Cardoso Lopes	e86a7ad01a	Fix the encoding of t2ISB by using the right class and also parse it correctly llvm-svn: 123776	2011-01-18 21:17:09 +00:00
Dan Gohman	44da55b7be	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Bruno Cardoso Lopes	e6290ccf9b	Follow the current hack set and enable the correct parsing of bkpt while in thumb mode. llvm-svn: 123772	2011-01-18 20:55:11 +00:00
Chris Lattner	86d56c651d	fix rdar://8878965, a regression I introduced with the recent llvm.objectsize changes. llvm-svn: 123771	2011-01-18 20:53:04 +00:00
Bruno Cardoso Lopes	7f639c11d7	Add support for parsing and encoding ARM's official syntax for the BFI instruction llvm-svn: 123770	2011-01-18 20:45:56 +00:00
Bruno Cardoso Lopes	4dc73fa075	Add support for mips32 madd and msub instructions. Patch by Akira Hatanaka llvm-svn: 123760	2011-01-18 19:29:17 +00:00
Duncan Sands	99589d07e9	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00
Duncan Sands	9b8e2bd8ef	Simplify (X<<1)-X into X. According to my auto-simplier this is the most common missed simplification in fully optimized code. It occurs sporadically in the testsuite, and many times in 403.gcc: the final bitcode has 131 fewer subtractions after this change. The reason that the multiplies are not eliminated is the same reason that instcombine did not catch this: they are used by other instructions (instcombine catches this with a more general transform which in general is only profitable if the operands have only one use). llvm-svn: 123754	2011-01-18 09:24:58 +00:00
Daniel Dunbar	66e91d4a58	McARM: Start marking T2 address operands as such, for the benefit of the parser. llvm-svn: 123722	2011-01-18 03:06:03 +00:00
Benjamin Kramer	45d183ccf0	Fix an off-by-one error in ctpop combining. llvm-svn: 123664	2011-01-17 18:00:28 +00:00
Devang Patel	3ec1f198e5	Update tests to accomodate unnamed_addr introduction. llvm-svn: 123663	2011-01-17 17:54:17 +00:00
Benjamin Kramer	24c5184dca	Add a DAGCombine to turn (ctpop x) u< 2 into (x & x-1) == 0. This shaves off 4 popcounts from the hacked 186.crafty source. This is enabled even when a native popcount instruction is available. The combined code is one operation longer but it should be faster nevertheless. llvm-svn: 123621	2011-01-17 12:04:57 +00:00
Kalle Raiskila	7e7b4ac751	Don't crash SPU BE with memory accesses with big alignmnet. llvm-svn: 123620	2011-01-17 11:59:20 +00:00
Evan Cheng	dfce83c8f5	Materialize GA addresses with movw + movt pairs for Darwin in PIC mode. e.g. movw r0, :lower16:(L_foo$non_lazy_ptr-(LPC0_0+4)) movt r0, :upper16:(L_foo$non_lazy_ptr-(LPC0_0+4)) LPC0_0: add r0, pc, r0 It's not yet enabled by default as some tests are failing. I suspect bugs in down stream tools. llvm-svn: 123619	2011-01-17 08:03:18 +00:00
Nick Lewycky	872a453ada	Test for lazy value info's ability to prove the absense of NULLs in pointers. llvm-svn: 123601	2011-01-16 21:57:20 +00:00
Michael J. Spencer	4e51541319	Make everyone happy this time. llvm-svn: 123599	2011-01-16 21:34:34 +00:00
Anders Carlsson	d3db83349e	Teach DAE to look for functions whose arguments are unused, and change all callers to pass in an undefvalue instead. llvm-svn: 123596	2011-01-16 21:25:33 +00:00
Michael J. Spencer	12a620fd58	Try and fix this test. For some reason llvm-ar thinks that the file exists when it shouldn't, but I have no way to verify that it doesn't actually exist on the buildbot. llvm-svn: 123594	2011-01-16 20:52:58 +00:00
Rafael Espindola	ec517cdf24	Update tests. llvm-svn: 123591	2011-01-16 18:02:57 +00:00
Rafael Espindola	751677a040	Don't merge two constants if we care about the address of both. This fixes the original testcase in PR8927. It also causes a clang binary built with a patched clang to increase in size by 0.21%. We can probably get some of the size back by writing a pass that detects that a global never has its pointer compared and adds unnamed_addr to it (maybe extend global opt). It is also possible that there are some other cases clang could add unnamed_addr to. I will investigate extending globalopt next. llvm-svn: 123584	2011-01-16 17:05:09 +00:00

1 2 3 4 5 ...

12066 Commits