llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	5709254bd5	Fix test. llvm-svn: 120730	2010-12-02 20:17:34 +00:00
Duncan Sands	050d93cb5a	This test dates from the time when llvm-gcc had problems if two types were named the same, so it had to qualify type names according to the enclosing scope to ensure uniqueness. This is no longer needed for correctness (though it may be helpful when reading the IR), so this test has lost its importance. Zap it because dragonegg will never be able to produce the qualified type name since modern gcc zaps language specific info (such as whether a type is nested inside another - needed to get X::Y here) before dragonegg is reached. llvm-svn: 120721	2010-12-02 18:19:23 +00:00
NAKAMURA Takumi	2dc7d5536a	test/Archive/extract.ll: Use cmp instead of diff. Thanks to Danil Malyshev! llvm-svn: 120698	2010-12-02 09:16:14 +00:00
Evan Cheng	419ea286ee	Fix and re-enable tail call optimization of expanded libcalls. llvm-svn: 120622	2010-12-01 22:59:46 +00:00
Rafael Espindola	5fe5f45352	Rename temporary symbols if they conflict with artificial symbols created by the assembler. This was blocking parsing any large .s produced by clang for example. Fixes PR8596. llvm-svn: 120603	2010-12-01 20:46:11 +00:00
Owen Anderson	943fb60b1f	Add correct encodings for STRD and LDRD, including fixup support. Additionally, update these to unified syntax. llvm-svn: 120589	2010-12-01 19:18:46 +00:00
Evan Cheng	a695abde49	Speculatively disable x86 portion of r120501 to appease the x86_64 buildbot. llvm-svn: 120549	2010-12-01 03:27:20 +00:00
Jason W Kim	29805961d8	ARM/MC/ELF relocation "hello world" for movw/movt. Lifted adjustFixupValue() from Darwin for sharing w ELF. Test added TODO: refactor ELFObjectWriter::RecordRelocation more. Possibly share more code with Darwin? Lots more relocations... llvm-svn: 120534	2010-12-01 02:40:06 +00:00
Chris Lattner	1c577b54b0	fix a bozo bug I introduced in r119930, causing a miscompile of 20040709-1.c from the gcc testsuite. I was using the size of a pointer instead of the pointee. This fixes rdar://8713376 llvm-svn: 120519	2010-12-01 01:24:55 +00:00
NAKAMURA Takumi	c8bf78e7f3	test/Archive: FileCheck-ize, and remove *.toc. These may be CRLF-tolerant. llvm-svn: 120506	2010-12-01 00:09:25 +00:00
Evan Cheng	d4b0873c06	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Chris Lattner	903add84d9	Enhance DSE to handle the variable index case in PR8657. llvm-svn: 120498	2010-11-30 23:43:23 +00:00
Chris Lattner	d513faf41f	remove fixme comment too. llvm-svn: 120493	2010-11-30 23:25:01 +00:00
Chris Lattner	370797a1fb	check in all files. This is now handled by my previous DSE commit. llvm-svn: 120492	2010-11-30 23:23:59 +00:00
Chris Lattner	c0f3379ae0	teach DSE to use GetPointerBaseWithConstantOffset to analyze may-aliasing stores that partially overlap with different base pointers. This implements PR6043 and the non-variable part of PR8657 llvm-svn: 120485	2010-11-30 23:05:20 +00:00
Chris Lattner	b63ba73b1b	enhance isRemovable to refuse to delete volatile mem transfers now that DSE hacks on them. This fixes a regression I introduced, by generalizing DSE to hack on transfers. llvm-svn: 120445	2010-11-30 19:12:10 +00:00
Owen Anderson	6187e66801	Add tests for more forms of Thumb2 loads and stores. llvm-svn: 120436	2010-11-30 18:15:21 +00:00
Che-Liang Chiou	e9baf13657	ptx: add command-line options for gpu target and ptx version llvm-svn: 120423	2010-11-30 10:14:14 +00:00
Eric Christopher	8e9fbcf0f0	Not all platforms use _<func>. Duh. llvm-svn: 120418	2010-11-30 09:23:54 +00:00
Bill Wendling	811c936ed5	Add parsing for the Thumb t_addrmode_s4 addressing mode. This can almost certainly be made more generic. But it does allow us to parse something like: ldr r3, [r2, r4] correctly in Thumb mode. llvm-svn: 120408	2010-11-30 07:44:32 +00:00
Chris Lattner	58b779e9c2	Rewrite the main DSE loop to be written in terms of reasoning about pairs of AA::Location's instead of looking for MemDep's "Def" predicate. This is more powerful and general, handling memset/memcpy/store all uniformly, and implementing PR8701 and probably obsoleting parts of memcpyoptimizer. This also fixes an obscure bug with init.trampoline and i8 stores, but I'm not surprised it hasn't been hit yet. Enhancing init.trampoline to carry the size that it stores would allow DSE to be much more aggressive about optimizing them. llvm-svn: 120406	2010-11-30 07:23:21 +00:00
Eric Christopher	fa6657cec0	Rewrite mwait and monitor support and custom lower arguments. Fixes PR8573. llvm-svn: 120404	2010-11-30 07:20:12 +00:00
Anders Carlsson	e3ea1cba79	Add a puts optimization that converts puts() to putchar('\n'). llvm-svn: 120398	2010-11-30 06:19:18 +00:00
Anders Carlsson	77e9892afd	Fix a typo. llvm-svn: 120394	2010-11-30 06:03:55 +00:00
Anders Carlsson	631d06bbce	Rename this test to FPuts.ll since it actually tests fputs. llvm-svn: 120393	2010-11-30 05:59:26 +00:00
Chris Lattner	6c7f64e0bc	remove a use of llvm-dis llvm-svn: 120383	2010-11-30 02:04:15 +00:00
Chris Lattner	c2e3445273	merge one more away llvm-svn: 120375	2010-11-30 01:06:43 +00:00
Chris Lattner	7578d0df51	I already merged partial-overwrite.ll -> PartialStore.ll Merge context-sensitive.ll -> simple.ll and upgrade it. llvm-svn: 120374	2010-11-30 01:05:07 +00:00
Chris Lattner	43e3a98675	clean up DSE tests, removing some poorly reduced and useless old test, merging more into other larger .ll files, filecheckizing along the way. llvm-svn: 120373	2010-11-30 01:00:34 +00:00
Chris Lattner	90c4947df7	enhance basicaa to return "Mod" for a memcpy call when the queried location doesn't overlap the source, and add a testcase. llvm-svn: 120370	2010-11-30 00:43:16 +00:00
Chris Lattner	9a146372b5	Teach basicaa that memset's modref set is at worst "mod" and never contains "ref". Enhance DSE to use a modref query instead of a store-specific hack to generalize the "ignore may-alias stores" optimization to handle memset and memcpy. llvm-svn: 120368	2010-11-30 00:28:45 +00:00
Owen Anderson	e22c7322b8	Correct Thumb2 encodings for a much wider range of loads and stores. llvm-svn: 120364	2010-11-30 00:14:31 +00:00
Chris Lattner	c3c754f750	my previous patch would cause us to start deleting some volatile stores, fix and add a testcase. llvm-svn: 120363	2010-11-30 00:12:39 +00:00
Bob Wilson	431ac4ef50	Add support for NEON VLD3-dup instructions. The encoding for alignment in VLD4-dup instructions is still a work in progress. llvm-svn: 120356	2010-11-30 00:00:35 +00:00
Owen Anderson	50d662b6cb	Provide Thumb2 encodings for basic loads and stores. llvm-svn: 120340	2010-11-29 22:44:32 +00:00
Evan Cheng	9a133f623c	Mark Darwin call instructions as using "r7" to prevent the frame-register assignment instructions from being moved below / above calls. rdar://8690640 llvm-svn: 120339	2010-11-29 22:43:27 +00:00
Benjamin Kramer	a22f0ce1a3	Add missing colon. llvm-svn: 120336	2010-11-29 22:39:38 +00:00
Benjamin Kramer	e6840ef4b3	Fix some broken CHECK lines. llvm-svn: 120332	2010-11-29 22:34:55 +00:00
Chris Lattner	2e8793482c	fix PR8677, patch by Jakub Staszak! llvm-svn: 120325	2010-11-29 21:59:31 +00:00
Frits van Bommel	28218aa8f1	Transform (extractvalue (load P), ...) to (load (gep P, 0, ...)) if the load has no other uses, shrinking the load. llvm-svn: 120323	2010-11-29 21:56:20 +00:00
Frits van Bommel	40a80ac963	Update this test to keep testing the -instcombine transform it's supposed to be testing instead of triggering the improved constant folding for insertvalue and extractvalue. llvm-svn: 120319	2010-11-29 20:55:40 +00:00
Frits van Bommel	a98214de10	Teach ConstantFoldInstruction() how to fold insertvalue and extractvalue. llvm-svn: 120316	2010-11-29 20:36:52 +00:00
Bob Wilson	77ab165afe	Add support for NEON VLD3-dup instructions. llvm-svn: 120312	2010-11-29 19:35:29 +00:00
Kalle Raiskila	1ff0bfa28f	Handle lshr for i128 correctly on SPU also when shiftamount > 7. llvm-svn: 120288	2010-11-29 14:44:28 +00:00
Kalle Raiskila	dc620afd1e	Enable PostRA scheduling for SPU. This speeds up selected test cases with up to 5% - no slowdowns observed. llvm-svn: 120286	2010-11-29 10:30:25 +00:00
NAKAMURA Takumi	6ea8a947e8	test: Check the feature 'loadable_module' with load modules in %llvmshlibdir. %llvmshlibdir should be 'bin' on Cygming. llvm-svn: 120282	2010-11-29 07:58:32 +00:00
Bill Wendling	232e52cfb7	Add more Thumb encodings. llvm-svn: 120279	2010-11-29 01:07:48 +00:00
Bill Wendling	ccba1a8d95	More Thumb encodings. llvm-svn: 120278	2010-11-29 01:00:43 +00:00
Bill Wendling	9600e97c60	Add Thumb encodings for REV instructions. llvm-svn: 120277	2010-11-29 00:42:50 +00:00
NAKAMURA Takumi	4fc56f0be7	test: Use $SharedLibDir for loadable modules. On Cygming, loadable modules are not in lib/ but bin. llvm-svn: 120274	2010-11-29 00:20:21 +00:00
NAKAMURA Takumi	5114d0afe3	test: Add the new feature 'loadable_module'. llvm-svn: 120273	2010-11-29 00:20:09 +00:00
Bill Wendling	775899eb2e	Add more Thumb encodings. llvm-svn: 120272	2010-11-29 00:18:15 +00:00
Chris Lattner	7e8a99b1c3	fix PR8686, accepting a 'b' suffix at the end of all the setcc instructions. I choose to handle this with an asmparser hack, though it could be handled by changing all the instruction definitions to allow be "setneb" instead of "setne". The asm parser hack is better in this case, because we want the disassembler to produce setne, not setneb. llvm-svn: 120260	2010-11-28 20:23:50 +00:00
Bob Wilson	2d790df105	Add support for NEON VLD2-dup instructions. llvm-svn: 120236	2010-11-28 06:51:26 +00:00
Rafael Espindola	5d882894d8	Lower TLS_addr32 and TLS_addr64. llvm-svn: 120225	2010-11-27 20:43:02 +00:00
Rafael Espindola	eab0800695	Implement the data16 prefix. llvm-svn: 120224	2010-11-27 20:29:45 +00:00
NAKAMURA Takumi	f80507c28c	CMake: lit(check.vcproj) can run with multiple configurations on Visual Studio. Unittests need LLVM_BUILD_MODE to pick up each test. Confirmed on CentOS5, Mingw, MSYS, and with possible configurations on VS8 and VS10. llvm-svn: 120212	2010-11-27 13:10:11 +00:00
Bob Wilson	c92eea0175	Add NEON VLD1-dup instructions (load 1 element to all lanes). llvm-svn: 120194	2010-11-27 06:35:16 +00:00
Daniel Dunbar	1440fd3539	macho-dump: Fix typo. llvm-svn: 120185	2010-11-27 04:00:06 +00:00
NAKAMURA Takumi	c54a9692ce	test/site.exp.in: Add "emitir", for now, fixing up r120156. CMake depends on site.exp.in, though, "emitir" might be unused. llvm-svn: 120174	2010-11-26 08:30:15 +00:00
Duncan Sands	7904068186	Remove explicit uses of -emit-llvm, the test infrastructure adds it automatically. Use -S with llvm-gcc rather than -c, so tests can work when llvm-gcc is really dragonegg (which can output IR with -S but not -c). Yes, dragonegg supports objective-c++ (poorly though). llvm-svn: 120164	2010-11-25 21:48:20 +00:00
Duncan Sands	8182ac6a05	Remove explicit uses of -emit-llvm, the test infrastructure adds it automatically. Use -S with llvm-gcc rather than -c, so tests can work when llvm-gcc is really dragonegg (which can output IR with -S but not -c). Yes, dragonegg supports objective-c (poorly though). llvm-svn: 120163	2010-11-25 21:46:07 +00:00
Duncan Sands	e6c974b230	Use -S rather than -c for the benefit of dragonegg. llvm-svn: 120161	2010-11-25 21:41:35 +00:00
Duncan Sands	5fe97a0490	Remove explicit uses of -emit-llvm, the test infrastructure adds it automatically. Use -S with llvm-gcc rather than -c, so tests can work when llvm-gcc is really dragonegg (which can output IR with -S but not -c). llvm-svn: 120160	2010-11-25 21:39:17 +00:00
Duncan Sands	0be0ae625d	Judging from the comment, the system assembler is supposed to assemble the output of this test. Since it was producing bitcode, that clearly wasn't happening! Have it produce target assembler and assemble that instead. llvm-svn: 120159	2010-11-25 21:26:21 +00:00
Duncan Sands	b32d19de6a	Remove explicit uses of -emit-llvm, the test infrastructure adds it automatically. Use -S with llvm-gcc rather than -c, so tests can work when llvm-gcc is really dragonegg (which can output IR with -S but not -c). llvm-svn: 120158	2010-11-25 21:24:35 +00:00
Duncan Sands	2b5243d096	Dragonegg cannot output bitcode, only human readable IR, so use -S rather than -c. llvm-svn: 120157	2010-11-25 21:21:59 +00:00
Duncan Sands	c78fbf9877	Use LLVMCC_EMITIR_FLAG rather than hard-coding "-emit-llvm". llvm-svn: 120156	2010-11-25 21:19:52 +00:00
Rafael Espindola	7c2acd022e	Use multiple 0x66 prefixes so that all nops up to 15 bytes are a single instruction. llvm-svn: 120147	2010-11-25 17:14:16 +00:00
Rafael Espindola	f8e127eaf6	Factor some code to parseSectionFlags and fix the default type of a section. llvm-svn: 120145	2010-11-25 15:32:56 +00:00
Nick Lewycky	b8de00ee07	Treat a call of function pointer like a load of the pointer when considering whether the pointer can be replaced with the global variable it is a copy of. Fixes PR8680. llvm-svn: 120126	2010-11-24 22:04:20 +00:00
Rafael Espindola	9f75d5df0b	Behave a bit more like gnu as and use the symbol (instead of the section) for any relocation to a symbol defined in a tls section. llvm-svn: 120121	2010-11-24 21:57:39 +00:00
Rafael Espindola	708ac4d6ad	Relocate with the symbol if the relocation is of kind NTPOFF. Patch by David Meyer, I added the test. llvm-svn: 120104	2010-11-24 19:23:50 +00:00
Rafael Espindola	e98d483b71	Fix and add tests for all cases in x86 and x86_64 where gnu as implicitly sets the type of a symbol to STT_TLS. llvm-svn: 120100	2010-11-24 18:51:21 +00:00
Rafael Espindola	af9a7a3e92	Testcase for r120017. llvm-svn: 120099	2010-11-24 18:03:57 +00:00
Kalle Raiskila	97fc68774c	Allow for 'fcmp ogt' in SPU. Fix by Visa Putkinen! llvm-svn: 120090	2010-11-24 11:42:17 +00:00
Rafael Espindola	4e70ac7b68	If a symbol is used as tls, mark it as tls even if not declare as so. Probably fixes PR8659. llvm-svn: 120076	2010-11-24 02:19:40 +00:00
Benjamin Kramer	94a622af4c	The srem -> urem transform is not safe for any divisor that's not a power of two. E.g. -5 % 5 is 0 with srem and 1 with urem. Also addresses Frits van Bommel's comments. llvm-svn: 120049	2010-11-23 20:33:57 +00:00
Bob Wilson	d7d2cf7842	Recognize sign/zero-extended constant BUILD_VECTORs for VMULL operations. We need to check if the individual vector elements are sign/zero-extended values. For now this only handles constants values. Radar 8687140. llvm-svn: 120034	2010-11-23 19:38:38 +00:00
Benjamin Kramer	b5afa65b0a	InstCombine: Reduce "X shift (A srem B)" to "X shift (A urem B)" iff B is positive. This allows to transform the rem in "1 << ((int)x % 8);" to an and. llvm-svn: 120028	2010-11-23 18:52:42 +00:00
Duncan Sands	adc7771f18	Exploit distributive laws (eg: And distributes over Or, Mul over Add, etc) in a fairly systematic way in instcombine. Some of these cases were already dealt with, in which case I removed the existing code. The case of Add has a bunch of funky logic which covers some of this plus a few variants (considers shifts to be a form of multiplication), which I didn't touch. The simplification performed is: AB+AC -> A(B+C). The improvement is to do this in cases that were not already handled [such as AB-AC -> A(B-C), which was reported on the mailing list], and also to do it more often by not checking for "only one use" if "B+C" simplifies. llvm-svn: 120024	2010-11-23 14:23:47 +00:00
Kalle Raiskila	e1b6c273b8	Division by pow-of-2 is not cheap on SPU, do it with shifts. llvm-svn: 120022	2010-11-23 13:27:59 +00:00
Rafael Espindola	3c7cab1402	Produce a relocation for pcrel absolute values. Based on a patch by David Meyer. llvm-svn: 120006	2010-11-23 07:20:12 +00:00
Chris Lattner	e5afa15b77	duncan's spider sense was right, I completely reversed the condition on this instcombine xform. This fixes a miscompilation of 403.gcc. llvm-svn: 119988	2010-11-23 02:42:04 +00:00
Chris Lattner	adc29567fc	filecheckize llvm-svn: 119987	2010-11-23 02:26:52 +00:00
Benjamin Kramer	f1ebb63161	InstCombine: Implement X - A-B -> X + AB. llvm-svn: 119984	2010-11-22 20:31:27 +00:00
Evan Cheng	eb56dca4fd	Fix epilogue codegen to avoid leaving the stack pointer in an invalid state. Previously Thumb2 would restore sp from fp like this: mov sp, r7 sub, sp, #4 If an interrupt is taken after the 'mov' but before the 'sub', callee-saved registers might be clobbered by the interrupt handler. Instead, try restoring directly from sp: add sp, #4 Or, if necessary (with VLA, etc.) use a scratch register to compute sp and then restore it: sub.w r4, r7, #8 mov sp, r7 rdar://8465407 llvm-svn: 119977	2010-11-22 18:12:04 +00:00
Duncan Sands	c133c54426	If a GEP index simply advances by multiples of a type of zero size, then replace the index with zero. llvm-svn: 119974	2010-11-22 16:32:50 +00:00
Kalle Raiskila	77d11d054c	Fix a bug with extractelement on SPU. In the attached testcase, the element was never extracted (missing rotate). llvm-svn: 119973	2010-11-22 16:28:26 +00:00
Benjamin Kramer	24656c9583	Implement the "if (X == 6 \|\| X == 4)" -> "if ((X\|2) == 6)" optimization. This currently only catches the most basic case, a two-case switch, but can be extended later. llvm-svn: 119964	2010-11-22 09:45:38 +00:00
Wesley Peck	f1d3800e65	Implement branch analysis in the MBlaze backend. llvm-svn: 119951	2010-11-21 21:53:36 +00:00
Duncan Sands	cf4bceba49	Add a rather pointless InstructionSimplify transform, inspired by recent constant folding improvements: if P points to a type of size zero, turn "gep P, N" into "P". More generally, if a gep index type has size zero, instcombine could replace the index with zero, but that is not done here. llvm-svn: 119942	2010-11-21 13:53:09 +00:00
Bill Wendling	c01d679928	Add encoding for ARM "trap" instruction. llvm-svn: 119938	2010-11-21 11:05:29 +00:00
Chris Lattner	b4cd1819fa	implement PR8524, apparently mainline gas accepts movq as an alias for movd when transfering between i64 gprs and mmx regs. llvm-svn: 119931	2010-11-21 08:18:57 +00:00
Chris Lattner	e48c31ce33	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Chris Lattner	6e22221b37	file checkize llvm-svn: 119926	2010-11-21 07:32:40 +00:00
Chris Lattner	f7e896138e	optimize: void a(int x) { if (((1<<x)&8)==0) b(); } into "x != 3", which occurs over 100 times in 403.gcc but in no other program in llvm-test. llvm-svn: 119922	2010-11-21 06:44:42 +00:00
Rafael Espindola	26cb15a549	Handle PCRel relocations with absolute values. Fixes PR8656. llvm-svn: 119917	2010-11-21 00:48:25 +00:00
Chris Lattner	58f9f58716	Implement PR8644: forwarding a memcpy value to a byval, allowing the memcpy to be eliminated. Unfortunately, the requirements on byval's without explicit alignment are really weak and impossible to predict in the mid-level optimizer, so this doesn't kick in much with current frontends. The fix is to change clang to set alignment on all byval arguments. llvm-svn: 119916	2010-11-21 00:28:59 +00:00
Andrew Trick	cf7fefb25c	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Owen Anderson	c0177aeb71	Add a test for CodeGenPrepare's ability to look through PHI nodes when performing addressing mode folding, introduced in r119853. llvm-svn: 119857	2010-11-19 22:34:53 +00:00
Dale Johannesen	c9242c577e	Prefetch has a MemOperand now. FileCheckize a test. This finishes up 8460971. llvm-svn: 119848	2010-11-19 21:49:38 +00:00
Mon P Wang	88ff56caa3	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Kevin Enderby	8be14414f6	Added support for the Mach-O .symbol_resolver directive. rdar://8673046 llvm-svn: 119816	2010-11-19 18:39:33 +00:00
Bill Wendling	945b776b6e	Add MC encodings for some Thumb instructions. Test for a few of them. The "bx lr" instruction cannot be tested just yet. It requires matching a "condition code", but adding one of those makes things go south quickly... llvm-svn: 119774	2010-11-19 01:33:10 +00:00
Bill Wendling	2063b84297	Add support for parsing the writeback ("!") token. llvm-svn: 119761	2010-11-18 23:43:05 +00:00
Owen Anderson	690fa953e1	More tests. llvm-svn: 119756	2010-11-18 23:30:10 +00:00
Owen Anderson	3517585249	Fix encodings for pkhbt, and fix some tests where I accidentally tested ARM mode instead of Thumb2. llvm-svn: 119755	2010-11-18 23:29:56 +00:00
Tanya Lattner	cd68095650	Fix bug in DAGCombiner for ARM that was trying to do a ShiftCombine on illegal types (vector should be split first). Added test case. llvm-svn: 119749	2010-11-18 22:06:46 +00:00
Owen Anderson	3fec5ff14b	More Thumb2 encodings. llvm-svn: 119737	2010-11-18 21:15:19 +00:00
Owen Anderson	3625098459	Fill out the set of Thumb2 multiplication operator encodings. llvm-svn: 119733	2010-11-18 20:32:18 +00:00
Duncan Sands	12f3b3b44f	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Duncan Sands	aef146b890	Factor code for testing whether replacing one value with another preserves LCSSA form out of ScalarEvolution and into the LoopInfo class. Use it to check that SimplifyInstruction simplifications are not breaking LCSSA form. Fixes PR8622. llvm-svn: 119727	2010-11-18 19:59:41 +00:00
Eric Christopher	b006fc9c07	Rewrite stack callee saved spills and restores to use push/pop instructions. Remove movePastCSLoadStoreOps and associated code for simple pointer increments. Update routines that depended upon other opcodes for save/restore. Adjust all testcases accordingly. llvm-svn: 119725	2010-11-18 19:40:05 +00:00
Owen Anderson	c21c100f3d	Completely rework the datastructure GVN uses to represent the value number to leader mapping. Previously, this was a tree of hashtables, and a query recursed into the table for the immediate dominator ad infinitum if the initial lookup failed. This led to really bad performance on tall, narrow CFGs. We can instead replace it with what is conceptually a multimap of value numbers to leaders (actually represented by a hashtable with a list of Value*'s as the value type), and then determine which leader from that set to use very cheaply thanks to the DFS numberings maintained by DominatorTree. Because there are typically few duplicates of a given value, this scan tends to be quite fast. Additionally, we use a custom linked list and BumpPtr allocation to avoid any unnecessary allocation in representing the value-side of the multimap. This change brings with it a 15% (!) improvement in the total running time of GVN on 403.gcc, which I think is pretty good considering that includes all the "real work" being done by MemDep as well. The one downside to this approach is that we can no longer use GVN to perform simple conditional progation, but that seems like an acceptable loss since we now have LVI and CorrelatedValuePropagation to pick up the slack. If you see conditional propagation that's not happening, please file bugs against LVI or CVP. llvm-svn: 119714	2010-11-18 18:32:40 +00:00
Dan Gohman	2e1fc849b2	Add support for PHI-translating sext, zext, and trunc instructions, enabling more PRE. PR8586. llvm-svn: 119704	2010-11-18 17:05:13 +00:00
Chris Lattner	731caac7c6	remove a pointless restriction from memcpyopt. It was refusing to optimize two memcpy's like this: copy A <- B copy C <- A if it couldn't prove that noalias(B,C). We can eliminate the copy by producing a memmove instead of memcpy. llvm-svn: 119694	2010-11-18 08:00:57 +00:00
Chris Lattner	bbb0f9661d	filecheckize, this is still not optimal, see PR8643 llvm-svn: 119693	2010-11-18 07:49:32 +00:00
Chris Lattner	ac5701319b	allow eliminating an alloca that is just copied from an constant global if it is passed as a byval argument. The byval argument will just be a read, so it is safe to read from the original global instead. This allows us to promote away the %agg.tmp alloca in PR8582 llvm-svn: 119686	2010-11-18 06:41:51 +00:00
Chris Lattner	f183d5c4be	enhance the "alloca is just a memcpy from constant global" to ignore calls that obviously can't modify the alloca because they are readonly/readnone. llvm-svn: 119683	2010-11-18 06:26:49 +00:00
Chris Lattner	7aeae25c78	fix a small oversight in the "eliminate memcpy from constant global" optimization. If the alloca that is "memcpy'd from constant" also has a memcpy from it, ignore it: it is a load. We now optimize the testcase to: define void @test2() { %B = alloca %T %a = bitcast %T* @G to i8* %b = bitcast %T* %B to i8* call void @llvm.memcpy.p0i8.p0i8.i64(i8* %b, i8* %a, i64 124, i32 4, i1 false) call void @bar(i8* %b) ret void } previously we would generate: define void @test() { %B = alloca %T %b = bitcast %T* %B to i8* %G.0 = getelementptr inbounds %T* @G, i32 0, i32 0 %tmp3 = load i8* %G.0, align 4 %G.1 = getelementptr inbounds %T* @G, i32 0, i32 1 %G.15 = bitcast [123 x i8]* %G.1 to i8* %1 = bitcast [123 x i8]* %G.1 to i984* %srcval = load i984* %1, align 1 %B.0 = getelementptr inbounds %T* %B, i32 0, i32 0 store i8 %tmp3, i8* %B.0, align 4 %B.1 = getelementptr inbounds %T* %B, i32 0, i32 1 %B.12 = bitcast [123 x i8]* %B.1 to i8* %2 = bitcast [123 x i8]* %B.1 to i984* store i984 %srcval, i984* %2, align 1 call void @bar(i8* %b) ret void } llvm-svn: 119682	2010-11-18 06:20:47 +00:00
Chris Lattner	9434184142	filecheckize llvm-svn: 119681	2010-11-18 06:16:43 +00:00
Rafael Espindola	67c6ab8865	Change CodeGen to use .loc directives. This produces a lot more readable output and testing is easier. A good example is the unknown-location.ll test that now can just look for ".loc 1 0 0". We also don't use a DW_LNE_set_address for every address change anymore. llvm-svn: 119613	2010-11-18 02:04:25 +00:00
Dale Johannesen	ed0d840838	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
Owen Anderson	d127e7174b	Try again at providing Thumb2 encodings for basic multiplication operators. llvm-svn: 119601	2010-11-18 01:08:42 +00:00
John Thompson	488051f1a6	Fixed to use input redirection for source - to eliminate .s output. llvm-svn: 119599	2010-11-18 00:50:20 +00:00
Owen Anderson	28883834e1	Revert r119593 while I figure out my testing disagrees with the buildbot. llvm-svn: 119597	2010-11-18 00:42:51 +00:00
Owen Anderson	64aaddcd64	Provide correct Thumb2 encodings for basic multiplication operators. llvm-svn: 119593	2010-11-18 00:19:10 +00:00
John Thompson	ddc7ce548c	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Wesley Peck	307e4688c5	Now that the MBlaze backend is in its own directory, split the test cases into multiple files for different types of instructions. llvm-svn: 119580	2010-11-17 22:54:43 +00:00
Owen Anderson	55425e7f78	Second attempt at correct encodings for Thumb2 bitfield instructions. llvm-svn: 119575	2010-11-17 22:16:31 +00:00
Dale Johannesen	0659c8f157	These tests are looking for library function names that appear to differ on Linux. Try to make them pass on Linux. Would be good for a Linux person to review this. llvm-svn: 119572	2010-11-17 21:57:32 +00:00
Bob Wilson	881b45ccdf	Change ARMGlobalMerge to keep BSS globals in separate pools. This completes the fixes for Radar 8673120. llvm-svn: 119566	2010-11-17 21:25:39 +00:00
Bob Wilson	4c8ab19c22	Fix ARMGlobalMerge pass to check if globals are entirely within range. It is generally not sufficient to check if the starting offset is in range of the maximum offset that can be efficiently used for the target. llvm-svn: 119565	2010-11-17 21:25:36 +00:00
Bob Wilson	59182fb4b5	Change the symbol for merged globals from "merged" to "_MergedGlobals". This makes it more clear that the symbol is an internal, compiler-generated name and gives a little more description about its contents. llvm-svn: 119564	2010-11-17 21:25:33 +00:00
Bob Wilson	f796d4b469	Fix the ARMGlobalMerge pass to look at variable sizes instead of pointer sizes. It was mistakenly looking at the pointer type when checking for the size of global variables. This is a partial fix for Radar 8673120. llvm-svn: 119563	2010-11-17 21:25:27 +00:00
Owen Anderson	6c37ceb182	Revert r119551, which broke buildbots. llvm-svn: 119555	2010-11-17 20:48:51 +00:00
Owen Anderson	7464116bde	Provide Thumb2 encodings for bitfield instructions. llvm-svn: 119551	2010-11-17 20:35:29 +00:00
Evan Cheng	7f8ab6ee8b	Remove ARM isel hacks that fold large immediates into a pair of add, sub, and, and xor. The 32-bit move immediates can be hoisted out of loops by machine LICM but the isel hacks were preventing them. Instead, let peephole optimization pass recognize registers that are defined by immediates and the ARM target hook will fold the immediates in. Other changes include 1) do not fold and / xor into cmp to isel TST / TEQ instructions if there are multiple uses. This happens when the 'and' is live out, machine sink would have sinked the computation and that ends up pessimizing code. The peephole pass would recognize situations where the 'and' can be toggled to define CPSR and eliminate the comparison anyway. 2) Move peephole pass to after machine LICM, sink, and CSE to avoid blocking important optimizations. rdar://8663787, rdar://8241368 llvm-svn: 119548	2010-11-17 20:13:28 +00:00
Owen Anderson	bced7ae046	More miscellaneous Thumb2 encodings. llvm-svn: 119546	2010-11-17 19:57:38 +00:00
Benjamin Kramer	07726c7d52	InstCombine: Add a missing irem identity (X % X -> 0). llvm-svn: 119538	2010-11-17 19:11:46 +00:00
Rafael Espindola	b67912d5cd	Add support for .int. llvm-svn: 119512	2010-11-17 16:24:40 +00:00
Rafael Espindola	5c996894bd	Add support for .2byte, .4byte and .8byte. Fixes PR8631. llvm-svn: 119511	2010-11-17 16:15:42 +00:00
Che-Liang Chiou	c03d04ee1f	Add simple arithmetics and %type directive for PTX llvm-svn: 119485	2010-11-17 08:08:49 +00:00
Bill Wendling	9898ac97fd	Proper encoding for VLDM and VSTM instructions. The register lists for these instructions have to distinguish between lists of single- and double-precision registers in order for the ASM matcher to do a proper job. In all other respects, a list of single- or double-precision registers are the same as a list of GPR registers. llvm-svn: 119460	2010-11-17 04:32:08 +00:00
Dale Johannesen	fc4feca7d8	Test for llvm-gcc patch 119392. llvm-svn: 119393	2010-11-16 21:57:15 +00:00
Duncan Sands	5ffc298bc7	In which I discover the existence of loops. Threading an operation over a phi node by applying it to each operand may be wrong if the operation and the phi node are mutually interdependent (the testcase has a simple example of this). So only do this transform if it would be correct to perform the operation in each predecessor of the block containing the phi, i.e. if the other operands all dominate the phi. This should fix the FFMPEG snow.c regression reported by İsmail Dönmez. llvm-svn: 119347	2010-11-16 12:16:38 +00:00
Rafael Espindola	7d19efd6ff	A bit more of gnu as compatibility when handling relocations with aliases. llvm-svn: 119328	2010-11-16 04:11:46 +00:00
Bill Wendling	92756fff57	Test encodings for LDM and STM. llvm-svn: 119315	2010-11-16 01:38:20 +00:00
Jakob Stoklund Olesen	e2b8858611	Fix PR8612 in the standard spiller, take two. The live range of a register defined by an early clobber starts at the use slot, not the def slot. Except when it is an early clobber tied to a use operand. Then it starts at the def slot like a standard def. llvm-svn: 119305	2010-11-16 00:40:59 +00:00

1 2 3 4 5 ...

11734 Commits