llvm-project

Commit Graph

Author	SHA1	Message	Date
Cameron Zwarich	6b0c4c9b6c	Move DominanceFrontier from VMCore to Analysis. llvm-svn: 123747	2011-01-18 06:06:27 +00:00
Daniel Dunbar	62ea26fb6f	McARM: Use accessors where appropriate. llvm-svn: 123746	2011-01-18 05:55:27 +00:00
Daniel Dunbar	bcd8eb0bac	McARM: Fill in ASMOperand::dump() for memory operands. llvm-svn: 123745	2011-01-18 05:55:21 +00:00
Daniel Dunbar	510740eea7	McARM: Make ARMOperand use a union where appropriate. llvm-svn: 123744	2011-01-18 05:55:15 +00:00
Cameron Zwarich	ce25e88218	There is no point in verifying an analysis that is never updated. llvm-svn: 123743	2011-01-18 05:44:04 +00:00
Daniel Dunbar	f5164f40c5	McARM: Unify ParseMemory() successfull return. llvm-svn: 123740	2011-01-18 05:34:24 +00:00
Daniel Dunbar	1d5e954965	McARM: Early exit on failure (NEFC). llvm-svn: 123739	2011-01-18 05:34:17 +00:00
Daniel Dunbar	7ed455990d	McARM: Always keep an offset expression, if used (instead of assuming == 0 if used but not present), and simplify logic. Also, clean up various non-sensicalisms in isMemModeRegThumb() and isMemModeImmThumb(). llvm-svn: 123738	2011-01-18 05:34:11 +00:00
Daniel Dunbar	5d99420e11	McARM: Add a variety of asserts on the sanity of memory operands. llvm-svn: 123737	2011-01-18 05:34:05 +00:00
Daniel Dunbar	d8da9e0fe6	McARM: Use a consistent marker for not-set OffsetRegNum. llvm-svn: 123736	2011-01-18 05:33:57 +00:00
Cameron Zwarich	fc210c79b7	Convert a std::map to a DenseMap for another 1.7% speedup on -scalarrepl. llvm-svn: 123732	2011-01-18 04:50:38 +00:00
Cameron Zwarich	6968c41ac8	Make a std::vector a SmallVector<*, 32> like the other vectors in the same function. This seems to be about a 1.5% speedup of -scalarrepl on test-suite with SPEC2000 and SPEC2006. llvm-svn: 123731	2011-01-18 04:41:32 +00:00
Rafael Espindola	ecd5b9abe9	Reduce indentation and remove commented out code. llvm-svn: 123729	2011-01-18 04:36:06 +00:00
Cameron Zwarich	66f3c66b9d	Remove some now-unused DominanceFrontier methods. llvm-svn: 123726	2011-01-18 04:21:57 +00:00
Cameron Zwarich	b703654edc	Remove code for updating dominance frontiers and some outdated references to dominance and post-dominance frontiers. llvm-svn: 123725	2011-01-18 04:11:31 +00:00
Cameron Zwarich	4694e69540	Remove outdated references to dominance frontiers. llvm-svn: 123724	2011-01-18 03:53:26 +00:00
Daniel Dunbar	66e91d4a58	McARM: Start marking T2 address operands as such, for the benefit of the parser. llvm-svn: 123722	2011-01-18 03:06:03 +00:00
Daniel Dunbar	e46bc4cec5	Formatting tweak. llvm-svn: 123718	2011-01-18 01:59:30 +00:00
Daniel Dunbar	f413213a48	Support/CommandLine: Add "Did you mean" print for mismatched operands. llvm-svn: 123717	2011-01-18 01:59:24 +00:00
Eric Christopher	542f8a5221	The stub routine that we're calling uses test and so clobbers the flags. llvm-svn: 123712	2011-01-18 01:37:20 +00:00
Chris Lattner	ea4e983d70	minor change to rafael's recent patches: if something is constant but requires a unique address, we can still put it in a readonly section, just not a mergable one. llvm-svn: 123711	2011-01-18 01:23:44 +00:00
Jeffrey Yasskin	249fcd4499	Remove unused variables found by gcc-4.6's -Wunused-but-set-variable. llvm-svn: 123707	2011-01-18 00:51:23 +00:00
Stuart Hastings	4fa832aab0	Remove checking that prevented overlapping CALLSEQ_START/CALLSEQ_END ranges, add legalizer support for nested calls. Necessary for ARM byval support. Radar 7662569. llvm-svn: 123704	2011-01-18 00:09:27 +00:00
NAKAMURA Takumi	8a07451a6e	Windows/PathV2.inc: For CryptAcquireContext(), CRYPT_VERIFYCONTEXT may be specified for easy use. llvm-svn: 123687	2011-01-17 22:41:34 +00:00
NAKAMURA Takumi	53f893af54	Windows/PathV2.inc: MoveFileEx() can behave like Posix's mv(1) to specify MOVEFILE_COPY_ALLOWED \| MOVEFILE_REPLACE_EXISTING. llvm-svn: 123686	2011-01-17 22:41:25 +00:00
NAKAMURA Takumi	bb4ea1fef9	lib/Support/Windows/Signals.inc: "Showstopper" dialogs may be suppressed with SetErrorMode() on Windows 7. llvm-svn: 123685	2011-01-17 22:41:15 +00:00
Owen Anderson	459e079912	Remove dead code, that I apparently wrote a while back. We seem to be doing well enough without whatever this was trying to do. When/if someone has the time to do some empirical evaluations, it might be worth it to figure out what this code was trying to do and see if it's worth resurrecting/fixing. llvm-svn: 123684	2011-01-17 22:39:54 +00:00
Douglas Gregor	69e6206b19	Add a missing <cctype> include, from Joerg Sonnenberger! llvm-svn: 123670	2011-01-17 19:17:01 +00:00
Jim Grosbach	834c373e3f	Trailing whitespace. llvm-svn: 123665	2011-01-17 18:34:03 +00:00
Benjamin Kramer	45d183ccf0	Fix an off-by-one error in ctpop combining. llvm-svn: 123664	2011-01-17 18:00:28 +00:00
Devang Patel	3ec1f198e5	Update tests to accomodate unnamed_addr introduction. llvm-svn: 123663	2011-01-17 17:54:17 +00:00
Cameron Zwarich	b410858a5f	Roll r123609 back in with two changes that fix test failures with expensive checks enabled: 1) Use '<' to compare integers in a comparison function rather than '<='. 2) Use the uniqued set DefBlocks rather than Info.DefiningBlocks to initialize the priority queue. The speedup of scalarrepl on test-suite + SPEC2000 + SPEC2006 is a bit less, at just under 16% rather than 17%. llvm-svn: 123662	2011-01-17 17:38:41 +00:00
Devang Patel	ea49cb04a5	Revert rr123550. It causes clang build failure on darwin9. llvm-svn: 123661	2011-01-17 17:34:43 +00:00
Michael J. Spencer	992efd12a7	Archive: Fix temp path names. llvm-svn: 123660	2011-01-17 16:43:30 +00:00
Oscar Fuentes	e65ed1c794	Add some platform checks. Also fix a typo on a Makefile. Patch by arrowdodger! llvm-svn: 123659	2011-01-17 16:35:14 +00:00
Michael J. Spencer	b74799090a	Support/raw_ostream: Fix uninitalized variable in raw_fd_ostream constructor. llvm-svn: 123643	2011-01-17 15:53:12 +00:00
Jay Foad	fe87364215	Remove useless Tag enumeration. llvm-svn: 123623	2011-01-17 15:18:06 +00:00
Kalle Raiskila	06c6d5cdb6	Split up RotateShift itinerary in SPU. 'rotq' and 'shlq' instructions go to the odd pipeline, wheras the inter-vector equivalents 'rot', 'shl' go to the even. llvm-svn: 123622	2011-01-17 13:33:19 +00:00
Benjamin Kramer	24c5184dca	Add a DAGCombine to turn (ctpop x) u< 2 into (x & x-1) == 0. This shaves off 4 popcounts from the hacked 186.crafty source. This is enabled even when a native popcount instruction is available. The combined code is one operation longer but it should be faster nevertheless. llvm-svn: 123621	2011-01-17 12:04:57 +00:00
Kalle Raiskila	7e7b4ac751	Don't crash SPU BE with memory accesses with big alignmnet. llvm-svn: 123620	2011-01-17 11:59:20 +00:00
Evan Cheng	dfce83c8f5	Materialize GA addresses with movw + movt pairs for Darwin in PIC mode. e.g. movw r0, :lower16:(L_foo$non_lazy_ptr-(LPC0_0+4)) movt r0, :upper16:(L_foo$non_lazy_ptr-(LPC0_0+4)) LPC0_0: add r0, pc, r0 It's not yet enabled by default as some tests are failing. I suspect bugs in down stream tools. llvm-svn: 123619	2011-01-17 08:03:18 +00:00
Cameron Zwarich	67431d7943	Roll out r123609 due to failures on the llvm-x86_64-linux-checks bot. llvm-svn: 123618	2011-01-17 07:26:51 +00:00
Francois Pichet	7cade2cd2f	Suppress warning "switch statement contains 'default' but no 'case' labels" on MSVC. llvm-svn: 123610	2011-01-17 02:07:17 +00:00
Cameron Zwarich	814cd9233e	Eliminate the use of dominance frontiers in PromoteMemToReg. In addition to eliminating a potentially quadratic data structure, this also gives a 17% speedup when running -scalarrepl on test-suite + SPEC2000 + SPEC2006. My initial experiment gave a greater speedup around 25%, but I moved the dominator tree level computation from dominator tree construction to PromoteMemToReg. Since this approach to computing IDFs has a much lower overhead than the old code using precomputed DFs, it is worth looking at using this new code for the second scalarrepl pass as well. llvm-svn: 123609	2011-01-17 01:08:59 +00:00
Michael J. Spencer	5ce56081c7	UnRevert "Revert "Archive: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1."" llvm-svn: 123605	2011-01-16 23:39:59 +00:00
Michael J. Spencer	ec202ee69a	Fix rename. llvm-svn: 123604	2011-01-16 22:18:41 +00:00
Nick Lewycky	872a453ada	Test for lazy value info's ability to prove the absense of NULLs in pointers. llvm-svn: 123601	2011-01-16 21:57:20 +00:00
Michael J. Spencer	4e51541319	Make everyone happy this time. llvm-svn: 123599	2011-01-16 21:34:34 +00:00
Anton Korobeynikov	27fc8f6467	Provide instruction sizes for ARMv5 variants of MUL instructions. This fixes PR8987 llvm-svn: 123598	2011-01-16 21:28:33 +00:00
Anders Carlsson	6a5171ba68	Update README.txt to remove the DAE enhancement. llvm-svn: 123597	2011-01-16 21:26:15 +00:00
Anders Carlsson	d3db83349e	Teach DAE to look for functions whose arguments are unused, and change all callers to pass in an undefvalue instead. llvm-svn: 123596	2011-01-16 21:25:33 +00:00
Michael J. Spencer	405e958ac3	UnRevert "Revert the archive part of "Support/PathV2: Add identify_magic."" This reverts commit dd103021a889a986a181ce36ed7b0e8dc9b645e1. llvm-svn: 123595	2011-01-16 21:13:51 +00:00
Michael J. Spencer	12a620fd58	Try and fix this test. For some reason llvm-ar thinks that the file exists when it shouldn't, but I have no way to verify that it doesn't actually exist on the buildbot. llvm-svn: 123594	2011-01-16 20:52:58 +00:00
Michael J. Spencer	c2caa2133c	Revert the archive part of "Support/PathV2: Add identify_magic." llvm-svn: 123593	2011-01-16 19:56:42 +00:00
Rafael Espindola	ec517cdf24	Update tests. llvm-svn: 123591	2011-01-16 18:02:57 +00:00
Chris Lattner	7c9f4c9c2b	tidy up a comment, as suggested by duncan llvm-svn: 123590	2011-01-16 17:46:19 +00:00
Rafael Espindola	cba4c33949	Only put unnamed_addr constants in mergeable sections. Fixes PR8297. llvm-svn: 123585	2011-01-16 17:19:34 +00:00
Rafael Espindola	751677a040	Don't merge two constants if we care about the address of both. This fixes the original testcase in PR8927. It also causes a clang binary built with a patched clang to increase in size by 0.21%. We can probably get some of the size back by writing a pass that detects that a global never has its pointer compared and adds unnamed_addr to it (maybe extend global opt). It is also possible that there are some other cases clang could add unnamed_addr to. I will investigate extending globalopt next. llvm-svn: 123584	2011-01-16 17:05:09 +00:00
Jay Foad	bbb91f2b22	Simplify the construction and destruction of Uses. Simplify User::dropHungOffUses(). llvm-svn: 123580	2011-01-16 15:30:52 +00:00
Owen Anderson	ec3b10fc56	Reduce and merge testcases. llvm-svn: 123579	2011-01-16 09:13:31 +00:00
Chris Lattner	35a2e65bcb	fix PR8514, a bug where the "heroic" transformation of shift/and into and/shift would cause nodes to move around and a dangling pointer to happen. The code tried to avoid this with a HandleSDNode, but got the details wrong. llvm-svn: 123578	2011-01-16 08:48:11 +00:00
Jay Foad	5ded9df82a	Remove unnecessary specialization OperandTraits<User>. llvm-svn: 123577	2011-01-16 08:23:16 +00:00
Jay Foad	59809c7a62	Move the implementation of the User class into a new source file, User.cpp. llvm-svn: 123575	2011-01-16 08:10:57 +00:00
Chris Lattner	e5f8de8639	fix PR8932, a case where arg promotion could infinitely promote. llvm-svn: 123574	2011-01-16 08:09:24 +00:00
Chris Lattner	ed1fb92cfe	simplify a little llvm-svn: 123573	2011-01-16 07:11:21 +00:00
Chris Lattner	c326ebd118	add some commentary llvm-svn: 123572	2011-01-16 06:39:44 +00:00
Chris Lattner	6fab2e9418	if an alloca is only ever accessed as a unit, and is accessed with load/store instructions, then don't try to decimate it into its individual pieces. This will just make a mess of the IR and is pointless if none of the elements are individually accessed. This was generating really terrible code for std::bitset (PR8980) because it happens to be lowered by clang as an {[8 x i8]} structure instead of {i64}. The testcase now is optimized to: define i64 @test2(i64 %X) { br label %L2 L2: ; preds = %0 ret i64 %X } before we generated: define i64 @test2(i64 %X) { %sroa.store.elt = lshr i64 %X, 56 %1 = trunc i64 %sroa.store.elt to i8 %sroa.store.elt8 = lshr i64 %X, 48 %2 = trunc i64 %sroa.store.elt8 to i8 %sroa.store.elt9 = lshr i64 %X, 40 %3 = trunc i64 %sroa.store.elt9 to i8 %sroa.store.elt10 = lshr i64 %X, 32 %4 = trunc i64 %sroa.store.elt10 to i8 %sroa.store.elt11 = lshr i64 %X, 24 %5 = trunc i64 %sroa.store.elt11 to i8 %sroa.store.elt12 = lshr i64 %X, 16 %6 = trunc i64 %sroa.store.elt12 to i8 %sroa.store.elt13 = lshr i64 %X, 8 %7 = trunc i64 %sroa.store.elt13 to i8 %8 = trunc i64 %X to i8 br label %L2 L2: ; preds = %0 %9 = zext i8 %1 to i64 %10 = shl i64 %9, 56 %11 = zext i8 %2 to i64 %12 = shl i64 %11, 48 %13 = or i64 %12, %10 %14 = zext i8 %3 to i64 %15 = shl i64 %14, 40 %16 = or i64 %15, %13 %17 = zext i8 %4 to i64 %18 = shl i64 %17, 32 %19 = or i64 %18, %16 %20 = zext i8 %5 to i64 %21 = shl i64 %20, 24 %22 = or i64 %21, %19 %23 = zext i8 %6 to i64 %24 = shl i64 %23, 16 %25 = or i64 %24, %22 %26 = zext i8 %7 to i64 %27 = shl i64 %26, 8 %28 = or i64 %27, %25 %29 = zext i8 %8 to i64 %30 = or i64 %29, %28 ret i64 %30 } In this case, instcombine was able to eliminate the nonsense, but in PR8980 enough PHIs are in play that instcombine backs off. It's better to not generate this stuff in the first place. llvm-svn: 123571	2011-01-16 06:18:28 +00:00
Chris Lattner	7cd8cf7d24	Use an irbuilder to get some trivial constant folding when doing a store of a constant. llvm-svn: 123570	2011-01-16 05:58:24 +00:00
Chris Lattner	adb1a233b1	remove a dead check, this was needed before we had an explicit veto on uses of phis. llvm-svn: 123569	2011-01-16 05:37:55 +00:00
Chris Lattner	d55581ded8	enhance FoldOpIntoPhi in instcombine to try harder when a phi has multiple uses. In some cases, all the uses are the same operation, so instcombine can go ahead and promote the phi. In the testcase this pushes an add out of the loop. llvm-svn: 123568	2011-01-16 05:28:59 +00:00
Evan Cheng	572756ac11	Spill R4 if it's going to be used to restore SP from FP. llvm-svn: 123567	2011-01-16 05:14:33 +00:00
Chris Lattner	ea7131a062	remove the AllowAggressive argument to FoldOpIntoPhi. It is forced to false in the first line of the function because it isn't a good idea, even for compares. llvm-svn: 123566	2011-01-16 05:14:26 +00:00
Chris Lattner	ff2e737714	more cleanups: use the IR builder. llvm-svn: 123565	2011-01-16 05:08:00 +00:00
Chris Lattner	25ce280511	tidy up code. llvm-svn: 123564	2011-01-16 04:37:29 +00:00
Owen Anderson	4e54efd625	Improve the safety of my globalopt enhancement by ensuring that the bitcast of the stored value to the new store type is always. Also, add a testcase. llvm-svn: 123563	2011-01-16 04:33:33 +00:00
Chris Lattner	08f43456c9	fix PR8983, a broken assertion. llvm-svn: 123562	2011-01-16 03:43:53 +00:00
Venkatraman Govindaraju	1b0e2cbf3f	Implement AnalyzeBranch in Sparc Backend. llvm-svn: 123561	2011-01-16 03:15:11 +00:00
Chris Lattner	218092e68e	fix PR8981, a crash trying to form a conditional inc with a floating point compare. llvm-svn: 123560	2011-01-16 02:56:53 +00:00
Chris Lattner	2d186574a6	reapply my fix for PR8961 with a tweak to properly handle multi-instruction sequences like calls. Many thanks to Jakob for finding a testcase. llvm-svn: 123559	2011-01-16 02:27:38 +00:00
Chris Lattner	8b4952fcf7	simplify this code, it is still broken but will follow up on llvm-commits. llvm-svn: 123558	2011-01-16 02:05:10 +00:00
Michael J. Spencer	2ff30b84f8	Revert "Archive: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1." llvm-svn: 123557	2011-01-16 01:43:22 +00:00
Chandler Carruth	ef28abefd0	Simplify a README.txt entry significantly to expose the core issue. llvm-svn: 123556	2011-01-16 01:40:23 +00:00
Chris Lattner	c703334ff1	one of michael's recent patches broke this, temporarily disable it so the bots go green llvm-svn: 123555	2011-01-16 01:04:49 +00:00
Chris Lattner	1e209b87ad	remove the partial specialization pass. It is unmaintained and has bugs. llvm-svn: 123554	2011-01-16 00:27:10 +00:00
Michael J. Spencer	53dcdc7420	Archive: Fix spelling. llvm-svn: 123552	2011-01-15 21:43:45 +00:00
Michael J. Spencer	a0ce763290	Archive: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1. llvm-svn: 123551	2011-01-15 21:43:37 +00:00
Michael J. Spencer	8685f387eb	Support/GraphWriter: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1. llvm-svn: 123550	2011-01-15 21:43:25 +00:00
Benjamin Kramer	bec03ea725	Add an assert so we don't silently miscompile ctpop for bit widths > 128. llvm-svn: 123549	2011-01-15 21:19:37 +00:00
Michael J. Spencer	94b2ab3556	Support/PathV2: Add identify_magic. llvm-svn: 123548	2011-01-15 20:39:36 +00:00
Benjamin Kramer	fff2517edc	Reimplement CTPOP legalization with the "best" algorithm from http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel In a silly microbenchmark on a 65 nm core2 this is 1.5x faster than the old code in 32 bit mode and about 2x faster in 64 bit mode. It's also a lot shorter, especially when counting 64 bit population on a 32 bit target. I hope this is fast enough to replace Kernighan-style counting loops even when the input is rather sparse. llvm-svn: 123547	2011-01-15 20:30:30 +00:00
Michael J. Spencer	b587180fa7	Unittests/Support/Path: Tweak test. llvm-svn: 123546	2011-01-15 18:52:49 +00:00
Michael J. Spencer	7887466adc	Support/PathV2: Implement has_magic in terms of get_magic. llvm-svn: 123545	2011-01-15 18:52:41 +00:00
Michael J. Spencer	ee1699c362	Support/PathV2: Implement get_magic. llvm-svn: 123544	2011-01-15 18:52:33 +00:00
Nick Lewycky	4a1ff16b29	Add missing whitespace. llvm-svn: 123543	2011-01-15 18:42:52 +00:00
Nick Lewycky	0296a481f9	Make constmerge a two-pass algorithm so that it won't miss merging opporuntities. Fixes PR8978. llvm-svn: 123541	2011-01-15 18:14:21 +00:00
Oscar Fuentes	25ac830e72	Make config.h.cmake similar to config.h.in Patch by arrowdodger! llvm-svn: 123539	2011-01-15 13:35:37 +00:00
Benjamin Kramer	ed5f2e504e	Try to unbreak selfhost. llvm-svn: 123537	2011-01-15 11:25:34 +00:00
Nick Lewycky	540f9536c8	Add a cache that protects mergefunc's internals from more surprises in DenseSet. Also, replace tabs with spaces. Yes, it's 2011. llvm-svn: 123535	2011-01-15 10:16:23 +00:00
Nick Lewycky	367f98f000	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533	2011-01-15 09:16:12 +00:00
Rafael Espindola	f1ed781aea	Add a clarification about merging constants with and without unnamed_addr. llvm-svn: 123530	2011-01-15 08:20:57 +00:00
Rafael Espindola	489e505adf	Allow unnamed_addr on declarations. llvm-svn: 123529	2011-01-15 08:15:00 +00:00
Chris Lattner	af26390790	temporarily revert r123526. While working on a follow-on patch I realize that ConstantFoldTerminator doesn't preserve dominfo. llvm-svn: 123527	2011-01-15 07:51:19 +00:00
Chris Lattner	8df83c4a24	fix rdar://8785296 - -fcatch-undefined-behavior generates inefficient code The basic issue is that isel (very reasonably!) expects conditional branches to be folded, so CGP leaving around a bunch dead computation feeding conditional branches isn't such a good idea. Just fold branches on constants into unconditional branches. llvm-svn: 123526	2011-01-15 07:36:13 +00:00
Chris Lattner	ee588defc6	simplify code, no functionality change. llvm-svn: 123525	2011-01-15 07:29:01 +00:00
Chris Lattner	1b93be501d	Now that instruction optzns can update the iterator as they go, we can have objectsize folding recursively simplify away their result when it folds. It is important to catch this here, because otherwise we won't eliminate the cross-block values at isel and other times. llvm-svn: 123524	2011-01-15 07:25:29 +00:00
Chris Lattner	7a2771440f	make the current instruction iterator an ivar, allowing xforms that potentially invalidate it (like inline asm lowering) to be sunk into their proper place, cleaning up a ton of code. llvm-svn: 123523	2011-01-15 07:14:54 +00:00
Chris Lattner	9c10d587f6	implement an instcombine xform that canonicalizes casts outside of and-with-constant operations. This fixes rdar://8808586 which observed that we used to compile: union xy { struct x { _Bool b[15]; } x; __attribute__((packed)) struct y { __attribute__((packed)) unsigned long b0to7; __attribute__((packed)) unsigned int b8to11; __attribute__((packed)) unsigned short b12to13; __attribute__((packed)) unsigned char b14; } y; }; struct x foo(union xy *xy) { return xy->x; } into: _foo: ## @foo movq (%rdi), %rax movabsq $1095216660480, %rcx ## imm = 0xFF00000000 andq %rax, %rcx movabsq $-72057594037927936, %rdx ## imm = 0xFF00000000000000 andq %rax, %rdx movzbl %al, %esi orq %rdx, %rsi movq %rax, %rdx andq $65280, %rdx ## imm = 0xFF00 orq %rsi, %rdx movq %rax, %rsi andq $16711680, %rsi ## imm = 0xFF0000 orq %rdx, %rsi movl %eax, %edx andl $-16777216, %edx ## imm = 0xFFFFFFFFFF000000 orq %rsi, %rdx orq %rcx, %rdx movabsq $280375465082880, %rcx ## imm = 0xFF0000000000 movq %rax, %rsi andq %rcx, %rsi orq %rdx, %rsi movabsq $71776119061217280, %r8 ## imm = 0xFF000000000000 andq %r8, %rax orq %rsi, %rax movzwl 12(%rdi), %edx movzbl 14(%rdi), %esi shlq $16, %rsi orl %edx, %esi movq %rsi, %r9 shlq $32, %r9 movl 8(%rdi), %edx orq %r9, %rdx andq %rdx, %rcx movzbl %sil, %esi shlq $32, %rsi orq %rcx, %rsi movl %edx, %ecx andl $-16777216, %ecx ## imm = 0xFFFFFFFFFF000000 orq %rsi, %rcx movq %rdx, %rsi andq $16711680, %rsi ## imm = 0xFF0000 orq %rcx, %rsi movq %rdx, %rcx andq $65280, %rcx ## imm = 0xFF00 orq %rsi, %rcx movzbl %dl, %esi orq %rcx, %rsi andq %r8, %rdx orq %rsi, %rdx ret We now compile this into: _foo: ## @foo ## BB#0: ## %entry movzwl 12(%rdi), %eax movzbl 14(%rdi), %ecx shlq $16, %rcx orl %eax, %ecx shlq $32, %rcx movl 8(%rdi), %edx orq %rcx, %rdx movq (%rdi), %rax ret A small improvement :-) llvm-svn: 123520	2011-01-15 06:32:33 +00:00
Chris Lattner	c23ca1f217	fix typo llvm-svn: 123519	2011-01-15 06:27:35 +00:00
Chris Lattner	76580f0ec3	Fix m_Not and m_Neg to not match random ConstantInt's. Before these would try hard to match constants by inverting the bits and recursively matching. There are two problems with this: 1) some patterns would match when we didn't want them to (theoretical) 2) this is insanely expensive to do, and most often pointless. This was apparently useful in just 2 instcombine cases, which I added code to handle explicitly. This change speeds up 'opt' time on 176.gcc by 1% and produces bitwise identical code. llvm-svn: 123518	2011-01-15 05:52:27 +00:00
Chris Lattner	e20dd530d0	one more instcombine variant that is needed to work with future changes, no functionality change currently. llvm-svn: 123517	2011-01-15 05:50:18 +00:00
Chris Lattner	497459d5fd	fix typo llvm-svn: 123516	2011-01-15 05:42:47 +00:00
Chris Lattner	f3c4eefff8	Catch ~x < cst just like ~x < ~y, we currently handle this through means that are about to disappear. llvm-svn: 123515	2011-01-15 05:41:33 +00:00
Chris Lattner	311aa63c87	reduce indentation llvm-svn: 123514	2011-01-15 05:40:29 +00:00
Eric Christopher	cc385c0c97	80-col. llvm-svn: 123505	2011-01-15 00:25:09 +00:00
Chris Lattner	b68ec5c339	Generalize LoadAndStorePromoter a bit and switch LICM to use it. llvm-svn: 123501	2011-01-15 00:12:35 +00:00
Bob Wilson	b7a3c42eae	Fix a comment. llvm-svn: 123497	2011-01-15 00:09:18 +00:00
Eric Christopher	2af9551ebf	Fix 80-cols. llvm-svn: 123494	2011-01-14 23:50:53 +00:00
Ted Kremenek	e92b6e436d	Update CMake build. llvm-svn: 123491	2011-01-14 22:58:11 +00:00
Bob Wilson	03912aba9a	Fix some tablegen issues to allow using zero_reg for InstAlias definitions. This is needed to allow an InstAlias for an instruction with an "OptionalDef" result register (like ARM's cc_out) where you want to set the optional register to reg0. llvm-svn: 123490	2011-01-14 22:58:09 +00:00
Ted Kremenek	6677f65de1	Fix memory leak found by clang static analyzer. llvm-svn: 123487	2011-01-14 22:34:17 +00:00
Ted Kremenek	b5241b2b59	'HiReg' is written but never read. Nuke its declaration and its assignments. Found by clang static analyzer. llvm-svn: 123486	2011-01-14 22:34:13 +00:00
Owen Anderson	3e2f6cf7ae	Fix a false-positive warning. llvm-svn: 123480	2011-01-14 22:31:13 +00:00
Dan Gohman	abac063b7a	Delete an assignment to ThisBB which isn't needed, and tidy up some comments. llvm-svn: 123479	2011-01-14 22:26:16 +00:00
Owen Anderson	9eb7cb48e4	Enhance GlobalOpt to be able evaluate initializers that involve stores through bitcasts, at least in simple cases. This fixes clang's CodeGenCXX/virtual-base-dtor.cpp llvm-svn: 123477	2011-01-14 22:19:20 +00:00
Anton Korobeynikov	9be547cfd3	Add a possibility to switch between CFI directives- and table-based frame description emission. Currently all the backends use table-based stuff. llvm-svn: 123476	2011-01-14 21:58:08 +00:00
Anton Korobeynikov	4d9de6be4b	Cleanup llvm-svn: 123475	2011-01-14 21:57:58 +00:00
Anton Korobeynikov	b46ef57de5	Add CFI directives-based frame information emission. Not hooked yet. llvm-svn: 123474	2011-01-14 21:57:53 +00:00
Anton Korobeynikov	61d167e92b	Split stuff as a preparation for CFI directives-based frame information emission llvm-svn: 123473	2011-01-14 21:57:45 +00:00
Anton Korobeynikov	e2bea1c82e	Use common style for .cfi directives llvm-svn: 123472	2011-01-14 21:57:39 +00:00
Andrew Trick	9ccce77893	Support for precise scheduling of the instruction selection DAG, disabled in this checkin. Sorry for the large diffs due to refactoring. New functionality is all guarded by EnableSchedCycles. Scheduling the isel DAG is inherently imprecise, but we give it a best effort: - Added MayReduceRegPressure to allow stalled nodes in the queue only if there is a regpressure need. - Added BUHasStall to allow checking for either dependence stalls due to latency or resource stalls due to pipeline hazards. - Added BUCompareLatency to encapsulate and standardize the heuristics for minimizing stall cycles (vs. reducing register pressure). - Modified the bottom-up heuristic (now in BUCompareLatency) to prioritize nodes by their depth rather than height. As long as it doesn't stall, height is irrelevant. Depth represents the critical path to the DAG root. - Added hybrid_ls_rr_sort::isReady to filter stalled nodes before adding them to the available queue. Related Cleanup: most of the register reduction routines do not need to be templates. llvm-svn: 123468	2011-01-14 21:11:41 +00:00
Chris Lattner	b498f9aff3	switch SRoA to use LoadAndStorePromoter instead of its own copy of the code. llvm-svn: 123457	2011-01-14 19:50:47 +00:00
Chris Lattner	95294b8796	Add a new LoadAndStorePromoter class, which implements the general "promote a bunch of load and stores" logic, allowing the code to be shared and reused. llvm-svn: 123456	2011-01-14 19:36:13 +00:00
Jay Foad	cbe1505617	OperandTraits<>::Layout isn't used for anything. Remove it. llvm-svn: 123452	2011-01-14 18:41:56 +00:00
Rafael Espindola	b1ebba9ec3	Update llvm-gcc's tests. llvm-svn: 123447	2011-01-14 17:01:20 +00:00
Oscar Fuentes	959d253476	Reorder macros on config.h.cmake to easily compare it against config.h.in. Patch by arrowdodger! llvm-svn: 123445	2011-01-14 16:41:03 +00:00
Devang Patel	610c41e7b0	Disable debug mode. llvm-svn: 123443	2011-01-14 15:55:50 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	c3eb0f4b2e	Rename this test. llvm-svn: 123440	2011-01-14 14:16:33 +00:00
Chris Lattner	8d7716a220	switch the second scalarrepl pass to use SSAUpdater. We run two scalarrepl passes: one early in the cleanup code and one late interlaced with the inliner. The second one is important because inlining and other scalar optzns can unpin allocas, allowing them to be split up and promoted. While important for performance, this is also relatively rare, and we would previously force a (non-lazy) computation of DomFrontiers, which happened even if nothing became unpinned. With this patch, the first pass of scalarrepl still promotes the vast bulk of allocas in programs, but hte second pass has changed to use SSAUpdater, which is more "sparse" and lazy. This speeds up opt -O3 time on kimwitu++ (a c++ app) by about 1%. The numbers are interesting: the first pass promotes ~17500 allocas. The second pass promotes about 1600. For non-C++ codes, the compile time win should be greater, because the second pass of scalarrepl does less. llvm-svn: 123437	2011-01-14 08:21:08 +00:00
Chris Lattner	9987a6f49b	split SROA into two passes: one that uses DomFrontiers (-scalarrepl) and one that uses SSAUpdater (-scalarrepl-ssa) llvm-svn: 123436	2011-01-14 08:13:00 +00:00
Jay Foad	1d4a8fe156	Remove casts between Value and Constant, which won't work if a static_cast from Constant* to Value* has to adjust the "this" pointer. This is groundwork for PR889. llvm-svn: 123435	2011-01-14 08:07:43 +00:00
Chris Lattner	543384efb4	Implement full support for promoting allocas to registers using SSAUpdater instead of DomTree/DomFrontier. This may be interesting for reducing compile time. This is currently disabled, but seems to work just fine. When this is enabled, we eliminate two runs of dominator frontier, one in the "early per-function" optimizations and one in the "interlaced with inliner" function passes. llvm-svn: 123434	2011-01-14 07:50:47 +00:00
Chris Lattner	5e0fef8531	relax testcase a bit. llvm-svn: 123433	2011-01-14 07:46:33 +00:00
Jakob Stoklund Olesen	ab3d6ecbd2	Try for the third time to teach getFirstTerminator() about debug values. This time let's rephrase to trick gcc-4.3 into not miscompiling. llvm-svn: 123432	2011-01-14 06:33:45 +00:00
Chris Lattner	e93e4f118c	revert my fastisel patch again which apparently still gives the llvm-gcc-i386-linux-selfhost buildbot heartburn... llvm-svn: 123431	2011-01-14 06:14:33 +00:00
Chris Lattner	5ca1391003	reapply r123414 now that the botz are calmed down and the fix is already in. llvm-svn: 123427	2011-01-14 04:24:28 +00:00
Chris Lattner	90f3a9a1c7	indentation llvm-svn: 123426	2011-01-14 04:23:53 +00:00
Evan Cheng	d4a5c05c97	Completed :lower16: / :upper16: support for movw / movt pairs on Darwin. - Fixed :upper16: fix up routine. It should be shifting down the top 16 bits first. - Added support for Thumb2 :lower16: and :upper16: fix up. - Added :upper16: and :lower16: relocation support to mach-o object writer. llvm-svn: 123424	2011-01-14 02:38:49 +00:00
Jakob Stoklund Olesen	c38102889f	Revert r123419. It still breaks llvm-gcc-i386-linux-selfhost. llvm-svn: 123423	2011-01-14 02:12:54 +00:00
Chris Lattner	21a64979f1	r123414 broke llvm-gcc bootstrap apparently, revert llvm-svn: 123422	2011-01-14 02:07:32 +00:00
Chris Lattner	3be81e9bd7	Set the insertion point correctly for instructions generated by load folding: they should go before the new instruction not after it. llvm-svn: 123420	2011-01-14 01:33:40 +00:00
Jakob Stoklund Olesen	c0767e029d	Try again to teach getFirstTerminator() about debug values. Fix some callers to better deal with debug values. llvm-svn: 123419	2011-01-14 01:17:53 +00:00
Owen Anderson	e3ed20ce9c	Rather than doing early instcombine, try doing early CSE instead. This should still handle most important simplifications, as well as resolving phase ordering issues where instcombine would inhibit important CSE'ing opportunities, for instance on BitBench/drop3. llvm-svn: 123418	2011-01-14 00:41:11 +00:00
Duncan Sands	7f60dc1eb0	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Owen Anderson	ae6ce377c2	Don't bother conditionalizing the use of SROA in -O1 mode. We're already running it unconditionally later in the pipeline. llvm-svn: 123416	2011-01-14 00:36:40 +00:00
Chris Lattner	0c34cb429e	fix PR8961 - a fast isel miscompilation where we'd insert a new instruction after sext's generated for addressing that got folded. Previously we compiled test5 into: _test5: ## @test5 ## BB#0: movq -8(%rsp), %rax ## 8-byte Reload movq (%rdi,%rax), %rdi addq %rdx, %rdi movslq %esi, %rax movq %rax, -8(%rsp) ## 8-byte Spill movq %rdi, %rax ret which is insane and wrong. Now we produce: _test5: ## @test5 ## BB#0: movslq %esi, %rax movq (%rdi,%rax), %rax addq %rdx, %rax ret llvm-svn: 123414	2011-01-14 00:01:01 +00:00
Jakob Stoklund Olesen	088b30aa48	Better terminator avoidance. This approach also works when the terminator doesn't have a slot index. (Which can happen??) llvm-svn: 123413	2011-01-13 23:35:53 +00:00
Evan Cheng	52899a9c34	Add comment about Thumb2 fixup comments being completely bogus. llvm-svn: 123411	2011-01-13 23:27:39 +00:00
Tobias Grosser	b1d11c19da	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Owen Anderson	a098d1505d	Recognize alternative register names like ip -> r12. Fixes <rdar://problem/8857982>. llvm-svn: 123409	2011-01-13 22:50:36 +00:00
Jakob Stoklund Olesen	bbb1a54b84	Fix a few more places that should use MBB::getLastNonDebugInstr(). llvm-svn: 123408	2011-01-13 22:47:43 +00:00
Owen Anderson	ec47597ecd	As far as I can tell, unified syntax uses c0-c15 instead of cr0-cr15 for mcr and friends. llvm-svn: 123407	2011-01-13 22:38:16 +00:00
Chris Lattner	b6c3aff1cb	typo llvm-svn: 123406	2011-01-13 22:11:56 +00:00
Chris Lattner	b9cdf393a4	memcpy + metadata = bliss :) llvm-svn: 123405	2011-01-13 22:08:15 +00:00
Owen Anderson	c3c7f5dd56	Add support to the ARM MC infrastructure to support mcr and friends. This requires supporting the symbolic immediate names used for these instructions, fixing their pretty-printers, and adding proper encoding information for them. With this, we can properly pretty-print and encode assembly like: mrc p15, #0, r3, c13, c0, #3 Fixes <rdar://problem/8857858>. llvm-svn: 123404	2011-01-13 21:46:02 +00:00
Evan Cheng	0447d30939	Relax an assertion. On archs like ARM, an immediate field may be scattered. So it's possible for some bits of every 8 bits to be encoded already, and the rest still needs to be fixed up. llvm-svn: 123403	2011-01-13 21:45:26 +00:00
Jakob Stoklund Olesen	05a0b55e76	Temporary workaround for an i386 crash in LiveDebugVariables. llvm-svn: 123400	2011-01-13 21:28:55 +00:00
Jakob Stoklund Olesen	4bc5e38960	Teach frame lowering to ignore debug values after the terminators. llvm-svn: 123399	2011-01-13 21:28:52 +00:00
Bob Wilson	657f227d08	Tidy comments, indentation, and 80-column violations. llvm-svn: 123397	2011-01-13 21:10:12 +00:00
Bob Wilson	328e91bbe1	Fix whitespace. llvm-svn: 123396	2011-01-13 20:59:44 +00:00
Kevin Enderby	b084be90e8	Fix ARMAsmParser::ParseOperand() to allow it to parse . as a branch target and directional local labels like 1f and 2b. llvm-svn: 123393	2011-01-13 20:32:36 +00:00
Devang Patel	331df548c3	Little help to debug the bugpoint itself. Patch by Bob Wilson. llvm-svn: 123390	2011-01-13 19:48:54 +00:00
Devang Patel	35f4ae26d6	Speculatively revert r123384 to make llvm-gcc-i386-linux-selfhost buildbot happy. llvm-svn: 123389	2011-01-13 19:27:50 +00:00
Oscar Fuentes	0a1b6b8ff2	Add some platform tests. Patch by arrowdodger! llvm-svn: 123388	2011-01-13 19:17:28 +00:00
Jim Grosbach	4424e7c4b8	When updating a tSpill/tRestore instruction to be a tSTRr/tLDRr, correctly set up the source operands. The original instr has an immediate operand that should be replaced with the frame reg operand rather than just adding the reg operand. Previously, the instruction ended up with too many operands causing an assert() when adding the default predicate. rdar://8825456 llvm-svn: 123387	2011-01-13 19:16:48 +00:00
Jakob Stoklund Olesen	0e233ae183	Teach MachineBasicBlock::getFirstTerminator to ignore debug values. It will still return an iterator that points to the first terminator or end(), but there may be DBG_VALUE instructions following the first terminator. llvm-svn: 123384	2011-01-13 18:41:05 +00:00
Bob Wilson	c8056a952e	Check for empty structs, and for consistency, zero-element arrays. llvm-svn: 123383	2011-01-13 18:26:59 +00:00
Bob Wilson	08713d3c5f	Extend SROA to handle arrays accessed as homogeneous structs and vice versa. This is a minor extension of SROA to handle a special case that is important for some ARM NEON operations. Some of the NEON intrinsics return multiple values, which are handled as struct types containing multiple elements of the same vector type. The corresponding return types declared in the arm_neon.h header have equivalent arrays. We need SROA to recognize that it can split up those arrays and structs into separate vectors, even though they are not always accessed with the same type. SROA already handles loads and stores of an entire alloca by using insertvalue/extractvalue to access the individual pieces, and that code works the same regardless of whether the type is a struct or an array. So, all that needs to be done is to check for compatible arrays and homogeneous structs. llvm-svn: 123381	2011-01-13 17:45:11 +00:00
Bob Wilson	12eec40c83	Make SROA more aggressive with allocas containing padding. SROA only split up structs and arrays one level at a time, so padding can only cause trouble if it is located in between the struct or array elements. llvm-svn: 123380	2011-01-13 17:45:08 +00:00
Oscar Fuentes	b7c43b8c59	Disable RTTI when building unit tests. This avoids errors at link time. llvm-svn: 123377	2011-01-13 15:31:45 +00:00
Oscar Fuentes	f8e26b123c	Platform tests for argz_* functions. Patch by arrowdodger! llvm-svn: 123376	2011-01-13 15:06:32 +00:00
Duncan Sands	ad000d8f16	Remove some wrong code which fortunately was never executed (as explained in the comment I added): an extern weak global may have a null address. llvm-svn: 123373	2011-01-13 10:43:08 +00:00
Duncan Sands	8d25a7c3a0	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Evan Cheng	965b3c7323	Model :upper16: and :lower16: as ARM specific MCTargetExpr. This is a step in the right direction. It eliminated some hacks and will unblock codegen work. But it's far from being done. It doesn't reject illegal expressions, e.g. (FOO - :lower16:BAR). It also doesn't work in Thumb2 mode at all. llvm-svn: 123369	2011-01-13 07:58:56 +00:00
Eric Christopher	da2d2f4d1f	Experiment with changing the default 32-bit linux stack alignment to 16 bytes for PR8969. Update all testcases accordingly. llvm-svn: 123367	2011-01-13 06:47:10 +00:00
Rafael Espindola	9ebe8ce68c	Keep unnamed_addr when linking. llvm-svn: 123364	2011-01-13 05:12:34 +00:00
Rafael Espindola	026d152e58	Reject uses of unnamed_addr in declarations. llvm-svn: 123358	2011-01-13 01:30:30 +00:00
Kevin Enderby	4d58d5f88f	Add a FIXME and two asserts for now in the ARMAsmParser when it sees .code 16 or .code 32 if the TargetMachine's isThumb() boolean does not match. The correct fix is to switch ARM subtargets at that point and is tracked by rdar://8856789 which is bigger task. llvm-svn: 123353	2011-01-13 01:07:01 +00:00
Dan Gohman	958620dd6d	Fix r123346 to handle scalar types too. llvm-svn: 123352	2011-01-13 01:06:51 +00:00
Jakob Stoklund Olesen	9472847bcc	Add missing space in debug output llvm-svn: 123351	2011-01-13 00:57:35 +00:00
Jason W Kim	9322997b60	Change call to Error() to assert() llvm-svn: 123350	2011-01-13 00:27:00 +00:00
Jason W Kim	39e36e7ab4	Style clean up - break up the breaks. llvm-svn: 123347	2011-01-13 00:07:51 +00:00
Dan Gohman	6e017a1134	Apply the patch from PR8958, which allows llc to get slightly further on the associated testcase before aborting. llvm-svn: 123346	2011-01-12 23:56:26 +00:00
Michael J. Spencer	d9960c69b5	Support/Path: Deprecate PathV1::IsSymlink and replace all uses with PathV2::is_symlink. llvm-svn: 123345	2011-01-12 23:55:06 +00:00
Jakob Stoklund Olesen	74ded57bb8	Try again enabling LiveDebugVariables. llvm-svn: 123342	2011-01-12 23:36:21 +00:00
Jason W Kim	1455842275	Added clarifying comment llvm-svn: 123341	2011-01-12 23:25:02 +00:00
Jason W Kim	e9eae0f887	JimG sez: "The value-kinds look like masks, but they're not consistently used that way, unfortunately. If you want to change them to work additively instead of a one-variant-kind-per-symbolref, that's great and I completely agree it's worth doing, but it really should be a separate patch. Until then, this isn't correct." So I am reverting this bit until a more opportune time. llvm-svn: 123340	2011-01-12 23:21:49 +00:00
Jakob Stoklund Olesen	e63dfeee36	Don't emit a DBG_VALUE for a spill slot that the rewriter decided not to use after all. llvm-svn: 123339	2011-01-12 23:14:07 +00:00
Jakob Stoklund Olesen	2ffee66e10	Fix braino in dominator tree walk. llvm-svn: 123338	2011-01-12 23:14:04 +00:00
Jakob Stoklund Olesen	1a3534afc4	Sometimes, old virtual registers can linger on DBG_VALUE instructions. Make sure we don't crash in that case, but simply turn them into %noreg instead. llvm-svn: 123335	2011-01-12 22:37:49 +00:00
Jakob Stoklund Olesen	013c4649c0	Teach VirtRegRewriter to update slot indexes when erasing instructions. It was leaving dangling pointers in the slot index maps. llvm-svn: 123334	2011-01-12 22:28:51 +00:00
Jakob Stoklund Olesen	71a3853332	Annotate VirtRegRewriter debug output with slot indexes. llvm-svn: 123333	2011-01-12 22:28:48 +00:00
Jakob Stoklund Olesen	58b6f4d832	Verify slot index ordering. The slot indexes must be monotonically increasing through the function. llvm-svn: 123324	2011-01-12 21:27:48 +00:00
Jakob Stoklund Olesen	62e01eefde	Assert if anybody tries to put a slot index on a DBG_VALUE instruction. llvm-svn: 123323	2011-01-12 21:27:45 +00:00
Jakob Stoklund Olesen	b5b4a5d0ba	Verify that machine instruction parent pointers are consistent. llvm-svn: 123322	2011-01-12 21:27:41 +00:00
Bill Wendling	e6ff05c59d	Sort the register list based on the actual register numbers rather than the enum values we give to them. <rdar://problem/8823730> llvm-svn: 123321	2011-01-12 21:20:59 +00:00
Devang Patel	30f3ebbc1f	Use SmallVector instead of SmallPtrSet and avoid non-deterministic behavior. llvm-svn: 123318	2011-01-12 19:12:45 +00:00
Matt Beaumont-Gay	3077bb64e9	Mostly undo r123297, but move the default case in EvaluateAsPCRel to the top of the switch block to appease GCC. llvm-svn: 123317	2011-01-12 18:02:55 +00:00
Nick Lewycky	7ecc2fc4ca	Add another note taken from the gcc bugzilla. llvm-svn: 123315	2011-01-12 09:06:19 +00:00
Venkatraman Govindaraju	d964580fea	Implement RETURNADDR and FRAMEADDR lowering in SPARC backend. llvm-svn: 123310	2011-01-12 05:08:36 +00:00
Venkatraman Govindaraju	ee347f8091	Remove SPARC backend getpcx instruction's Uses. Also, insert an assert to ensure %o7 is not assigned as the destination of getpcx instruction. llvm-svn: 123304	2011-01-12 03:52:59 +00:00
Chris Lattner	dd5f60b7a7	revert 123144, reenabling the rest of memset formation. llvm-svn: 123302	2011-01-12 03:25:15 +00:00
Venkatraman Govindaraju	3b71b0ae3d	Fix SPARC backend call instruction so that arguments passed through registers are correctly marked as used instead of passing all possible argument registers as used. llvm-svn: 123301	2011-01-12 03:18:21 +00:00
Chris Lattner	654098f411	revert r123146 which disabled code that wasn't the root cause of the bootstrap miscompare issue. llvm-svn: 123299	2011-01-12 01:52:23 +00:00
Chris Lattner	fa7c29d255	revert r123149, reenabling an improvement to memcpyopt that wasn't the source of the bootstrap problem. llvm-svn: 123298	2011-01-12 01:43:46 +00:00
Matt Beaumont-Gay	ea43172297	Prefer llvm_unreachable to assert(0) llvm-svn: 123297	2011-01-12 01:42:42 +00:00
Jason W Kim	9c5b65d289	1. Support ELF pcrel relocations for movw/movt: R_ARM_MOVT_PREL and R_ARM_MOVW_PREL_NC. 2. Fix minor bug in ARMAsmPrinter - treat bitfield flag as a bitfield, not an enum. 3. Add support for 3 new elf section types (no-ops) llvm-svn: 123294	2011-01-12 00:19:25 +00:00
Jason W Kim	1f7bc0707d	Workaround for bug 8721. .s Test added. llvm-svn: 123292	2011-01-11 23:53:41 +00:00
Jakob Stoklund Olesen	43812bfa92	The world is not ready for LiveDebugVariables yet. llvm-svn: 123290	2011-01-11 23:20:33 +00:00
Jakob Stoklund Olesen	12cc296bd4	Remove the PR8954 workaround. llvm-svn: 123288	2011-01-11 22:56:41 +00:00
Jakob Stoklund Olesen	f2407aa98b	Fix a non-deterministic loop in llvm::MergeBlockIntoPredecessor. DT->changeImmediateDominator() trivially ignores identity updates, so there is really no need for the uniqueing provided by SmallPtrSet. I expect this to fix PR8954. llvm-svn: 123286	2011-01-11 22:54:38 +00:00
Jakob Stoklund Olesen	8c98495f43	Enable LiveDebugVariables by default. llvm-svn: 123282	2011-01-11 22:45:28 +00:00
Venkatraman Govindaraju	4d6ade0e31	SPARC backend: correct ICC/FCC uses for ADDX and SELECT_CC llvm-svn: 123281	2011-01-11 22:38:28 +00:00
Cameron Zwarich	cb9c4f85ec	Dial back the speculative fix for PR8954 a bit, so that we only recompute dominators once at the beginning of GVN instead of once per iteration. llvm-svn: 123278	2011-01-11 22:14:42 +00:00
Jakob Stoklund Olesen	803f48bcd1	Don't insert DBG_VALUE instructions after the first terminator. For one, MachineBasicBlock::getFirstTerminator() doesn't understand what is happening, and it also makes sense to have all control flow run through the DBG_VALUE. llvm-svn: 123277	2011-01-11 22:11:16 +00:00
Evan Cheng	e45d685895	Clean up ARM subtarget code by using Triple ADT. llvm-svn: 123276	2011-01-11 21:46:47 +00:00
Devang Patel	447cb38fbe	Appropriately truncate debug info range in dwarf output. This is not yet completely enabled. llvm-svn: 123274	2011-01-11 21:42:10 +00:00
Jakob Stoklund Olesen	819eb4ed0b	Put the Dominator improvements back in. They were not the cause of bootstrap miscomparisons. llvm-svn: 123273	2011-01-11 21:23:09 +00:00
Cameron Zwarich	51eb403907	Attempt to fix the bootstrap buildbot. Rafael says this works for him on x86-64 Linux. llvm-svn: 123270	2011-01-11 20:23:34 +00:00
Jakob Stoklund Olesen	32bd3a1e9a	Speculatively revert the recent improvements to Dominators.h in an attempt to track down the gcc bootstrap miscompare. llvm-svn: 123254	2011-01-11 19:26:30 +00:00
Daniel Dunbar	09264124c1	McARM: Fill in GetMnemonicAcceptInfo(). llvm-svn: 123253	2011-01-11 19:06:29 +00:00
Daniel Dunbar	6492807291	McARM: Write a silly Python script to compute some hard coded info from the generated ARM match table, which is substantially more efficient than dealing with tblgen. llvm-svn: 123252	2011-01-11 19:06:26 +00:00
Owen Anderson	0022a4b417	Remove dead variable, const-ref-ize an APInt. llvm-svn: 123248	2011-01-11 18:26:37 +00:00
Chris Lattner	d41db8f9cb	this pass claims to preserve scev, make sure to tell it about deletions. llvm-svn: 123247	2011-01-11 18:14:50 +00:00
Bob Wilson	e5863d6639	Fix a comment: We now have intrinsics for vcvtr. llvm-svn: 123246	2011-01-11 17:56:41 +00:00
Chris Lattner	d30de95520	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Chris Lattner	abd2dfd3dc	Fix PR8946, a missing reg/reg form of movdqu. llvm-svn: 123242	2011-01-11 17:04:55 +00:00
Daniel Dunbar	5a384c86b2	McARM: Sketch some logic for determining when to add carry set and predication code operands based on the "canonical mnemonic". llvm-svn: 123239	2011-01-11 15:59:53 +00:00
Daniel Dunbar	9d944b3fcc	McARM: Add more hard coded logic to SplitMnemonicAndCC to also split out the carry setting flag from the mnemonic. Note that this currently involves me disabling a number of working cases in arm_instructions.s, this is a hopefully short term evil which will be rapidly fixed (and greatly surpassed), assuming my current approach flies. llvm-svn: 123238	2011-01-11 15:59:50 +00:00
Jay Foad	c8adf5f458	FixedNumOperandTraits and VariadicOperandTraits assumed that, given a "this" pointer for any subclass of User, you could static_cast it to User* and then reinterpret_cast that to Use* to get the end of the operand list. This isn't a safe assumption in general, because the static_cast might adjust the "this" pointer. Fixed by having these OperandTraits classes take an extra template parameter, which is the subclass of User. This is groundwork for PR889. llvm-svn: 123235	2011-01-11 15:07:38 +00:00
Frits van Bommel	8e158495f1	Factor the actual simplification out of SimplifyIndirectBrOnSelect and into a new helper function so it can be reused in e.g. an upcoming SimplifySwitchOnSelect. No functional change. llvm-svn: 123234	2011-01-11 12:52:11 +00:00
Oscar Fuentes	c8cb58e130	Add to the CMake build some options and platform tests supported by the traditional build. Patch by arrowdodger! llvm-svn: 123233	2011-01-11 12:31:54 +00:00
Oscar Fuentes	77c4f70123	Made llvm_replace_compiler_option more robust. Use it on llvm_process_sources. llvm-svn: 123232	2011-01-11 12:31:34 +00:00
Kalle Raiskila	a5538cdbf9	Fix a thinko in 123226 that caused test failures on "other" platforms. llvm-svn: 123229	2011-01-11 11:27:56 +00:00
Eric Christopher	31bb4c5811	Revert the testcase from the previous reverted commit. llvm-svn: 123227	2011-01-11 09:20:44 +00:00
Kalle Raiskila	be9ad1e631	Add a "nop filler" pass to SPU. Filling no-ops is done just before emitting of assembly, when the instruction stream is final. No-ops are inserted to align the instructions so the dual-issue of the pipeline is utilized. This speeds up generated code with a minimum of 1% on a select set of algorithms. This pass may be redundant if the instruction scheduler and all subsequent passes that modify the instruction stream (prolog+epilog inserter, register scavenger, are there others?) are made aware of the instruction alignments. llvm-svn: 123226	2011-01-11 09:07:54 +00:00
Eric Christopher	23bf3bafb7	Temporarily revert 123133, it's causing some regressions and I'm trying to get a testcase. llvm-svn: 123225	2011-01-11 09:02:09 +00:00
Chris Lattner	193ce7c4d1	update memdep when an instruction is deleted. This code isn't actually reached in the testcase in PR8954, but it's safe and good practice. llvm-svn: 123224	2011-01-11 08:19:16 +00:00
Chris Lattner	e2523b287c	when MergeBlockIntoPredecessor merges two blocks, update MemDep if it is floating around in the ether. llvm-svn: 123223	2011-01-11 08:16:49 +00:00
Chris Lattner	f6ae904e34	Fix FoldSingleEntryPHINodes to update memdep and AA when it deletes phi nodes. It is called from MergeBlockIntoPredecessor which is called from GVN, which claims to preserve these. I'm skeptical that this is the actual problem behind PR8954, but this is a stab in the right direction. llvm-svn: 123222	2011-01-11 08:13:40 +00:00
Chris Lattner	dfcfcb49fa	random cleanups llvm-svn: 123221	2011-01-11 08:00:40 +00:00
Chris Lattner	054d2a8525	merge tests into one crash.ll test. llvm-svn: 123220	2011-01-11 07:50:07 +00:00
Chris Lattner	63fe78de68	remove a bogus assertion: the latch block of a loop is not neccesarily an uncond branch to the header. This fixes PR8955 (the assertion tripping). llvm-svn: 123219	2011-01-11 07:47:59 +00:00
Chris Lattner	23109cb319	the GEP faq says that only inbounds geps are guaranteed to not overflow. llvm-svn: 123218	2011-01-11 06:44:41 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Michael J. Spencer	0d771edeee	Support/Path: Deprecate PathV1::isDirectory and replace all uses with PathV2::is_directory. llvm-svn: 123209	2011-01-11 01:21:55 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Chandler Carruth	b1e7f557b7	Teach constant folding to perform conversions from constant floating point values to their integer representation through the SSE intrinsic calls. This is the last part of a README.txt entry for which I have real world examples. llvm-svn: 123206	2011-01-11 01:07:24 +00:00
Chandler Carruth	fdf4969149	FileCheck-ize a test, and move a no-longer calling test case to another file and make it actually test something... llvm-svn: 123205	2011-01-11 01:07:20 +00:00
Owen Anderson	d490c2d2ae	Fix a random missed optimization by making InstCombine more aggressive when determining which bits are demanded by a comparison against a constant. llvm-svn: 123203	2011-01-11 00:36:45 +00:00
Eric Christopher	1bb2c00f65	Move ExpandAtomic into the integer expansion routines - it's only used there. llvm-svn: 123202	2011-01-11 00:36:08 +00:00
Eric Christopher	3904343c68	Even if we don't have 7 bytes of stack space we may need to save and restore the stack pointer from the frame pointer on thumbv6. Fixes rdar://8819685 llvm-svn: 123196	2011-01-11 00:16:04 +00:00
Eric Christopher	d5bbeba8d0	Expand on the safeness of restoring the sp from the fp a bit more. llvm-svn: 123193	2011-01-10 23:10:59 +00:00
Dale Johannesen	d2b48119b0	Fix PR 8916 (qv for analysis), at least the immediate problem. There's an inherent tension in DAGCombine between assuming that things will be put in canonical form, and the Depth mechanism that disables transformations when recursion gets too deep. It would not surprise me if there's a lot of little bugs like this one waiting to be discovered. The mechanism seems fragile and I'd suggest looking at it from a design viewpoint. llvm-svn: 123191	2011-01-10 21:53:07 +00:00
Chris Lattner	78cdd2a6c6	+0.0 vs -0.0 differences can be handled by looking at the user of the operation in some cases. llvm-svn: 123190	2011-01-10 21:01:17 +00:00
Daniel Dunbar	c0e8756ba9	McARM: Flush out hard coded known non-predicated mnemonic list. llvm-svn: 123189	2011-01-10 21:01:03 +00:00
Daniel Dunbar	2d01239fe7	McARM: Mark some T2 ...s instructions as codegen only, they aren't real instructions but are restricted pseudo forms. llvm-svn: 123177	2011-01-10 15:26:39 +00:00
Daniel Dunbar	6e3aedd830	ARM/MC: Mark several '...S' instructions as codegen only, they aren't real instructions but are restricted pseudo forms. llvm-svn: 123176	2011-01-10 15:26:35 +00:00
Daniel Dunbar	2be732ab5f	MC/ARM/AsmParser: Minor nitty fixes. llvm-svn: 123175	2011-01-10 15:26:21 +00:00
Daniel Dunbar	4035383937	MC/AsmMatcher: Fix indirect 80-col viola. llvm-svn: 123174	2011-01-10 15:26:11 +00:00
Anton Korobeynikov	5fb942a307	Fix merge fallout llvm-svn: 123172	2011-01-10 12:56:18 +00:00
Anton Korobeynikov	441ae5b88c	Update CMake stuff llvm-svn: 123171	2011-01-10 12:39:23 +00:00
Anton Korobeynikov	2f93128109	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Daniel Dunbar	876bb0180f	MC/ARM/AsmParser: Split out SplitMnemonicAndCC(). llvm-svn: 123169	2011-01-10 12:24:52 +00:00
Chandler Carruth	352d9b14b3	Cleanup some of the constant folding code to consistently test intrinsic IDs when available rather than using a mixture of IDs and textual name comparisons. llvm-svn: 123165	2011-01-10 09:02:58 +00:00
Chris Lattner	6c8b8dd522	fit in 80 cols and use MBB::isSuccessor instead of a hand rolled std::find. llvm-svn: 123164	2011-01-10 07:51:31 +00:00
Chandler Carruth	cf414cf0a6	Teach instcombine about the rest of the SSE and SSE2 conversion intrinsics element dependencies. Reviewed by Nick. llvm-svn: 123161	2011-01-10 07:19:37 +00:00
Jakob Stoklund Olesen	2fb5b31578	Simplify a bunch of isVirtualRegister() and isPhysicalRegister() logic. These functions not longer assert when passed 0, but simply return false instead. No functional change intended. llvm-svn: 123155	2011-01-10 02:58:51 +00:00
Chandler Carruth	7bb282ebb1	Fold two related tests into the newly FileCheck-ized test, migrating them to FileCheck as well. llvm-svn: 123154	2011-01-10 02:53:58 +00:00
Chandler Carruth	ef7aac5961	Clean up and FileCheck-ize a test. llvm-svn: 123153	2011-01-10 02:53:54 +00:00
Michael J. Spencer	83bd49d4f9	Fix Whitespace. llvm-svn: 123152	2011-01-10 02:34:40 +00:00
Michael J. Spencer	58df2e00b2	Support/Path: Deprecate PathV1::exists and replace all uses with PathV2::fs::exists. llvm-svn: 123151	2011-01-10 02:34:23 +00:00
Chris Lattner	88bc848ab6	another random stab in the dark trying to fix llvm-gcc-i386-linux-selfhost llvm-svn: 123149	2011-01-10 02:34:11 +00:00
Chris Lattner	ec1387cf4b	fix typo llvm-svn: 123148	2011-01-10 02:33:34 +00:00
Chris Lattner	4662bd4b13	another (more) aggressive attempt to bring llvm-gcc-i386-linux-selfhost back to life. llvm-svn: 123146	2011-01-10 00:47:34 +00:00
Chris Lattner	eef1455020	expand on a note llvm-svn: 123145	2011-01-10 00:33:01 +00:00
Chris Lattner	1017fa6746	temporarily disable memset formation from memsets in an effort to restore buildbot stability. llvm-svn: 123144	2011-01-09 23:52:48 +00:00
Chris Lattner	1032965cbe	add a testcase I missed in previous commit. llvm-svn: 123143	2011-01-09 23:52:31 +00:00
Chris Lattner	5b358c6825	typo llvm-svn: 123142	2011-01-09 23:48:41 +00:00
Chris Lattner	320370e3ca	xref a PR # llvm-svn: 123141	2011-01-09 23:42:22 +00:00
Jakob Stoklund Olesen	8786bda222	Remove TargetRegisterInfo::NoRegister. Fix the TargetRegisterInfo::NoRegister places where someone preferred typing 'TargetRegisterInfo::NoRegister' instead of typing '0'. Note that TableGen is already emitting xx::NoRegister in xxGenRegisterNames.inc. llvm-svn: 123140	2011-01-09 23:20:48 +00:00
Chris Lattner	67f82314af	add a fixme: ir isn't expressive enough. llvm-svn: 123139	2011-01-09 23:02:10 +00:00
Chris Lattner	28f140a33e	Step #4 in improving trip count analysis: HowFarToZero can analyze NUW AddRec's much more aggressively. We now get a trip count for @test2 in nsw.ll llvm-svn: 123138	2011-01-09 22:58:47 +00:00
Jakob Stoklund Olesen	8828e482b6	Change virtual register numbering to make more space for physical registers. The numbering plan is now: 0 NoRegister. [1;2^30) Physical registers. [2^30;2^31) Stack slots. [2^31;2^32) Virtual registers. (With -1u and -2u used by DenseMapInfo.) Each segment is filled from the left, so any mistaken interpretation should quickly cause crashes. FirstVirtualRegister has been removed. TargetRegisterInfo provides predicates conversion functions that should be used instead of interpreting register numbers manually. It is now legal to pass NoRegister to isPhysicalRegister() and isVirtualRegister(). The result is false in both cases. It is quite rare to represent stack slots in this way, so isPhysicalRegister() and isVirtualRegister() require that isStackSlot() be checked first if it can possibly return true. This allows a very fast implementation of the common predicates. llvm-svn: 123137	2011-01-09 22:42:48 +00:00
Chris Lattner	dff679f4b6	rearrange some code, no functionality change. llvm-svn: 123136	2011-01-09 22:39:48 +00:00
Chandler Carruth	d011d5317c	Add a note about the inability to model FP -> int conversions which perform rounding other than truncation in the IR. Common C code for this turns into really an LLVM intrinsic call that blocks a lot of further optimizations. llvm-svn: 123135	2011-01-09 22:36:18 +00:00
Chris Lattner	a44274cb4f	Step #3 to improving trip count analysis: If we fold a + {b,+,stride} into {a+b,+,stride} (because a is LIV), then the resultant AddRec is NUW/NSW if the client says it is. llvm-svn: 123133	2011-01-09 22:31:26 +00:00
Chris Lattner	fc87752d55	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Jakob Stoklund Olesen	d82ac37594	Remove MachineRegisterInfo::getLastVirtReg(), it was giving wrong results when no virtual registers have been allocated. It was only used to resize IndexedMaps, so provide an IndexedMap::resize() method such that Map.grow(MRI.getLastVirtReg()); can be replaced with the simpler Map.resize(MRI.getNumVirtRegs()); This works correctly when no virtuals are allocated, and it bypasses the to/from index conversions. llvm-svn: 123130	2011-01-09 21:58:20 +00:00

... 4 5 6 7 8 ...

69526 Commits