llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	7c9f4c9c2b	tidy up a comment, as suggested by duncan llvm-svn: 123590	2011-01-16 17:46:19 +00:00
Rafael Espindola	cba4c33949	Only put unnamed_addr constants in mergeable sections. Fixes PR8297. llvm-svn: 123585	2011-01-16 17:19:34 +00:00
Rafael Espindola	751677a040	Don't merge two constants if we care about the address of both. This fixes the original testcase in PR8927. It also causes a clang binary built with a patched clang to increase in size by 0.21%. We can probably get some of the size back by writing a pass that detects that a global never has its pointer compared and adds unnamed_addr to it (maybe extend global opt). It is also possible that there are some other cases clang could add unnamed_addr to. I will investigate extending globalopt next. llvm-svn: 123584	2011-01-16 17:05:09 +00:00
Jay Foad	bbb91f2b22	Simplify the construction and destruction of Uses. Simplify User::dropHungOffUses(). llvm-svn: 123580	2011-01-16 15:30:52 +00:00
Owen Anderson	ec3b10fc56	Reduce and merge testcases. llvm-svn: 123579	2011-01-16 09:13:31 +00:00
Chris Lattner	35a2e65bcb	fix PR8514, a bug where the "heroic" transformation of shift/and into and/shift would cause nodes to move around and a dangling pointer to happen. The code tried to avoid this with a HandleSDNode, but got the details wrong. llvm-svn: 123578	2011-01-16 08:48:11 +00:00
Jay Foad	5ded9df82a	Remove unnecessary specialization OperandTraits<User>. llvm-svn: 123577	2011-01-16 08:23:16 +00:00
Jay Foad	59809c7a62	Move the implementation of the User class into a new source file, User.cpp. llvm-svn: 123575	2011-01-16 08:10:57 +00:00
Chris Lattner	e5f8de8639	fix PR8932, a case where arg promotion could infinitely promote. llvm-svn: 123574	2011-01-16 08:09:24 +00:00
Chris Lattner	ed1fb92cfe	simplify a little llvm-svn: 123573	2011-01-16 07:11:21 +00:00
Chris Lattner	c326ebd118	add some commentary llvm-svn: 123572	2011-01-16 06:39:44 +00:00
Chris Lattner	6fab2e9418	if an alloca is only ever accessed as a unit, and is accessed with load/store instructions, then don't try to decimate it into its individual pieces. This will just make a mess of the IR and is pointless if none of the elements are individually accessed. This was generating really terrible code for std::bitset (PR8980) because it happens to be lowered by clang as an {[8 x i8]} structure instead of {i64}. The testcase now is optimized to: define i64 @test2(i64 %X) { br label %L2 L2: ; preds = %0 ret i64 %X } before we generated: define i64 @test2(i64 %X) { %sroa.store.elt = lshr i64 %X, 56 %1 = trunc i64 %sroa.store.elt to i8 %sroa.store.elt8 = lshr i64 %X, 48 %2 = trunc i64 %sroa.store.elt8 to i8 %sroa.store.elt9 = lshr i64 %X, 40 %3 = trunc i64 %sroa.store.elt9 to i8 %sroa.store.elt10 = lshr i64 %X, 32 %4 = trunc i64 %sroa.store.elt10 to i8 %sroa.store.elt11 = lshr i64 %X, 24 %5 = trunc i64 %sroa.store.elt11 to i8 %sroa.store.elt12 = lshr i64 %X, 16 %6 = trunc i64 %sroa.store.elt12 to i8 %sroa.store.elt13 = lshr i64 %X, 8 %7 = trunc i64 %sroa.store.elt13 to i8 %8 = trunc i64 %X to i8 br label %L2 L2: ; preds = %0 %9 = zext i8 %1 to i64 %10 = shl i64 %9, 56 %11 = zext i8 %2 to i64 %12 = shl i64 %11, 48 %13 = or i64 %12, %10 %14 = zext i8 %3 to i64 %15 = shl i64 %14, 40 %16 = or i64 %15, %13 %17 = zext i8 %4 to i64 %18 = shl i64 %17, 32 %19 = or i64 %18, %16 %20 = zext i8 %5 to i64 %21 = shl i64 %20, 24 %22 = or i64 %21, %19 %23 = zext i8 %6 to i64 %24 = shl i64 %23, 16 %25 = or i64 %24, %22 %26 = zext i8 %7 to i64 %27 = shl i64 %26, 8 %28 = or i64 %27, %25 %29 = zext i8 %8 to i64 %30 = or i64 %29, %28 ret i64 %30 } In this case, instcombine was able to eliminate the nonsense, but in PR8980 enough PHIs are in play that instcombine backs off. It's better to not generate this stuff in the first place. llvm-svn: 123571	2011-01-16 06:18:28 +00:00
Chris Lattner	7cd8cf7d24	Use an irbuilder to get some trivial constant folding when doing a store of a constant. llvm-svn: 123570	2011-01-16 05:58:24 +00:00
Chris Lattner	adb1a233b1	remove a dead check, this was needed before we had an explicit veto on uses of phis. llvm-svn: 123569	2011-01-16 05:37:55 +00:00
Chris Lattner	d55581ded8	enhance FoldOpIntoPhi in instcombine to try harder when a phi has multiple uses. In some cases, all the uses are the same operation, so instcombine can go ahead and promote the phi. In the testcase this pushes an add out of the loop. llvm-svn: 123568	2011-01-16 05:28:59 +00:00
Evan Cheng	572756ac11	Spill R4 if it's going to be used to restore SP from FP. llvm-svn: 123567	2011-01-16 05:14:33 +00:00
Chris Lattner	ea7131a062	remove the AllowAggressive argument to FoldOpIntoPhi. It is forced to false in the first line of the function because it isn't a good idea, even for compares. llvm-svn: 123566	2011-01-16 05:14:26 +00:00
Chris Lattner	ff2e737714	more cleanups: use the IR builder. llvm-svn: 123565	2011-01-16 05:08:00 +00:00
Chris Lattner	25ce280511	tidy up code. llvm-svn: 123564	2011-01-16 04:37:29 +00:00
Owen Anderson	4e54efd625	Improve the safety of my globalopt enhancement by ensuring that the bitcast of the stored value to the new store type is always. Also, add a testcase. llvm-svn: 123563	2011-01-16 04:33:33 +00:00
Chris Lattner	08f43456c9	fix PR8983, a broken assertion. llvm-svn: 123562	2011-01-16 03:43:53 +00:00
Venkatraman Govindaraju	1b0e2cbf3f	Implement AnalyzeBranch in Sparc Backend. llvm-svn: 123561	2011-01-16 03:15:11 +00:00
Chris Lattner	218092e68e	fix PR8981, a crash trying to form a conditional inc with a floating point compare. llvm-svn: 123560	2011-01-16 02:56:53 +00:00
Chris Lattner	2d186574a6	reapply my fix for PR8961 with a tweak to properly handle multi-instruction sequences like calls. Many thanks to Jakob for finding a testcase. llvm-svn: 123559	2011-01-16 02:27:38 +00:00
Chris Lattner	8b4952fcf7	simplify this code, it is still broken but will follow up on llvm-commits. llvm-svn: 123558	2011-01-16 02:05:10 +00:00
Michael J. Spencer	2ff30b84f8	Revert "Archive: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1." llvm-svn: 123557	2011-01-16 01:43:22 +00:00
Chandler Carruth	ef28abefd0	Simplify a README.txt entry significantly to expose the core issue. llvm-svn: 123556	2011-01-16 01:40:23 +00:00
Chris Lattner	c703334ff1	one of michael's recent patches broke this, temporarily disable it so the bots go green llvm-svn: 123555	2011-01-16 01:04:49 +00:00
Chris Lattner	1e209b87ad	remove the partial specialization pass. It is unmaintained and has bugs. llvm-svn: 123554	2011-01-16 00:27:10 +00:00
Michael J. Spencer	53dcdc7420	Archive: Fix spelling. llvm-svn: 123552	2011-01-15 21:43:45 +00:00
Michael J. Spencer	a0ce763290	Archive: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1. llvm-svn: 123551	2011-01-15 21:43:37 +00:00
Michael J. Spencer	8685f387eb	Support/GraphWriter: Replace all internal uses of PathV1 with PathV2. The external API still uses PathV1. llvm-svn: 123550	2011-01-15 21:43:25 +00:00
Benjamin Kramer	bec03ea725	Add an assert so we don't silently miscompile ctpop for bit widths > 128. llvm-svn: 123549	2011-01-15 21:19:37 +00:00
Michael J. Spencer	94b2ab3556	Support/PathV2: Add identify_magic. llvm-svn: 123548	2011-01-15 20:39:36 +00:00
Benjamin Kramer	fff2517edc	Reimplement CTPOP legalization with the "best" algorithm from http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel In a silly microbenchmark on a 65 nm core2 this is 1.5x faster than the old code in 32 bit mode and about 2x faster in 64 bit mode. It's also a lot shorter, especially when counting 64 bit population on a 32 bit target. I hope this is fast enough to replace Kernighan-style counting loops even when the input is rather sparse. llvm-svn: 123547	2011-01-15 20:30:30 +00:00
Michael J. Spencer	b587180fa7	Unittests/Support/Path: Tweak test. llvm-svn: 123546	2011-01-15 18:52:49 +00:00
Michael J. Spencer	7887466adc	Support/PathV2: Implement has_magic in terms of get_magic. llvm-svn: 123545	2011-01-15 18:52:41 +00:00
Michael J. Spencer	ee1699c362	Support/PathV2: Implement get_magic. llvm-svn: 123544	2011-01-15 18:52:33 +00:00
Nick Lewycky	4a1ff16b29	Add missing whitespace. llvm-svn: 123543	2011-01-15 18:42:52 +00:00
Nick Lewycky	0296a481f9	Make constmerge a two-pass algorithm so that it won't miss merging opporuntities. Fixes PR8978. llvm-svn: 123541	2011-01-15 18:14:21 +00:00
Oscar Fuentes	25ac830e72	Make config.h.cmake similar to config.h.in Patch by arrowdodger! llvm-svn: 123539	2011-01-15 13:35:37 +00:00
Benjamin Kramer	ed5f2e504e	Try to unbreak selfhost. llvm-svn: 123537	2011-01-15 11:25:34 +00:00
Nick Lewycky	540f9536c8	Add a cache that protects mergefunc's internals from more surprises in DenseSet. Also, replace tabs with spaces. Yes, it's 2011. llvm-svn: 123535	2011-01-15 10:16:23 +00:00
Nick Lewycky	367f98f000	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533	2011-01-15 09:16:12 +00:00
Rafael Espindola	f1ed781aea	Add a clarification about merging constants with and without unnamed_addr. llvm-svn: 123530	2011-01-15 08:20:57 +00:00
Rafael Espindola	489e505adf	Allow unnamed_addr on declarations. llvm-svn: 123529	2011-01-15 08:15:00 +00:00
Chris Lattner	af26390790	temporarily revert r123526. While working on a follow-on patch I realize that ConstantFoldTerminator doesn't preserve dominfo. llvm-svn: 123527	2011-01-15 07:51:19 +00:00
Chris Lattner	8df83c4a24	fix rdar://8785296 - -fcatch-undefined-behavior generates inefficient code The basic issue is that isel (very reasonably!) expects conditional branches to be folded, so CGP leaving around a bunch dead computation feeding conditional branches isn't such a good idea. Just fold branches on constants into unconditional branches. llvm-svn: 123526	2011-01-15 07:36:13 +00:00
Chris Lattner	ee588defc6	simplify code, no functionality change. llvm-svn: 123525	2011-01-15 07:29:01 +00:00
Chris Lattner	1b93be501d	Now that instruction optzns can update the iterator as they go, we can have objectsize folding recursively simplify away their result when it folds. It is important to catch this here, because otherwise we won't eliminate the cross-block values at isel and other times. llvm-svn: 123524	2011-01-15 07:25:29 +00:00
Chris Lattner	7a2771440f	make the current instruction iterator an ivar, allowing xforms that potentially invalidate it (like inline asm lowering) to be sunk into their proper place, cleaning up a ton of code. llvm-svn: 123523	2011-01-15 07:14:54 +00:00
Chris Lattner	9c10d587f6	implement an instcombine xform that canonicalizes casts outside of and-with-constant operations. This fixes rdar://8808586 which observed that we used to compile: union xy { struct x { _Bool b[15]; } x; __attribute__((packed)) struct y { __attribute__((packed)) unsigned long b0to7; __attribute__((packed)) unsigned int b8to11; __attribute__((packed)) unsigned short b12to13; __attribute__((packed)) unsigned char b14; } y; }; struct x foo(union xy *xy) { return xy->x; } into: _foo: ## @foo movq (%rdi), %rax movabsq $1095216660480, %rcx ## imm = 0xFF00000000 andq %rax, %rcx movabsq $-72057594037927936, %rdx ## imm = 0xFF00000000000000 andq %rax, %rdx movzbl %al, %esi orq %rdx, %rsi movq %rax, %rdx andq $65280, %rdx ## imm = 0xFF00 orq %rsi, %rdx movq %rax, %rsi andq $16711680, %rsi ## imm = 0xFF0000 orq %rdx, %rsi movl %eax, %edx andl $-16777216, %edx ## imm = 0xFFFFFFFFFF000000 orq %rsi, %rdx orq %rcx, %rdx movabsq $280375465082880, %rcx ## imm = 0xFF0000000000 movq %rax, %rsi andq %rcx, %rsi orq %rdx, %rsi movabsq $71776119061217280, %r8 ## imm = 0xFF000000000000 andq %r8, %rax orq %rsi, %rax movzwl 12(%rdi), %edx movzbl 14(%rdi), %esi shlq $16, %rsi orl %edx, %esi movq %rsi, %r9 shlq $32, %r9 movl 8(%rdi), %edx orq %r9, %rdx andq %rdx, %rcx movzbl %sil, %esi shlq $32, %rsi orq %rcx, %rsi movl %edx, %ecx andl $-16777216, %ecx ## imm = 0xFFFFFFFFFF000000 orq %rsi, %rcx movq %rdx, %rsi andq $16711680, %rsi ## imm = 0xFF0000 orq %rcx, %rsi movq %rdx, %rcx andq $65280, %rcx ## imm = 0xFF00 orq %rsi, %rcx movzbl %dl, %esi orq %rcx, %rsi andq %r8, %rdx orq %rsi, %rdx ret We now compile this into: _foo: ## @foo ## BB#0: ## %entry movzwl 12(%rdi), %eax movzbl 14(%rdi), %ecx shlq $16, %rcx orl %eax, %ecx shlq $32, %rcx movl 8(%rdi), %edx orq %rcx, %rdx movq (%rdi), %rax ret A small improvement :-) llvm-svn: 123520	2011-01-15 06:32:33 +00:00
Chris Lattner	c23ca1f217	fix typo llvm-svn: 123519	2011-01-15 06:27:35 +00:00
Chris Lattner	76580f0ec3	Fix m_Not and m_Neg to not match random ConstantInt's. Before these would try hard to match constants by inverting the bits and recursively matching. There are two problems with this: 1) some patterns would match when we didn't want them to (theoretical) 2) this is insanely expensive to do, and most often pointless. This was apparently useful in just 2 instcombine cases, which I added code to handle explicitly. This change speeds up 'opt' time on 176.gcc by 1% and produces bitwise identical code. llvm-svn: 123518	2011-01-15 05:52:27 +00:00
Chris Lattner	e20dd530d0	one more instcombine variant that is needed to work with future changes, no functionality change currently. llvm-svn: 123517	2011-01-15 05:50:18 +00:00
Chris Lattner	497459d5fd	fix typo llvm-svn: 123516	2011-01-15 05:42:47 +00:00
Chris Lattner	f3c4eefff8	Catch ~x < cst just like ~x < ~y, we currently handle this through means that are about to disappear. llvm-svn: 123515	2011-01-15 05:41:33 +00:00
Chris Lattner	311aa63c87	reduce indentation llvm-svn: 123514	2011-01-15 05:40:29 +00:00
Eric Christopher	cc385c0c97	80-col. llvm-svn: 123505	2011-01-15 00:25:09 +00:00
Chris Lattner	b68ec5c339	Generalize LoadAndStorePromoter a bit and switch LICM to use it. llvm-svn: 123501	2011-01-15 00:12:35 +00:00
Bob Wilson	b7a3c42eae	Fix a comment. llvm-svn: 123497	2011-01-15 00:09:18 +00:00
Eric Christopher	2af9551ebf	Fix 80-cols. llvm-svn: 123494	2011-01-14 23:50:53 +00:00
Ted Kremenek	e92b6e436d	Update CMake build. llvm-svn: 123491	2011-01-14 22:58:11 +00:00
Bob Wilson	03912aba9a	Fix some tablegen issues to allow using zero_reg for InstAlias definitions. This is needed to allow an InstAlias for an instruction with an "OptionalDef" result register (like ARM's cc_out) where you want to set the optional register to reg0. llvm-svn: 123490	2011-01-14 22:58:09 +00:00
Ted Kremenek	6677f65de1	Fix memory leak found by clang static analyzer. llvm-svn: 123487	2011-01-14 22:34:17 +00:00
Ted Kremenek	b5241b2b59	'HiReg' is written but never read. Nuke its declaration and its assignments. Found by clang static analyzer. llvm-svn: 123486	2011-01-14 22:34:13 +00:00
Owen Anderson	3e2f6cf7ae	Fix a false-positive warning. llvm-svn: 123480	2011-01-14 22:31:13 +00:00
Dan Gohman	abac063b7a	Delete an assignment to ThisBB which isn't needed, and tidy up some comments. llvm-svn: 123479	2011-01-14 22:26:16 +00:00
Owen Anderson	9eb7cb48e4	Enhance GlobalOpt to be able evaluate initializers that involve stores through bitcasts, at least in simple cases. This fixes clang's CodeGenCXX/virtual-base-dtor.cpp llvm-svn: 123477	2011-01-14 22:19:20 +00:00
Anton Korobeynikov	9be547cfd3	Add a possibility to switch between CFI directives- and table-based frame description emission. Currently all the backends use table-based stuff. llvm-svn: 123476	2011-01-14 21:58:08 +00:00
Anton Korobeynikov	4d9de6be4b	Cleanup llvm-svn: 123475	2011-01-14 21:57:58 +00:00
Anton Korobeynikov	b46ef57de5	Add CFI directives-based frame information emission. Not hooked yet. llvm-svn: 123474	2011-01-14 21:57:53 +00:00
Anton Korobeynikov	61d167e92b	Split stuff as a preparation for CFI directives-based frame information emission llvm-svn: 123473	2011-01-14 21:57:45 +00:00
Anton Korobeynikov	e2bea1c82e	Use common style for .cfi directives llvm-svn: 123472	2011-01-14 21:57:39 +00:00
Andrew Trick	9ccce77893	Support for precise scheduling of the instruction selection DAG, disabled in this checkin. Sorry for the large diffs due to refactoring. New functionality is all guarded by EnableSchedCycles. Scheduling the isel DAG is inherently imprecise, but we give it a best effort: - Added MayReduceRegPressure to allow stalled nodes in the queue only if there is a regpressure need. - Added BUHasStall to allow checking for either dependence stalls due to latency or resource stalls due to pipeline hazards. - Added BUCompareLatency to encapsulate and standardize the heuristics for minimizing stall cycles (vs. reducing register pressure). - Modified the bottom-up heuristic (now in BUCompareLatency) to prioritize nodes by their depth rather than height. As long as it doesn't stall, height is irrelevant. Depth represents the critical path to the DAG root. - Added hybrid_ls_rr_sort::isReady to filter stalled nodes before adding them to the available queue. Related Cleanup: most of the register reduction routines do not need to be templates. llvm-svn: 123468	2011-01-14 21:11:41 +00:00
Chris Lattner	b498f9aff3	switch SRoA to use LoadAndStorePromoter instead of its own copy of the code. llvm-svn: 123457	2011-01-14 19:50:47 +00:00
Chris Lattner	95294b8796	Add a new LoadAndStorePromoter class, which implements the general "promote a bunch of load and stores" logic, allowing the code to be shared and reused. llvm-svn: 123456	2011-01-14 19:36:13 +00:00
Jay Foad	cbe1505617	OperandTraits<>::Layout isn't used for anything. Remove it. llvm-svn: 123452	2011-01-14 18:41:56 +00:00
Rafael Espindola	b1ebba9ec3	Update llvm-gcc's tests. llvm-svn: 123447	2011-01-14 17:01:20 +00:00
Oscar Fuentes	959d253476	Reorder macros on config.h.cmake to easily compare it against config.h.in. Patch by arrowdodger! llvm-svn: 123445	2011-01-14 16:41:03 +00:00
Devang Patel	610c41e7b0	Disable debug mode. llvm-svn: 123443	2011-01-14 15:55:50 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	c3eb0f4b2e	Rename this test. llvm-svn: 123440	2011-01-14 14:16:33 +00:00
Chris Lattner	8d7716a220	switch the second scalarrepl pass to use SSAUpdater. We run two scalarrepl passes: one early in the cleanup code and one late interlaced with the inliner. The second one is important because inlining and other scalar optzns can unpin allocas, allowing them to be split up and promoted. While important for performance, this is also relatively rare, and we would previously force a (non-lazy) computation of DomFrontiers, which happened even if nothing became unpinned. With this patch, the first pass of scalarrepl still promotes the vast bulk of allocas in programs, but hte second pass has changed to use SSAUpdater, which is more "sparse" and lazy. This speeds up opt -O3 time on kimwitu++ (a c++ app) by about 1%. The numbers are interesting: the first pass promotes ~17500 allocas. The second pass promotes about 1600. For non-C++ codes, the compile time win should be greater, because the second pass of scalarrepl does less. llvm-svn: 123437	2011-01-14 08:21:08 +00:00
Chris Lattner	9987a6f49b	split SROA into two passes: one that uses DomFrontiers (-scalarrepl) and one that uses SSAUpdater (-scalarrepl-ssa) llvm-svn: 123436	2011-01-14 08:13:00 +00:00
Jay Foad	1d4a8fe156	Remove casts between Value and Constant, which won't work if a static_cast from Constant* to Value* has to adjust the "this" pointer. This is groundwork for PR889. llvm-svn: 123435	2011-01-14 08:07:43 +00:00
Chris Lattner	543384efb4	Implement full support for promoting allocas to registers using SSAUpdater instead of DomTree/DomFrontier. This may be interesting for reducing compile time. This is currently disabled, but seems to work just fine. When this is enabled, we eliminate two runs of dominator frontier, one in the "early per-function" optimizations and one in the "interlaced with inliner" function passes. llvm-svn: 123434	2011-01-14 07:50:47 +00:00
Chris Lattner	5e0fef8531	relax testcase a bit. llvm-svn: 123433	2011-01-14 07:46:33 +00:00
Jakob Stoklund Olesen	ab3d6ecbd2	Try for the third time to teach getFirstTerminator() about debug values. This time let's rephrase to trick gcc-4.3 into not miscompiling. llvm-svn: 123432	2011-01-14 06:33:45 +00:00
Chris Lattner	e93e4f118c	revert my fastisel patch again which apparently still gives the llvm-gcc-i386-linux-selfhost buildbot heartburn... llvm-svn: 123431	2011-01-14 06:14:33 +00:00
Chris Lattner	5ca1391003	reapply r123414 now that the botz are calmed down and the fix is already in. llvm-svn: 123427	2011-01-14 04:24:28 +00:00
Chris Lattner	90f3a9a1c7	indentation llvm-svn: 123426	2011-01-14 04:23:53 +00:00
Evan Cheng	d4a5c05c97	Completed :lower16: / :upper16: support for movw / movt pairs on Darwin. - Fixed :upper16: fix up routine. It should be shifting down the top 16 bits first. - Added support for Thumb2 :lower16: and :upper16: fix up. - Added :upper16: and :lower16: relocation support to mach-o object writer. llvm-svn: 123424	2011-01-14 02:38:49 +00:00
Jakob Stoklund Olesen	c38102889f	Revert r123419. It still breaks llvm-gcc-i386-linux-selfhost. llvm-svn: 123423	2011-01-14 02:12:54 +00:00
Chris Lattner	21a64979f1	r123414 broke llvm-gcc bootstrap apparently, revert llvm-svn: 123422	2011-01-14 02:07:32 +00:00
Chris Lattner	3be81e9bd7	Set the insertion point correctly for instructions generated by load folding: they should go before the new instruction not after it. llvm-svn: 123420	2011-01-14 01:33:40 +00:00
Jakob Stoklund Olesen	c0767e029d	Try again to teach getFirstTerminator() about debug values. Fix some callers to better deal with debug values. llvm-svn: 123419	2011-01-14 01:17:53 +00:00
Owen Anderson	e3ed20ce9c	Rather than doing early instcombine, try doing early CSE instead. This should still handle most important simplifications, as well as resolving phase ordering issues where instcombine would inhibit important CSE'ing opportunities, for instance on BitBench/drop3. llvm-svn: 123418	2011-01-14 00:41:11 +00:00
Duncan Sands	7f60dc1eb0	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00

1 2 3 4 5 ...

69271 Commits