llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	7b7caf51e9	Support for .ifdef / .ifndef in the assembler parser. Patch by Joerg Sonnenberger. llvm-svn: 125120	2011-02-08 22:29:56 +00:00
Jakob Stoklund Olesen	93dda45ada	Also handle the situation where an indirect branch is the first (and last) instruction in a basic block. llvm-svn: 125116	2011-02-08 21:46:11 +00:00
Jakob Stoklund Olesen	f2b16dc847	Add LiveIntervals::addKillFlags() to recompute kill flags after register allocation. This is a lot easier than trying to get kill flags right during live range splitting and rematerialization. llvm-svn: 125113	2011-02-08 21:13:03 +00:00
Jakob Stoklund Olesen	4d83c691f6	Trim debug spew llvm-svn: 125109	2011-02-08 19:33:58 +00:00
Jakob Stoklund Olesen	c6a2041d99	Avoid folding a load instruction into an instruction that redefines the register. The target hook doesn't know how to do that. (Neither do I). llvm-svn: 125108	2011-02-08 19:33:55 +00:00
David Greene	10b0db1d5f	[AVX] Implement BUILD_VECTOR lowering for 256-bit vectors. For anything but the simplest of cases, lower a 256-bit BUILD_VECTOR by splitting it into 128-bit parts and recombining. llvm-svn: 125105	2011-02-08 19:04:41 +00:00
Jakob Stoklund Olesen	1749935173	Add SplitEditor::overlapIntv() to create small ranges where both registers are live. If a live range is used by a terminator instruction, and that live range needs to leave the block on the stack or in a different register, it can be necessary to have both sides of the split live at the terminator instruction. Example: %vreg2 = COPY %vreg1 JMP %vreg1 Becomes after spilling %vreg2: SPILL %vreg1 JMP %vreg1 The spill doesn't kill the register as is normally the case. llvm-svn: 125102	2011-02-08 18:50:21 +00:00
Jakob Stoklund Olesen	3d11c8eaf2	Add assertion. llvm-svn: 125101	2011-02-08 18:50:18 +00:00
Andrew Trick	4b4918788b	Fix PostRA antidependence breaker. Avoid using the same register for two def operands or and earlyclobber def and use operand. This fixes PR8986 and improves on the prior fix for rdar://problem/8959122. llvm-svn: 125089	2011-02-08 17:39:46 +00:00
Evan Cheng	558ccef74f	Temporary workaround for a bad bug introduced by r121082 which replaced t2LDRpci with t2LDRi12. There are a couple of problems with this. 1. The encoding for the literal and immediate constant are different. Note bit 7 of the literal case is 'U' so it can be negative. 2. t2LDRi12 is now narrowed to tLDRpci before constant island pass is run. So we end up never using the Thumb2 instruction, which ends up creating a lot more constant islands. llvm-svn: 125074	2011-02-08 03:07:03 +00:00
Dan Gohman	de7f699754	Don't split any loop backedges, including backedges of loops other than the active loop. This is generally desirable, and it avoids trouble in situations such as the testcase in PR9123, though the failure mode depends on use-list order, so it is infeasible to test. llvm-svn: 125065	2011-02-08 00:55:13 +00:00
Jakob Stoklund Olesen	55fc1d0b3e	Add LiveIntervals::shrinkToUses(). After uses of a live range are removed, recompute the live range to only cover the remaining uses. This is necessary after rematerializing the value before some (but not all) uses. llvm-svn: 125058	2011-02-08 00:03:05 +00:00
Benjamin Kramer	8d6a8c130b	SimplifyCFG: Track the number of used icmps when turning a icmp chain into a switch. If we used only one icmp, don't turn it into a switch. Also prevent the switch-to-icmp transform from creating identity adds, noticed by Marius Wachtler. llvm-svn: 125056	2011-02-07 22:37:28 +00:00
Bruno Cardoso Lopes	36dd43fda6	Add support for parsing dmb/dsb instructions llvm-svn: 125055	2011-02-07 22:09:15 +00:00
Devang Patel	639dd997eb	Remove comment about an argument that was removed couple of years ago. llvm-svn: 125054	2011-02-07 21:58:52 +00:00
Bruno Cardoso Lopes	c9253b4deb	Remove the MCR asm parser hack and start using the custom target specific asm parsing of operands introduced in r125030. As a small note, besides using a more generic approach we can also have more descriptive output when debugging llvm-mc, example: mcr p7, #1, r5, c1, c1, #4 note: parsed instruction: ['mcr', <ARMCC::al>, <coprocessor number: 7>, 1, <register 73>, <coprocessor register: 1>, <coprocessor register: 1>, 4] llvm-svn: 125052	2011-02-07 21:41:25 +00:00
Chris Lattner	e17322b3b7	fix comment change. llvm-svn: 125047	2011-02-07 20:03:14 +00:00
David Greene	79651c527b	[AVX] Insert/extract subvector lowering support. This includes a couple of utility functions that will be used in other places for more AVX lowering. llvm-svn: 125029	2011-02-07 19:36:54 +00:00
Jason W Kim	e5ce4c9bcd	ARM/MC/ELF Lowercase .cpu attributes in .s, but make them uppercase in .o llvm-svn: 125025	2011-02-07 19:07:11 +00:00
Evan Cheng	e1a4ac9b5b	Fix an obvious typo which caused an isel assertion. rdar://8964854. llvm-svn: 125023	2011-02-07 18:50:47 +00:00
Bob Wilson	06fce87c4a	Add codegen support for using post-increment NEON load/store instructions. The vld1-lane, vld1-dup and vst1-lane instructions do not yet support using post-increment versions, but all the rest of the NEON load/store instructions should be handled now. llvm-svn: 125014	2011-02-07 17:43:21 +00:00
Bob Wilson	a609b8954e	Change VLD3/4 and VST3/4 for quad registers to not update the address register. These operations are expanded to pairs of loads or stores, and the first one uses the address register update to produce the address for the second one. So far, the second load/store has also updated the address register, just for convenience, since that output has never been used. In anticipation of actually supporting post-increment updates for these operations, this changes the non-updating operations to use a non-updating load/store for the second instruction. llvm-svn: 125013	2011-02-07 17:43:15 +00:00
Bob Wilson	42e67b5f73	Fix some NEON instruction itineraries. llvm-svn: 125012	2011-02-07 17:43:12 +00:00
Bob Wilson	f3c8df3202	Fix a comment: addrmode6 no longer includes the optional writeback flag. llvm-svn: 125011	2011-02-07 17:43:09 +00:00
Bob Wilson	3dfe815358	Remove inaccurate comments: so_imm and t2_so_imm operands are not encoded until the instructions are emitted or printed. llvm-svn: 125010	2011-02-07 17:43:06 +00:00
Bob Wilson	0d95ed90cc	Move code for OffsetCompare struct closer to where it is used. llvm-svn: 125009	2011-02-07 17:43:03 +00:00
Chris Lattner	a676c0fc77	implement .ll and .bc support for nsw/nuw on shl and exact on lshr/ashr. Factor some code better. llvm-svn: 125006	2011-02-07 16:40:21 +00:00
Duncan Sands	867cb633b4	Add an m_Div pattern for matching either a udiv or an sdiv and use it to simplify the "(X/Y)*Y->X when the division is exact" transform. llvm-svn: 125004	2011-02-07 09:36:32 +00:00
Jason W Kim	202630c6ee	Teach ARM/MC/ELF about gcc compatible reloc output to get past odd linkage failures with relocations. The code committed is a first cut at compatibility for emitted relocations in ELF .o. Why do this? because existing ARM tools like emitting relocs symbols as explicit relocations, not as section-offset relocs. Result is that with these changes, 1) relocs are now substantially identical what to gcc outputs. 2) larger apps (including many spec2k tests) compile, cross-link, and pass Added reminder fixme to tests for future conversion to .s form. llvm-svn: 124996	2011-02-07 01:11:15 +00:00
Jason W Kim	85b0af177f	Rework some .ARM.attribute work for improved gcc compatibility. Unified EmitTextAttribute for both Asm and Obj emission (.cpu only) Added necessary cortex-A8 related attrs for codegen compat tests. llvm-svn: 124995	2011-02-07 00:49:53 +00:00
Chris Lattner	6e57b15228	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Chris Lattner	35315d065b	enhance vmcore to know that udiv's can be exact, and add a trivial instcombine xform to exercise this. Nothing forms exact udivs yet though. This is progress on PR8862 llvm-svn: 124992	2011-02-06 21:44:57 +00:00
Eric Christopher	b54605b8e2	Remove premature optimization that avoided calculating argument weights if we weren't going to inline the function. The rest of the code using this was removed. Fixes PR9154. llvm-svn: 124991	2011-02-06 21:27:46 +00:00
Anders Carlsson	ecf8e159e3	Simplify test, as suggested by Chris. llvm-svn: 124990	2011-02-06 20:22:49 +00:00
Anders Carlsson	49d81a3d7e	Remove a virtual inheritance case that clang can devirtualize fully now. llvm-svn: 124989	2011-02-06 20:16:49 +00:00
Anders Carlsson	d21b06a0db	When loading from a constant, fold inttoptr if the integer type and the resulting pointer type both have the same size. llvm-svn: 124987	2011-02-06 20:11:56 +00:00
Nick Lewycky	cb1a4c26ee	Simplify away redundant test, and document what's going on. llvm-svn: 124977	2011-02-06 05:04:00 +00:00
Nick Lewycky	f8797fda44	Remove specialized comparison of InlineAsm objects. They're uniqued on creation now, and this wasn't comparing some of their relevant bits anyhow. llvm-svn: 124976	2011-02-06 04:33:50 +00:00
Anders Carlsson	36c6d23074	Fix another warning. llvm-svn: 124961	2011-02-05 18:33:43 +00:00
Anders Carlsson	630125ab27	Fix a clang warning. llvm-svn: 124960	2011-02-05 18:19:35 +00:00
NAKAMURA Takumi	03a541f5c4	Windows/DynamicLibrary.inc: Split explicit symbols into explicit_symbols.inc. config.h.* have conditions whether each symbol is defined or not. Autoconf and CMake may check symbols in libgcc.a for JIT on Mingw. llvm-svn: 124950	2011-02-05 15:11:53 +00:00
NAKAMURA Takumi	1850c80afb	Target/X86: Tweak allocating shadow area (aka home) on Win64. It must be enough for caller to allocate one. llvm-svn: 124949	2011-02-05 15:11:32 +00:00
NAKAMURA Takumi	b21c3db920	lib/Target/X86/X86ISelLowering.cpp: Introduce a new variable "IsWin64". No functional changes. llvm-svn: 124948	2011-02-05 15:11:13 +00:00
NAKAMURA Takumi	b1bbdd8d44	lib/Target/X86/X86JITInfo.cpp: Add Win64 stuff. llvm-svn: 124947	2011-02-05 15:11:03 +00:00
NAKAMURA Takumi	f7f319d4d3	Target/X86: Fix whitespace. llvm-svn: 124946	2011-02-05 15:10:54 +00:00
NAKAMURA Takumi	3e600a29d3	Windows/Program.inc: Quote arguments when dubious characters (used by cmd.exe or MSYS shell) are included to invoke CreateProcess(). Thanks to Danil Malyshev. llvm-svn: 124945	2011-02-05 08:53:12 +00:00
Andrew Trick	f841571404	Fix an anti-dep breaker corner case. <rdar://problem/8959122> illegal register operands for UMULL instruction in cfrac nightly test I'm stil working on a unit test, but the case is: rx = movcc rx, r3 r2 = ldr r2, r3 = umull r2, r2 The anti-dep breaker should not convert this into an illegal instruction: r2, r2 = umull llvm-svn: 124932	2011-02-05 02:58:46 +00:00
Eric Christopher	ceb4671ddd	Fix cut and paste error spotted by Jakob. llvm-svn: 124930	2011-02-05 02:48:47 +00:00
Jakob Stoklund Olesen	4ee8990278	Be more strict about the first/last interference-free use. If the interference overlaps the instruction, we cannot separate it. llvm-svn: 124918	2011-02-05 01:06:39 +00:00
Jakob Stoklund Olesen	7b73528064	Add assertions to verify that the new interval is clear of the interference. If these inequalities don't hold, we are creating a live range split that won't allocate. llvm-svn: 124917	2011-02-05 01:06:36 +00:00
Eric Christopher	2dfbd7e0c1	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
David Greene	96d07a82b2	[AVX] Revert 124910 until clients are ready. llvm-svn: 124912	2011-02-05 00:24:41 +00:00
David Greene	bdd481507a	[AVX] Add some utilities to insert and extract 128-bit subvectors. This allows us to easily support 256-bit operations that don't have native 256-bit support. This applies to integer operations, certain types of shuffles and various othher things. llvm-svn: 124910	2011-02-04 23:29:33 +00:00
Jakob Stoklund Olesen	e8ac8e93a1	Apparently, it is possible for a block with a landing pad successor to have no calls. In that case we simply ignore the landing pad and split live ranges before the first terminator. llvm-svn: 124907	2011-02-04 23:11:13 +00:00
Devang Patel	116a9d7c38	Merge .debug_loc entries whenever possible to reduce debug_loc size. llvm-svn: 124904	2011-02-04 22:57:18 +00:00
Nick Lewycky	d650b30488	Mark that the return is using EAX so that we don't use it for some other purpose. Fixes PR9080! llvm-svn: 124903	2011-02-04 22:44:08 +00:00
Jakob Stoklund Olesen	80a2878b5d	Be more accurate about live range splitting at the end of blocks. If interference reaches the last split point, it is effectively live out and should be marked as 'MustSpill'. This can make a difference when the terminator uses a register. There is no way that register can be reused in the outgoing CFG bundle, even if it isn't live out. llvm-svn: 124900	2011-02-04 21:42:06 +00:00
Jason W Kim	4761fba833	Teach ARM/MC/ELF about EF_ARM_EABI_VERSION. The magic number is set to 5 to match the current doc. Added FIXME reminder Make it really configurable later. llvm-svn: 124899	2011-02-04 21:41:11 +00:00
Jason W Kim	d2e2f56c36	Teach ARM/MC/ELF to handle R_ARM_JUMP24 relocation type for conditional jumps. (yes, this is different from R_ARM_CALL) - Adds a new method getARMBranchTargetOpValue() which handles the necessary distinction between the conditional and unconditional br/bl needed for ARM/ELF At least for ARM mode, the needed fixup for conditional versus unconditional br/bl is identical, but the ARM docs and existing ARM tools expect this reloc type... Added a few FIXME's for future naming fixups in ARMInstrInfo.td llvm-svn: 124895	2011-02-04 19:47:15 +00:00
Jakob Stoklund Olesen	096bd8837f	Add LiveIntervals::getLastSplitPoint(). A live range cannot be split everywhere in a basic block. A split must go before the first terminator, and if the variable is live into a landing pad, the split must happen before the call that can throw. llvm-svn: 124894	2011-02-04 19:33:11 +00:00
Jakob Stoklund Olesen	fefe6ebc73	Verify that one of the ranges produced by region splitting is allocatable. We should not be attempting a region split if it won't lead to at least one directly allocatable interval. That could cause infinite splitting loops. llvm-svn: 124893	2011-02-04 19:33:07 +00:00
Daniel Dunbar	6619340462	MC/AsmParser: Add support for allowing the conversion process to fail (via custom conversion functions). llvm-svn: 124872	2011-02-04 17:12:23 +00:00
David Greene	653f1eed2d	[AVX] Support VSINSERTF128 with more patterns and appropriate infrastructure. This makes lowering 256-bit vectors to 128-bit vectors simple when 256-bit vector support is not available. llvm-svn: 124868	2011-02-04 16:08:29 +00:00
NAKAMURA Takumi	5a3ff5b5a0	Make Win32's header file name lower for cross build on case-sensitive filesystem. llvm-svn: 124864	2011-02-04 12:53:04 +00:00
Andrew Trick	d0548ae750	Introducing a new method of tracking register pressure. We can't precisely track pressure on a selection DAG, but we can at least keep it balanced. This design accounts for various interesting aspects of selection DAGS: register and subregister copies, glued nodes, dead nodes, unused registers, etc. Added SUnit::NumRegDefsLeft and ScheduleDAGSDNodes::RegDefIter. Note: I disabled PrescheduleNodesWithMultipleUses when register pressure is enabled, based on no evidence other than I don't think it makes sense to have both enabled. llvm-svn: 124853	2011-02-04 03:18:17 +00:00
Devang Patel	26ffa01889	DebugLoc associated with a machine instruction is used to emit location entries. DebugLoc associated with a DBG_VALUE is used to identify lexical scope of the variable. After register allocation, while inserting DBG_VALUE remember original debug location for the first instruction and reuse it, otherwise dwarf writer may be mislead in identifying the variable's scope. llvm-svn: 124845	2011-02-04 01:43:25 +00:00
Evan Cheng	f7073d1445	Update comments. llvm-svn: 124843	2011-02-04 01:10:12 +00:00
Jakob Stoklund Olesen	3295a99fe9	Skip unused values. llvm-svn: 124842	2011-02-04 00:59:23 +00:00
Jakob Stoklund Olesen	b336c50c81	Also compute interference intervals for blocks with no uses. When the live range is live through a block that doesn't use the register, but that has interference, region splitting wants to split at the top and bottom of the basic block. llvm-svn: 124839	2011-02-04 00:39:20 +00:00
Jakob Stoklund Olesen	66d0f39904	Verify kill flags conservatively. Allow a live range to end with a kill flag, but don't allow a kill flag that doesn't end the live range. This makes the machine code verifier more useful during register allocation when kill flag computation is deferred. llvm-svn: 124838	2011-02-04 00:39:18 +00:00
Bob Wilson	813bdf6e58	Do not sign extend floating-point values in the asm parser. llvm-svn: 124831	2011-02-03 23:17:47 +00:00
Andrew Trick	3f924e4e87	whitespace llvm-svn: 124827	2011-02-03 23:00:17 +00:00
Benjamin Kramer	62aa46b852	SimplifyCFG: Also transform switches that represent a range comparison but are not sorted into sub+icmp. This transforms another 1000 switches in gcc.c. llvm-svn: 124826	2011-02-03 22:51:41 +00:00
Bob Wilson	fb0bd049da	Fix 80-column violations and whitespace. llvm-svn: 124819	2011-02-03 21:46:10 +00:00
Jakob Stoklund Olesen	4a6518e6a8	Ensure that the computed interference intervals actually overlap their basic blocks. llvm-svn: 124815	2011-02-03 20:29:43 +00:00
Jakob Stoklund Olesen	db4cf7e4a4	Tweak debug output from SlotIndexes. llvm-svn: 124814	2011-02-03 20:29:41 +00:00
Jakob Stoklund Olesen	d8f62e2a62	Add debug output and asserts to the phi-connecting code. llvm-svn: 124813	2011-02-03 20:29:39 +00:00
Jakob Stoklund Olesen	8c0254870b	Fix coloring bug when mapping values in the middle of a live-through block. If the found value is not live-through the block, we should only add liveness up to the requested slot index. When the value is live-through, the whole block should be colored. Bug found by SSA verification in the machine code verifier. llvm-svn: 124812	2011-02-03 20:29:36 +00:00
Jakob Stoklund Olesen	f12e120743	Return live range end points from SplitEditor::enter/leave. These end points come from the inserted copies, and can be passed directly to useIntv. This simplifies the coloring code. llvm-svn: 124799	2011-02-03 17:04:16 +00:00
Jakob Stoklund Olesen	2b855eb69c	Silence an MSVC warning llvm-svn: 124798	2011-02-03 17:04:12 +00:00
David Greene	c4da110fd2	[AVX] VEXTRACTF128 support. This commit includes patterns for matching EXTRACT_SUBVECTOR to VEXTRACTF128 along with support routines to examine and translate index values. VINSERTF128 comes next. With these two in place we can begin supporting more AVX operations as INSERT/EXTRACT can be used as a fallback when 256-bit support is not available. llvm-svn: 124797	2011-02-03 15:50:00 +00:00
Richard Osborne	a31b9c2f7c	Add XCore intrinsics for resource instructions. llvm-svn: 124794	2011-02-03 13:14:25 +00:00
Duncan Sands	06504025d2	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Eric Christopher	ede6267993	Reapply this. llvm-svn: 124779	2011-02-03 06:18:29 +00:00
Eric Christopher	21933539f2	Temporarily revert 124765 in an attempt to find the cycle breaking bootstrap. llvm-svn: 124778	2011-02-03 05:40:54 +00:00
Rafael Espindola	d11311f291	Fix PR9127 by reversing the operands even if they have more then one use. Reversing the operands allows us to fold, but doesn't force us to. Also, at this point the DAG is still being optimized, so the check for hasOneUse is not very precise. llvm-svn: 124773	2011-02-03 03:58:05 +00:00
Daniel Dunbar	4fed88704d	raw_fd_ostream: Add a SetUseAtomicWrites() method (uses writev). llvm-svn: 124771	2011-02-03 03:32:32 +00:00
Jakob Stoklund Olesen	dca2917e25	Defer SplitKit value mapping until all defs are available. The greedy register allocator revealed some problems with the value mapping in SplitKit. We would sometimes start mapping values before all defs were known, and that could change a value from a simple 1-1 mapping to a multi-def mapping that requires ssa update. The new approach collects all defs and register assignments first without filling in any live intervals. Only when finish() is called, do we compute liveness and mapped values. At this time we know with certainty which values map to multiple values in a split range. This also has the advantage that we can compute live ranges based on the remaining uses after rematerializing at split points. The current implementation has many opportunities for compile time optimization. llvm-svn: 124765	2011-02-03 00:54:23 +00:00
Devang Patel	df0dd7dc69	Fix typo in comment. llvm-svn: 124759	2011-02-03 00:13:47 +00:00
Devang Patel	be933b470a	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	3a9e65efb6	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Duncan Sands	5747abab10	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Bob Wilson	09a6b46c89	Update comment to match my recent change. llvm-svn: 124725	2011-02-02 17:29:40 +00:00
Benjamin Kramer	f4ea1d5f79	SimplifyCFG: Turn switches into sub+icmp+branch if possible. This makes the job of the later optzn passes easier, allowing the vast amount of icmp transforms to chew on it. We transform 840 switches in gcc.c, leading to a 16k byte shrink of the resulting binary on i386-linux. The testcase from README.txt now compiles into decl %edi cmpl $3, %edi sbbl %eax, %eax andl $1, %eax ret llvm-svn: 124724	2011-02-02 15:56:22 +00:00
Richard Osborne	8607a67d37	Add support for trampolines on the XCore. llvm-svn: 124722	2011-02-02 14:57:41 +00:00
Duncan Sands	fdfdbd091d	Remove NoVendor and NoOS, added in commit 123990, from Triple. While it may be useful to understand "none", this is not the place for it. Tweak the fix to Normalize while there: the fix added in 123990 works correctly, but I like this way better. Finally, now that Triple understands some non-trivial environment values, teach the unittests about them. llvm-svn: 124720	2011-02-02 10:08:38 +00:00
Nick Lewycky	a46c898314	Remove wasteful caching. This isn't needed for correctness because any function that might have changed been affected by a merge elsewhere will have been removed from the function set, and it isn't needed for performance because we call grow() ahead of time to prevent reallocations. llvm-svn: 124717	2011-02-02 05:31:01 +00:00
Dan Gohman	c6f0bda839	Conservatively, clear optional flags, such as nsw, when performing reassociation. No testcase, because I wasn't able to create a testcase which actually demonstrates a problem. llvm-svn: 124713	2011-02-02 02:05:46 +00:00
Dan Gohman	08d2c98c23	Fix reassociate to clear optional flags, such as nsw. llvm-svn: 124712	2011-02-02 02:02:34 +00:00
Sean Callanan	26fc7858db	Fixed a bug in the disassembler where the mandatory 0x66 prefix would be misinterpreted in some cases on 32-bit x86 platforms. Thanks to Olivier Meurant for identifying the bug. llvm-svn: 124709	2011-02-02 01:09:02 +00:00
Evan Cheng	d42641c6b5	Given a pair of floating point load and store, if there are no other uses of the load, then it may be legal to transform the load and store to integer load and store of the same width. This is done if the target specified the transformation as profitable. e.g. On arm, this can transform: vldr.32 s0, [] vstr.32 s0, [] to ldr r12, [] str r12, [] rdar://8944252 llvm-svn: 124708	2011-02-02 01:06:55 +00:00
Bob Wilson	59513209aa	PR9081: Split up LDM instruction with deprecated use of both LR and PC. This is completely untested but pretty straightforward, so hopefully I got it right. llvm-svn: 124694	2011-02-01 22:30:51 +00:00
Matt Beaumont-Gay	29c8c8fe92	Take Bill Wendling's suggestion for structuring a couple of asserts. llvm-svn: 124688	2011-02-01 22:12:50 +00:00
Anton Korobeynikov	1f3bc9b5e6	Fix imm printing for logical instructions. Patch by Brian G. Lucas! llvm-svn: 124679	2011-02-01 20:22:53 +00:00
Jay Foad	142777224c	Make SwitchInst::removeCase() more efficient. llvm-svn: 124659	2011-02-01 09:22:34 +00:00
Duncan Sands	a29ea9aa4c	Add a m_Undef pattern for convenience. This is so that code that uses pattern matching can also pattern match undef, creating a more uniform style. llvm-svn: 124657	2011-02-01 09:06:20 +00:00
Duncan Sands	4b397fcdc2	Add a m_SignBit pattern for convenience. llvm-svn: 124656	2011-02-01 08:50:33 +00:00
Duncan Sands	cf0ff030a8	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Carl Norum	ecd90b5946	Test commit - fix a double 'should' in a comment. llvm-svn: 124652	2011-02-01 07:38:42 +00:00
Rafael Espindola	4a9b18d07b	Correctly merge available_externally and regular definitions when they have different visibilities. llvm-svn: 124650	2011-02-01 05:33:52 +00:00
Evan Cheng	dfc85ed01e	Fix bogus assert condition noticed by Csaba Raduly. llvm-svn: 124645	2011-02-01 01:50:49 +00:00
Eric Christopher	46308e666a	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Evan Cheng	d22a4a1fd6	Patches to build EFI with Clang/LLVM. By Carl Norum. llvm-svn: 124639	2011-02-01 01:14:13 +00:00
Devang Patel	56cc5fdf09	Keep track of incoming argument's location while emitting LiveIns. llvm-svn: 124611	2011-01-31 21:38:14 +00:00
Roman Divacky	9a58919c8e	Enumerate .code16/32/64 instead of checking .code prefix. This unbreaks some ARM tests. llvm-svn: 124608	2011-01-31 21:19:43 +00:00
Roman Divacky	bd59dff739	Error on all .code* directives instead of just .code16 as they all lead to a silent miscompilation of code. llvm-svn: 124603	2011-01-31 20:56:49 +00:00
David Greene	f3c6873544	Fix vector sign extend to put the source and destination types in the correct places. llvm-svn: 124601	2011-01-31 20:39:01 +00:00
Chris Lattner	865fe3b283	add a note, progress unblocked by PR8575 being fixed. llvm-svn: 124599	2011-01-31 20:23:28 +00:00
Richard Osborne	272e084bca	Fix bug where ReduceLoadWidth was creating illegal ZEXTLOAD instructions. llvm-svn: 124587	2011-01-31 17:41:44 +00:00
Anton Korobeynikov	221f4faa92	Save a mapping between original and cloned constpool entries. llvm-svn: 124570	2011-01-30 22:07:39 +00:00
Anton Korobeynikov	fe3a6e049d	Clarify the LSDASection NULL check llvm-svn: 124569	2011-01-30 22:07:31 +00:00
Anders Carlsson	f23a6da271	Recognize and simplify (A+B) == A -> B == 0 A == (A+B) -> B == 0 llvm-svn: 124567	2011-01-30 22:01:13 +00:00
Jakob Stoklund Olesen	9af7afcb7f	Respect the -tail-dup-size command line option even when optimizing for size. This is similar to the -unroll-threshold option. There should be no change in behavior when -tail-dup-size is not explicit on the llc command line. llvm-svn: 124564	2011-01-30 20:38:12 +00:00
Duncan Sands	2e5a58da8f	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	b67edc6a29	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Benjamin Kramer	946e1522b6	Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x, c1+c2) when c1 equals the amount of bits that are truncated off. This happens all the time when a smul is promoted to a larger type. On x86-64 we now compile "int test(int x) { return x/10; }" into movslq %edi, %rax imulq $1717986919, %rax, %rax movq %rax, %rcx shrq $63, %rcx sarq $34, %rax <- used to be "shrq $32, %rax; sarl $2, %eax" addl %ecx, %eax This fires 96 times in gcc.c on x86-64. llvm-svn: 124559	2011-01-30 16:38:43 +00:00
Nick Lewycky	c15dd6f07c	Fix 'fcmp one' constant folding. Noticed by inspection. llvm-svn: 124557	2011-01-30 01:49:58 +00:00
Nick Lewycky	7c75f0c031	Fix some formatting and upgrade comments from llvm 1.x to 2.x syntax. llvm-svn: 124556	2011-01-30 01:48:50 +00:00
Nick Lewycky	97a2895e73	Add the select optimization recently added to instcombine to constant folding. This is the one where one of the branches of the select is another select on the same condition. llvm-svn: 124547	2011-01-29 20:35:06 +00:00
Francois Pichet	326e4a2966	Unbreak the MSVC build. The DEBUG() call at line 606 demands to see raw_ostream's definition. I have no idea why this seems to only break MSVC. llvm-svn: 124545	2011-01-29 20:06:16 +00:00
Nick Lewycky	b89d9a4412	Fix comment. llvm-svn: 124544	2011-01-29 19:55:23 +00:00
Frits van Bommel	2a55951d08	Call SimplifyFDivInst() in InstCombiner::visitFDiv(). llvm-svn: 124535	2011-01-29 17:50:27 +00:00
Frits van Bommel	c2549661af	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	2e9e4f1be3	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Benjamin Kramer	65bb14d368	Add the missing sub identity "A-(A-B) -> B" to DAGCombine. This happens e.g. for code like "X - X%10" where we lower the modulo operation to a series of multiplies and shifts that are then subtracted from X, leading to this missed optimization. llvm-svn: 124532	2011-01-29 12:34:05 +00:00
Evan Cheng	73c29178ac	Add a test for TCE return duplication. llvm-svn: 124527	2011-01-29 04:53:35 +00:00
Evan Cheng	d983eba7dc	Re-apply r124518 with fix. Watch out for invalidated iterator. llvm-svn: 124526	2011-01-29 04:46:23 +00:00
Evan Cheng	65b8ccf6ac	Revert r124518. It broke Linux self-host. llvm-svn: 124522	2011-01-29 02:43:04 +00:00
Evan Cheng	d4eff31476	Re-commit r124462 with fixes. Tail recursion elim will now dup ret into unconditional predecessor to enable TCE on demand. llvm-svn: 124518	2011-01-29 01:29:26 +00:00
Andrew Trick	24f5ff0f23	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Roman Divacky	cd9ae95ae7	Error on .code16 instead of producing wrong (32bit) code. llvm-svn: 124498	2011-01-28 19:29:48 +00:00
Duncan Sands	e4b4d0c16d	This dyn_cast should be a cast. Pointed out by Frits van Bommel. llvm-svn: 124497	2011-01-28 18:53:08 +00:00
Duncan Sands	65995fa2a0	Thread divisions over selects and phis. This doesn't fire much and has basically zero effect on the testsuite (it improves two Ada testcases). llvm-svn: 124496	2011-01-28 18:50:50 +00:00
Bob Wilson	775eec2280	PR9030: Fix disassembly of ARM "mov pc, lr" instruction. Patch by Jyun-Yan You. llvm-svn: 124492	2011-01-28 17:50:30 +00:00
Duncan Sands	771e82a863	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Oscar Fuentes	e789bdb870	Fix libffi usage when it is on a custom path. llvm-svn: 124486	2011-01-28 16:49:05 +00:00
Roman Divacky	7e9e290952	Add support for parsing .float llvm-svn: 124485	2011-01-28 14:20:32 +00:00
Nick Lewycky	cfb284cf96	Rename functions to follow coding standard. Also rejiggers comments. No functionality change. llvm-svn: 124482	2011-01-28 08:43:14 +00:00
Nick Lewycky	aaf401241a	Add a doxygen comment for this class. llvm-svn: 124480	2011-01-28 08:19:00 +00:00
Nick Lewycky	564fcca856	Reorder for readability. (Chris, is this what you meant?) llvm-svn: 124479	2011-01-28 07:36:21 +00:00
Evan Cheng	aaa9606b2f	Revert r124462. There are a few big regressions that I need to fix first. llvm-svn: 124478	2011-01-28 07:12:38 +00:00
Nick Lewycky	c5eb3733f7	Reduce the number of functions we look at in the first pass, and preallocate the function equality set. llvm-svn: 124475	2011-01-28 05:48:15 +00:00
Nick Lewycky	0af77fd45b	Fix build with stdcxx by using llvm::next. Patch by Joerg Sonnenberger! llvm-svn: 124472	2011-01-28 04:00:15 +00:00
Nick Lewycky	b074e32641	Fold select + select where both selects are on the same condition. llvm-svn: 124469	2011-01-28 03:28:10 +00:00
Rafael Espindola	6c17d54891	Print the visibility of declarations. llvm-svn: 124468	2011-01-28 03:20:10 +00:00
Nico Weber	4ada0d9164	PR8951: Support for .equiv in integrated assembler, patch by Jörg Sonnenberger! llvm-svn: 124467	2011-01-28 03:04:41 +00:00
Evan Cheng	417fca86c4	- Stop simplifycfg from duplicating "ret" instructions into unconditional branches. PR8575, rdar://5134905, rdar://8911460. - Allow codegen tail duplication to dup small return blocks after register allocation is done. llvm-svn: 124462	2011-01-28 02:19:21 +00:00
Evan Cheng	bb8420a070	Fix PLD encoding. llvm-svn: 124458	2011-01-27 23:48:34 +00:00
Kevin Enderby	e9f2f0cb0b	Changed llvm-mc arm target to give an error if .syntax divided is used. Since only .syntax unified is supported. llvm-svn: 124454	2011-01-27 23:22:36 +00:00
Oscar Fuentes	800a2afbb3	Use the paths to libffi's header and library even when no custom location was stated with FFI_INCLUDE_DIR/FFI_LIBRARY_DIR. llvm-svn: 124449	2011-01-27 22:58:34 +00:00
David Greene	34f7c0d8aa	[AVX] Clean up the code to configure target lowering for AVX. Specify how to lower more/new operations. This is a prerequisite for adding additional AVX lowering. llvm-svn: 124447	2011-01-27 22:38:56 +00:00
Andrew Trick	c0ca67601a	Remove a temporary workaround for a lencod miscompile. Depends on the fix in r124442. llvm-svn: 124443	2011-01-27 21:28:51 +00:00
Andrew Trick	13bb644fdd	VirtRegRewriter fix: update kill flags, which are used by the scavenger. rdar://problem/8893967: JM/lencod miscompile at -arch armv7 -mthumb -O3 Added ResurrectKill to remove kill flags after we decide to reused a physical register. And (hopefully) ensure that we call it in all the right places. Sorry, I'm not checking in a unit test given that it's a miscompile I can't reproduce easily with a toy example. Failures in the rewriter depend on a series of heuristic decisions maked during one of the many upstream phases in codegen. This case would require coercing regalloc to generate a couple of rematerialzations in a way that causes the scavenger to reuse the same register at just the wrong point. The general way to test this is to implement kill flags verification. Then we could have a simple, robust compile-only unit test. That would be worth doing if the whole pass was not about to disappear. At this point we focus verification work on the next generation of regalloc. llvm-svn: 124442	2011-01-27 21:26:43 +00:00
Benjamin Kramer	57e3d65884	Unbreak the build. llvm-svn: 124426	2011-01-27 20:30:54 +00:00
Nick Lewycky	e2d46d30ae	Expound upon this comparison! llvm-svn: 124406	2011-01-27 19:51:31 +00:00
Nick Lewycky	5a37e950e1	Use dyn_cast instead of isa+cast. llvm-svn: 124404	2011-01-27 19:42:43 +00:00
Devang Patel	1cec755494	Speculatively revert r124380. llvm-svn: 124397	2011-01-27 19:15:01 +00:00
Devang Patel	3b266a2780	While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. Take 2. This includes fix for dragonegg crash. llvm-svn: 124380	2011-01-27 17:43:53 +00:00
Roman Divacky	ed5efb4053	Add support for specifying register name in cfi-register/offset/def as well as register number. llvm-svn: 124379	2011-01-27 17:16:37 +00:00
Roman Divacky	36b1b47c5a	Introduce virtual ParseRegister method in TargetAsmParser. Create override of this method in X86/ARM/MBlaze. llvm-svn: 124378	2011-01-27 17:14:22 +00:00
Jay Foad	9f32cfd35e	Fix indentation. llvm-svn: 124375	2011-01-27 14:44:55 +00:00
Nick Lewycky	13e04aef2a	Fix surprising missed optimization in mergefunc where we forgot to consider that relationships like "i8* null" is equivalent to "i32* null". llvm-svn: 124368	2011-01-27 08:38:19 +00:00
Bob Wilson	2d69fb4184	Avoid modifying the OneClassForEachPhysReg map while iterating over it. Linear scan regalloc is currently assuming that any register aliased with a member of a regclass must also be in at least one regclass. That is not always true. For example, for X86, RIP is in a regclass but IP is not. If you're unlucky, this can cause a crash by invalidating the iterator. llvm-svn: 124365	2011-01-27 07:26:15 +00:00
Eric Christopher	331cc5218d	Use the incoming VT not the VT of where we're trying to store to determine if we can store a value. Also, the exclusion is or, not and. Fixes rdar://8920247. llvm-svn: 124357	2011-01-27 05:44:56 +00:00
NAKAMURA Takumi	f3e20b9f0f	lib/Target/X86/X86ISelDAGToDAG.cpp: __main should be WINCALL64 on Win64. CALL64 marks %xmm* as dead. llvm-svn: 124354	2011-01-27 03:20:19 +00:00
Matt Beaumont-Gay	a148c59231	Try harder to not have unused variables. llvm-svn: 124350	2011-01-27 02:39:27 +00:00
Matt Beaumont-Gay	0cddbf2bdf	Opt-mode -Wunused-variable cleanup llvm-svn: 124346	2011-01-27 01:47:50 +00:00
Devang Patel	92b7077f9e	Reapply 124301 llvm-svn: 124339	2011-01-27 00:13:27 +00:00
Bill Wendling	fb4ee9bbde	Initialize variable to get rid of clang warning. llvm-svn: 124331	2011-01-26 22:21:35 +00:00
Jay Foad	b0c5e35929	Simplify User::operator delete(). llvm-svn: 124330	2011-01-26 21:56:10 +00:00
Devang Patel	b370bf329a	Revert 124301. llvm-svn: 124327	2011-01-26 21:41:22 +00:00
Devang Patel	084e0628e0	Revert r124302 llvm-svn: 124320	2011-01-26 21:12:32 +00:00
Bill Wendling	5a13d4fa8f	Add support for printing out floating point values from the ARM assembly parser. The parser will always give us a binary representation of the floating point number. llvm-svn: 124318	2011-01-26 20:57:43 +00:00
Eric Christopher	cd55a46c31	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
David Greene	bab5e6ed0e	[AVX] Add INSERT_SUBVECTOR and support it on x86. This provides a default implementation for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VINSERTF128 if AVX is available. llvm-svn: 124307	2011-01-26 19:13:22 +00:00
Devang Patel	a11210b1b8	While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. llvm-svn: 124302	2011-01-26 18:55:05 +00:00
Devang Patel	9d4eb2f480	Process valid SDDbgValues even if the node does not have any order assigned. llvm-svn: 124301	2011-01-26 18:42:32 +00:00
Devang Patel	1448e7c8b6	Refactor. llvm-svn: 124300	2011-01-26 18:20:04 +00:00
David Greene	b6f1611928	[AVX] Support EXTRACT_SUBVECTOR on x86. This provides a default implementation of EXTRACT_SUBVECTOR for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VEXTRACTF128 if AVX is available. llvm-svn: 124292	2011-01-26 15:38:49 +00:00
Bruno Cardoso Lopes	178c1e0c9b	fix the encoding and add testcases for ARM nop, yield, wfe and wfi instructions llvm-svn: 124288	2011-01-26 13:28:14 +00:00
Duncan Sands	69bdb585b2	Fix PR9039, a use-after-free in reassociate. The issue was that the operand being factorized (and erased) could occur several times in Ops, resulting in freed memory being used when the next occurrence in Ops was analyzed. llvm-svn: 124287	2011-01-26 10:08:38 +00:00
Nick Lewycky	91543447a6	AttrListPtr has an overloaded operator== which does this for us, we should use it. No functionality change! llvm-svn: 124286	2011-01-26 09:23:19 +00:00
Nick Lewycky	82d4db8662	Teach mergefunc that intptr_t is the same width as a pointer. We still can't merge vector<intptr_t>::push_back() and vector<void>::push_back() because Enumerate() doesn't realize that "i64 null" and "i8** null" are equivalent. llvm-svn: 124285	2011-01-26 09:13:58 +00:00
Nick Lewycky	fb622f9920	There are no vectors of pointer or arrays, so we don't need to check vector elements for type equivalence. llvm-svn: 124284	2011-01-26 08:50:18 +00:00
Duncan Sands	8a33733228	APInt has a method for determining whether a number is a power of 2 which is more efficient than countPopulation - use it. llvm-svn: 124283	2011-01-26 08:44:16 +00:00
Nick Lewycky	d9e6b4a8ff	Fix memory corruption. If one of the SCEV creation functions calls another but doesn't return immediately after then the insert position in UniqueSCEVs will be out of date. No test because this is a memory corruption issue. Fixes PR9051! llvm-svn: 124282	2011-01-26 08:40:22 +00:00
Eric Christopher	078159e310	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Bill Wendling	d13d13496f	Add needed braces. llvm-svn: 124273	2011-01-26 02:06:22 +00:00
NAKAMURA Takumi	0cfdac078e	Target/X86: Tweak win64's tailcall. llvm-svn: 124272	2011-01-26 02:04:09 +00:00
NAKAMURA Takumi	9d29eff198	Fix whitespace. llvm-svn: 124270	2011-01-26 02:03:37 +00:00
NAKAMURA Takumi	c780782560	lib/Target/X86/X86RegisterInfo.cpp: Fix whitespace. llvm-svn: 124268	2011-01-26 01:28:06 +00:00
NAKAMURA Takumi	86278dc3ea	lib/Target/X86/X86RegisterInfo.cpp: Fix a typo in comment. llvm-svn: 124267	2011-01-26 01:27:58 +00:00
Eric Christopher	58f157a677	Coding style formatting changes. llvm-svn: 124260	2011-01-26 01:09:59 +00:00
Jakob Stoklund Olesen	b308902024	Rename member variables to follow the rest of LLVM. No functional change. llvm-svn: 124257	2011-01-26 00:50:53 +00:00
Devang Patel	efc6b16e4b	Provide an interface to transfer SDDbgValue from one SDNode to another. llvm-svn: 124245	2011-01-25 23:27:42 +00:00
Bill Wendling	57990c4910	Revert 124230. It was causing test failures. llvm-svn: 124233	2011-01-25 21:48:36 +00:00
Bill Wendling	624cef696d	The floating point value is encoded in its binary form as an Imm. Convert it appropriately so that it prints out the decimal representation. llvm-svn: 124230	2011-01-25 21:27:46 +00:00
Bill Wendling	cdbf17b179	Add support for parsing a Real value. It stores the Real value as its binary encoding. It's up to the individual back-ends to convert it to their preferred representation when printing. llvm-svn: 124229	2011-01-25 21:26:41 +00:00
Rafael Espindola	563eb4bb6c	Move unnamed_addr after the function arguments on Sabre's request. llvm-svn: 124209	2011-01-25 19:09:56 +00:00
Devang Patel	70f8e5962a	Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. llvm-svn: 124203	2011-01-25 18:09:58 +00:00
Devang Patel	04b649d48a	This assertion is too restrictive, it does not apply for dangling dbg value nodes (nodes where dbg.value intrinsic preceds use of the value). llvm-svn: 124202	2011-01-25 18:09:33 +00:00
Duncan Sands	9e9d5b25e2	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	fced7620f5	See if this fixes llvm-gcc bootstrap. llvm-svn: 124184	2011-01-25 12:15:09 +00:00
Duncan Sands	d395108394	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Nick Lewycky	f1cec164ce	Teach mergefunc how to emit aliases safely again -- but keep it turned it off for now. It's controlled by the HasGlobalAliases variable which is not attached to any flag yet. llvm-svn: 124182	2011-01-25 08:56:50 +00:00
Eric Christopher	cd087f2512	Reorganize this so that the early exit and special cases come early rather than interspersed. No functional change. llvm-svn: 124168	2011-01-25 01:34:31 +00:00
Evan Cheng	d6093ff4cb	Don't merge restore with tail call instruction. llvm-svn: 124167	2011-01-25 01:28:33 +00:00
Anton Korobeynikov	f3a62314f3	Provide correct registers for EH stuff on ARM llvm-svn: 124151	2011-01-24 22:38:45 +00:00
Anton Korobeynikov	b15beb2ae1	Support printing exception section into the current one. This is the case when LSDASection is blank llvm-svn: 124150	2011-01-24 22:38:40 +00:00
Devang Patel	533479544b	Speculatively revert r124138. llvm-svn: 124142	2011-01-24 20:04:37 +00:00
Devang Patel	8cc5355c90	Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. llvm-svn: 124138	2011-01-24 19:24:37 +00:00
Andrew Trick	a293c49f0d	Temporarily workaround JM/lencod miscompile (SIGSEGV). rdar://problem/8893967 llvm-svn: 124137	2011-01-24 19:08:15 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Chris Lattner	f277b5d434	fix PR8928 by clearing a stale map, patch by Jakub Staszak! llvm-svn: 124132	2011-01-24 18:36:51 +00:00
Rafael Espindola	689939e648	Handle strings in section names the same way as gas: * If the name is a single string, we remove the quotes * If the name starts without a quote, we include any quotes in the name llvm-svn: 124127	2011-01-24 18:02:54 +00:00
Dan Gohman	3ac8cd614f	Add a comment. llvm-svn: 124126	2011-01-24 17:54:18 +00:00
Daniel Dunbar	72d523beab	Support/CommandLine: Fix LookupNearestOption to also search extra option names. llvm-svn: 124124	2011-01-24 17:27:17 +00:00
Chris Lattner	bf638d2a0d	fix a missing shuffle pattern, PR9009. Patch by Artiom Myaskouvskey! llvm-svn: 124102	2011-01-24 03:42:46 +00:00
Chris Lattner	b4017769ae	fix PR9017, a bug where we'd assert when promoting in unreachable code. llvm-svn: 124100	2011-01-24 03:29:07 +00:00
Chris Lattner	23289c385a	fix PR9015, a crash linking recursive metadata. llvm-svn: 124099	2011-01-24 03:18:24 +00:00
Chris Lattner	9685603260	this isn't a memset, we do convert dest[i] to one though :) llvm-svn: 124097	2011-01-24 02:32:00 +00:00
Chris Lattner	b830ee5250	with recent work, we now optimize this into: define i32 @foo(i32 %x) nounwind readnone ssp { entry: %tobool = icmp eq i32 %x, 0 %tmp5 = select i1 %tobool, i32 2, i32 1 ret i32 %tmp5 } llvm-svn: 124091	2011-01-24 01:12:18 +00:00
Chris Lattner	d83e7b0ff6	enhance SRoA to promote allocas that are used by PHI nodes. This often occurs because instcombine sinks loads and inserts phis. This kicks in on such apps as 175.vpr, eon, 403.gcc, xalancbmk and a bunch of times in spec2006 in some app that uses std::deque. This resolves the last of rdar://7339113. llvm-svn: 124090	2011-01-24 01:07:11 +00:00
Chris Lattner	a960725d18	Enhance SRoA to promote allocas that are used by selects in some common cases. This triggers a surprising number of times in SPEC2K6 because min/max idioms end up doing this. For example, code from the STL ends up looking like this to SRoA: %202 = load i64* %__old_size, align 8, !tbaa !3 %203 = load i64* %__old_size, align 8, !tbaa !3 %204 = load i64* %__n, align 8, !tbaa !3 %205 = icmp ult i64 %203, %204 %storemerge.i = select i1 %205, i64* %__n, i64* %__old_size %206 = load i64* %storemerge.i, align 8, !tbaa !3 We can now promote both the __n and the __old_size allocas. This addresses another chunk of rdar://7339113, poor codegen on stringswitch. llvm-svn: 124088	2011-01-23 22:04:55 +00:00
Chris Lattner	9879965f4b	teach Value::isDereferenceablePointer that byval arguments are always dereferencable, noticed by inspection. llvm-svn: 124085	2011-01-23 21:15:29 +00:00
Anders Carlsson	773bc67eff	Add a memset loop that LoopIdiomRecognize doesn't recognize. llvm-svn: 124082	2011-01-23 20:31:00 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Rafael Espindola	c5efca47fc	Initialize MCNoExecStack. llvm-svn: 124079	2011-01-23 18:50:12 +00:00
Rafael Espindola	b3eca9bb71	Add support for the --noexecstack option. llvm-svn: 124077	2011-01-23 17:55:27 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Rafael Espindola	8bac423ddb	Add support for lowercase variants. llvm-svn: 124071	2011-01-23 16:11:25 +00:00
Chris Lattner	9491dee24e	Enhance SRoA to be more aggressive about scalarization of aggregate allocas that have PHI or select uses of their element pointers. This can often happen when instcombine sinks two loads into a successor, inserting a phi or select. With this patch, we can scalarize the alloca, but the pinned elements are not yet promoted. This is still a win for large aggregates where only one element is used. This fixes rdar://8904039 and part of rdar://7339113 (poor codegen on stringswitch). llvm-svn: 124070	2011-01-23 08:27:54 +00:00
Cameron Zwarich	07d6fe34b3	Convert two std::vectors to SmallVectors for a 3.4% speedup running -scalarrepl on test-suite + SPEC2000 & SPEC2006. llvm-svn: 124068	2011-01-23 08:03:04 +00:00
Chris Lattner	8acbb79506	have AllocaInfo store the alloca being inspected, simplifying callers. No functionality change. llvm-svn: 124067	2011-01-23 07:29:29 +00:00
Chris Lattner	3e56c29068	Rearrange some code a bit. Change MarkUnsafe to handle the "Transformation preventing inst" printing, so that -scalarrepl -debug will always print the rejected instruction. No functionality change. llvm-svn: 124066	2011-01-23 07:05:44 +00:00
Chris Lattner	a587ab7b94	remove an old hack that avoided creating MMX datatypes. The X86 backend has been fixed. llvm-svn: 124064	2011-01-23 06:40:33 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Rafael Espindola	4b7b7fba38	Delay the creation of eh_frame so that the user can change the defaults. Add support for SHT_X86_64_UNWIND. llvm-svn: 124059	2011-01-23 05:43:40 +00:00
Rafael Espindola	0e7e34e476	Remove more duplicated code. llvm-svn: 124056	2011-01-23 04:43:11 +00:00
Rafael Espindola	aea4958ea6	Remove duplicated code. llvm-svn: 124054	2011-01-23 04:28:49 +00:00

... 3 4 5 6 7 ...

45495 Commits