llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	873106a932	Force a triple to make this test pass on Darwin. llvm-svn: 132228	2011-05-27 23:12:48 +00:00
Cameron Zwarich	75d99e4b70	Add a GR32_NOREX_NOSP register class and fix a bug where getMatchingSuperRegClass() was saying that the matching superregister class of GR32_NOREX in GR64_NOREX_NOSP is GR64_NOREX, which drops the NOSP constraint. This fixes PR10032. llvm-svn: 132225	2011-05-27 22:26:04 +00:00
Rafael Espindola	b8e08be77d	Fix a regression I recently introduced by removing DwarfRegNum of subregisters: When a value is in a subregister, at least report the location as being the superregister. We should extend the .td files to encode the bit range so that we can produce a DW_OP_bit_piece. llvm-svn: 132224	2011-05-27 22:15:01 +00:00
Rafael Espindola	d23bfb8a7a	Make size computation less brittle. llvm-svn: 132222	2011-05-27 22:05:41 +00:00
Charles Davis	041ec4aada	Add the suffix to the Win64 EH data sections' names if given. Add a test for this. XFAIL'd, because the COFF AsmParser can't handle .section yet. llvm-svn: 132220	2011-05-27 21:38:47 +00:00
Chad Rosier	bbdca744d4	Typo is test case llvm-svn: 132214	2011-05-27 20:16:57 +00:00
Jakob Stoklund Olesen	2348f3133f	Make room for register allocation to improve. llvm-svn: 132213	2011-05-27 20:15:06 +00:00
Evan Cheng	518bcd0ef4	Don't use movw / movt for iOS static codegen for now to workaround some tools issues. rdar://9514789 llvm-svn: 132211	2011-05-27 20:11:27 +00:00
Jakob Stoklund Olesen	63a9cef5c2	Delete a test that is no longer relevant. According to PR2536, the old spiller had trouble with the IMPLICIT_DEF in this code: %reg1028<def> = MOV16rm %reg0, 1, %reg0, <ga:g_5>, Mem:LD(2,2) [g_5 + 0] %reg1039<def> = IMPLICIT_DEF %reg1038<def> = INSERT_SUBREG %reg1039, %reg1028, 2 %reg1025<def> = AND32ri %reg1038, 65534, %%EFLAGS<imp-def> However, today we emit a zero-extending load instead: %vreg10<def> = MOVZX32rm16 %noreg, 1, %noreg, <ga:@g_5>, %noreg; %mem:LD2[@g_5] GR32:%vreg10 %vreg0<def> = AND32ri %vreg10, 65534, %%EFLAGS<imp-def,dead>; %GR32:%vreg0,%vreg10 This makes the test pointless since it no longer creates the spiller hazard. llvm-svn: 132210	2011-05-27 20:02:42 +00:00
Chad Rosier	3252177f16	CRC32 intrinsics were renamed at revision 132163. This submission fixes aliasing issues with the old and new names as well as adds test cases for the auto-upgrader. Fixes rdar 9472944. llvm-svn: 132207	2011-05-27 19:38:10 +00:00
Evan Cheng	97c9f84f68	Add iOS test llvm-svn: 132203	2011-05-27 19:04:21 +00:00
John McCall	bd04b74bb2	Fix the inliner to maintain the current de facto invoke semantics: - the selector for the landing pad must provide all available information about the handlers, filters, and cleanups within that landing pad - calls to _Unwind_Resume must be converted to branches to the enclosing lpad so as to avoid re-entering the unwinder when the lpad claimed it was going to handle the exception in some way This is quite specific to libUnwind-based unwinding. In an effort to not interfere too badly with other unwinders, and with existing hacks in frontends, this only triggers on _Unwind_Resume (not _Unwind_Resume_or_Rethrow) and does nothing with selectors if it cannot find a selector call for either lpad. llvm-svn: 132200	2011-05-27 18:34:38 +00:00
Eli Friedman	3a8d9625b0	And fix the test in r132194. llvm-svn: 132196	2011-05-27 18:14:28 +00:00
Eli Friedman	fe84bd659c	Fix a silly mistake (which trips over an assertion) in r132099. rdar://9515076 llvm-svn: 132194	2011-05-27 18:02:04 +00:00
Devang Patel	3c6aed2d98	Select DW_AT_const_value size based on variable size. llvm-svn: 132193	2011-05-27 16:45:18 +00:00
Charles Davis	ea5dc3a67b	Assorted fixes for Win64 EH unwind info emission: - Flip order of bitfields. This gets our output matching GAS. - Handle case where the end of the prolog wasn't specified. - If the resulting unwind info struct is less than 8 bytes, pad to 8 bytes. Add a test for the latter two. llvm-svn: 132188	2011-05-27 15:10:25 +00:00
Benjamin Kramer	749ef5f420	InstCombine: Make switch folding with equality compares more aggressive by trying instsimplify on the arm where we know the compared value. Stuff like "x == y ? y : x&y" now folds into "x&y". llvm-svn: 132185	2011-05-27 13:00:16 +00:00
Cameron Zwarich	34ef49dc74	Fix PR10029 - VerifyCoalescing failure on patterns_dfa.c of 445.gobmk. llvm-svn: 132181	2011-05-27 05:04:51 +00:00
Charles Davis	43a421e3d5	Add a test for Win64 EH unwind information emission. llvm-svn: 132180	2011-05-27 03:54:43 +00:00
Chad Rosier	b362884ca9	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Devang Patel	42ddaa10d3	During branch folding avoid inserting redundant DBG_VALUE machine instructions. llvm-svn: 132148	2011-05-26 21:47:59 +00:00
Galina Kistanova	7defeeae67	Make few ExecutionEngine tests XFAIL for ARM, since ExecutionEngine is broken for ARM, please remove the following XFAIL when it will be fixed. llvm-svn: 132135	2011-05-26 19:17:14 +00:00
Akira Hatanaka	aa560006ed	Add support for C++ exception handling. llvm-svn: 132131	2011-05-26 18:59:03 +00:00
Eli Friedman	c48f7c212e	Fix test on Windows. llvm-svn: 132126	2011-05-26 18:00:32 +00:00
Charles Davis	567a1ad7c5	Add a test for the chained directives that I forgot last time. llvm-svn: 132110	2011-05-26 05:17:43 +00:00
Stuart Hastings	493a12bf5e	Reverting 132105: it broke some LLVM-GCC DejaGNU tests. llvm-svn: 132108	2011-05-26 04:09:49 +00:00
Charles Davis	006e1c39d0	Test .seh_startchained and .seh_endchained parsing. Rework how the MCWin64EHUnwindInfo instances are stored. Fix issues with chained unwind areas exposed by the test that were related to this. The ChainedParent field had the wrong address, because when the chained unwind info was added, the addresses shifted around. Now we store the pointers to the structures, which are now allocated from the MC heap. llvm-svn: 132106	2011-05-26 02:45:47 +00:00
Stuart Hastings	276f231c2f	Correctly handle a one-word struct passed byval on x86_64. rdar://problem/6920088 llvm-svn: 132105	2011-05-26 02:44:56 +00:00
Andrew Trick	7fac79e255	indvars: incremental fixes for -disable-iv-rewrite and testcases. Use a proper worklist for use-def traversal without holding onto an iterator. Now that we process all IV uses, we need complete logic for resusing existing derived IV defs. See HoistStep. llvm-svn: 132103	2011-05-26 00:46:11 +00:00
Eli Friedman	c70355195c	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Akira Hatanaka	fa63d3096d	Define WeakRefDirective. llvm-svn: 132098	2011-05-25 23:30:30 +00:00
Eli Friedman	865866e7fe	PR9998: ashr exact %x, 31 is not equivalent to sdiv exact %x, -2147483648. llvm-svn: 132097	2011-05-25 23:26:20 +00:00
Charles Davis	2f6ecea19d	Add tests for .seh_setframe and .seh_handlerdata parsing. Fix issues with them. I had to add a special SwitchSectionNoChange method to MCStreamer just for .seh_handlerdata. If this isn't OK, please let me know, and I'll find some other way to fix .seh_handlerdata streaming. llvm-svn: 132084	2011-05-25 21:43:45 +00:00
Eric Christopher	8c5e4192e6	Implement the 'm' modifier. Note that it only works for memory operands. Part of rdar://9119939 llvm-svn: 132081	2011-05-25 20:51:58 +00:00
Akira Hatanaka	44eba3ac49	Custom-lower FCOPYSIGN nodes. llvm-svn: 132074	2011-05-25 19:32:07 +00:00
Charles Davis	828b00c0e1	Add tests for .seh_savereg and .seh_savexmm parsing. Once again, fix the buggy methods that parse these directives. llvm-svn: 132045	2011-05-25 04:51:25 +00:00
Cameron Zwarich	3088e0a179	Make tTAILJMPr/tTAILJMPrND emit a tBX without a preceding MOV of PC to LR. This fixes <rdar://problem/9495913> llvm-svn: 132042	2011-05-25 04:45:27 +00:00
Andrew Trick	eb3c36e69c	indvars: fixed IV cloning in -disable-iv-rewrite mode with associated cleanup and overdue test cases. llvm-svn: 132038	2011-05-25 04:42:22 +00:00
Charles Davis	b0c4f39173	Add a test for .seh_pushframe parsing. Fix the bug exposed by it (and another one I found by inspection). llvm-svn: 132037	2011-05-25 04:08:15 +00:00
Rafael Espindola	fc9bae6f8b	Replace the -unwind-tables option with a per function flag. This is more LTO friendly as we can now correctly merge files compiled with or without -fasynchronous-unwind-tables. llvm-svn: 132033	2011-05-25 03:44:17 +00:00
Akira Hatanaka	aac670c1c8	Fix lowering of DYNAMIC_STACKALLOC nodes. llvm-svn: 132030	2011-05-25 02:20:00 +00:00
Charles Davis	fc1e7ce850	Add a test for the .seh_handler directive. Fix problems with the parsing method exposed by the test. While we're at it, simplify the .seh_proc parsing method. llvm-svn: 132028	2011-05-25 01:33:42 +00:00
Bruno Cardoso Lopes	5445213a25	Fix PR9762 Enable the parsing of the operand "cpsr_all" for the ARM msr instruction llvm-svn: 132026	2011-05-25 00:35:03 +00:00
Eric Christopher	1b724948e9	Implement the arm 'L' asm modifier. Part of rdar://9119939 llvm-svn: 132024	2011-05-24 23:27:13 +00:00
Eric Christopher	b1dda56ac2	Implement the immediate part of the 'B' modifier. Part of rdar://9119939 llvm-svn: 132023	2011-05-24 23:15:43 +00:00
Eric Christopher	7617883ce3	Add support for the arm 'y' asm modifier. Fixes part of rdar://9444657 llvm-svn: 132011	2011-05-24 22:10:34 +00:00
Akira Hatanaka	2486729839	Test case for r132003. llvm-svn: 132005	2011-05-24 21:28:18 +00:00
Charles Davis	f4ce8fde18	Test basic SEH directive-parsing functionality. Fix a latent bug exposed by this test. llvm-svn: 132004	2011-05-24 21:22:53 +00:00
Akira Hatanaka	ce4037ebcf	Fix test case. llvm-svn: 131988	2011-05-24 19:37:15 +00:00
Akira Hatanaka	0f30561bae	Revision 131986 test case. llvm-svn: 131987	2011-05-24 19:29:37 +00:00
Cameron Zwarich	d7707fc911	Fix "make check" in Release by removing debug-only options from an 'opt' invocation. llvm-svn: 131972	2011-05-24 18:26:09 +00:00
Dan Gohman	0573b55c2b	Make DecomposeGEPExpression check SimplifyInstruction only after checking for a GEP, so that it matches what GetUnderlyingObject does. This fixes an obscure bug turned up by bugpoint in the testcase for PR9931. llvm-svn: 131971	2011-05-24 18:24:08 +00:00
Cameron Zwarich	843bc7d673	Make LoadAndStorePromoter preserve debug info and create llvm.dbg.values when promoting allocas to SSA variables. Fixes <rdar://problem/9479036>. llvm-svn: 131953	2011-05-24 03:10:43 +00:00
Rafael Espindola	0f33be1b87	Fix the defaults for .eh_frame. We were marking it as writable. llvm-svn: 131951	2011-05-24 02:50:20 +00:00
Evan Cheng	88f9137fd7	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Andrew Trick	37f0082804	FileCheck-ize a couple of IV unit tests. llvm-svn: 131946	2011-05-24 01:02:49 +00:00
Andrew Trick	1ea0243bd0	Test case for r130799 - indvars: Added canExpandBackEdgeTakenCount. llvm-svn: 131939	2011-05-24 00:17:53 +00:00
Akira Hatanaka	6af5bd2537	Add pattern for double-to-integer conversion. Patch by Sasa Stankovic. llvm-svn: 131927	2011-05-23 22:16:43 +00:00
Dan Gohman	6c4a319088	When checking for signed multiplication overflow, watch out for INT_MIN and -1. This fixes PR9845. llvm-svn: 131919	2011-05-23 21:07:39 +00:00
Akira Hatanaka	f9e5750fc8	Change StackDirection from StackGrowsUp to StackGrowsDown. The following improvements are accomplished as a result of applying this patch: - Fixed frame objects' offsets (relative to either the virtual frame pointer or the stack pointer) are set before instruction selection is completed. There is no need to wait until Prologue/Epilogue Insertion is run to set them. - Calculation of final offsets of fixed frame objects is straightforward. It is no longer necessary to assign negative offsets to fixed objects for incoming arguments in order to distinguish them from the others. - Since a fixed object has its relative offset set during instruction selection, there is no need to conservatively set its alignment to 4. - It is no longer necessary to reorder non-fixed frame objects in MipsFrameLowering::adjustMipsStackFrame. llvm-svn: 131915	2011-05-23 20:16:59 +00:00
Devang Patel	9987d3098b	Test case for r131908. llvm-svn: 131909	2011-05-23 17:49:29 +00:00
Devang Patel	c4d9a84159	While replacing all uses of a SDValue with another value, do not forget to transfer SDDbgValue. llvm-svn: 131907	2011-05-23 17:35:08 +00:00
Chris Lattner	026f5e61f0	fix a really nasty basicaa mod/ref calculation bug that was causing miscompilation of UnitTests/ObjC/messages-2.m with the recent optimizer improvements. llvm-svn: 131897	2011-05-23 05:15:43 +00:00
Cameron Zwarich	bc90690b24	Fix <rdar://problem/9476260> by having tail calls always generate 32-bit branches in Darwin Thumb2 code. Tail calls are already disabled on Thumb1. llvm-svn: 131894	2011-05-23 01:57:17 +00:00
Chris Lattner	8aff4f8efc	Transform any logical shift of a power of two into an exact/NUW shift when in a known-non-zero context. llvm-svn: 131887	2011-05-23 00:21:50 +00:00
Chris Lattner	83791ced7b	Teach valuetracking that byval arguments with a specified alignment are aligned. Teach memcpyopt to not give up all hope when confonted with an underaligned memcpy feeding an overaligned byval. If the source of the memcpy can be determined to be adequeately aligned, or if it can be forced to be, we can eliminate the memcpy. This addresses PR9794. We now compile the example into: define i32 @f(%struct.p* nocapture byval align 8 %q) nounwind ssp { entry: %call = call i32 @g(%struct.p* byval align 8 %q) nounwind ret i32 %call } in both x86-64 and x86-32 mode. We still don't get a tailcall though, because tailcalls apparently can't handle byval. llvm-svn: 131884	2011-05-23 00:03:39 +00:00
Chris Lattner	af5fecb747	add test from PR9164 llvm-svn: 131876	2011-05-22 22:35:34 +00:00
Chris Lattner	819278891a	testcase for PR9378 llvm-svn: 131875	2011-05-22 22:32:53 +00:00
Chris Lattner	713d52364f	implement PR9315, constant folding exp2 in terms of pow (since hosts without C99 runtimes don't have exp2). llvm-svn: 131872	2011-05-22 22:22:35 +00:00
Renato Golin	4cd5187f5b	RTABI chapter 4.3.4 specifies __eabi_mem* calls. Specifically, __eabi_memset accepts parameters (ptr, size, value) in a different order than GNU's memset (ptr, value, size), therefore the special lowering in AAPCS mode. Implementation by Evzen Muller. llvm-svn: 131868	2011-05-22 21:41:23 +00:00
Chris Lattner	7c99f19d9f	Carve out a place in instcombine to put transformations which work knowing that their result is non-zero. Implement an example optimization (PR9814), which allows us to transform: A / ((1 << B) >>u 2) into: A >>u (B-2) which we compile into: _divu3: ## @divu3 leal -2(%rsi), %ecx shrl %cl, %edi movl %edi, %eax ret instead of: _divu3: ## @divu3 movb %sil, %cl movl $1, %esi shll %cl, %esi shrl $2, %esi movl %edi, %eax xorl %edx, %edx divl %esi, %eax ret llvm-svn: 131860	2011-05-22 18:18:41 +00:00
Johnny Chen	a0c9c75df2	Fix Bug 9386 - ARM disassembler failed to disassemble conditional bx Modified the patch to .td file supplied by Jyun-Yan You. Add a test case and modified ARMDisassemblerCore.cpp a little bit. llvm-svn: 131859	2011-05-22 17:51:04 +00:00
Chris Lattner	c4ca7ab7e7	Fix PR9815: I was trying to get out of "generating code and then failing to form a memset, then having to delete it" but my approximation isn't safe for self recurrent loops. Instead of doign a hack, just do it the right way. llvm-svn: 131858	2011-05-22 17:39:56 +00:00
Frits van Bommel	ad964559ef	Add a parameter to ConstantFoldTerminator() that callers can use to ask it to also clean up the condition of any conditional terminator it folds to be unconditional, if that turns the condition into dead code. This just means it calls RecursivelyDeleteTriviallyDeadInstructions() in strategic spots. It defaults to the old behavior. I also changed -simplifycfg, -jump-threading and -codegenprepare to use this to produce slightly better code without any extra cleanup passes (AFAICT this was the only place in -simplifycfg where now-dead conditions of replaced terminators weren't being cleaned up). The only other user of this function is -sccp, but I didn't read that thoroughly enough to figure out whether it might be holding pointers to instructions that could be deleted by this. llvm-svn: 131855	2011-05-22 16:24:18 +00:00
Chris Lattner	408cfef6f0	I missed a checking with my GVN change. llvm-svn: 131851	2011-05-22 07:20:02 +00:00
Chris Lattner	1a1acc2191	fix PR9856, an incorrectly conservative assertion: a global can be "stored once" even if its address is compared. llvm-svn: 131849	2011-05-22 07:15:13 +00:00
Chris Lattner	f0d59072de	fix PR9841 by having GVN not process dead loads. This was causing it to get into infinite loops when it would widen a load (which can necessarily leave around dead loads). llvm-svn: 131847	2011-05-22 07:03:34 +00:00
Chris Lattner	a10327f531	remove a trivial test, make some other tests less trivial. llvm-svn: 131846	2011-05-22 07:02:43 +00:00
Chris Lattner	cc87723178	make this test less trivial. llvm-svn: 131845	2011-05-22 06:59:33 +00:00
Nick Lewycky	d60e135cfe	Commit test change, forgotten as part of r131838. llvm-svn: 131839	2011-05-22 05:31:47 +00:00
Nick Lewycky	a68ec83b36	Teach the inliner to emit llvm.lifetime.start/end, to scope the local variables of the inlinee to the code representing the original function. llvm-svn: 131838	2011-05-22 05:22:10 +00:00
Nick Lewycky	1c8af13719	Fix grammar in test. llvm-svn: 131831	2011-05-22 01:16:00 +00:00
Duncan Sands	5ec65765e6	Revert commit 131781, to see if it fixes the x86-64 dragonegg buildbot. Original log message: When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131809	2011-05-21 20:54:46 +00:00
Benjamin Kramer	2fd48f2730	Implement mulo x, 2 -> addo x, x in DAGCombiner. llvm-svn: 131800	2011-05-21 18:31:55 +00:00
Benjamin Kramer	e08fb1dce9	Merge and FileCheckize test cases. llvm-svn: 131799	2011-05-21 18:31:48 +00:00
Benjamin Kramer	fda5dc4968	Revert "InstCombine: Turn mul.with.overflow(X, 2) into the cheaper add.with.overflow(X, X)" It's better to do this in codegen, mul.with.overflow(X, 2) is more canonical because it has only one use on "X". llvm-svn: 131798	2011-05-21 18:31:42 +00:00
Benjamin Kramer	691731eb9c	InstCombine: Turn mul.with.overflow(X, 2) into the cheaper add.with.overflow(X, X) llvm-svn: 131789	2011-05-21 09:22:06 +00:00
Dan Gohman	8b20187c82	When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131781	2011-05-21 01:05:08 +00:00
Eli Friedman	60afcc2a6f	Add fast-isel support for byval calls on x86. llvm-svn: 131764	2011-05-20 22:21:04 +00:00
Rafael Espindola	652bfdb1ab	adds some attributes to attribute section when cpu is "xscale" (this is what used in Android NDK, when architecture is ARMv5) patch by Koan-Sin Tan llvm-svn: 131751	2011-05-20 20:10:34 +00:00
Rafael Espindola	1866808384	fixes target address tBL and tBLX and sets relocation type of tBL/tBLX to R_ARM_THM_CALL (ARM ELF 4.7.1.6) Patch by koan-sin tan. llvm-svn: 131748	2011-05-20 20:01:01 +00:00
Stuart Hastings	91f1d24736	Re-commit 131641 with fixes; de-pseudoize MOVSX16rr8 and friends. rdar://problem/8614450 llvm-svn: 131746	2011-05-20 19:04:40 +00:00
Akira Hatanaka	43407fe633	Make $fp and $ra callee-saved registers and let PrologEpilogInserter handle saving and restoring them. llvm-svn: 131745	2011-05-20 18:39:33 +00:00
Chad Rosier	ad00f3d0b9	Fixed regression due to commit 131709, which disables vararg tail call optimizations on Win64 llvm-svn: 131740	2011-05-20 17:49:39 +00:00
Benjamin Kramer	0bf26746d9	Rename the "sandybridge" subtarget to "corei7-avx", for GCC compatibility. llvm-svn: 131730	2011-05-20 15:11:26 +00:00
Cameron Zwarich	e0a52df6e5	Fix PR9960 by teaching SimpleRegisterCoalescing::AdjustCopiesBackFrom() to preserve the phikill flag. llvm-svn: 131717	2011-05-20 03:54:04 +00:00
Akira Hatanaka	fe4f9d5977	Fix bug in which nodes that write to argument registers do not get glued with the JALR node. Patch by Sasa Stankovic llvm-svn: 131714	2011-05-20 02:30:51 +00:00
Chad Rosier	552f8c4819	Don't attempt to tail call optimize for Win64. llvm-svn: 131709	2011-05-20 00:59:28 +00:00
Evan Cheng	e8d2e9eb35	Revert r131664 and fix it in instcombine instead. rdar://9467055 llvm-svn: 131708	2011-05-20 00:54:37 +00:00
Eli Friedman	22da799428	Add fast-isel support for zeroext and signext ret instructions on x86. llvm-svn: 131689	2011-05-19 22:16:13 +00:00
Rafael Espindola	1edfb17bc2	Looks like OS X assemblers (including MC) don't like foo: bar = foo .quad bar Avoid producing it. Fixes PR9951. llvm-svn: 131687	2011-05-19 22:05:56 +00:00
Eric Christopher	4014e5e208	Oddly people want to use the 'r' constraint for fp constants on x86. Fixes rdar://9218925 Fixes PR9601 llvm-svn: 131682	2011-05-19 21:33:47 +00:00
Eli Friedman	e53a77d3a6	Fix up this test to use explicit triples (Win64 passes a different number of arguments in registers). llvm-svn: 131676	2011-05-19 21:13:08 +00:00
Jason W Kim	d0c937d4b2	This fixes one divergence between LLVM and binutils for ARM in the text section. Assume the following bit of annotated assembly: .section .data.rel.ro,"aw",%progbits .align 2 .LAlpha: .long startval(GOTOFF) .text .align 2 .type main,%function .align 4 main: ;;; assume "main" starts at offset 0x20 0x0 push {r11, lr} 0x4 movw r0, :lower16:(.LAlpha-(.LBeta+8)) ;;; ==> (.AddrOf(.LAlpha) - ((.AddrOf(.LBeta) - .AddrOf(".")) + 8) ;;; ==> (??? - ((16-4) + 8) = -20 0x8 movt r0, :upper16:(.LAlpha-(.LBeta+8)) ;;; ==> (.AddrOf(.LAlpha) - ((.AddrOf(.LBeta) - .AddrOf(".")) + 8) ;;; ==> (??? - ((16-8) + 8) = -16 0xc ... blah .LBeta: 0x10 add r0, pc, r0 0x14 ... blah .LGamma: 0x18 add r1, pc, r1 Above snippet results in the following relocs in the .o file for the first pair of movw/movt instructions 00000024 R_ARM_MOVW_PREL_NC .LAlpha 00000028 R_ARM_MOVT_PREL .LAlpha And the encoded instructions in the .o file for main: must be 00000020 <main>: 20: e92d4800 push {fp, lr} 24: e30f0fec movw r0, #65516 ; 0xffec i.e. -20 28: e34f0ff0 movt r0, #65520 ; 0xfff0 i.e. -16 However, llc (prior to this commit) generates the following sequence 00000020 <main>: 20: e92d4800 push {fp, lr} 24: e30f0fec movw r0, #65516 ; 0xffec - i.e. -20 28: e34f0fff movt r0, #65535 ; 0xffff - i.e. -1 What has to happen in the ArmAsmBackend is that if the relocation is PC relative, the 16 bits encoded as part of movw and movt must be both addends, not addresses. It makes sense to encode addresses by right shifting the value by 16, but the result is incorrect for PIC. i.e., the right shift by 16 for movt is ONLY valid for the NON-PCRel case. This change agrees with what GNU as does, and makes the PIC code run. MC/ARM/elf-movt.s covers this case. llvm-svn: 131674	2011-05-19 20:55:25 +00:00
Rafael Espindola	0fc5e89c82	ADD64ri32 sign extends its argument, so we need to use a R_X86_64_32S. Fixes PR9934. We really need to start tblgening the relocation info :-( llvm-svn: 131669	2011-05-19 20:32:34 +00:00
Akira Hatanaka	9e6a8cca5d	Align i64 arguments to 64 bit boundaries. llvm-svn: 131668	2011-05-19 20:29:48 +00:00
Evan Cheng	2b9bd38678	crc32 with 64-bit output zeros upper 32-bits. rdar://9467055 llvm-svn: 131664	2011-05-19 18:57:12 +00:00
Stuart Hastings	ae012a7525	Move test to Transforms/InstCombine. llvm-svn: 131634	2011-05-19 05:53:22 +00:00
Rafael Espindola	3f60a0b411	Add test for PR9946. llvm-svn: 131621	2011-05-19 02:35:26 +00:00
Eli Friedman	41e509a33d	More instcombine cleanup, towards improving debug line info. llvm-svn: 131604	2011-05-18 23:58:37 +00:00
Tanya Lattner	1d11720ae4	Handle perfect shuffle case that generates a vrev for vectors of floats. Add test case. llvm-svn: 131582	2011-05-18 21:44:54 +00:00
Dan Gohman	3268e4d692	When forming an ICmpZero LSRUse, normalize the non-IV operand of the comparison, so that the resulting expression is fully normalized. This fixes PR9939. llvm-svn: 131576	2011-05-18 21:02:18 +00:00
Johnny Chen	071634612d	Disassembly of tBcc was wrongly adding 4 to the SignExtend'ed imm8:'0' immediate operand. llvm-svn: 131565	2011-05-18 20:32:41 +00:00
Chad Rosier	f4e832b14e	Enables vararg functions that pass all arguments via registers to be optimized into tail-calls when possible. llvm-svn: 131560	2011-05-18 19:59:50 +00:00
Eli Friedman	49346010f8	More instcombine cleanup aimed towards improving debug line info. llvm-svn: 131559	2011-05-18 19:57:14 +00:00
Stuart Hastings	51d696766c	An imminent fix to the x86_64 byval logic will expose a flaw in the x86_64 sibcall logic. I've filed PR9943 for the sibcall problem, and this patch alters the testcase to work around the flaw. When PR9943 is fixed, this patch should be reverted. llvm-svn: 131557	2011-05-18 19:19:17 +00:00
Eli Friedman	3f46c3e702	Force a triple on a couple of tests; we don't support fast-isel of ret on Win64. llvm-svn: 131540	2011-05-18 17:16:37 +00:00
Stuart Hastings	38849debb5	Merge pmovzx test case into existing file. llvm-svn: 131539	2011-05-18 17:02:04 +00:00
Justin Holewinski	bbdcd17d44	PTX: add flag to disable mad/fma selection Patch by Dan Bailey llvm-svn: 131537	2011-05-18 15:42:23 +00:00
Duncan Sands	7f64656d21	Tighten up checking of the validity of casts. (1) The IR parser would happily accept things like "sext <2 x i32> to <999 x i64>". It would also accept "sext <2 x i32> to i64", though the verifier would catch that later. Fixed by having castIsValid check that vector lengths match except when doing a bitcast. (2) When creating a cast instruction, check that the cast is valid (this was already done when creating constexpr casts). While there, replace getScalarSizeInBits (used to allow more vector casts) with getPrimitiveSizeInBits in getCastOpcode and isCastable since vector to vector casts are now handled explicitly by passing to the element types; i.e. this bit should result in no functional change. llvm-svn: 131532	2011-05-18 09:21:57 +00:00
Tanya Lattner	48b182c3a4	In r131488 I misunderstood how VREV works. It splits the vector in half and splits each half. Therefore, the real problem was that we were using a VREV64 for a 4xi16, when we should have been using a VREV32. Updated test case and reverted change to the PerfectShuffle Table. llvm-svn: 131529	2011-05-18 06:42:21 +00:00
Eli Friedman	96254a0d53	Start trying to make InstCombine preserve more debug info. The idea here is to set the debug location on the IRBuilder, which will be then right location in most cases. This should magically give many transformations debug locations, and fixing places which are missing a debug location will usually just means changing the code creating it to use the IRBuilder. As an example, the change to InstCombineCalls catches a common case where a call to a bitcast of a function is rewritten. Chris, does this approach look reasonable? llvm-svn: 131516	2011-05-18 01:28:27 +00:00
Eli Friedman	7d7ad8374f	Make some of the fast-isel tests actually test fast-isel (and fix test failures). llvm-svn: 131510	2011-05-18 00:00:10 +00:00
Stuart Hastings	5bd18b6638	X86 pmovsx/pmovzx ignore the upper half of their inputs. rdar://problem/6945110 llvm-svn: 131493	2011-05-17 22:13:31 +00:00
Tanya Lattner	c7e291b354	vrev is incorrectly defined in the perfect shuffle table. The ordering is backwards (should be 0x3210 versus 0x1032) which exposed a bug when doing a shuffle on a 4xi16. I've attached a test case. llvm-svn: 131488	2011-05-17 20:48:40 +00:00
Galina Kistanova	dd45646a47	Move test for appropriate directory. llvm-svn: 131477	2011-05-17 19:06:43 +00:00
Eli Friedman	7b27942fe7	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. This is r131438 with a couple small fixes. llvm-svn: 131474	2011-05-17 18:29:03 +00:00
Stuart Hastings	a7ae4552af	Drop lli, revise test. llvm-svn: 131452	2011-05-17 02:38:59 +00:00
Eli Friedman	7335e8a720	Back out r131444 and r131438; they're breaking nightly tests. I'll look into it more tomorrow. llvm-svn: 131451	2011-05-17 02:36:59 +00:00
Eli Friedman	e5f7f26df0	Fix test. llvm-svn: 131444	2011-05-17 00:39:14 +00:00
Evan Cheng	54459240e3	Add target triple so test doesn't fail on Windows machines. llvm-svn: 131439	2011-05-17 00:15:58 +00:00
Eli Friedman	83ba150f3a	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. llvm-svn: 131438	2011-05-17 00:13:47 +00:00
Jakob Stoklund Olesen	4edf17d91f	Teach LiveInterval::isZeroLength about null SlotIndexes. When instructions are deleted, they leave tombstone SlotIndex entries. The isZeroLength method should ignore these null indexes. This causes RABasic to sometimes spill a callee-saved register in the abi-isel.ll test, so don't run that test with -regalloc=basic. Prioritizing register allocation according to spill weight can cause more registers to be used. llvm-svn: 131436	2011-05-16 23:50:05 +00:00
Eli Friedman	d4a3609d30	Remove dead code. Fix associated test to use FileCheck. llvm-svn: 131424	2011-05-16 21:28:22 +00:00
Eli Friedman	a4d4a0162d	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Eli Friedman	9ac944774f	Basic fast-isel of extractvalue. Not too helpful on its own, given the IR clang generates for cases like this, but it should become more useful soon. llvm-svn: 131417	2011-05-16 20:27:46 +00:00
Rafael Espindola	e90c1cb221	sets bit 0 of the function address of thumb function in .symtab ("T is 1 if the target symbol S has type STT_FUNC and the symbol addresses a Thumb instruction ;it is 0 otherwise." from "ELF for the ARM Architecture" 4.7.1.2) Patch by Koan-Sin Tan! llvm-svn: 131406	2011-05-16 16:17:21 +00:00
Rafael Espindola	2050af838d	Don't do tail calls in a function that call setjmp. The stack might be corrupted when setjmp returns again. llvm-svn: 131399	2011-05-16 03:05:33 +00:00
Benjamin Kramer	cb7e56e592	Disable test harder. llvm-svn: 131363	2011-05-14 19:30:39 +00:00
Stuart Hastings	3c2fd1cf62	Disable this test while I revise it. rdar://problem/9267970 llvm-svn: 131350	2011-05-14 18:39:05 +00:00
Benjamin Kramer	d96205c4e5	SimplifyCFG: Use ComputeMaskedBits to prune dead cases from switch instructions. llvm-svn: 131345	2011-05-14 15:57:25 +00:00
Stuart Hastings	66a82b966e	Avoid combining GEPs that might overflow at runtime. rdar://problem/9267970 Patch by Julien Lerouge! llvm-svn: 131339	2011-05-14 05:55:10 +00:00
Rafael Espindola	df9db7ed92	Don't produce a vmovntdq if we don't have AVX support. llvm-svn: 131330	2011-05-14 00:30:01 +00:00
Rafael Espindola	8255d1e223	Move test. llvm-svn: 131315	2011-05-13 21:35:17 +00:00
Rafael Espindola	a9289bedf8	Move test. llvm-svn: 131314	2011-05-13 21:33:32 +00:00
Galina Kistanova	60e17fe806	Move platform-dependent test to appropriate directory. llvm-svn: 131302	2011-05-13 19:45:05 +00:00
Rafael Espindola	e53b7d1a11	Make codegen able to handle values of empty types. This is one way to fix PR9900. I will keep it open until sable is able to comment on it. llvm-svn: 131294	2011-05-13 15:18:06 +00:00
Stuart Hastings	aa02c0847d	Since I can't reproduce the failures from 131261, re-trying with a simplified version. <rdar://problem/9298790> llvm-svn: 131274	2011-05-13 00:51:54 +00:00
Stuart Hastings	8d57d8ea64	Revert 131266 and 131261 due to buildbot complaints. rdar://problem/9298790 llvm-svn: 131269	2011-05-13 00:15:17 +00:00
Stuart Hastings	ef4940254f	Tweak 131261 (thumb2-cbnz.ll) to generate the intended cbnz. rdar://problem/9298790 llvm-svn: 131266	2011-05-13 00:10:03 +00:00
Stuart Hastings	89f1b47e3a	Non-fast-isel followup to 129634; correctly handle branches controlled by non-CMP expressions. The executable test case (129821) would test this as well, if we had an "-O0 -disable-arm-fast-isel" LLVM-GCC tester. Alas, the ARM assembly would be very difficult to check with FileCheck. The thumb2-cbnz.ll test is affected; it generates larger code (tst.w vs. cmp #0), but I believe the new version is correct. rdar://problem/9298790 llvm-svn: 131261	2011-05-12 23:36:41 +00:00
Galina Kistanova	9e56e51fab	Correction. Use explicit target triple in the test. llvm-svn: 131252	2011-05-12 21:55:34 +00:00
Evan Cheng	43054e6159	Re-enable branchfolding common code hoisting optimization. Fixed a liveness test bug and also taught it to update liveins. llvm-svn: 131241	2011-05-12 20:30:01 +00:00
Stuart Hastings	114ecbd0f4	Move this test to CodeGen/Thumb. rdar://problem/9416774 llvm-svn: 131196	2011-05-11 19:41:28 +00:00
Devang Patel	34a6620748	Identify end of prologue (and beginning of function body) using DW_LNS_set_prologue_end line table opcode. llvm-svn: 131194	2011-05-11 19:22:19 +00:00
Stuart Hastings	c7c465c573	Reduced test case. rdar://problem/9416774 llvm-svn: 131191	2011-05-11 17:29:25 +00:00
Owen Anderson	b745623b71	Fix encoding of Thumb BLX register instructions. Patch by Koan-Sin Tan. llvm-svn: 131189	2011-05-11 17:00:48 +00:00
Stuart Hastings	e1d075f2aa	And lo, I was given a testcase for 131152. rdar://problem/9416774 llvm-svn: 131184	2011-05-11 16:00:21 +00:00
Nadav Rotem	8a7beb80f0	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Nadav Rotem	8f971c27fb	Add custom lowering of X86 vector SRA/SRL/SHL when the shift amount is a splat vector. llvm-svn: 131179	2011-05-11 08:12:09 +00:00
Rafael Espindola	2a09d65979	Revert 131172 as it is causing clang to miscompile itself. I will try to provide a reduced testcase. llvm-svn: 131176	2011-05-11 03:27:17 +00:00
Evan Cheng	05fc35e275	Add a late optimization to BranchFolding that hoist common instruction sequences at the start of basic blocks to their common predecessor. It's actually quite common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size benefit. e.g. pushq %rax testl %edi, %edi jne LBB0_2 ## BB#1: xorb %al, %al popq %rdx ret LBB0_2: xorb %al, %al callq _foo popq %rdx ret => pushq %rax xorb %al, %al testl %edi, %edi je LBB0_2 ## BB#1: callq _foo LBB0_2: popq %rdx ret rdar://9145558 llvm-svn: 131172	2011-05-11 01:03:01 +00:00
Rafael Espindola	14e1b58405	Add triple. llvm-svn: 131169	2011-05-10 23:14:29 +00:00
Rafael Espindola	19c1a56287	Produce a __debug_frame section on darwin ARM when appropriate. llvm-svn: 131151	2011-05-10 21:04:45 +00:00
Rafael Espindola	99f6735532	On MachO, unlike ELF, there should be no relocation to produce the CIE pointer. llvm-svn: 131149	2011-05-10 20:59:42 +00:00
Rafael Espindola	d0d2354258	The EH symbols are only needed in eh_frame, not debug_frame. llvm-svn: 131146	2011-05-10 19:51:53 +00:00
Rafael Espindola	fdc3e6fab6	Use .cfi_sections to put the unwind info in .debug_frame when possible. With this clang will use .debug_frame in, for example, clang -g -c -m32 test.c This matches gcc's behaviour. It looks like .debug_frame is a bit bigger than .eh_frame, but has the big advantage of not being allocated. llvm-svn: 131140	2011-05-10 18:39:09 +00:00
Rafael Espindola	27390b4a0e	In a debug_frame the cfi offset is to the start of the debug_frame section! llvm-svn: 131129	2011-05-10 15:20:23 +00:00
Justin Holewinski	3c0447259c	PTX: add test cases for cvt, fneg, and selp Patch by Dan Bailey llvm-svn: 131128	2011-05-10 14:53:13 +00:00
Rafael Espindola	1ecb12fc57	Add support for producing .deubg_frame sections. llvm-svn: 131121	2011-05-10 03:54:12 +00:00
Benjamin Kramer	d724a590e5	X86: Add a bunch of peeps for add and sub of SETB. "b + ((a < b) ? 1 : 0)" compiles into cmpl %esi, %edi adcl $0, %esi instead of cmpl %esi, %edi sbbl %eax, %eax andl $1, %eax addl %esi, %eax This saves a register, a false dependency on %eax (Intel's CPUs still don't ignore it) and it's shorter. llvm-svn: 131070	2011-05-08 18:36:07 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Jakob Stoklund Olesen	a5c889982a	Emit a proper error message when register allocators run out of registers. This can't be just an assertion, users can always write impossible inline assembly. Such an assembly statement should be included in the error message. llvm-svn: 131024	2011-05-06 21:58:30 +00:00
Nick Lewycky	64c9284411	It's valid to take the blockaddress of a different function, so remove this assert in the bitcode writer. No change needed because the ValueEnumerator holds a whole-module numbering anyhow. Fixes PR9857! llvm-svn: 131016	2011-05-06 21:09:44 +00:00
Galina Kistanova	a335f5aeeb	Move few target-dependant tests to appropriate directories. llvm-svn: 131002	2011-05-06 18:24:46 +00:00
Rafael Espindola	20ce0c0ce0	Pass -disable-cfi to llc. llvm-svn: 130999	2011-05-06 18:01:58 +00:00
Rafael Espindola	ac893d6898	Pass -disable-cfi. llvm-svn: 130995	2011-05-06 17:44:58 +00:00
Justin Holewinski	11d70b6b32	PTX: add PTX 2.3 language target Patch by Wei-Ren Chen llvm-svn: 130980	2011-05-06 11:40:36 +00:00
Duncan Sands	a071c82900	Fix PR9820: a read-only call differs from a load in that a load doesn't return the pointer being dereferenced, it returns the pointee, but a call might return the pointer itself. llvm-svn: 130979	2011-05-06 10:30:37 +00:00
Eli Friedman	5401962643	Re-revert r130877; it's apparently causing a regression on 197.parser, possibly related to cbnz formation. llvm-svn: 130977	2011-05-06 05:23:07 +00:00
Eli Friedman	8a20e66926	PR9838: Fix transform introduced in r127064 to not trigger when only one side of the icmp is an exact shift. llvm-svn: 130954	2011-05-05 21:59:18 +00:00
Rafael Espindola	a4982bddf3	Don't produce a __debug_frame. I tested both gdb on a bootstrapped clang and and the gdb testsuite on OS X (snow leopard) and both are happy using __eh_frame. llvm-svn: 130937	2011-05-05 18:43:39 +00:00
Galina Kistanova	b93a130120	Many LLVM tests relies on standard output stream be in the binary mode. Which is not always the case (on Windows in particular). The patch adds a test to verify that the standard output stream is actually in the binary mode. llvm-svn: 130936	2011-05-05 18:40:27 +00:00
Eli Friedman	441a01a2b8	Avoid extra vreg copies for arguments passed in registers. Specifically, this can make MachineCSE more effective in some cases (especially in small functions). PR8361 / part of rdar://problem/8259436 . llvm-svn: 130928	2011-05-05 16:53:34 +00:00
Jakob Stoklund Olesen	f118fae233	Fix test to be less sensitive to coalescing. This should unbreak llvm-gcc-i386-linux-selfhost. llvm-svn: 130927	2011-05-05 16:48:00 +00:00
Jakob Stoklund Olesen	17d4f9bbcc	Prepare remaining tests for -join-physreg going away. llvm-svn: 130893	2011-05-04 23:54:59 +00:00
Jakob Stoklund Olesen	369bddf5ad	Fix a batch of x86 tests to be coalescer independent. Most of these tests require a single mov instruction that can come either before or after a 2-addr instruction. -join-physregs changes the behavior, but the results are equivalent. llvm-svn: 130891	2011-05-04 23:54:51 +00:00
Dan Gohman	dd550305e6	Give this test an explicit register allocator, so that it can work even if the default register allocator is changed. llvm-svn: 130883	2011-05-04 23:14:02 +00:00
Bill Wendling	2a40131f6b	SjLj EH could produce a machine basic block that legitimately has more than one landing pad as its successor. SjLj exception handling jumps to the correct landing pad via a switch statement that's generated right before code-gen. Loosen the constraint in the machine instruction verifier to allow for this. Note, this isn't the most rigorous check since we cannot determine where that switch statement came from. But it's marginally better than turning this check off when SjLj exceptions are used. <rdar://problem/9187612> llvm-svn: 130881	2011-05-04 22:54:05 +00:00
Eli Friedman	0fe4608af2	Re-commit r130862 with a minor change to avoid an iterator running off the edge in some cases. Original message: Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130877	2011-05-04 22:10:36 +00:00
Galina Kistanova	e53ae508ec	This test fails on ARM. The test shouldn't explicitly specify alignment (and alignment 4 is wrong) and requires hard-float. llvm-svn: 130875	2011-05-04 21:57:44 +00:00
Eli Friedman	3bd79ba856	Back out r130862; it appears to be breaking bootstrap. llvm-svn: 130867	2011-05-04 20:48:42 +00:00
Eli Friedman	a16fc2fec0	Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130862	2011-05-04 19:54:24 +00:00
Jakob Stoklund Olesen	28a93a49bb	Fix more register and coalescing dependencies. llvm-svn: 130859	2011-05-04 19:02:11 +00:00
Jakob Stoklund Olesen	d7fd7bfc31	Explicitly request physreg coalesing for a bunch of Thumb2 unit tests. These tests all follow the same pattern: mov r2, r0 movs r0, #0 $CMP r2, r1 it eq moveq r0, #1 bx lr The first 'mov' can be eliminated by rematerializing 'movs r0, #0' below the test instruction: $CMP r0, r1 mov.w r0, #0 it eq moveq r0, #1 bx lr So far, only physreg coalescing can do that. The register allocators won't yet split live ranges just to eliminate copies. They can learn, but this particular problem is not likely to show up in real code. It only appears because r0 is used for both the function argument and return value. llvm-svn: 130858	2011-05-04 19:02:07 +00:00
Jakob Stoklund Olesen	e7528c45ea	FileCheckize and break dependence on coalescing order. llvm-svn: 130856	2011-05-04 19:02:01 +00:00
Jakob Stoklund Olesen	067ba3c23c	Explicitly request -join-physregs for some tests that depend on it. llvm-svn: 130855	2011-05-04 19:01:59 +00:00
Devang Patel	39ecf816c5	Do not emit location expression size twice. llvm-svn: 130854	2011-05-04 19:00:57 +00:00
Akira Hatanaka	3bace5d223	Remove LLVM IR metadata in test case committed in r130847. llvm-svn: 130849	2011-05-04 18:28:36 +00:00
Akira Hatanaka	23e8ecf125	Prevent instructions using $gp from being placed between a jalr and the instruction that restores the clobbered $gp. llvm-svn: 130847	2011-05-04 17:54:27 +00:00

... 2 3 4 5 6 ...

13265 Commits