llvm-project

Commit Graph

Author	SHA1	Message	Date
Yiannis Tsiouris	dbb4adf134	Add a GC plugin for Erlang llvm-svn: 177867	2013-03-25 13:47:46 +00:00
Owen Anderson	c81616b0a9	Remove the type legality check from the SelectionDAGBuilder when it lowers @llvm.fmuladd to ISD::FMA nodes. Performing this check unilaterally prevented us from generating FMAs when the incoming IR contained illegal vector types which would eventually be legalized to underlying types that did support FMA. For example, an @llvm.fmuladd on an OpenCL float16 should become a sequence of float4 FMAs, not float4 fmul+fadd's. NOTE: Because we still call the target-specific profitability hook, individual targets can reinstate the old behavior, if desired, by simply performing the legality check inside their callback hook. They can also perform more sophisticated legality checks, if, for example, some illegal vector types can be productively implemented as FMAs, but not others. llvm-svn: 177820	2013-03-23 08:26:53 +00:00
David Blaikie	30ce0788e7	Refactor out the DIFile parameter to DILexicalBlock to refer to the raw file/directory pair llvm-svn: 177742	2013-03-22 17:33:20 +00:00
David Blaikie	f333dc9571	Reorder the DIFile field in DILexicalBlock to become a prefix common with other DIScopes llvm-svn: 177703	2013-03-22 05:47:44 +00:00
David Blaikie	0d7d62e4b2	Move the DIFile in DISubprogram to the beginning to be a common prefix along with other DIScopes llvm-svn: 177674	2013-03-21 22:29:36 +00:00
David Blaikie	cc8d090163	Remove unused field in DISubprogram llvm-svn: 177661	2013-03-21 20:28:52 +00:00
David Blaikie	efb0d65ed7	Debug info: refactor the first field of DICompileUnit to be a raw file/directory pair This removes the DICompileUnit special case from DIScope. llvm-svn: 177610	2013-03-20 23:58:12 +00:00
David Blaikie	3b88852a2d	Debug Info: Swap the 2nd and 3rd parameters to DICompileUnit to match the common DIScope prefix llvm-svn: 177595	2013-03-20 22:52:54 +00:00
David Blaikie	43a729d165	Remove unused field in DICompileUnit llvm-svn: 177590	2013-03-20 22:34:33 +00:00
Michael Liao	0f4ea0c4a9	Fix PR15296 - Move SRA/SRL/SHL lowering support from DAG combination to DAG lowering to support extended 256-bit integer in AVX but not AVX2. llvm-svn: 177478	2013-03-20 02:33:21 +00:00
David Blaikie	200b6ed80f	Refactor the DIFile (2nd) parameter to DITypes to be an MDNode reference to a raw directory/file pair This makes DIType's first non-tag parameter the same as DIFile's, allowing them to both share the common implementation of getFilename/getDirectory in DIScope. llvm-svn: 177467	2013-03-20 00:26:26 +00:00
Justin Holewinski	d068943809	Propagate DAG node ordering during type legalization and instruction selection A node's ordering is only propagated during legalization if (a) the new node does not have an ordering (is not a CSE'd node), or (b) the new node has an ordering that is higher than the node being legalized. llvm-svn: 177465	2013-03-20 00:10:32 +00:00
David Blaikie	abec74b38e	Move the DIFile operand to DITypes from the 4th operand to the 2nd. This is another step along the way to making all DIScopes have a common prefix which can be added to in a general manner to support using directives (DW_TAG_imported_module). llvm-svn: 177462	2013-03-19 23:25:22 +00:00
Nadav Rotem	0f1bc60d51	Optimize sext <4 x i8> and <4 x i16> to <4 x i64>. Patch by Ahmad, Muhammad T <muhammad.t.ahmad@intel.com> llvm-svn: 177421	2013-03-19 18:38:27 +00:00
David Blaikie	3053029310	Remove unnecessary leading comment characters in lit-only file llvm-svn: 177327	2013-03-18 22:08:16 +00:00
David Blaikie	99067af791	Include '.test' suffix in target specific lit configs that need it Apparently my final cleanup to use a relevant suffix for these tests before committing r176831 caused them to stop running since lit wasn't configured to run tests with that suffix in those directories (why don't we just have a global suffix list?). So, add the suffix to the relevant directories & fix the test that has bitrotted over the last week due to my debug info schema changes. llvm-svn: 177315	2013-03-18 20:31:44 +00:00
David Blaikie	8fb8224578	Split out filename & directory from DIFile to start generalizing over DIScopes This is the first step to making all DIScopes have a common metadata prefix (so that things (using directives, for example) that can appear in any scope can be added to that common prefix). DIFile is itself a DIScope so the common prefix of all DIScopes cannot be a DIFile - instead it's the raw filename/directory name pair. llvm-svn: 177239	2013-03-17 21:13:55 +00:00
Craig Topper	612f7bfa4d	Add X86 code emitter support AVX encoded MRMDestReg instructions. Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao. llvm-svn: 177221	2013-03-16 03:44:31 +00:00
Nadav Rotem	4a4827ce21	Add a triple to the test. llvm-svn: 177131	2013-03-15 00:10:23 +00:00
Nadav Rotem	adfa5eaf8c	Unaligned loads should use the VMOVUPS opcode. llvm-svn: 177130	2013-03-14 23:49:44 +00:00
Chad Rosier	4b54f594b4	[fast-isel] The X86FastISel::FastLowerArguments function doesn't properly handle the win64 calling convention. rdar://13423768 llvm-svn: 177113	2013-03-14 21:25:04 +00:00
Michael Liao	20d287044c	Fix PR15309 - Fix the typo on type checking llvm-svn: 177010	2013-03-14 06:57:42 +00:00
David Blaikie	aabfe4f997	Simplify file/directory name handling in DILexicalBlock llvm-svn: 176993	2013-03-13 22:52:59 +00:00
David Blaikie	0d221159a0	Remove the unused 4th operand for DIFile debug info metadata llvm-svn: 176983	2013-03-13 22:05:21 +00:00
David Blaikie	1ca2f36289	Refactor filename/directory in DICompileUnit into a DIFile This is the next step towards making the metadata for DIScopes have a common prefix rather than having to delegate based on their tag type. llvm-svn: 176913	2013-03-13 00:01:35 +00:00
David Blaikie	452c3ff649	Remove unused "isMain" field from DICompileUnit llvm-svn: 176910	2013-03-12 22:43:04 +00:00
David Blaikie	a4f770d51c	Update debug info test cases with empty SplitDebugFilename field. This could be 'null' or the empty string, DIDescriptor::getStringField coalesces the two cases anyway so it's just a matter of legible/efficient representation. The change in behavior of the DICompileUnit::get* functions could be subsumed by the full verification check - but ideally that should just be an assertion if we could front-load the actual debug info metadata failure paths. llvm-svn: 176907	2013-03-12 22:25:36 +00:00
Jan Wen Voung	6dc3076080	Revert the test moves from 176733. Use "REQUIRES: asserts" instead. llvm-svn: 176873	2013-03-12 16:27:52 +00:00
David Blaikie	47922fb006	Upgrading debug info test cases to be (more) compatible with the current debug info format. These cases were found by further work to remove support for debug info versioning. Common cleanups (other than changing the version info in the tag field) included adding the last parameter to compile_units (recently added for fission support) and other cases of trailing fields in lexical blocks, compile units, and subprograms. llvm-svn: 176834	2013-03-11 22:37:40 +00:00
David Blaikie	789beb5300	Remove duplicate test contents. llvm-svn: 176831	2013-03-11 22:10:14 +00:00
Nick Lewycky	48beb21185	Fix a crasher newly introduced in r176659/r176649, where fast-isel tries to lower an expect intrinsic that is a constant expression. llvm-svn: 176830	2013-03-11 21:44:37 +00:00
NAKAMURA Takumi	a60c7a0f4b	llvm/test/CodeGen/X86/handle-move.ll: Mark it as XFAIL:cygming. Investigating. llvm-svn: 176808	2013-03-11 16:30:26 +00:00
NAKAMURA Takumi	1e02e73c30	Suppress atomic(32\|64).ll as XFAIL on win32 codegen. Investigating. llvm-svn: 176798	2013-03-11 08:39:48 +00:00
Benjamin Kramer	01b75cc0f2	Test case hygiene. llvm-svn: 176772	2013-03-09 18:25:40 +00:00
Jan Wen Voung	7857a64909	Disable statistics on Release builds and move tests that depend on -stats. Summary: Statistics are still available in Release+Asserts (any +Asserts builds), and stats can also be turned on with LLVM_ENABLE_STATS. Move some of the FastISel stats that were moved under DEBUG() back out of DEBUG(), since stats are disabled across the board now. Many tests depend on grepping "-stats" output. Move those into a orig_dir/Stats/. so that they can be marked as unsupported when building without statistics. Differential Revision: http://llvm-reviews.chandlerc.com/D486 llvm-svn: 176733	2013-03-08 22:56:31 +00:00
Jakob Stoklund Olesen	8d1aaf21cf	Rewrite the physreg part of findLastUseBefore(). To find the last use of a register unit, start from the bottom and scan upwards until a user is found. <rdar://problem/13353090> llvm-svn: 176706	2013-03-08 18:08:57 +00:00
David Blaikie	e7838a4c53	Another test fix for r176671. llvm-svn: 176679	2013-03-08 02:27:40 +00:00
David Blaikie	d17cfedf8b	Couple of test fixes for r176671. Not sure why these aren't failing on my linux machine, but this should cover it. llvm-svn: 176678	2013-03-08 02:26:16 +00:00
David Blaikie	e5a2f704a4	Upgrade tests to the latest debug info format. Mostly this is just changing the named metadata (llvm.dbg.sp, llvm.dbg.gv, llvm.dbg.<func>.lv, etc -> llvm.dbg.cu), adding a few fields to older records (DIVariable: flags/inlined-at, DICompileUnit: sp/gv/types, DISubprogram: local variables list) The tests to update were discovered by a change I'm working on to remove debug info version support - so any tests using old debug info versions I haven't updated probably are bad tests or just not actually designed to test debug info. llvm-svn: 176671	2013-03-08 00:23:31 +00:00
Benjamin Kramer	5e5fd6bb4f	Move testcase, this is testing extraction not inserting. llvm-svn: 176635	2013-03-07 18:51:02 +00:00
Benjamin Kramer	2c3d0df8ee	X86: Fold EXTRACT_SUBVECTORs of a BUILD_VECTOR into a smaller BUILD_VECTOR. That can usually be lowered efficiently and is common in sandybridge code. It would be nice to do this in DAGCombiner but we can't insert arbitrary BUILD_VECTORs this late. Fixes PR15462. llvm-svn: 176634	2013-03-07 18:48:40 +00:00
Michael Liao	d5cac37dc5	Fix two remaining issue after fixing PR15355 when CMOV is not available - Phi nodes should be replaced/updated after lowering CMOV into branch because 'mainMBB' updating operand in Phi node is changed. - Add EFLAGS in livein before lowering the 2nd CMOV. It's necessary as we will reuse the EFLAGS generated before the 1st lowered CMOV, which won't clobber EFLAGS. However, we need explicitly specify that. - '-attr=-cmov' test case are added. llvm-svn: 176598	2013-03-07 01:01:29 +00:00
Michael Liao	da22b30be5	Fix PR15355 - Clear 'mayStore' flag when loading from the atomic variable before the spin loop - Clear kill flag from one use to multiple use in registers forming the address to that atomic variable - don't use a physical register as live-in register in BB (neither entry nor landing pad.) by copying it into virtual register (patch by Cameron Zwarich) llvm-svn: 176538	2013-03-06 00:17:04 +00:00
Preston Gurd	485296d1e8	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Michael Liao	d10584e38b	Add regression tests (WORKSFORME) - These tests wont't crash on trunk but would be better to add them so that they don't break again in the future. llvm-svn: 176369	2013-03-01 19:23:37 +00:00
Michael Liao	6af16fc3b7	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	31a9bbcd4a	Add a test case for r176066. llvm-svn: 176119	2013-02-26 20:22:30 +00:00
Chad Rosier	d2686ffa56	Remove a few unused arguments. llvm-svn: 176109	2013-02-26 18:39:31 +00:00
Kostya Serebryany	cf880b9443	Unify clang/llvm attributes for asan/tsan/msan (LLVM part) These are two related changes (one in llvm, one in clang). LLVM: - rename address_safety => sanitize_address (the enum value is the same, so we preserve binary compatibility with old bitcode) - rename thread_safety => sanitize_thread - rename no_uninitialized_checks -> sanitize_memory CLANG: - add __attribute__((no_sanitize_address)) as a synonym for __attribute__((no_address_safety_analysis)) - add __attribute__((no_sanitize_thread)) - add __attribute__((no_sanitize_memory)) for S in address thread memory If -fsanitize=S is present and __attribute__((no_sanitize_S)) is not set llvm attribute sanitize_S llvm-svn: 176075	2013-02-26 06:58:09 +00:00
Michael Liao	ab97668061	Fix PR10499 - Check whether SSE is available before lowering all 1s vector building with PCMPEQD, which is only available from SSE2 llvm-svn: 176058	2013-02-25 23:01:03 +00:00
Chad Rosier	0adc042392	Remove extraneous attribute number. llvm-svn: 176053	2013-02-25 22:06:05 +00:00
Chad Rosier	a92ef4ba5b	[fast-isel] Add X86FastIsel::FastLowerArguments to handle functions with 6 or fewer scalar integer (i32 or i64) arguments. It completely eliminates the need for SDISel for trivial functions. Also, add the new llc -fast-isel-abort-args option, which is similar to -fast-isel-abort option, but for formal argument lowering. llvm-svn: 176052	2013-02-25 21:59:35 +00:00
Andrew Trick	7cf4361912	pre-RA-sched fix: only reevaluate physreg interferences when necessary. Fixes rdar:13279013: scheduler was blowing up on select instructions. llvm-svn: 176037	2013-02-25 19:11:48 +00:00
Chandler Carruth	05920b1847	Fix the root cause of PR15348 by correctly handling alignment 0 on memory intrinsics in the SDAG builder. When alignment is zero, the lang ref says that no alignment assumptions can be made. This is the exact opposite of the internal API contracts of the DAG where alignment 0 indicates that the alignment can be made to be anything desired. There is another, more explicit alignment that is better suited for the role of "no alignment at all": an alignment of 1. Map the intrinsic alignment to this early so that we don't end up generating aligned DAGs. It is really terrifying that we've never seen this before, but we suddenly started generating a large number of alignment 0 memcpys due to the new code to do memcpy-based copying of POD class members. That patch contains a bug that rounds bitfield alignments down when they are the first field. This can in turn produce zero alignments. This fixes weird crashes I've seen in library users of LLVM on 32-bit hosts, etc. llvm-svn: 176022	2013-02-25 14:20:21 +00:00
Nadav Rotem	b532fca92c	Revert r169638 because it broke Mesa llvmpipe tests. Fix PR15239. llvm-svn: 175985	2013-02-24 07:09:35 +00:00
Benjamin Kramer	ee23dcb461	X86: Disable cmov-memory patterns on subtargets without cmov. Fixes PR15115. llvm-svn: 175962	2013-02-23 10:40:58 +00:00
Peter Collingbourne	e049fd2c31	Fix test by matching movaps instead of AVX-only vmovaps llvm-svn: 175914	2013-02-22 19:53:30 +00:00
Peter Collingbourne	7b57621fb3	x86_64: designate most general purpose and SSE registers as callee save under coldcc llvm-svn: 175911	2013-02-22 19:19:44 +00:00
Pete Cooper	23e8b6b8c9	Remove unused CHECK lines copied from another test llvm-svn: 175905	2013-02-22 18:16:21 +00:00
Pete Cooper	047f81a5df	Fix isa<> check which could never be true. It was incorrectly checking a Function* being an IntrinsicInst* which isn't possible. It should always have been checking the CallInst* instead. Added test case for x86 which ensures we only get one constant load. It was 2 before this change. rdar://problem/13267920 llvm-svn: 175853	2013-02-22 01:50:38 +00:00
Benjamin Kramer	3238dc0c61	DAGCombiner: Make the post-legalize vector op optimization more aggressive. A legal BUILD_VECTOR goes in and gets constant folded into another legal BUILD_VECTOR so we don't lose any legality here. The problematic PPC optimization that made this check necessary was fixed recently. llvm-svn: 175759	2013-02-21 15:24:35 +00:00
Michael Liao	7fb39669ef	Fix PR15267 - When extloading from a vector with non-byte-addressable element, e.g. <4 x i1>, the current logic breaks. Extend the current logic to fix the case where the element type is not byte-addressable by loading all bytes, bit-extracting/packing each element. llvm-svn: 175642	2013-02-20 18:04:21 +00:00
Elena Demikhovsky	0ccdd1315b	I optimized the following patterns: sext <4 x i1> to <4 x i64> sext <4 x i8> to <4 x i64> sext <4 x i16> to <4 x i64> I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns: (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT))) The sext_in_reg (v4i32 x) may be lowered to shl+sar operations. The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution. I also added a cost of this operations to the AVX costs table. llvm-svn: 175619	2013-02-20 12:42:54 +00:00
Chad Rosier	45a52fa097	[ms-inline asm] Force the use of a base pointer if the MachineFunction includes MS-style inline assembly. This is a follow-on to r175334. Forcing a FP to be emitted doesn't ensure it will be used. Therefore, force the base pointer as well. We now treat MS inline assembly in the same way we treat functions with dynamic stack realignment and VLAs. This guarantees the BP will be used to reference parameters and locals. rdar://13218191 llvm-svn: 175576	2013-02-19 23:50:45 +00:00
Eli Bendersky	c66b7b2582	Fix typo llvm-svn: 175530	2013-02-19 17:11:48 +00:00
Benjamin Kramer	b3aa2b8497	Fix GCMetadaPrinter::finishAssembly not executed, patch by Yiannis Tsiouris. Due to the execution order of doFinalization functions, the GC information were deleted before AsmPrinter::doFinalization was executed. Thus, the GCMetadataPrinter::finishAssembly was never called. The patch fixes that by moving the code of the GCInfoDeleter::doFinalization to Printer::doFinalization. llvm-svn: 175528	2013-02-19 16:51:44 +00:00
Benjamin Kramer	53bc37ca2a	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Benjamin Kramer	fb9ea4e659	Force a cpu for test. It failed on atom due to different scheduling decisions. llvm-svn: 175401	2013-02-17 18:26:11 +00:00
Jakub Staszak	6a62c29f6b	Replace "check:" wth "CHECK:". Also fix one test by changing "vpermilps" to "vpshufd". llvm-svn: 175357	2013-02-16 12:16:56 +00:00
Bill Wendling	61375d8953	Reinitialize the ivars in the subtarget so that they can be reset with the new features. llvm-svn: 175336	2013-02-16 01:36:26 +00:00
Chad Rosier	925c9b499e	[ms-inline asm] Do not omit the frame pointer if we have ms-inline assembly. If the frame pointer is omitted, and any stack changes occur in the inline assembly, e.g.: "pusha", then any C local variable or C argument references will be incorrect. I pass no judgement on anyone who would do such a thing. ;) rdar://13218191 llvm-svn: 175334	2013-02-16 01:25:28 +00:00
Bill Wendling	e9434778f7	Temporary revert of 175320. llvm-svn: 175322	2013-02-15 23:22:32 +00:00
Bill Wendling	a060d0efd8	Reinitialize the ivars in the subtarget. When we're recalculating the feature set of the subtarget, we need to have the ivars in their initial state. llvm-svn: 175320	2013-02-15 23:18:01 +00:00
Paul Redmond	f29ddfe93f	enable SDISel sincos optimization for GNU environments - add sincos to runtime library if target triple environment is GNU - added canCombineSinCosLibcall() which checks that sincos is in the RTL and if the environment is GNU then unsafe fpmath is enabled (required to preserve errno) - extended sincos-opt lit test Reviewed by: Hal Finkel llvm-svn: 175283	2013-02-15 18:45:18 +00:00
Nadav Rotem	495b1a43c1	Dont merge consecutive loads/stores into vectors when noimplicitfloat is used. llvm-svn: 175190	2013-02-14 18:28:52 +00:00
Elena Demikhovsky	d0a0cc80cd	Fixed a bug in X86TargetLowering::LowerVectorIntExtend() (assertion failure). Added a test. llvm-svn: 175144	2013-02-14 08:20:26 +00:00
Andrew Trick	6871e5f4e5	Reapply "s/grep/FileCheck/ in some tests" This reverts commit fd1335e982bbf93c5f450ed4fd29f9f787435c85. Use a triple this time. llvm-svn: 175134	2013-02-14 03:45:08 +00:00
Andrew Trick	836bf1526b	Revert "s/grep/FileCheck/ in some tests" This reverts commit 8b75e6bc35fb3f9c1e788dbd05084c0f4a60a0f3. The FileCheck tests are not equivalent: test/CodeGen/X86/tailcall-structret.ll:6:10: error: expected string not found in input ; CHECK: jmp init ^ <stdin>:1:2: note: scanning from here .section __TEXT,__text,regular,pure_instructions ^ <stdin>:13:2: note: possible intended match here jmp _init ## TAILCALL ^ llvm-svn: 175124	2013-02-14 03:00:57 +00:00
Eli Bendersky	3ffeb68dd7	s/grep/FileCheck/ in some tests llvm-svn: 175093	2013-02-13 22:00:37 +00:00
Eli Bendersky	04553985d7	s/grep/FileCheck/ in some tests llvm-svn: 175089	2013-02-13 21:46:38 +00:00
Chad Rosier	da05cf7ba7	[ms-inline asm] Fix up test case for non-Darwin platforms. llvm-svn: 175087	2013-02-13 21:41:58 +00:00
Chad Rosier	282edd7caa	[ms-inline-asm] Add support for memory references that have non-immediate displacements. rdar://12974533 llvm-svn: 175083	2013-02-13 21:33:44 +00:00
Benjamin Kramer	8e2637e2b0	X86: Disable generation of rep;movsl when %esi is used as a base pointer. This happens when there is both stack realignment and a dynamic alloca in the function. If we overwrite %esi (rep;movsl uses fixed registers) we'll lose the base pointer and the next register spill will write into oblivion. Fixes PR15249 and unbreaks firefox on i386/freebsd. Mozilla uses dynamic allocas and freebsd a 4 byte stack alignment. llvm-svn: 175057	2013-02-13 13:40:35 +00:00
Elena Demikhovsky	9e0df7cb01	Prevent insertion of "vzeroupper" before call that preserves YMM registers, since a caller uses preserved registers across the call. llvm-svn: 175043	2013-02-13 08:02:04 +00:00
Eric Christopher	389ee71b0a	Check i1 as well as i8 variables for 8 bit registers for x86 inline assembly. llvm-svn: 175036	2013-02-13 06:01:05 +00:00
Eric Christopher	2398a9a175	Finish obviously broken thought. llvm-svn: 175035	2013-02-13 06:01:00 +00:00
Paul Redmond	7e7e3de43d	Fix the lit test added in r174972 Patch by: Kevin Schoedel llvm-svn: 174974	2013-02-12 16:07:27 +00:00
Paul Redmond	288604ed0c	PR14562 - Truncation of left shift became undef DAGCombiner::ReduceLoadWidth was converting (trunc i32 (shl i64 v, 32)) into (shl i32 v, 32) into undef. To prevent this, check the shift count against the final result size. Patch by: Kevin Schoedel Reviewed by: Nadav Rotem llvm-svn: 174972	2013-02-12 15:21:21 +00:00
Eli Bendersky	ef4558abd3	This is a follow-up on r174446, now taking Atom processors into account. Atoms use LEA for updating SP in prologs/epilogs, and the exact LEA opcode depends on the data model. Also reapplying the test case which was added and then reverted (because of Atom failures), this time specifying explicitly the CPU in addition to the triple. The test case now checks all variations (data mode, cpu Atom vs. Core). llvm-svn: 174542	2013-02-06 20:43:57 +00:00
Eli Bendersky	c4446856e3	Remove this test in the meantime, since it won't pass on Atom. Atom uses lea to move the stack pointer in prologs/epilogs. I will fix the test and add it back later. llvm-svn: 174484	2013-02-06 03:15:00 +00:00
Manman Ren	d2c38d684a	Attempt to recover gdb bot after r174445. Failure: undefined symbol 'Lline_table_start0'. Root-cause: we use a symbol subtraction to calculate at_stmt_list, but the line table entries are not dumped in the assembly. Fix: use zero instead of a symbol subtraction for Compile Unit 0. llvm-svn: 174479	2013-02-06 00:59:41 +00:00
Eli Bendersky	59a6fb0381	Test for r174446 llvm-svn: 174464	2013-02-05 23:31:48 +00:00
Manman Ren	4e042a6be6	Dwarf: support for LTO where a single object file can have multiple line tables We generate one line table for each compilation unit in the object file. Reviewed by Eric and Kevin. rdar://problem/13067005 llvm-svn: 174445	2013-02-05 21:52:47 +00:00
Owen Anderson	de89ecf1fc	Reapply r174343, with a fix for a scary DAG combine bug where it failed to differentiate between the alignment of the base point of a load, and the overall alignment of the load. This caused infinite loops in DAG combine with the original application of this patch. ORIGINAL COMMIT LOG: When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174431	2013-02-05 19:24:39 +00:00
Jakob Stoklund Olesen	eb1084ee54	Add a test case for PR14750. This was fixed by r174402. llvm-svn: 174405	2013-02-05 18:04:15 +00:00
NAKAMURA Takumi	3753b28cd2	Revert r174343, "When the target-independent DAGCombiner inferred a higher alignment for a load," It caused hangups in compiling clang/lib/Parse/ParseDecl.cpp and clang/lib/Driver/Tools.cpp in stage2 on some hosts. llvm-svn: 174374	2013-02-05 14:44:16 +00:00
Owen Anderson	a47fdbb032	When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174343	2013-02-05 06:25:30 +00:00
Benjamin Kramer	2c9da989c2	X86: Open up some opportunities for constant folding by postponing shift lowering. Fixes PR15141. llvm-svn: 174327	2013-02-04 15:19:33 +00:00
Benjamin Kramer	548ffa274a	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors. This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325	2013-02-04 15:19:18 +00:00
David Blaikie	33111dfea0	Remove the (apparently) unnecessary debug info metadata indirection. The main lists of debug info metadata attached to the compile_unit had an extra layer of metadata nodes they went through for no apparent reason. This patch removes that (& still passes just as much of the GDB 7.5 test suite). If anyone can show evidence as to why these extra metadata nodes are there I'm open to reverting this patch & documenting why they're there. llvm-svn: 174266	2013-02-02 05:56:24 +00:00
Shuxin Yang	cadd8a068e	rdar://13126763 Fix a bug in DAGCombine. The symptom is mistakenly optimizing expression "x + xx" into "x 3.0". llvm-svn: 174239	2013-02-02 00:22:03 +00:00
David Sehr	8114a7a651	Two changes relevant to LEA and x32: 1) allows the use of RIP-relative addressing in 32-bit LEA instructions under x86-64 (ILP32 and LP64) 2) separates the size of address registers in 64-bit LEA instructions from control by ILP32/LP64. llvm-svn: 174208	2013-02-01 19:28:09 +00:00
Lang Hames	dd47804394	When lowering memcpys to loads and stores, make sure we don't promote alignments past the natural stack alignment. llvm-svn: 174085	2013-01-31 20:23:43 +00:00
Eric Christopher	4e3e94c13d	Check and allow floating point registers to select the size of the register for inline asm. This conforms to how gcc allows for effective casting of inputs into gprs (fprs is already handled). llvm-svn: 174008	2013-01-31 00:50:46 +00:00
Eli Bendersky	6c84b90b70	Replace some more greps with FileChecks in tests llvm-svn: 174006	2013-01-31 00:44:12 +00:00
Eli Bendersky	a320e00e74	Rewrite this test properly with a FileCheck instead of greps llvm-svn: 173997	2013-01-31 00:11:52 +00:00
Evan Cheng	9449ec956f	Forgot the test case before. llvm-svn: 173988	2013-01-30 22:57:00 +00:00
Benjamin Kramer	05cc93964a	When the legalizer is splitting vector shifts, the result may not have the right shift amount type. Fix that by adding a cast to the shift expander. This came up with vector shifts on sse-less X86 CPUs. <2 x i64> = shl <2 x i64> <2 x i64> -> i64,i64 = shl i64 i64; shl i64 i64 -> i32,i32,i32,i32 = shl_parts i32 i32 i64; shl_parts i32 i32 i64 Now we cast the last two i64s to the right type. Fixes the crash in PR14668. llvm-svn: 173615	2013-01-27 11:19:11 +00:00
Benjamin Kramer	99c68dd964	X86: Do splat promotion later, so the optimizer can chew on it first. This catches many cases where we can emit a more efficient shuffle for a specific mask or when the mask contains undefs. Once the splat is lowered to unpacks we can't do that anymore. There is a possibility of moving the promotion after pshufb matching, but I'm not sure if pshufb with a mask loaded from memory is faster than 3 shuffles, so I avoided that for now. llvm-svn: 173569	2013-01-26 11:44:21 +00:00
Benjamin Kramer	7268a05178	FileCheckize and merge some tests. llvm-svn: 173568	2013-01-26 11:14:32 +00:00
Eli Bendersky	597fc1233a	In this patch, we teach X86_64TargetMachine that it has a ILP32 (defined by the x32 ABI) mode, in which case its pointers are 32-bits in size. This knowledge is also added to X86RegisterInfo that now returns the appropriate registers in getPointerRegClass. There are many outcomes to this change. In order to keep the patches separate and manageable, we start by focusing on some simple testable cases. The patch adds a test with passing a pointer to a function - focusing on the difference between the two data models for x86-64. Another test is added for handling of 'sret' arguments (and functionality is added in X86ISelLowering to make it work). A note on naming: the "x32 ABI" document refers to the AMD64 architecture (in LLVM it's distinguished by being is64Bits() in the x86 subtarget) with two variations: the LP64 (default) data model, and the ILP32 data model. This patch adds predicates to the subtarget which are consistent with this naming scheme. llvm-svn: 173503	2013-01-25 22:07:43 +00:00
Eli Bendersky	e6abe83258	Now that llvm-dwarfdump supports flags to specify which DWARF section to dump, use them in tests that run llvm-dwarfdump. This is in order to make tests as specific as possible. llvm-svn: 173498	2013-01-25 21:44:53 +00:00
Andrew Trick	e2c3f5c982	MIsched: Improve the interface to SchedDFS analysis (subtrees). Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425	2013-01-25 06:33:57 +00:00
Andrew Trick	44f750a3e5	MISched: Add SchedDFSResult to ScheduleDAGMI to formalize the interface and allow other strategies to select it. llvm-svn: 173413	2013-01-25 04:01:04 +00:00
Bill Wendling	7c8f96a91b	Add the heuristic to differentiate SSPStrong from SSPRequired. The requirements of the strong heuristic are: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) llvm-svn: 173231	2013-01-23 06:43:53 +00:00
Bill Wendling	d154e283f2	Add the IR attribute 'sspstrong'. SSPStrong applies a heuristic to insert stack protectors in these situations: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) This patch implements the SSPString attribute to be equivalent to SSPRequired. This will change in a subsequent patch. llvm-svn: 173230	2013-01-23 06:41:41 +00:00
Michael Liao	3dffc5e2b7	Fix an issue of pseudo atomic instruction DAG schedule - Add list of physical registers clobbered in pseudo atomic insts Physical registers are clobbered when pseudo atomic instructions are expanded. Add them in clobber list to prevent DAG scheduler to mis-schedule them after these insns are declared side-effect free. - Add test case from Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 173200	2013-01-22 21:47:38 +00:00
NAKAMURA Takumi	9439237063	llvm/test/CodeGen/X86/win_ftol2.ll: Add -cpu=generic to appease valgrind. On valgrind the processor is reported; Host CPU: athlon-fx llvm-svn: 172983	2013-01-20 15:40:02 +00:00
Nadav Rotem	9450fcfff1	Revert 172708. The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends. This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical. Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model. llvm-svn: 172968	2013-01-20 08:35:56 +00:00
Nadav Rotem	7b3120b9ae	On Sandybridge split unaligned 256bit stores into two xmm-sized stores. llvm-svn: 172894	2013-01-19 08:38:41 +00:00
Nadav Rotem	7431211214	On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction. llvm-svn: 172868	2013-01-18 23:10:30 +00:00
NAKAMURA Takumi	b72e763325	llvm/test/CodeGen/X86/Atomics-64.ll: Tweak for 2nd RUN not to overwrite %t. It sometimes causes spurious failure on lit win32. Feel free to prune or suppress each output. llvm-svn: 172823	2013-01-18 14:52:02 +00:00
Elena Demikhovsky	f6a30e05d5	Optimization for the following SIGN_EXTEND pairs: v8i8 -> v8i64, v8i8 -> v8i32, v4i8 -> v4i64, v4i16 -> v4i64 for AVX and AVX2. Bug 14865. llvm-svn: 172708	2013-01-17 09:59:53 +00:00
Benjamin Kramer	bcd14a0f26	X86: Add patterns for X86ISD::VSEXT in registers. Those can occur when something between the sextload and the store is on the same chain and blocks isel. Fixes PR14887. llvm-svn: 172353	2013-01-13 11:37:04 +00:00
Preston Gurd	99c6990457	Update patch for the pad short functions pass for Intel Atom (only). Adds a check for -Oz, changes the code to not re-visit BBs, and skips over DBG_VALUE instrs. Patch by Andy Zhang. llvm-svn: 172258	2013-01-11 22:06:56 +00:00
Tim Northover	3a51aab390	Simplify writing floating types to assembly. This removes previous special cases for each floating-point type in favour of a shared codepath. llvm-svn: 172189	2013-01-11 10:36:13 +00:00
NAKAMURA Takumi	e46e8225f4	llvm/test/CodeGen/X86/ms-inline-asm.ll: Fixup; Globals doesn't have leading underscore in symbol on linux. llvm-svn: 172139	2013-01-10 23:02:48 +00:00
Evan Cheng	c8444b159a	PR14896: Handle memcpy from constant string where the memcpy size is larger than the string size. llvm-svn: 172124	2013-01-10 22:13:27 +00:00
Chad Rosier	a4bc9437a2	[ms-inline asm] Add support for calling functions from inline assembly. Part of rdar://12991541 llvm-svn: 172121	2013-01-10 22:10:27 +00:00
Evan Cheng	5652a8df32	Fix a DAG combine bug visitBRCOND() is transforming br(xor(x, y)) to br(x != y). It cahced XOR's operands before calling visitXOR() but failed to update the operands when visitXOR changed the XOR node. rdar://12968664 llvm-svn: 171999	2013-01-09 20:56:40 +00:00
Nadav Rotem	3f5825c6c1	add -march to the test llvm-svn: 171956	2013-01-09 07:04:23 +00:00
Nadav Rotem	977e0be4a0	Efficient lowering of vector sdiv when the divisor is a splatted power of two constant. PR 14848. The lowered sequence is based on the existing sequence the target-independent DAG Combiner creates for the scalar case. Patch by Zvi Rackover. llvm-svn: 171953	2013-01-09 05:14:33 +00:00
Preston Gurd	a01daace88	Pad Short Functions for Intel Atom The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. This patch has been updated to address Nadav's review comments - Optimize only at >= O1 and don't do optimization if -Os is set - Stores MachineBasicBlock* instead of BBNum - Uses DenseMap instead of std::map - Fixes placement of braces Patch by Andy Zhang. llvm-svn: 171879	2013-01-08 18:27:24 +00:00
Craig Topper	4f1c7256f9	Fix suffix handling for parsing and printing of cvtsi2ss, cvtsi2sd, cvtss2si, cvttss2si, cvtsd2si, and cvttsd2si to match gas behavior. cvtsi2* should parse with an 'l' or 'q' suffix or no suffix at all. No suffix should be treated the same as 'l' suffix. Printing should always print a suffix. Previously we didn't parse or print an 'l' suffix. cvtt2si/cvt2si should parse with an 'l' or 'q' suffix or not suffix at all. No suffix should use the destination register size to choose encoding. Printing should not print a suffix. Original 'l' suffix issue with cvtsi2* pointed out by Michael Kuperstein. llvm-svn: 171668	2013-01-06 20:39:29 +00:00
Evan Cheng	3fb03e23a4	Fix for PR14739. It's not safe to fold a load into a call across a store. Thanks to Nick Lewycky for the initial patch. llvm-svn: 171665	2013-01-06 19:00:15 +00:00
Craig Topper	92a70b1e65	Recommit r171461 which was incorrectly reverted. Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171608	2013-01-05 07:39:25 +00:00
Nadav Rotem	478b6a47ec	Revert revision 171524. Original message: URL: http://llvm.org/viewvc/llvm-project?rev=171524&view=rev Log: The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171603	2013-01-05 05:42:48 +00:00
Preston Gurd	e36b685a94	The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171524	2013-01-04 20:54:54 +00:00
Nadav Rotem	c616a5408a	Revert revision: 171467. This transformation is incorrect and makes some tests fail. Original message: Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171468	2013-01-04 17:35:21 +00:00
Elena Demikhovsky	5f2f06d2d9	Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171467	2013-01-03 08:48:33 +00:00
Michael Gottesman	820aac1c78	Revert "Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks." This reverts commit r171461 since it breaks the following tests: Clang :: Analysis/outofbound-notwork.c Clang :: Analysis/string-fail.c Clang :: CXX/basic/basic.lookup/basic.lookup.qual/p6-0x.cpp Clang :: CXX/basic/basic.lookup/basic.lookup.unqual/p15.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.fct.spec/p4.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.stc/p10.cpp Clang :: CXX/temp/temp.param/p14.cpp Clang :: CXX/temp/temp.res/temp.dep.res/temp.point/p1.cpp Clang :: CodeGen/2009-02-13-zerosize-union-field-ppc.c Clang :: CodeGen/blocks-2.c Clang :: CodeGen/libcalls-d.c Clang :: CodeGen/libcalls-ld.c Clang :: CodeGenCXX/conversion-function.cpp Clang :: CodeGenCXX/debug-info-limit-type.cpp Clang :: CodeGenCXX/inheriting-constructor.cpp Clang :: FixIt/fixit-errors.c Clang :: FixIt/fixit-pmem.cpp Clang :: Modules/namespaces.cpp Clang :: PCH/changed-files.c Clang :: PCH/pr4489.c Clang :: PCH/source-manager-stack.c Clang :: Parser/cxx-ambig-decl-expr-xfail.cpp Clang :: SemaCXX/switch-implicit-fallthrough-cxx98.cpp Clang :: SemaTemplate/instantiate-function-1.mm llvm-svn: 171466	2013-01-03 08:18:30 +00:00
Craig Topper	7c27cc9fd0	Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171461	2013-01-03 06:40:20 +00:00
Jakob Stoklund Olesen	725d57682b	Fix PR14732 by handling all kinds of IMPLICIT_DEF live ranges. Most IMPLICIT_DEF instructions are removed by the ProcessImplicitDefs pass, and a few are reinserted by PHIElimination when a PHI argument is <undef>. RegisterCoalescer was assuming that all IMPLICIT_DEF live ranges look like those created by PHIElimination, and that their live range never leaves the basic block. The PR14732 test case does tricks with PHI nodes that causes a longer IMPLICIT_DEF live range to appear. This happens very rarely, but RegisterCoalescer should be able to handle it. llvm-svn: 171435	2013-01-03 00:47:51 +00:00
Tom Stellard	567f886eb0	DAGCombiner: Avoid generating illegal vector INT_TO_FP nodes DAGCombiner::reduceBuildVecConvertToConvertBuildVec() was making two mistakes: 1. It was checking the legality of scalar INT_TO_FP nodes and then generating vector nodes. 2. It was passing the result value type to TargetLoweringInfo::getOperationAction() when it should have been passing the value type of the first operand. llvm-svn: 171420	2013-01-02 22:13:01 +00:00
Nadav Rotem	c8d7047fa9	AVX: Fix a bug in WidenMaskArithmetic. llvm-svn: 171397	2013-01-02 17:40:39 +00:00
Dmitri Gribenko	56bf2e1830	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171250	2012-12-30 02:33:22 +00:00
Nadav Rotem	3da9ac72fa	AVX: Move the ZEXT/ANYEXT DAGCo optimizations to the lowering of these optimizations. The old test cases still cover all of these lowering/optimizations. The single change that we have is that now anyext does not need to zero a register, because it does not use the exact code path as the zero_extend. llvm-svn: 171178	2012-12-28 05:45:24 +00:00
Nadav Rotem	2a054b4475	On AVX/AVX2 the type v8i1 is legalized to v8i16, which is an XMM sized register. In most cases we actually compare or select YMM-sized registers and mixing the two types creates horrible code. This commit optimizes some of the transition sequences. PR14657. llvm-svn: 171148	2012-12-27 08:15:45 +00:00
NAKAMURA Takumi	40aa3285f4	llvm/test/CodeGen/X86: FileCheck-ize two tests in r171083. llvm-svn: 171084	2012-12-26 03:19:30 +00:00
NAKAMURA Takumi	334f685328	llvm/test/CodeGen/X86: Disable avx in two tests corresponding to r171082. llvm-svn: 171083	2012-12-26 03:08:55 +00:00

1 2 3 4 5 ...

3986 Commits