llvm-project

Commit Graph

Author	SHA1	Message	Date
Pete Cooper	23e8b6b8c9	Remove unused CHECK lines copied from another test llvm-svn: 175905	2013-02-22 18:16:21 +00:00
Renato Golin	adc1b07002	More tests to global struct vectorizer llvm-svn: 175898	2013-02-22 16:18:31 +00:00
Kristof Beyls	0ba797e8f7	Make ARMAsmPrinter generate the correct alignment specifier syntax in instructions. The Printer will now print instructions with the correct alignment specifier syntax, like vld1.8 {d16}, [r0:64] llvm-svn: 175884	2013-02-22 10:01:33 +00:00
Bill Wendling	a032374ea0	Use references to attribute groups on the call/invoke instructions. Listing all of the attributes for the callee of a call/invoke instruction is way too much and makes the IR unreadable. Use references to attributes instead. llvm-svn: 175877	2013-02-22 09:09:42 +00:00
Reed Kotler	4416cdadd5	Expand mips16 SelT form pseudso/macros. llvm-svn: 175862	2013-02-22 05:10:51 +00:00
Pete Cooper	047f81a5df	Fix isa<> check which could never be true. It was incorrectly checking a Function* being an IntrinsicInst* which isn't possible. It should always have been checking the CallInst* instead. Added test case for x86 which ensures we only get one constant load. It was 2 before this change. rdar://problem/13267920 llvm-svn: 175853	2013-02-22 01:50:38 +00:00
Eli Bendersky	705085da37	Previously, parsing capability of the .debug_frame section was added to lib/DebugInfo, with dumping in llvm-dwarfdump. This patch adds initial ability to parse and dump CFA instructions contained in entries. To keep it manageable, the patch omits some more advanced capabilities (accounted in TODOs): * Parsing of instructions with BLOCK arguments (expression lists) * Dumping of actual instruction arguments (currently only names are dumped). This is quite tricky since the dumper has to effectively "interpret" the instructions. llvm-svn: 175820	2013-02-21 22:53:19 +00:00
Renato Golin	cf928cb53f	Allow GlobalValues to vectorize with AliasAnalysis Storing the load/store instructions with the values and inspect them using Alias Analysis to make sure they don't alias, since the GEP pointer operand doesn't take the offset into account. Trying hard to not add any extra cost to loads and stores that don't overlap on global values, AA is only calculated if all of the previous attempts failed. Using biggest vector register size as the stride for the vectorization access, as we're being conservative and the cost model (which calculates the real vectorization factor) is only run after the legalization phase. We might re-think this relationship in the future, but for now, I'd rather be safe than sorry. llvm-svn: 175818	2013-02-21 22:39:03 +00:00
Anshuman Dasgupta	d062c70444	Hexagon: Expand cttz, ctlz, and ctpop for now. llvm-svn: 175783	2013-02-21 19:39:40 +00:00
Jakob Stoklund Olesen	2ff4dc0ff2	Make RAFast::UsedInInstr indexed by register units. This fixes some problems with too conservative checking where we were marking all aliases of a register as used, and then also checking all aliases when allocating a register. <rdar://problem/13249625> llvm-svn: 175782	2013-02-21 19:35:21 +00:00
Bill Schmidt	27917785ae	Large code model support for PowerPC. Large code model is identical to medium code model except that the addis/addi sequence for "local" accesses is never used. All accesses use the addis/ld sequence. The coding changes are straightforward; most of the patch is taken up with creating variants of the medium model tests for large model. llvm-svn: 175767	2013-02-21 17:12:27 +00:00
Benjamin Kramer	3238dc0c61	DAGCombiner: Make the post-legalize vector op optimization more aggressive. A legal BUILD_VECTOR goes in and gets constant folded into another legal BUILD_VECTOR so we don't lose any legality here. The problematic PPC optimization that made this check necessary was fixed recently. llvm-svn: 175759	2013-02-21 15:24:35 +00:00
Tom Stellard	0d171c8877	R600: Fix for Unigine when MachineSched is enabled Fixes for-loop.cl piglit test Patch By: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175742	2013-02-21 15:06:59 +00:00
Michel Danzer	7f02a8c7a7	R600/SI: Make sure M0 is loaded for V_INTERP_MOV_F32 NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175733	2013-02-21 08:57:10 +00:00
Reed Kotler	97ba5f2772	Expand the sel pseudo/macro. This generates basic blocks where previously there were inline br .+4 instructions. Soon everything can enjoy the full instruction scheduling experience. llvm-svn: 175718	2013-02-21 04:22:38 +00:00
Jack Carter	dc46338e2d	Mips specific standalone assembler addressing mode %hi and %lo. The constructs %hi() and %lo() represent the high and low 16 bits of the address. Because the 16 bit offset field of an LW instruction is interpreted as signed, if bit 15 of the low part is 1 then the low part will act as a negative and 1 needs to be added to the high part. Contributer: Vladimir Medic llvm-svn: 175707	2013-02-21 02:09:31 +00:00
Bill Schmidt	f5b474c6c6	PPCDAGToDAGISel::PostprocessISelDAG() This patch implements the PPCDAGToDAGISel::PostprocessISelDAG virtual method to perform post-selection peephole optimizations on the DAG representation. One optimization is implemented here: folds to clean up complex addressing expressions for thread-local storage and medium code model. It will also be useful for large code model sequences when those are added later. I originally thought about doing this on the MI representation prior to register assignment, but it's difficult to do effective global dead code elimination at that point. DCE is trivial on the DAG representation. A typical example of a candidate code sequence in assembly: addis 3, 2, globalvar@toc@ha addi 3, 3, globalvar@toc@l lwz 5, 0(3) When the final instruction is a load or store with an immediate offset of zero, the offset from the add-immediate can replace the zero, provided the relocation information is carried along: addis 3, 2, globalvar@toc@ha lwz 5, globalvar@toc@l(3) Since the addi can in general have multiple uses, we need to only delete the instruction when the last use is removed. llvm-svn: 175697	2013-02-21 00:38:25 +00:00
Jack Carter	1ac5322e61	ELF symbol table field st_other support, excluding visibility bits. Mips specific standalone assembler directive "set at". This directive changes the general purpose register that the assembler will use when given the symbolic register name $at. This does not include negative testing. That will come in a future patch. A side affect of this patch recognizes the different GPR register names for temporaries between old abi and new abi so a test case for that is included. Contributer: Vladimir Medic llvm-svn: 175686	2013-02-20 23:11:17 +00:00
Bill Schmidt	4437ad7d94	Stabilize vec_constants.ll llvm-svn: 175683	2013-02-20 22:43:03 +00:00
Arnold Schwaighofer	3f9568e921	DAGCombiner: Fold pointless truncate, bitcast, buildvector series (2xi32) (truncate ((2xi64) bitcast (buildvector i32 a, i32 x, i32 b, i32 y))) can be folded into a (2xi32) (buildvector i32 a, i32 b). Such a DAG would cause uneccessary vdup instructions followed by vmovn instructions. We generate this code on ARM NEON for a setcc olt, 2xf64, 2xf64. For example, in the vectorized version of the code below. double A[N]; double B[N]; void test_double_compare_to_double() { int i; for(i=0;i<N;i++) A[i] = (double)(A[i] < B[i]); } radar://13191881 Fixes bug 15283. llvm-svn: 175670	2013-02-20 21:33:32 +00:00
Bill Schmidt	c6cbecc2c7	Additional fixes for bug 15155. This handles the cases where the 6-bit splat element is odd, converting to a three-instruction sequence to add or subtract two splats. With this fix, the XFAIL in test/CodeGen/PowerPC/vec_constants.ll is removed. llvm-svn: 175663	2013-02-20 20:41:42 +00:00
Michael Liao	7fb39669ef	Fix PR15267 - When extloading from a vector with non-byte-addressable element, e.g. <4 x i1>, the current logic breaks. Extend the current logic to fix the case where the element type is not byte-addressable by loading all bytes, bit-extracting/packing each element. llvm-svn: 175642	2013-02-20 18:04:21 +00:00
Bill Schmidt	6631e94838	Fix bug 14779 for passing anonymous aggregates [patch by Kai Nacke]. The PPC backend doesn't handle these correctly. This patch uses logic similar to that in the X86 and ARM backends to track these arguments properly. llvm-svn: 175635	2013-02-20 17:31:41 +00:00
Jyotsna Verma	7503a62bce	Hexagon: Move HexagonMCInst.h to MCTargetDesc/HexagonMCInst.h. Add HexagonMCInst class which adds various Hexagon VLIW annotations. In addition, this class also includes some APIs related to the constant extenders. llvm-svn: 175634	2013-02-20 16:13:27 +00:00
Bill Schmidt	51e7951e24	Fix PR15155: lost vadd/vsplat optimization. During lowering of a BUILD_VECTOR, we look for opportunities to use a vector splat. When the splatted value fits in 5 signed bits, a single splat does the job. When it doesn't fit in 5 bits but does fit in 6, and is an even value, we can splat on half the value and add the result to itself. This last optimization hasn't been working recently because of improved constant folding. To circumvent this, create a pseudo VADD_SPLAT that can be expanded during instruction selection. llvm-svn: 175632	2013-02-20 15:50:31 +00:00
Elena Demikhovsky	0ccdd1315b	I optimized the following patterns: sext <4 x i1> to <4 x i64> sext <4 x i8> to <4 x i64> sext <4 x i16> to <4 x i64> I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns: (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT))) The sext_in_reg (v4i32 x) may be lowered to shl+sar operations. The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution. I also added a cost of this operations to the AVX costs table. llvm-svn: 175619	2013-02-20 12:42:54 +00:00
Kostya Serebryany	699ac28aa5	[asan] instrument invoke insns with noreturn attribute (as well as call insns) llvm-svn: 175617	2013-02-20 12:35:15 +00:00
Logan Chien	53c18d8ac7	Fix thumbv5e frame lowering assertion failure. It is possible that frame pointer is not found in the callee saved info, thus FramePtrSpillFI may be incorrect if we don't check the result of hasFP(MF). Besides, if we enable the stack coloring algorithm, there will be an assertion to ensure the slot is live. But in the test case, %var1 is not live in the prologue of the function, and we will get the assertion failure. Note: There is similar code in ARMFrameLowering.cpp. llvm-svn: 175616	2013-02-20 12:21:33 +00:00
Bill Wendling	4cdb88983d	Use the attribute group reference instead of the attribute directly. llvm-svn: 175609	2013-02-20 07:48:23 +00:00
Bill Wendling	90bc19cd91	Modify the LLVM assembly output so that it uses references to represent function attributes. This makes the LLVM assembly look better. E.g.: define void @foo() #0 { ret void } attributes #0 = { nounwind noinline ssp } llvm-svn: 175605	2013-02-20 07:21:42 +00:00
Reed Kotler	7b503c2b03	Expand pseudos/macros: SltCCRxRy16, SltiCCRxImmX16, SltiuCCRxImmX16, SltuCCRxRy16 $T8 shows up as register $24 when emitted from C++ code so we had to change some tests that were already there for this functionality. llvm-svn: 175593	2013-02-20 05:45:15 +00:00
Michael J. Spencer	6a8746b7e6	[llvm-readobj] Add ELF .dynamic table dumping. llvm-svn: 175592	2013-02-20 02:37:12 +00:00
Chad Rosier	45a52fa097	[ms-inline asm] Force the use of a base pointer if the MachineFunction includes MS-style inline assembly. This is a follow-on to r175334. Forcing a FP to be emitted doesn't ensure it will be used. Therefore, force the base pointer as well. We now treat MS inline assembly in the same way we treat functions with dynamic stack realignment and VLAs. This guarantees the BP will be used to reference parameters and locals. rdar://13218191 llvm-svn: 175576	2013-02-19 23:50:45 +00:00
Jack Carter	10c97e5ca0	ELF symbol table field st_other support, excluding visibility bits. Mips (o32 abi) specific e_header setting. EF_MIPS_ABI_O32 needs to be set in the ELF header flags for o32 abi output. Contributer: Reed Kotler llvm-svn: 175569	2013-02-19 22:29:00 +00:00
Jack Carter	1ba1f3cec8	ELF symbol table field st_other support, excluding visibility bits. Mips (Mips16) specific e_header setting. EF_MIPS_ARCH_ASE_M16 needs to be set in the ELF header flags for Mips16. Contributer: Reed Kotler llvm-svn: 175566	2013-02-19 22:14:34 +00:00
Nadav Rotem	0186347c4c	Fix a bug in mayHaveSideEffects. Functions that do not return are now considered as instructions with side effects. rdar://13227456 llvm-svn: 175553	2013-02-19 20:02:09 +00:00
Jim Grosbach	3fa275e6f7	ARM: Allocation hints must make sure to be in the alloc order. When creating an allocation hint for a register pair, make sure the hint for the physical register reference is still in the allocation order. rdar://13240556 llvm-svn: 175541	2013-02-19 18:55:36 +00:00
Eli Bendersky	c66b7b2582	Fix typo llvm-svn: 175530	2013-02-19 17:11:48 +00:00
Benjamin Kramer	b3aa2b8497	Fix GCMetadaPrinter::finishAssembly not executed, patch by Yiannis Tsiouris. Due to the execution order of doFinalization functions, the GC information were deleted before AsmPrinter::doFinalization was executed. Thus, the GCMetadataPrinter::finishAssembly was never called. The patch fixes that by moving the code of the GCInfoDeleter::doFinalization to Printer::doFinalization. llvm-svn: 175528	2013-02-19 16:51:44 +00:00
Arnold Schwaighofer	e5083442b2	ARM NEON: Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers. The patch fixes this by using a COPY_TO_REGCLASS and a EXTRACT_SUBREG to extract the element from the vector instead. radar://13191881 llvm-svn: 175520	2013-02-19 15:27:05 +00:00
Kostya Serebryany	3ece9beaf1	[asan] instrument memory accesses with unusual sizes This patch makes asan instrument memory accesses with unusual sizes (e.g. 5 bytes or 10 bytes), e.g. long double or packed structures. Instrumentation is done with two 1-byte checks (first and last bytes) and if the error is found __asan_report_load_n(addr, real_size) or __asan_report_store_n(addr, real_size) is called. Also, call these two new functions in memset/memcpy instrumentation. asan-rt part will follow. llvm-svn: 175507	2013-02-19 11:29:21 +00:00
Reed Kotler	3e457f505e	Expand pseudos/macros BteqzT8SltiX16, BteqzT8SltiuX16, BtnezT8SltiX16, BtnezT8SltiuX16 . llvm-svn: 175486	2013-02-19 03:56:57 +00:00
Bill Wendling	c98e4fef1a	Temporarily revert r175470 for more review. llvm-svn: 175476	2013-02-19 00:52:45 +00:00
Reed Kotler	d82171990f	Expand pseudos BteqzT8CmpiX16 and BtnezT8CmpiX16. llvm-svn: 175474	2013-02-19 00:20:58 +00:00
Bill Wendling	66651e4c2f	Check to see if the 'no-builtin' attribute is set before simplifying a library call. llvm-svn: 175470	2013-02-18 23:17:16 +00:00
Chad Rosier	f666b761bd	Comment out the rdar number. llvm-svn: 175460	2013-02-18 21:59:15 +00:00
Chad Rosier	f3f8f443e1	[fast-isel] Remove an invalid assert. If the memcpy has an odd length with an alignment of 2, this would incorrectly assert on the last 1 byte copy. rdar://13202135 llvm-svn: 175459	2013-02-18 21:46:28 +00:00
Benjamin Kramer	53bc37ca2a	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Vincent Lejeune	1ce13f553e	R600/SI: Use MULADD_IEEE/V_MAD_F32 instruction for mad pattern llvm-svn: 175446	2013-02-18 14:11:28 +00:00
Kostya Serebryany	7ca384bc1a	[asan] revert r175266 as it breaks code with packed structures. supporting long double will require a more general solution llvm-svn: 175442	2013-02-18 13:47:02 +00:00
Tim Northover	67d3c09332	AArch64: adjust tests which rely on a default JIT Profiling tests do need a JIT. They'll pass if a cross-compiler targetting AArch64 by default has been built, but fail if a native AArch64 compiler has been build. Therefore XFAIL is inappropriate and we mark them unsupported. ExecutionEngine tests are JIT by definition, they should also be unsupported. Transforms/LICM only uses the interpreter to check the output is still sane after optimisation. It can be switched to use an interpreter. llvm-svn: 175433	2013-02-18 11:08:37 +00:00
Reed Kotler	1460738710	Expand macro/pseudo instructions BtnezT8SltX16 and BtnezT8SltuX16. llvm-svn: 175420	2013-02-18 05:43:03 +00:00
Reed Kotler	c40f4e5899	Expand pseudo/macro BteqzT8SltX16. llvm-svn: 175417	2013-02-18 04:04:26 +00:00
Reed Kotler	7e4bc6067b	Expand macro/pseudo BteqzT8CmpX16. llvm-svn: 175416	2013-02-18 03:06:29 +00:00
Reed Kotler	cb37409b92	Beginning of expanding all current mips16 macro/pseudo instruction sequences. This expansion will be moved to expandISelPseudos as soon as I can figure out how to do that. There are other instructions which use this ExpandFEXT_T8I816_ins and as soon as I have finished expanding them all, I will delete the macro asm string text so it has no way to be used in the future. llvm-svn: 175413	2013-02-18 00:59:04 +00:00
Richard Osborne	53fff94527	[XCore] Add missing 2r instructions. These instructions are not targeted by the compiler but it is needed for the MC layer. llvm-svn: 175407	2013-02-17 22:38:05 +00:00
Richard Osborne	f5a3ffcba9	[XCore] Add TSETR instruction. This instruction is not targeted by the compiler but it is needed for the MC layer. llvm-svn: 175406	2013-02-17 22:32:41 +00:00
Richard Osborne	2192615d9f	[XCore] Add missing u10 / lu10 instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 175404	2013-02-17 20:44:48 +00:00
Richard Osborne	3814491fb1	[XCore] Add missing u6 / lu6 instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 175403	2013-02-17 20:43:17 +00:00
Benjamin Kramer	fb9ea4e659	Force a cpu for test. It failed on atom due to different scheduling decisions. llvm-svn: 175401	2013-02-17 18:26:11 +00:00
Hal Finkel	76e65e4542	BBVectorize: Fix an invalid reference bug This fixes PR15289. This bug was introduced (recently) in r175215; collecting all std::vector references for candidate pairs to delete at once is invalid because subsequent lookups in the owning DenseMap could invalidate the references. bugpoint was able to reduce a useful test case. Unfortunately, because whether or not this asserts depends on memory layout, this test case will sometimes appear to produce valid output. Nevertheless, running under valgrind will reveal the error. llvm-svn: 175397	2013-02-17 15:59:26 +00:00
Bill Wendling	23242098e7	The transform is: (or (bool?A:B),(bool?C:D)) --> (bool?(or A,C):(or B,D)) By the time the OR is visited, both the SELECTs have been visited and not optimized and the OR itself hasn't been transformed so we do this transform in the hopes that the new ORs will be optimized. The transform is explicitly disabled for vector-selects until "codegen matures to handle them better". Patch by Muhammad Tauqir! llvm-svn: 175380	2013-02-16 23:41:36 +00:00
Benjamin Kramer	64bf78046e	MCParser: Reject .balign with non-pow2 alignments. GNU as rejects them and there are configure scripts in the wild that check if the assembler rejects ".align 3" to determine whether the alignment is in bytes or powers of two. llvm-svn: 175360	2013-02-16 15:00:16 +00:00
Jakub Staszak	6a62c29f6b	Replace "check:" wth "CHECK:". Also fix one test by changing "vpermilps" to "vpshufd". llvm-svn: 175357	2013-02-16 12:16:56 +00:00
Bill Wendling	61375d8953	Reinitialize the ivars in the subtarget so that they can be reset with the new features. llvm-svn: 175336	2013-02-16 01:36:26 +00:00
Chad Rosier	925c9b499e	[ms-inline asm] Do not omit the frame pointer if we have ms-inline assembly. If the frame pointer is omitted, and any stack changes occur in the inline assembly, e.g.: "pusha", then any C local variable or C argument references will be incorrect. I pass no judgement on anyone who would do such a thing. ;) rdar://13218191 llvm-svn: 175334	2013-02-16 01:25:28 +00:00
Joerg Sonnenberger	e2bb314a30	Derive ELF section type from the name in some cases where GNU as does so. llvm-svn: 175327	2013-02-16 00:32:53 +00:00
Bill Wendling	e9434778f7	Temporary revert of 175320. llvm-svn: 175322	2013-02-15 23:22:32 +00:00
Bill Wendling	a060d0efd8	Reinitialize the ivars in the subtarget. When we're recalculating the feature set of the subtarget, we need to have the ivars in their initial state. llvm-svn: 175320	2013-02-15 23:18:01 +00:00
Derek Schuff	8878bcc9e7	If bundle alignment is enabled, do not add data to a fragment with instructions With bundle alignment, instructions all get their own MCFragments (unless they are in a bundle-locked group). For instructions with fixups, this is an MCDataFragment. Emitting actual data (e.g. for .long) attempts to re-use MCDataFragments, which we don't want int this case since it leads to fragments which exceed the bundle size. So, don't reuse them in this case. Also adds a test and fixes some formatting. llvm-svn: 175316	2013-02-15 22:50:52 +00:00
Pekka Jaaskelainen	62848c9c24	Forgot to 'svn add' the LoopVectorizer tests for the new parallel loop metadata, sorry. llvm-svn: 175311	2013-02-15 21:50:19 +00:00
Paul Redmond	f29ddfe93f	enable SDISel sincos optimization for GNU environments - add sincos to runtime library if target triple environment is GNU - added canCombineSinCosLibcall() which checks that sincos is in the RTL and if the environment is GNU then unsafe fpmath is enabled (required to preserve errno) - extended sincos-opt lit test Reviewed by: Hal Finkel llvm-svn: 175283	2013-02-15 18:45:18 +00:00
Arnaud A. de Grandmaison	61c167c62b	Teach InstCombine to work with smaller legal types in icmp (shl %v, C1), C2 It enables to work with a smaller constant, which is target friendly for those which can compare to immediates. It also avoids inserting a shift in favor of a trunc, which can be free on some targets. This used to work until LLVM-3.1, but regressed with the 3.2 release. llvm-svn: 175270	2013-02-15 14:35:47 +00:00
Kostya Serebryany	a968568165	[asan] support long double on 64-bit. See https://code.google.com/p/address-sanitizer/issues/detail?id=151 llvm-svn: 175266	2013-02-15 12:46:06 +00:00
Tim Northover	3533ad6bbd	AArch64: remove ConstantIsland pass & put literals in separate section. This implements the review suggestion to simplify the AArch64 backend. If we later discover that we really need the extra complexity of the ConstantIslands pass for performance reasons it can be resurrected. llvm-svn: 175258	2013-02-15 09:33:43 +00:00
Tim Northover	5466e36fb5	AArch64: refactor frame handling to use movz/movk for overlarge offsets. In the near future litpools will be in a different section, which means that any access to them is at least two instructions. This makes the case for a movz/movk pair (if total offset <= 32-bits) even more compelling. llvm-svn: 175257	2013-02-15 09:33:26 +00:00
Bill Wendling	26b95756c1	Simplify the 'operator<' for the attribute object. llvm-svn: 175252	2013-02-15 05:25:26 +00:00
Anna Zaks	269d1fa991	Revert "Fix testcase for attribute ordering." This reverts commit 58f20a3cbfca7384fe5e25e095f18572736a4792. llvm-svn: 175249	2013-02-15 04:15:53 +00:00
Anna Zaks	61040b915d	Revert "Fix testcase for attribute ordering." This reverts commit 997c6516ca161073a1d516ebca7c0ca7722f64e2. llvm-svn: 175248	2013-02-15 04:15:50 +00:00
Bill Wendling	f7d8d767fb	Fix testcase for attribute ordering. llvm-svn: 175238	2013-02-15 01:04:46 +00:00
Reed Kotler	f022147790	Fix minor mips16 issues in directives for function prologue. Probably this does not matter but makes it more gcc compatible which avoids possible subtle problems. Also, turned back on a disabled check in helloworld.ll. llvm-svn: 175237	2013-02-15 01:04:38 +00:00
Bill Wendling	fa1d248ccf	Fix testcase for attribute ordering. llvm-svn: 175236	2013-02-15 00:58:25 +00:00
Joel Jones	0f8617b17e	The ARM NEON vector compare instructions take three arguments. However, the assembler should also accept a two arg form, as the docuemntation specifies that the first (destination) register is optional. This patch uses TwoOperandAliasConstraint to add the two argument form. It also fixes an 80-column formatting problem in: test/MC/ARM/neon-bitwise-encoding <rdar://problem/12909419> Clang rejects ARM NEON assembly instructions llvm-svn: 175221	2013-02-14 23:18:40 +00:00
Kay Tiong Khoo	7b564da474	death to extra whitespace llvm-svn: 175200	2013-02-14 19:15:14 +00:00
Kay Tiong Khoo	f809c6491d	added basic support for Intel ADX instructions -feature flag, instructions definitions, test cases llvm-svn: 175196	2013-02-14 19:08:21 +00:00
Nadav Rotem	495b1a43c1	Dont merge consecutive loads/stores into vectors when noimplicitfloat is used. llvm-svn: 175190	2013-02-14 18:28:52 +00:00
Weiming Zhao	c598700788	Re-apply r175088 for bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM Update test case to use -mtriple=arm-linux-gnueabi llvm-svn: 175186	2013-02-14 18:10:21 +00:00
Vincent Lejeune	f940fd05bd	R600: Do not fold single instruction with more that 3 kcache read It fixes around 100 tfb piglit tests and 16 glean tests. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175183	2013-02-14 16:57:19 +00:00
Rafael Espindola	86d5345988	Revert r15266. This fixes llvm.org/pr15266. llvm-svn: 175173	2013-02-14 16:23:08 +00:00
Krzysztof Parzyszek	f71a69d608	Add testcase for llvm-dwarfdump to test parsing of the pubnames data. llvm-svn: 175168	2013-02-14 16:10:58 +00:00
Kristof Beyls	2efb59a719	Make ARMAsmParser accept the correct alignment specifier syntax in instructions. The parser will now accept instructions with alignment specifiers written like vld1.8 {d16}, [r0:64] , while also still accepting the incorrect syntax vld1.8 {d16}, [r0, :64] llvm-svn: 175164	2013-02-14 14:46:12 +00:00
Elena Demikhovsky	5b9d426907	Moved line-info.ll to DebugInfo\X86 directory llvm-svn: 175150	2013-02-14 09:07:34 +00:00
Elena Demikhovsky	70247a807b	The test failed on Windows. I've changed the platform to run to "x86_64-apple-darwin". llvm-svn: 175146	2013-02-14 08:23:08 +00:00
Elena Demikhovsky	d0a0cc80cd	Fixed a bug in X86TargetLowering::LowerVectorIntExtend() (assertion failure). Added a test. llvm-svn: 175144	2013-02-14 08:20:26 +00:00
Michel Danzer	51d5eb2f63	R600: Add lit tests for texture sampling instruction selection. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175138	2013-02-14 07:43:51 +00:00
Andrew Trick	6871e5f4e5	Reapply "s/grep/FileCheck/ in some tests" This reverts commit fd1335e982bbf93c5f450ed4fd29f9f787435c85. Use a triple this time. llvm-svn: 175134	2013-02-14 03:45:08 +00:00
Nick Lewycky	06417743cf	Teach the DataLayout aware constant folder to be much more aggressive towards 'and' instructions. This is a pattern that shows up a lot in ubsan binaries. llvm-svn: 175128	2013-02-14 03:23:37 +00:00
Andrew Trick	836bf1526b	Revert "s/grep/FileCheck/ in some tests" This reverts commit 8b75e6bc35fb3f9c1e788dbd05084c0f4a60a0f3. The FileCheck tests are not equivalent: test/CodeGen/X86/tailcall-structret.ll:6:10: error: expected string not found in input ; CHECK: jmp init ^ <stdin>:1:2: note: scanning from here .section __TEXT,__text,regular,pure_instructions ^ <stdin>:13:2: note: possible intended match here jmp _init ## TAILCALL ^ llvm-svn: 175124	2013-02-14 03:00:57 +00:00
Weiming Zhao	090edf7e67	temporarily revert the patch due to some conflicts llvm-svn: 175107	2013-02-13 23:24:40 +00:00
Anshuman Dasgupta	e96f804eba	Hexagon: add support for predicate-GPR copies. llvm-svn: 175102	2013-02-13 22:56:34 +00:00
Tom Stellard	91da4e9199	R600: Add support for 128-bit parameters NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175096	2013-02-13 22:05:20 +00:00
Eli Bendersky	3ffeb68dd7	s/grep/FileCheck/ in some tests llvm-svn: 175093	2013-02-13 22:00:37 +00:00
Eli Bendersky	04553985d7	s/grep/FileCheck/ in some tests llvm-svn: 175089	2013-02-13 21:46:38 +00:00
Weiming Zhao	0632a4b002	Bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM llvm-svn: 175088	2013-02-13 21:43:02 +00:00
Chad Rosier	da05cf7ba7	[ms-inline asm] Fix up test case for non-Darwin platforms. llvm-svn: 175087	2013-02-13 21:41:58 +00:00
Jyotsna Verma	d92252469e	Hexagon: Use absolute addressing mode loads/stores for global+offset instead of redefining separate instructions for them. llvm-svn: 175086	2013-02-13 21:38:46 +00:00
Chad Rosier	282edd7caa	[ms-inline-asm] Add support for memory references that have non-immediate displacements. rdar://12974533 llvm-svn: 175083	2013-02-13 21:33:44 +00:00
Reed Kotler	f662cff689	For Mips 16, add the optimization where the 16 bit form of addiu sp can be used if the offset fits in 11 bits. This makes use of the fact that the abi requires sp to be 8 byte aligned so the actual offset can fit in 8 bits. It will be shifted left and sign extended before being actually used. The assembler or direct object emitter will shift right the 11 bit signed field by 3 bits. We don't need to deal with that here. llvm-svn: 175073	2013-02-13 20:28:27 +00:00
Manman Ren	7a4c8a783c	Clean up LDV, no functionality change. Remove dead functions: renameRegister Move private member variables from LDV to Impl Remove ssp/uwtable from testing case llvm-svn: 175072	2013-02-13 20:23:48 +00:00
David Peixotto	6eecb28d3a	PR14992 - Tablegen incorrectly converts ARM tLDMIA_UPD pseudo to tLDMIA Fixed bug in tablegen conversion when source pseudo instruction has a different number of arguments than the destination instruction. llvm-svn: 175066	2013-02-13 19:21:47 +00:00
Benjamin Kramer	8e2637e2b0	X86: Disable generation of rep;movsl when %esi is used as a base pointer. This happens when there is both stack realignment and a dynamic alloca in the function. If we overwrite %esi (rep;movsl uses fixed registers) we'll lose the base pointer and the next register spill will write into oblivion. Fixes PR15249 and unbreaks firefox on i386/freebsd. Mozilla uses dynamic allocas and freebsd a 4 byte stack alignment. llvm-svn: 175057	2013-02-13 13:40:35 +00:00
Reed Kotler	9cb8e7b9f5	Make jumptables work for -static llvm-svn: 175044	2013-02-13 08:32:14 +00:00
Elena Demikhovsky	9e0df7cb01	Prevent insertion of "vzeroupper" before call that preserves YMM registers, since a caller uses preserved registers across the call. llvm-svn: 175043	2013-02-13 08:02:04 +00:00
Eric Christopher	389ee71b0a	Check i1 as well as i8 variables for 8 bit registers for x86 inline assembly. llvm-svn: 175036	2013-02-13 06:01:05 +00:00
Eric Christopher	2398a9a175	Finish obviously broken thought. llvm-svn: 175035	2013-02-13 06:01:00 +00:00
Kostya Serebryany	3838f27905	[tsan] disable load widening in ThreadSanitizer mode llvm-svn: 175034	2013-02-13 05:59:45 +00:00
Manman Ren	f019cd62da	Debug Info: LiveDebugVarible can remove DBG_VALUEs, make sure we emit them back. RegisterCoalescer used to depend on LiveDebugVariable. LDV removes DBG_VALUEs without emitting them at the end. We fix this by removing LDV from RegisterCoalescer. Also add an assertion to make sure we call emitDebugValues if DBG_VALUEs are removed at runOnMachineFunction. rdar://problem/13183203 Reviewed by Andy & Jakob llvm-svn: 175023	2013-02-13 01:14:49 +00:00
Krzysztof Parzyszek	5974de4e7d	Remove target-specific info from the testcase for DWARF/pubnames. llvm-svn: 174992	2013-02-12 18:53:21 +00:00
Chad Rosier	f72d06a919	[ms-inline asm] Add support for lexing binary integers with a [bB] suffix. This is complicated by backward labels (e.g., 0b can be both a backward label and a binary zero). The current implementation assumes [0-9]b is always a label and thus it's possible for 0b and 1b to not be interpreted correctly for ms-style inline assembly. However, this is relatively simple to fix in the inline assembly (i.e., drop the [bB]). This patch also limits backward labels to [0-9]b, so that only 0b and 1b are ambiguous. Part of rdar://12470373 llvm-svn: 174983	2013-02-12 18:29:02 +00:00
Krzysztof Parzyszek	228daa6986	Allow optionally generating pubnames section in DWARF info. Introduce option "generate-dwarf-pubnames" to control it, set to "false" by default. llvm-svn: 174981	2013-02-12 18:00:14 +00:00
Kay Tiong Khoo	c5c9713fcf	added test cases for r174920 (prefetch disassembly) llvm-svn: 174979	2013-02-12 17:07:44 +00:00
Paul Redmond	7e7e3de43d	Fix the lit test added in r174972 Patch by: Kevin Schoedel llvm-svn: 174974	2013-02-12 16:07:27 +00:00
Jyotsna Verma	39f7a2b7a0	Hexagon: Add support to generate predicated absolute addressing mode instructions. llvm-svn: 174973	2013-02-12 16:06:23 +00:00
Paul Redmond	288604ed0c	PR14562 - Truncation of left shift became undef DAGCombiner::ReduceLoadWidth was converting (trunc i32 (shl i64 v, 32)) into (shl i32 v, 32) into undef. To prevent this, check the shift count against the final result size. Patch by: Kevin Schoedel Reviewed by: Nadav Rotem llvm-svn: 174972	2013-02-12 15:21:21 +00:00
Justin Holewinski	be8dc6499a	[NVPTX] Disable vector registers Vectors were being manually scalarized by the backend. Instead, let the target-independent code do all of the work. The manual scalarization was from a time before good target-independent support for scalarization in LLVM. However, this forces us to specially-handle vector loads and stores, which we can turn into PTX instructions that produce/consume multiple operands. llvm-svn: 174968	2013-02-12 14:18:49 +00:00
Kostya Serebryany	e2e32b32e8	[asan] fix tests for the new ABI llvm-svn: 174959	2013-02-12 11:14:24 +00:00
Bill Wendling	7321fec934	Test for string attributes and for attribute group output. llvm-svn: 174954	2013-02-12 09:14:20 +00:00
Arnold Schwaighofer	89aef93841	ARM cost model: Add vector reverse shuffle costs A reverse shuffle is lowered to a vrev and possibly a vext instruction (quad word). radar://13171406 llvm-svn: 174933	2013-02-12 02:40:39 +00:00
Arnold Schwaighofer	1f3d3ca769	ARM NEON: Handle v16i8 and v8i16 reverse shuffles Lower reverse shuffles to a vrev64 and a vext instruction instead of the default legalization of storing and loading to the stack. This is important because we generate reverse shuffles in the loop vectorizer when we reverse store to an array. uint8_t Arr[N]; for (i = 0; i < N; ++i) Arr[N - i - 1] = ... radar://13171760 llvm-svn: 174929	2013-02-12 01:58:32 +00:00
Chad Rosier	8bc655605b	[ms-inline asm] Add support for lexing hexidecimal integers with a [hH] suffix. Part of rdar://12470373 llvm-svn: 174926	2013-02-12 01:00:01 +00:00
Michael Ilseman	74a6da963b	Optimization: bitcast (<1 x ...> insertelement ..., X, ...) to ... ==> bitcast X to ... llvm-svn: 174905	2013-02-11 21:41:44 +00:00
Krzysztof Parzyszek	9a278f108a	Extend Hexagon hardware loop generation to handle various additional cases: - variety of compare instructions, - loops with no preheader, - arbitrary lower and upper bounds. llvm-svn: 174904	2013-02-11 21:37:55 +00:00
Michael Ilseman	35f82ff833	Remove trailing whitespace llvm-svn: 174903	2013-02-11 21:36:49 +00:00
Kay Tiong Khoo	d30b1a2ac7	fixed disassembly of some i386 system insts with intel syntax added file for test cases for i386 intel syntax llvm-svn: 174900	2013-02-11 19:46:36 +00:00
Justin Holewinski	4f12f53353	[NVPTX] Remove NoCapture from address space conversion intrinsics. NoCapture is not valid in this case, and was causing incorrect optimizations. llvm-svn: 174896	2013-02-11 18:56:35 +00:00
Tim Northover	09995ac069	AArch64: generate dwarfdump test rather than include .o in subversion llvm-svn: 174891	2013-02-11 16:28:12 +00:00
Tim Northover	acaa788be6	AArch64: Add basic relocation processing for llvm-dwarfdump. This allows llvm-dwarfdump to handle the relocations needed, at least for LLVM-produced code. llvm-svn: 174874	2013-02-11 11:16:02 +00:00
Tim Northover	45a0d77c48	AArch64: Undo change to how test was run This broke on Windows, presumably due to interleaving of output streams. llvm-svn: 174873	2013-02-11 10:51:41 +00:00
Tim Northover	60baeb984f	Make use of DiagnosticType to provide better AArch64 diagnostics. This gives a DiagnosticType to all AsmOperands in sight. This replaces all "invalid operand" diagnostics with something more specific. The messages given should still be sufficiently vague that they're not usually actively misleading when LLVM guesses your instruction incorrectly. llvm-svn: 174871	2013-02-11 09:29:37 +00:00
Bill Wendling	84ba97698e	FileCheck-ize the tests. llvm-svn: 174865	2013-02-11 08:34:57 +00:00
Kostya Serebryany	d688bab563	[tsan/msan] adding thread_safety and uninitialized_checks attributes llvm-svn: 174864	2013-02-11 08:13:54 +00:00
Andrew Trick	bc7059032b	LSR IVChain improvement. Handle chains in which the same offset is used for both loads and stores to the same array. Fixes rdar://11410078. llvm-svn: 174789	2013-02-09 01:11:01 +00:00
Manman Ren	d2c95eb995	Dwarf: do not use line_table_start in at_stmt_list since we do not always emit line table entries in assembly. llvm-svn: 174785	2013-02-09 00:41:44 +00:00
Reed Kotler	b9bf8dca47	Add the 16 bit version of addiu. To the assembler, the 16 and 32 bit are the same so we put in the comment field an indicator when we think we are emitting the 16 bit version. For the direct object emitter, the difference is important as well as for other passes which need an accurate count of program size. There will be other similar putbacks to this for various instructions. llvm-svn: 174747	2013-02-08 21:42:56 +00:00
Hal Finkel	2581905f81	DAGCombiner: Constant folding around pre-increment loads/stores Previously, even when a pre-increment load or store was generated, we often needed to keep a copy of the original base register for use with other offsets. If all of these offsets are constants (including the offset which was combined into the addressing mode), then this is clearly unnecessary. This change adjusts these other offsets to use the new incremented address. llvm-svn: 174746	2013-02-08 21:35:47 +00:00
Bob Wilson	67bbf3aa0c	Revert 172027 and 174336. Remove diagnostics about over-aligned stack objects. Aside from the question of whether we report a warning or an error when we can't satisfy a requested stack object alignment, the current implementation of this is not good. We're not providing any source location in the diagnostics and the current warning is not connected to any warning group so you can't control it. We could improve the source location somewhat, but we can do a much better job if this check is implemented in the front-end, so let's do that instead. <rdar://problem/13127907> llvm-svn: 174741	2013-02-08 20:35:15 +00:00
Bill Schmidt	62fe7a5b17	Refine fix to bug 15041. Thanks to help from Nadav and Hal, I have a more reasonable (and even correct!) approach. This specifically penalizes the insertelement and extractelement operations for the performance hit that will occur on PowerPC processors. llvm-svn: 174725	2013-02-08 18:19:17 +00:00
Chad Rosier	22d275f7b8	[SimplifyLibCalls] Library call simplification doen't work if the call site isn't using the default calling convention. However, if the transformation is from a call to inline IR, then the calling convention doesn't matter. rdar://13157990 llvm-svn: 174724	2013-02-08 18:00:14 +00:00
Arnold Schwaighofer	594fa2dc2b	ARM cost model: Address computation in vector mem ops not free Adds a function to target transform info to query for the cost of address computation. The cost model analysis pass now also queries this interface. The code in LoopVectorize adds the cost of address computation as part of the memory instruction cost calculation. Only there, we know whether the instruction will be scalarized or not. Increase the penality for inserting in to D registers on swift. This becomes necessary because we now always assume that address computation has a cost and three is a closer value to the architecture. radar://13097204 llvm-svn: 174713	2013-02-08 14:50:48 +00:00
Alexey Samsonov	897f2cf408	Update tests for DWARF parser: store sources next to pre-built object files and provide build instructions llvm-svn: 174711	2013-02-08 14:34:33 +00:00
Reed Kotler	66165c8f96	When Mips16 frames grow large, the immediate field may exceed the maximum allowed size for the instruction. This code uses RegScavenger to fix this. We sometimes need 2 registers for Mips16 so we must handle things differently than how register scavenger is normally used. llvm-svn: 174696	2013-02-08 03:57:41 +00:00
Andrew Trick	1bd53c3675	Revert "Have InstCombine call SipmlifyCall when handling calls. Test case included." This reverts commit 3854a5d90fee52af1065edbed34521fff6cdc18d. This causes a clang unit test to hang: vtable-available-externally.cpp. llvm-svn: 174692	2013-02-08 01:55:39 +00:00
Michael Ilseman	6092dc5455	Have InstCombine call SipmlifyCall when handling calls. Test case included. llvm-svn: 174675	2013-02-07 23:01:35 +00:00
Akira Hatanaka	061d1ea5da	[mips] Add definition of JALR instruction which has two register operands. Change the original JALR instruction with one register operand to be a pseudo-instruction. llvm-svn: 174657	2013-02-07 19:48:00 +00:00
Michael Ilseman	5485729b9a	Identify and simplify idempotent intrinsics. Test case included. llvm-svn: 174650	2013-02-07 19:26:05 +00:00
Michael J. Spencer	3a967eac1e	[Object][ELF] Fix crash on no dynamic section. llvm-svn: 174639	2013-02-07 18:26:45 +00:00
Arnold Schwaighofer	213fced704	ARM cost model: Add costs for vector selects Vector selects are cheap on NEON. They get lowered to a vbsl instruction. radar://13158753 llvm-svn: 174631	2013-02-07 16:10:15 +00:00
Tom Stellard	e06163a9a6	R600: Add support for SET_DX10 instructions These instructions compare two floating point values and return an integer true (-1) or false (0) value. When compiling code generated by the Mesa GLSL frontend, the SET_DX10 instructions save us four instructions for most branch decisions that use floating-point comparisons. llvm-svn: 174609	2013-02-07 14:02:35 +00:00
Tom Stellard	6d867e8d4d	R600: Add tests for unsupported condition codes. All of the le and lt variants are unsupported. llvm-svn: 174608	2013-02-07 14:02:33 +00:00
Tom Stellard	b40ada9b85	R600: Fix assembly name for SETGT_INT llvm-svn: 174607	2013-02-07 14:02:27 +00:00
Owen Anderson	bfd2ce96c7	Remove this testcase until I can figure out how to properly conditionalize it. llvm-svn: 174591	2013-02-07 07:01:54 +00:00
Owen Anderson	589baf98b4	Another attempt at getting the XFAIL line right for this test. llvm-svn: 174588	2013-02-07 06:26:55 +00:00
Reed Kotler	4a230ffa96	Make sure we call externals from libraries properly when -static. For example, when we are doing mips16 hard float or soft float. llvm-svn: 174583	2013-02-07 04:34:51 +00:00
Reed Kotler	ec60f7d335	Enable jumps when in -static mode. llvm-svn: 174580	2013-02-07 03:49:51 +00:00
Michael Ilseman	1dd6f2a5ba	Preserve fast-math flags after reassociation and commutation. Update test cases llvm-svn: 174571	2013-02-07 01:40:15 +00:00
Michael Ilseman	10f2055812	whitespace llvm-svn: 174569	2013-02-07 01:27:13 +00:00
Owen Anderson	389f7dc7a2	Fix CMake detection of various cmath functions, and XFAIL the test on platforms that are known to be missing them. llvm-svn: 174564	2013-02-07 00:54:05 +00:00
Owen Anderson	d4ebfd8400	Signficantly generalize our ability to constant fold floating point intrinsics, including ones on half types. llvm-svn: 174555	2013-02-06 22:43:31 +00:00
Eli Bendersky	d5d4c89bf0	Fix typo llvm-svn: 174553	2013-02-06 22:34:46 +00:00
Eli Bendersky	9fcfe1ed89	Add a comment to the test that points to the source from which the input object file was generated. llvm-svn: 174551	2013-02-06 22:17:40 +00:00
Eli Bendersky	c0d905c0e9	Add a test for checking the current .debug_frame dumping capability. The test is a binary placed in test/DebugInfo/Inputs, with a source C file used for reference/reproducing. The source's first line is a clang build command for reproducing the binary. llvm-svn: 174543	2013-02-06 20:55:06 +00:00
Eli Bendersky	ef4558abd3	This is a follow-up on r174446, now taking Atom processors into account. Atoms use LEA for updating SP in prologs/epilogs, and the exact LEA opcode depends on the data model. Also reapplying the test case which was added and then reverted (because of Atom failures), this time specifying explicitly the CPU in addition to the triple. The test case now checks all variations (data mode, cpu Atom vs. Core). llvm-svn: 174542	2013-02-06 20:43:57 +00:00
Guy Benyei	5ea04c385f	Canonicalize line endings to Linux style also when the --strict-whitespace flag is in use. This flag is supposed to affect horizontal whitespaces only. llvm-svn: 174541	2013-02-06 20:40:38 +00:00
Tim Northover	228d9d3aa2	Implement external weak (ELF) symbols on AArch64 Weakly defined symbols should evaluate to 0 if they're undefined at link-time. This is impossible to do with the usual address generation patterns, so we should use a literal pool entry to materlialise the address. llvm-svn: 174518	2013-02-06 16:43:33 +00:00
Tim Northover	a80c4c1a08	Add AArch64 CRC32 instructions These instructions are a late addition to the architecture, and may yet end up behind an optional attribute, but for now they're available at all times. llvm-svn: 174496	2013-02-06 09:13:13 +00:00
Tim Northover	91a51c5a7c	Add icache prefetch operations to AArch64 This adds hints to the various "prfm" instructions so that they can affect the instruction cache as well as the data cache. llvm-svn: 174495	2013-02-06 09:04:56 +00:00
Eli Bendersky	c4446856e3	Remove this test in the meantime, since it won't pass on Atom. Atom uses lea to move the stack pointer in prologs/epilogs. I will fix the test and add it back later. llvm-svn: 174484	2013-02-06 03:15:00 +00:00
Manman Ren	d2c38d684a	Attempt to recover gdb bot after r174445. Failure: undefined symbol 'Lline_table_start0'. Root-cause: we use a symbol subtraction to calculate at_stmt_list, but the line table entries are not dumped in the assembly. Fix: use zero instead of a symbol subtraction for Compile Unit 0. llvm-svn: 174479	2013-02-06 00:59:41 +00:00
Eli Bendersky	59a6fb0381	Test for r174446 llvm-svn: 174464	2013-02-05 23:31:48 +00:00
Manman Ren	4e042a6be6	Dwarf: support for LTO where a single object file can have multiple line tables We generate one line table for each compilation unit in the object file. Reviewed by Eric and Kevin. rdar://problem/13067005 llvm-svn: 174445	2013-02-05 21:52:47 +00:00
Akira Hatanaka	dec25266d7	[mips] Do not use function CC_MipsN_VarArg unless the function being analyzed is a vararg function. The original code was examining flag OutputArg::IsFixed to determine whether CC_MipsN_VarArg or CC_MipsN should be called. This is not correct, since this flag is often set to false when the function being analyzed is a non-variadic function. llvm-svn: 174442	2013-02-05 21:18:11 +00:00
Benjamin Kramer	944e0abf04	InstCombine: Fix and simplify the inttoptr side too. llvm-svn: 174438	2013-02-05 20:22:40 +00:00
Michael Gottesman	a750006ad6	Added missing newline to end of test case. llvm-svn: 174433	2013-02-05 19:39:44 +00:00
Owen Anderson	de89ecf1fc	Reapply r174343, with a fix for a scary DAG combine bug where it failed to differentiate between the alignment of the base point of a load, and the overall alignment of the load. This caused infinite loops in DAG combine with the original application of this patch. ORIGINAL COMMIT LOG: When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174431	2013-02-05 19:24:39 +00:00
Benjamin Kramer	e477875873	InstCombine: Harden code to work with vectors of pointers and simplify it a bit. Found by running instcombine on a fabricated test case for the constant folder. llvm-svn: 174430	2013-02-05 19:21:56 +00:00
Jyotsna Verma	6031625b03	Hexagon: Use TFR_cond with cmpb.[eq,gt,gtu] to handle zext( set[ne,eq,gt,ugt] (...) ) type of dag patterns. llvm-svn: 174429	2013-02-05 19:20:45 +00:00
Benjamin Kramer	a5a9ec5755	ConstantFolding: Fix a crash when encoutering a truncating inttoptr. This was introduced in r173293. llvm-svn: 174424	2013-02-05 19:04:36 +00:00
Jyotsna Verma	d53b25b47e	Hexagon: Add testcase for post-increment store instructions. llvm-svn: 174419	2013-02-05 18:23:51 +00:00
Chad Rosier	92a54f6d4c	[SjLj Prepare] When demoting an invoke instructions to the stack, if the normal edge is critical, then split it so we can insert the store. rdar://13126179 llvm-svn: 174418	2013-02-05 18:23:10 +00:00
Jyotsna Verma	50ca6dd8a7	Hexagon: Use multiclass for absolute addressing mode stores. llvm-svn: 174412	2013-02-05 18:15:34 +00:00
Jakob Stoklund Olesen	eb1084ee54	Add a test case for PR14750. This was fixed by r174402. llvm-svn: 174405	2013-02-05 18:04:15 +00:00
Derek Schuff	90aa1d8abe	[MC] Bundle alignment: Invalidate relaxed fragments Currently, when a fragment is relaxed, its size is modified, but its offset is not (it gets laid out as a side effect of checking whether it needs relaxation), then all subsequent fragments are invalidated because their offsets need to change. When bundling is enabled, relaxed fragments need to get laid out again, because the increase in size may push it over a bundle boundary. So instead of only invalidating subsequent fragments, also invalidate the fragment that gets relaxed, which causes it to get laid out again. This patch also fixes some trailing whitespace and fixes the bundling-related debug output of MCFragments. llvm-svn: 174401	2013-02-05 17:55:27 +00:00
Tom Stellard	7d41161a2d	R600: Add tests for instruction predicates llvm-svn: 174393	2013-02-05 17:09:13 +00:00
Tom Stellard	2e5e7a5bef	R600: Emit function name in the AsmPrinter Emitting the function name allows us to check for it in the FileCheck tests so we can make sure FileCheck is checking the output of the correct function. llvm-svn: 174392	2013-02-05 17:09:11 +00:00
Jyotsna Verma	6f635b5488	Hexagon: Add V4 compare instructions. Enable relationship mapping for the existing instructions. llvm-svn: 174389	2013-02-05 16:42:24 +00:00
NAKAMURA Takumi	7ec43d9b37	Formatting. llvm-svn: 174380	2013-02-05 15:32:16 +00:00
NAKAMURA Takumi	6635fe56d3	llvm/test/Transforms/LoopVectorize/X86/vector_ptr_load_store.ll: "-debug" requires +Asserts. llvm-svn: 174379	2013-02-05 15:32:10 +00:00
Arnold Schwaighofer	22174f5d5a	Loop Vectorizer: Handle pointer stores/loads in getWidestType() In the loop vectorizer cost model, we used to ignore stores/loads of a pointer type when computing the widest type within a loop. This meant that if we had only stores/loads of pointers in a loop we would return a widest type of 8bits (instead of 32 or 64 bit) and therefore a vector factor that was too big. Now, if we see a consecutive store/load of pointers we use the size of a pointer (from data layout). This problem occured in SingleSource/Benchmarks/Shootout-C++/hash.cpp (reduced test case is the first test in vector_ptr_load_store.ll). radar://13139343 llvm-svn: 174377	2013-02-05 15:08:02 +00:00
NAKAMURA Takumi	3753b28cd2	Revert r174343, "When the target-independent DAGCombiner inferred a higher alignment for a load," It caused hangups in compiling clang/lib/Parse/ParseDecl.cpp and clang/lib/Driver/Tools.cpp in stage2 on some hosts. llvm-svn: 174374	2013-02-05 14:44:16 +00:00
Logan Chien	4b724429b8	Link .ARM.exidx with corresponding text section. The sh_link in the ELF section header of .ARM.exidx should be filled with the section index of the corresponding text section. llvm-svn: 174372	2013-02-05 14:18:59 +00:00
Arnold Schwaighofer	a804bbee9b	ARM cost model: Cost for scalar integer casts and floating point conversions Also adds some costs for vector integer float conversions. llvm-svn: 174371	2013-02-05 14:05:55 +00:00
Jack Carter	428a06cc75	This patch that sets the Mips ELF header flag for MicroMips architectures. Contributer: Zoran Jovanovic llvm-svn: 174360	2013-02-05 09:30:03 +00:00
Jack Carter	9c1a027fe8	This patch that sets the EmitAlias flag in td files and enables the instruction printer to print aliased instructions. Due to usage of RegisterOperands a change in common code (utils/TableGen/AsmWriterEmitter.cpp) is required to get the correct register value if it is a RegisterOperand. Contributer: Vladimir Medic llvm-svn: 174358	2013-02-05 08:32:10 +00:00
Eric Christopher	6a421a944d	Add support for testing the output of the abbrev table for the skeleton CU as part of the DWARF5 split dwarf proposal. llvm-svn: 174351	2013-02-05 07:32:00 +00:00
Eric Christopher	7a2cdf798b	Add support for emitting a stub DW_AT_GNU_dwo_id as part of the DWARF5 split dwarf proposal. llvm-svn: 174350	2013-02-05 07:31:55 +00:00
Michael Gottesman	e2376cdf71	Add code to GlobalVariable.h so that global variables marked as externally_initialized return false for hasDefiniteInitializer and hasUniqueInitializer. rdar://12580965. llvm-svn: 174345	2013-02-05 06:53:26 +00:00
Owen Anderson	a47fdbb032	When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174343	2013-02-05 06:25:30 +00:00
Michael Gottesman	27e7ef326a	Added LLVM Asm/Bitcode Reader/Writer support for new IR keyword externally_initialized. llvm-svn: 174340	2013-02-05 05:57:38 +00:00
Manman Ren	86b1d868ba	[Stack Alignment] emit warning instead of a hard error Per discussion in rdar://13127907, we should emit a hard error only if people write code where the requested alignment is larger than achievable and assumes the low bits are zeros. A warning should be good enough when we are not sure if the source code assumes the low bits are zeros. rdar://13127907 llvm-svn: 174336	2013-02-04 23:45:08 +00:00
Jyotsna Verma	7ab68fbd1d	Hexagon: Add V4 combine instructions and some more Def Pats for V2. llvm-svn: 174331	2013-02-04 15:52:56 +00:00
Benjamin Kramer	c35d526489	Disable a couple more vector splat optimizations on PPC. I didn't see those because the test case used "not grep". FileCheck the test and XFAIL it, preserving the old optimization, so this can be fixed eventually. llvm-svn: 174330	2013-02-04 15:52:32 +00:00
Benjamin Kramer	2c9da989c2	X86: Open up some opportunities for constant folding by postponing shift lowering. Fixes PR15141. llvm-svn: 174327	2013-02-04 15:19:33 +00:00
Benjamin Kramer	548ffa274a	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors. This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325	2013-02-04 15:19:18 +00:00
Tim Northover	37b131f607	Update debugging test for change in expected metadata. llvm-svn: 174321	2013-02-04 12:15:00 +00:00
David Blaikie	2811f8ac28	[DebugInfo] remove more node indirection (this time from the subprogram's variable lists) llvm-svn: 174305	2013-02-04 05:56:36 +00:00
Arnold Schwaighofer	98f1012f9b	ARM cost model: Penalize insertelement into D subregisters Swift has a renaming dependency if we load into D subregisters. We don't have a way of distinguishing between insertelement operations of values from loads and other values. Therefore, we are pessimistic for now (The performance problem showed up in example 14 of gcc-loops). radar://13096933 llvm-svn: 174300	2013-02-04 02:52:05 +00:00
David Blaikie	33111dfea0	Remove the (apparently) unnecessary debug info metadata indirection. The main lists of debug info metadata attached to the compile_unit had an extra layer of metadata nodes they went through for no apparent reason. This patch removes that (& still passes just as much of the GDB 7.5 test suite). If anyone can show evidence as to why these extra metadata nodes are there I'm open to reverting this patch & documenting why they're there. llvm-svn: 174266	2013-02-02 05:56:24 +00:00
Reed Kotler	f8933f83f0	Start static relocation implementation for mips16. This checkin makes hello world work. llvm-svn: 174264	2013-02-02 04:07:35 +00:00
Manman Ren	053e4ff008	Removing ssp and uwtable from the testcase llvm-svn: 174259	2013-02-02 01:34:38 +00:00
Shuxin Yang	cadd8a068e	rdar://13126763 Fix a bug in DAGCombine. The symptom is mistakenly optimizing expression "x + xx" into "x 3.0". llvm-svn: 174239	2013-02-02 00:22:03 +00:00
Manman Ren	e697d3cd2e	[Dwarf] avoid emitting multiple AT_const_value for static memebers. Testing case is reduced from MultiSource/BenchMarks/Prolangs-C++/deriv1. rdar://problem/13071590 llvm-svn: 174235	2013-02-01 23:54:37 +00:00
Bill Schmidt	52742c25ae	LLVM enablement for some older PowerPC CPUs llvm-svn: 174230	2013-02-01 22:59:51 +00:00
Dan Gohman	9ee4bc1abc	Add a testcase for some past-the-end address subtleties. llvm-svn: 174210	2013-02-01 19:37:52 +00:00
David Sehr	8114a7a651	Two changes relevant to LEA and x32: 1) allows the use of RIP-relative addressing in 32-bit LEA instructions under x86-64 (ILP32 and LP64) 2) separates the size of address registers in 64-bit LEA instructions from control by ILP32/LP64. llvm-svn: 174208	2013-02-01 19:28:09 +00:00
Jyotsna Verma	10f5c2db4e	Hexagon: Test case to confirm generation of indexed loads with zero offset. llvm-svn: 174196	2013-02-01 16:40:06 +00:00
Benjamin Kramer	c05aa958b1	InstSimplify: stripAndComputeConstantOffsets can be called with vectors of pointers too. Prepare it for vectors of pointers and handle simple cases. We don't handle complicated cases because accumulateConstantOffset bails on pointer vectors. Fixes selfhost on i386. llvm-svn: 174179	2013-02-01 15:21:10 +00:00
Tim Northover	e3d4236402	Add explicit triples to AArch64 tests Only Linux is supported at the moment, and other platforms quickly fault. As a result these tests would fail on non-Linux hosts. It may be worth making the tests more generic again as more platforms are supported. llvm-svn: 174170	2013-02-01 11:40:47 +00:00
Nadav Rotem	4349f6963e	Revert r174152. The shift amount may overflow and in that case this transformation is illegal. llvm-svn: 174156	2013-02-01 07:59:33 +00:00
Nadav Rotem	1d584029ae	Optimize shift lefts of a constant by a value plus constant into a single shift. llvm-svn: 174152	2013-02-01 06:45:40 +00:00
Dan Gohman	b3e2d3a638	Rewrite instsimplify's handling if icmp on pointer values to remove the remaining use of AliasAnalysis concepts such as isIdentifiedObject to prove pointer inequality. @external_compare in test/Transforms/InstSimplify/compare.ll shows a simple case where a noalias argument can be equal to a global variable address, and while AliasAnalysis can get away with saying that these pointers don't alias, instsimplify cannot say that they are not equal. llvm-svn: 174122	2013-02-01 00:11:13 +00:00
Dan Gohman	995d40e1e2	An alloca can be equal to an argument. It can't alias an alloca, but it could be equal, since there's nothing preventing a caller from correctly predicting the stack location of an alloca. llvm-svn: 174119	2013-01-31 23:49:33 +00:00
Bill Wendling	1c7cc8ae90	Remove the AttrBuilder form of the Attribute::get creators. The AttrBuilder is for building a collection of attributes. The Attribute object holds only one attribute. So it's not really useful for the Attribute object to have a creator which takes an AttrBuilder. This has two fallouts: 1. The AttrBuilder no longer holds its internal attributes in a bit-mask form. 2. The attributes are now ordered alphabetically (hence why the tests have changed). llvm-svn: 174110	2013-01-31 23:16:25 +00:00
Tom Stellard	4926921bd4	R600: Fold clamp, neg, abs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174099	2013-01-31 22:11:54 +00:00
Manman Ren	aec2ce7db4	Linker: correctly link in dbg.declare This is a re-worked version of r174048. Given source IR: call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !14), !dbg !15 we used to generate call void @llvm.dbg.declare(metadata !27, metadata !28), !dbg !29 !27 = metadata !{null} With this patch, we will correctly generate call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !27), !dbg !28 Looking up %argc.addr in ValueMap will return null, since %argc.addr is already correctly set up, we can use identity mapping. rdar://problem/13089880 llvm-svn: 174093	2013-01-31 21:19:18 +00:00
Lang Hames	dd47804394	When lowering memcpys to loads and stores, make sure we don't promote alignments past the natural stack alignment. llvm-svn: 174085	2013-01-31 20:23:43 +00:00
Derek Schuff	b76ec3bb5e	[MC] bundle alignment: prevent padding instructions from crossing bundle boundaries llvm-svn: 174067	2013-01-31 17:00:03 +00:00
Tim Northover	e0e3aefdd3	Add AArch64 as an experimental target. This patch adds support for AArch64 (ARM's 64-bit architecture) to LLVM in the "experimental" category. Currently, it won't be built unless requested explicitly. This initial commit should have support for: + Assembly of all scalar (i.e. non-NEON, non-Crypto) instructions (except the late addition CRC instructions). + CodeGen features required for C++03 and C99. + Compilation for the "small" memory model: code+static data < 4GB. + Absolute and position-independent code. + GNU-style (i.e. "__thread") TLS. + Debugging information. The principal omission, currently, is performance tuning. This patch excludes the NEON support also reviewed due to an outbreak of batshit insanity in our legal department. That will be committed soon bringing the changes to precisely what has been approved. Further reviews would be gratefully received. llvm-svn: 174054	2013-01-31 12:12:40 +00:00
Pekka Jaaskelainen	995a3e731d	Made the min-trip-count-switch test X86-specific to avoid breakage with builds without X86-support. llvm-svn: 174052	2013-01-31 10:33:22 +00:00
Alexey Samsonov	5234a8ed9f	Revert r173946. This breaks compilation of googletest with Clang llvm-svn: 174048	2013-01-31 08:02:11 +00:00
Michael Gottesman	41e4ac4224	Filecheckized 2x tests in SimplifyCFG and removed their date prefix to fit with current llvm style for test names. llvm-svn: 174011	2013-01-31 01:04:23 +00:00
Eric Christopher	4e3e94c13d	Check and allow floating point registers to select the size of the register for inline asm. This conforms to how gcc allows for effective casting of inputs into gprs (fprs is already handled). llvm-svn: 174008	2013-01-31 00:50:46 +00:00
Eli Bendersky	6c84b90b70	Replace some more greps with FileChecks in tests llvm-svn: 174006	2013-01-31 00:44:12 +00:00
Eli Bendersky	a320e00e74	Rewrite this test properly with a FileCheck instead of greps llvm-svn: 173997	2013-01-31 00:11:52 +00:00
Dan Gohman	6a61fccb96	Fix ConstantFold's folding of icmp instructions to recognize that, for example, a one-past-the-end pointer from one global variable may be equal to the base pointer of another global variable. llvm-svn: 173995	2013-01-31 00:01:45 +00:00
Hal Finkel	e1df90958d	PPC QPX requires a 32-byte aligned stack On systems which support the QPX vector instructions, the stack must be 32-byte aligned. llvm-svn: 173993	2013-01-30 23:43:27 +00:00
Evan Cheng	9449ec956f	Forgot the test case before. llvm-svn: 173988	2013-01-30 22:57:00 +00:00
Hal Finkel	efb305e54c	Add definitions for the PPC a2q core marked as having QPX available This is the first commit of a large series which will add support for the QPX vector instruction set to the PowerPC backend. This instruction set is used on the IBM Blue Gene/Q supercomputers. llvm-svn: 173973	2013-01-30 21:17:42 +00:00
Manman Ren	81dcc62805	Linker: correctly link in dbg.declare Given source IR: call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !14), !dbg !15 we used to generate call void @llvm.dbg.declare(metadata !27, metadata !28), !dbg !29 !27 = metadata !{null} With this patch, we will correctly generate call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !27), !dbg !28 Looking up %argc.addr in ValueMap will return null, since %argc.addr is already correctly set up, we can use identity mapping. llvm-svn: 173946	2013-01-30 17:42:15 +00:00
Eli Bendersky	2e2ce49e59	Add a special ARM trap encoding for NaCl. More details in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130128/163783.html Patch by JF Bastien llvm-svn: 173943	2013-01-30 16:30:19 +00:00
Logan Chien	a436e4c7e4	Add missing header and test cases for r173939. llvm-svn: 173941	2013-01-30 15:48:50 +00:00
Nadav Rotem	513bd8a73c	InstCombine: canonicalize sext-and --> select sext-not-and --> select. Patch by Muhammad Tauqir Ahmad. llvm-svn: 173901	2013-01-30 06:35:22 +00:00
Saleem Abdulrasool	26127bd746	build: add --with-python option This adds a new --with-python option to allow configuration of the python binary for building. If not specified, $PATH will be searched for common python binary names (python, python2, python3). If specified, and the path is not executable, it will attempt to search $PATH. Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> Reviewed-by: Eric Christopher <echristo@gmail.com>, Daniel Dunbar <daniel@zuster.org> llvm-svn: 173890	2013-01-30 04:07:37 +00:00
Jack Carter	718da0b53b	This patch implements runtime ARM specific setting of ELF header e_flags. Contributer: Jack Carter llvm-svn: 173885	2013-01-30 02:24:33 +00:00
Jack Carter	7f378104b6	This patch implements runtime Mips specific setting of ELF header e_flags. Contributer: Jack Carter llvm-svn: 173884	2013-01-30 02:16:36 +00:00
Jack Carter	1bd90ff6cc	This patch reworks how llvm targets set and update ELF header e_flags. Currently gathering information such as symbol, section and data is done by collecting it in an MCAssembler object. From MCAssembler and MCAsmLayout objects ELFObjectWriter::WriteObject() forms and streams out the ELF object file. This patch just adds a few members to the MCAssember class to store and access the e_flag settings. It allows for runtime additions to the e_flag by assembler directives. The standalone assembler can get to MCAssembler from getParser().getStreamer().getAssembler(). This patch is the generic infrastructure and will be followed by patches for ARM and Mips for their target specific use. Contributer: Jack Carter llvm-svn: 173882	2013-01-30 02:09:52 +00:00
Akira Hatanaka	4385564e97	[mips] Test case for r173862. Patch by Sasa Stankovic. llvm-svn: 173863	2013-01-30 00:28:15 +00:00
Renato Golin	5e9d55eca0	Adding simple cast cost to ARM Changing ARMBaseTargetMachine to return ARMTargetLowering intead of the generic one (similar to x86 code). Tests showing which instructions were added to cast when necessary or cost zero when not. Downcast to 16 bits are not lowered in NEON, so costs are not there yet. llvm-svn: 173849	2013-01-29 23:31:38 +00:00
Michael J. Spencer	54b24e1000	[MC][COFF] Delay handling symbol aliases when writing Fixes PR14447 and PR9034. Patch by Nico Rieck! llvm-svn: 173839	2013-01-29 22:10:07 +00:00
Pekka Jaaskelainen	f50ab84bb1	LoopVectorize: convert TinyTripCountVectorThreshold constant to a command line switch. llvm-svn: 173837	2013-01-29 21:42:08 +00:00
David Blaikie	9a7a7a9a6f	Support artificial parameters in function types. Provides the functionality for Clang change r172911 - I just had this still lying around. llvm-svn: 173820	2013-01-29 19:35:24 +00:00
Tim Northover	a0edd3ee66	Fix 64-bit atomic operations in Thumb mode. The ARM and Thumb variants of LDREXD and STREXD have different constraints and take different operands. Previously the code expanding atomic operations didn't take this into account and asserted in Thumb mode. llvm-svn: 173780	2013-01-29 09:06:13 +00:00
Craig Topper	c048154b9b	Merge SSE and AVX shuffle instructions in the comment printer. llvm-svn: 173777	2013-01-29 07:54:31 +00:00
Bill Wendling	f2955aa3f2	Convert getAttributes() to return an AttributeSetNode. The AttributeSetNode contains all of the attributes. This removes one (hopefully last) use of the Attribute class as a container of multiple attributes. llvm-svn: 173761	2013-01-29 03:20:31 +00:00
Andrew Kaylor	6d8776a514	Add support for source and line information to IntelJITEventListener for object emitted by MCJIT. llvm-svn: 173712	2013-01-28 19:52:37 +00:00
Bill Schmidt	2e4ae4e154	This patch addresses bug 15031. The common code in the post-RA scheduler to break anti-dependencies on the critical path contained a flaw. In the reported case, an anti-dependency between the overlapping registers %X4 and %R4 exists: %X29<def> = OR8 %X4, %X4 %R4<def>, %X3<def,dead,tied3> = LBZU 1, %X3<kill,tied1> The unpatched code breaks the dependency by replacing %R4 and its uses with %R3, the first register on the available list. However, %R3 and %X3 overlap, so this creates two overlapping definitions on the same instruction. The fix is straightforward, preventing selection of a register that overlaps any other defined register on the same instruction. The test case is reduced from the bug report, and verifies that we no longer produce "lbzu 3, 1(3)" when breaking this anti-dependency. llvm-svn: 173706	2013-01-28 18:36:58 +00:00
Evgeniy Stepanov	6f85ef300d	[msan] Mostly disable msan-handle-icmp-exact. It is way too slow. Change the default option value to 0. Always do exact shadow propagation for unsigned ICmp with constants, it is cheap (under 1% cpu time) and required for correctness. llvm-svn: 173682	2013-01-28 11:42:28 +00:00
Craig Topper	5c683972bc	Fix 256-bit PALIGNR comment decoding to understand that it works on independent 256-bit lanes. llvm-svn: 173674	2013-01-28 07:41:18 +00:00
Richard Osborne	038d24f90c	[XCore] Add missing l2rus instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173634	2013-01-27 22:28:30 +00:00
Richard Osborne	f2ecd40929	[XCore] Add missing l2r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173629	2013-01-27 21:26:02 +00:00
Richard Osborne	7fe8f63544	[XCore] Add missing 1r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173624	2013-01-27 20:46:21 +00:00
Richard Osborne	8f56317287	[XCore] Add missing 0r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173623	2013-01-27 20:42:57 +00:00
Benjamin Kramer	05cc93964a	When the legalizer is splitting vector shifts, the result may not have the right shift amount type. Fix that by adding a cast to the shift expander. This came up with vector shifts on sse-less X86 CPUs. <2 x i64> = shl <2 x i64> <2 x i64> -> i64,i64 = shl i64 i64; shl i64 i64 -> i32,i32,i32,i32 = shl_parts i32 i32 i64; shl_parts i32 i32 i64 Now we cast the last two i64s to the right type. Fixes the crash in PR14668. llvm-svn: 173615	2013-01-27 11:19:11 +00:00
Chandler Carruth	329b590e6e	Re-revert r173342, without losing the compile time improvements, flat out bug fixes, or functionality preserving refactorings. llvm-svn: 173610	2013-01-27 06:42:03 +00:00
David Blaikie	9f4b70dde0	PR14566: Debug Info: Removing top level lexical blocks This adds support for LLVM to accept metadata that doesn't include a top level lexical block in a function. Specifically LLVM couldn't handle this when there were file changes relating to these blocks. I've updated a few test cases to ensure other functionality (such as inlining) isn't affected by this change, but haven't pervasively updated all the test cases. llvm-svn: 173592	2013-01-26 21:55:23 +00:00
Benjamin Kramer	6a93596538	X86: Decode PALIGN operands so I don't have to do it in my head. llvm-svn: 173572	2013-01-26 13:31:37 +00:00
Benjamin Kramer	99c68dd964	X86: Do splat promotion later, so the optimizer can chew on it first. This catches many cases where we can emit a more efficient shuffle for a specific mask or when the mask contains undefs. Once the splat is lowered to unpacks we can't do that anymore. There is a possibility of moving the promotion after pshufb matching, but I'm not sure if pshufb with a mask loaded from memory is faster than 3 shuffles, so I avoided that for now. llvm-svn: 173569	2013-01-26 11:44:21 +00:00
Benjamin Kramer	7268a05178	FileCheckize and merge some tests. llvm-svn: 173568	2013-01-26 11:14:32 +00:00
Andrew Kaylor	9a8ff813f3	Add DIContext::getLineInfoForAddressRange() function and test. This function allows a caller to obtain a table of line information for a function using the function's address and size. llvm-svn: 173537	2013-01-26 00:28:05 +00:00
NAKAMURA Takumi	8653bcf024	llvm/test/CMakeLists.txt: Add a dependency to llvm-rtdyld in check-llvm. llvm-svn: 173528	2013-01-25 23:24:07 +00:00
Hal Finkel	4e5ca9e578	Initial implementation of PPCTargetTransformInfo This provides a place to add customized operation cost information and control some other target-specific IR-level transformations. The only non-trivial logic in this checkin assigns a higher cost to unaligned loads and stores (covered by the included test case). llvm-svn: 173520	2013-01-25 23:05:59 +00:00
Andrew Kaylor	d55d7019fc	Add support for applying in-memory relocations to the .debug_line section and, in the case of ELF files, using symbol addresses when available for relocations to the .debug_info section. Also extending the llvm-rtdyld tool to add the ability to dump line number information for testing purposes. llvm-svn: 173517	2013-01-25 22:50:58 +00:00
Reid Kleckner	1aa3784960	XFAIL close-stderr on win32 The test runner does not rewrite instances of /dev/null inside the quoted sh command. /dev/null does not exist, so opt will fail to open it, and return a non-zero exit code. llvm-svn: 173509	2013-01-25 22:12:54 +00:00
Reid Kleckner	0198e00318	Set the +x bit on two batch scripts Cygwin git-svn will faithfully forward the svn properties all the way down to the NTFS executable permission. Without the +x bit, tests using these scripts fail with "Access Denied". llvm-svn: 173508	2013-01-25 22:12:50 +00:00
Reid Kleckner	ab083f727b	FileCheck-ify some grep tests These tests in particular try to use escaped square brackets as an argument to grep, which is failing for me with native win32 python. It appears the backslash is being lost near the CreateProcess*() call. llvm-svn: 173506	2013-01-25 22:11:46 +00:00
Eli Bendersky	597fc1233a	In this patch, we teach X86_64TargetMachine that it has a ILP32 (defined by the x32 ABI) mode, in which case its pointers are 32-bits in size. This knowledge is also added to X86RegisterInfo that now returns the appropriate registers in getPointerRegClass. There are many outcomes to this change. In order to keep the patches separate and manageable, we start by focusing on some simple testable cases. The patch adds a test with passing a pointer to a function - focusing on the difference between the two data models for x86-64. Another test is added for handling of 'sret' arguments (and functionality is added in X86ISelLowering to make it work). A note on naming: the "x32 ABI" document refers to the AMD64 architecture (in LLVM it's distinguished by being is64Bits() in the x86 subtarget) with two variations: the LP64 (default) data model, and the ILP32 data model. This patch adds predicates to the subtarget which are consistent with this naming scheme. llvm-svn: 173503	2013-01-25 22:07:43 +00:00
Eli Bendersky	158ea095c0	Add back a RUN line removed by mistake by a previous commit llvm-svn: 173502	2013-01-25 21:58:09 +00:00
Richard Osborne	6b86eec819	Add instruction encodings / disassembly support for l4r instructions. llvm-svn: 173501	2013-01-25 21:55:32 +00:00
Eli Bendersky	e6abe83258	Now that llvm-dwarfdump supports flags to specify which DWARF section to dump, use them in tests that run llvm-dwarfdump. This is in order to make tests as specific as possible. llvm-svn: 173498	2013-01-25 21:44:53 +00:00
Hal Finkel	1a57ba57a2	Improve the !add TableGen test case. Suggested by Sean Silva. llvm-svn: 173481	2013-01-25 20:29:25 +00:00
Eli Bendersky	7a94daa170	Add command-line flags for DWARF dumping. Flags for dumping specific DWARF sections added in lib/DebugInfo and llvm-dwarfdump. llvm-svn: 173480	2013-01-25 20:26:43 +00:00
Richard Osborne	a19fa86a70	Add instruction encodings / disassembly support for l5r instructions. llvm-svn: 173479	2013-01-25 20:20:07 +00:00
Evgeniy Stepanov	fac8403249	[msan] Implement exact shadow propagation for relational ICmp. Only for integers, pointers, and vectors of those. No floats. Instrumentation seems very heavy, and may need to be replaced with some approximation in the future. llvm-svn: 173452	2013-01-25 15:31:10 +00:00
Hal Finkel	c7d4dc13a4	Add an addition operator to TableGen This adds an !add(a, b) operator to tablegen; this will be used to cleanup the PPC register definitions. llvm-svn: 173445	2013-01-25 14:49:08 +00:00
Silviu Baranga	3eb45a03af	Fixed the condition codes for the atomic64 min/umin code generation on ARM. If the sutraction of the higher 32 bit parts gives a 0 result, we need to do the store operation. llvm-svn: 173437	2013-01-25 10:39:49 +00:00
Andrew Trick	e2c3f5c982	MIsched: Improve the interface to SchedDFS analysis (subtrees). Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425	2013-01-25 06:33:57 +00:00
Chandler Carruth	ceff222dea	Switch this code away from Value::isUsedInBasicBlock. That code either loops over instructions in the basic block or the use-def list of the value, neither of which are really efficient when repeatedly querying about values in the same basic block. What's more, we already know that the CondBB is small, and so we can do a much more efficient test by counting the uses in CondBB, and seeing if those account for all of the uses. Finally, we shouldn't blanket fail on any such instruction, instead we should conservatively assume that those instructions are part of the cost. Note that this actually fixes a bug in the pass because isUsedInBasicBlock has a really terrible bug in it. I'll fix that in my next commit, but the fix for it would make this code suddenly take the compile time hit I thought it already was taking, so I wanted to go ahead and migrate this code to a faster & better pattern. The bug in isUsedInBasicBlock was also causing other tests to test the wrong thing entirely: for example we weren't actually disabling speculation for floating point operations as intended (and tested), but the test passed because we failed to speculate them due to the isUsedInBasicBlock failure. llvm-svn: 173417	2013-01-25 05:40:09 +00:00
Andrew Trick	44f750a3e5	MISched: Add SchedDFSResult to ScheduleDAGMI to formalize the interface and allow other strategies to select it. llvm-svn: 173413	2013-01-25 04:01:04 +00:00
Jack Carter	07c818d2da	This patch implements parsing the .word directive for the Mips assembler. Contributer: Vladimir Medic llvm-svn: 173407	2013-01-25 01:31:34 +00:00
Akira Hatanaka	28aed9ca85	[mips] Set flag neverHasSideEffects flag on some of the floating point instructions. llvm-svn: 173401	2013-01-25 00:20:39 +00:00
Benjamin Kramer	1c4e323fdd	Reapply chandlerc's r173342 now that the miscompile it was triggering is fixed. Original commit message: Plug TTI into the speculation logic, giving it a real cost interface that can be specialized by targets. The goal here is not to be more aggressive, but to just be more accurate with very obvious cases. There are instructions which are known to be truly free and which were not being modeled as such in this code -- see the regression test which is distilled from an inner loop of zlib. Everywhere the TTI cost model is insufficiently conservative I've added explicit checks with FIXME comments to go add proper modelling of these cost factors. If this causes regressions, the likely solution is to make TTI even more conservative in its cost estimates, but test cases will help here. llvm-svn: 173357	2013-01-24 16:44:25 +00:00

... 4 5 6 7 8 ...

18664 Commits