llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	3053029310	Remove unnecessary leading comment characters in lit-only file llvm-svn: 177327	2013-03-18 22:08:16 +00:00
Manman Ren	1217112d11	Check whether a pointer is non-null (isKnownNonNull) in isKnownNonZero. This handles the case where we have an inbounds GEP with alloca as the pointer. This fixes the regression in PR12750 and rdar://13286434. Note that we can also fix this by handling some GEP cases in isKnownNonNull. llvm-svn: 177321	2013-03-18 21:23:25 +00:00
David Blaikie	99067af791	Include '.test' suffix in target specific lit configs that need it Apparently my final cleanup to use a relevant suffix for these tests before committing r176831 caused them to stop running since lit wasn't configured to run tests with that suffix in those directories (why don't we just have a global suffix list?). So, add the suffix to the relevant directories & fix the test that has bitrotted over the last week due to my debug info schema changes. llvm-svn: 177315	2013-03-18 20:31:44 +00:00
Hal Finkel	21f2a43ab4	Fix large count and negative constant count handling in PPCCTRLoops This commit fixes an assert that would occur on loops with large constant counts (like looping for ((uint32_t) -1) iterations on PPC64). The existing code did not handle counts that it computed to be negative (asserting instead), but these can be created with valid inputs. This bug was discovered by bugpoint while I was attempting to isolate a completely different problem. Also, in writing test cases for the negative-count problem, I discovered that the ori/lsi handling was broken (there was a typo which caused the logic that was supposed to detect these pairs and extract the iteration count to always fail). This has now also been corrected (and is covered by one of the new test cases). llvm-svn: 177295	2013-03-18 17:40:44 +00:00
Hal Finkel	12337e4e7d	Cleanup initial-value constants in PPCCTRLoops Because the initial-value constants had not been added to the list of instructions considered for DCE the resulting code had redundant constant-materialization instructions. llvm-svn: 177294	2013-03-18 17:40:27 +00:00
David Tweed	d505b24277	Initially forgotten-to-svn-add test case for r177279. llvm-svn: 177280	2013-03-18 12:07:24 +00:00
Kostya Serebryany	10cc12f2b7	[asan] when creating string constants, set unnamed_attr and align 1 so that equal strings are merged by the linker. Observed up to 1% binary size reduction. Thanks to Anton Korobeynikov for the suggestion llvm-svn: 177264	2013-03-18 09:38:39 +00:00
Kostya Serebryany	6b5b58deeb	[asan] don't instrument functions with available_externally linkage. This saves a bit of compile time and reduces the number of redundant global strings generated by asan (https://code.google.com/p/address-sanitizer/issues/detail?id=167 ) llvm-svn: 177250	2013-03-18 07:33:49 +00:00
Craig Topper	0498b88d48	Post process ADC/SBB and use a shorter encoding if they use a sign extended immediate. llvm-svn: 177243	2013-03-18 03:34:55 +00:00
Craig Topper	7e9a1cb199	Refactor some duplicated code into helper functions. llvm-svn: 177242	2013-03-18 02:53:34 +00:00
Michael Gottesman	a8b60a4fda	Reduced dont-infinite-loop-during-block-escape-analysis.ll with bugpoint and moved it to retain-block-escape-analysis.ll. NOTE I verified that the original bug behind dont-infinite-loop-during-block-escape-analysis.ll occurs when using opt on retain-block-escape-analysis.ll. llvm-svn: 177240	2013-03-17 21:31:12 +00:00
David Blaikie	8fb8224578	Split out filename & directory from DIFile to start generalizing over DIScopes This is the first step to making all DIScopes have a common metadata prefix (so that things (using directives, for example) that can appear in any scope can be added to that common prefix). DIFile is itself a DIScope so the common prefix of all DIScopes cannot be a DIFile - instead it's the raw filename/directory name pair. llvm-svn: 177239	2013-03-17 21:13:55 +00:00
David Blaikie	2e488d1f0d	Generalize debug info test to be resilient to changes in metadata node numbering llvm-svn: 177238	2013-03-17 21:08:22 +00:00
Michael Gottesman	9782183126	The promised test case for r175939. This test makes sure that the ObjCARC escape analysis looks at the uses of instructions which copy the block pointer value by checking all four cases where that can occur. llvm-svn: 177232	2013-03-17 08:42:58 +00:00
Hal Finkel	fcc51d4ff1	Improve PPC VR (Altivec) register spilling This change cleans up two issues with Altivec register spilling: 1. The spilling code was inefficient (using two instructions, and add and a load, when just one would do) 2. The code assumed that r0 would always be available (true for now, but this will change) The new code handles VR spilling just like GPR spills but forced into r+r mode. As a result, when any VR spills are present, we must now always allocate the register-scavenger spill slot. llvm-svn: 177231	2013-03-17 04:43:44 +00:00
Hal Finkel	57080382e6	Remove FIXMEs in PPC test cases related to unaligned loads/stores As pointed out by Bill in response to r177160, these two FIXMEs can also be removed. llvm-svn: 177229	2013-03-16 23:02:31 +00:00
Craig Topper	612f7bfa4d	Add X86 code emitter support AVX encoded MRMDestReg instructions. Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao. llvm-svn: 177221	2013-03-16 03:44:31 +00:00
Arnold Schwaighofer	9d7a3827e4	ARM cost model: Fix costs for some vector selects I was too pessimistic in r177105. Vector selects that fit into a legal register type lower just fine. I was mislead by the code fragment that I was using. The stores/loads that I saw in those cases came from lowering the conditional off an address. Changing the code fragment to: %T0_3 = type <8 x i18> %T1_3 = type <8 x i1> define void @func_blend3(%T0_3* %loadaddr, %T0_3* %loadaddr2, %T1_3* %blend, %T0_3* %storeaddr) { %v0 = load %T0_3* %loadaddr %v1 = load %T0_3* %loadaddr2 ==> FROM: ;%c = load %T1_3* %blend ==> TO: %c = icmp slt %T0_3 %v0, %v1 ==> USE: %r = select %T1_3 %c, %T0_3 %v0, %T0_3 %v1 store %T0_3 %r, %T0_3* %storeaddr ret void } revealed this mistake. radar://13403975 llvm-svn: 177170	2013-03-15 18:31:01 +00:00
Silviu Baranga	82dd6ac3bc	Adding an A15 specific optimization pass for interactions between S/D/Q registers. The pass handles all the required transformations pre-regalloc. llvm-svn: 177169	2013-03-15 18:28:25 +00:00
Benjamin Kramer	2f5457141a	ARM: Fix an old refacto. Fixes PR15520. llvm-svn: 177167	2013-03-15 17:27:39 +00:00
Hal Finkel	8d7fbc9dad	Enable unaligned memory access on PPC for scalar types Unaligned access is supported on PPC for non-vector types, and is generally more efficient than manually expanding the loads and stores. A few of the existing test cases were using expanded unaligned loads and stores to test other features (like load/store with update), and for these test cases, unaligned access remains disabled. llvm-svn: 177160	2013-03-15 15:27:13 +00:00
Arnold Schwaighofer	f5284ff61f	ARM cost model: Fix cost of fptrunc and fpext instructions A vector fptrunc and fpext simply gets split into scalar instructions. radar://13192358 llvm-svn: 177159	2013-03-15 15:10:47 +00:00
Hal Finkel	b0fac42987	Protect PPC Altivec patterns with a predicate In preparation for the addition of other SIMD ISA extensions (such as QPX) we need to make sure that all Altivec patterns are properly predicated on having Altivec support. No functionality change intended (one test case needed to be updated b/c it assumed that Altivec intrinsics would be supported without enabling Altivec support). llvm-svn: 177152	2013-03-15 13:21:21 +00:00
Alexey Samsonov	cd27b98d38	Fixup for r176933: more careful setup of path to llvm-symbolizer llvm-svn: 177144	2013-03-15 07:27:49 +00:00
Rafael Espindola	ef9d3494b2	Fix the FDE encoding to be relative on ELF. This is a very late complement to r130637 which fixed this on x86_64. Fixes pr15448. Since it looks like that every elf architecture uses this encoding when using cfi, make it the default for elf. Just exclude mips64el. It has a lovely .ll -> .o test (ef_frame.ll) that tests that nothing changes in the binary content of the .eh_frame produced by llc. Oblige it. llvm-svn: 177141	2013-03-15 05:51:57 +00:00
Hal Finkel	bb420f10e9	Allocate the RS spill slot for any PPC function with spills and a large stack frame For spills into a large stack frame, the FI-elimination code uses the register scavenger to obtain a free GPR for use with an r+r-addressed load or store. When there are no available GPRs, the scavenger gets one by using its spill slot. Previously, we were not always allocating that spill slot and the RS would assert when the spill slot was needed. I don't currently have a small test that triggered the assert, but I've created a small regression test that verifies that the spill slot is now added when the stack frame is sufficiently large. llvm-svn: 177140	2013-03-15 05:06:04 +00:00
Nadav Rotem	4a4827ce21	Add a triple to the test. llvm-svn: 177131	2013-03-15 00:10:23 +00:00
Nadav Rotem	adfa5eaf8c	Unaligned loads should use the VMOVUPS opcode. llvm-svn: 177130	2013-03-14 23:49:44 +00:00
Arnold Schwaighofer	9b55e31bcb	LoopVectorizer: Insert some white space to make test case more readable Also remove some unneeded function attributes. llvm-svn: 177114	2013-03-14 21:31:09 +00:00
Chad Rosier	4b54f594b4	[fast-isel] The X86FastISel::FastLowerArguments function doesn't properly handle the win64 calling convention. rdar://13423768 llvm-svn: 177113	2013-03-14 21:25:04 +00:00
Hal Finkel	e987a311ba	Not all PPC functions with a frame pointer need a RS spill slot We used to add a spill slot for the register scavenger whenever the function has a frame pointer. This is unnecessarily conservative: We may need the spill slot for dynamic stack allocations, and functions with dynamic stack allocations always have a FP, but we might also have a FP for other reasons (such as the user explicitly disabling frame-pointer elimination), and we don't necessarily need a spill slot for those functions. The structsinregs test needed adjustment because it disables FP elimination. llvm-svn: 177106	2013-03-14 19:34:32 +00:00
Arnold Schwaighofer	8070b382ec	ARM cost model: Increase cost of some vector selects we do terrible on By terrible I mean we store/load from the stack. This matters on PAQp8 in _Z5trainPsS_ii (which is inlined into Mixer::update) where we decide to vectorize a loop with a VF of 8 resulting in a 25% degradation on a cortex-a8. LV: Found an estimated cost of 2 for VF 8 For instruction: icmp slt i32 LV: Found an estimated cost of 2 for VF 8 For instruction: select i1, i32, i32 The bug that tracks the CodeGen part is PR14868. radar://13403975 llvm-svn: 177105	2013-03-14 19:17:02 +00:00
Jyotsna Verma	ec613665c2	Hexagon: Removed asserts regarding alignment and offset. We are warning the user about the alignment, so we should not assert. llvm-svn: 177103	2013-03-14 19:08:03 +00:00
Arnold Schwaighofer	4991ce9d49	Add missing asserts flag to test - it uses debug flags llvm-svn: 177102	2013-03-14 19:01:58 +00:00
Arnold Schwaighofer	c63cf3a0ae	LoopVectorize: Invert case when we use a vector cmp value to query select cost We generate a select with a vectorized condition argument when the condition is NOT loop invariant. Not the other way around. llvm-svn: 177098	2013-03-14 18:54:36 +00:00
Shuxin Yang	2eca602f8b	Perform factorization as a last resort of unsafe fadd/fsub simplification. Rules include: 1)1 xy +/- xz => x*(y +/- z) (the order of operands dosen't matter) 2) y/x +/- z/x => (y +/- z)/x The transformation is disabled if the new add/sub expr "y +/- z" is a denormal/naz/inifinity. rdar://12911472 llvm-svn: 177088	2013-03-14 18:08:26 +00:00
Adrian Prantl	ed6d955416	Test that we emit a DW_AT_location for self captured by a block. This is the backend part of a CFE test with the same name. llvm-svn: 177087	2013-03-14 17:54:13 +00:00
Vincent Lejeune	0a22bc4156	R600: Factorize code handling Const Read Port limitation llvm-svn: 177078	2013-03-14 15:50:45 +00:00
Alexey Samsonov	819eddc3ce	[ASan] emit instrumentation for initialization order checking by default llvm-svn: 177063	2013-03-14 12:38:58 +00:00
Chandler Carruth	a1c54bbe34	PR14972: SROA vs. GVN exposed a really bad bug in SROA. The fundamental problem is that SROA didn't allow for overly wide loads where the bits past the end of the alloca were masked away and the load was sufficiently aligned to ensure there is no risk of page fault, or other trapping behavior. With such widened loads, SROA would delete the load entirely rather than clamping it to the size of the alloca in order to allow mem2reg to fire. This was exposed by a test case that neatly arranged for GVN to run first, widening certain loads, followed by an inline step, and then SROA which miscompiles the code. However, I see no reason why this hasn't been plaguing us in other contexts. It seems deeply broken. Diagnosing all of the above took all of 10 minutes of debugging. The really annoying aspect is that fixing this completely breaks the pass. ;] There was an implicit reliance on the fact that no loads or stores extended past the alloca once we decided to rewrite them in the final stage of SROA. This was used to encode information about whether the loads and stores had been split across multiple partitions of the original alloca. That required threading explicit tracking of whether a use of a partition is split across multiple partitions. Once that was done, another problem arose: we allowed splitting of integer loads and stores iff they were loads and stores to the entire alloca. This is a really arbitrary limitation, and splitting at least some integer loads and stores is crucial to maximize promotion opportunities. My first attempt was to start removing the restriction entirely, but currently that does Very Bad Things by causing many common alloca patterns to be fully decomposed into i8 operations and lots of or-ing together to produce larger integers on demand. The code bloat is terrifying. That is still the right end-goal, but substantial work must be done to either merge partitions or ensure that small i8 values are eagerly merged in some other pass. Sadly, figuring all this out took essentially all the time and effort here. So the end result is that we allow splitting only when the load or store at least covers the alloca. That ensures widened loads and stores don't hurt SROA, and that we don't rampantly decompose operations more than we have previously. All of this was already fairly well tested, and so I've just updated the tests to cover the wide load behavior. I can add a test that crafts the pass ordering magic which caused the original PR, but that seems really brittle and to provide little benefit. The fundamental problem is that widened loads should Just Work. llvm-svn: 177055	2013-03-14 11:32:24 +00:00
Craig Topper	872999737d	Fix a bug in the calculation of the VEX.B bit for FMA4 rr with the VEX.W bit set. The VEX.B was being calculated from the wrong operand. Fixes at least some portion of PR14185. llvm-svn: 177014	2013-03-14 07:40:52 +00:00
Michael Liao	20d287044c	Fix PR15309 - Fix the typo on type checking llvm-svn: 177010	2013-03-14 06:57:42 +00:00
Jiong Wang	5bbb96d7df	test commit: remove blank line. llvm-svn: 177009	2013-03-14 05:43:59 +00:00
Nick Lewycky	3d28d4dee7	Remove a change to the debug info in this test, that I made while testing something else and forgot to remove. llvm-svn: 177007	2013-03-14 05:28:10 +00:00
Nick Lewycky	d11060d971	Try using %S to find the emitted .gcno file. llvm-svn: 177006	2013-03-14 05:23:30 +00:00
Nick Lewycky	fdfed3e9c9	Refactor GCOV's six constructor arguments into a struct with a getter that constructs default arguments. It can now take default arguments from cl::opt'ions. Add a new -default-gcov-version=... option, and actually test it! Sink the reverse-order of the version into GCOVProfiling, hiding it from our users. llvm-svn: 177002	2013-03-14 05:13:26 +00:00
David Blaikie	aabfe4f997	Simplify file/directory name handling in DILexicalBlock llvm-svn: 176993	2013-03-13 22:52:59 +00:00
David Blaikie	3254616cc3	Remove an extra operand to a DIFile metadata entry (extra cleanup/fallout from r176983 - not sure why I didn't catch this locally) llvm-svn: 176988	2013-03-13 22:33:09 +00:00
David Blaikie	0d221159a0	Remove the unused 4th operand for DIFile debug info metadata llvm-svn: 176983	2013-03-13 22:05:21 +00:00
Arnold Schwaighofer	9f2e0fa52e	ARM cost model: Add test case to make sure we would notice a change in CodeGen In r176898 I updated the cost model to reflect the fact that sext/zext/cast on v8i32 <-> v8i8 and v16i32 <-> v16i8 are expensive. This test case is so that we make sure to update the cost model once we fix CodeGen. llvm-svn: 176955	2013-03-13 16:25:55 +00:00
Evgeniy Stepanov	3aa627b09d	Add llvm-symbolizer as test dependency. It is required when building tests with ASan or MSan. llvm-svn: 176941	2013-03-13 09:35:18 +00:00
Evgeniy Stepanov	e9c2d3f950	Set symbolizer path in the test environment. This is needed to get symbolized stack traces when running LLVM tests under (A\|M)San. llvm-svn: 176933	2013-03-13 06:58:09 +00:00
David Blaikie	1ca2f36289	Refactor filename/directory in DICompileUnit into a DIFile This is the next step towards making the metadata for DIScopes have a common prefix rather than having to delegate based on their tag type. llvm-svn: 176913	2013-03-13 00:01:35 +00:00
David Blaikie	452c3ff649	Remove unused "isMain" field from DICompileUnit llvm-svn: 176910	2013-03-12 22:43:04 +00:00
David Blaikie	a4f770d51c	Update debug info test cases with empty SplitDebugFilename field. This could be 'null' or the empty string, DIDescriptor::getStringField coalesces the two cases anyway so it's just a matter of legible/efficient representation. The change in behavior of the DICompileUnit::get* functions could be subsumed by the full verification check - but ideally that should just be an assertion if we could front-load the actual debug info metadata failure paths. llvm-svn: 176907	2013-03-12 22:25:36 +00:00
Arnold Schwaighofer	90774f3c8f	ARM cost model: Increase the cost for vector casts that use the stack Increase the cost of v8/v16-i8 to v8/v16-i32 casts and truncates as the backend currently lowers those using stack accesses. This was responsible for a significant degradation on MultiSource/Benchmarks/Trimaran/enc-pc1/enc-pc1 where we vectorize one loop to a vector factor of 16. After this patch we select a vector factor of 4 which will generate reasonable code. unsigned char cle[32]; void test(short c) { unsigned short compte; for (compte = 0; compte <= 31; compte++) { cle[compte] = cle[compte] ^ c; } } radar://13220512 llvm-svn: 176898	2013-03-12 21:19:22 +00:00
David Blaikie	854b2af17f	Correct invalid debug info metadata Code review feedback on r176838 by Patrik Hägglund. llvm-svn: 176884	2013-03-12 19:04:24 +00:00
Jan Wen Voung	6dc3076080	Revert the test moves from 176733. Use "REQUIRES: asserts" instead. llvm-svn: 176873	2013-03-12 16:27:52 +00:00
Hal Finkel	01271c6022	Don't reserve R2 on Darwin/PPC Now that only the register-scavenger version of the CR spilling code remains, we no longer need the Darwin R2 hack. Darwin can use R0 as a spare register in any case where the System V ABI uses it (R0 is special architecturally, and so is reserved under all common ABIs). A few test cases needed to be updated to reflect the register-allocation changes. llvm-svn: 176868	2013-03-12 15:18:14 +00:00
Patrik Hagglund	ba6f3221d6	In r169695, the address space limit for tests was replaced with a data segment limit. Now, as a complement, add a stack space limit. Otherwise, tests may grow undesirable large at inifinite recursion. (Seen at r176838, test/Assembler/2010-02-05-FunctionLocalMetadataBecomesNull.ll) llvm-svn: 176862	2013-03-12 12:38:10 +00:00
NAKAMURA Takumi	e781913ac4	llvm/test/CodeGen/R600/schedule-*.ll: Let them require +Asserts. llvm-svn: 176835	2013-03-11 23:16:30 +00:00
David Blaikie	47922fb006	Upgrading debug info test cases to be (more) compatible with the current debug info format. These cases were found by further work to remove support for debug info versioning. Common cleanups (other than changing the version info in the tag field) included adding the last parameter to compile_units (recently added for fission support) and other cases of trailing fields in lexical blocks, compile units, and subprograms. llvm-svn: 176834	2013-03-11 22:37:40 +00:00
David Blaikie	789beb5300	Remove duplicate test contents. llvm-svn: 176831	2013-03-11 22:10:14 +00:00
Nick Lewycky	48beb21185	Fix a crasher newly introduced in r176659/r176649, where fast-isel tries to lower an expect intrinsic that is a constant expression. llvm-svn: 176830	2013-03-11 21:44:37 +00:00
Kevin Enderby	f15856ebb4	Fixes disassembler crashes on 2013 Haswell RTM instructions. rdar://13318048 llvm-svn: 176828	2013-03-11 21:17:13 +00:00
Bill Wendling	9534d8885f	Don't remove a landing pad if the invoke requires a table entry. An invoke may require a table entry. For instance, when the function it calls is expected to throw. <rdar://problem/13360379> llvm-svn: 176827	2013-03-11 20:53:00 +00:00
Vincent Lejeune	e5ecf10a02	R600: Fix JUMP handling so that MachineInstr verification can occur This allows R600 Target to use the newly created -verify-misched llc flag llvm-svn: 176819	2013-03-11 18:15:06 +00:00
NAKAMURA Takumi	a60c7a0f4b	llvm/test/CodeGen/X86/handle-move.ll: Mark it as XFAIL:cygming. Investigating. llvm-svn: 176808	2013-03-11 16:30:26 +00:00
NAKAMURA Takumi	1e02e73c30	Suppress atomic(32\|64).ll as XFAIL on win32 codegen. Investigating. llvm-svn: 176798	2013-03-11 08:39:48 +00:00
Lang Hames	82d48e7fb0	Remove date from test case file name. The PR number provides a unique ID already. llvm-svn: 176796	2013-03-11 03:49:23 +00:00
Lang Hames	be3d971143	Don't glue users to extract_subreg when selecting the llvm.arm.ldrexd intrinsic - it can cause impossible-to-schedule subgraphs to be introduced. PR15053. llvm-svn: 176777	2013-03-09 22:56:09 +00:00
Benjamin Kramer	fc0c7bf0d7	Fix test case. llvm-svn: 176773	2013-03-09 18:34:27 +00:00
Benjamin Kramer	01b75cc0f2	Test case hygiene. llvm-svn: 176772	2013-03-09 18:25:40 +00:00
Arnold Schwaighofer	4090b61ac3	LoopVectorizer: Ignore dbg.value instructions We want vectorization to happen at -g. Ignore calls to the dbg.value intrinsic and don't transfer them to the vectorized code. radar://13378964 llvm-svn: 176768	2013-03-09 15:56:34 +00:00
Nick Lewycky	c1f9694d05	We need a shndx if the number of sections breaks SHN_LORESERVE. This condition for choosing to emit a shndx was simply testing the wrong variable. llvm-svn: 176762	2013-03-09 09:31:44 +00:00
Jan Wen Voung	7857a64909	Disable statistics on Release builds and move tests that depend on -stats. Summary: Statistics are still available in Release+Asserts (any +Asserts builds), and stats can also be turned on with LLVM_ENABLE_STATS. Move some of the FastISel stats that were moved under DEBUG() back out of DEBUG(), since stats are disabled across the board now. Many tests depend on grepping "-stats" output. Move those into a orig_dir/Stats/. so that they can be marked as unsupported when building without statistics. Differential Revision: http://llvm-reviews.chandlerc.com/D486 llvm-svn: 176733	2013-03-08 22:56:31 +00:00
David Blaikie	1f7ff93cda	Remove -print-dbginfo as it is unused & bitrotten. This pass hasn't been touched in two years & would fail with assertions against the current debug info metadata format (the only test case for it still uses a many-versions old debug info metadata format) llvm-svn: 176707	2013-03-08 18:17:46 +00:00
Jakob Stoklund Olesen	8d1aaf21cf	Rewrite the physreg part of findLastUseBefore(). To find the last use of a register unit, start from the bottom and scan upwards until a user is found. <rdar://problem/13353090> llvm-svn: 176706	2013-03-08 18:08:57 +00:00
Benjamin Kramer	10a74ed434	Force cpu in test. llvm-svn: 176702	2013-03-08 17:01:18 +00:00
Benjamin Kramer	37c2d65c5a	Insert the reduction start value into the first bypass block to preserve domination. Fixes PR15344. llvm-svn: 176701	2013-03-08 16:58:37 +00:00
Tom Stellard	5e524897ed	R600: Optimize another selectcc case fold selectcc (selectcc x, y, a, b, cc), b, a, b, setne -> selectcc x, y, a, b, cc Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176700	2013-03-08 15:37:11 +00:00
Tom Stellard	2add82de09	R600: Improve custom lowering of select_cc Two changes: 1. Prefer SET* instructions when possible 2. Handle the CND*_INT case with floating-point args Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176699	2013-03-08 15:37:09 +00:00
Tom Stellard	492ebeabe9	R600: Change operation action from Custom to Expand for BR_CC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176698	2013-03-08 15:37:07 +00:00
Tom Stellard	e8f9f2877b	R600: Change operation action from Custom to Expand for SETCC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176697	2013-03-08 15:37:05 +00:00
Tom Stellard	d93ef7afaa	LegalizeDAG: Respect the result of TLI.getBooleanContents() when expanding SETCC llvm-svn: 176695	2013-03-08 15:37:02 +00:00
Vincent Lejeune	2bc2730765	R600: Change addresspace in fold-kcache.ll AddressSpace definition has changed in a previous commit, reflect it to avoid false failure. llvm-svn: 176693	2013-03-08 15:34:07 +00:00
Tim Northover	c8f1a5de9f	AArch64: specify full triple in test as only Linux works for now. llvm-svn: 176692	2013-03-08 15:27:30 +00:00
Christian Konig	21442994a7	R600/SI: adjust test to recent changes Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176691	2013-03-08 14:44:00 +00:00
Jyotsna Verma	7825e064b9	Hexagon: Add patterns for zero extended loads from i1->i64. llvm-svn: 176689	2013-03-08 14:15:15 +00:00
Tim Northover	95f4892d4c	AArch64: expand sincos operations, we don't support them. Patch based on Mans Rullgard's. llvm-svn: 176688	2013-03-08 13:55:07 +00:00
David Blaikie	e7838a4c53	Another test fix for r176671. llvm-svn: 176679	2013-03-08 02:27:40 +00:00
David Blaikie	d17cfedf8b	Couple of test fixes for r176671. Not sure why these aren't failing on my linux machine, but this should cover it. llvm-svn: 176678	2013-03-08 02:26:16 +00:00
Bill Wendling	2d915e2c15	Revert r176154 in favor of a better approach. Code generation makes some basic assumptions about the IR it's been given. In particular, if there is only one 'invoke' in the function, then that invoke won't be going away. However, with the advent of the `llvm.donothing' intrinsic, those invokes may go away. If all of them go away, the landing pad no longer has any users. This confuses the back-end, which asserts. This happens with SjLj exceptions, because that's the model that modifies the IR based on there being invokes, etc. in the function. Remove any invokes of `llvm.donothing' during SjLj EH preparation. This will give us a CFG that the back-end won't be confused about. If all of the invokes in a function are removed, then the SjLj EH prepare pass won't insert the bogus code the relies upon the invokes being there. <rdar://problem/13228754&13316637> llvm-svn: 176677	2013-03-08 02:21:08 +00:00
David Blaikie	e5a2f704a4	Upgrade tests to the latest debug info format. Mostly this is just changing the named metadata (llvm.dbg.sp, llvm.dbg.gv, llvm.dbg.<func>.lv, etc -> llvm.dbg.cu), adding a few fields to older records (DIVariable: flags/inlined-at, DICompileUnit: sp/gv/types, DISubprogram: local variables list) The tests to update were discovered by a change I'm working on to remove debug info version support - so any tests using old debug info versions I haven't updated probably are bad tests or just not actually designed to test debug info. llvm-svn: 176671	2013-03-08 00:23:31 +00:00
Chad Rosier	9c1796f877	[fast-isel] Add support for the expect intrinsic. rdar://13370942 llvm-svn: 176649	2013-03-07 20:42:17 +00:00
Jyotsna Verma	c7dcc2fbc5	Hexagon: Handle i8, i16 and i1 Var Args. llvm-svn: 176647	2013-03-07 20:28:34 +00:00
Jyotsna Verma	2ba0c0b927	Hexagon: Add support to lower block address. llvm-svn: 176637	2013-03-07 19:10:28 +00:00
Benjamin Kramer	5e5fd6bb4f	Move testcase, this is testing extraction not inserting. llvm-svn: 176635	2013-03-07 18:51:02 +00:00
Benjamin Kramer	2c3d0df8ee	X86: Fold EXTRACT_SUBVECTORs of a BUILD_VECTOR into a smaller BUILD_VECTOR. That can usually be lowered efficiently and is common in sandybridge code. It would be nice to do this in DAGCombiner but we can't insert arbitrary BUILD_VECTORs this late. Fixes PR15462. llvm-svn: 176634	2013-03-07 18:48:40 +00:00
Jim Grosbach	48a91abc10	SDAG: Handle scalarizing an extend of a <1 x iN> vector. Just scalarize the element and rebuild a vector of the result type from that. rdar://13281568 llvm-svn: 176614	2013-03-07 05:47:54 +00:00
Manman Ren	1e4272085d	Debug Info: store the files and directories for each compile unit. We now emit a line table for each compile unit. To reduce the prologue size of each line table, the files and directories used by each compile unit are stored in std::map<unsigned, std::vector< > > instead of std::vector< >. The prologue for a lto'ed image can be as big as 93K. Duplicating 93K for each compile unit causes a huge increase of debug info. With this patch, each prologue will only emit the files required by the compile unit. rdar://problem/13342023 llvm-svn: 176605	2013-03-07 01:42:00 +00:00
Andrew Trick	a0a5ca06b9	SimplifyCFG fix for volatile load/store. Fixes rdar:13349374. Volatile loads and stores need to be preserved even if the language standard says they are undefined. "volatile" in this context means "get out of the way compiler, let my platform handle it". Additionally, this is the only way I know of with llvm to write to the first page (when hardware allows) without dropping to assembly. llvm-svn: 176599	2013-03-07 01:03:35 +00:00
Michael Liao	d5cac37dc5	Fix two remaining issue after fixing PR15355 when CMOV is not available - Phi nodes should be replaced/updated after lowering CMOV into branch because 'mainMBB' updating operand in Phi node is changed. - Add EFLAGS in livein before lowering the 2nd CMOV. It's necessary as we will reuse the EFLAGS generated before the 1st lowered CMOV, which won't clobber EFLAGS. However, we need explicitly specify that. - '-attr=-cmov' test case are added. llvm-svn: 176598	2013-03-07 01:01:29 +00:00
Akira Hatanaka	0f693a8a77	[mips] Custom-legalize BR_JT. In N64-static, GOT address is needed to compute the branch address. llvm-svn: 176580	2013-03-06 21:32:03 +00:00
Shuxin Yang	408bdad5b4	Memory Dependence Analysis (not mem-dep test) take advantage of "invariant.load" metadata. The "invariant.load" metadata indicates the memory unit being accessed is immutable. A load annotated with this metadata can be moved across any store. As I am not sure if it is legal to move such loads across barrier/fence, this change dose not allow such transformation. rdar://11311484 Thank Arnold for code review. llvm-svn: 176562	2013-03-06 17:48:48 +00:00
Jim Grosbach	95d2eb95c3	InstCombine: Don't shrink allocas when combining with a bitcast. When considering folding a bitcast of an alloca into the alloca itself, make sure we don't shrink the amount of memory being allocated, or things rapidly go sideways. rdar://13324424 llvm-svn: 176547	2013-03-06 05:44:53 +00:00
Akira Hatanaka	a9cf03fbd7	[mips] Add a line which checks function name. Rename file. llvm-svn: 176543	2013-03-06 01:58:03 +00:00
Michael Liao	da22b30be5	Fix PR15355 - Clear 'mayStore' flag when loading from the atomic variable before the spin loop - Clear kill flag from one use to multiple use in registers forming the address to that atomic variable - don't use a physical register as live-in register in BB (neither entry nor landing pad.) by copying it into virtual register (patch by Cameron Zwarich) llvm-svn: 176538	2013-03-06 00:17:04 +00:00
Akira Hatanaka	1454ed8ad3	[mips] Remove android calling convention. This calling convention was added just to handle functions which return vector of floats. The fix committed in r165585 solves the problem. llvm-svn: 176530	2013-03-05 23:22:30 +00:00
Akira Hatanaka	e092f72956	[mips] Fix MipsCC::analyzeReturn so that, in soft-float mode, fp128 gets returned in registers $2 and $4. llvm-svn: 176527	2013-03-05 22:54:59 +00:00
Akira Hatanaka	5f3ba9e595	[mips] Fix MipsTargetLowering::LowerCallResult and LowerReturn to correctly handle fp128 returns. llvm-svn: 176523	2013-03-05 22:41:55 +00:00
Akira Hatanaka	3b7391d140	[mips] Fix MipsTargetLowering::LowerCall to pass fp128 arguments in floating point registers. llvm-svn: 176521	2013-03-05 22:20:28 +00:00
Akira Hatanaka	4b634fa3b3	[mips] Correct handling of fp128 (long double) formals and read long double parameters from floating point registers if target is mips64 hard float. llvm-svn: 176520	2013-03-05 22:13:04 +00:00
Jyotsna Verma	457801f7ab	reverting patch 176508. llvm-svn: 176513	2013-03-05 20:29:23 +00:00
Jyotsna Verma	7179e712dd	Hexagon: Add support for lowering block address. llvm-svn: 176508	2013-03-05 19:37:46 +00:00
Jyotsna Verma	0eeea14e3e	Hexagon: Expand addc, adde, subc and sube. llvm-svn: 176505	2013-03-05 19:04:47 +00:00
Eli Bendersky	59d7cb2386	Fixes a test by replacing .align by .p2align and setting triples explicitly. Patch by David Sehr llvm-svn: 176502	2013-03-05 18:56:14 +00:00
Jyotsna Verma	f4e324f4fb	Hexagon: Add encoding bits to the TFR64 instructions. Set imMoveImm, isAsCheapAsAMove flags for TFRI instructions. llvm-svn: 176499	2013-03-05 18:42:28 +00:00
David Sehr	af76f18fc3	Add a test that .align directives on capable processors use long NOPs. llvm-svn: 176490	2013-03-05 16:46:54 +00:00
Vincent Lejeune	3b6f20e944	R600: Turn BUILD_VECTOR into Reg_Sequence Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176487	2013-03-05 15:04:49 +00:00
Vincent Lejeune	a199d01e4d	R600: Use MUL_IEEE for trig/fdiv intrinsic Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176485	2013-03-05 15:04:37 +00:00
NAKAMURA Takumi	3ae057c6ab	llvm/test/CodeGen/Mips/mips64-f128.ll: Add explicit -mtriple=mips64el-unknown-unknown to appease win32. FIXME: Is it expected for win32 to affect mips targets? llvm-svn: 176471	2013-03-05 02:18:59 +00:00
NAKAMURA Takumi	cc91d50289	llvm/test/CodeGen/Thumb/iabs.ll: Add explicit -mtriple=thumb-unknown-unknown to appease win32 hosts. llvm-svn: 176470	2013-03-05 02:18:52 +00:00
David Sehr	4c8979cd4d	The current X86 NOP padding uses one long NOP followed by the remainder in one-byte NOPs. If the processor actually executes those NOPs, as it sometimes does with aligned bundling, this can have a performance impact. From my micro-benchmarks run on my one machine, a 15-byte NOP followed by twelve one-byte NOPs is about 20% worse than a 15 followed by a 12. This patch changes NOP emission to emit as many 15-byte (the maximum) as possible followed by at most one shorter NOP. llvm-svn: 176464	2013-03-05 00:02:23 +00:00
Lang Hames	30be8a30cc	Check isDiscardableIfUnused, rather than hasLocalLinkage, when bumping GlobalValue linkage up to ExternalLinkage in the ExtractGV pass. This prevents linkonce and linkonce_odr symbols from being DCE'd. llvm-svn: 176459	2013-03-04 22:40:44 +00:00
Akira Hatanaka	c7828356aa	[mips] Print move instructions. "move $4, $5" is printed instead of "or $4, $5, $zero". llvm-svn: 176455	2013-03-04 22:25:01 +00:00
Jack Carter	0e149b04f6	Mips specific inline assembler constraint 'R' 'R' An address that can be sued in a non-macro load or store. This patch includes a positive test case. llvm-svn: 176452	2013-03-04 21:33:15 +00:00
Eli Bendersky	4e1db8d7f7	Reapply r176381, writing the CHECKs in a more forgiving manner to account for running llvm-objdump on Darwin. llvm-svn: 176443	2013-03-04 18:20:31 +00:00
Preston Gurd	485296d1e8	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Jim Grosbach	a3c5c769d6	ARM: Creating a vector from a lane of another. The VDUP instruction source register doesn't allow a non-constant lane index, so make sure we don't construct a ARM::VDUPLANE node asking it to do so. rdar://13328063 http://llvm.org/bugs/show_bug.cgi?id=13963 llvm-svn: 176413	2013-03-02 20:16:24 +00:00
Arnold Schwaighofer	99cba9697a	ARM NEON: Fix v2f32 float intrinsics Mark them as expand, they are not legal as our backend does not match them. llvm-svn: 176410	2013-03-02 19:38:33 +00:00
Nuno Lopes	589443bd93	recommit r172363 & r171325 (reverted in r172756) This adds minimalistic support for PHI nodes to llvm.objectsize() evaluation fingers crossed so that it does break clang boostrap again.. llvm-svn: 176408	2013-03-02 11:36:24 +00:00
Arnold Schwaighofer	20ef54f4c1	X86 cost model: Adjust cost for custom lowered vector multiplies This matters for example in following matrix multiply: int mmult(int rows, int cols, int m1, int m2, int m3) { int i, j, k, val; for (i=0; i<rows; i++) { for (j=0; j<cols; j++) { val = 0; for (k=0; k<cols; k++) { val += m1[i][k] * m2[k][j]; } m3[i][j] = val; } } return(m3); } Taken from the test-suite benchmark Shootout. We estimate the cost of the multiply to be 2 while we generate 9 instructions for it and end up being quite a bit slower than the scalar version (48% on my machine). Also, properly differentiate between avx1 and avx2. On avx-1 we still split the vector into 2 128bits and handle the subvector muls like above with 9 instructions. Only on avx-2 will we have a cost of 9 for v4i64. I changed the test case in test/Transforms/LoopVectorize/X86/avx1.ll to use an add instead of a mul because with a mul we now no longer vectorize. I did verify that the mul would be indeed more expensive when vectorized with 3 kernels: for (i ...) r += a[i] * 3; for (i ...) m1[i] = m1[i] * 3; // This matches the test case in avx1.ll and a matrix multiply. In each case the vectorized version was considerably slower. radar://13304919 llvm-svn: 176403	2013-03-02 04:02:52 +00:00
Nadav Rotem	739e37a0d2	PR14448 - prevent the loop vectorizer from vectorizing the same loop twice. The LoopVectorizer often runs multiple times on the same function due to inlining. When this happens the loop vectorizer often vectorizes the same loops multiple times, increasing code size and adding unneeded branches. With this patch, the vectorizer during vectorization puts metadata on scalar loops and marks them as 'already vectorized' so that it knows to ignore them when it sees them a second time. PR14448. llvm-svn: 176399	2013-03-02 01:33:49 +00:00
Michael Gottesman	ee45c03fec	Revert "Rewrite a test to count emitted instructions without using -stats" This reverts commit aac7922b8fe7ae733d3fe6697e6789fd730315dc. I am reverting the commit since it broke the phase 1 public buildbot for a few hours. http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RA/builds/2137 llvm-svn: 176394	2013-03-02 00:53:20 +00:00
Akira Hatanaka	ece459bb66	[mips] Fix inefficient code generation. This patch eliminates the need to emit a constant move instruction when this pattern is matched: (select (setgt a, Constant), T, F) The pattern above effectively turns into this: (conditional-move (setlt a, Constant + 1), F, T) llvm-svn: 176384	2013-03-01 21:52:08 +00:00
Eli Bendersky	0091e2ff00	Rewrite a test to count emitted instructions without using -stats Also removed the comments of "should produce..." because they completely don't match the actually produced output. llvm-svn: 176381	2013-03-01 21:34:37 +00:00
Akira Hatanaka	3d055580a9	Set properties for f128 type. llvm-svn: 176378	2013-03-01 21:11:44 +00:00
Eli Bendersky	10ab5e72e1	Rewrite a test to check actual output rather than intermediate implementation detail. The was this test was written, it was relying on an implementation detail (fixups) and hence was very brittle (relying, among other things, on the exact ordering of statistics printed by MC). The test was rewritten to check a more observable output difference. While it doesn't cover 100% of the things the original test covered, it's a good practice to write regression tests this way. If we want to check that internal details and invariants hold, such tests should be expressed as unit tests. llvm-svn: 176377	2013-03-01 20:54:00 +00:00
Edwin Vane	510c341517	No need to force-create clang-tools-extra lit.site.cfg The make (all) target takes care of creating lit configs and auto-generating tests. The problem with the original 'lit.site.cfg' target is it's not recursive and doesn't fully create everything necessary for testing clang-tools-extra. llvm-svn: 176374	2013-03-01 19:58:58 +00:00
Michael Liao	d10584e38b	Add regression tests (WORKSFORME) - These tests wont't crash on trunk but would be better to add them so that they don't break again in the future. llvm-svn: 176369	2013-03-01 19:23:37 +00:00
Chad Rosier	b3864609cf	Generate an error message instead of asserting or segfaulting when we can't handle indirect register inputs. rdar://13322011 llvm-svn: 176367	2013-03-01 19:12:05 +00:00
Benjamin Kramer	12f98fae98	LoopVectorize: Don't hang forever if a PHI only has skipped PHI uses. Fixes PR15384. llvm-svn: 176366	2013-03-01 19:07:31 +00:00
Michael Liao	6af16fc3b7	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	9660343b42	Add support for using non-pic code for arm and thumb1 when emitting the sjlj dispatch code. As far as I can tell the thumb2 code is behaving as expected. I was able to compile and run the associated test case for both arm and thumb1. rdar://13066352 llvm-svn: 176363	2013-03-01 18:30:38 +00:00
Christian Konig	3c54770365	R600/SI: fix sampler tests after fixing wait insertions Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176359	2013-03-01 17:39:05 +00:00
Jyotsna Verma	8425643728	Hexagon: Add constant extender support framework. llvm-svn: 176358	2013-03-01 17:37:13 +00:00
Akira Hatanaka	e9e588dd72	[mips] Remove unused option. Fix 80-column violations. llvm-svn: 176330	2013-03-01 02:17:02 +00:00
Akira Hatanaka	8f7bfb39be	[mips] Add the capability to search delay slot filling instructions in successor basic blocks. Currently this is off by default. llvm-svn: 176329	2013-03-01 02:03:51 +00:00
Akira Hatanaka	e01ff9dc60	[mips] Add capability to search in the forward direction for instructions that can fill the delay slot. Currently, this is off by default. llvm-svn: 176320	2013-03-01 00:50:52 +00:00
Akira Hatanaka	eb33ced08f	[mips] Define class MemDefsUses. This class tracks dependence between memory instructions using underlying objects of memory operands. llvm-svn: 176313	2013-03-01 00:16:31 +00:00
Quentin Colombet	e684a6d4aa	Fix a bug in instcombine for fmul in fast math mode. The instcombine recognized pattern looks like: a = b * c d = a +/- Cst or a = b * c d = Cst +/- a When creating the new operands for fadd or fsub instruction following the related fmul, the first operand was created with the second original operand (M0 was created with C1) and the second with the first (M1 with Opnd0). The fix consists in creating the new operands with the appropriate original operand, i.e., M0 with Opnd0 and M1 with C1. llvm-svn: 176300	2013-02-28 21:12:40 +00:00
Benjamin Kramer	f7cfac7a14	Cost model support for lowered math builtins. We make the cost for calling libm functions extremely high as emitting the calls is expensive and causes spills (on x86) so performance suffers. We still vectorize important calls like ceilf and friends on SSE4.1. and fabs. Differential Revision: http://llvm-reviews.chandlerc.com/D466 llvm-svn: 176287	2013-02-28 19:09:33 +00:00
Tim Northover	ce17020c97	AArch64: remove post-encoder method from FCMP (immediate) instructions. The work done by the post-encoder (setting architecturally unused bits to 0 as required) can be done by the existing operand that covers the "#0.0". This removes at least one use of the discouraged PostEncoderMethod uses. llvm-svn: 176261	2013-02-28 14:46:14 +00:00
Tim Northover	c3c5c0971d	AArch64: be more careful resorting to inefficient addressing for weak vars. If an otherwise weak var is actually defined in this unit, it can't be undefined at runtime so we can use normal global variable sequences (ADRP/ADD) to access it. llvm-svn: 176259	2013-02-28 14:36:31 +00:00
Tim Northover	b9d4fd210b	AArch64: don't drop GlobalAddress offset when handling extern_weak decls. llvm-svn: 176258	2013-02-28 14:36:24 +00:00
Tim Northover	9fafdf6d5a	AArch64: Use cbnz instead of cmp/b.ne pair for atomic operations. llvm-svn: 176253	2013-02-28 13:52:07 +00:00
Evgeniy Stepanov	00062b4498	[msan] Implement sanitize_memory attribute. Shadow checks are disabled and memory loads always produce fully initialized values in functions that don't have a sanitize_memory attribute. Value and argument shadow is propagated as usual. This change also updates blacklist behaviour to match the above. llvm-svn: 176247	2013-02-28 11:25:14 +00:00
Renato Golin	0d89178ba3	Corrections for XFAIL armv5 tests Most of the tests that behave differently on llvm-arm-linux buildbot did so becase the triple wasn't set correctly to armv5, so we can revert most of the special behaviour added previously. Some tests still need the special treatment, though. llvm-svn: 176243	2013-02-28 10:05:10 +00:00
Manman Ren	584e4c0eda	Debug Info: for static member variables, always put AT_MIPS_linkage_name to the definition DIE (TAG_variable), and put AT_MIPS_linkage_name to TAG_member when DarwinGDBCompat is true. Darwin GDB needs AT_MIPS_linkage_name at both places to work. Follow-up patch to r176143. rdar://problem/13291234 llvm-svn: 176220	2013-02-27 23:21:02 +00:00
Jim Grosbach	5f21587648	ARM: FMA is legal only if VFP4 is available. rdar://13306723 llvm-svn: 176212	2013-02-27 21:31:12 +00:00
Tim Northover	29931ab21d	ARM: permit full range of valid ADR immediates. This fixes an issue where trying to assemlbe valid ADR instructions would cause LLVM to hit a failed assertion. Patch by Keith Walker. llvm-svn: 176189	2013-02-27 16:43:09 +00:00
Benjamin Kramer	dc145816fd	LoopVectorize: Vectorize math builtin calls. This properly asks TargetLibraryInfo if a call is available and if it is, it can be translated into the corresponding LLVM builtin. We don't vectorize sqrt() yet because I'm not sure about the semantics for negative numbers. The other intrinsic should be exact equivalents to the libm functions. Differential Revision: http://llvm-reviews.chandlerc.com/D465 llvm-svn: 176188	2013-02-27 15:24:19 +00:00
Meador Inge	9b47f6414b	IR: Don't constant fold GEP bitcasts between different address spaces PR15262 reported a bug where the following instruction: i8 getelementptr inbounds i8* bitcast ([4 x i8] addrspace(12)* @buf to i8), i32 2 was getting folded into: addrspace(12) getelementptr inbounds ([4 x i8] addrspace(12)* @buf, i32 0, i32 2) This caused instcombine to crash because the original instruction and the folded instruction have different types. The issue was fixed by disallowing bitcasts between different address spaces to be folded away. llvm-svn: 176156	2013-02-27 02:26:42 +00:00
Manman Ren	683f59b36c	SelectionDAG: If llvm.donothing has a landingpad, we should clear CurrentCallSite to avoid an assertion failure: assert(MMI.getCurrentCallSite() == 0 && "Overlapping call sites!"); rdar://problem/13228754 llvm-svn: 176154	2013-02-27 02:11:57 +00:00
Manman Ren	5ae44d2b75	Debug Info: for static member variables, add AT_MIPS_linkage_name to the definition DIE, to make old GDB happy. We have a regression for old GDB when Clang uses DW_TAG_member to declare static members inside a class, instead of DW_TAG_variable. This patch will fix this regression. rdar://problem/13291234 llvm-svn: 176143	2013-02-27 00:02:32 +00:00
Michael Ilseman	a7b93c1e5f	Constant fold vector bitcasts of halves similarly to how floats and doubles are folded. Test case included. llvm-svn: 176131	2013-02-26 22:51:07 +00:00
Manman Ren	fe494749e4	Revert r176120 as it caused a failure at static-member.cpp llvm-svn: 176129	2013-02-26 22:35:53 +00:00
Bill Schmidt	8ea7af8e44	Fix PR15332 (patch by Florian Zeitz). There's no need to generate a stack frame for PPC32 SVR4 when there are no local variables assigned to the stack, i.e., when no red zone is needed. (PPC64 supports a red zone, but PPC32 does not.) llvm-svn: 176124	2013-02-26 21:28:57 +00:00
Manman Ren	5222195831	Debug Info: for static member variables, move AT_MIPS_linkage_name from TAG_member inside a class to the specification DIE. Having AT_MIPS_linkage_name on TAG_member caused old gdb (GNU 6.3.50) to error out. Also gcc 4.7 has AT_MIPS_linkage_name on the specification DIE. rdar://problem/13291234 llvm-svn: 176120	2013-02-26 20:48:29 +00:00
Chad Rosier	31a9bbcd4a	Add a test case for r176066. llvm-svn: 176119	2013-02-26 20:22:30 +00:00
Jim Grosbach	94a2260a7f	AsmParser: More generic support for integer type suffices. For integer constants, allow 'L', 'UL' as well as 'ULL' and 'LL'. This provides better support for shared headers between .s and .c files that define bunches of constant values. rdar://9321056 llvm-svn: 176118	2013-02-26 20:17:10 +00:00
Chad Rosier	d2686ffa56	Remove a few unused arguments. llvm-svn: 176109	2013-02-26 18:39:31 +00:00
Renato Golin	e7693537d8	Proper XFAILs for ARMv7 / v5 llvm-svn: 176095	2013-02-26 17:16:27 +00:00
Bill Schmidt	441907dc09	Fix PR15359. The PowerPC TLS relocation types were not previously added to the necessary list in MCELFStreamer::fixSymbolsInTLSFixups(). Now they are! llvm-svn: 176094	2013-02-26 16:41:03 +00:00
Kostya Serebryany	cf880b9443	Unify clang/llvm attributes for asan/tsan/msan (LLVM part) These are two related changes (one in llvm, one in clang). LLVM: - rename address_safety => sanitize_address (the enum value is the same, so we preserve binary compatibility with old bitcode) - rename thread_safety => sanitize_thread - rename no_uninitialized_checks -> sanitize_memory CLANG: - add __attribute__((no_sanitize_address)) as a synonym for __attribute__((no_address_safety_analysis)) - add __attribute__((no_sanitize_thread)) - add __attribute__((no_sanitize_memory)) for S in address thread memory If -fsanitize=S is present and __attribute__((no_sanitize_S)) is not set llvm attribute sanitize_S llvm-svn: 176075	2013-02-26 06:58:09 +00:00
Michael Liao	ab97668061	Fix PR10499 - Check whether SSE is available before lowering all 1s vector building with PCMPEQD, which is only available from SSE2 llvm-svn: 176058	2013-02-25 23:01:03 +00:00
Chad Rosier	0adc042392	Remove extraneous attribute number. llvm-svn: 176053	2013-02-25 22:06:05 +00:00
Chad Rosier	a92ef4ba5b	[fast-isel] Add X86FastIsel::FastLowerArguments to handle functions with 6 or fewer scalar integer (i32 or i64) arguments. It completely eliminates the need for SDISel for trivial functions. Also, add the new llc -fast-isel-abort-args option, which is similar to -fast-isel-abort option, but for formal argument lowering. llvm-svn: 176052	2013-02-25 21:59:35 +00:00
Andrew Trick	7cf4361912	pre-RA-sched fix: only reevaluate physreg interferences when necessary. Fixes rdar:13279013: scheduler was blowing up on select instructions. llvm-svn: 176037	2013-02-25 19:11:48 +00:00
Chad Rosier	669bb3ee77	[ms-inline asm] Add support for the pushad/popad mnemonics. rdar://13254235 llvm-svn: 176036	2013-02-25 19:06:27 +00:00
Matt Beaumont-Gay	0e760da5fc	'Hexadecimal' has two 'a's and only one 'i'. llvm-svn: 176031	2013-02-25 18:11:18 +00:00
Bill Schmidt	b454829981	Fix missing relocation for TLS addressing peephole optimization. Report and fix due to Kai Nacke. Testcase update by me. llvm-svn: 176029	2013-02-25 16:44:35 +00:00
Chandler Carruth	05920b1847	Fix the root cause of PR15348 by correctly handling alignment 0 on memory intrinsics in the SDAG builder. When alignment is zero, the lang ref says that no alignment assumptions can be made. This is the exact opposite of the internal API contracts of the DAG where alignment 0 indicates that the alignment can be made to be anything desired. There is another, more explicit alignment that is better suited for the role of "no alignment at all": an alignment of 1. Map the intrinsic alignment to this early so that we don't end up generating aligned DAGs. It is really terrifying that we've never seen this before, but we suddenly started generating a large number of alignment 0 memcpys due to the new code to do memcpy-based copying of POD class members. That patch contains a bug that rounds bitfield alignments down when they are the first field. This can in turn produce zero alignments. This fixes weird crashes I've seen in library users of LLVM on 32-bit hosts, etc. llvm-svn: 176022	2013-02-25 14:20:21 +00:00
Benjamin Kramer	ee40b9a2d4	CVP: If we have a PHI with an incoming select, try to skip the select. This is a common pattern with dyn_cast and similar constructs, when the PHI no longer depends on the select it can often be turned into a simpler construct or even get hoisted out of the loop. PR15340. llvm-svn: 175995	2013-02-24 15:34:43 +00:00
Benjamin Kramer	b867fea5e6	Fix invalid IR in test, missing incoming value for PHI node. llvm-svn: 175994	2013-02-24 15:34:29 +00:00
Nadav Rotem	b532fca92c	Revert r169638 because it broke Mesa llvmpipe tests. Fix PR15239. llvm-svn: 175985	2013-02-24 07:09:35 +00:00
Renato Golin	0890ace58a	Some more tests for the global structure vectorizer llvm-svn: 175964	2013-02-23 12:48:30 +00:00
Benjamin Kramer	ee23dcb461	X86: Disable cmov-memory patterns on subtargets without cmov. Fixes PR15115. llvm-svn: 175962	2013-02-23 10:40:58 +00:00
Reed Kotler	dacee2bb44	Expand pseudos/macros for Selt. This is the last of the complex macros.The rest is some small misc. stuff. llvm-svn: 175950	2013-02-23 03:09:56 +00:00
Jim Grosbach	9be2d71512	ARM: Convenience aliases for 'srs*' instructions. Handle an implied 'sp' operand. rdar://11466783 llvm-svn: 175940	2013-02-23 00:52:09 +00:00
Eric Christopher	dae389bb98	Use getSplitDebugFilename when constructing the skeleton cu and update testcase accordingly to give the correct name to the cu. llvm-svn: 175934	2013-02-22 23:50:08 +00:00
Akira Hatanaka	02b0e48f6a	[mips] Emit call16 operator instead of got_disp. The former allows lazy binding. llvm-svn: 175920	2013-02-22 21:10:03 +00:00
Peter Collingbourne	e049fd2c31	Fix test by matching movaps instead of AVX-only vmovaps llvm-svn: 175914	2013-02-22 19:53:30 +00:00
Peter Collingbourne	7b57621fb3	x86_64: designate most general purpose and SSE registers as callee save under coldcc llvm-svn: 175911	2013-02-22 19:19:44 +00:00
Pete Cooper	23e8b6b8c9	Remove unused CHECK lines copied from another test llvm-svn: 175905	2013-02-22 18:16:21 +00:00
Renato Golin	adc1b07002	More tests to global struct vectorizer llvm-svn: 175898	2013-02-22 16:18:31 +00:00
Kristof Beyls	0ba797e8f7	Make ARMAsmPrinter generate the correct alignment specifier syntax in instructions. The Printer will now print instructions with the correct alignment specifier syntax, like vld1.8 {d16}, [r0:64] llvm-svn: 175884	2013-02-22 10:01:33 +00:00
Bill Wendling	a032374ea0	Use references to attribute groups on the call/invoke instructions. Listing all of the attributes for the callee of a call/invoke instruction is way too much and makes the IR unreadable. Use references to attributes instead. llvm-svn: 175877	2013-02-22 09:09:42 +00:00
Reed Kotler	4416cdadd5	Expand mips16 SelT form pseudso/macros. llvm-svn: 175862	2013-02-22 05:10:51 +00:00
Pete Cooper	047f81a5df	Fix isa<> check which could never be true. It was incorrectly checking a Function* being an IntrinsicInst* which isn't possible. It should always have been checking the CallInst* instead. Added test case for x86 which ensures we only get one constant load. It was 2 before this change. rdar://problem/13267920 llvm-svn: 175853	2013-02-22 01:50:38 +00:00
Eli Bendersky	705085da37	Previously, parsing capability of the .debug_frame section was added to lib/DebugInfo, with dumping in llvm-dwarfdump. This patch adds initial ability to parse and dump CFA instructions contained in entries. To keep it manageable, the patch omits some more advanced capabilities (accounted in TODOs): * Parsing of instructions with BLOCK arguments (expression lists) * Dumping of actual instruction arguments (currently only names are dumped). This is quite tricky since the dumper has to effectively "interpret" the instructions. llvm-svn: 175820	2013-02-21 22:53:19 +00:00
Renato Golin	cf928cb53f	Allow GlobalValues to vectorize with AliasAnalysis Storing the load/store instructions with the values and inspect them using Alias Analysis to make sure they don't alias, since the GEP pointer operand doesn't take the offset into account. Trying hard to not add any extra cost to loads and stores that don't overlap on global values, AA is only calculated if all of the previous attempts failed. Using biggest vector register size as the stride for the vectorization access, as we're being conservative and the cost model (which calculates the real vectorization factor) is only run after the legalization phase. We might re-think this relationship in the future, but for now, I'd rather be safe than sorry. llvm-svn: 175818	2013-02-21 22:39:03 +00:00
Anshuman Dasgupta	d062c70444	Hexagon: Expand cttz, ctlz, and ctpop for now. llvm-svn: 175783	2013-02-21 19:39:40 +00:00
Jakob Stoklund Olesen	2ff4dc0ff2	Make RAFast::UsedInInstr indexed by register units. This fixes some problems with too conservative checking where we were marking all aliases of a register as used, and then also checking all aliases when allocating a register. <rdar://problem/13249625> llvm-svn: 175782	2013-02-21 19:35:21 +00:00
Bill Schmidt	27917785ae	Large code model support for PowerPC. Large code model is identical to medium code model except that the addis/addi sequence for "local" accesses is never used. All accesses use the addis/ld sequence. The coding changes are straightforward; most of the patch is taken up with creating variants of the medium model tests for large model. llvm-svn: 175767	2013-02-21 17:12:27 +00:00
Benjamin Kramer	3238dc0c61	DAGCombiner: Make the post-legalize vector op optimization more aggressive. A legal BUILD_VECTOR goes in and gets constant folded into another legal BUILD_VECTOR so we don't lose any legality here. The problematic PPC optimization that made this check necessary was fixed recently. llvm-svn: 175759	2013-02-21 15:24:35 +00:00
Tom Stellard	0d171c8877	R600: Fix for Unigine when MachineSched is enabled Fixes for-loop.cl piglit test Patch By: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175742	2013-02-21 15:06:59 +00:00
Michel Danzer	7f02a8c7a7	R600/SI: Make sure M0 is loaded for V_INTERP_MOV_F32 NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175733	2013-02-21 08:57:10 +00:00
Reed Kotler	97ba5f2772	Expand the sel pseudo/macro. This generates basic blocks where previously there were inline br .+4 instructions. Soon everything can enjoy the full instruction scheduling experience. llvm-svn: 175718	2013-02-21 04:22:38 +00:00
Jack Carter	dc46338e2d	Mips specific standalone assembler addressing mode %hi and %lo. The constructs %hi() and %lo() represent the high and low 16 bits of the address. Because the 16 bit offset field of an LW instruction is interpreted as signed, if bit 15 of the low part is 1 then the low part will act as a negative and 1 needs to be added to the high part. Contributer: Vladimir Medic llvm-svn: 175707	2013-02-21 02:09:31 +00:00
Bill Schmidt	f5b474c6c6	PPCDAGToDAGISel::PostprocessISelDAG() This patch implements the PPCDAGToDAGISel::PostprocessISelDAG virtual method to perform post-selection peephole optimizations on the DAG representation. One optimization is implemented here: folds to clean up complex addressing expressions for thread-local storage and medium code model. It will also be useful for large code model sequences when those are added later. I originally thought about doing this on the MI representation prior to register assignment, but it's difficult to do effective global dead code elimination at that point. DCE is trivial on the DAG representation. A typical example of a candidate code sequence in assembly: addis 3, 2, globalvar@toc@ha addi 3, 3, globalvar@toc@l lwz 5, 0(3) When the final instruction is a load or store with an immediate offset of zero, the offset from the add-immediate can replace the zero, provided the relocation information is carried along: addis 3, 2, globalvar@toc@ha lwz 5, globalvar@toc@l(3) Since the addi can in general have multiple uses, we need to only delete the instruction when the last use is removed. llvm-svn: 175697	2013-02-21 00:38:25 +00:00
Jack Carter	1ac5322e61	ELF symbol table field st_other support, excluding visibility bits. Mips specific standalone assembler directive "set at". This directive changes the general purpose register that the assembler will use when given the symbolic register name $at. This does not include negative testing. That will come in a future patch. A side affect of this patch recognizes the different GPR register names for temporaries between old abi and new abi so a test case for that is included. Contributer: Vladimir Medic llvm-svn: 175686	2013-02-20 23:11:17 +00:00
Bill Schmidt	4437ad7d94	Stabilize vec_constants.ll llvm-svn: 175683	2013-02-20 22:43:03 +00:00
Arnold Schwaighofer	3f9568e921	DAGCombiner: Fold pointless truncate, bitcast, buildvector series (2xi32) (truncate ((2xi64) bitcast (buildvector i32 a, i32 x, i32 b, i32 y))) can be folded into a (2xi32) (buildvector i32 a, i32 b). Such a DAG would cause uneccessary vdup instructions followed by vmovn instructions. We generate this code on ARM NEON for a setcc olt, 2xf64, 2xf64. For example, in the vectorized version of the code below. double A[N]; double B[N]; void test_double_compare_to_double() { int i; for(i=0;i<N;i++) A[i] = (double)(A[i] < B[i]); } radar://13191881 Fixes bug 15283. llvm-svn: 175670	2013-02-20 21:33:32 +00:00
Bill Schmidt	c6cbecc2c7	Additional fixes for bug 15155. This handles the cases where the 6-bit splat element is odd, converting to a three-instruction sequence to add or subtract two splats. With this fix, the XFAIL in test/CodeGen/PowerPC/vec_constants.ll is removed. llvm-svn: 175663	2013-02-20 20:41:42 +00:00
Michael Liao	7fb39669ef	Fix PR15267 - When extloading from a vector with non-byte-addressable element, e.g. <4 x i1>, the current logic breaks. Extend the current logic to fix the case where the element type is not byte-addressable by loading all bytes, bit-extracting/packing each element. llvm-svn: 175642	2013-02-20 18:04:21 +00:00
Bill Schmidt	6631e94838	Fix bug 14779 for passing anonymous aggregates [patch by Kai Nacke]. The PPC backend doesn't handle these correctly. This patch uses logic similar to that in the X86 and ARM backends to track these arguments properly. llvm-svn: 175635	2013-02-20 17:31:41 +00:00
Jyotsna Verma	7503a62bce	Hexagon: Move HexagonMCInst.h to MCTargetDesc/HexagonMCInst.h. Add HexagonMCInst class which adds various Hexagon VLIW annotations. In addition, this class also includes some APIs related to the constant extenders. llvm-svn: 175634	2013-02-20 16:13:27 +00:00
Bill Schmidt	51e7951e24	Fix PR15155: lost vadd/vsplat optimization. During lowering of a BUILD_VECTOR, we look for opportunities to use a vector splat. When the splatted value fits in 5 signed bits, a single splat does the job. When it doesn't fit in 5 bits but does fit in 6, and is an even value, we can splat on half the value and add the result to itself. This last optimization hasn't been working recently because of improved constant folding. To circumvent this, create a pseudo VADD_SPLAT that can be expanded during instruction selection. llvm-svn: 175632	2013-02-20 15:50:31 +00:00
Elena Demikhovsky	0ccdd1315b	I optimized the following patterns: sext <4 x i1> to <4 x i64> sext <4 x i8> to <4 x i64> sext <4 x i16> to <4 x i64> I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns: (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT))) The sext_in_reg (v4i32 x) may be lowered to shl+sar operations. The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution. I also added a cost of this operations to the AVX costs table. llvm-svn: 175619	2013-02-20 12:42:54 +00:00
Kostya Serebryany	699ac28aa5	[asan] instrument invoke insns with noreturn attribute (as well as call insns) llvm-svn: 175617	2013-02-20 12:35:15 +00:00
Logan Chien	53c18d8ac7	Fix thumbv5e frame lowering assertion failure. It is possible that frame pointer is not found in the callee saved info, thus FramePtrSpillFI may be incorrect if we don't check the result of hasFP(MF). Besides, if we enable the stack coloring algorithm, there will be an assertion to ensure the slot is live. But in the test case, %var1 is not live in the prologue of the function, and we will get the assertion failure. Note: There is similar code in ARMFrameLowering.cpp. llvm-svn: 175616	2013-02-20 12:21:33 +00:00
Bill Wendling	4cdb88983d	Use the attribute group reference instead of the attribute directly. llvm-svn: 175609	2013-02-20 07:48:23 +00:00
Bill Wendling	90bc19cd91	Modify the LLVM assembly output so that it uses references to represent function attributes. This makes the LLVM assembly look better. E.g.: define void @foo() #0 { ret void } attributes #0 = { nounwind noinline ssp } llvm-svn: 175605	2013-02-20 07:21:42 +00:00
Reed Kotler	7b503c2b03	Expand pseudos/macros: SltCCRxRy16, SltiCCRxImmX16, SltiuCCRxImmX16, SltuCCRxRy16 $T8 shows up as register $24 when emitted from C++ code so we had to change some tests that were already there for this functionality. llvm-svn: 175593	2013-02-20 05:45:15 +00:00
Michael J. Spencer	6a8746b7e6	[llvm-readobj] Add ELF .dynamic table dumping. llvm-svn: 175592	2013-02-20 02:37:12 +00:00
Chad Rosier	45a52fa097	[ms-inline asm] Force the use of a base pointer if the MachineFunction includes MS-style inline assembly. This is a follow-on to r175334. Forcing a FP to be emitted doesn't ensure it will be used. Therefore, force the base pointer as well. We now treat MS inline assembly in the same way we treat functions with dynamic stack realignment and VLAs. This guarantees the BP will be used to reference parameters and locals. rdar://13218191 llvm-svn: 175576	2013-02-19 23:50:45 +00:00
Jack Carter	10c97e5ca0	ELF symbol table field st_other support, excluding visibility bits. Mips (o32 abi) specific e_header setting. EF_MIPS_ABI_O32 needs to be set in the ELF header flags for o32 abi output. Contributer: Reed Kotler llvm-svn: 175569	2013-02-19 22:29:00 +00:00
Jack Carter	1ba1f3cec8	ELF symbol table field st_other support, excluding visibility bits. Mips (Mips16) specific e_header setting. EF_MIPS_ARCH_ASE_M16 needs to be set in the ELF header flags for Mips16. Contributer: Reed Kotler llvm-svn: 175566	2013-02-19 22:14:34 +00:00
Nadav Rotem	0186347c4c	Fix a bug in mayHaveSideEffects. Functions that do not return are now considered as instructions with side effects. rdar://13227456 llvm-svn: 175553	2013-02-19 20:02:09 +00:00
Jim Grosbach	3fa275e6f7	ARM: Allocation hints must make sure to be in the alloc order. When creating an allocation hint for a register pair, make sure the hint for the physical register reference is still in the allocation order. rdar://13240556 llvm-svn: 175541	2013-02-19 18:55:36 +00:00
Eli Bendersky	c66b7b2582	Fix typo llvm-svn: 175530	2013-02-19 17:11:48 +00:00
Benjamin Kramer	b3aa2b8497	Fix GCMetadaPrinter::finishAssembly not executed, patch by Yiannis Tsiouris. Due to the execution order of doFinalization functions, the GC information were deleted before AsmPrinter::doFinalization was executed. Thus, the GCMetadataPrinter::finishAssembly was never called. The patch fixes that by moving the code of the GCInfoDeleter::doFinalization to Printer::doFinalization. llvm-svn: 175528	2013-02-19 16:51:44 +00:00
Arnold Schwaighofer	e5083442b2	ARM NEON: Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers. The patch fixes this by using a COPY_TO_REGCLASS and a EXTRACT_SUBREG to extract the element from the vector instead. radar://13191881 llvm-svn: 175520	2013-02-19 15:27:05 +00:00
Kostya Serebryany	3ece9beaf1	[asan] instrument memory accesses with unusual sizes This patch makes asan instrument memory accesses with unusual sizes (e.g. 5 bytes or 10 bytes), e.g. long double or packed structures. Instrumentation is done with two 1-byte checks (first and last bytes) and if the error is found __asan_report_load_n(addr, real_size) or __asan_report_store_n(addr, real_size) is called. Also, call these two new functions in memset/memcpy instrumentation. asan-rt part will follow. llvm-svn: 175507	2013-02-19 11:29:21 +00:00
Reed Kotler	3e457f505e	Expand pseudos/macros BteqzT8SltiX16, BteqzT8SltiuX16, BtnezT8SltiX16, BtnezT8SltiuX16 . llvm-svn: 175486	2013-02-19 03:56:57 +00:00
Bill Wendling	c98e4fef1a	Temporarily revert r175470 for more review. llvm-svn: 175476	2013-02-19 00:52:45 +00:00
Reed Kotler	d82171990f	Expand pseudos BteqzT8CmpiX16 and BtnezT8CmpiX16. llvm-svn: 175474	2013-02-19 00:20:58 +00:00
Bill Wendling	66651e4c2f	Check to see if the 'no-builtin' attribute is set before simplifying a library call. llvm-svn: 175470	2013-02-18 23:17:16 +00:00
Chad Rosier	f666b761bd	Comment out the rdar number. llvm-svn: 175460	2013-02-18 21:59:15 +00:00
Chad Rosier	f3f8f443e1	[fast-isel] Remove an invalid assert. If the memcpy has an odd length with an alignment of 2, this would incorrectly assert on the last 1 byte copy. rdar://13202135 llvm-svn: 175459	2013-02-18 21:46:28 +00:00
Benjamin Kramer	53bc37ca2a	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Vincent Lejeune	1ce13f553e	R600/SI: Use MULADD_IEEE/V_MAD_F32 instruction for mad pattern llvm-svn: 175446	2013-02-18 14:11:28 +00:00
Kostya Serebryany	7ca384bc1a	[asan] revert r175266 as it breaks code with packed structures. supporting long double will require a more general solution llvm-svn: 175442	2013-02-18 13:47:02 +00:00
Tim Northover	67d3c09332	AArch64: adjust tests which rely on a default JIT Profiling tests do need a JIT. They'll pass if a cross-compiler targetting AArch64 by default has been built, but fail if a native AArch64 compiler has been build. Therefore XFAIL is inappropriate and we mark them unsupported. ExecutionEngine tests are JIT by definition, they should also be unsupported. Transforms/LICM only uses the interpreter to check the output is still sane after optimisation. It can be switched to use an interpreter. llvm-svn: 175433	2013-02-18 11:08:37 +00:00
Reed Kotler	1460738710	Expand macro/pseudo instructions BtnezT8SltX16 and BtnezT8SltuX16. llvm-svn: 175420	2013-02-18 05:43:03 +00:00
Reed Kotler	c40f4e5899	Expand pseudo/macro BteqzT8SltX16. llvm-svn: 175417	2013-02-18 04:04:26 +00:00
Reed Kotler	7e4bc6067b	Expand macro/pseudo BteqzT8CmpX16. llvm-svn: 175416	2013-02-18 03:06:29 +00:00
Reed Kotler	cb37409b92	Beginning of expanding all current mips16 macro/pseudo instruction sequences. This expansion will be moved to expandISelPseudos as soon as I can figure out how to do that. There are other instructions which use this ExpandFEXT_T8I816_ins and as soon as I have finished expanding them all, I will delete the macro asm string text so it has no way to be used in the future. llvm-svn: 175413	2013-02-18 00:59:04 +00:00

... 3 4 5 6 7 ...

18809 Commits