llvm-project

Commit Graph

Author	SHA1	Message	Date
Renato Golin	c0a3c1d66b	Add @llvm.clear_cache builtin Implementing the LLVM part of the call to __builtin___clear_cache which translates into an intrinsic @llvm.clear_cache and is lowered by each target, either to a call to __clear_cache or nothing at all incase the caches are unified. Updating LangRef and adding some tests for the implemented architectures. Other archs will have to implement the method in case this builtin has to be compiled for it, since the default behaviour is to bail unimplemented. A Clang patch is required for the builtin to be lowered into the llvm intrinsic. This will be done next. llvm-svn: 204802	2014-03-26 12:52:28 +00:00
Hal Finkel	732f0f73a7	[PowerPC] Lower VSELECT using xxsel when VSX is available With VSX there is a real vector select instruction, and so we should use it. Note that VSELECT will still scalarize for v2f64 because the corresponding SetCC result type (v2i64) is not currently a legal type. llvm-svn: 204801	2014-03-26 12:49:28 +00:00
Daniel Sanders	6dad838f3a	[mips] Add tests for t0-t3 for N32/N64 These are aliases of t4-t7 and are provided for compatibility with both the original ABI documentation (using t4-t7) and GNU As (using t0-t3) llvm-svn: 204797	2014-03-26 11:46:34 +00:00
Daniel Sanders	a4b0c74765	[mips] The register names depend on the ABI being N32/N64 rather than the arch being mips64 Summary: Added test cases for O32 and N32 on MIPS64. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3175 llvm-svn: 204796	2014-03-26 11:39:07 +00:00
Timur Iskhodzhanov	b5b7a61646	Follow-up to r204790: don't try to emit line tables if there are no functions with DI in the TU llvm-svn: 204795	2014-03-26 11:24:36 +00:00
Daniel Sanders	85f482b02f	[mips] $s8 is an alias for $fp in all ABI's, not just N32/N64. llvm-svn: 204793	2014-03-26 11:05:24 +00:00
Daniel Sanders	91d4407cd8	[mips] Move the CHECK lines in mips*-register-names.s to make it more obvious which CHECK matches with which insn This reveals a small mistake in mips-register-names.s ($sp is tested twice and $s8 is not tested) which will be fixed in a follow-up commit. llvm-svn: 204792	2014-03-26 10:54:30 +00:00
Timur Iskhodzhanov	6a35c15589	Add tests for r204790 llvm-svn: 204791	2014-03-26 09:51:45 +00:00
Timur Iskhodzhanov	e32ef937eb	Use -LABEL checks in the COFF debug info tests llvm-svn: 204788	2014-03-26 08:45:02 +00:00
Rafael Espindola	65481d7b97	Revert "Prevent alias from pointing to weak aliases." This reverts commit r204781. I will follow up to with msan folks to see what is what they were trying to do with aliases to weak aliases. llvm-svn: 204784	2014-03-26 06:14:40 +00:00
Hal Finkel	bd4de9d478	[PowerPC] Generate logical vector VSX instructions These instructions are essentially the same as their Altivec counterparts, but have access to the larger VSX register file. llvm-svn: 204782	2014-03-26 04:55:40 +00:00
Rafael Espindola	3b712a84a9	Prevent alias from pointing to weak aliases. Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204781	2014-03-26 04:48:47 +00:00
David Blaikie	62dd7df612	DebugInfo: Add fission-related sections to COFF Allows this test to pass on COFF platforms so we don't need to restrict this test to a single target anymore. llvm-svn: 204780	2014-03-26 03:05:10 +00:00
Rafael Espindola	85a8491a93	Correctly detect if a symbol uses a reserved section index or not. The logic was incorrect for variables, causing them to end up in the wrong section if the section had an index >= 0xff00. llvm-svn: 204771	2014-03-26 00:16:43 +00:00
Quentin Colombet	6f12ae0d5c	[X86] Add broadcast instructions to the table used by ExeDepsFix pass. Adds the different broadcast instructions to the ReplaceableInstrsAVX2 table. That way the ExeDepsFix pass can take better decisions when AVX2 broadcasts are across domain (int <-> float). In particular, prior to this patch we were generating: vpbroadcastd LCPI1_0(%rip), %ymm2 vpand %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 ## <- domain change penalty Now, we generate the following nice sequence where everything is in the float domain: vbroadcastss LCPI1_0(%rip), %ymm2 vandps %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 <rdar://problem/16354675> llvm-svn: 204770	2014-03-26 00:10:22 +00:00
Rafael Espindola	10be0837ac	Create .symtab_shndxr only when needed. We need .symtab_shndxr if and only if a symbol references a section with an index >= 0xff00. The old code was trying to figure out if the section was needed ahead of time, making it a fairly dependent on the code actually writing the table. It was also somewhat conservative and would create the section in cases where it was not needed. If I remember correctly, the old structure was there so that the sections were created in the same order gas creates them. That was valuable when MC's support for ELF was new and we tested with elf-dump.py. This patch refactors the symbol table creation to another class and makes it obvious that .symtab_shndxr is really only created when we are about to output a reference to a section index >= 0xff00. While here, also improve the tests to use macros. One file is one section short of needing .symtab_shndxr, the second one has just the right number. llvm-svn: 204769	2014-03-25 23:44:25 +00:00
Hal Finkel	174e590966	[PowerPC] Select between VSX A-type and M-type FMA instructions just before RA The VSX instruction set has two types of FMA instructions: A-type (where the addend is taken from the output register) and M-type (where one of the product operands is taken from the output register). This adds a small pass that runs just after MI scheduling (and, thus, just before register allocation) that mutates A-type instructions (that are created during isel) into M-type instructions when: 1. This will eliminate an otherwise-necessary copy of the addend 2. One of the product operands is killed by the instruction The "right" moment to make this decision is in between scheduling and register allocation, because only there do we know whether or not one of the product operands is killed by any particular instruction. Unfortunately, this also makes the implementation somewhat complicated, because the MIs are not in SSA form and we need to preserve the LiveIntervals analysis. As a simple example, if we have: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 ... %vreg9<def,tied1> = XSMADDADP %vreg9<tied0>, %vreg17, %vreg19, %RM<imp-use>; VSLRC:%vreg9,%vreg17,%vreg19 ... We can eliminate the copy by changing from the A-type to the M-type instruction. This means: %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 is replaced by: %vreg16<def,tied1> = XSMADDMDP %vreg16<tied0>, %vreg18, %vreg9, %RM<imp-use>; VSLRC:%vreg16,%vreg18,%vreg9 and we remove: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 llvm-svn: 204768	2014-03-25 23:29:21 +00:00
NAKAMURA Takumi	3a485fba1f	llvm/test/DebugInfo/empty.ll: Suppress crash for targeting pecoff while investigating. llvm-svn: 204766	2014-03-25 23:16:44 +00:00
Adam Nemet	4beef4c90d	[X86] Generate VPSHUFB for in-place v16i16 shuffles This used to resort to splitting the 256-bit operation into two 128-bit shuffles and then recombining the results. Fixes <rdar://problem/16167303> llvm-svn: 204735	2014-03-25 17:47:06 +00:00
Richard Osborne	0af4aa9a19	[InstCombine] Don't fold bitcast into store if it would need addrspacecast Summary: Previously the code didn't check if the before and after types for the store were pointers to different address spaces. This resulted in instcombine using a bitcast to convert between pointers to different address spaces, causing an assertion due to the invalid cast. It is not be appropriate to use addrspacecast this case because it is not guaranteed to be a no-op cast. Instead bail out and do not do the transformation. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3117 llvm-svn: 204733	2014-03-25 17:21:41 +00:00
Matt Arsenault	86673ba836	R600: Add failing testcase for <3 x i32> stores. This is supposed to have the same store size and alignment as <4 x i32>, but currently is split into a 64-bit and 32-bit store. llvm-svn: 204729	2014-03-25 16:50:55 +00:00
Benjamin Kramer	e75eaca32f	ScalarEvolution: Compute exit counts for loops with a power-of-2 step. If we have a loop of the form for (unsigned n = 0; n != (k & -32); n += 32) {} then we know that n is always divisible by 32 and the loop must terminate. Even if we have a condition where the loop counter will overflow it'll always hold this invariant. PR19183. Our loop vectorizer creates this pattern and it's also occasionally formed by loop counters derived from pointers. llvm-svn: 204728	2014-03-25 16:25:12 +00:00
Evgeniy Stepanov	86f318e8b4	[msan] Relax the test some more. This may or may not fix the bots. R204720 did not. llvm-svn: 204721	2014-03-25 14:32:05 +00:00
Evgeniy Stepanov	d2b07ddfac	[msan] Make some tests less strict. This may or may not fix the bots. llvm-svn: 204720	2014-03-25 14:15:14 +00:00
Evgeniy Stepanov	fc742acc8c	[msan] More precise instrumentation of select IR. Some bits of select result may be initialized even if select condition is not. https://code.google.com/p/memory-sanitizer/issues/detail?id=50 llvm-svn: 204716	2014-03-25 13:08:34 +00:00
Daniel Sanders	71a89d92f6	[mips] '.set at=$0' should be equivalent to '.set noat' Differential Revision: http://llvm-reviews.chandlerc.com/D3171 llvm-svn: 204714	2014-03-25 13:01:06 +00:00
Cameron McInally	45dc489403	Fix AVX2 Gather execution domains. llvm-svn: 204713	2014-03-25 12:36:38 +00:00
Daniel Sanders	b1d7e53a26	[mips] Correct testcase for .set at=$reg and emit the new warnings for numeric registers too. Summary: Remove the XFAIL added in my previous commit and correct the test such that it correctly tests the expansion of the assembler temporary. Also added a test to check that $at is always $1 when written by the user. Corrected the new assembler temporary warnings so that they are emitted for numeric registers too. Differential Revision: http://llvm-reviews.chandlerc.com/D3169 llvm-svn: 204711	2014-03-25 11:16:03 +00:00
Daniel Sanders	e231ae9e3a	[mips] Fix assembler temporary expansion and add associated warnings about the use of $at. Summary: The assembler temporary is normally $at ($1) but can be reassigned using '.set at=$reg'. Regardless of which register is nominated as the assembler temporary, $at remains $1 when written by the user. Adds warnings under the following conditions: * The register nominated as the assembler temporary is used by the user. * '.set noat' is in effect and $at is used by the user. Both of these only work for named registers. I have a follow up commit that makes it work for numeric registers as well. XFAIL set-at-directive.s since it incorrectly tests that $at is redefined by '.set at=$reg'. Testcases will follow in a separate commit. Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3167 llvm-svn: 204710	2014-03-25 10:57:07 +00:00
David Majnemer	273bff4713	WinCOFF: Add support for -fdata-sections This is a pretty straight forward translation for COFF, we just need to stick the data in a COMDAT section marked as IMAGE_COMDAT_SELECT_NODUPLICATES. N.B. We must be careful to avoid sticking entities with private linkage in COMDAT groups. COFF is pretty hostile to the renaming of entities so we must be careful to disallow GlobalVariables with unstable names. llvm-svn: 204703	2014-03-25 06:14:26 +00:00
David Blaikie	3ffe4dd67f	DebugInfo: Add GNU_addr_base and GNU_ranges_base only when there are addresses or ranges Based on code review feedback from Eric in r204672. llvm-svn: 204702	2014-03-25 05:34:24 +00:00
Saleem Abdulrasool	1425622ad8	test: fix CHECK lines Thanks to gix for pointing out that the CHECK-LABEL lines were incorrect! llvm-svn: 204700	2014-03-25 03:39:39 +00:00
David Blaikie	9c550ac4e7	DebugInfo: Support debug_loc under fission Implement debug_loc.dwo, as well as llvm-dwarfdump support for dumping this section. Outlined in the DWARF5 spec and http://gcc.gnu.org/wiki/DebugFission the debug_loc.dwo section has more variation than the standard debug_loc, allowing 3 different forms of entry (plus the end of list entry). GCC seems to, and Clang certainly, only use one form, so I've just implemented dumping support for that for now. It wasn't immediately obvious that there was a good refactoring to share the implementation of dumping support between debug_loc and debug_loc.dwo, so they're separate for now - ideas welcome or I may come back to it at some point. As per a comment in the code, we could choose different forms that may reduce the number of debug_addr entries we emit, but that will require further study. llvm-svn: 204697	2014-03-25 01:44:02 +00:00
Manman Ren	78cf02a07b	Register Allocator: check other options before using a CSR for the first time. When register allocator's stage is RS_Spill, we choose spill over using the CSR for the first time, if the spill cost is lower than CSRCost. When register allocator's stage is < RS_Split, we choose pre-splitting over using the CSR for the first time, if the cost of splitting is lower than CSRCost. CSRCost is set with command-line option "regalloc-csr-first-time-cost". The default value is 0 to generate the same codes as before this commit. With a value of 15 (1 << 14 is the entry frequency), I measured performance gain of 3% on 253.perlbmk and 1.7% on 197.parser, with instrumented PGO, on an arm device. rdar://16162005 llvm-svn: 204690	2014-03-25 00:16:25 +00:00
Kevin Enderby	89299400ac	Fix crashes when assembler directives are used that are not for Mach-O object files by generating an error instead. rdar://16335232 llvm-svn: 204687	2014-03-25 00:05:50 +00:00
David Blaikie	96dea0581e	DebugInfo: Add DW_AT_GNU_ranges_base to skeleton CUs This is used to avoid relocations in the dwo file by allowing DW_AT_ranges specified in debug_info.dwo to be relative to this base address. (r204667 implements the base-relative DW_AT_ranges side of this) llvm-svn: 204672	2014-03-24 21:31:35 +00:00
David Blaikie	26b2bd04fd	DebugInfo: Implement relative addressing for DW_AT_ranges under fission This removes the debug_ranges relocations from debug_info.dwo (but doesn't implement the DW_AT_GNU_ranges_base which is also necessary for correct functioning) llvm-svn: 204668	2014-03-24 21:07:27 +00:00
David Blaikie	3c9a3cc495	DebugInfo: Don't emit relocations to abbreviations in debug_info.dwo llvm-svn: 204667	2014-03-24 20:53:02 +00:00
Matt Arsenault	684dc80b6d	R600/SI: Fix extra mov from legalizing 64-bit SALU ops. Check the register class of each operand individually to avoid an extra copy to a vgpr. llvm-svn: 204662	2014-03-24 20:08:13 +00:00
Matt Arsenault	248b7b6ba1	R600/SI: Sub-optimial fix for 64-bit immediates with SALU ops. No longer asserts, but now you get moves loading legal immediates into the split 32-bit operations. llvm-svn: 204661	2014-03-24 20:08:09 +00:00
Matt Arsenault	f35182c783	R600/SI: Fix 64-bit bit ops that require the VALU. Try to match scalar and first like the other instructions. Expand 64-bit ands to a pair of 32-bit ands since that is not available on the VALU. llvm-svn: 204660	2014-03-24 20:08:05 +00:00
Matt Arsenault	a7f1e0c44f	R600: Implement isNarrowingProfitable. llvm-svn: 204658	2014-03-24 19:43:31 +00:00
Ulrich Weigand	cae3a17a21	[PowerPC] Generate little-endian object files As a first step towards real little-endian code generation, this patch changes the PowerPC MC layer to actually generate little-endian object files. This involves passing the little-endian flag through the various layers, including down to createELFObjectWriter so we actually get basic little-endian ELF objects, emitting instructions in little-endian order, and handling fixups and relocations as appropriate for little-endian. The bulk of the patch is to update most test cases in test/MC/PowerPC to verify both big- and little-endian encodings. (The only test cases not updated are those that create actual big-endian ABI code, like the TLS tests.) Note that while the object files are now little-endian, the generated code itself is not yet updated, in particular, it still does not adhere to the ELFv2 ABI. llvm-svn: 204634	2014-03-24 18:16:09 +00:00
Quentin Colombet	2d5c156b96	[X86][ISelDAG] Add missing fallback patterns for avx2 broadcast instructions. Those patterns are used when the load cannot be folded into the related broadcast during the select phase. This happens when the load gets additional uses that were not anticipated during the previous lowering phases (constant vector to constant load, then constant load reused) or when selection DAG is not able to prove that folding the load will not create a cycle in the DAG. <rdar://problem/16074331> llvm-svn: 204631	2014-03-24 17:54:19 +00:00
Matt Arsenault	ad41d7b531	R600/SI: Fix 64-bit private loads. llvm-svn: 204630	2014-03-24 17:50:46 +00:00
Eli Bendersky	a7b6679774	Add test to test/CodeGen/NVPTX for "alloca buffer" arguments. Make sure such IR gets properly lowered to PTX. llvm-svn: 204624	2014-03-24 16:52:30 +00:00
Daniel Sanders	d89b13625e	[mips] Add error message when trying to use $at in '.set noat' mode. Summary: Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3158 llvm-svn: 204621	2014-03-24 16:48:01 +00:00
Daniel Sanders	68fd4c784c	[mips] Add regression tests for parenthetic expressions in MIPS assembly. Summary: These expressions already worked but weren't tested. Patch by Robert N. M. Watson and David Chisnall (it was originally two patches) Their work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3156 llvm-svn: 204612	2014-03-24 15:42:21 +00:00
Daniel Sanders	01f9fc06e7	[mips] Allow dsubu to take an immediate as an alias for dsubiu. Summary: Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3155 llvm-svn: 204611	2014-03-24 15:38:00 +00:00
Daniel Sanders	a771fefb72	[mips] Implement shorthand add / sub forms for MIPS. Summary: - If only two registers are passed to a three-register operation, then the first argument is both source and destination register. - If a non-register is passed as the last argument, generate the immediate version of the instruction. Also mark DADD commutative and add scheduling information (to the generic scheduler), and implement DSUB. Patch by David Chisnall His work was sponsored by: DARPA, AFRL CC: theraven Differential Revision: http://llvm-reviews.chandlerc.com/D3148 llvm-svn: 204605	2014-03-24 14:05:39 +00:00
Justin Holewinski	ba2fa6de4f	[NVPTX] Add isel patterns for addrspacecast llvm-svn: 204600	2014-03-24 11:17:53 +00:00
Rafael Espindola	cfee7efde9	Teach llvm-readobj to print human friendly description of reserved sections. llvm-svn: 204584	2014-03-24 05:00:34 +00:00
Karthik Bhat	195e9dd91b	Allow constant folding of ceil function whenever feasible llvm-svn: 204583	2014-03-24 04:36:06 +00:00
Rafael Espindola	717aeb6d7b	Add back tests that were reverted in r204203. They pass again with the fix in r204581. llvm-svn: 204582	2014-03-24 03:48:15 +00:00
Rafael Espindola	022bb76879	Propagate section from base to derived symbol. We were already propagating the section in a = b With this patch we also propagate it for a = b + 1 llvm-svn: 204581	2014-03-24 03:43:21 +00:00
Justin Bogner	db1225d061	llvm-profdata: Check for bad data in the show command llvm-svn: 204573	2014-03-23 20:55:53 +00:00
David Majnemer	9338984f57	WinCOFF: Add support for -ffunction-sections This is a pretty straight forward translation for COFF, we just need to stick the function in a COMDAT section marked as IMAGE_COMDAT_SELECT_NODUPLICATES. llvm-svn: 204565	2014-03-23 17:47:39 +00:00
Hal Finkel	4a912250fa	[PowerPC] Make use of VSX f64 <-> i64 conversion instructions When VSX is available, these instructions should be used in preference to the older variants that only have access to the scalar floating-point registers. llvm-svn: 204559	2014-03-23 05:35:00 +00:00
Lang Hames	459b5dc39e	Revert r204076 for now - it caused significant regressions in a number of benchmarks. <rdar://problem/16368461> llvm-svn: 204558	2014-03-23 04:22:31 +00:00
Duncan P. N. Exon Smith	4680361d7c	InstrProf: Check pointer size in raw profile Since the profile can come from 32-bit machines, we need to check the pointer size. Change the magic number to facilitate this. Adds tests for reading 32-bit and 64-bit binaries (both big- and little-endian). The tests write a binary using printf in RUN lines (like raw-magic-but-no-header.test). Assuming the bots don't complain, this seems like a better way forward for testing RawInstrProfReader than committing binary files. <rdar://problem/16400648> llvm-svn: 204557	2014-03-23 03:38:12 +00:00
Rafael Espindola	a6e3a599d1	Propagate types from symbol to aliases. This is similar, but not identical to what gas does. The logic in MC is to just compute the symbol table after parsing the entire file. GAS is mixed, given .type b, @object a = b b: .type b, @function It will propagate the change and make 'a' a function. Given .type b, @object b: a = b .type b, @function the type of 'a' is still object. Since we do the computation in the end, we produce a function in both cases. llvm-svn: 204555	2014-03-23 03:33:20 +00:00
Justin Bogner	957a2944df	llvm-profdata: Don't pipe stderr into show for the tests Some text shows up on stderr when using guard malloc, and this test was trying to treat that as input to llvm-profdata show. There's no reason to pipe stderr into show at all here. llvm-svn: 204549	2014-03-22 23:53:43 +00:00
Saleem Abdulrasool	44419fc3cd	ARM IAS: properly handle function entries in .thumb When a label is parsed, check if there is type information available for the label. If so, check if the symbol is a function. If the symbol is a function and we are in thumb mode and no explicit thumb_func has been emitted, adjust the symbol data to indicate that the function definition is a thumb function. The application of this inferencing is improved value handling in the object file (the required thumb bit is set on symbols which are thumb functions). It also helps improve compatibility with binutils. The one complication that arises from this handling is the MCAsmStreamer. The default implementation of getOrCreateSymbolData in MCStreamer does not support tracking the symbol data. In order to support the semantics of thumb functions, track symbol data in assembly streamer. Although O(n) in number of labels in the TU, this is already done in various other streamers and as such the memory overhead is not a practical concern in this scenario. llvm-svn: 204544	2014-03-22 19:26:18 +00:00
Hal Finkel	55805eb562	[PowerPC] Fix the VSX v2f64 return register v2f64 values, like other 128-bit values, are returned under VSX in register vs34 (Altivec register v2). llvm-svn: 204543	2014-03-22 18:24:43 +00:00
Juergen Ributzka	e802d507b0	[Constant Hoisting] Fix multiple entries for the same basic block in PHI nodes. A PHI node usually has only one value/basic block pair per incoming basic block. In the case of a switch statement it is possible that a following PHI node may have more than one such pair per incoming basic block. E.g.: %0 = phi i64 [ 123456, %case2 ], [ 654321, %Entry ], [ 654321, %Entry ] This is valid and the verfier doesn't complain, because both values are the same. Constant hoisting materializes the constant for each operand separately and the value is still the same, but the variable names have changed. As a result the verfier can't recognize anymore that they are the same value and complains. This fix adds special update code for PHI node in constant hoisting to prevent this corner case. This fixes <rdar://problem/16394449> llvm-svn: 204537	2014-03-22 01:49:27 +00:00
Andrea Di Biagio	5b0aacf1c7	[DAG] Fix an assertion failure caused by an invalid cast in method 'BuildVectorSDNode::isConstantSplat' This patch renames method 'isConstantSplat' as 'getConstantSplatValue' (mainly for consistency reasons), and rewrites its logic to ensure that we always perform a legal 'cast<ConstantSDNode>'. Added test shift-combine-crash.ll to verify that DAGCombiner no longer crashes with an assertion failure in the attempt to simplify a vector shift by a vector of all undef counts. llvm-svn: 204536	2014-03-22 01:47:22 +00:00
Rafael Espindola	66f96fe0cb	Fix the value computation in sym_a: sym_d = sym_a + 1 This is the smallest fix I was able to extract from what got reverted in r204203. llvm-svn: 204527	2014-03-21 22:00:29 +00:00
Manman Ren	c935560568	Register allocator: add condition to hoist a spill to outer loop. We make sure a spill is not hoisted to a hotter outer loop by adding a condition. Hoist a spill to outer loop if there are multiple dependents (it can be beneficial if more than one dependents are hoisted) or if DepSV (the hoisting source) is hotter than SV (the hoisting destination). rdar://16268194 llvm-svn: 204522	2014-03-21 21:46:24 +00:00
Duncan P. N. Exon Smith	af777bb37c	InstrProf: Cleanup binary profdata testcase Cleanup the current binary testcase for profile data. - Rename it to something more specific. - Remove the text comparison. - Check the output of llvm-profdata show. llvm-svn: 204518	2014-03-21 21:20:35 +00:00
Duncan P. N. Exon Smith	745a2bf0b8	InstrProf: Change magic number to have non-text characters Include non-text characters in the magic number so that text files can't match. <rdar://problem/15950346> llvm-svn: 204513	2014-03-21 20:42:37 +00:00
Duncan P. N. Exon Smith	531bb481e2	InstrProf: Actually detect bad headers <rdar://problem/15950346> llvm-svn: 204510	2014-03-21 20:42:28 +00:00
David Blaikie	330ec978a6	DebugInfo: Omit DW_AT_addr_base from skeletal type units. Type units have no addresses, so there's no need for DW_AT_addr_base. This removes another relocation from every skeletal type unit and brings LLVM's skeletal type units in line with GCC's (containing only GNU_dwo_name (strp), comp_dir (strp), and GNU_pubnames (flag_present)). Cary's got some ideas about using str_index in the .o file to reduce those last two relocations (well, replace two relocations with one relocation (pointing to the string index) and two indicies) llvm-svn: 204506	2014-03-21 20:27:21 +00:00
Chad Rosier	b7747e31ef	[AArch64] Add SchedRW lists to NEON instructions. Previously, only regular AArch64 instructions were annotated with SchedRW lists. This patch does the same for NEON enabling these instructions to be scheduled by the MIScheduler. Additionally, store operations are now modeled and a few SchedRW lists were updated for bug fixes (e.g. multiple def operands). Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 204505	2014-03-21 19:34:41 +00:00
Duncan P. N. Exon Smith	24b4b65339	InstrProf: Read raw binary profile in llvm-profdata Read a raw binary profile that corresponds to a memory dump from the runtime profile. The test is a binary file generated from cfe/trunk/test/Profile/c-general.c with the new compiler-rt runtime and the matching text version of the input. It includes instructions on how to regenerate. <rdar://problem/15950346> llvm-svn: 204496	2014-03-21 18:26:05 +00:00
Matt Arsenault	8e2581b11e	R600/SI: Move instruction patterns to scalar versions. Some of them also had the pattern on both, so this removes the duplication. llvm-svn: 204492	2014-03-21 18:01:18 +00:00
Rafael Espindola	d2bd8def3f	Remove redundant test. This is tested from MC already. llvm-svn: 204491	2014-03-21 18:00:51 +00:00
Rafael Espindola	734f105379	Move codegen test over to MC. llvm-svn: 204490	2014-03-21 17:55:34 +00:00
Justin Bogner	b9bd7f85a7	ProfileData: Introduce InstrProfWriter using the naive text format This isn't a format we'll want to write out in practice, but moving it to the writer library simplifies llvm-profdata and isolates it from further changes to the format. This also allows us to update the tests to not rely on the text output format. llvm-svn: 204489	2014-03-21 17:46:22 +00:00
Rafael Espindola	c07cc8f370	Convert test to using cfi. An unnamed global in llvm still produces a regular symbol. llvm-svn: 204488	2014-03-21 17:38:01 +00:00
Paul Robinson	f03ff490ed	Refactor llvm/test/lit.cfg to use lit.util.which. llvm-svn: 204486	2014-03-21 17:31:35 +00:00
Rafael Espindola	7618632517	Remove redundant test. The production of the .eh symbols is done from MC now and we already have tests for it. llvm-svn: 204483	2014-03-21 17:26:35 +00:00
Justin Bogner	f8d791983c	ProfileData: Introduce the InstrProfReader interface and a text reader This introduces the ProfileData library and updates llvm-profdata to use this library for reading profiles. InstrProfReader is an abstract base class that will be subclassed for both the raw instrprof data from compiler-rt and the efficient instrprof format that will be used for PGO. llvm-svn: 204482	2014-03-21 17:24:48 +00:00
Rafael Espindola	d8eb29ecfd	Split out the MC part of this test. llvm-svn: 204481	2014-03-21 17:16:11 +00:00
Daniel Sanders	f88a29e66a	[mips] Correct lowering of VECTOR_SHUFFLE to VSHF. Summary: VECTOR_SHUFFLE concatenates the vectors in an vectorwise fashion. <0b00, 0b01> + <0b10, 0b11> -> <0b00, 0b01, 0b10, 0b11> VSHF concatenates the vectors in a bitwise fashion: <0b00, 0b01> + <0b10, 0b11> -> 0b0100 + 0b1110 -> 0b01001110 <0b10, 0b11, 0b00, 0b01> We must therefore swap the operands to get the correct result. The test case that discovered the issue was MultiSource/Benchmarks/nbench. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3142 llvm-svn: 204480	2014-03-21 16:56:51 +00:00
Tom Stellard	1583409e33	R600/SI: Handle MUBUF instructions in SIInstrInfo::moveToVALU() llvm-svn: 204476	2014-03-21 15:51:57 +00:00
Tom Stellard	e038720702	R600/SI: Handle S_MOV_B64 in SIInstrInfo::moveToVALU() llvm-svn: 204475	2014-03-21 15:51:54 +00:00
Tom Stellard	edfd81d965	Sink: Don't sink static allocas from the entry block CodeGen treats allocas outside the entry block as dynamically sized stack objects. llvm-svn: 204473	2014-03-21 15:51:51 +00:00
Richard Sandiford	dc6c2c953d	[SystemZ] Add support for z196 float<->unsigned conversions These complement the older float<->signed instructions. llvm-svn: 204451	2014-03-21 10:56:30 +00:00
Kevin Qin	67b9c50c53	Fix test command line to avoid generating output file. llvm-svn: 204437	2014-03-21 07:20:29 +00:00
Juergen Ributzka	f0dff49ad0	[Constant Hoisting] Make the constant materialization cost operand dependent Extend the target hook to take also the operand index into account when calculating the cost of the constant materialization. Related to <rdar://problem/16381500> llvm-svn: 204435	2014-03-21 06:04:45 +00:00
Juergen Ributzka	5429c06b90	[Constant Hoisting] Change the algorithm to only track constants for instructions. Originally the algorithm would search for expensive constants and track their users, which could be instructions and constant expressions. This change only tracks the constants for instructions, but constant expressions are indirectly covered too. If an operand is an constant expression, then we look through the expression to find anny expensive constants. The algorithm keep now track of the instruction and the operand index where the constant is used. This allows more precise hoisting of constant materialization code for PHI instructions, because we only hoist to the basic block of the incoming operand. Before we had to find the idom of all PHI operands and hoist the materialization code there. This also makes updating of instructions easier. Before we had to keep track of the original constant, find it in the instructions, and then replace it. Now we can just simply update the operand. Related to <rdar://problem/16381500> llvm-svn: 204433	2014-03-21 06:04:36 +00:00
Jiangning Liu	db55b02e1c	This reverts commit r203762, "ARM: support emission of complex SO expressions". The commit r203762 introduced silent failure for complext SO expression, and it's even worse than compiler crash. llvm-svn: 204427	2014-03-21 02:51:01 +00:00
Kevin Qin	275ce91243	Fix an assertion caused by using inline asm with indirect register inputs. llvm-svn: 204425	2014-03-21 02:14:50 +00:00
Kevin Qin	b2c78b07d6	[AArch64] Remove .data_region directive from AArch64. .data_region is only used in Darwin, so it shouldn't be generated for other OS. Currently AArch64 doesn't support darwin yet, so I removed it from AArch64. When Darwin is supported someday, we can add it back and associate it with Darwin. llvm-svn: 204424	2014-03-21 02:12:48 +00:00
Rafael Espindola	f1b10242c0	Convert a CodeGen test into a MC test. llvm-svn: 204421	2014-03-21 00:55:42 +00:00
Rui Ueyama	827c8a2b07	Object/COFF: Support large relocation table. NumberOfRelocations field in COFF section table is only 16-bit wide. If an object has more than 65535 relocations, the number of relocations is stored to VirtualAddress field in the first relocation field, and a special flag (IMAGE_SCN_LNK_NRELOC_OVFL) is set to Characteristics field. In test we cheated a bit. I made up a test file so that it has IMAGE_SCN_LNK_NRELOC_OVFL flag but the number of relocations is much smaller than 65535. This is to avoid checking in a large test file just to test a file with many relocations. Differential Revision: http://llvm-reviews.chandlerc.com/D3139 llvm-svn: 204418	2014-03-21 00:44:19 +00:00
Rafael Espindola	2544330a29	Port test to cfi. llvm-svn: 204416	2014-03-21 00:30:24 +00:00
Rafael Espindola	fc72577d92	Convert another CodeGen test into a MC test. llvm-svn: 204412	2014-03-20 23:35:00 +00:00
Weiming Zhao	0152485679	Fix PR19136: [ARM] Fix Folding SP Update into vpush/vpop Sicne MBB->computeRegisterLivenes() returns Dead for sub regs like s0, d0 is used in vpop instead of updating sp, which causes s0 dead before its use. This patch checks the liveness of each subreg to make sure the reg is actually dead. llvm-svn: 204411	2014-03-20 23:28:16 +00:00
Greg Fitzgerald	1843227551	llvm-objdump output hex to match binutils' objdump Patch by Ted Woodward llvm-svn: 204409	2014-03-20 22:55:15 +00:00
Rafael Espindola	df1be1f4c5	Convert CodeGen test into a more specific MC test. llvm-svn: 204406	2014-03-20 22:05:59 +00:00
Rafael Espindola	c889a278fa	Remove unused options from test. llvm-svn: 204401	2014-03-20 21:38:04 +00:00
Rafael Espindola	98629c4e4d	Don't use EmitAbsValue with symbol references. The function exists to force an expression to be absolute, but there it is not possible to force a symbol reference since a = b .long a means something else. This is an alternative fix for pr9951 that uses an assert. It then deletes the old pr9951 test that was testing nothing already. llvm-svn: 204399	2014-03-20 21:26:38 +00:00
Juergen Ributzka	46357931ab	Revert "[Constant Hoisting] Extend coverage of the constant hoisting pass." I will break this up into smaller pieces for review and recommit. llvm-svn: 204393	2014-03-20 20:17:13 +00:00
Juergen Ributzka	6dab520c70	[Constant Hoisting] Extend coverage of the constant hoisting pass. This commit extends the coverage of the constant hoisting pass, adds additonal debug output and updates the function names according to the style guide. Related to <rdar://problem/16381500> llvm-svn: 204389	2014-03-20 19:55:52 +00:00
Mark Seaborn	b6118c5b17	Remove LowerInvoke's obsolete "-enable-correct-eh-support" option This option caused LowerInvoke to generate code using SJLJ-based exception handling, but there is no code left that interprets the jmp_buf stack that the resulting code maintained (llvm.sjljeh.jblist). This option has been obsolete for a while, and replaced by SjLjEHPrepare. This leaves the default behaviour of LowerInvoke, which is to convert invokes to calls. Differential Revision: http://llvm-reviews.chandlerc.com/D3136 llvm-svn: 204388	2014-03-20 19:54:47 +00:00
Eric Christopher	384f3feb2d	Reapply DW_AT_low/high_pc patch: Use the range machinery for DW_AT_ranges and DW_AT_high/lo_pc. This commit moves us from a single range per subprogram to extending ranges if we are: a) In the same section, and b) In the same enclosing CU. This means we have more fine grained ranges for compile units, and fewer ranges overall when we have multiple functions in the same CU adjacent to each other in the object file. Also remove all of the earlier hacks around this functionality for function sections etc. Also update all of the testcases to take into account the merging functionality. with a fix for location entries in the debug_loc section: Make sure that debug loc entries are relative to the low_pc of the compile unit. This means that when we only have a single range that the offset should be just relative to the low_pc of the unit, for multiple ranges for a CU this means that we'll be relative to 0 which we emit along with DW_AT_ranges. This mostly shows up with linked binaries, so add a testcase with multiple CUs so that our location is going to be offset of a CU with a non-zero low_pc. llvm-svn: 204377	2014-03-20 19:16:16 +00:00
David Blaikie	7ac51493d6	Add comments from Eric's review of r204094. llvm-svn: 204358	2014-03-20 17:05:45 +00:00
Mark Seaborn	277fbe1bfe	Add a test for LowerInvoke that doesn't use "-enable-correct-eh-support" None of the existing tests for LowerInvoke check LowerInvoke's output, and all but one use "-enable-correct-eh-support", which is obsolete, so those tests will be removed when that option is removed. To make sure LowerInvoke will still have test coverage, this adds a test for its default mode which converts invokes to calls. Differential Revision: http://llvm-reviews.chandlerc.com/D3124 llvm-svn: 204344	2014-03-20 14:12:47 +00:00
Kai Nacke	93fe5e810d	[MIPS] Add cpu octeon and some instructions The Octeon cpu from Cavium Networks is mips64r2 based and has an extended instruction set. In order to utilize this with LLVM, a new cpu feature "octeon" and a subtarget feature "cnmips" is added. A small set of new instructions (baddu, dmul, pop, dpop, seq, sne) is also added. LLVM generates dmul, pop and dpop instructions with option -mcpu=octeon or -mattr=+cnmips. llvm-svn: 204337	2014-03-20 11:51:58 +00:00
Alexander Potapenko	7aafd31dad	[ASan] Add -asan-module to the ASan .ll tests. After the -asan pass had been split into -asan (function-level) and -asan-module (module-level) some of the tests have silently stopped working, because they didn't instrument the globals anymore. We've decided to have every test using both passes, irrespective of the presence of globals in it. llvm-svn: 204335	2014-03-20 11:16:34 +00:00
Alexander Potapenko	04969e8b31	[ASan] Do not instrument globals from the llvm.metadata section. Fixes https://code.google.com/p/address-sanitizer/issues/detail?id=279. llvm-svn: 204331	2014-03-20 10:48:34 +00:00
Zoran Jovanovic	a0f5328984	Provide an operand for microMIPS wait instruction. llvm-svn: 204329	2014-03-20 10:41:37 +00:00
Zoran Jovanovic	87d13e5ec1	Implementation of microMIPS 16-bit instructions MOVE and JALR. Differential Revision: http://llvm-reviews.chandlerc.com/D3112 llvm-svn: 204325	2014-03-20 10:18:24 +00:00
Zoran Jovanovic	28221d8bc1	Mark alias symbols as microMIPS if necessary. Differential Revision: http://llvm-reviews.chandlerc.com/D3080 llvm-svn: 204323	2014-03-20 09:44:49 +00:00
Craig Topper	ccb38c5588	Test case for r204305. llvm-svn: 204316	2014-03-20 06:45:10 +00:00
David Majnemer	798e548955	Object: Output .file symbols properly obj2yaml would emit the NUL bytes padding the auxiliary file symbol records. Trimming them looks nicer. llvm-svn: 204314	2014-03-20 06:29:02 +00:00
Saleem Abdulrasool	39f773f939	Reapply 'ARM IAS: support .thumb_set' Re-apply the change after it was reverted to do conflicts due to another change being reverted. llvm-svn: 204306	2014-03-20 06:05:33 +00:00
Hao Liu	40b5ab8e5b	[ARM]Fix an assertion failure in A15SDOptimizer about DPair reg class by treating DPair as QPR. llvm-svn: 204304	2014-03-20 05:36:59 +00:00
Rafael Espindola	7fadc0ea7d	Look through variables when computing relocations. Given bar = foo + 4 .long bar MC would eat the 4. GNU as includes it in the relocation. The rule seems to be that a variable that defines a symbol is used in the relocation and one that does not define a symbol is evaluated and the result included in the relocation. Fixing this unfortunately required some other changes: * Since the variable is now evaluated, it would prevent the ELF writer from noticing the weakref marker the elf streamer uses. This patch then replaces that with a VariantKind in MCSymbolRefExpr. * Using VariantKind then requires us to look past other VariantKind to see .weakref bar,foo call bar@PLT doing this also fixes zed = foo +2 call zed@PLT so that is a good thing. * Looking past VariantKind means that the relocation selection has to use the fixup instead of the target. This is a reboot of the previous fixes for MC. I will watch the sanitizer buildbot and wait for a build before adding back the previous fixes. llvm-svn: 204294	2014-03-20 02:12:01 +00:00
Eric Christopher	e9551ec1a0	Revert "Use the range machinery for DW_AT_ranges and DW_AT_high/lo_pc." This appears to trigger failures with optimization and function arguments somehow. This reverts commit r204277. llvm-svn: 204286	2014-03-20 00:12:06 +00:00
Eric Christopher	e33c990616	Use the range machinery for DW_AT_ranges and DW_AT_high/lo_pc. This commit moves us from a single range per subprogram to extending ranges if we are: a) In the same section, and b) In the same enclosing CU. This means we have more fine grained ranges for compile units, and fewer ranges overall when we have multiple functions in the same CU adjacent to each other in the object file. Also remove all of the earlier hacks around this functionality for function sections etc. Also update all of the testcases to take into account the merging functionality. llvm-svn: 204277	2014-03-19 22:42:36 +00:00
Matt Arsenault	d06ebd93e6	R600/SI: Add support for 64-bit LDS writes llvm-svn: 204274	2014-03-19 22:19:54 +00:00
Matt Arsenault	b943348cb9	R600/SI: Add support for 64-bit LDS loads. v2: -Use correct opcode for DS_READ_64 llvm-svn: 204273	2014-03-19 22:19:52 +00:00
Matt Arsenault	99ed78926b	R600/SI: Match i16 immediate offset of LDS instructions. llvm-svn: 204272	2014-03-19 22:19:49 +00:00
Matt Arsenault	43eeee182a	R600/SI: Fix test checking wrong instruction operand. The source and destination happen to be the same register. llvm-svn: 204271	2014-03-19 22:19:45 +00:00
Matt Arsenault	547aff20f5	R600/SI: Don't display the GDS bit. It isn't actually used now, and probably never will be, plus it makes tests less annoying. I also think SC prints GDS instructions as a separate instruction name. llvm-svn: 204270	2014-03-19 22:19:43 +00:00
Matheus Almeida	004d61f698	[mips] Making sure that a '.set noreorder' directive is correctly parsed and emitted and that no NOPs are emitted in a 'noreorder section'. llvm-svn: 204250	2014-03-19 16:20:19 +00:00
Evgeniy Stepanov	2275a01a44	Set debug info for instructions inserted in SplitBlockAndInsertIfThen. llvm-svn: 204230	2014-03-19 12:56:38 +00:00
David Majnemer	ddf28f2b79	Object: Provide a richer means of describing auxiliary symbols The current state of affairs has auxiliary symbols described as a big bag of bytes. This is less than satisfying, it detracts from the YAML file as being human readable. Instead, allow for symbols to optionally contain their auxiliary data. This allows us to have a much higher level way of describing things like weak symbols, function definitions and section definitions. This depends on D3105. Differential Revision: http://llvm-reviews.chandlerc.com/D3092 llvm-svn: 204214	2014-03-19 04:47:47 +00:00
Justin Bogner	618bcea714	llvm-profdata: Make "merge" into a subcommand. We'll be adding a few more subcommands in the near future. llvm-svn: 204211	2014-03-19 02:20:46 +00:00
Justin Bogner	38fff8682b	llvm-profdata: Update to use the naive text format with function hash This also uses line_iterator to simplify the parsing logic. llvm-svn: 204210	2014-03-19 02:20:42 +00:00
Rafael Espindola	a73744e894	Make the test harder by using a non-zero offset. llvm-svn: 204205	2014-03-19 00:26:58 +00:00
Rafael Espindola	7bbd5c2636	Revert "Add back r203962, r204028 and r204059." This reverts commit r204178. llvm-svn: 204203	2014-03-19 00:13:43 +00:00
David Blaikie	47f4b82d8b	DebugInfo: Use the comp_dir of the referencing type units when building debug_line.dwo This isn't a complete fix - it falls back to non-comp_dir when multiple compile units are in play. Adding a map of comp_dir to table is part of the more general solution, but I gave up (in the short term) when I realized I'd also have to calculate the size of each type unit so as to produce correct DW_AT_stmt_list attributes. llvm-svn: 204202	2014-03-19 00:11:28 +00:00
Eli Bendersky	2281ef91e6	Expose "noduplicate" attribute as a property for intrinsics. The "noduplicate" function attribute exists to prevent certain optimizations from duplicating calls to the function. This is important on platforms where certain function call duplications are unsafe (for example execution barriers for CUDA and OpenCL). This patch makes it possible to specify intrinsics as "noduplicate" and translates that to the appropriate function attribute. llvm-svn: 204200	2014-03-18 23:51:07 +00:00
Rui Ueyama	f078eff39c	Object/COFF: Add function to check if section number is reserved one. Differential Revision: http://llvm-reviews.chandlerc.com/D3103 llvm-svn: 204199	2014-03-18 23:37:53 +00:00
NAKAMURA Takumi	2e21f63462	Move yet another test that requires ARM to an ARM test directory. llvm-svn: 204198	2014-03-18 23:12:09 +00:00
Jim Grosbach	e93b257c6a	Move tests that require ARM to an ARM test directory. llvm-svn: 204197	2014-03-18 22:43:59 +00:00
Duncan P. N. Exon Smith	cb1c81afa0	Fix use_iterator crash in ObjCArc from r203364 The use_iterator redesign in r203364 introduced an increment past the end of a range in -objc-arc-contract. Added an explicit check for the end of the range. <rdar://problem/16333235> llvm-svn: 204195	2014-03-18 22:32:43 +00:00
Jim Grosbach	448334a738	Darwin: Add assembler directives to create version-min load commands. Allow object files to be tagged with a version-min load command for iOS or MacOSX. Teach macho-dump to understand the version-min load commands for testcases. rdar://11337778 llvm-svn: 204190	2014-03-18 22:09:05 +00:00
Rafael Espindola	574bfa12fa	Add back r203962, r204028 and r204059. This reverts commit r204137. This includes a fix for handling aliases of aliases. llvm-svn: 204178	2014-03-18 20:40:38 +00:00
Hans Wennborg	aec21ce43e	X86 memcpy lowering: use "rep movs" even when esi is used as base pointer For functions where esi is used as base pointer, we would previously fall back from lowering memcpy with "rep movs" because that clobbers esi. With this patch, we just store esi in another physical register, and restore it afterwards. This adds a little bit of register preassure, but the more efficient memcpy should be worth it. Differential Revision: http://llvm-reviews.chandlerc.com/D2968 llvm-svn: 204174	2014-03-18 20:04:34 +00:00
Michael Zolotukhin	7ac41056c8	Fix test lsr-normalization.ll broken in r204161. llvm-svn: 204166	2014-03-18 18:17:59 +00:00
Raul E. Silvera	a9dafe6793	Add support for scalarizing/splitting vector bswap. Summary: SLP Vectorization of intrinsics (r203707) has exposed cases where the expansion of vector bswap is failing (PR19151). Reviewers: hfinkel CC: chandlerc Differential Revision: http://llvm-reviews.chandlerc.com/D3104 llvm-svn: 204163	2014-03-18 17:49:12 +00:00
Michael Zolotukhin	ed0a7761e5	Add stride normalization to SCEV Normalize/Denormalize transformation. llvm-svn: 204161	2014-03-18 17:34:03 +00:00
Andrea Di Biagio	28f46d9f39	[DAGCombiner] teach how to simplify xor/and/or nodes according to the following rules: 1) (AND (shuf (A, C, Mask), shuf (B, C, Mask)) -> shuf (AND (A, B), C, Mask) 2) (OR (shuf (A, C, Mask), shuf (B, C, Mask)) -> shuf (OR (A, B), C, Mask) 3) (XOR (shuf (A, C, Mask), shuf (B, C, Mask)) -> shuf (XOR (A, B), V_0, Mask) 4) (AND (shuf (C, A, Mask), shuf (C, B, Mask)) -> shuf (C, AND (A, B), Mask) 5) (OR (shuf (C, A, Mask), shuf (C, B, Mask)) -> shuf (C, OR (A, B), Mask) 6) (XOR (shuf (C, A, Mask), shuf (C, B, Mask)) -> shuf (V_0, XOR (A, B), Mask) llvm-svn: 204160	2014-03-18 17:12:59 +00:00
Bill Schmidt	ff9622ef0e	Fix PR19144: Incorrect offset generated for int-to-fp conversion at -O0. When converting a signed 32-bit integer to double-precision floating point on hardware without a lfiwax instruction, we have to instead use a lfd followed by fcfid. We were erroneously offsetting the address by 4 bytes in preparation for either a lfiwax or lfiwzx when generating the lfd. This fixes that silly error. This was not caught in the test suite since the conversion tests were run with -mcpu=pwr7, which implies availability of lfiwax. I've added another test case for older hardware that checks the code we expect in the absence of lfiwax and other flavors of fcfid. There are fewer tests in this test case because we punt to DAG selection in more cases on older hardware. (We must generate complex fiddly sequences in those cases, and there is marginal benefit in duplicating that logic in fast-isel.) llvm-svn: 204155	2014-03-18 14:32:50 +00:00
Evgeniy Stepanov	302964ee92	[msan] Origin tracking with history. LLVM part of MSan implementation of advanced origin tracking, when we record not only creation point, but all locations where an uninitialized value was stored to memory, too. llvm-svn: 204151	2014-03-18 13:30:56 +00:00
Diego Novillo	213bb00245	Tolerate unmangled names in sample profiles. Summary: The compiler does not always generate linkage names. If a function has been inlined and its body elided, its linkage name may not be generated. When the binary executes, the profiler will use its unmangled name when attributing samples. This results in unmangled names in the input profile. We are currently failing hard when this happens. However, in this case all that happens is that we fail to attribute samples to the inlined function. While this means fewer optimization opportunities, it should not cause a compilation failure. This patch accepts all valid function names, regardless of whether they were mangled or not. Reviewers: chandlerc CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3087 llvm-svn: 204142	2014-03-18 12:03:12 +00:00
Alexander Kornienko	64de613751	Revert r203962 and two revisions depending on it: r204028 and r204059. The revision I'm reverting breaks handling of transitive aliases. This blocks us and breaks sanitizer bootstrap: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/2651 (and checked locally by Alexey). This revision is the result of: svn merge -r204059:204058 -r204028:204027 -r203962:203961 . + the regression test added to test/MC/ELF/alias.s Another way to reproduce the regression with clang: $ cat q.c void a1(); void a2() __attribute__((alias("a1"))); void a3() __attribute__((alias("a2"))); void a1() {} $ ~/work/llvm-build/bin/clang-3.5-good -c q.c && mv q.o good.o && \ ~/work/llvm-build/bin/clang-3.5-bad -c q.c && mv q.o bad.o && \ objdump -t good.o bad.o good.o: file format elf64-x86-64 SYMBOL TABLE: 0000000000000000 l df ABS 0000000000000000 q.c 0000000000000000 l d .text 0000000000000000 .text 0000000000000000 l d .data 0000000000000000 .data 0000000000000000 l d .bss 0000000000000000 .bss 0000000000000000 l d .comment 0000000000000000 .comment 0000000000000000 l d .note.GNU-stack 0000000000000000 .note.GNU-stack 0000000000000000 l d .eh_frame 0000000000000000 .eh_frame 0000000000000000 g F .text 0000000000000006 a1 0000000000000000 g F .text 0000000000000006 a2 0000000000000000 g F .text 0000000000000006 a3 bad.o: file format elf64-x86-64 SYMBOL TABLE: 0000000000000000 l df ABS 0000000000000000 q.c 0000000000000000 l d .text 0000000000000000 .text 0000000000000000 l d .data 0000000000000000 .data 0000000000000000 l d .bss 0000000000000000 .bss 0000000000000000 l d .comment 0000000000000000 .comment 0000000000000000 l d .note.GNU-stack 0000000000000000 .note.GNU-stack 0000000000000000 l d .eh_frame 0000000000000000 .eh_frame 0000000000000000 g F .text 0000000000000006 a1 0000000000000000 g F .text 0000000000000006 a2 0000000000000000 g .text 0000000000000000 a3 llvm-svn: 204137	2014-03-18 10:36:11 +00:00
NAKAMURA Takumi	7a1ac3b89b	CodeGen/R600/v_cndmask.ll: Relax an expression to unbreak msvcrt. V_CNDMASK_B32_e64 v0, v0, -1.#QNAN0e+00, s[2:3], 0, 0, 0, 0 FIXME: We really need to implement our formatter... llvm-svn: 204118	2014-03-18 06:17:22 +00:00
NAKAMURA Takumi	4dc097ad7c	DebugInfo/lto-comp-dir.ll: Tweak for dos path. llvm-svn: 204117	2014-03-18 06:01:14 +00:00
Adrian Prantl	1a1647cab6	Switch the type field in DIVariable and DIGlobalVariable over to DITypeRefs. This allows us to catch more opportunities for ODR-based type uniquing during LTO. Paired commit with CFE which updates some testcases to verify the new DIBuilder behavior. llvm-svn: 204106	2014-03-18 02:34:58 +00:00
David Blaikie	8287aff1cc	DebugInfo: Avoid emitting standard opcode lengths in debug_line.dwo headers where opcodes are never used anyway Introduce a slightly tighter wrapper around the header structure that handles this use case. (MCDwarfDwoLineTable) llvm-svn: 204101	2014-03-18 02:13:23 +00:00
David Blaikie	4a2f95f60e	DebugInfo: Implement debug_line.dwo for file names used in type units during -gsplit-dwarf This removes an attribute (and more importantly, a relocation) from skeleton type units and removes some unnecessary file names from the debug_line section that remains in the .o (and linked executable) file. There's still a few places we could shave off some more space here: * use compilation dir of the underlying compilation unit (since all the type units share that compilation dir - though this would be more complicated in LTO cases where they don't (keep a map of compilation dir->line table header?)) * Remove some of the unnecessary header fields from the line table since they're not needed in this situation (about 12 bytes per table). llvm-svn: 204099	2014-03-18 01:17:26 +00:00
David Blaikie	9a6f9a4c68	DebugInfo: Flag test as requiring object emission support Cleans up buildbot failures on R600 and similar. llvm-svn: 204095	2014-03-18 00:12:25 +00:00
David Blaikie	e05274d7d9	DebugInfo: Do not rely on the compilation dir (index 0) for files in line tables shared between compilation units When emitting assembly there's no support for emitting separate line tables for each compilation unit - so LLVM emits .loc directives producing a single line table. Line tables have an implicit directory (index 0) equal to the compilation directory (DW_AT_comp_dir) of the compilation unit that references them. If multiple compilation units (with possibly disparate compilation directories) reference the same line table, we must avoid relying on this ambiguous directory. Achieve this my simply not setting the compilation directory on the line table when we're in this situation (multiple units while emitting assembly). llvm-svn: 204094	2014-03-18 00:11:48 +00:00
David Blaikie	c7f29dc068	DebugInfo: Move line table zero-directory-index (compilation dir) handling into MCDwarf Our handling of compilation directory in DwarfDebug was broken (incorrectly using the 'last' compilation directory (that of the last CU in the metadata list) for all function emission in any CU). By moving this handling down into MCDwarf the issue is fixed as the compilation dir is tracked correctly per line table. llvm-svn: 204089	2014-03-17 23:29:40 +00:00
Dan Gohman	172c5d3451	Use range metadata instead of introducing selects. When GlobalOpt has determined that a GlobalVariable only ever has two values, it would convert the GlobalVariable to a boolean, and introduce SelectInsts at every load, to choose between the two possible values. These SelectInsts introduce overhead and other unpleasantness. This patch makes GlobalOpt just add range metadata to loads from such GlobalVariables instead. This enables the same main optimization (as seen in test/Transforms/GlobalOpt/integer-bool.ll), without introducing selects. The main downside is that it doesn't get the memory savings of shrinking such GlobalVariables, but this is expected to be negligible. llvm-svn: 204076	2014-03-17 19:57:04 +00:00
Kevin Enderby	8d761cc56d	Making a guess to fix the test case with r204056 to get the build bot working. llvm-svn: 204073	2014-03-17 19:00:03 +00:00
Matt Arsenault	fae02989b7	R600: Match sign_extend_inreg to BFE instructions llvm-svn: 204072	2014-03-17 18:58:11 +00:00
Matt Arsenault	985b9de485	Make DAGCombiner work on vector bitshifts with constant splat vectors. llvm-svn: 204071	2014-03-17 18:58:01 +00:00
Saleem Abdulrasool	11543a9953	ARM IAS: support .thumb_set This performs the equivalent of a .set directive in that it creates a symbol which is an alias for another symbol or value which may possibly be yet undefined. This directive also has the added property in that it marks the aliased symbol as being a thumb function entry point, in the same way that the .thumb_func directive does. The current implementation fails one test due to an unrelated issue. Functions within .thumb sections are not marked as thumb_func. The result is that the aliasee function is not valued correctly. llvm-svn: 204059	2014-03-17 17:13:54 +00:00
Adam Nemet	24381f1cb7	[VectorLegalizer/X86] Don't unvectorize fp_to_uint for v8f32->v8i16 Rather than LegalizeAction::Expand, this needs LegalizeAction::Promote to get promoted to fp_to_sint v8f32->v8i32. This is a legal operation on AVX. For that to work properly, we also need to teach the legalizer about the specific promotion required here. The default vector promotion uses bitcasting to a vector type of the same total size. We want to promote the vector element type, effectively widening the operation and then truncating the result. This is analogous to the current logic of how int_to_fp is promoted. The change also factors out some code from the int_to_fp promotion code to ValueType::widenIntegerVectorElementType. This is now shared between int_to_fp and fp_to_int. There is no longer need for the custom lowering of fp_to_sint f32->v8i16 in X86. It can now go through the new target-independent fp_to_*int promotion logic. I also checked that no other target uses Promote for these ops yet, so there shouldn't be any unexpected change in behavior. Fixes <rdar://problem/16202247> llvm-svn: 204058	2014-03-17 17:06:14 +00:00
Tom Stellard	d0084464b5	R600/SI: Fix implementation of isInlineConstant() used by the verifier The type of the immediates should not matter as long as the encoding is equivalent to the encoding of one of the legal inline constants. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 204056	2014-03-17 17:03:52 +00:00
Tom Stellard	fbe435de63	R600/SI: Use correct dest register class for V_READFIRSTLANE_B32 This instructions writes to an 32-bit SGPR. This change required adding the 32-bit VCC_LO and VCC_HI registers, because the full VCC register is 64 bits. This fixes verifier errors on several of the indirect addressing piglit tests. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 204055	2014-03-17 17:03:51 +00:00
NAKAMURA Takumi	3b3a4d9dac	llvm/test/MC/MachO/gen-dwarf-cpp.s: Relax an expression to match DOS pat. llvm-svn: 204030	2014-03-17 05:31:54 +00:00
Rafael Espindola	f863a3e2ec	Consider the base pointer for setting the symbol type. This is really a consistency fix. Since given a = b we propagate the information, we should propagate it too given a = b + (1 - 1) Fixes pr19145. llvm-svn: 204028	2014-03-17 04:29:51 +00:00
David Blaikie	c714ef4581	DebugInfo: Improve reuse of file table entries in asm debug info The previous deduping strategy was woefully inadequate - it only considered the most recent file used and avoided emitting a duplicate in that case - never considering the a/b/a scenario. It was also lacking when it came to directory paths as the previous filename would never match the current if the filename had been split into file and directory components. This change builds caching functionality into the line table at the lowest level in an optional form (a file number of 0 indicates that one should be chosen and returned) and will eventually be reused by the normal source level debugging DWARF emission. llvm-svn: 204027	2014-03-17 01:52:11 +00:00
David Blaikie	8bef7cd876	Test case llvm-svn: 204026	2014-03-17 01:52:04 +00:00
Nico Rieck	8678acd5ed	llvm-readobj: Print referred symbol name for CLR token definition llvm-svn: 204024	2014-03-17 01:46:52 +00:00
Nico Rieck	effcd4ba7a	llvm-readobj: Add test for COFF auxiliary symbols as used by C++/CLI llvm-svn: 204023	2014-03-17 01:46:28 +00:00
Lang Hames	7c8189c6d3	[X86] New and improved VZeroUpperInserter optimization. - Adds support for inserting vzerouppers before tail-calls. This is enabled implicitly by having MachineInstr::copyImplicitOps preserve regmask operands, which allows VZeroUpperInserter to see where tail-calls use vector registers. - Fixes a bug that caused the previous version of this optimization to miss some vzeroupper insertion points in loops. (Loops-with-vector-code that followed loops-without-vector-code were mistakenly overlooked by the previous version). - New algorithm never revisits instructions. Fixes <rdar://problem/16228798> llvm-svn: 204021	2014-03-17 01:22:54 +00:00
Benjamin Kramer	049784ec50	Use a fixed subtarget for test so atom scheduling can't change the addresses this test relies on. llvm-svn: 204014	2014-03-15 23:01:29 +00:00
NAKAMURA Takumi	64587433ce	llvm/test/Transforms/SampleProfile/syntax.ll: Suppress checking the message catalog in ENOENT. It is locale-dependent on Windows. llvm-svn: 203997	2014-03-15 02:32:21 +00:00
Rui Ueyama	cec949af13	Object/COFF: change data type of SymbolNumber from int16 to uint16. Microsoft PE/COFF Spec clearly states that the field is of signed interger type. However, in reality, it's unsigned. If cl.exe needs to create a large number of sections for COMDAT sections, it will just create more than 32768 sections. Handling large section number as negative number is not correct. I think this is a spec bug. Differential Revision: http://llvm-reviews.chandlerc.com/D3088 llvm-svn: 203986	2014-03-15 00:04:08 +00:00
Adrian Prantl	2e4e62e2cc	Debug info: Unique types before emitting them to DWARF, where applicable. llvm-svn: 203983	2014-03-14 23:08:29 +00:00
Adrian Prantl	d1e6a4e189	Debug Info: Fix LTO type uniquing for C++ member declarations based on the ODR. This adds an OdrMemberMap to DwarfDebug which is used to unique C++ member function declarations based on the unique identifier of their containing class and their mangled name. We can't use the usual DIRef mechanism here because DIScopes are indexed using their entire MDNode, including decl_file and decl_line, which need not be unique (see testcase). Prior to this change multiple redundant member function declarations would end up in the same uniqued DW_TAG_class_type. llvm-svn: 203982	2014-03-14 23:08:25 +00:00
Adrian Prantl	5a4b90deae	Re-add checks that were in this testcase before it was converted to dwarfdump. llvm-svn: 203981	2014-03-14 23:08:21 +00:00
Diego Novillo	a32aa3251c	Use DiagnosticInfo facility. Summary: The sample profiler pass emits several error messages. Instead of just aborting the compiler with report_fatal_error, we can emit better messages using DiagnosticInfo. This adds a new sub-class of DiagnosticInfo to handle the sample profiler. Reviewers: chandlerc, qcolombet CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3086 llvm-svn: 203976	2014-03-14 21:58:59 +00:00
Eric Christopher	09d1c0f85d	Remove command line option for CU hashing. This is on by default now. Fix up testcases and use of flag. llvm-svn: 203973	2014-03-14 21:20:07 +00:00
Eric Christopher	a2a6e927c8	Make the arbitrary section name be something mach-o compatible. llvm-svn: 203972	2014-03-14 21:16:54 +00:00
Eric Christopher	4dd947aa02	If we see that we're emitting code for a function that doesn't have any lexical scopes then go ahead and turn on DW_AT_ranges for the compile unit since we would be claiming to describe in the CU a range for which we don't have information in the CU otherwise. llvm-svn: 203969	2014-03-14 20:53:49 +00:00
Eric Christopher	3a70d0083f	Remove the -generate-dwarf-cu-ranges flag. Rewrite a couple of testcases to cover areas that would be normally by turning it on into testcases that will follow the logic. llvm-svn: 203968	2014-03-14 20:53:43 +00:00
Rafael Espindola	8953f81f67	Correctly handle an ELF symbol defined with "a = b + expr". We were marking the symbol as absolute instead of computing b's offset + the expression value. This fixes pr19126. llvm-svn: 203962	2014-03-14 20:09:04 +00:00
Ulrich Weigand	f445399870	[ppc64] Avoid copy relocs in named rodata sections Commit r181723 introduced code to avoid placing initialized variables needing relocations into the .rodata section, which avoid copy relocs that do not work as expected on ppc64 function references. The same treatment is also needed for named .rodata.XXX sections. This patch changes PPC64LinuxTargetObjectFile::SelectSectionForGlobal to modify "Kind" before calling the default SelectSectionForGlobal routine, instead of first calling the default routine and then just checking for the (main) .rodata section afterwards. llvm-svn: 203921	2014-03-14 12:45:22 +00:00
Oliver Stannard	f010b9850c	Generalise assembly tests to not rely on anonymous symbol names llvm-svn: 203909	2014-03-14 09:10:26 +00:00
Evgeniy Stepanov	49e2625144	AddressSanitizer instrumentation for MOV and MOVAPS. This is an initial version of *Sanitizer instrumentation of assembly code. Patch by Yuri Gorshenin. llvm-svn: 203908	2014-03-14 08:58:04 +00:00
Simon Atanasyan	a3130a4bad	[yaml2obj][ELF] Assign name (.shstrtab) to the section holds sections names. llvm-svn: 203897	2014-03-14 06:53:16 +00:00
Eric Christopher	af7eca2da4	Use DW_AT_linkage_name when we're emitting DWARF4 or above. llvm-svn: 203867	2014-03-13 23:26:25 +00:00
Rafael Espindola	2fb5bc33a3	Remove the linker_private and linker_private_weak linkages. These linkages were introduced some time ago, but it was never very clear what exactly their semantics were or what they should be used for. Some investigation found these uses: * utf-16 strings in clang. * non-unnamed_addr strings produced by the sanitizers. It turns out they were just working around a more fundamental problem. For some sections a MachO linker needs a symbol in order to split the section into atoms, and llvm had no idea that was the case. I fixed that in r201700 and it is now safe to use the private linkage. When the object ends up in a section that requires symbols, llvm will use a 'l' prefix instead of a 'L' prefix and things just work. With that, these linkages were already dead, but there was a potential future user in the objc metadata information. I am still looking at CGObjcMac.cpp, but at this point I am convinced that linker_private and linker_private_weak are not what they need. The objc uses are currently split in * Regular symbols (no '\01' prefix). LLVM already directly provides whatever semantics they need. * Uses of a private name (start with "\01L" or "\01l") and private linkage. We can drop the "\01L" and "\01l" prefixes as soon as llvm agrees with clang on L being ok or not for a given section. I have two patches in code review for this. * Uses of private name and weak linkage. The last case is the one that one could think would fit one of these linkages. That is not the case. The semantics are * the linker will merge these symbol by name. * the linker will hide them in the final DSO. Given that the merging is done by name, any of the private (or internal) linkages would be a bad match. They allow llvm to rename the symbols, and that is really not what we want. From the llvm point of view, these objects should really be (linkonce\|weak)(_odr)?. For now, just keeping the "\01l" prefix is probably the best for these symbols. If we one day want to have a more direct support in llvm, IMHO what we should add is not a linkage, it is just a hidden_symbol attribute. It would be applicable to multiple linkages. For example, on weak it would produce the current behavior we have for objc metadata. On internal, it would be equivalent to private (and we should then remove private). llvm-svn: 203866	2014-03-13 23:18:37 +00:00
Owen Anderson	9b8f9c3d95	Fix a bug in InstCombine where we would incorrectly attempt to construct a bitcast between pointers of two different address spaces if they happened to have the same pointer size. llvm-svn: 203862	2014-03-13 22:51:43 +00:00
Kevin Enderby	3de14bc77e	Add -mtriple=x86_64-linux to this test case to fix the build bots.5 The original commit was r203829. llvm-svn: 203844	2014-03-13 20:31:19 +00:00
Ekaterina Romanova	8d62008ecb	Fix for http://llvm.org/bugs/show_bug.cgi?id=18590 This patch fixes the bug in peephole optimization that folds a load which defines one vreg into the one and only use of that vreg. With debug info, a DBG_VALUE that referenced the vreg considered to be a use, preventing the optimization. The fix is to ignore DBG_VALUE's during the optimization, and undef a DBG_VALUE that references a vreg that gets removed. Patch by Trevor Smigiel! llvm-svn: 203829	2014-03-13 18:47:12 +00:00
Rafael Espindola	4269b9eed5	Use printable names to implement directional labels. This changes the implementation of local directional labels to use a dedicated map. With that it can then just use CreateTempSymbol, which is what the rest of MC uses. CreateTempSymbol doesn't do a great job at making sure the names are unique (or being efficient when the names are not needed), but that should probably be fixed in a followup patch. This fixes pr18928. llvm-svn: 203826	2014-03-13 18:09:26 +00:00
Tom Stellard	08ef1233c6	R600: LDS instructions shouldn't implicitly define OQAP LDS instructions are pseudo instructions which model the OQAP defs and uses within a single instruction. This fixes a hang in the opencv MedianFilter tests. llvm-svn: 203818	2014-03-13 17:13:04 +00:00
Mark Seaborn	3f73533a9d	Cleanup: Remove use of old "-enable-correct-eh-support" option from a test This option enables LowerInvoke's obsolete SJLJ EH support, but the target used in this test (ARM Darwin) no longer uses the LowerInvoke pass, so the option has no effect here. This target currently uses the newer SjLjEHPrepare pass instead. This cleanup will help with removing "-enable-correct-eh-support". Differential Revision: http://llvm-reviews.chandlerc.com/D3064 llvm-svn: 203810	2014-03-13 16:23:00 +00:00
Hans Wennborg	89050436e6	[ARM] Use symbolic register names in .cfi directives only with IAS (PR19110) This is a follow-up to r203635. Saleem pointed out that since symbolic register names are much easier to read, it would be good if we could turn them off only when we really need to because we're using an external assembler. Differential Revision: http://llvm-reviews.chandlerc.com/D3056 llvm-svn: 203806	2014-03-13 15:56:41 +00:00
Manuel Jacob	a7c48f99ae	CodeGenPrep: sink extends of illegal types into use block. Summary: This helps the instruction selector to lower an i64 * i64 -> i128 multiplication into a single instruction on targets which support it. This is an update of D2973 which was reverted because of a bug reported as PR19084. Reviewers: t.p.northover, chapuni Reviewed By: t.p.northover CC: llvm-commits, alex, chapuni Differential Revision: http://llvm-reviews.chandlerc.com/D3021 llvm-svn: 203797	2014-03-13 13:36:25 +00:00

... 2 3 4 5 6 ...

23444 Commits