llvm-project

Commit Graph

Author	SHA1	Message	Date
Erik Verbruggen	59a1219846	InstCombine: merge constants in both operands of icmp. Transform: icmp X+Cst2, Cst into: icmp X, Cst-Cst2 when Cst-Cst2 does not overflow, and the add has nsw. llvm-svn: 204912	2014-03-27 11:16:05 +00:00
Daniel Sanders	d897b564ca	[mips] Stop caching the result of hasMips64(), isABI_O32(), isABI_N32(), and isABI_N64() from MipsSubTarget in MipsTargetLowering Summary: The short name is quite convenient so provide an accessor for them instead. No functional change Depends on D3177 Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3178 llvm-svn: 204911	2014-03-27 10:46:12 +00:00
Elena Demikhovsky	bb2f6b72d3	AVX-512: Implemented masking for integer arithmetic & logic instructions. By Robert Khasanov rob.khasanov@gmail.com llvm-svn: 204906	2014-03-27 09:45:08 +00:00
Stepan Dyatkovskiy	e8747e30ef	Rejected r204899 and r204900 due to remaining test failures on cmake-llvm-x86_64-linux buildbot. llvm-svn: 204901	2014-03-27 08:38:18 +00:00
Stepan Dyatkovskiy	3530003008	Fix for pr18931: Crash using integrated assembler with immediate arithmetic Fix description: Expressions like 'cmp r0, #(l1 - l2) >> 3' could not be evaluated on asm parsing stage, since it is impossible to resolve labels on this stage. In the end of stage we still have expression (MCExpr). Then, when we want to encode it, we expect it to be an immediate, but it still an expression. Patch introduces a Fixup (MCFixup instance), that is processed after main encoding stage. llvm-svn: 204899	2014-03-27 07:49:39 +00:00
Jiangning Liu	1d3f2c7c82	ARM: raise error message when complex SO expressions can't really be solved as a constant at compilation time. llvm-svn: 204898	2014-03-27 07:42:58 +00:00
Lang Hames	2768d26a62	Move MCSymbolizer's constructor into header. It's trivial - there's no need for it to be out-of-line. llvm-svn: 204892	2014-03-27 02:42:52 +00:00
Lang Hames	eb37092342	Update MCSymbolizer and its subclasses' constructors to reflect the fact that they take ownership of the RelocationInfo they're constructed with. llvm-svn: 204891	2014-03-27 02:39:01 +00:00
Lang Hames	69247821a2	Remove forward declaration for Target class - Target is already defined here. No functional change. llvm-svn: 204885	2014-03-27 01:05:49 +00:00
Quentin Colombet	3914bf516b	[X86][Vectorizer Cost Model] Correct vectorization cost model for v2i64->v2f64 and v4i64->v4f64. The new costs match what we did for SSE2 and reflect the reality of our codegen. <rdar://problem/16381225> llvm-svn: 204884	2014-03-27 00:52:16 +00:00
Rafael Espindola	a041ef1bd8	Correctly propagates st_size. This also finally removes a bogus call to AliasedSymbol. llvm-svn: 204883	2014-03-27 00:28:24 +00:00
Jim Grosbach	72fbde84b8	X86: Correct vectorization cost model for v8f32->v8i8. Fix the cost model to reflect the reality of our codegen. rdar://16370633 llvm-svn: 204880	2014-03-27 00:04:11 +00:00
Nick Lewycky	77d5fb40c8	Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink! llvm-svn: 204876	2014-03-26 23:45:15 +00:00
Hal Finkel	df3e34d944	[PowerPC] Generate VSX permutations for v2[fi]64 vectors llvm-svn: 204873	2014-03-26 22:58:37 +00:00
Reid Kleckner	23798a9731	CloneFunction: Clone all attributes, including the CC Summary: Tested with a unit test because we don't appear to have any transforms that use this other than ASan, I think. Fixes PR17935. Reviewers: nicholas CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3194 llvm-svn: 204866	2014-03-26 22:26:35 +00:00
Ekaterina Romanova	b9aea9383a	This is a fix for PR# 19051. I noticed code gen differences due to code motion when running tests with and without the debug info at O2. The problem is in branch folding. A loop wanted to skip the debug info, but actually it didn't do so. llvm-svn: 204865	2014-03-26 22:15:28 +00:00
Manman Ren	14aa891976	Add comments. Addressing review comments from Evan on r204690. llvm-svn: 204864	2014-03-26 22:14:09 +00:00
Justin Bogner	95e0a70581	llvm-cov: Handle functions with no line number Functions may in an instrumented binary but not in the original source when they're inserted by the compiler or the runtime. These functions aren't meaningful to the user, so teach llvm-cov to skip over them instead of crashing. llvm-svn: 204863	2014-03-26 22:03:06 +00:00
Kevin Enderby	5611398b6b	Fix a problem with the ARM assembler incorrectly matching a vector list parameter that is using all lanes "{d0[], d2[]}" but can match and instruction with a ”{d0, d2}" parameter. I’m finishing up a fix for proper checking of the unsupported alignments on vld/vst instructions and ran into this. Thus I don’t have a test case at this time. And adding all code that will demonstrate the bug would obscure the very simple one line fix. So if you would indulge me on not having a test case at this time I’ll instead offer up a detailed explanation of what is going on in this commit message. This instruction: vld2.8 {d0[], d2[]}, [r4:64] is not legal as the alignment can only be 16 when the size is 8. Per this documentation: A8.8.325 VLD2 (single 2-element structure to all lanes) <align> The alignment. It can be one of: 16 2-byte alignment, available only if <size> is 8, encoded as a = 1. 32 4-byte alignment, available only if <size> is 16, encoded as a = 1. 64 8-byte alignment, available only if <size> is 32, encoded as a = 1. omitted Standard alignment, see Unaligned data access on page A3-108. So when code is added to the llvm integrated assembler to not match that instruction because of the alignment it then goes on to try to match other instructions and comes across this: vld2.8 {d0, d2}, [r4:64] and and matches it. This is because of the method ARMOperand::isVecListDPairSpaced() is missing the check of the Kind. In this case the Kind is k_VectorListAllLanes . While the name of the method may suggest that this is OK it really should check that the Kind is k_VectorList. As the method ARMOperand::isDoubleSpacedVectorAllLanes() is what was used to match {d0[], d2[]} and correctly checks the Kind: bool isDoubleSpacedVectorAllLanes() const { return Kind == k_VectorListAllLanes && VectorList.isDoubleSpaced; } where the original ARMOperand::isVecListDPairSpaced() does not check the Kind: bool isVecListDPairSpaced() const { if (isSingleSpacedVectorList()) return false; return (ARMMCRegisterClasses[ARM::DPairSpcRegClassID] .contains(VectorList.RegNum)); } Jim Grosbach has reviewed the change and said: Yep, that sounds right. … And by "right" I mean, "wow, that's a nasty latent bug I'm really, really glad to see fixed." :) rdar://16436683 llvm-svn: 204861	2014-03-26 21:54:11 +00:00
Arnold Schwaighofer	1a444489e9	PR15967 Fix in basicaa for faulty returning no alias. This commit consist of two parts. The first part fix the PR15967. The wrong conclusion was made when the MaxLookup limit was reached. The fix introduce a out parameter (MaxLookupReached) to DecomposeGEPExpression that the function aliasGEP can act upon. The second part is introducing the constant MaxLookupSearchDepth to make sure that DecomposeGEPExpression and GetUnderlyingObject use the same search depth. This is a small cleanup to clarify the original algorithm. Patch by Karl-Johan Karlsson! llvm-svn: 204859	2014-03-26 21:30:19 +00:00
Hal Finkel	6e28e6aaaf	[PowerPC] VSX loads and stores support unaligned access I've not yet updated PPCTTI because I'm not sure what the actual relative cost is compared to the aligned uses. llvm-svn: 204848	2014-03-26 19:39:09 +00:00
Kevin Enderby	8108f38437	Fix the ARM VST4 (single 4-element structure from one lane) size 16 double-spaced registers instruction printing. This: vld4.16 {d17[1], d19[1], d21[1], d23[1]}, [r7]! was being printed as: vld4.16 {d17[1], d18[1], d19[1], d20[1]}, [r7]! rdar://16435096 llvm-svn: 204847	2014-03-26 19:35:40 +00:00
Hal Finkel	7279f4b00d	[PowerPC] Use v2f64 <-> v2i64 VSX conversion instructions llvm-svn: 204843	2014-03-26 19:13:54 +00:00
Matt Arsenault	90b733a3cf	R600: Add a testcase for sext_in_reg I missed. This sext_inreg i32 in i64 case was already handled, but not enabled. llvm-svn: 204840	2014-03-26 18:31:06 +00:00
Hal Finkel	ea76a44584	[PowerPC] Remove some dead VSX v4f32 store patterns These patterns are dead (because v4f32 stores are currently promoted to v4i32 and stored using Altivec instructions), and also are likely not correct (because they'd store the vector elements in the opposite order from that assumed by the rest of the Altivec code). llvm-svn: 204839	2014-03-26 18:26:36 +00:00
Hal Finkel	9281c9a38b	[PowerPC] Use VSX vector load/stores for v2[fi]64 These instructions have access to the complete VSX register file. In addition, they "swap" the order of the elements so that element 0 (the scalar part) comes first in memory and element 1 follows at a higher address. llvm-svn: 204838	2014-03-26 18:26:30 +00:00
Juergen Ributzka	6ff29a7b2f	[MCJIT] Check if there have been errors during RuntimeDyld execution. llvm-svn: 204837	2014-03-26 18:19:27 +00:00
Jim Grosbach	ed2cd39b81	Fix for incorrect address sinking in the presence of potential overflows. In some cases it is possible for CGP to attempt to reuse a base address from another basic block. In those cases we have to be sure that all the address math was either done at the same bit width, or that none of it overflowed before it was extended. Patch by Louis Gerbarg <lgg@apple.com> rdar://16307442 llvm-svn: 204833	2014-03-26 17:27:01 +00:00
Hans Wennborg	d683a22dd2	Revert "X86 memcpy lowering: use "rep movs" even when esi is used as base pointer" (r204174) > For functions where esi is used as base pointer, we would previously fall ba > from lowering memcpy with "rep movs" because that clobbers esi. > > With this patch, we just store esi in another physical register, and restore > it afterwards. This adds a little bit of register preassure, but the more > efficient memcpy should be worth it. > > Differential Revision: http://llvm-reviews.chandlerc.com/D2968 This didn't work. I was ending up with code like this: lea edi,[esi+38h] mov ecx,0Fh mov edx,esi mov esi,ebx rep movs dword ptr es:[edi],dword ptr [esi] lea ecx,[esi+74h] <-- Ooops, we're now using esi before restoring it from edx. add ebx,3Ch mov esi,edx I guess if we want to do this we need stronger glue or something, or doing the expansion much later. llvm-svn: 204829	2014-03-26 16:30:54 +00:00
Hal Finkel	a6c8b51212	[PowerPC] Add v2i64 as a legal VSX type v2i64 needs to be a legal VSX type because it is the SetCC result type from v2f64 comparisons. We need to expand all non-arithmetic v2i64 operations. This fixes the lowering for v2f64 VSELECT. llvm-svn: 204828	2014-03-26 16:12:58 +00:00
Matheus Almeida	ea06727f03	[mips] Use TwoOperandAliasConstraint for ArithLogicR instructions. This enables TableGen to generate an additional two operand matcher for our ArithLogicR class of instructions (constituted by 3 register operands). E.g.: and $1, $2 <=> and $1, $1, $2 llvm-svn: 204826	2014-03-26 16:09:43 +00:00
Matheus Almeida	ab5633b70c	[mips] Add support to the '.dword' directive. The '.dword' directive accepts a list of expressions and emits them in 8-byte chunks in successive locations. llvm-svn: 204822	2014-03-26 15:44:18 +00:00
Matheus Almeida	3e2a702aa2	[mips] Rename function in MipsAsmParser. parseDirectiveWord is a generic function that parses an expression which means there's no need for it to have such an specific name. Renaming it to parseDataDirective so that it can also be used to handle .dword directives[1]. [1]To be added in a follow up commit. No functional changes. llvm-svn: 204818	2014-03-26 15:24:36 +00:00
Matheus Almeida	3b9c63d29b	[mips] Add support to '.set mips64'. The '.set mips64' directive enables the feature Mips:FeatureMips64 from assembly. Note that it doesn't modify the ELF header as opposed to the use of -mips64 from the command-line. The reason for this is that we want to be as compatible as possible with existing assemblers like GAS. llvm-svn: 204817	2014-03-26 15:14:32 +00:00
Christian Pirker	99974c7242	AArch64_BE Elf support for MC-JIT runtime dynamic linker llvm-svn: 204816	2014-03-26 14:57:32 +00:00
Matheus Almeida	a2cd009c51	[mips] Add support to '.set mips64r2'. The '.set mips64r2' directive enables the feature Mips:FeatureMips64r2 from assembly. Note that it doesn't modify the ELF header as opposed to the use of -mips64r2 from the command-line. The reason for this is that we want to be as compatible as possible with existing assemblers like GAS. llvm-svn: 204815	2014-03-26 14:52:22 +00:00
Christian Pirker	3aa0e6a1f9	AArch64_BE function argument passing for ARM ABI llvm-svn: 204814	2014-03-26 14:51:22 +00:00
Tim Northover	1ff5f29fb5	ARM: add intrinsics for the v8 ldaex/stlex We've already got versions without the barriers, so this just adds IR-level support for generating the new v8 ones. rdar://problem/16227836 llvm-svn: 204813	2014-03-26 14:39:31 +00:00
Matheus Almeida	fe1e39dcba	[mips] Hoist common functionality into a new function. Given that we support multiple directives that enable a particular feature (e.g. '.set mips16'), it's best to hoist that code into a new function so that we don't repeat the same pattern w.r.t parsing and handling error cases. No functional changes. llvm-svn: 204811	2014-03-26 14:26:27 +00:00
Renato Golin	93010e687f	Change @llvm.clear_cache default to call rt-lib After some discussion on IRC, emitting a call to the library function seems like a better default, since it will move from a compiler internal error to a linker error, that the user can work around until LLVM is fixed. I'm also adding a note on the responsibility of the user to confirm that the cache was cleared on platforms where nothing is done. llvm-svn: 204806	2014-03-26 14:01:32 +00:00
Daniel Sanders	6dd7251599	[mips] The decision to use MO_GOT_PAGE and MO_GOT_OFST depends on the ABI being N32 or N64 not the arch being MIPS64 Summary: No functional change (in supported use cases) Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3177 llvm-svn: 204805	2014-03-26 13:59:42 +00:00
Cameron McInally	4532596b8f	Fix AVX512 Gather and Scatter execution domains. llvm-svn: 204804	2014-03-26 13:50:50 +00:00
Matheus Almeida	f79b281421	[mips] Add support for '.option pic2'. The directive '.option pic2' enables PIC from assembly source. At the moment none of the macros/directives check the PIC bit but that's going to be fixed relatively soon. For example, the expansion of macros like 'la' depend on the relocation model. llvm-svn: 204803	2014-03-26 13:40:29 +00:00
Renato Golin	c0a3c1d66b	Add @llvm.clear_cache builtin Implementing the LLVM part of the call to __builtin___clear_cache which translates into an intrinsic @llvm.clear_cache and is lowered by each target, either to a call to __clear_cache or nothing at all incase the caches are unified. Updating LangRef and adding some tests for the implemented architectures. Other archs will have to implement the method in case this builtin has to be compiled for it, since the default behaviour is to bail unimplemented. A Clang patch is required for the builtin to be lowered into the llvm intrinsic. This will be done next. llvm-svn: 204802	2014-03-26 12:52:28 +00:00
Hal Finkel	732f0f73a7	[PowerPC] Lower VSELECT using xxsel when VSX is available With VSX there is a real vector select instruction, and so we should use it. Note that VSELECT will still scalarize for v2f64 because the corresponding SetCC result type (v2i64) is not currently a legal type. llvm-svn: 204801	2014-03-26 12:49:28 +00:00
Daniel Sanders	a4b0c74765	[mips] The register names depend on the ABI being N32/N64 rather than the arch being mips64 Summary: Added test cases for O32 and N32 on MIPS64. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3175 llvm-svn: 204796	2014-03-26 11:39:07 +00:00
Timur Iskhodzhanov	b5b7a61646	Follow-up to r204790: don't try to emit line tables if there are no functions with DI in the TU llvm-svn: 204795	2014-03-26 11:24:36 +00:00
Daniel Sanders	85f482b02f	[mips] $s8 is an alias for $fp in all ABI's, not just N32/N64. llvm-svn: 204793	2014-03-26 11:05:24 +00:00
Timur Iskhodzhanov	8499a12259	Fix PR19239 - Add support for generating debug info for functions without lexical scopes and/or debug info at all llvm-svn: 204790	2014-03-26 09:50:36 +00:00
Rafael Espindola	65481d7b97	Revert "Prevent alias from pointing to weak aliases." This reverts commit r204781. I will follow up to with msan folks to see what is what they were trying to do with aliases to weak aliases. llvm-svn: 204784	2014-03-26 06:14:40 +00:00
Hal Finkel	bd4de9d478	[PowerPC] Generate logical vector VSX instructions These instructions are essentially the same as their Altivec counterparts, but have access to the larger VSX register file. llvm-svn: 204782	2014-03-26 04:55:40 +00:00
Rafael Espindola	3b712a84a9	Prevent alias from pointing to weak aliases. Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204781	2014-03-26 04:48:47 +00:00
David Blaikie	62dd7df612	DebugInfo: Add fission-related sections to COFF Allows this test to pass on COFF platforms so we don't need to restrict this test to a single target anymore. llvm-svn: 204780	2014-03-26 03:05:10 +00:00
Rafael Espindola	85a8491a93	Correctly detect if a symbol uses a reserved section index or not. The logic was incorrect for variables, causing them to end up in the wrong section if the section had an index >= 0xff00. llvm-svn: 204771	2014-03-26 00:16:43 +00:00
Quentin Colombet	6f12ae0d5c	[X86] Add broadcast instructions to the table used by ExeDepsFix pass. Adds the different broadcast instructions to the ReplaceableInstrsAVX2 table. That way the ExeDepsFix pass can take better decisions when AVX2 broadcasts are across domain (int <-> float). In particular, prior to this patch we were generating: vpbroadcastd LCPI1_0(%rip), %ymm2 vpand %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 ## <- domain change penalty Now, we generate the following nice sequence where everything is in the float domain: vbroadcastss LCPI1_0(%rip), %ymm2 vandps %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 <rdar://problem/16354675> llvm-svn: 204770	2014-03-26 00:10:22 +00:00
Rafael Espindola	10be0837ac	Create .symtab_shndxr only when needed. We need .symtab_shndxr if and only if a symbol references a section with an index >= 0xff00. The old code was trying to figure out if the section was needed ahead of time, making it a fairly dependent on the code actually writing the table. It was also somewhat conservative and would create the section in cases where it was not needed. If I remember correctly, the old structure was there so that the sections were created in the same order gas creates them. That was valuable when MC's support for ELF was new and we tested with elf-dump.py. This patch refactors the symbol table creation to another class and makes it obvious that .symtab_shndxr is really only created when we are about to output a reference to a section index >= 0xff00. While here, also improve the tests to use macros. One file is one section short of needing .symtab_shndxr, the second one has just the right number. llvm-svn: 204769	2014-03-25 23:44:25 +00:00
Hal Finkel	174e590966	[PowerPC] Select between VSX A-type and M-type FMA instructions just before RA The VSX instruction set has two types of FMA instructions: A-type (where the addend is taken from the output register) and M-type (where one of the product operands is taken from the output register). This adds a small pass that runs just after MI scheduling (and, thus, just before register allocation) that mutates A-type instructions (that are created during isel) into M-type instructions when: 1. This will eliminate an otherwise-necessary copy of the addend 2. One of the product operands is killed by the instruction The "right" moment to make this decision is in between scheduling and register allocation, because only there do we know whether or not one of the product operands is killed by any particular instruction. Unfortunately, this also makes the implementation somewhat complicated, because the MIs are not in SSA form and we need to preserve the LiveIntervals analysis. As a simple example, if we have: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 ... %vreg9<def,tied1> = XSMADDADP %vreg9<tied0>, %vreg17, %vreg19, %RM<imp-use>; VSLRC:%vreg9,%vreg17,%vreg19 ... We can eliminate the copy by changing from the A-type to the M-type instruction. This means: %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 is replaced by: %vreg16<def,tied1> = XSMADDMDP %vreg16<tied0>, %vreg18, %vreg9, %RM<imp-use>; VSLRC:%vreg16,%vreg18,%vreg9 and we remove: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 llvm-svn: 204768	2014-03-25 23:29:21 +00:00
Rafael Espindola	0ce0971afa	Use Endian.h to simplify this code a bit. While at it, factor some logic into FragmentWriter. This will allow more code to be factored out of the fairly large ELFObjectWriter. llvm-svn: 204765	2014-03-25 22:43:53 +00:00
Juergen Ributzka	7be410f5d5	[Constant Hoisting] Make the constant candidate map local to the collectConstantCandidates method. llvm-svn: 204758	2014-03-25 21:21:10 +00:00
Hal Finkel	6c32ff31d0	[PowerPC] Correct commutable indices for VSX FMA instructions Although the first two operands are the ones that can be swapped, the tied input operand is listed before them, so we need to adjust for that. I have a test case for this, but it goes along with an upcoming commit (so it will come soon). llvm-svn: 204748	2014-03-25 19:26:43 +00:00
Hal Finkel	25e0454f10	[PowerPC] Add a TableGen relation for A-type and M-type VSX FMA instructions TableGen will create a lookup table for the A-type FMA instructions providing their corresponding M-form opcodes. This will be used by upcoming commits. llvm-svn: 204746	2014-03-25 18:55:11 +00:00
Matt Arsenault	0c274feedf	R600: Move computeMaskedBitsForTargetNode out of AMDILISelLowering.cpp Remove handling of select_cc, since it makes no sense to be there. This now does nothing, but I'll be adding some handling of other target nodes soon. llvm-svn: 204743	2014-03-25 18:18:27 +00:00
Duncan P. N. Exon Smith	3dbe10503a	blockfreq: Implement Pass::releaseMemory() Implement Pass::releaseMemory() in BlockFrequencyInfo and MachineBlockFrequencyInfo. Just delete the private implementation when not in use. Switch to a std::unique_ptr to make the logic more clear. <rdar://problem/14292693> llvm-svn: 204741	2014-03-25 18:01:38 +00:00
Duncan P. N. Exon Smith	936aef9238	blockfreq: Use const in MachineBlockFrequencyInfo <rdar://problem/14292693> llvm-svn: 204740	2014-03-25 18:01:32 +00:00
Juergen Ributzka	631c4914b2	[X86TTI] Make constant base pointers for getElementPtr opaque. If getElementPtr uses a constant as base pointer, then make the constant opaque. This prevents constant folding it with the offset. The offset can usually be encoded in the load/store instruction itself and the base address doesn't have to be rematerialized several times. llvm-svn: 204739	2014-03-25 18:01:25 +00:00
Juergen Ributzka	5eef98cf7a	[Stackmaps][X86TTI] Fix think-o in getIntImmCost calculation. The cost for the first four stackmap operands was always TCC_Free. This is only true for the first two operands. All other operands are TCC_Free if they are within 64bit. llvm-svn: 204738	2014-03-25 18:01:23 +00:00
Juergen Ributzka	e2e16844f5	[DAG] Keep the opaque constant flag when performing unary constant folding operations. Usually opaque constants shouldn't be folded, unless they are simple unary operations that don't create new constants. Although this shouldn't drop the opaque constant flag. This commit fixes this. Related to <rdar://problem/14774662> llvm-svn: 204737	2014-03-25 18:01:20 +00:00
Adam Nemet	4beef4c90d	[X86] Generate VPSHUFB for in-place v16i16 shuffles This used to resort to splitting the 256-bit operation into two 128-bit shuffles and then recombining the results. Fixes <rdar://problem/16167303> llvm-svn: 204735	2014-03-25 17:47:06 +00:00
Adam Nemet	ac6d6383a3	[X86] Factor out new helper getPSHUFB I found three implementations of this. This splits it out into a new function and uses it from the three places. My plan is to add a fourth use when lowering a vector_shuffle:v16i16. Compared the assembly output of test/CodeGen/X86 before and after. The only change is due to how the first PSHUFB was generated in LowerVECTOR_SHUFFLEv8i16. If the shuffle mask specified undef (i.e. -1), the old implementation would write -1 * 2 and -1 * 2 + 1 (254 and 255) in the control mask. Now we write 0x80. These are of course interchangeable since bit 7 decides if a constant zero is written in the result byte. The other instances of this code use 0x80 consistently. Related to <rdar://problem/16167303> llvm-svn: 204734	2014-03-25 17:47:03 +00:00
Richard Osborne	0af4aa9a19	[InstCombine] Don't fold bitcast into store if it would need addrspacecast Summary: Previously the code didn't check if the before and after types for the store were pointers to different address spaces. This resulted in instcombine using a bitcast to convert between pointers to different address spaces, causing an assertion due to the invalid cast. It is not be appropriate to use addrspacecast this case because it is not guaranteed to be a no-op cast. Instead bail out and do not do the transformation. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3117 llvm-svn: 204733	2014-03-25 17:21:41 +00:00
Richard Osborne	9805ec457d	Reuse earlier variables to make it clear the types involved in the cast. No functionality change. llvm-svn: 204732	2014-03-25 17:21:35 +00:00
Benjamin Kramer	e75eaca32f	ScalarEvolution: Compute exit counts for loops with a power-of-2 step. If we have a loop of the form for (unsigned n = 0; n != (k & -32); n += 32) {} then we know that n is always divisible by 32 and the loop must terminate. Even if we have a condition where the loop counter will overflow it'll always hold this invariant. PR19183. Our loop vectorizer creates this pattern and it's also occasionally formed by loop counters derived from pointers. llvm-svn: 204728	2014-03-25 16:25:12 +00:00
Matt Arsenault	b22426c510	Fix creating illegal setcc cond codes. If GT/UGT or LT/ULT were set to expand, a comparison with a constant would replace it with the illegal cond code. There are several more places later in this function that will have the same basic problem. Theoretically R600 should hit this problem for a test, but for some reason it doesn't. llvm-svn: 204727	2014-03-25 16:09:21 +00:00
Evgeniy Stepanov	fc742acc8c	[msan] More precise instrumentation of select IR. Some bits of select result may be initialized even if select condition is not. https://code.google.com/p/memory-sanitizer/issues/detail?id=50 llvm-svn: 204716	2014-03-25 13:08:34 +00:00
Daniel Sanders	71a89d92f6	[mips] '.set at=$0' should be equivalent to '.set noat' Differential Revision: http://llvm-reviews.chandlerc.com/D3171 llvm-svn: 204714	2014-03-25 13:01:06 +00:00
Cameron McInally	45dc489403	Fix AVX2 Gather execution domains. llvm-svn: 204713	2014-03-25 12:36:38 +00:00
Daniel Sanders	b1d7e53a26	[mips] Correct testcase for .set at=$reg and emit the new warnings for numeric registers too. Summary: Remove the XFAIL added in my previous commit and correct the test such that it correctly tests the expansion of the assembler temporary. Also added a test to check that $at is always $1 when written by the user. Corrected the new assembler temporary warnings so that they are emitted for numeric registers too. Differential Revision: http://llvm-reviews.chandlerc.com/D3169 llvm-svn: 204711	2014-03-25 11:16:03 +00:00
Daniel Sanders	e231ae9e3a	[mips] Fix assembler temporary expansion and add associated warnings about the use of $at. Summary: The assembler temporary is normally $at ($1) but can be reassigned using '.set at=$reg'. Regardless of which register is nominated as the assembler temporary, $at remains $1 when written by the user. Adds warnings under the following conditions: * The register nominated as the assembler temporary is used by the user. * '.set noat' is in effect and $at is used by the user. Both of these only work for named registers. I have a follow up commit that makes it work for numeric registers as well. XFAIL set-at-directive.s since it incorrectly tests that $at is redefined by '.set at=$reg'. Testcases will follow in a separate commit. Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3167 llvm-svn: 204710	2014-03-25 10:57:07 +00:00
Erik Verbruggen	e706b88304	Simplify loop that worked around bugs in old GCC/Xcode. GCC 4.0.1 and Xcode 2 are no longer supported for building llvm/clang. llvm-svn: 204705	2014-03-25 09:06:18 +00:00
Yaron Keren	24fdbe5676	Disable Visual C++ warning 4722 about aborting a destructor, it has no value for us. llvm-svn: 204704	2014-03-25 08:42:49 +00:00
David Majnemer	273bff4713	WinCOFF: Add support for -fdata-sections This is a pretty straight forward translation for COFF, we just need to stick the data in a COMDAT section marked as IMAGE_COMDAT_SELECT_NODUPLICATES. N.B. We must be careful to avoid sticking entities with private linkage in COMDAT groups. COFF is pretty hostile to the renaming of entities so we must be careful to disallow GlobalVariables with unstable names. llvm-svn: 204703	2014-03-25 06:14:26 +00:00
David Blaikie	3ffe4dd67f	DebugInfo: Add GNU_addr_base and GNU_ranges_base only when there are addresses or ranges Based on code review feedback from Eric in r204672. llvm-svn: 204702	2014-03-25 05:34:24 +00:00
Andrew Trick	c8ac7ea261	SLP vectorizer: Don't hoist vector extracts of phis. Extracts coming from phis were being hoisted, while all others were sunk to their uses. This was inconsistent and didn't seem to serve a purpose. Changing all extracts to be sunk to uses is a prerequisite for adding block frequency to the SLP vectorizer's cost model. I benchmarked the change in isolation (without block frequency). I only saw noise on x86 and some potentially significant improvements on ARM. No major regressions is good enough for me. llvm-svn: 204699	2014-03-25 02:18:47 +00:00
David Blaikie	9c550ac4e7	DebugInfo: Support debug_loc under fission Implement debug_loc.dwo, as well as llvm-dwarfdump support for dumping this section. Outlined in the DWARF5 spec and http://gcc.gnu.org/wiki/DebugFission the debug_loc.dwo section has more variation than the standard debug_loc, allowing 3 different forms of entry (plus the end of list entry). GCC seems to, and Clang certainly, only use one form, so I've just implemented dumping support for that for now. It wasn't immediately obvious that there was a good refactoring to share the implementation of dumping support between debug_loc and debug_loc.dwo, so they're separate for now - ideas welcome or I may come back to it at some point. As per a comment in the code, we could choose different forms that may reduce the number of debug_addr entries we emit, but that will require further study. llvm-svn: 204697	2014-03-25 01:44:02 +00:00
David Blaikie	2d33d6a4c2	DebugInfo: Remove unnecessary zero-size check This seems excessive - switching section isn't expensive (or if it is we're already being wasteful, since we emitted the debug_loc section symbol earlier anyway) and otherwise there's no work that happens in this function when the list is empty. llvm-svn: 204696	2014-03-25 01:43:56 +00:00
Manman Ren	78cf02a07b	Register Allocator: check other options before using a CSR for the first time. When register allocator's stage is RS_Spill, we choose spill over using the CSR for the first time, if the spill cost is lower than CSRCost. When register allocator's stage is < RS_Split, we choose pre-splitting over using the CSR for the first time, if the cost of splitting is lower than CSRCost. CSRCost is set with command-line option "regalloc-csr-first-time-cost". The default value is 0 to generate the same codes as before this commit. With a value of 15 (1 << 14 is the entry frequency), I measured performance gain of 3% on 253.perlbmk and 1.7% on 197.parser, with instrumented PGO, on an arm device. rdar://16162005 llvm-svn: 204690	2014-03-25 00:16:25 +00:00
Kevin Enderby	89299400ac	Fix crashes when assembler directives are used that are not for Mach-O object files by generating an error instead. rdar://16335232 llvm-svn: 204687	2014-03-25 00:05:50 +00:00
Manman Ren	9db66b3d34	Register Allocator: refactoring (no functionality change). Factor out two functions calculateRegionSplitCost and doRegionSplit from tryRegionSplit. These two functions will be used in coming patches. rdar://16162005 llvm-svn: 204684	2014-03-24 23:23:42 +00:00
David Blaikie	84d8e18f2b	DebugInfo: Simplify debug loc list handling by keeping separate lists Rather than using a flat list with "empty" entries (ala the actual on-disk format), keep separate lists for each variable. llvm-svn: 204680	2014-03-24 22:38:38 +00:00
David Blaikie	34ec5d07e1	DwarfDebug: Simplify debug_loc merging No functional change intended. Merging up-front rather than delaying this task until later. This just seems simpler and more efficient (avoiding growing the debug loc list only to have to skip over those post-merged entries, etc). llvm-svn: 204679	2014-03-24 22:27:06 +00:00
Adrian Prantl	c95ec91e2a	Get rid of an unnecessary use of the * and & operators. llvm-svn: 204673	2014-03-24 21:33:01 +00:00
David Blaikie	96dea0581e	DebugInfo: Add DW_AT_GNU_ranges_base to skeleton CUs This is used to avoid relocations in the dwo file by allowing DW_AT_ranges specified in debug_info.dwo to be relative to this base address. (r204667 implements the base-relative DW_AT_ranges side of this) llvm-svn: 204672	2014-03-24 21:31:35 +00:00
David Blaikie	26b2bd04fd	DebugInfo: Implement relative addressing for DW_AT_ranges under fission This removes the debug_ranges relocations from debug_info.dwo (but doesn't implement the DW_AT_GNU_ranges_base which is also necessary for correct functioning) llvm-svn: 204668	2014-03-24 21:07:27 +00:00
David Blaikie	3c9a3cc495	DebugInfo: Don't emit relocations to abbreviations in debug_info.dwo llvm-svn: 204667	2014-03-24 20:53:02 +00:00
David Blaikie	f72ed5f9ed	DwarfDebug: Remove an unused parameter llvm-svn: 204665	2014-03-24 20:31:01 +00:00
Matt Arsenault	db8b1d5b6c	R600: Don't viewCFG() under DEBUG() except on failure. Having these popping up every time you use -debug is really irritating. llvm-svn: 204664	2014-03-24 20:29:02 +00:00
David Blaikie	d82b237785	Remove unused parameter llvm-svn: 204663	2014-03-24 20:28:10 +00:00
Matt Arsenault	684dc80b6d	R600/SI: Fix extra mov from legalizing 64-bit SALU ops. Check the register class of each operand individually to avoid an extra copy to a vgpr. llvm-svn: 204662	2014-03-24 20:08:13 +00:00
Matt Arsenault	248b7b6ba1	R600/SI: Sub-optimial fix for 64-bit immediates with SALU ops. No longer asserts, but now you get moves loading legal immediates into the split 32-bit operations. llvm-svn: 204661	2014-03-24 20:08:09 +00:00
Matt Arsenault	f35182c783	R600/SI: Fix 64-bit bit ops that require the VALU. Try to match scalar and first like the other instructions. Expand 64-bit ands to a pair of 32-bit ands since that is not available on the VALU. llvm-svn: 204660	2014-03-24 20:08:05 +00:00

1 2 3 4 5 ...

68075 Commits