llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Sanders	da43f0e76f	[mips] Fix a partially initialized member variable that was introduced in r268896. llvm-svn: 268938	2016-05-09 17:42:04 +00:00
Simon Pilgrim	0a81921cdb	Fixed unused but set variable warning llvm-svn: 268931	2016-05-09 16:42:23 +00:00
Matt Arsenault	a949dc619c	AMDGPU: Fold shift into cvt_f32_ubyteN llvm-svn: 268930	2016-05-09 16:29:50 +00:00
Daniel Sanders	108823bc35	[mips] Try to fix 'truncation from FindBestPredicateResult to bool' reported by MSVC llvm-svn: 268928	2016-05-09 15:50:15 +00:00
Daniel Sanders	cc9a2cf7ee	[mips][ias] Attempt to fix 'not all control paths return a value' reported by MSVC. llvm-svn: 268927	2016-05-09 15:37:52 +00:00
Daniel Sanders	e473dc937f	[mips][micromips] Make getPointerRegClass() result depend on the instruction. Summary: Previously, it returned the GPR16MMRegClass for all instructions which was incorrect for instructions like lwsp/lwgp and unnecesarily restricted the permitted registers for instructions like lw32. This fixes quite a few of the -verify-machineinstrs errors reported in PR27458. I've only added -verify-machineinstrs to one test in this change since I understand there is a plan to enable the verifier by default. Reviewers: hvarga, zbuljan, zoran.jovanovic, sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19873 llvm-svn: 268918	2016-05-09 13:38:25 +00:00
Simon Pilgrim	af742d51ad	[X86][SSE] Added TODO comment to add support for AVX512 mask registers to shuffle comments This came up in discussion on D19198 llvm-svn: 268915	2016-05-09 13:30:16 +00:00
Daniel Sanders	d044e49b37	[mips] Fix use after free and an unnecessary copy introduced in r268896. llvm-svn: 268913	2016-05-09 13:10:57 +00:00
Strahinja Petrovic	e682b80b8b	[PowerPC] fix register alignment for long double type This patch fixes register alignment for long double type in soft float mode. Before this patch alignment was 8 and this patch changes it to 4. Differential Revision: http://reviews.llvm.org/D18034 llvm-svn: 268909	2016-05-09 12:27:39 +00:00
Chris Dewhurst	e3b8645a1c	[Sparc][LEON] Add UMAC and SMAC instruction support for Sparc LEON subtargets This change adds SMAC (signed multiply-accumulate) and UMAC (unsigned multiply-accumulate) for LEON subtargets of the Sparc processor. The new files LeonFeatures.td and leon-instructions.ll will both be expanded in future, so I want to leave them separate as small files for this review, to be expanded in future check-ins. Note: The functions are provided only for inline-assembly provision. No DAG selection is provided. Differential Revision: http://reviews.llvm.org/D19911 llvm-svn: 268908	2016-05-09 11:55:15 +00:00
Silviu Baranga	f60be28ed8	[AArch64] Implement lowering of the X constraint on AArch64 Summary: This implements the lowering of the X constraint on AArch64. The default behaviour of the X constraint lowering is to restrict it to "f". This is a problem because the "f" constraint is not implemented on AArch64 and would be too restrictive anyway. Therefore, the AArch64 hook will lower this to "w" (if the operand is a floating point or vector) or "r" otherwise. The implementation is similar with the one added for ARM (r267411). This is the AArch64 side of the fix for http://llvm.org/PR26493 Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits, t.p.northover Differential Revision: http://reviews.llvm.org/D19967 llvm-svn: 268907	2016-05-09 11:10:44 +00:00
Benjamin Kramer	2b68d15d6f	Revert "[Mips] Fix use after free." Fixes use after free but breaks tests. This reverts commit r268901. llvm-svn: 268902	2016-05-09 10:31:17 +00:00
Benjamin Kramer	5e2e8ddb2e	[Mips] Fix use after free. llvm-svn: 268901	2016-05-09 10:21:56 +00:00
Daniel Sanders	3d00056515	[mips][ias] R_MIPS_(GOT\|HI\|LO\|PC)16 and R_MIPS_GPREL32 do not need symbols. Summary: In theory, care must be taken to ensure that pairs of R_MIPS_(GOT\|HI\|LO)16 make the same decision on both relocs in the reloc pair but in practice this isn't as hard as it sounds and only limits the complexity of the predicate used. We handle all three with the same code to ensure their decisions always agree with each other. Reviewers: sdardis Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19016 llvm-svn: 268900	2016-05-09 10:21:14 +00:00
Zlatko Buljan	ba553a6e0a	[mips][microMIPS] Implement LWP and SWP instructions Differential Revision: http://reviews.llvm.org/D10640 llvm-svn: 268896	2016-05-09 08:07:28 +00:00
Craig Topper	a5d0bf5c36	[X86] Strengthen some type contraints for floating point round and extend. llvm-svn: 268892	2016-05-09 05:34:14 +00:00
Craig Topper	a58abd1cc6	[AVX512] Fix up types for arguments of int_x86_avx512_mask_cvtsd2ss_round and int_x86_avx512_mask_cvtss2sd_round. Only the argument being converted should be a different type. The other 2 argument should have the same type as the result. llvm-svn: 268891	2016-05-09 05:34:12 +00:00
Craig Topper	707c89c00d	[AVX512] Add non-temporal store patterns for v16i32/v32i16/v64i8. llvm-svn: 268889	2016-05-08 23:43:17 +00:00
Craig Topper	c41320d700	[AVX512] Add missing patterns for non-temporal stores of 128/256-bit vXi8/vXi16/vXi32 when VLX is enabled. The equivalent AVX1/2 patterns are disabled by VLX. This caused regular stores to be emitted instead. llvm-svn: 268886	2016-05-08 23:08:45 +00:00
Craig Topper	906f397137	[AVX512] Change predicates on some vXi16/vXi8 AVX store patterns so they stay enabled unless VLX and BWI instructions are supported." Without this we could fail instruction selection if VLX was enabled, but BWI wasn't. llvm-svn: 268885	2016-05-08 23:08:40 +00:00
Craig Topper	e5ce84a33c	[AVX512] Add VLX 128/256-bit SET0 operations that encode to 128/256-bit EVEX encoded VPXORD so all 32 registers can be used. llvm-svn: 268884	2016-05-08 21:33:53 +00:00
Craig Topper	9d9251b86f	[X86] Remove extra patterns that check for BUILD_VECTOR of all 0s. These are always canonicalized to v4i32/v8i32/v16i32 except for in SSE1 only when only v4f32 is supported. llvm-svn: 268880	2016-05-08 20:10:20 +00:00
David Majnemer	eac58d8f68	[X86] Promote several single precision FP libcalls on Windows A number of libcalls don't exist in any particular lib but are, instead, defined in math.h as inline functions (even in C mode!). Don't rely on their existence when lowering @llvm.{cos,sin,floor,..}.f32, promote them instead. N.B. We had logic to handle FREM but were missing out on a number of others. This change generalizes the FREM handling. llvm-svn: 268875	2016-05-08 08:15:50 +00:00
Craig Topper	d681e23336	[X86] Lower 256-bit vector all-zero constants to v8i32 even with AVX1 only. Either way a 256-bit VXORPS will be used. llvm-svn: 268873	2016-05-08 07:10:54 +00:00
Craig Topper	3d6722910c	[X86] Add patterns for 256-bit non-temporal stores when only AVX1 is supported. While there, add a predicate to the SSE2 patterns to avoid an ordering dependency. llvm-svn: 268872	2016-05-08 07:10:50 +00:00
Craig Topper	d788498411	[X86] No need to avoid selecting AVX_SET0 for 256-bit integer types when only AVX1 is supported. AVX_SET0 just expands to 256-bit VXORPS which is legal in AVX1. llvm-svn: 268871	2016-05-08 07:10:47 +00:00
Weiming Zhao	5b5501e817	[ARM] Fix Scavenger assert due to underestimated stack size (re-apply r268810 as it exposed an uninitialized variable in ARM MFI. Patch 268868 should fix that.) Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: vitalybuka, aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268869	2016-05-08 05:11:54 +00:00
Weiming Zhao	453b79013e	Fix use-of-uninitialized-value of ARMMachineFunctionInfo Summary: Explicitly initialize ArgumentStackSize to prevent the msan failure. Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D20051 llvm-svn: 268868	2016-05-08 05:04:47 +00:00
Craig Topper	6502975cf5	[X86] Fix InstAliases to not allow FARCALL32i/FARCALL16i/FARJMP32i/FARJMP16i in 64-bit mode. llvm-svn: 268863	2016-05-07 19:25:56 +00:00
Simon Pilgrim	96e5307d4e	[X86] Pulled out duplicate mask width calculation. NFCI. llvm-svn: 268861	2016-05-07 18:04:24 +00:00
Sanjay Patel	c2751e7050	[x86, BMI] add TLI hook for 'andn' and use it to simplify comparisons For the sake of minimalism, this patch is x86 only, but I think that at least PPC, ARM, AArch64, and Sparc probably want to do this too. We might want to generalize the hook and pattern recognition for a target like PPC that has a full assortment of negated logic ops (orc, nand). Note that http://reviews.llvm.org/D18842 will cause this transform to trigger more often. For reference, this relates to: https://llvm.org/bugs/show_bug.cgi?id=27105 https://llvm.org/bugs/show_bug.cgi?id=27202 https://llvm.org/bugs/show_bug.cgi?id=27203 https://llvm.org/bugs/show_bug.cgi?id=27328 Differential Revision: http://reviews.llvm.org/D19087 llvm-svn: 268858	2016-05-07 15:03:40 +00:00
NAKAMURA Takumi	77edc2ef9f	MipsELFObjectWriter.cpp: Activate debug printer just for +Asserts. [-Wunused-function] llvm-svn: 268848	2016-05-07 04:51:51 +00:00
Vitaly Buka	e81d96be6f	Revert r268810 becase it brakes msan bot. 16802==WARNING: MemorySanitizer: use-of-uninitialized-value lib/Target/ARM/ARMFrameLowering.cpp:1632 llvm-svn: 268833	2016-05-07 01:54:00 +00:00
Ahmed Bougacha	04a8fc2e37	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. This re-applies r268760, reverted in r268794. Fixes http://llvm.org/PR27670 The original imp-defs assertion was way overzealous: forward all implicit operands, except imp-defs of the new super-reg def (r268787 for GR64, but also possible for GR16->GR32), or imp-uses of the new super-reg use. While there, mark the source use as Undef, and add an imp-use of the old source reg: that should cover any case of dead super-regs. At the stage the pass runs, flags are unlikely to matter anyway; still, let's be as correct as possible. Also add MIR tests for the various interesting cases. Original commit message: Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268831	2016-05-07 01:11:17 +00:00
Ahmed Bougacha	068ac4af39	[X86] Register and initialize the FixupBW pass. That lets us use it in MIR tests. llvm-svn: 268830	2016-05-07 01:11:10 +00:00
Weiming Zhao	74f12d31c1	[ARM] Fix Scavenger assert due to underestimated stack size (this is resubmit of r268529 with minor refactoring. r268529 was reverted at r268536 due a memory sanitizer failure. I have not been able to reproduce that failure and I checked all the variable used in my change but I could not spot an issue. I did some refactoring and see if it will give a clearer hint) Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: vitalybuka, aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268810	2016-05-06 22:20:13 +00:00
Quentin Colombet	a09f050dc1	Revert "[X86] Add a new LOW32_ADDR_ACCESS_RBP register class." This reverts commit r268796. I believe it breaks test/CodeGen/X86/asm-mismatched-types.ll with: Cannot emit physreg copy instruction llvm-svn: 268799	2016-05-06 21:21:50 +00:00
Quentin Colombet	2728074e3c	[X86] Add a new LOW32_ADDR_ACCESS_RBP register class. ABIs like NaCl uses 32-bit addresses but have 64-bit frame. The new register class reflects those constraints when choosing a register class for a address access. llvm-svn: 268796	2016-05-06 21:10:53 +00:00
Quentin Colombet	377fc2aa3d	[X86] Rename the X32_ADDR_ACCESS register class into LOW32_ADDR_ACCESS. This register class may be used by any ABIs that uses x86_64 ISA while using 32-bit addresses, not just in X32 cases. Make sure the name reflects that. llvm-svn: 268795	2016-05-06 21:10:43 +00:00
Nico Weber	9b32b4fbee	Revert r268760, it caused PR27670. llvm-svn: 268794	2016-05-06 21:07:02 +00:00
Ahmed Bougacha	505984b466	[X86] Accept imp-defs of GR64 super-registers in FixupBW MOVrr. Testcase will follow shortly. llvm-svn: 268787	2016-05-06 20:03:03 +00:00
Artem Tamazov	f0b6b40fa4	[AMDGPU][llvm-mc] Some refactoring of .td files Some custom Operands and AsmOperandClasses moved to proper place. No functional changes. Differential Revision: http://reviews.llvm.org/D20012 llvm-svn: 268780	2016-05-06 19:32:38 +00:00
Krzysztof Parzyszek	adb7ff0283	[Hexagon] Be careful about anti-dependencies with a call in packetizer In a case like J2_callr <ga:@foo>, %R0<imp-use>, ... R0<def> = ... the anti-dependency on R0 cannot be ignored and the two instructions cannot be packetized together, since if they were, the assignment to R0 would take place before the call. llvm-svn: 268776	2016-05-06 19:13:38 +00:00
Quentin Colombet	a065ac45ee	[X86] Get rid of X32_NOREX_ADDR_ACCESS register class. According to H.J. Lu <hjl.tools@gmail.com>, this register class is never used. llvm-svn: 268771	2016-05-06 18:22:48 +00:00
Artem Tamazov	ebe71ce36a	[AMDGPU][llvm-mc] Add support for sendmsg(...) syntax. Added support for sendmsg(MSG[, OP[, STREAM_ID]]) syntax in s_sendmsg and s_sendmsghalt instructions. The syntax matches the SP3 assembler/disassembler rules. That is why implicit inputs (like M0 and EXEC) are not printed to disassembly output anymore. sendmsg(...) allows only known message types and attributes, even if literals are used instead of symbolic names. However, raw literal (without "sendmsg") still can be used, and that allows for any 16-bit value. Tests updated/added. Differential Revision: http://reviews.llvm.org/D19596 llvm-svn: 268762	2016-05-06 17:48:48 +00:00
Ahmed Bougacha	258426ca7a	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268760	2016-05-06 17:42:57 +00:00
Ahmed Bougacha	04200a7c86	[X86] Remove \brief in FixupBW. NFC. llvm-svn: 268754	2016-05-06 17:28:47 +00:00
Ahmed Bougacha	cfd9e55e90	[X86] Simplify FixupBW sub_8bit_hi-related logic. NFC. Instead of passing around sizes and asking for subregs, we can check the subreg indices we care about: sub_8bit_hi and sub_8bit. Differential Revision: http://reviews.llvm.org/D20006 llvm-svn: 268753	2016-05-06 17:28:42 +00:00
Geoff Berry	a5335647d5	[AArch64] Combine callee-save and local stack SP adjustment instructions. Summary: If a function needs to allocate both callee-save stack memory and local stack memory, we currently decrement/increment the SP in two steps: first for the callee-save area, and then for the local stack area. This changes the code to allocate them both at once at the very beginning/end of the function. This has two benefits: 1) there is one fewer sub/add micro-op in the prologue/epilogue 2) the stack adjustment instructions act as a scheduling barrier, so moving them to the very beginning/end of the function increases post-RA scheduler's ability to move instructions (that only depend on argument registers) before any of the callee-save stores This change can cause an increase in instructions if the original local stack SP decrement could be folded into the first store to the stack. This occurs when the first local stack store is to stack offset 0. In this case we are trading off one more sub instruction for one fewer sub micro-op (along with benefits (2) and (3) above). Reviewers: t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18619 llvm-svn: 268746	2016-05-06 16:34:59 +00:00
Jun Bum Lim	33be4997ed	[AArch64] Decouple zero store promotion from narrow ld merge. NFC. Summary: This change refactors to decouple the zero store promotion from the narrow ld merge and add a flag (enable-narrow-ld-merge=true) to control the narrow ld merge optimization. Reviewers: jmolloy, t.p.northover, mcrosier Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19885 llvm-svn: 268744	2016-05-06 15:08:57 +00:00
Nikolay Haustov	6eb050ea4e	Revert "AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2." This reverts commit 47486d52454d60cdf6becc0b2efe533c73794380. It broke calling OpenCL kernel from another kernel. llvm-svn: 268739	2016-05-06 14:59:04 +00:00
Daniel Sanders	8de3d3cad6	[mips] Fix inconsistent .cprestore behaviour between direct object emission and assembling. Summary: Direct object emission has an initialization order problem where an InitMCObjectFile is called after MipsTargetELFStreamer determines whether PIC is enabled by default or not. There doesn't seem to be point that initializes all cases so split the responsibility between MipsTargetELFStreamer and MipsAsmPrinter. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19728 llvm-svn: 268737	2016-05-06 14:37:24 +00:00
Daniel Sanders	a463d31a64	[mips] Correct the ordering of HI/LO pairs in the relocation table. Summary: There seems to have been a misunderstanding as to the meaning of 'offset' in the rules laid down by our ABI. The previous code believed that 'offset' meant the offset within the section that the relocation is applied to. However, it should have meant the offset from the symbol used in the relocation expression. This patch adds two fields to ELFRelocationEntry and uses them to correct the order of relocations for MIPS. These fields contain: * The original symbol before shouldRelocateWithSymbol() is considered. This ensures that R_MIPS_GOT16 is able to correctly distinguish between local and external symbols, allowing us to tell whether %got() requires a matching %lo() or not (local symbols require one, external symbols don't). It also prevents confusing cases where the fuzzy matching rules cause things like %hi(foo)/%lo(foo+3) and %hi(bar)/%lo(bar+1) to swap their %lo()'s. * The original offset before shouldRelocateWithSymbol() is considered. The existing Addend field is always zero when the object uses in place addends (because it's already moved it to the encoding) but MIPS needs to use the original offset to ensure that the linker correctly calculates the carry-in bit for %hi() and %got(). IAS ensures that unmatchable %hi()/%got() relocations are placed at the end of the table to ensure that the linker rejects the table (we're unable to report such errors directly). The alternatives to this risk accidental matching against inappropriate relocations which may silently compute incorrect values due to an incorrect carry bit between the %lo() and %hi()/%got(). Reviewers: sdardis Subscribers: dsanders, sdardis, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D19718 llvm-svn: 268733	2016-05-06 13:49:25 +00:00
Daniel Sanders	f9d8b8ccc5	[mips][mips16] Use isUnconditionalBranch() in AnalyzeBranch() and constant island pass. Summary: This stops it misidentifying unconditional branches as conditional branches which fixes a -verify-machineinstrs error about exiting a function via fall through. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19864 llvm-svn: 268731	2016-05-06 13:23:51 +00:00
Daniel Sanders	a6cda12179	[mips][fastisel] Conditional moves do not have implicit operands. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19862 llvm-svn: 268730	2016-05-06 12:57:26 +00:00
Sam Kolton	5f10a137d0	[TableGen] AsmMatcher: support for default values for optional operands Summary: This change allows to specify "DefaultMethod" for optional operand (IsOptional = 1) in AsmOperandClass that return default value for operand. This is used in convertToMCInst to set default values in MCInst. Previously if you wanted to set default value for operand you had to create custom converter method. With this change it is possible to use standard converters even when optional operands presented. Reviewers: tstellarAMD, ab, craig.topper Subscribers: jyknight, dsanders, arsenm, nhaustov, llvm-commits Differential Revision: http://reviews.llvm.org/D18242 llvm-svn: 268726	2016-05-06 11:31:17 +00:00
Dylan McKay	6d8078f993	[AVR] Add a majority of the backend code Summary: This adds the majority of the AVR backend. Reviewers: hfinkel, dsanders, vkalintiris, arsenm Subscribers: dylanmckay Differential Revision: http://reviews.llvm.org/D17906 llvm-svn: 268722	2016-05-06 10:12:31 +00:00
Nikolay Haustov	dc1bb79b92	AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2. Summary: Check calling convention in AMDGPUMachineFunction::isKernel This will be used for AMDGPU_HSA_KERNEL symbol type in output ELF. Also, in the future unused non-kernels may be optimized. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19917 llvm-svn: 268719	2016-05-06 09:23:13 +00:00
Zlatko Buljan	31c9ebe281	[mips][microMIPS] Add CodeGen support for MUL* and DMUL* instructions Differential Revision: http://reviews.llvm.org/D15744 llvm-svn: 268714	2016-05-06 08:24:14 +00:00
Justin Bogner	b012699741	SDAG: Rename Select->SelectImpl and repurpose Select as returning void This is a step towards removing the rampant undefined behaviour in SelectionDAG, which is a part of llvm.org/PR26808. We rename SelectionDAGISel::Select to SelectImpl and update targets to match, and then change Select to return void and consolidate the sketchy behaviour we're trying to get away from there. Next, we'll update backends to implement `void Select(...)` instead of SelectImpl and eventually drop the base Select implementation. llvm-svn: 268693	2016-05-05 23:19:08 +00:00
Krzysztof Parzyszek	897574311f	[scan-build] fix warnings emitted on LLVM Hexagon code base Patch by Apelete Seketeli. Differential Revision: http://reviews.llvm.org/D19968 llvm-svn: 268691	2016-05-05 22:00:44 +00:00
Krzysztof Parzyszek	6bd4268302	[Hexagon] Fix the offset ranges for vector memory instructions llvm-svn: 268690	2016-05-05 21:58:02 +00:00
Chad Rosier	777dc513a0	[AArch64] Remove unused MBP headers/dependency. NFC. llvm-svn: 268682	2016-05-05 20:58:38 +00:00
Dan Gohman	450a80754f	[WebAssembly] Don't emit epilogue code in the middle of stackified code. llvm-svn: 268679	2016-05-05 20:41:15 +00:00
Matt Arsenault	539ca882c6	AMDGPU: Simplify control flow / conditions llvm-svn: 268676	2016-05-05 20:27:02 +00:00
NAKAMURA Takumi	2eec13680e	Touch Hexagon/CMakeLists.txt to regenerate build files, since r268641 complains of missing HexagonAlias.td on ninja. FIXME: TableGen.cmake globs *.td(s) with wildcards for deps. It is not good. llvm-svn: 268666	2016-05-05 19:28:01 +00:00
Tim Northover	df43264cf7	ARM: don't attempt to merge litpools referencing different PC-anchors. Given something like: ldr r0, .LCPI0_0 (== pc-rel var) add r0, pc ldr r1, .LCPI0_1 (== pc-rel var) add r1, pc we cannot combine the 2 ldr instructions and litpools because they get added to a different pc to form the correct address. I think the original logic came from a time when we fused the LDRpci/PICADD instructions into one pseudo-instruction so the PC was always immediately at-hand. That's no longer the case. Should fix general-dynamic TLS access on Linux, and quite possibly other -fPIC code that relies on litpools (e.g. v6m and -Oz compilations) though trivial tweaks of the .ll test didn't provoke anything. llvm-svn: 268662	2016-05-05 18:38:53 +00:00
Krzysztof Parzyszek	f7a4bd4068	[Hexagon] Add aliases for vector loads/stores with no explicit offset The mem(r0) instructions are treated as mem(r0+#0). llvm-svn: 268661	2016-05-05 18:38:35 +00:00
Nicolai Haehnle	ffbd56a1c9	AMDGPU: Uniform branch conditions can originate with intrinsics Summary: Discovered by Dave Airlie, fixes an assertion in Khronos OpenGL CTS GL43-CTS.shader_storage_buffer_object.advanced-matrix. In this particular case, the buffer load intrinsic fed into a uniform conditional branch, and led the brcond lowering down the wrong path. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19931 llvm-svn: 268650	2016-05-05 17:36:36 +00:00
Tom Stellard	fcfaea4cff	AMDGPU/SI: Add support for AMD code object version 2. Summary: Version 2 is now the default. If you want to emit version 1, use the amdgcn--amdhsa-amdcov1 triple. Reviewers: arsenm, kzhuravl Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19283 llvm-svn: 268647	2016-05-05 17:03:33 +00:00
Hans Wennborg	501e739d8a	X86CallFrameOptimization: make adjustCallSequence's return type void It always returned the same value (true). No functionality change. llvm-svn: 268645	2016-05-05 16:39:31 +00:00
Krzysztof Parzyszek	8da817d1ca	[Hexagon] Merge HexagonAlias.td into HexagonInstrAlias.td, NFC llvm-svn: 268641	2016-05-05 16:19:36 +00:00
Krzysztof Parzyszek	e57662d5ec	[Hexagon] Handle operand type differences for A2_tfrpi The instruction A2_tfrpi has a 64-bit operand, while the corresponding intrinsic takes a 32-bit value. The actual value has only 8 significant bits, so the difference is only in the type used to represent it. In order to map the intrinsic to the instruction, the operand needs to be extended to the correct type. llvm-svn: 268635	2016-05-05 15:29:47 +00:00
James Y Knight	0c145c0c3a	Remove bit-rotten CppBackend. This backend was supposed to generate C++ code which will re-construct the LLVM IR passed as input. This seems to me to have very marginal usefulness in the first place. However, the code has never been updated to use IRBuilder, which makes its current value negative -- people who look at the output may be steered to use the wrong C++ APIs to construct IR. Furthermore, it's generated code that doesn't compile since at least 2013. Differential Revision: http://reviews.llvm.org/D19942 llvm-svn: 268631	2016-05-05 14:35:40 +00:00
Nirav Dave	996fc133b7	Fix Mips Parser error reporting [mips] On error, ParseDirective should always return false to signify that the directive was understood. Reviewers: dsanders, vkalintiris, sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19929 llvm-svn: 268630	2016-05-05 14:15:46 +00:00
Marcin Koscielnicki	0275fac2c9	[X86] Extend some Linux special cases to cover kFreeBSD. Both Linux and kFreeBSD use glibc, so follow similiar code paths. Add isTargetGlibc to check for this, and use it instead of isTargetLinux in a few places. Fixes PR22248 for kFreeBSD. Differential Revision: http://reviews.llvm.org/D19104 llvm-svn: 268624	2016-05-05 11:35:51 +00:00
David Majnemer	911d0e3c21	[X86] Use the right type when folding xor (truncate (shift)) -> setcc The result type of setcc is dependent on whether or not AVX512 is present. We had an X86-specific DAG-combine which assumed that the result type should be i8 when it could be i1. This meant that we would generate illegal setccs which LowerSETCC did not like. Instead, use an appropriate type and zero extend to i8. Also, there were some scenarios where the fold should have fired but didn't because we were overly cautious about the types. This meant that we generated: shrl $31, %edi andl $1, %edi kmovw %edi, %k0 kxnorw %k0, %k0, %k1 kshiftrw $15, %k1, %k1 kxorw %k1, %k0, %k0 kmovw %k0, %eax instead of: testl %edi, %edi setns %al This fixes PR27638. llvm-svn: 268609	2016-05-05 06:00:56 +00:00
Justin Bogner	8752be775c	ARM: Use a Handle to track SDNodes in case they're CSE'd. NFC The code here is recursively Select-ing a new Node to avoid issues where N is CSE'd during replaceDAGValue and stops being valid. We can accomplish the same goal in a more principled way by using a HandleSDNode. This is essentially a less dodgy fix for PR25733 than the original attempt back in r255120. llvm-svn: 268590	2016-05-05 01:43:49 +00:00
Marcin Koscielnicki	ad1482c6f1	[SystemZ] Implement backchain attribute (recommit with fix). This introduces a SystemZ-specific "backchain" attribute on function, which enables writing the frame backchain link as specified by the ABI. This will be used to implement -mbackchain option in clang. Differential Revision: http://reviews.llvm.org/D19889 Fixed in this version: added RegState::Define and RegState::Kill on R1D in prologue. llvm-svn: 268581	2016-05-05 00:37:30 +00:00
Marcin Koscielnicki	12037b4e9d	Revert "[SystemZ] Implement backchain attribute." This reverts commit rL268571. It caused failures in register scavenger. llvm-svn: 268576	2016-05-04 23:54:53 +00:00
Marcin Koscielnicki	9de88d9bbe	[SystemZ] Implement llvm.get.dynamic.area.offset To be used for AddressSanitizer. Differential Revision: http://reviews.llvm.org/D19817 llvm-svn: 268572	2016-05-04 23:31:26 +00:00
Marcin Koscielnicki	835d927938	[SystemZ] Implement backchain attribute. This introduces a SystemZ-specific "backchain" attribute on function, which enables writing the frame backchain link as specified by the ABI. This will be used to implement -mbackchain option in clang. Differential Revision: http://reviews.llvm.org/D19889 llvm-svn: 268571	2016-05-04 23:31:20 +00:00
Quentin Colombet	0c5bfd0514	[X86] Add a few register classes for x32 address accesses. The new register classes allow to tell the machine verifier that it is fine to use RIP for address accesses in x32 mode. Prior to that patch, we would complain that we are using a GR64 in place of GR32, whereas it is actually fine to use GR64 for x32 as long as the 32 high bits are 0s. RIP has this property and is used for RIP-relative addressing. This partially fixes http://llvm.org/PR27481. llvm-svn: 268567	2016-05-04 22:45:31 +00:00
Evandro Menezes	d23324aab1	[AArch64] Add cheap as move instructions for Exynos M1 llvm-svn: 268549	2016-05-04 20:47:25 +00:00
Evandro Menezes	bcb95cd0ed	[AArch64] Use the reciprocal estimation machinery This patch adds support for estimating the square root, its reciprocal and division or reciprocal using the combiner generic reciprocal machinery. llvm-svn: 268539	2016-05-04 20:18:27 +00:00
Vitaly Buka	6b5c89262a	Revert r268529 because it caused use-of-uninitialized-value Summary: This reverts commit d88cc0862bf7da64850b89e9bb5ea9f95e7f1184. #0 0xfed467 in llvm::ARMFrameLowering::determineCalleeSaves(llvm::MachineFunction&, llvm::BitVector&, llvm::RegScavenger) const /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/Target/ARM/ARMFrameLowering.cpp:1625:52 #1 0x330d4cc in (anonymous namespace)::PEI::runOnMachineFunction(llvm::MachineFunction&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/CodeGen/PrologEpilogInserter.cpp:186:3 #2 0x3193e12 in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/CodeGen/MachineFunctionPass.cpp:60:13 #3 0x396237d in llvm::FPPassManager::runOnFunction(llvm::Function&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1526:23 #4 0x3962a23 in llvm::FPPassManager::runOnModule(llvm::Module&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1547:16 #5 0x3963d52 in runOnModule /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1603:23 #6 0x3963d52 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1706 #7 0x6bb910 in compileModule(char*, llvm::LLVMContext&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/tools/llc/llc.cpp:412:5 #8 0x6b3c25 in main /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/tools/llc/llc.cpp:218:22 #9 0x7fd4a7d37ec4 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x21ec4) #10 0x625c93 in _start (/mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm_build_msan/bin/llc+0x625c93) Reviewers: Subscribers: llvm-svn: 268536	2016-05-04 19:44:11 +00:00
Weiming Zhao	2373f769ce	[ARM] Fix Scavenger assert due to underestimated stack size Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268529	2016-05-04 18:19:33 +00:00
Nemanja Ivanovic	1a2b2f03e7	[PowerPC] Generate VSX version of splat word This patch corresponds to review: http://reviews.llvm.org/D18592 It allows the PPC back end to generate the xxspltw instruction where we previously only emitted vspltw. llvm-svn: 268516	2016-05-04 16:04:02 +00:00
Jan Vesely	bbc2231983	AMDGPU/R600: Minor cleanup in InstrInfo Use std::make_pair instead of constructor Use C++11 loop Reuse helper var Reviewers: tstellardAMD Subsribers: arsenm Differential Revision: http://reviews.llvm.org/D19787 llvm-svn: 268503	2016-05-04 14:55:45 +00:00
Daniel Sanders	c07f06aeee	[mips][ias] Only round section sizes when explicitly requested. As requested by Rafael Espindola in his post-commit comments on r268036. This makes the previous behaviour the default while still allowing verification of IAS. llvm-svn: 268496	2016-05-04 13:21:06 +00:00
Chris Dewhurst	8338d90ba3	[Sparc] Allow taking of function address into a register. Modification of previously existing code (variable rename only), with unit test added. Differential Revision: http://reviews.llvm.org/D19368 llvm-svn: 268493	2016-05-04 12:11:05 +00:00
Zlatko Buljan	4807f829b4	[mips][microMIPS] Add CodeGen support for microMIPSr6 ROTR and ROTRV and add tests for LL, SC, SYSCALL, ROTR, ROTRV, LWM32, SWM32 and MOVEP instructions Differential Revision: http://reviews.llvm.org/D19857 llvm-svn: 268491	2016-05-04 12:02:12 +00:00
Chris Dewhurst	69fa1926db	[Sparc] Implement __builtin_setjmp, __builtin_longjmp back-end. This code implements builtin_setjmp and builtin_longjmp exception handling intrinsics for 32-bit Sparc back-ends. The code started as a mash-up of the PowerPC and X86 versions, although there are sufficient differences to both that had to be made for Sparc handling. Note: I have manual tests running. I'll work on a unit test and add that to the rest of this diff in the next day. Also, this implementation is only for 32-bit Sparc. I haven't focussed on a 64-bit version, although I have left the code in a prepared state for implementing this, including detecting pointer size and comments indicating where I suspect there may be differences. Differential Revision: http://reviews.llvm.org/D19798 llvm-svn: 268483	2016-05-04 09:33:30 +00:00
David Majnemer	2c5aeabedd	[X86] Lower zext i1 arguments i1 is now a legal type for X86 with AVX512. There were some paths in X86FastISel which were not quite ready to see an i1 value: they were not quite sure how to deal with sign/zero extends for call arguments. DTRT by extending to i8 for zeroext and bailing out of FastISel for signext. This fixes PR27591. llvm-svn: 268470	2016-05-04 00:22:23 +00:00
Simon Pilgrim	be439d7f1a	[X86] Tidied up SDValue's SDNode referencing. NFCI. llvm-svn: 268445	2016-05-03 21:44:45 +00:00
Tim Northover	d2ecbccf27	X86-Darwin: start emitting data-region directives for jump-tables. The surrounding tools can cope these days, and they were invented for a reason. llvm-svn: 268437	2016-05-03 21:03:41 +00:00
David L Kreitzer	c9fbf1018a	Add an address space for the X86 SS segment. Patch by Michael LeMay (michael.lemay@intel.com) Differential Revision: http://reviews.llvm.org/D17093 llvm-svn: 268431	2016-05-03 20:16:08 +00:00
Tom Stellard	4a304b3886	AMDGPU/SI: Use range loops to simplify some code in the SI Scheduler Reviewers: arsenm, axeldavy Subscribers: MatzeB, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19822 llvm-svn: 268396	2016-05-03 16:30:56 +00:00
Aaron Ballman	3bd56b3b43	Silence unused variable warning; NFC. llvm-svn: 268392	2016-05-03 15:17:25 +00:00
Simon Pilgrim	d2752708a3	[X86][SSE] Added target shuffle combine to MOVQ llvm-svn: 268391	2016-05-03 15:05:13 +00:00

1 2 3 4 5 ...

37378 Commits