llvm-project

Commit Graph

Author	SHA1	Message	Date
Joerg Sonnenberger	8ffe7ab7c2	Optimize a printf with a double procent to putchar. llvm-svn: 268922	2016-05-09 14:36:16 +00:00
James Molloy	5c20e27b7f	[VectorUtils] Query number of sign bits to allow more truncations When deciding if a vector calculation can be done in a smaller bitwidth, use sign bit information from ValueTracking to add more information and allow more truncations. llvm-svn: 268921	2016-05-09 14:32:30 +00:00
Daniel Sanders	e473dc937f	[mips][micromips] Make getPointerRegClass() result depend on the instruction. Summary: Previously, it returned the GPR16MMRegClass for all instructions which was incorrect for instructions like lwsp/lwgp and unnecesarily restricted the permitted registers for instructions like lw32. This fixes quite a few of the -verify-machineinstrs errors reported in PR27458. I've only added -verify-machineinstrs to one test in this change since I understand there is a plan to enable the verifier by default. Reviewers: hvarga, zbuljan, zoran.jovanovic, sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19873 llvm-svn: 268918	2016-05-09 13:38:25 +00:00
Strahinja Petrovic	e682b80b8b	[PowerPC] fix register alignment for long double type This patch fixes register alignment for long double type in soft float mode. Before this patch alignment was 8 and this patch changes it to 4. Differential Revision: http://reviews.llvm.org/D18034 llvm-svn: 268909	2016-05-09 12:27:39 +00:00
Chris Dewhurst	e3b8645a1c	[Sparc][LEON] Add UMAC and SMAC instruction support for Sparc LEON subtargets This change adds SMAC (signed multiply-accumulate) and UMAC (unsigned multiply-accumulate) for LEON subtargets of the Sparc processor. The new files LeonFeatures.td and leon-instructions.ll will both be expanded in future, so I want to leave them separate as small files for this review, to be expanded in future check-ins. Note: The functions are provided only for inline-assembly provision. No DAG selection is provided. Differential Revision: http://reviews.llvm.org/D19911 llvm-svn: 268908	2016-05-09 11:55:15 +00:00
Silviu Baranga	f60be28ed8	[AArch64] Implement lowering of the X constraint on AArch64 Summary: This implements the lowering of the X constraint on AArch64. The default behaviour of the X constraint lowering is to restrict it to "f". This is a problem because the "f" constraint is not implemented on AArch64 and would be too restrictive anyway. Therefore, the AArch64 hook will lower this to "w" (if the operand is a floating point or vector) or "r" otherwise. The implementation is similar with the one added for ARM (r267411). This is the AArch64 side of the fix for http://llvm.org/PR26493 Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits, t.p.northover Differential Revision: http://reviews.llvm.org/D19967 llvm-svn: 268907	2016-05-09 11:10:44 +00:00
Simon Pilgrim	bf3a4f552e	[X86][AVX512] Added masked version of combine tests llvm-svn: 268904	2016-05-09 10:43:13 +00:00
Daniel Sanders	3d00056515	[mips][ias] R_MIPS_(GOT\|HI\|LO\|PC)16 and R_MIPS_GPREL32 do not need symbols. Summary: In theory, care must be taken to ensure that pairs of R_MIPS_(GOT\|HI\|LO)16 make the same decision on both relocs in the reloc pair but in practice this isn't as hard as it sounds and only limits the complexity of the predicate used. We handle all three with the same code to ensure their decisions always agree with each other. Reviewers: sdardis Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19016 llvm-svn: 268900	2016-05-09 10:21:14 +00:00
Zlatko Buljan	ba553a6e0a	[mips][microMIPS] Implement LWP and SWP instructions Differential Revision: http://reviews.llvm.org/D10640 llvm-svn: 268896	2016-05-09 08:07:28 +00:00
Frederic Riss	5af2c005eb	[dsymutil] Fix -arch option for thumb variants. r267249 removed the dual ARM/Thumb interface from MachOObjectFile, simplifying llvm-dsymutil's code. This unfortunately also regressed llvm-dsymutil's ability to select thumb slices, because the simplified code was also dealing with the discrepency between the slice arch (eg. armv7m) and the triple arch name (eg. thumbv7m). llvm-svn: 268894	2016-05-09 06:01:12 +00:00
Craig Topper	a58abd1cc6	[AVX512] Fix up types for arguments of int_x86_avx512_mask_cvtsd2ss_round and int_x86_avx512_mask_cvtss2sd_round. Only the argument being converted should be a different type. The other 2 argument should have the same type as the result. llvm-svn: 268891	2016-05-09 05:34:12 +00:00
Craig Topper	707c89c00d	[AVX512] Add non-temporal store patterns for v16i32/v32i16/v64i8. llvm-svn: 268889	2016-05-08 23:43:17 +00:00
Craig Topper	c41320d700	[AVX512] Add missing patterns for non-temporal stores of 128/256-bit vXi8/vXi16/vXi32 when VLX is enabled. The equivalent AVX1/2 patterns are disabled by VLX. This caused regular stores to be emitted instead. llvm-svn: 268886	2016-05-08 23:08:45 +00:00
Craig Topper	e5ce84a33c	[AVX512] Add VLX 128/256-bit SET0 operations that encode to 128/256-bit EVEX encoded VPXORD so all 32 registers can be used. llvm-svn: 268884	2016-05-08 21:33:53 +00:00
Craig Topper	298b6d7493	[X86] Re-generate tests using update_llc_test_checks.py to prepare for a future commit. NFC llvm-svn: 268883	2016-05-08 21:33:47 +00:00
Craig Topper	092794b82a	Remove Windows line endings in some tests to prepare for a future commit. NFC llvm-svn: 268882	2016-05-08 21:33:44 +00:00
Simon Pilgrim	4a9d32c5ba	[CostModel][X86] Extended comparison instruction cost model tests to include SSE2/SSE3/SSSE3/SSE41/SSE42 targets llvm-svn: 268877	2016-05-08 15:24:53 +00:00
Craig Topper	d681e23336	[X86] Lower 256-bit vector all-zero constants to v8i32 even with AVX1 only. Either way a 256-bit VXORPS will be used. llvm-svn: 268873	2016-05-08 07:10:54 +00:00
Craig Topper	3d6722910c	[X86] Add patterns for 256-bit non-temporal stores when only AVX1 is supported. While there, add a predicate to the SSE2 patterns to avoid an ordering dependency. llvm-svn: 268872	2016-05-08 07:10:50 +00:00
Craig Topper	d788498411	[X86] No need to avoid selecting AVX_SET0 for 256-bit integer types when only AVX1 is supported. AVX_SET0 just expands to 256-bit VXORPS which is legal in AVX1. llvm-svn: 268871	2016-05-08 07:10:47 +00:00
Weiming Zhao	5b5501e817	[ARM] Fix Scavenger assert due to underestimated stack size (re-apply r268810 as it exposed an uninitialized variable in ARM MFI. Patch 268868 should fix that.) Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: vitalybuka, aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268869	2016-05-08 05:11:54 +00:00
Simon Pilgrim	b6f82c449a	[SelectionDAG] Added bitreverse(bitreverse(v)) --> v Added bitreverse creation testing llvm-svn: 268865	2016-05-07 20:12:36 +00:00
Simon Pilgrim	8ef046a8ca	[X86] Added BITREVERSE constant folding and identity tests Identity tests are currently failing - this will be fixed soon llvm-svn: 268862	2016-05-07 19:04:00 +00:00
Simon Pilgrim	420852e8d4	[CostModel][X86] Split BSWAP/BITREVERSE cost tests from CTPOP/CTLZ/CTTZ 'bit count' cost tests llvm-svn: 268859	2016-05-07 16:34:16 +00:00
Sanjay Patel	c2751e7050	[x86, BMI] add TLI hook for 'andn' and use it to simplify comparisons For the sake of minimalism, this patch is x86 only, but I think that at least PPC, ARM, AArch64, and Sparc probably want to do this too. We might want to generalize the hook and pattern recognition for a target like PPC that has a full assortment of negated logic ops (orc, nand). Note that http://reviews.llvm.org/D18842 will cause this transform to trigger more often. For reference, this relates to: https://llvm.org/bugs/show_bug.cgi?id=27105 https://llvm.org/bugs/show_bug.cgi?id=27202 https://llvm.org/bugs/show_bug.cgi?id=27203 https://llvm.org/bugs/show_bug.cgi?id=27328 Differential Revision: http://reviews.llvm.org/D19087 llvm-svn: 268858	2016-05-07 15:03:40 +00:00
Mehdi Amini	581f0e1451	Refactor stripDebugInfo(Function) to handle intrinsic This moves the code that handles stripping debug info intrinsic from StripDebugInfo(Module) to StripDebugInfo(Function). The latter is already walking every instructions so it makes sense to do it at the same time. This makes also stripDebugInfo(Function) as an API more useful: it is really dropping every debug info in the Function. Finally the existing code is trigerring an assertion when the Module is not fully materialized. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268847	2016-05-07 04:10:52 +00:00
Vitaly Buka	49bbbd8e7a	Revert r268832 "Refactor stripDebugInfo(Function) to handle intrinsic" It breaks many bots llvm-svn: 268837	2016-05-07 02:10:59 +00:00
Vitaly Buka	e81d96be6f	Revert r268810 becase it brakes msan bot. 16802==WARNING: MemorySanitizer: use-of-uninitialized-value lib/Target/ARM/ARMFrameLowering.cpp:1632 llvm-svn: 268833	2016-05-07 01:54:00 +00:00
Mehdi Amini	6eef08138e	Refactor stripDebugInfo(Function) to handle intrinsic This moves the code that handles stripping debug info intrinsic from StripDebugInfo(Module) to StripDebugInfo(Function). The latter is already walking every instructions so it makes sense to do it at the same time. This makes also stripDebugInfo(Function) as an API more useful: it is really dropping every debug info in the Function. Finally the existing code is trigerring an assertion when the Module is not fully materialized. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268832	2016-05-07 01:42:36 +00:00
Ahmed Bougacha	04a8fc2e37	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. This re-applies r268760, reverted in r268794. Fixes http://llvm.org/PR27670 The original imp-defs assertion was way overzealous: forward all implicit operands, except imp-defs of the new super-reg def (r268787 for GR64, but also possible for GR16->GR32), or imp-uses of the new super-reg use. While there, mark the source use as Undef, and add an imp-use of the old source reg: that should cover any case of dead super-regs. At the stage the pass runs, flags are unlikely to matter anyway; still, let's be as correct as possible. Also add MIR tests for the various interesting cases. Original commit message: Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268831	2016-05-07 01:11:17 +00:00
Matthias Braun	22152acf7b	DetectDeadLanes: Increase precision when detecting undef inputs In case of COPY-like instruction we may be able to deduce that a certain input is unused, based on the used lanes of the register defined by the instruction. This even works accross otherwise incompatible copies (no need to have compatible lanemasks, completely unused operands are still completely unused). It even makes sense to redo the analysis in this case since we gained information for a case we previously stopped at because of the incompatible masks. llvm-svn: 268815	2016-05-06 22:43:50 +00:00
Simon Pilgrim	45964c3742	[SLPVectorizer][X86] Regenerated SEXT/ZEXT cast vectorization tests Added 256-bit vector test as well llvm-svn: 268811	2016-05-06 22:22:18 +00:00
Weiming Zhao	74f12d31c1	[ARM] Fix Scavenger assert due to underestimated stack size (this is resubmit of r268529 with minor refactoring. r268529 was reverted at r268536 due a memory sanitizer failure. I have not been able to reproduce that failure and I checked all the variable used in my change but I could not spot an issue. I did some refactoring and see if it will give a clearer hint) Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: vitalybuka, aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268810	2016-05-06 22:20:13 +00:00
Philip Reames	6f4d0088c6	Reapply 267210 with fix for PR27490 Original Commit Message Extend load/store type canonicalization to handle unordered operations Extend the type canonicalization logic to work for unordered atomic loads and stores. Note that while this change itself is fairly simple and low risk, there's a reasonable chance this will expose problems in the backends by suddenly generating IR they wouldn't have seen before. Anything of this nature will be an existing bug in the backend (you could write an atomic float load), but this will definitely change the frequency with which such cases are encountered. If you see problems, feel free to revert this change, but please make sure you collect a test case. Note that the concern about lowering is now much less likely. PR27490 proved that we already were mucking with the types of ordered atomics and volatiles. As a result, this change doesn't introduce as much new behavior as originally thought. llvm-svn: 268809	2016-05-06 22:17:01 +00:00
Zachary Turner	5a1b5ef9eb	Make llvm-pdbdump print CV type records This reuses the CVTypeDumper from libcodeview to dump full information about type records within a PDB file. Differential Revision: http://reviews.llvm.org/D20022 Reviewed By: rnk llvm-svn: 268808	2016-05-06 22:15:42 +00:00
Philip Reames	4a3c3b66d7	[GVN] PRE of unordered loads Again, fairly simple. Only change is ensuring that we actually copy the property of the load correctly. The aliasing legality constraints were already handled by the FRE patches. There's nothing special about unorder atomics from the perspective of the PRE algorithm itself. llvm-svn: 268804	2016-05-06 21:43:51 +00:00
Simon Pilgrim	2def0a878a	[SLPVectorizer][X86] Added BSWAP/BITREVERSE vectorization tests llvm-svn: 268803	2016-05-06 21:41:55 +00:00
Simon Pilgrim	a2220ea456	[SLPVectorizer][X86] Added CTPOP/CTLZ/CTTZ vectorization tests llvm-svn: 268800	2016-05-06 21:33:01 +00:00
Quentin Colombet	a09f050dc1	Revert "[X86] Add a new LOW32_ADDR_ACCESS_RBP register class." This reverts commit r268796. I believe it breaks test/CodeGen/X86/asm-mismatched-types.ll with: Cannot emit physreg copy instruction llvm-svn: 268799	2016-05-06 21:21:50 +00:00
Zachary Turner	4f84e3bfcf	Fix failing test due to merge conflict. llvm-svn: 268798	2016-05-06 21:19:29 +00:00
Quentin Colombet	2728074e3c	[X86] Add a new LOW32_ADDR_ACCESS_RBP register class. ABIs like NaCl uses 32-bit addresses but have 64-bit frame. The new register class reflects those constraints when choosing a register class for a address access. llvm-svn: 268796	2016-05-06 21:10:53 +00:00
Nico Weber	9b32b4fbee	Revert r268760, it caused PR27670. llvm-svn: 268794	2016-05-06 21:07:02 +00:00
Zachary Turner	819e77d196	Port DebugInfoPDB over to using llvm::Error. Differential Revision: http://reviews.llvm.org/D19940 Reviewed By: rnk llvm-svn: 268791	2016-05-06 20:51:57 +00:00
Marcin Koscielnicki	b088ad1e09	[MSan] [X86] Fix vararg helper for fixed arguments in overflow area. This fixes http://llvm.org/PR27646 on x86_64. Differential Revision: http://reviews.llvm.org/D19997 llvm-svn: 268783	2016-05-06 19:36:56 +00:00
Philip Reames	1fdce639d2	[GVN] Handle unordered atomics in cross block FRE You'll note there are essentially no code changes here. Cross block FRE heavily reuses code from the block local FRE. All of the tricky parts were done as part of the previous patch and the refactoring that removed the original code duplication. llvm-svn: 268775	2016-05-06 18:46:45 +00:00
Eric Christopher	e3f7d3df3c	The associated PR for this test was PR27135 not PR27132. llvm-svn: 268772	2016-05-06 18:23:14 +00:00
Philip Reames	ae8997f496	[GVN] Do local FRE for unordered atomic loads This patch is the first in a small series teaching GVN to optimize unordered loads aggressively. This change just handles block local FRE because that's the simplest thing which lets me test MDA, and the AvailableValue pieces. Somewhat suprisingly, MDA appears fine and only a couple of small changes are needed in GVN. Once this is in, I'll tackle non-local FRE and PRE. The former looks like a natural extension of this, the later will require a couple of minor changes. Differential Revision: http://reviews.llvm.org/D19440 llvm-svn: 268770	2016-05-06 18:17:13 +00:00
Sanjay Patel	1cb6241a89	[SimplifyCFG] propagate branch metadata when creating select (retry r268550 / r268751 with possible fix) Retrying r268550/r268751 which were reverted at r268577/r268765 due a memory sanitizer failure. I have not been able to reproduce that failure, but I've taken another guess at fixing the problem in this version of the patch and will watch for another failure. Original commit message: Unlike earlier similar fixes, we need to recalculate the branch weights in this case. Differential Revision: http://reviews.llvm.org/D19674 llvm-svn: 268767	2016-05-06 18:07:46 +00:00
Sanjay Patel	84a0bf64a8	revert r268751 - caused same failures on msan bot llvm-svn: 268765	2016-05-06 17:51:37 +00:00
Simon Pilgrim	b3f5cb7a65	[CostModel][X86] Tweak 'SSE2-only' test CPU as it was only disabling SSE41 not SSE3/SSSE3 etc. llvm-svn: 268763	2016-05-06 17:50:07 +00:00
Artem Tamazov	ebe71ce36a	[AMDGPU][llvm-mc] Add support for sendmsg(...) syntax. Added support for sendmsg(MSG[, OP[, STREAM_ID]]) syntax in s_sendmsg and s_sendmsghalt instructions. The syntax matches the SP3 assembler/disassembler rules. That is why implicit inputs (like M0 and EXEC) are not printed to disassembly output anymore. sendmsg(...) allows only known message types and attributes, even if literals are used instead of symbolic names. However, raw literal (without "sendmsg") still can be used, and that allows for any 16-bit value. Tests updated/added. Differential Revision: http://reviews.llvm.org/D19596 llvm-svn: 268762	2016-05-06 17:48:48 +00:00
Simon Pilgrim	93d9b96bdb	[CostModel][X86] Added ctlz/cttz undef-zero costmodel tests llvm-svn: 268761	2016-05-06 17:48:35 +00:00
Ahmed Bougacha	258426ca7a	[X86] Teach X86FixupBWInsts to promote MOV8rr/MOV16rr to MOV32rr. Codesize is less (16) or equal (8), and we avoid partial dependencies. Differential Revision: http://reviews.llvm.org/D19999 llvm-svn: 268760	2016-05-06 17:42:57 +00:00
Geoff Berry	f8862968db	[AArch64] Fix test to specify triple and disable post-RA scheduling. This should fix bot breakage caused by r268746: [AArch64] Combine callee-save and local stack SP adjustment instructions. llvm-svn: 268752	2016-05-06 17:12:38 +00:00
Sanjay Patel	6609510c32	[SimplifyCFG] propagate branch metadata when creating select (retry r268550 with possible fix) Retrying r268550 which was reverted at r268577 due a memory sanitizer failure. I have not been able to reproduce that failure, but I've taken a guess at fixing the problem in this version of the patch and will watch for another failure. Original commit message: Unlike earlier similar fixes, we need to recalculate the branch weights in this case. Differential Revision: http://reviews.llvm.org/D19674 llvm-svn: 268751	2016-05-06 17:07:47 +00:00
Geoff Berry	a5335647d5	[AArch64] Combine callee-save and local stack SP adjustment instructions. Summary: If a function needs to allocate both callee-save stack memory and local stack memory, we currently decrement/increment the SP in two steps: first for the callee-save area, and then for the local stack area. This changes the code to allocate them both at once at the very beginning/end of the function. This has two benefits: 1) there is one fewer sub/add micro-op in the prologue/epilogue 2) the stack adjustment instructions act as a scheduling barrier, so moving them to the very beginning/end of the function increases post-RA scheduler's ability to move instructions (that only depend on argument registers) before any of the callee-save stores This change can cause an increase in instructions if the original local stack SP decrement could be folded into the first store to the stack. This occurs when the first local stack store is to stack offset 0. In this case we are trading off one more sub instruction for one fewer sub micro-op (along with benefits (2) and (3) above). Reviewers: t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18619 llvm-svn: 268746	2016-05-06 16:34:59 +00:00
Nikolay Haustov	6eb050ea4e	Revert "AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2." This reverts commit 47486d52454d60cdf6becc0b2efe533c73794380. It broke calling OpenCL kernel from another kernel. llvm-svn: 268739	2016-05-06 14:59:04 +00:00
Simon Pilgrim	5122c64fd8	[CostModel][X86] Added costmodel tests for vector ctpop/ctlz/cttz/bitreverse/bswap llvm-svn: 268738	2016-05-06 14:38:14 +00:00
Daniel Sanders	8de3d3cad6	[mips] Fix inconsistent .cprestore behaviour between direct object emission and assembling. Summary: Direct object emission has an initialization order problem where an InitMCObjectFile is called after MipsTargetELFStreamer determines whether PIC is enabled by default or not. There doesn't seem to be point that initializes all cases so split the responsibility between MipsTargetELFStreamer and MipsAsmPrinter. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19728 llvm-svn: 268737	2016-05-06 14:37:24 +00:00
Chad Rosier	4ab37c0037	[SimplifyCFG] Prefer a simplification based on a dominating condition. Rather than merge two branches with a common destination. Differential Revision: http://reviews.llvm.org/D19743 llvm-svn: 268735	2016-05-06 14:25:14 +00:00
Daniel Sanders	a463d31a64	[mips] Correct the ordering of HI/LO pairs in the relocation table. Summary: There seems to have been a misunderstanding as to the meaning of 'offset' in the rules laid down by our ABI. The previous code believed that 'offset' meant the offset within the section that the relocation is applied to. However, it should have meant the offset from the symbol used in the relocation expression. This patch adds two fields to ELFRelocationEntry and uses them to correct the order of relocations for MIPS. These fields contain: * The original symbol before shouldRelocateWithSymbol() is considered. This ensures that R_MIPS_GOT16 is able to correctly distinguish between local and external symbols, allowing us to tell whether %got() requires a matching %lo() or not (local symbols require one, external symbols don't). It also prevents confusing cases where the fuzzy matching rules cause things like %hi(foo)/%lo(foo+3) and %hi(bar)/%lo(bar+1) to swap their %lo()'s. * The original offset before shouldRelocateWithSymbol() is considered. The existing Addend field is always zero when the object uses in place addends (because it's already moved it to the encoding) but MIPS needs to use the original offset to ensure that the linker correctly calculates the carry-in bit for %hi() and %got(). IAS ensures that unmatchable %hi()/%got() relocations are placed at the end of the table to ensure that the linker rejects the table (we're unable to report such errors directly). The alternatives to this risk accidental matching against inappropriate relocations which may silently compute incorrect values due to an incorrect carry bit between the %lo() and %hi()/%got(). Reviewers: sdardis Subscribers: dsanders, sdardis, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D19718 llvm-svn: 268733	2016-05-06 13:49:25 +00:00
Daniel Sanders	f9d8b8ccc5	[mips][mips16] Use isUnconditionalBranch() in AnalyzeBranch() and constant island pass. Summary: This stops it misidentifying unconditional branches as conditional branches which fixes a -verify-machineinstrs error about exiting a function via fall through. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19864 llvm-svn: 268731	2016-05-06 13:23:51 +00:00
Daniel Sanders	a6cda12179	[mips][fastisel] Conditional moves do not have implicit operands. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19862 llvm-svn: 268730	2016-05-06 12:57:26 +00:00
Ryan Govostes	3f37df0326	[asan] add option to set shadow mapping offset Allowing overriding the default ASAN shadow mapping offset with the -asan-shadow-offset option, and allow zero to be specified for both offset and scale. Patch by Aaron Carroll <aaronc@apple.com>. llvm-svn: 268724	2016-05-06 10:25:22 +00:00
Nikolay Haustov	dc1bb79b92	AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2. Summary: Check calling convention in AMDGPUMachineFunction::isKernel This will be used for AMDGPU_HSA_KERNEL symbol type in output ELF. Also, in the future unused non-kernels may be optimized. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19917 llvm-svn: 268719	2016-05-06 09:23:13 +00:00
Nikolay Haustov	1f7732abfa	AMDGPU/SI: Add amdgpu_kernel calling convention. Part 1. Summary: This will be used for AMDGPU_HSA_KERNEL symbol type in output ELF. Also, in the future unused non-kernels may be optimized. For now, also accept SPIR_KERNEL for HCC frontend. Also, add bitcode compatibility tests for missing calling conventions except AVR_BUILTIN which doesn't have parse code. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, joker.eph, llvm-commits llvm-svn: 268717	2016-05-06 09:07:29 +00:00
Mehdi Amini	3b132e34b0	ThinLTO: fix assertion and refactor check for hidden use from inline ASM in a helper function This test was crashing, and currently it breaks bootstrapping clang with debuginfo Differential Revision: http://reviews.llvm.org/D20008 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268715	2016-05-06 08:25:33 +00:00
Zlatko Buljan	31c9ebe281	[mips][microMIPS] Add CodeGen support for MUL* and DMUL* instructions Differential Revision: http://reviews.llvm.org/D15744 llvm-svn: 268714	2016-05-06 08:24:14 +00:00
Xinliang David Li	8aebf44c97	[PM] port IR based PGO prof-gen pass to new pass manager llvm-svn: 268710	2016-05-06 05:49:19 +00:00
Xinliang David Li	779dd2db95	[profile] Remove another unneeded field in raw profile reader DataValueSize is now removed. The change is consolidated with previous raw version bump. llvm-svn: 268703	2016-05-06 02:13:00 +00:00
Ahmed Bougacha	16547c4e31	[CodeGen] Round [SU]INT_TO_FP result when promoting from f16. If we don't, values that aren't precisely representable in f16 could be used as-is in a promoted f32 operation, which would produce incorrect results. AArch64 had the correct behavior; add a focused test. Fixes http://llvm.org/PR26871 llvm-svn: 268700	2016-05-06 00:58:00 +00:00
Xinliang David Li	28a932742c	[PM] port Branch Frequency Analaysis pass to new PM llvm-svn: 268687	2016-05-05 21:13:27 +00:00
Davide Italiano	f54f2f0893	[PM] Port Interprocedural SCCP to the new pass manager. llvm-svn: 268684	2016-05-05 21:05:36 +00:00
Dehao Chen	f50c67ce7c	Revert http://reviews.llvm.org/D19926 as it breaks tests. llvm-svn: 268681	2016-05-05 20:47:53 +00:00
Dan Gohman	450a80754f	[WebAssembly] Don't emit epilogue code in the middle of stackified code. llvm-svn: 268679	2016-05-05 20:41:15 +00:00
Dehao Chen	e48b4ee98c	Simplify CFG before assigning discriminator. Summary: We need to clean up CFG before assigning discriminator to minimize the impact of optimization on debug info. Reviewers: davidxl, dblaikie, dnovillo Subscribers: dnovillo, danielcdh, llvm-commits Differential Revision: http://reviews.llvm.org/D19926 llvm-svn: 268675	2016-05-05 20:18:49 +00:00
Marcin Koscielnicki	60061c21cb	[MSan] [MIPS64] Fix vararg helper for >1 fixed argument. This fixes http://llvm.org/PR27646 on Mips64. Differential Revision: http://reviews.llvm.org/D19989 llvm-svn: 268673	2016-05-05 20:13:17 +00:00
Matt Arsenault	6689abe632	AMDGPU: Run r600 tests last llvm-svn: 268672	2016-05-05 20:07:37 +00:00
Tim Northover	df43264cf7	ARM: don't attempt to merge litpools referencing different PC-anchors. Given something like: ldr r0, .LCPI0_0 (== pc-rel var) add r0, pc ldr r1, .LCPI0_1 (== pc-rel var) add r1, pc we cannot combine the 2 ldr instructions and litpools because they get added to a different pc to form the correct address. I think the original logic came from a time when we fused the LDRpci/PICADD instructions into one pseudo-instruction so the PC was always immediately at-hand. That's no longer the case. Should fix general-dynamic TLS access on Linux, and quite possibly other -fPIC code that relies on litpools (e.g. v6m and -Oz compilations) though trivial tweaks of the .ll test didn't provoke anything. llvm-svn: 268662	2016-05-05 18:38:53 +00:00
Krzysztof Parzyszek	f7a4bd4068	[Hexagon] Add aliases for vector loads/stores with no explicit offset The mem(r0) instructions are treated as mem(r0+#0). llvm-svn: 268661	2016-05-05 18:38:35 +00:00
Vitaly Buka	1df2338bb6	Revert "[ThinLTO] Emit individual index files for distributed backends" MemorySanitizer: use-of-uninitialized-value in lib/Bitcode/Writer/BitcodeWriter.cpp:364:70 http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/12544/steps/check-llvm%20msan/logs/stdio This reverts commit 0c4a898ea550699d1b2f4fe3767251c8f9a48d52. llvm-svn: 268660	2016-05-05 18:31:00 +00:00
Kevin Enderby	b34e3a1877	Clean up the specific error message for a malformed Mach-O files with bad segment load commands. The existing test case in test/Object/macho-invalid.test for macho-invalid-too-small-segment-load-command has a cmdsize of 55, while being too small also it is not a multiple of 4. So when that check is added this test case will produce a different error. So I constructed a new test case that will trigger the intended error. I also changed the error message to be consistent with the other malformed Mach-O file error messages which prints the load command index. I also removed both object_error::macho_load_segment_too_small and object_error::macho_load_segment_too_many_sections from Object/Error.h as they are not needed and can just use object_error::parse_failed and let the error message string distinguish the specific error. llvm-svn: 268652	2016-05-05 17:43:35 +00:00
Nicolai Haehnle	ffbd56a1c9	AMDGPU: Uniform branch conditions can originate with intrinsics Summary: Discovered by Dave Airlie, fixes an assertion in Khronos OpenGL CTS GL43-CTS.shader_storage_buffer_object.advanced-matrix. In this particular case, the buffer load intrinsic fed into a uniform conditional branch, and led the brcond lowering down the wrong path. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19931 llvm-svn: 268650	2016-05-05 17:36:36 +00:00
Tom Stellard	fcfaea4cff	AMDGPU/SI: Add support for AMD code object version 2. Summary: Version 2 is now the default. If you want to emit version 1, use the amdgcn--amdhsa-amdcov1 triple. Reviewers: arsenm, kzhuravl Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19283 llvm-svn: 268647	2016-05-05 17:03:33 +00:00
Chad Rosier	25cfb7dbd6	[ValueTracking] Improve isImpliedCondition for matching LHS and Imm RHSs. llvm-svn: 268636	2016-05-05 15:39:18 +00:00
Silviu Baranga	c05bab8a9c	[LV] Identify more induction PHIs by coercing expressions to AddRecExprs Summary: Some PHIs can have expressions that are not AddRecExprs due to the presence of sext/zext instructions. In order to prevent the Loop Vectorizer from bailing out when encountering these PHIs, we now coerce the SCEV expressions to AddRecExprs using SCEV predicates (when possible). We only do this when the alternative would be to not vectorize. Reviewers: mzolotukhin, anemet Subscribers: mssimpso, sanjoy, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D17153 llvm-svn: 268633	2016-05-05 15:20:39 +00:00
James Y Knight	0c145c0c3a	Remove bit-rotten CppBackend. This backend was supposed to generate C++ code which will re-construct the LLVM IR passed as input. This seems to me to have very marginal usefulness in the first place. However, the code has never been updated to use IRBuilder, which makes its current value negative -- people who look at the output may be steered to use the wrong C++ APIs to construct IR. Furthermore, it's generated code that doesn't compile since at least 2013. Differential Revision: http://reviews.llvm.org/D19942 llvm-svn: 268631	2016-05-05 14:35:40 +00:00
Nirav Dave	996fc133b7	Fix Mips Parser error reporting [mips] On error, ParseDirective should always return false to signify that the directive was understood. Reviewers: dsanders, vkalintiris, sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19929 llvm-svn: 268630	2016-05-05 14:15:46 +00:00
Teresa Johnson	f8cbd6591f	Fix Windows bot failures from r268627 Remove "/" path separator from expected pattern which should fix a couple of Windows bots that have failed: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/4816 http://bb.pgr.jp/builders/ninja-clang-i686-msc19-R/builds/2610 llvm-svn: 268629	2016-05-05 14:10:57 +00:00
Teresa Johnson	9254ebe3c0	[ThinLTO] Emit individual index files for distributed backends Summary: When launching ThinLTO backends in a distributed build (currently supported in gold via the thinlto-index-only plugin option), emit an individual index file for each backend process as described here: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098272.html The individual index file encodes the summary and module information required for implementing the importing/exporting decisions made for a given module in the thin link step. This is in place of the current mechanism that uses the combined index to make importing decisions in each back end independently. It is an enabler for doing global summary based optimizations in the thin link step (which will be recorded in the individual index files), and reduces the size of the index that must be sent to each backend process, and the amount of work to scan it in the backends. Rather than create entirely new ModuleSummaryIndex structures (and all the included unique_ptrs) for each backend index file, a map is created to record all of the GUID and summary pointers needed for a particular index file. The IndexBitcodeWriter walks this map instead of the full index (hiding the details of managing the appropriate summary iteration in a new iterator subclass). This is more efficient than walking the entire combined index and filtering out just the needed summaries during each backend bitcode index write. Depends on D19481. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19556 llvm-svn: 268627	2016-05-05 13:44:56 +00:00
Marcin Koscielnicki	0275fac2c9	[X86] Extend some Linux special cases to cover kFreeBSD. Both Linux and kFreeBSD use glibc, so follow similiar code paths. Add isTargetGlibc to check for this, and use it instead of isTargetLinux in a few places. Fixes PR22248 for kFreeBSD. Differential Revision: http://reviews.llvm.org/D19104 llvm-svn: 268624	2016-05-05 11:35:51 +00:00
Igor Kudrin	27d8dd39cf	[Coverage] Combine counts of expansion regions if there are no code regions for the same area. Differential Revision: http://reviews.llvm.org/D18831 llvm-svn: 268620	2016-05-05 09:39:45 +00:00
David Majnemer	911d0e3c21	[X86] Use the right type when folding xor (truncate (shift)) -> setcc The result type of setcc is dependent on whether or not AVX512 is present. We had an X86-specific DAG-combine which assumed that the result type should be i8 when it could be i1. This meant that we would generate illegal setccs which LowerSETCC did not like. Instead, use an appropriate type and zero extend to i8. Also, there were some scenarios where the fold should have fired but didn't because we were overly cautious about the types. This meant that we generated: shrl $31, %edi andl $1, %edi kmovw %edi, %k0 kxnorw %k0, %k0, %k1 kshiftrw $15, %k1, %k1 kxorw %k1, %k0, %k0 kmovw %k0, %eax instead of: testl %edi, %edi setns %al This fixes PR27638. llvm-svn: 268609	2016-05-05 06:00:56 +00:00
Mehdi Amini	022b5bcb7a	LTOCodeGenerator: add linkonce(_odr) to "llvm.compiler.used" when present in "MustPreserve" set If the linker requested to preserve a linkonce function, we should honor this even if we drop all uses. We explicitely avoid turning them into weak_odr (unlike the first version of this patch in r267644), because the codegen can be different on Darwin: because of `llvm::canBeOmittedFromSymbolTable()` we may emit the symbol as weak_def_can_be_hidden instead of weak_definition. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268607	2016-05-05 05:14:24 +00:00
Mehdi Amini	752ffe9c5f	Revert "LTOCodeGenerator: turns linkonce(_odr) into weak_(odr) when present "MustPreserve" set" This reverts commit r267644. Turning linkonce_odr into weak_odr is a sementic change on Darwin: because of `llvm::canBeOmittedFromSymbolTable()` we may emit the symbol as weak_def_can_be_hidden instead of weak_definition. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268606	2016-05-05 05:14:20 +00:00
Xinliang David Li	78d61b11e3	[Profile] Raw profile header clean up Remove dead ValueDataBegin field in raw header. llvm-svn: 268602	2016-05-05 04:07:30 +00:00
Xinliang David Li	6e5dd41481	[PM] Port Branch Probability Analysis pass to the new pass manager. Differential Revision: http://reviews.llvm.org/D19839 llvm-svn: 268601	2016-05-05 02:59:57 +00:00
Davide Italiano	344e838fea	[PM] Port EliminateAvailableExternally pass to the new pass manager. llvm-svn: 268599	2016-05-05 02:37:32 +00:00
Ryan Govostes	8c21be6b3e	Revert "[asan] add option to set shadow mapping offset" This reverts commit ba89768f97b1d4326acb5e33c14eb23a05c7bea7. llvm-svn: 268588	2016-05-05 01:27:04 +00:00
Ryan Govostes	097c5b051c	[asan] add option to set shadow mapping offset Allowing overriding the default ASAN shadow mapping offset with the -asan-shadow-offset option, and allow zero to be specified for both offset and scale. llvm-svn: 268586	2016-05-05 01:14:39 +00:00
Davide Italiano	164b9bc6fe	[PM] Port ConstantMerge to the new pass manager. llvm-svn: 268582	2016-05-05 00:51:09 +00:00
Marcin Koscielnicki	ad1482c6f1	[SystemZ] Implement backchain attribute (recommit with fix). This introduces a SystemZ-specific "backchain" attribute on function, which enables writing the frame backchain link as specified by the ABI. This will be used to implement -mbackchain option in clang. Differential Revision: http://reviews.llvm.org/D19889 Fixed in this version: added RegState::Define and RegState::Kill on R1D in prologue. llvm-svn: 268581	2016-05-05 00:37:30 +00:00
Adam Nemet	3c5eabfcbc	[LoopDataPrefetch] Add optimization remark With -Rpass=loop-data-prefetch, show the memory access that got prefetched. llvm-svn: 268578	2016-05-05 00:08:15 +00:00
Vitaly Buka	fdcea9d78a	Revert "[SimplifyCFG] propagate branch metadata when creating select" MemorySanitizer: use-of-uninitialized-value 0x4910e47 in count /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/include/llvm/Support/MathExtras.h:159:12 0x4910e47 in countLeadingZeros<unsigned long> /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/include/llvm/Support/MathExtras.h:183 0x4910e47 in FitWeights /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/Transforms/Utils/SimplifyCFG.cpp:855 0x4910e47 in SimplifyCondBranchToCondBranch /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/Transforms/Utils/SimplifyCFG.cpp:2895 This reverts commit 609f4dd4bf3bc735c8c047a4d4b0a8e9e4d202e2. llvm-svn: 268577	2016-05-04 23:59:33 +00:00
Marcin Koscielnicki	12037b4e9d	Revert "[SystemZ] Implement backchain attribute." This reverts commit rL268571. It caused failures in register scavenger. llvm-svn: 268576	2016-05-04 23:54:53 +00:00
Marcin Koscielnicki	9de88d9bbe	[SystemZ] Implement llvm.get.dynamic.area.offset To be used for AddressSanitizer. Differential Revision: http://reviews.llvm.org/D19817 llvm-svn: 268572	2016-05-04 23:31:26 +00:00
Marcin Koscielnicki	835d927938	[SystemZ] Implement backchain attribute. This introduces a SystemZ-specific "backchain" attribute on function, which enables writing the frame backchain link as specified by the ABI. This will be used to implement -mbackchain option in clang. Differential Revision: http://reviews.llvm.org/D19889 llvm-svn: 268571	2016-05-04 23:31:20 +00:00
Quentin Colombet	0c5bfd0514	[X86] Add a few register classes for x32 address accesses. The new register classes allow to tell the machine verifier that it is fine to use RIP for address accesses in x32 mode. Prior to that patch, we would complain that we are using a GR64 in place of GR32, whereas it is actually fine to use GR64 for x32 as long as the 32 high bits are 0s. RIP has this property and is used for RIP-relative addressing. This partially fixes http://llvm.org/PR27481. llvm-svn: 268567	2016-05-04 22:45:31 +00:00
Simon Pilgrim	1f5ad702f8	[SelectionDAG] BITREVERSE vector legalization of bit operations (REAPPLIED) Some vector bit operations are promoted instead of having custom lowering. This patch changes the isOperationLegalOrCustom tests for vector AND/OR operations to use a new TLI helper isOperationLegalOrCustomOrPromote instead, allowing the SSE implementations to stay on the simd unit. Differential Revision: http://reviews.llvm.org/D19805 llvm-svn: 268561	2016-05-04 22:08:51 +00:00
Balaram Makam	569eaec5f3	"Reapply r268521 "[InstCombine] Canonicalize icmp instructions based on dominating conditions."" This reapplies commit r268521, that was reverted in r268530 due to a test failure in select-implied.ll Modified the test case to reflect the new change. llvm-svn: 268557	2016-05-04 21:32:14 +00:00
Sanjay Patel	7e8c285814	[SimplifyCFG] propagate branch metadata when creating select Unlike earlier similar fixes, we need to recalculate the branch weights in this case. Differential Revision: http://reviews.llvm.org/D19674 llvm-svn: 268550	2016-05-04 20:48:24 +00:00
Evandro Menezes	bcb95cd0ed	[AArch64] Use the reciprocal estimation machinery This patch adds support for estimating the square root, its reciprocal and division or reciprocal using the combiner generic reciprocal machinery. llvm-svn: 268539	2016-05-04 20:18:27 +00:00
Vitaly Buka	6b5c89262a	Revert r268529 because it caused use-of-uninitialized-value Summary: This reverts commit d88cc0862bf7da64850b89e9bb5ea9f95e7f1184. #0 0xfed467 in llvm::ARMFrameLowering::determineCalleeSaves(llvm::MachineFunction&, llvm::BitVector&, llvm::RegScavenger) const /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/Target/ARM/ARMFrameLowering.cpp:1625:52 #1 0x330d4cc in (anonymous namespace)::PEI::runOnMachineFunction(llvm::MachineFunction&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/CodeGen/PrologEpilogInserter.cpp:186:3 #2 0x3193e12 in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/CodeGen/MachineFunctionPass.cpp:60:13 #3 0x396237d in llvm::FPPassManager::runOnFunction(llvm::Function&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1526:23 #4 0x3962a23 in llvm::FPPassManager::runOnModule(llvm::Module&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1547:16 #5 0x3963d52 in runOnModule /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1603:23 #6 0x3963d52 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/lib/IR/LegacyPassManager.cpp:1706 #7 0x6bb910 in compileModule(char*, llvm::LLVMContext&) /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/tools/llc/llc.cpp:412:5 #8 0x6b3c25 in main /mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm/tools/llc/llc.cpp:218:22 #9 0x7fd4a7d37ec4 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x21ec4) #10 0x625c93 in _start (/mnt/b/sanitizer-buildbot2/sanitizer-x86_64-linux-bootstrap/build/llvm_build_msan/bin/llc+0x625c93) Reviewers: Subscribers: llvm-svn: 268536	2016-05-04 19:44:11 +00:00
Hal Finkel	e2b89118bd	[ConstantFold] Don't try to strip fp -> int bitcasts to simplify icmps ConstantFold has logic to take icmp (bitcast x to y), null and strip the bitcast. This makes sense in general, but not if x has floating-point type. In this case, we'd need a fcmp, not an icmp, and the code will assert. We normally don't see this situation because we constant fold fp -> int bitcasts, however, we'll see it for bitcasts of ppc_fp128 -> i128. This is because that bitcast is Endian-dependent, and as a result, we don't simplify it in ConstantFold (we could, but no one has yet added the necessary logic). Regardless, ConstantFold should not depend on that canonicalization for correctness. llvm-svn: 268534	2016-05-04 19:37:08 +00:00
Sanjay Patel	13d57b94bb	[x86] add tests to show current codegen for obscured fneg/fabs llvm-svn: 268533	2016-05-04 19:06:03 +00:00
Marcin Koscielnicki	cc9676a821	[MSan] [Mips64] Add tests for vararg handling. Differential Revision: http://reviews.llvm.org/D19919 llvm-svn: 268531	2016-05-04 18:39:14 +00:00
Balaram Makam	31e7e13789	Revert "[InstCombine] Canonicalize icmp instructions based on dominating conditions." This reverts commit 573a40f79b35cf3e71db331bb00f6a84f03b835d. llvm-svn: 268530	2016-05-04 18:37:35 +00:00
Weiming Zhao	2373f769ce	[ARM] Fix Scavenger assert due to underestimated stack size Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268529	2016-05-04 18:19:33 +00:00
Simon Pilgrim	1a14f0d25c	Revert r268504 llvm-svn: 268526	2016-05-04 17:49:14 +00:00
Marianne Mailhot-Sarrasin	b192670279	Adding test cases showing the behavior of LoopUnrollPass according to optnone and optsize attributes The unroll pass was disabled by clang in /Os. Those new test cases shows that the pass will behave correctly even if it is not fully disabled. This patch is related in some way to the clang commit (http://reviews.llvm.org/D19827), which re-enables the pass in /Os. Differential Revision: http://reviews.llvm.org/D19870 llvm-svn: 268524	2016-05-04 17:45:40 +00:00
Balaram Makam	cf3bcb2625	[InstCombine] Canonicalize icmp instructions based on dominating conditions. Summary: This patch canonicalizes conditions based on the constant range information of the dominating branch condition. For example: %cmp = icmp slt i64 %a, 0 br i1 %cmp, label %land.lhs.true, label %lor.rhs lor.rhs: %cmp2 = icmp sgt i64 %a, 0 Would now be canonicalized into: %cmp = icmp slt i64 %a, 0 br i1 %cmp, label %land.lhs.true, label %lor.rhs lor.rhs: %cmp2 = icmp ne i64 %a, 0 Reviewers: mcrosier, gberry, t.p.northover, llvm-commits, reames, hfinkel, sanjoy, majnemer Subscribers: MatzeB, majnemer, mcrosier Differential Revision: http://reviews.llvm.org/D18841 llvm-svn: 268521	2016-05-04 17:34:20 +00:00
Reid Kleckner	b034526853	Reland "Use ScopedPrinter in llvm-pdbdump" This reverts r268508 and reinstates r268506 with an additional cast from TypeLeafKind to unsigned to allow conversion to HexNumber. llvm-svn: 268517	2016-05-04 16:09:04 +00:00
Nemanja Ivanovic	1a2b2f03e7	[PowerPC] Generate VSX version of splat word This patch corresponds to review: http://reviews.llvm.org/D18592 It allows the PPC back end to generate the xxspltw instruction where we previously only emitted vspltw. llvm-svn: 268516	2016-05-04 16:04:02 +00:00
Simon Pilgrim	bc0e1d7492	[X86][SSE] Regenerate vector bswap tests llvm-svn: 268514	2016-05-04 15:45:48 +00:00
Hans Wennborg	0c3518e84b	[SimplifyCFG] isSafeToSpeculateStore now ignores debug info This patch fixes PR27615. @llvm.dbg.value instructions no longer count towards the maximum number of instructions to look back at in the instruction list when searching for a store instruction. This should make the output consistent between debug and non-debug build. Patch by Henric Karlsson <henric.karlsson@ericsson.com>! Differential Revision: http://reviews.llvm.org/D19912 llvm-svn: 268512	2016-05-04 15:40:57 +00:00
Chad Rosier	20dbbf3542	Revert "Use ScopedPrinter in llvm-pdbdump" This reverts commit r268506 due to build breakage. llvm-svn: 268508	2016-05-04 15:25:06 +00:00
Zachary Turner	cdd313ca19	Use ScopedPrinter in llvm-pdbdump When printing raw PDB file fields, streams, and records, use the ScopedPrinter class so we have consistency with llvm-readobj's output format. For the most part this is pretty mechanical, but I had to fix up the test file to conform to the new YAMLesque output format. i added a few additional helper functions to the ScopedPrinter such as one to print a dotted version, etc. Differential Revision: http://reviews.llvm.org/D19897 Reviewed By: rnk llvm-svn: 268506	2016-05-04 15:05:12 +00:00
Simon Pilgrim	b97c06210b	[SelectionDAG] BITREVERSE vector legalization of bit operations Vector bit operations are typically promoted instead of having custom lowering. This patch changes the isOperationLegalOrCustom tests for vector AND/OR operations to use isOperationLegalOrPromote instead, allowing the SSE implementations to stay on the simd unit. Differential Revision: http://reviews.llvm.org/D19805 llvm-svn: 268504	2016-05-04 15:01:13 +00:00
Igor Laevsky	fb1811d3a0	[RS4GC] Use SetVector/MapVector instead of DenseSet/DenseMap to guarantee stable ordering Goal of this change is to guarantee stable ordering of the statepoint arguments and other newly inserted values such as gc.relocates. Previously we had explicit sorting in a couple of places. However for unnamed values ordering was partial and overall we didn't have any strong invariant regarding it. This change switches all data structures to use SetVector's and MapVector's which provide possibility for deterministic iteration over them. Explicit sorting is now redundant and was removed. Differential Revision: http://reviews.llvm.org/D19669 llvm-svn: 268502	2016-05-04 14:55:36 +00:00
Elena Demikhovsky	24aba1ca38	The test files are auto-generated by update_llc_test_checks.py utility. No functional changes. llvm-svn: 268498	2016-05-04 14:31:18 +00:00
Daniel Sanders	c07f06aeee	[mips][ias] Only round section sizes when explicitly requested. As requested by Rafael Espindola in his post-commit comments on r268036. This makes the previous behaviour the default while still allowing verification of IAS. llvm-svn: 268496	2016-05-04 13:21:06 +00:00
Chris Dewhurst	8338d90ba3	[Sparc] Allow taking of function address into a register. Modification of previously existing code (variable rename only), with unit test added. Differential Revision: http://reviews.llvm.org/D19368 llvm-svn: 268493	2016-05-04 12:11:05 +00:00
Zlatko Buljan	4807f829b4	[mips][microMIPS] Add CodeGen support for microMIPSr6 ROTR and ROTRV and add tests for LL, SC, SYSCALL, ROTR, ROTRV, LWM32, SWM32 and MOVEP instructions Differential Revision: http://reviews.llvm.org/D19857 llvm-svn: 268491	2016-05-04 12:02:12 +00:00
Chris Dewhurst	69fa1926db	[Sparc] Implement __builtin_setjmp, __builtin_longjmp back-end. This code implements builtin_setjmp and builtin_longjmp exception handling intrinsics for 32-bit Sparc back-ends. The code started as a mash-up of the PowerPC and X86 versions, although there are sufficient differences to both that had to be made for Sparc handling. Note: I have manual tests running. I'll work on a unit test and add that to the rest of this diff in the next day. Also, this implementation is only for 32-bit Sparc. I haven't focussed on a 64-bit version, although I have left the code in a prepared state for implementing this, including detecting pointer size and comments indicating where I suspect there may be differences. Differential Revision: http://reviews.llvm.org/D19798 llvm-svn: 268483	2016-05-04 09:33:30 +00:00
Daniel Sanders	04468f2914	[mips] Remove -mattr=+n64 and fix indentation in tailcall.ll RUN lines. NFC. -mattr=+n64 isn't the correct way to specify the ABI and N64 is already the default for the RUN line concerned. llvm-svn: 268482	2016-05-04 09:08:35 +00:00
David Majnemer	3918cdd2a1	[ConstantFolding, ValueTracking] Fold constants involving bitcasts of ConstantVector We assumed that ConstantVectors would be rather uninteresting from the perspective of analysis. However, this is not the case due to a quirk of how LLVM handles vectors of i1. Vectors of i1 are not ConstantDataVectors like vectors of i8, i16, i32 or i64 because i1's SizeInBits differs from it's StoreSizeInBytes. This leads to it being categorized as a ConstantVector instead of a ConstantDataVector. Instead, treat ConstantVector more uniformly. This fixes PR27591. llvm-svn: 268479	2016-05-04 06:13:33 +00:00
Simon Atanasyan	8a71b53ea9	[llvm-readobj] Print MIPS .MIPS.options section content .MIPS.options section specifies miscellaneous options to be applied to an object file. LLVM as well as modern versions of GNU tools emit the only type of the options - ODK_REGINFO. The patch teaches llvm-readobj to print details of the ODK_REGINFO and skip contents of other options. llvm-svn: 268478	2016-05-04 05:58:57 +00:00
David Majnemer	2c5aeabedd	[X86] Lower zext i1 arguments i1 is now a legal type for X86 with AVX512. There were some paths in X86FastISel which were not quite ready to see an i1 value: they were not quite sure how to deal with sign/zero extends for call arguments. DTRT by extending to i8 for zeroext and bailing out of FastISel for signext. This fixes PR27591. llvm-svn: 268470	2016-05-04 00:22:23 +00:00
David Majnemer	95549497ec	[GlobalDCE, Misc] Don't remove functions referenced by ifuncs We forgot to consider the target of ifuncs when considering if a function was alive or dead. N.B. Also update a few auxiliary tools like bugpoint and verify-uselistorder. This fixes PR27593. llvm-svn: 268468	2016-05-04 00:20:48 +00:00
Kevin Enderby	a8e3ab0c56	Produce another specific error message for a malformed Mach-O file when a load command has a size less than 8 bytes. I think the existing test case in test/Object/macho-invalid.test for macho64-invalid-too-small-load-command was trying to test for this but that test case triggered a different error given how it was constructed. So I constructed a new test case that would trigger this specific error. I also changed the error message to be consistent with the other malformed Mach-O file error messages. I also removed object_error::macho_small_load_command from Object/Error.h as it is not needed and can just use object_error::parse_failed and let the error message string distinguish the error. llvm-svn: 268463	2016-05-03 23:13:50 +00:00
Andrew Kaylor	50271f787e	Add opt-bisect support to additional passes that can be skipped Differential Revision: http://reviews.llvm.org/D19882 llvm-svn: 268457	2016-05-03 22:32:30 +00:00
Justin Bogner	d0d2341f30	PM: Port LoopRotation to the new loop pass manager llvm-svn: 268452	2016-05-03 22:02:31 +00:00
Simon Pilgrim	fb1766ad68	[X86][XOP] Add placeholder VPERMIL2 combining tests llvm-svn: 268450	2016-05-03 21:55:37 +00:00
Justin Bogner	ab6a513b4e	PM: Port LoopSimplifyCFG to the new pass manager llvm-svn: 268446	2016-05-03 21:47:32 +00:00
Tim Northover	d2ecbccf27	X86-Darwin: start emitting data-region directives for jump-tables. The surrounding tools can cope these days, and they were invented for a reason. llvm-svn: 268437	2016-05-03 21:03:41 +00:00
Sanjoy Das	8a004551d0	[RS4GC] Add a test case around calling conventions; NFC llvm-svn: 268436	2016-05-03 20:58:10 +00:00
Davide Italiano	66228c4cf1	[IPO/GlobalDCE] Port to the new pass manager. Differential Revision: http://reviews.llvm.org/D19782 llvm-svn: 268425	2016-05-03 19:39:15 +00:00
Jack Liu	f101c0f7a1	[SROA] Function canConvertValue needs to check whether both NewTy and OldTy pointers are pointing to the same addr space. This can prevent SROA from creating a bitcast between pointers with different addr spaces. Differential Revision: http://reviews.llvm.org/D19697 llvm-svn: 268424	2016-05-03 19:30:48 +00:00
Jack Liu	430e2c2140	Revert 268409 due to missing comment. llvm-svn: 268421	2016-05-03 19:15:02 +00:00
Quentin Colombet	26dab3a485	[ImplicitNullChecks] Account for implicit-defs as well when updating the liveness. The replaced load may have implicit-defs and those defs may be used in the block of the original load. Make sure to update the liveness accordingly. This is a generalization of r267817. llvm-svn: 268412	2016-05-03 18:09:06 +00:00
Jack Liu	1ff4a0b7ee	(no commit message) llvm-svn: 268409	2016-05-03 18:01:43 +00:00
Sanjoy Das	4ae3920c5b	[LICM] Kill SCEV loop dispositions if needed SCEV caches whether SCEV expressions are loop invariant, variant or computable. LICM breaks this cache, almost by definition; so clear the SCEV disposition cache if LICM changed anything. llvm-svn: 268408	2016-05-03 17:50:11 +00:00
Sanjoy Das	7e7a5a050a	Use all_of instead of a raw loop; NFC Added some tests despite being NFC, since it looks like nothing was exercising the "all incoming values to exit PHIs are same" logic. llvm-svn: 268407	2016-05-03 17:50:06 +00:00
Sanjoy Das	905fc27ebf	[LoopDeletion] Clear SCEV loop dispositions `Loop::makeLoopInvariant` can hoist instructions out of loops, so loop dispositions for the loop it operated on may need to be cleared. We can be smarter here (especially around how `forgetLoopDispositions` is implemented), but let's be correct first. Fixes PR27570. llvm-svn: 268406	2016-05-03 17:50:02 +00:00
Sanjoy Das	013a4ac4aa	[SCEV] Tweak the output format and content of -analyze In the "LoopDispositions:" section: - Instead of printing out a list, print out a "dictionary" to make it obvious by inspection which disposition is for which loop. This is just a cosmetic change. - Print dispositions for parent _and_ sibling loops. I will use this to write a test case. llvm-svn: 268405	2016-05-03 17:49:57 +00:00
Kevin Enderby	368e714907	Produce another specific error message for a malformed Mach-O file when a load command other than the first one is past the end of the load commands. This is like the test case in test/Object/macho-invalid.test for macho64-invalid-incomplete-load-command but it is the second load command that is past the end of all the load commands instead of the first. The code in the constructor for MachOObjectFile that loops over the load commands used getNextLoadCommandInfo() which was not producing a good error message. So that was fixed and a test case was added. llvm-svn: 268403	2016-05-03 17:16:08 +00:00
Mehdi Amini	7f7d8be518	Move "Eliminate Available Externally" immediately after the inliner This pass is supposed to reduce the size of the IR for compile time purpose. We should run it ASAP, except when we prepare for LTO or ThinLTO, and we want to keep them available for link-time inline. Differential Revision: http://reviews.llvm.org/D19813 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268394	2016-05-03 15:46:00 +00:00
Simon Pilgrim	d2752708a3	[X86][SSE] Added target shuffle combine to MOVQ llvm-svn: 268391	2016-05-03 15:05:13 +00:00
Anna Thomas	43d7e1cbff	Fold compares irrespective of whether allocation can be elided Summary When a non-escaping pointer is compared to a global value, the comparison can be folded even if the corresponding malloc/allocation call cannot be elided. We need to make sure the global value is not null, since comparisons to null cannot be folded. In future, we should also handle cases when the the comparison instruction dominates the pointer escape. Reviewers: sanjoy Subscribers s.egerton, llvm-commits Differential Revision: http://reviews.llvm.org/D19549 llvm-svn: 268390	2016-05-03 14:58:21 +00:00
Daniel Sanders	01bcefd983	[mips][fastisel] ADJCALLSTACKUP has a second immediate operand. Summary: It's always zero for SelectionDAG and is never read by the MIPS backend so do the same for FastISel. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19863 llvm-svn: 268386	2016-05-03 14:19:26 +00:00
Daniel Sanders	fe98b2f54b	[mips] Use MipsMCExpr instead of MCSymbolRefExpr for all relocations. Summary: This is much closer to the way MIPS relocation expressions work (%hi(foo + 2) rather than %hi(foo) + 2) and removes the need for the various bodges in MipsAsmParser::evaluateRelocExpr(). Removing those bodges ensures that the constant stored in MCValue is the full 32 or 64-bit (depending on ABI) offset from the symbol. This will be used to correct the %hi/%lo matching needed to sort the relocation table correctly. As part of this: * Gave MCExpr::print() the ability to omit parenthesis when emitting a symbol reference inside a MipsMCExpr operator like %hi(X). Without this we print things like %lo(($L1)). * %hi(%neg(%gprel(X))) is now three MipsMCExpr's instead of one. Most of the related special cases have been removed or moved to MipsMCExpr. We can remove the rest as we gain support for the less common relocations when they are not part of this specific combination. * Renamed MipsMCExpr::VariantKind and the enum prefix ('VK_') to avoid confusion with MCSymbolRefExpr::VariantKind and its prefix (also 'VK_'). * fixup_Mips_GOT_Local and fixup_Mips_GOT_Global were found to be identical and merged into fixup_Mips_GOT. * MO_GOT16 and MO_GOT turned out to be identical and have been merged into MO_GOT. * VK_Mips_GOT and VK_Mips_GOT16 turned out to be the same thing so they have been merged into MEK_GOT Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19716 llvm-svn: 268379	2016-05-03 13:35:44 +00:00
Simon Pilgrim	32e78c3ff7	[X86][SSSE3] Missing combine opportunity to simplify to a MOVQ shuffle llvm-svn: 268378	2016-05-03 13:12:44 +00:00
Igor Breger	58c07806ae	[AVX512] Add support for commutative MAX/MIN . In general VMAX{PS,PD} and VMIN{PS,PD} instruction are not commutative . In combine pass only if UnsafeFPMath are used VMAX/VMAX are converted to commutative nodes VMAXC/VMAXC. Differential Revision: http://reviews.llvm.org/D19860 llvm-svn: 268375	2016-05-03 11:51:45 +00:00
Kristof Beyls	c08f70588d	Mark that SpeculativeExecution preserves Globals Alias Analysis. A few benchmarks with lots of accesses to global variables in the hot loops regressed a lot since r266399, which added the SpeculativeExecution pass to the default pipeline. The problem is that this pass doesn't mark Globals Alias Analysis as preserved. Globals Alias Analysis is computed in a module pass, whereas SpeculativeExecution is a function pass, and a lot of passes dependent on the Globals Alias Analysis to optimize these benchmarks are also function passes. As such, the Globals Alias Analysis information cannot be recomputed between SpeculativeExecution and the following function passes needing that information. SpeculativeExecution doesn't invalidate Globals Alias Analysis, so mark it as such to fix those performance regressions. Differential Revision: http://reviews.llvm.org/D19806 llvm-svn: 268370	2016-05-03 08:33:26 +00:00
Igor Breger	ab076c683c	[AVX512] Fix lowerV4X128VectorShuffle to select correctly input operands . Differential Revision: http://reviews.llvm.org/D19803 llvm-svn: 268368	2016-05-03 08:08:44 +00:00
Matthias Braun	bb85aef77d	Fix uppercase typo llvm-svn: 268362	2016-05-03 05:21:53 +00:00
Matthias Braun	e25bbd0bb8	AArch64/optimizeCondBranch: Remove earlier kill flag when forming TBZ This fixes -verify-machineinstrs complaints when compiling test-suite/SingleSource/Benchmarks/Shootout-C++/wordfreq.cpp llvm-svn: 268360	2016-05-03 04:54:16 +00:00
Jack Liu	cd777c8b35	test commit llvm-svn: 268358	2016-05-03 04:06:24 +00:00
David Majnemer	3d90bb79c4	[LoopUnroll] Unroll loops which have exit blocks to EH pads We were overly cautious in our analysis of loops which have invokes which unwind to EH pads. The loop unroll transform is safe because it only clones blocks in the loop body, it does not try to split critical edges involving EH pads. Instead, move the necessary safety check to LoopUnswitch. N.B. The safety check for loop unswitch is covered by an existing test which fails without it. llvm-svn: 268357	2016-05-03 03:57:40 +00:00
Zachary Turner	f5c59654f7	Parse the TPI (type information) stream of PDB files. This parses the TPI stream (stream 2) from the PDB file. This stream contains some header information followed by a series of codeview records. There is some additional complexity here in that alongside this stream of codeview records is a serialized hash table in order to efficiently query the types. We parse the necessary bookkeeping information to allow us to reconstruct the hash table, but we do not actually construct it yet as there are still a few things that need to be understood first. Differential Revision: http://reviews.llvm.org/D19840 Reviewed By: ruiu, rnk llvm-svn: 268343	2016-05-03 00:28:21 +00:00
Mehdi Amini	5b85d8d67b	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268341	2016-05-03 00:27:28 +00:00
Wolfgang Pieb	a4e71bd11a	Moved test case for r268323 to DebugInfo/X86 to unbreak aarch64. llvm-svn: 268339	2016-05-03 00:22:09 +00:00
Reid Kleckner	97837b7b09	[MC] Create unique .pdata sections for every .text section Summary: This adds a unique ID to the COFF section uniquing map, similar to the one we have for ELF. The unique id is not currently exposed via the assembler because we don't have a use case for it yet. Users generally create .pdata with the .seh_* family of directives, and the assembler internally needs to produce .pdata and .xdata sections corresponding to the code section. The association between .text sections and the assembler-created .xdata and .pdata sections is maintained as an ID field of MCSectionCOFF. The CFI-related sections are created with the given unique ID, so if more code is added to the same text section, we can find and reuse the CFI sections that were already created. Reviewers: majnemer, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19376 llvm-svn: 268331	2016-05-02 23:22:18 +00:00
Quentin Colombet	776e6de516	[MachineBlockPlacement] Let the target optimize the branches at the end. After the layout of the basic blocks is set, the target may be able to get rid of unconditional branches to fallthrough blocks that the generic code does not catch. This happens any time TargetInstrInfo::AnalyzeBranch is not able to analyze all the branches involved in the terminators sequence, while still understanding a few of them. In such situation, AnalyzeBranch can directly modify the branches if it has been instructed to do so. This patch takes advantage of that. llvm-svn: 268328	2016-05-02 22:58:59 +00:00
Quentin Colombet	4e1d389ac5	[X86] Model FAULTING_LOAD_OP as a terminator and branch. This operation may branch to the handler block and we do not want it to happen anywhere within the basic block. Moreover, by marking it "terminator and branch" the machine verifier does not wrongly assume (because of AnalyzeBranch not knowing better) the branch is analyzable. Indeed, the target was seeing only the unconditional branch and not the faulting load op and thought it was a simple unconditional block. The machine verifier was complaining because of that and moreover, other optimizations could have done wrong transformation! In the process, simplify the representation of the handler block in the faulting load op. Now, we directly reference the handler block instead of using a label. This has the benefits of: 1. MC knows how to issue a label for a BB, so leave that to it. 2. Accessing the target BB from its label is painful, whereas it is direct from a MBB operand. Note: The 2 bytes offset in implicit-null-check.ll comes from the fact the unconditional jumps are not removed anymore, as the whole terminator sequence is not analyzable anymore. Will fix it in a subsequence commit. llvm-svn: 268327	2016-05-02 22:58:54 +00:00
Wolfgang Pieb	56aa4b0629	DebugInfo: Avoid propagating incorrect debug locations in SelectionDAG via CSE. Summary: When SelectionDAG performs CSE it is possible that the context's source location is different from that of the selected node. This can lead to incorrect line number records. We update the debug location to the one that occurs earlier in the instruction sequence. This fixes PR21006. Reviewers: echristo, sdmitrouk Subscribers: jevinskie, asl, llvm-commits Differential Revision: http://reviews.llvm.org/D12094 llvm-svn: 268323	2016-05-02 22:50:51 +00:00
Mehdi Amini	1e918c9cb3	Revert "ThinLTO: do not import function whose linkage prevents inlining." This reverts commit r268315, the tests are not passing. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268317	2016-05-02 22:26:04 +00:00
Mehdi Amini	bda9b2ae9e	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268315	2016-05-02 22:11:27 +00:00
Kevin Enderby	64f7a995b0	Fix llvm-size to exit with non zero when it can’t open a file. rdar://26027819 llvm-svn: 268313	2016-05-02 21:41:03 +00:00
Rafael Espindola	21507a4a5a	Don't try to create thin bsd archives. Not such variant has been specified yet. llvm-svn: 268305	2016-05-02 21:06:57 +00:00
Frederic Riss	bd126df21f	[dsymutil] Create the temporary files in the system temp directory. llvm-dsymutil used to create the temporary files in the output directory. This works fine except when the output directory contains a '%' char, which is then replaced by llvm::sys::fs::createUniqueFile() generating an invalid path. Just use the default temp dir for those files. llvm-svn: 268304	2016-05-02 21:06:14 +00:00
Kevin Enderby	7bd8d99497	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
Matt Arsenault	bcdfee7030	AMDGPU: Custom lower v2i32 loads and stores This will allow us to split up 64-bit private accesses when necessary. llvm-svn: 268296	2016-05-02 20:13:51 +00:00
Tom Stellard	154c9cdd24	AMDGPU/SI: Use v_readfirstlane_b32 when restoring SGPRs spilled to scratch We were using v_readlane_b32 with the lane set to zero, but this won't work if thread 0 is not active. Differential Revision: http://reviews.llvm.org/D19745 llvm-svn: 268295	2016-05-02 20:11:44 +00:00
Matt Arsenault	2b957b5a6f	AMDGPU: Make i64 loads/stores promote to v2i32 Now that unaligned access expansion should not attempt to produce i64 accesses, we can remove the hack in PreprocessISelDAG where this is done. This allows splitting i64 private accesses while allowing the new add nodes indexing the vector components can be folded with the base pointer arithmetic. llvm-svn: 268293	2016-05-02 20:07:26 +00:00
Simon Pilgrim	21b2c5660e	[X86][AVX2] Added 128-bit wide shuffle test Demonstrate missing 128-bit wide shuffle combine support llvm-svn: 268290	2016-05-02 19:46:58 +00:00
Reid Kleckner	bca59d2a43	Revert "[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics" This reverts commit r268254. This change causes assertion failures while building Chromium. Reduced test case coming soon. llvm-svn: 268288	2016-05-02 19:43:22 +00:00
Tim Northover	c08db1840c	ARM: fix handling of SUB immediates in peephole opt. We were negating an immediate that was going to be used in a SUBri form unnecessarily. Since ADD/SUB are very similar we can do that, but we have to change the SUB to an ADD at the same time. This also applies to ADD, and allows us to handle a slightly larger range of immediates for those two operations. rdar://25992245 llvm-svn: 268276	2016-05-02 18:30:08 +00:00
Justin Holewinski	9a6ea2c256	[NVPTX] Fix sign/zero-extending ldg/ldu instruction selection Summary: We don't have sign-/zero-extending ldg/ldu instructions defined, so we need to emulate them with explicit CVTs. We were originally handling the i8 case, but not any other cases. Fixes PR26185 Reviewers: jingyue, jlebar Subscribers: jholewinski Differential Revision: http://reviews.llvm.org/D19615 llvm-svn: 268272	2016-05-02 18:12:02 +00:00
Zachary Turner	0eace0bae5	Parse PDB Name Hash Table PDB has a lot of similar data structures. We already have code for parsing a Name Map, but PDB seems to have a different but very similar structure that is a hash table. This is the beginning of code needed in order to parse the name hash table, but it is not yet complete. It parses the basic metadata of the hash table, the bucket array, and the names buffer, but doesn't use any of these fields yet as the data structure requires a non-trivial amount of work to understand. llvm-svn: 268268	2016-05-02 18:09:14 +00:00
Tom Stellard	1f520e5c98	AMDGPU/SI: Use the hazard recognizer to break SMEM soft clauses Summary: Add support for detecting hazards in SMEM soft clauses, so that we only break the clauses when necessary, either by adding s_nop or re-ordering other alu instructions. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18870 llvm-svn: 268260	2016-05-02 17:39:06 +00:00
Nicolai Haehnle	119d3d80cb	AMDGPU: llvm.SI.fs.constant is a source of divergence Summary: This intrinsic is used to get flat-shaded fragment shader inputs. Those are uniform across a primitive, but a fragment shader wave may process pixels from multiple primitives (as indicated by the prim_mask), and so that's where divergence can arise. Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19747 llvm-svn: 268259	2016-05-02 17:37:01 +00:00
Derek Schuff	31680dd832	[WebAssembly] Rename memory_size intrinsic to current_memory This follows the recent renaming in the wasm spec. llvm-svn: 268255	2016-05-02 17:25:22 +00:00
Hans Wennborg	b7599329fc	[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics Make it possible that TryToSimplifyUncondBranchFromEmptyBlock merges empty basic block including lifetime intrinsics as well as phi nodes and unconditional branch into its successor or predecessor(s). If successor of empty block has single predecessor, all contents including lifetime intrinsics are sinked into the successor. Otherwise, they are hoisted into its predecessor(s) and then merged into the predecessor(s). Patch by Josh Yoon <josh.yoon@samsung.com>! Differential Revision: http://reviews.llvm.org/D19257 llvm-svn: 268254	2016-05-02 17:22:54 +00:00
Mehdi Amini	45c7b3ecb5	Move createReversePostOrderFunctionAttrsPass right after the inliner is done This is where it was originally, until LoopVersioningLICM was inserted before in r259986, I don't believe it was on purpose. Differential Revision: http://reviews.llvm.org/D19809 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268252	2016-05-02 16:53:16 +00:00
Pete Cooper	228b1e9a1f	Add llvm-pdbdump to the tool substitutions list in lit. NFC. This adds llvm-pdbdump to the list of tools which get printed with the full path in verbose mode. This makes it easier to take the whole run line from verbose output and run it again without prepending with the builds bin directory. llvm-svn: 268250	2016-05-02 16:51:26 +00:00
Chad Rosier	84567343bc	Remove extra whitespace. NFC. llvm-svn: 268248	2016-05-02 16:45:00 +00:00
Tom Stellard	a27007eb4f	AMDGPU/SI: Use hazard recognizer to detect DPP hazards Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18603 llvm-svn: 268247	2016-05-02 16:23:09 +00:00
Sanjay Patel	ec41cd2461	remove blank lines llvm-svn: 268246	2016-05-02 15:49:09 +00:00
Sanjay Patel	ebc0faa8d4	[InstCombine] regenerate checks llvm-svn: 268245	2016-05-02 15:32:10 +00:00
Sanjay Patel	0d0181006a	[InstCombine] regenerate checks llvm-svn: 268244	2016-05-02 15:25:49 +00:00
Sanjay Patel	0b75fd81e1	[InstCombine] regenerate checks llvm-svn: 268242	2016-05-02 15:21:41 +00:00
Sanjay Patel	933f9da43d	[InstCombine] regenerate checks llvm-svn: 268241	2016-05-02 15:18:13 +00:00
Sanjay Patel	b193fe943f	[InstCombine] regenerate checks llvm-svn: 268239	2016-05-02 15:06:55 +00:00
Sanjay Patel	1540b19407	[InstCombine] regenerate checks llvm-svn: 268232	2016-05-02 14:21:55 +00:00
David L Kreitzer	0fe4632bd7	Enable the X86 call frame optimization for the 64-bit targets that allow it. Fixes PR27241. Differential Revision: http://reviews.llvm.org/D19688 llvm-svn: 268227	2016-05-02 13:45:25 +00:00
Jonas Paulsson	1eb3486a7a	[SystemZ] Temporarily disable codegen test int-add-12.ll. This checks for AGSI transformation, which is temporarily disabled. llvm-svn: 268219	2016-05-02 10:42:47 +00:00
Davide Italiano	22b3ad8630	[llvm-readobj] Dump hash as part of -version-info. llvm-svn: 268210	2016-05-02 02:30:18 +00:00
Davide Italiano	4f277763cf	[GlobalDCE] Modernize. Use FileCheck instead of grep. llvm-svn: 268207	2016-05-01 22:51:14 +00:00
Simon Pilgrim	ca140b17cb	[InstCombine][SSE] Added support to VPERMD/VPERMPS to shuffle combine to accept UNDEF elements. llvm-svn: 268206	2016-05-01 20:43:02 +00:00
Simon Pilgrim	c590492075	Dropped FIXME comment llvm-svn: 268205	2016-05-01 20:33:25 +00:00
Simon Pilgrim	eeacc40e27	[InstCombine][SSE] Added support to VPERMILVAR to shuffle combine to accept UNDEF elements. llvm-svn: 268204	2016-05-01 20:22:42 +00:00
Simon Pilgrim	cc7f567b6a	[InstCombine][AVX] Fixed PERMILVAR identity tests and added additional decode tests llvm-svn: 268203	2016-05-01 20:06:47 +00:00
Simon Pilgrim	e5e8c2fde0	[InstCombine][SSE] Added support to PSHUFB to shuffle combine to accept UNDEF elements. llvm-svn: 268202	2016-05-01 19:26:21 +00:00
Simon Pilgrim	cae3e70707	[InstCombine][SSE] Regenerate MOVSX/MOVZX tests llvm-svn: 268201	2016-05-01 18:28:45 +00:00
Craig Topper	b6da65403a	[AVX512] VPACKUSWB/VPACKSSWB should not be encoded with EVEX.W=1. While there fix the execution domain for VPACKSSDW/VPACKUSDW. llvm-svn: 268200	2016-05-01 17:38:32 +00:00
Simon Pilgrim	8cddf8b3c6	[InstCombine][AVX2] Combine VPERMD/VPERMPS intrinsics with constant masks to shufflevector. llvm-svn: 268199	2016-05-01 16:41:22 +00:00
Igor Breger	110af565c7	getelementptr instruction, support index vector of EVT. Differential Revision: http://reviews.llvm.org/D19775 llvm-svn: 268195	2016-05-01 13:29:12 +00:00
Igor Breger	131008fbcb	Change AVX512 braodcastsd/ss patterns interaction with spilling . New implementation take a scalar register and generate a vector without COPY_TO_REGCLASS (turn it into a VR128 register ) .The issue is that during register allocation we may spill a scalar value using 128-bit loads and stores, wasting cache bandwidth. Differential Revision: http://reviews.llvm.org/D19579 llvm-svn: 268190	2016-05-01 08:40:00 +00:00
Craig Topper	e430de8be6	[AVX512] Prefer AVX512 VPACK instructions over AVX/AVX2 instructions when VLX and BWI are supported. llvm-svn: 268189	2016-05-01 06:52:19 +00:00
Sanjoy Das	f2f00fb11a	[SCEV] When printing via -analysis, dump loop disposition There are currently some bugs in tree around SCEV caching an incorrect loop disposition. Printing out loop dispositions will let us write whitebox tests as those are fixed. The dispositions are printed as a list in "inside out" order, i.e. innermost loop first. llvm-svn: 268177	2016-05-01 04:51:05 +00:00
Simon Pilgrim	c179435055	[InstCombine][AVX2] Added VPERMD/VPERMPS shuffle combining placeholder tests. For future support for VPERMD/VPERMPS to generic shuffles combines llvm-svn: 268166	2016-04-30 20:41:52 +00:00
Simon Pilgrim	8e38a5439b	[InstCombine][AVX] Split off VPERMILVAR tests and added additional tests for UNDEF mask elements llvm-svn: 268159	2016-04-30 07:32:19 +00:00
Tom Stellard	c51e4468b7	AMDGPU/SI: Remove wait state handling for SMRD in SIInsertWaits This was supposed to be part of r268143. llvm-svn: 268154	2016-04-30 04:04:48 +00:00
Amjad Aboud	72da9391f0	Reverting 268054 & 268063 as they caused PR27579. llvm-svn: 268150	2016-04-30 01:44:07 +00:00
Sanjoy Das	47cf2affbd	[LowerGuardIntrinsics] Keep track of !make.implicit metadata If a guard call being lowered by LowerGuardIntrinsics has the `!make.implicit` metadata attached, then reattach the metadata to the branch in the resulting expanded form of the intrinsic. This allows us to implement null checks as guards and still get the benefit of implicit null checks. llvm-svn: 268148	2016-04-30 00:55:59 +00:00
Lawrence Hu	1befea2bdc	Reroll loops with multiple IV and negative step part 3 support multiple induction variables This patch enable loop reroll for the following case: for(int i=0; i<N; i += 2) { S += a++; S += a++; }; Differential Revision: http://reviews.llvm.org/D16550 llvm-svn: 268147	2016-04-30 00:51:22 +00:00
Tom Stellard	cb6ba62d6f	AMDGPU/SI: Enable the post-ra scheduler Summary: This includes a hazard recognizer implementation to replace some of the hazard handling we had during frame index elimination. Reviewers: arsenm Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18602 llvm-svn: 268143	2016-04-30 00:23:06 +00:00
Sanjoy Das	52c68bb0f5	[LowerGuardIntrinsics] Preserve calling conv when lowering llvm-svn: 268142	2016-04-30 00:17:47 +00:00
Sanjay Patel	bc6fad0bdf	add minimal test to show dropped metadata llvm-svn: 268141	2016-04-30 00:12:54 +00:00
Sanjay Patel	6748ec49e9	remove the metadata added with r267827 We can demonstrate the 'select' bug and fix with a simpler test case. The merged weight values are already tested in another test. llvm-svn: 268139	2016-04-30 00:02:36 +00:00
Sanjoy Das	107aefc2fc	Mark guards on true as "trivially dead" This moves some logic added to EarlyCSE in rL268120 into `llvm::isInstructionTriviallyDead`. Adds a test case for DCE to demonstrate that passes other than EarlyCSE can now pick up on the new information. llvm-svn: 268126	2016-04-29 22:23:16 +00:00
Haicheng Wu	4afe0425db	[MBP] Use Function::optForSize() instead of checking OptimizeForSize directly. Fix a FIXME. Disable loop alignment if compiled with -Oz now. llvm-svn: 268121	2016-04-29 22:01:10 +00:00
Sanjoy Das	ee81b23fe7	[EarlyCSE] Simplify guard intrinsics Summary: This change teaches EarlyCSE some basic properties of guard intrinsics: - Guard intrinsics read all memory, but don't write to any memory - After a guard has executed, the condition it was guarding on can be assumed to be true - Guard intrinsics on a constant `true` are no-ops Reviewers: reames, hfinkel Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19578 llvm-svn: 268120	2016-04-29 21:52:58 +00:00
Matt Arsenault	701c21ea10	AMDGPU: Fix crash with unreachable terminators. If a block has no successors because it ends in unreachable, this was accessing an invalid iterator. Also stop counting instructions that don't emit any real instructions. llvm-svn: 268119	2016-04-29 21:52:13 +00:00
Sriraman Tallam	7da9b445ea	Differential Revision: http://reviews.llvm.org/D19733 llvm-svn: 268106	2016-04-29 21:19:16 +00:00
Matt Arsenault	dc4ebad6d4	AMDGPU: Add kernarg.segment.ptr intrinsic llvm-svn: 268105	2016-04-29 21:16:52 +00:00
Chad Rosier	cd62bf5821	[InstCombine] Determine the result of a select based on a dominating condition. Differential Revision: http://reviews.llvm.org/D19550 llvm-svn: 268104	2016-04-29 21:12:31 +00:00
Matt Arsenault	ab2232cf73	DAGCombiner: Reduce truncated shl width llvm-svn: 268094	2016-04-29 19:53:16 +00:00
David Majnemer	d2a074b1f4	[ValueTracking] matchSelectPattern needs to be more careful around FP matchSelectPattern attempts to see through casts which mask min/max patterns from being more obvious. Under certain circumstances, it would misidentify a sequence of instructions as a min/max because it assumed that folding casts would preserve the result. This is not the case for floating point <-> integer casts. This fixes PR27575. llvm-svn: 268086	2016-04-29 18:40:34 +00:00
Artem Tamazov	5d3ae19bdf	[AMDGPU][llvm-mc] Add some missing testcases to trap.s Differential Revision: http://reviews.llvm.org/D19602 llvm-svn: 268073	2016-04-29 17:41:44 +00:00
Geoff Berry	b92cd5293e	[BasicAA] Treat llvm.assume as not accessing memory in getModRefBehavior(Function) Reviewers: dberlin, chandlerc, hfinkel, reames, sanjoy Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19730 llvm-svn: 268068	2016-04-29 17:18:28 +00:00
Artem Tamazov	38e496b175	Fixed/Recommitted r267733 "[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for SMRD." Previously reverted by r267752. r267733 review: Differential Revision: http://reviews.llvm.org/D19342 llvm-svn: 268066	2016-04-29 17:04:50 +00:00
Guozhi Wei	fa3e04298b	[PPC] Enable shuffling of VSX vectors This patch fixes PR27078 by enabling shuffling of vectors if VSX is available. llvm-svn: 268064	2016-04-29 17:00:54 +00:00
Amjad Aboud	ee04164599	Fixed LIT tests that was broken after change in r268054. llvm-svn: 268063	2016-04-29 16:54:18 +00:00
Sanjay Patel	362dcf9615	auto-generate checks llvm-svn: 268061	2016-04-29 16:39:37 +00:00
Daniel Sanders	7225cd52e7	[mips][ias] Move createCpRestoreMemOp to MipsTargetStreamer. NFC. Summary: This removes the temporary call to isIntegratedAssemblerRequired() which was added recently. It's effect is now acheived directly in the MipsTargetStreamer hierarchy. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19715 llvm-svn: 268058	2016-04-29 16:16:49 +00:00
Amjad Aboud	293ee8bba1	Recommitted r264280 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26942 in r267004. llvm-svn: 268054	2016-04-29 16:07:55 +00:00
Simon Dardis	d8bceb9d3a	[mips][FastISel] A store is not a load. Correct trivial error. One of the failing tests from PR/27458. Reviewers: dsanders, vkalintiris, mcrosier Differential Review: http://reviews.llvm.org/D19726 llvm-svn: 268053	2016-04-29 16:07:47 +00:00
Krzysztof Parzyszek	f5cbac93eb	[Hexagon] Optimize addressing modes for load/store Patch by Jyotsna Verma. llvm-svn: 268051	2016-04-29 15:49:13 +00:00

... 3 4 5 6 7 ...

36498 Commits