llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Ballman	fb28a60d6d	Addressing a post-commit review comment suggesting to avoid using direct initialization. llvm-svn: 229512	2015-02-17 16:57:05 +00:00
Sanjay Patel	ab7e86e5be	Canonicalize splats as build_vectors (PR22283) This is a follow-on patch to: http://reviews.llvm.org/D7093 That patch canonicalized constant splats as build_vectors, and this patch removes the constant check so we can canonicalize all splats as build_vectors. This fixes the 2nd test case in PR22283: http://llvm.org/bugs/show_bug.cgi?id=22283 The unfortunate code duplication between SelectionDAG and DAGCombiner is discussed in the earlier patch review. At least this patch is just removing code... This improves an existing x86 AVX test and changes codegen in an ARM test. Differential Revision: http://reviews.llvm.org/D7389 llvm-svn: 229511	2015-02-17 16:54:32 +00:00
Tom Stellard	bc3776803b	R600/SI: Extend private extload pattern to include zext loads llvm-svn: 229507	2015-02-17 16:36:00 +00:00
Aaron Ballman	f4984a961f	I believe we no longer require LLVM_HAS_INITIALIZER_LISTS; it's supported in MSVC 2013 and GCC. Added a trivial test to ensure the ArrayRef initializer list constructor is called and behaves as expected. If any of the bots complain (perhaps due to an antiquated version of an STL implementation), I will revert. llvm-svn: 229502	2015-02-17 15:37:53 +00:00
NAKAMURA Takumi	944511621e	ADT/PointerIntPairTest.cpp: Prune obsolete #if. We don't support msc17 anymore. llvm-svn: 229501	2015-02-17 15:36:01 +00:00
Benjamin Kramer	6cd780ff21	Prefer SmallVector::append/insert over push_back loops. Same functionality, but hoists the vector growth out of the loop. llvm-svn: 229500	2015-02-17 15:29:18 +00:00
Aaron Ballman	1618543e5e	Reverting r229473; it does not compile with MSVC 2013, and I suspect it was meant to be reverted in r229483. llvm-svn: 229496	2015-02-17 13:18:43 +00:00
Elena Demikhovsky	ef035bb974	Fixed a bug in store sinking. The problem was in store-sink barrier check. Store sink barrier should be checked for ModRef (read-write) mode. http://llvm.org/bugs/show_bug.cgi?id=22613 llvm-svn: 229495	2015-02-17 13:10:05 +00:00
NAKAMURA Takumi	cd5a3673c3	OrcJIT: Appease msc18 not to be confused on executeCompileCallback<OrcX86_64>. llvm-svn: 229494	2015-02-17 12:53:16 +00:00
NAKAMURA Takumi	3e087357ce	Reformat. llvm-svn: 229493	2015-02-17 12:53:05 +00:00
NAKAMURA Takumi	4197483fb6	OrcJIT: Try to appease msc18 to add move constructor in FullyPartitionedModule . llvm-svn: 229492	2015-02-17 12:52:58 +00:00
Manuel Klimek	874298aa7f	Fix problem with uninitialized bool found by asan. llvm-svn: 229490	2015-02-17 12:42:14 +00:00
Andrea Di Biagio	b55666f7e3	[X86][FastISel] Add missing flag -fast-isel-abort to run lines in test fast-isel-fptrunc-fpext.ll. Flag -fast-isel-abort is required in order to verify that X86FastISel never fails to select FPExt (float-to-double) and FPTrunc (double-to-float). No Functional change intended. llvm-svn: 229489	2015-02-17 12:25:49 +00:00
Andrea Di Biagio	eb97f92489	[X86] Silence -Wsign-compare warnings. GCC 4.8 reported two new warnings due to comparisons between signed and unsigned integer expressions. The new warnings were accidentally introduced by revision 229480. Added explicit casts to silence the warnings. No functional change intended. llvm-svn: 229488	2015-02-17 11:20:11 +00:00
Justin Bogner	5b3ad88646	Revert "InstrProf: Add unit tests for the profile reader and writer" This added API to the InstrProfWriter to write to a string so I could write unittests without using temp files. This doesn't really work, since the format has tighter alignment requirements than a char. This reverts r229478 and its follow-up, r229481. llvm-svn: 229483	2015-02-17 09:21:43 +00:00
Elena Demikhovsky	ba84672519	AVX-512: changes in intel_ocl_bi calling conventions - added mask types v8i1 and v16i1 to possible function parameters - enabled passing 512-bit vectors in standard CC - added a test for KNL intel_ocl_bi conventions llvm-svn: 229482	2015-02-17 09:20:12 +00:00
Justin Bogner	74f2e94118	InstrProf: Add missing header from r229478 llvm-svn: 229481	2015-02-17 08:26:06 +00:00
Michael Kuperstein	ff5acaf50c	[X86] Combine vector anyext + and into a vector zext Vector zext tends to get legalized into a vector anyext, represented as a vector shuffle with an undef vector + a bitcast, that gets ANDed with a mask that zeroes the undef elements. Combine this into an explicit shuffle with a zero vector instead. This allows shuffle lowering to match it as a zext, instead of matching it as an anyext and emitting an explicit AND. This combine only covers a subset of the cases, but it's a start. Differential Revision: http://reviews.llvm.org/D7666 llvm-svn: 229480	2015-02-17 08:22:51 +00:00
Justin Bogner	218d0689a9	Re-apply "InstrProf: Add unit tests for the profile reader and writer" Add these tests again, but use va_list instead of initializer lists. This reverts r229456, reapplying r229455. llvm-svn: 229478	2015-02-17 07:50:59 +00:00
Jonas Paulsson	31ab1fa2ca	[PBQP] NDEBUG guards added around code needed for assert. wasConservativelyAllocatable() is only called to assert that a conservatively allocatable node wasn't forced to spill. llvm-svn: 229477	2015-02-17 07:45:06 +00:00
Eric Christopher	5c0e009d3a	Make the PowerPC AsmPrinter independent of global subtarget initialization. Initialize the subtarget once per function and migrate EmitStartOfAsmFile to either use attributes on the TargetMachine or get information from all of the various subtargets. llvm-svn: 229475	2015-02-17 07:21:21 +00:00
Justin Bogner	f23604d0de	InstrProf: Use a test fixture in the coverage mapping tests llvm-svn: 229473	2015-02-17 06:56:49 +00:00
Eric Christopher	75dc3904a5	Add a FIXME to move IsLittleEndian to the target machine. llvm-svn: 229472	2015-02-17 06:45:17 +00:00
Eric Christopher	fee6aaf683	Move ABI handling and 64-bitness to the PowerPC target machine. This required changing how the computation of the ABI is handled and how some of the checks for ABI/target are done. llvm-svn: 229471	2015-02-17 06:45:15 +00:00
Lang Hames	9cebeb6666	[Orc][Kaleidoscope] Fix misnumbered steps in comments, plus tidy one explanation up a little. llvm-svn: 229467	2015-02-17 05:53:28 +00:00
Lang Hames	9b4b92bb21	[Orc][Kaleidoscope] Add an example of extreme-laziness in Orc. The version of the tutorial uses the new compile callbacks API to inject stubs that trigger IRGen & Codegen of their respective function bodies when they are first called. llvm-svn: 229466	2015-02-17 05:40:42 +00:00
Lang Hames	31ab495e4e	[Orc][Kaleidoscope] Update the MainLoop code of the orc/kaleidoscope tutorials to get rid of the duplicate prompt. NFC. llvm-svn: 229465	2015-02-17 05:36:59 +00:00
Duncan P. N. Exon Smith	752d6df22d	AsmPrinter: Use DIExpression default constructor, NFC llvm-svn: 229464	2015-02-17 02:42:45 +00:00
Chandler Carruth	55db07016e	[x86] Teach the unpack lowering to try wider element unpacks. This allows it to match still more places where previously we would have to fall back on floating point shuffles or other more complex lowering strategies. I'm hoping to replace some of the hand-rolled unpack matching with this routine is it gets more and more clever. llvm-svn: 229463	2015-02-17 02:12:24 +00:00
Hal Finkel	2bb61ba2fe	[BDCE] Add a bit-tracking DCE pass BDCE is a bit-tracking dead code elimination pass. It is based on ADCE (the "aggressive DCE" pass), with the added capability to track dead bits of integer valued instructions and remove those instructions when all of the bits are dead. Currently, it does not actually do this all-bits-dead removal, but rather replaces the instruction's uses with a constant zero, and lets instcombine (and the later run of ADCE) do the rest. Because we essentially get a run of ADCE "for free" while tracking the dead bits, we also do what ADCE does and removes actually-dead instructions as well (this includes instructions newly trivially dead because all bits were dead, but not all such instructions can be removed). The motivation for this is a case like: int __attribute__((const)) foo(int i); int bar(int x) { x \|= (4 & foo(5)); x \|= (8 & foo(3)); x \|= (16 & foo(2)); x \|= (32 & foo(1)); x \|= (64 & foo(0)); x \|= (128& foo(4)); return x >> 4; } As it turns out, if you order the bit-field insertions so that all of the dead ones come last, then instcombine will remove them. However, if you pick some other order (such as the one above), the fact that some of the calls to foo() are useless is not locally obvious, and we don't remove them (without this pass). I did a quick compile-time overhead check using sqlite from the test suite (Release+Asserts). BDCE took ~0.4% of the compilation time (making it about twice as expensive as ADCE). I've not looked at why yet, but we eliminate instructions due to having all-dead bits in: External/SPEC/CFP2006/447.dealII/447.dealII External/SPEC/CINT2006/400.perlbench/400.perlbench External/SPEC/CINT2006/403.gcc/403.gcc MultiSource/Applications/ClamAV/clamscan MultiSource/Benchmarks/7zip/7zip-benchmark llvm-svn: 229462	2015-02-17 01:36:59 +00:00
Lang Hames	2754714fb9	[Orc] Update the Orc indirection utils and refactor the CompileOnDemand layer. This patch replaces most of the Orc indirection utils API with a new class: JITCompileCallbackManager, which creates and manages JIT callbacks. Exposing this functionality directly allows the user to create callbacks that are associated with user supplied compilation actions. For example, you can create a callback to lazyily IR-gen something from an AST. (A kaleidoscope example demonstrating this will be committed shortly). This patch also refactors the CompileOnDemand layer to use the JITCompileCallbackManager API. llvm-svn: 229461	2015-02-17 01:18:38 +00:00
Hal Finkel	7f957c17a0	Specify arch in test/CodeGen/X86/float-conv-elim.ll This test was failing on non-x86 hosts because it specified a cpu of x86_64, but not an architecture. x86_64 is obviously not a valid cpu on all architectures. llvm-svn: 229460	2015-02-17 00:11:19 +00:00
Duncan P. N. Exon Smith	b474937929	AsmPrinter: Stop creating DebugLocs While looking at a heap profile of a clang LTO bootstrap with -g, I noticed that 2.2% of memory in an `llvm-lto` of clang is from calling `DebugLoc::get()` in `collectVariableInfo()` (accounting for ~40% of memory used for `MDLocation`s). I suspect this was introduced by r226736, whose goal was to prevent uniquing of `DebugLoc`s (goal achieved, if so). There's no reason we need a `DebugLoc` here at all -- it was just being used for (in)convenient API -- so the fix is to pass the scope and inlined-at directly to `LexicalScopes::findInlinedScope()`. llvm-svn: 229459	2015-02-17 00:02:27 +00:00
Hal Finkel	5cedafb8cd	[PowerPC] Support non-direct-sub/superclass VSX copies Our register allocation has become better recently, it seems, and is now starting to generate cross-block copies into inflated register classes. These copies are not transformed into subregister insertions/extractions by the PPCVSXCopy class, and so need to be handled directly by PPCInstrInfo::copyPhysReg. The code to do this was almost there, but not quite (it was unnecessarily restricting itself to only the direct sub/super-register-class case (not copying between, for example, something in VRRC and the lower-half of VSRC which are super-registers of F8RC). Triggering this behavior manually is difficult; I'm including two bugpoint-reduced test cases from the test suite. llvm-svn: 229457	2015-02-16 23:46:30 +00:00
Justin Bogner	fcb2de694a	Revert "InstrProf: Add unit tests for the profile reader and writer" Looks like the bots don't like my initializer lists. This reverts r229455 llvm-svn: 229456	2015-02-16 23:31:07 +00:00
Justin Bogner	f83e895fa7	InstrProf: Add unit tests for the profile reader and writer This required some minor API to be added to these types to avoid needing temp files. Also, I've used initializer lists in the tests, as MSVC 2013 claims to support them. I'll redo this without them if the bots complain. llvm-svn: 229455	2015-02-16 23:27:48 +00:00
Simon Atanasyan	79ba8407d2	[Mips] Add .MIPS.options section descriptor kinds enumeration No functional changes. llvm-svn: 229452	2015-02-16 22:59:29 +00:00
Lang Hames	05fa2b0a14	[Orc] Add an emitAndFinalize method to the ObjectLinkingLayer, IRCompileLayer and LazyEmittingLayer of Orc. This method allows you to immediately emit and finalize a module. It is required by an upcoming refactor of the indirection utils and the compile-on-demand layer. I've filed http://llvm.org/PR22608 to write unit tests for this and other Orc APIs. llvm-svn: 229451	2015-02-16 22:36:25 +00:00
Ahmed Bougacha	bf2b90e92d	[ARM] Remove unused declaration. NFC. GlobalMerge was moved to lib/CodeGen a while ago, and is no longer called "ARMGlobalMerge". llvm-svn: 229448	2015-02-16 22:30:08 +00:00
Cameron McInally	c5764cbe4e	[AVX512] Make 512b vector floating point rounds legal on AVX512. llvm-svn: 229445	2015-02-16 22:15:42 +00:00
Matthias Braun	15635c5f85	RegisterCoalescer: Don't rematerialize subregister definitions. We cannot simply rematerialize instructions which only defining a subregister, as the final value also depends on the previous instructions. This fixes test/CodeGen/R600/subreg-coalescer-bug.ll with subreg liveness enabled. llvm-svn: 229444	2015-02-16 22:05:17 +00:00
Matthias Braun	1b901a4435	RegisterCoalescer: Do not look for regclass of IMPLICIT_DEF. IMPLICIT_DEF is a generic instruction and has no (fixed) output register class defined. The rematerialization code of the register coalescer should not scan the instruction description for a register class. This fixes a problem showing up in test/CodeGen/R600/subreg-coalescer-crash.ll with subregister liveness enabled. llvm-svn: 229443	2015-02-16 22:05:12 +00:00
Simon Pilgrim	b2c00f3286	[X86][SSE] Add SSE MOVQ instructions to SSEPackedInt domain Patch to explicitly add the SSE MOVQ (rr,mr,rm) instructions to SSEPackedInt domain - prevents a number of costly domain switches. Differential Revision: http://reviews.llvm.org/D7600 llvm-svn: 229439	2015-02-16 21:50:56 +00:00
Mehdi Amini	3e0023b8f6	SelectionDAG: fold (fp_to_u/sint (s/uint_to_fp)) here too Update SPARC tests to match. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229438	2015-02-16 21:47:58 +00:00
Mehdi Amini	b9a0fa4822	InstCombine: fold more cases of (fp_to_u/sint (u/sint_to_fp val)) Fixes radar 15486701. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229437	2015-02-16 21:47:54 +00:00
Mehdi Amini	7aab8752ba	Tests: reformat sitofp.ll and use FileCheck From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229436	2015-02-16 21:47:50 +00:00
Justin Bogner	ab89ed7dd5	InstrProf: Use ErrorOr for IndexedInstrProfReader::create (NFC) The other InstrProfReader::create factories were updated to return ErrorOr in r221120, and it's odd for these APIs not to match. llvm-svn: 229433	2015-02-16 21:28:58 +00:00
Craig Topper	49df44e2e2	[X86] Remove the multiply by 8 that goes into the shift constant for X86ISD::VSHLDQ and X86ISD::VSRLDQ. This simplifies the pattern matching in isel and allows these nodes to become the patterns embedded in the instruction. llvm-svn: 229431	2015-02-16 20:52:07 +00:00
Craig Topper	44026efa88	[X86] Remove x86.avx2.psll.dq.bs and x86.avx2.psrl.dq.bs intrinsics. llvm-svn: 229430	2015-02-16 20:51:59 +00:00
Matthias Braun	d6b108e445	ARM: Transfer kill flag when lowering VSTMQIA to VSTMDIA. llvm-svn: 229425	2015-02-16 19:34:30 +00:00

1 2 3 4 5 ...

113468 Commits