llvm-project

Commit Graph

Author	SHA1	Message	Date
Steven Wu	94746694ca	Fix another hang caused by ManagedStatic in SignalHandler Fix two other variables that might cause the same hang fixed in r235914. The hang is caused by constructing ManagedStatic in signalhandler. In this case, if FileToRemove or CallBacksToRun is not contructed, it means there is no work to do. llvm-svn: 236741	2015-05-07 16:20:51 +00:00
Sanjay Patel	a9f6d3505d	[x86] eliminate unnecessary shuffling/moves with unary scalar math ops (PR21507) Finish the job that was abandoned in D6958 following the refactoring in http://reviews.llvm.org/rL230221: 1. Uncomment the intrinsic def for the AVX r_Int instruction. 2. Add missing r_Int entries to the load folding tables; there are already tests that check these in "test/Codegen/X86/fold-load-unops.ll", so I haven't added any more in this patch. 3. Add patterns to solve PR21507 ( https://llvm.org/bugs/show_bug.cgi?id=21507 ). So instead of this: movaps %xmm0, %xmm1 rcpss %xmm1, %xmm1 movss %xmm1, %xmm0 We should now get: rcpss %xmm0, %xmm0 And instead of this: vsqrtss %xmm0, %xmm0, %xmm1 vblendps $1, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm1[0],xmm0[1,2,3] We should now get: vsqrtss %xmm0, %xmm0, %xmm0 Differential Revision: http://reviews.llvm.org/D9504 llvm-svn: 236740	2015-05-07 15:48:53 +00:00
Hans Wennborg	44faaa7aa4	Switch lowering: handle zero-weight branch probabilities After r236617, branch probabilities are no longer guaranteed to be >= 1. This patch makes the swich lowering code handle that correctly, without bumping the branch weights by 1 which might cause overflow and skews the probabilities. Covered by @zero_weight_tree in test/CodeGen/X86/switch.ll. llvm-svn: 236739	2015-05-07 15:47:15 +00:00
Simon Atanasyan	04d9e653ed	[obj2yaml/yaml2obj] Add SHT_MIPS_ABIFLAGS section support This change adds support for the SHT_MIPS_ABIFLAGS section reading/writing to the obj2yaml and yaml2obj tools. llvm-svn: 236738	2015-05-07 15:40:48 +00:00
Simon Atanasyan	fee03b1be8	[MIPS] Move MIPS ABI flags structure constants to the separate header http://reviews.llvm.org/D9517 The separate header file allows to reuse the MIPS ABI flags structure constants in other LLVM tools like the llvm-readobj. No functional changes. llvm-svn: 236732	2015-05-07 14:57:04 +00:00
Simon Atanasyan	67bdc799a7	[llvm-readobj/obj2yaml/yaml2obj] Support more MIPS ELF header flags llvm-svn: 236728	2015-05-07 14:04:44 +00:00
Elena Demikhovsky	29792e9a80	AVX-512: Added all forms of FP compare instructions for KNL and SKX. Added intrinsics for the instructions. CC parameter of the intrinsics was changed from i8 to i32 according to the spec. By Igor Breger (igor.breger@intel.com) llvm-svn: 236714	2015-05-07 11:24:42 +00:00
Toma Tabacu	506cfd0b2b	[mips] Add the SoftFloat MipsSubtarget feature. Summary: This will enable the IAS to reject floating point instructions if soft-float is enabled. Reviewers: dsanders, echristo Reviewed By: dsanders Subscribers: jfb, llvm-commits, mpf Differential Revision: http://reviews.llvm.org/D9053 llvm-svn: 236713	2015-05-07 10:29:52 +00:00
NAKAMURA Takumi	2a5bd54f4e	Scalar/PlaceSafepoints.cpp: Fix a warning introduced in r228090. [-Wunused-variable] llvm-svn: 236711	2015-05-07 10:18:46 +00:00
Mehdi Amini	2668a487a7	Update InstCombine to transform aggregate loads into scalar loads. Summary: One step further getting aggregate loads and store being optimized properly. This will only handle struct with one element at this point. Test Plan: Added unit tests for the new supported cases. Reviewers: chandlerc, joker-eph, joker.eph, majnemer Reviewed By: majnemer Subscribers: pete, llvm-commits Differential Revision: http://reviews.llvm.org/D8339 Patch by Amaury Sechet. From: Amaury Sechet <amaury@fb.com> llvm-svn: 236695	2015-05-07 05:52:40 +00:00
Alexey Samsonov	3514f27456	[SanitizerCoverage] Introduce SanitizerCoverageOptions struct. Summary: This gives frontend more precise control over collected coverage information. User can still override these options by passing -mllvm flags. No functionality change. Test Plan: regression test suite. Reviewers: kcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9539 llvm-svn: 236687	2015-05-07 01:00:31 +00:00
Justin Bogner	5a5c381ba9	InstrProf: Simplify looking up sections for coverage data llvm-svn: 236685	2015-05-07 00:31:58 +00:00
Philip Reames	7a738dd94c	[JumpThreading] Simplify comparisons when simplifying branches If we have recognized that a conditional is constant at a particular location in the code (while trying to decide if we can simplify a conditional branch), we can eagerly replace that condition with a constant if it's definition is post dominated by the branch in question. In practice, this ends up being a compile time savings at most. JumpThreading would have visited each using branch anyways. CVP would have visited the cmp itself again. Unless LVI gives up early, we shouldn't gain any addition power by doing this transformation early. What we do gain is simplicity and compile time. Differential Revision: http://reviews.llvm.org/D9312 llvm-svn: 236684	2015-05-07 00:19:14 +00:00
Kostya Serebryany	a407ddef27	[lib/Fuzzer] add dfsan_weak_hook_memcmp, enable the test that uses it, simplify the test runner llvm-svn: 236683	2015-05-07 00:11:33 +00:00
Vince Harron	d528112b41	Added support for building against Android API-9 SDK Created an abstraction for log2, llvm::Log2 in Support/MathExtras.h Hid Android problems inside of it Differential Revision: http://reviews.llvm.org/D9467 llvm-svn: 236680	2015-05-07 00:05:26 +00:00
David Blaikie	567d0e5a90	Revert "[opaque pointer type] Pass explicit pointer type through GEP constant folding" Causes regressions in Clang. Reverting while I investigate. This reverts commit r236670. llvm-svn: 236678	2015-05-06 23:56:21 +00:00
Akira Hatanaka	3058d0f080	Let llc and opt override "-target-cpu" and "-target-features" via command line options. This commit fixes a bug in llc and opt where "-mcpu" and "-mattr" wouldn't override function attributes "-target-cpu" and "-target-features" in the IR. Differential Revision: http://reviews.llvm.org/D9537 llvm-svn: 236677	2015-05-06 23:54:14 +00:00
Sanjoy Das	2e0d29fb09	[X86MCInst] Move LowerSTATEPOINT to inside X86AsmPrinter. NFC. llvm-svn: 236676	2015-05-06 23:53:26 +00:00
Sanjoy Das	80876d5db3	[X86MCInst] Clean up LowerSTATEPOINT: variable names. NFC. llvm-svn: 236675	2015-05-06 23:53:24 +00:00
Sanjoy Das	abf15608a7	[Statepoints] Clean up PlaceSafepoints.cpp: de-duplicate code. Common duplicated code and remove unnecessary code. llvm-svn: 236674	2015-05-06 23:53:21 +00:00
Sanjoy Das	93abd813ec	[Statepoints] Clean up PlaceSafepoints.cpp: variable naming. Use CamelCase. NFC. llvm-svn: 236673	2015-05-06 23:53:19 +00:00
Sanjoy Das	abe1c685ac	[IRBuilder] Add a CreateGCStatepointInvoke. Renames the original CreateGCStatepoint to CreateGCStatepointCall, and moves invoke creating functionality from PlaceSafepoints.cpp to IRBuilder.cpp. This changes the labels generated for PlaceSafepoints/invokes.ll so use a regex there to make the basic block labels more resilient. llvm-svn: 236672	2015-05-06 23:53:09 +00:00
David Blaikie	e66a45fdb4	[opaque pointer type] Pass explicit pointer type through GEP constant folding llvm-svn: 236670	2015-05-06 23:49:14 +00:00
Pete Cooper	2777d88745	Change typeIncompatible to return an AttrBuilder instead of new-ing an AttributeSet. This makes use of the new API which can remove attributes from a set given a builder. This is much faster than creating a temporary set and reduces llc time by about 0.3% which was all spent creating temporary attributes sets on the context. llvm-svn: 236668	2015-05-06 23:19:56 +00:00
Pete Cooper	d2a44619e3	Add remove method to operate on AttrBuilder instead of AttributeSet. Prior to this change we would have to construct a temporary AttributeSet (which isn't temporary at all given that its allocated on the context), just to contain the attributes in the builder, then call remove on that. Now we can just remove any attributes from the (lightweight and really temporary) builder itself. Will be used in a future commit to remove some temporary attributes sets. llvm-svn: 236666	2015-05-06 23:19:43 +00:00
Justin Bogner	367a9f28c1	InstrProf: Give coverage its own errors instead of piggy backing on instrprof Since the coverage mapping reader and the instrprof reader were emitting a shared set of error codes, the error messages you'd get back from llvm-cov were ambiguous about what was actually wrong. Add another error category to fix this. I've also improved the wording on a couple of the instrprof errors, for consistency. llvm-svn: 236665	2015-05-06 23:19:35 +00:00
Justin Bogner	0b13086366	InstrProf: Remove a function that just returns its argument (NFC) llvm-svn: 236664	2015-05-06 23:15:55 +00:00
Alex Lorenz	fe6f1865bc	YAML: Extract the code that skips a comment into a separate method, NFC. This commit extracts the code that skips over a YAML comment from the 'scanToNextToken' method into a separate 'skipComment' method. This refactoring is motivated by a patch that implements parsing of YAML block scalars (http://reviews.llvm.org/D9503), as the method that parses a block scalar reuses the 'skipComment' method. llvm-svn: 236663	2015-05-06 23:00:45 +00:00
Pete Cooper	cc151ccdcf	Remove unnecessary #ifndef NDEBUG guard around assert. NFC. Found by Hal Finkel in the review of AttributeSets. http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150504/275058.html llvm-svn: 236662	2015-05-06 22:55:46 +00:00
Duncan P. N. Exon Smith	538ef562bd	Bitcode: Set LastDL after writing DebugLocs Somehow I dropped this in r233585, and we haven't had `DEBUG_LOC_AGAIN` records since. Add it back. Also tests that the output assembly looks okay. Fixes PR23436. llvm-svn: 236661	2015-05-06 22:51:12 +00:00
Pete Cooper	27483915e8	Handle dead defs in the if converter. We had code such as this: r2 = ... t2Bcc label1: ldr ... r2 label2; return r2<dead, def> The if converter was transforming this to r2<def> = ... return [pred] r2<dead,def> ldr <r2, kill> return which fails the machine verifier because the ldr now reads from a dead def. The fix here detects dead defs in stepForward and passes them back to the caller in the clobbers list. The caller then clears the dead flag from the def is the value is live. llvm-svn: 236660	2015-05-06 22:51:04 +00:00
Kostya Serebryany	3befe94acb	[lib/Fuzzer] remove dfsan_fuzzer_abi.list -- its contents are now moved to dfsan proper llvm-svn: 236659	2015-05-06 22:47:24 +00:00
Quentin Colombet	0ddd315db0	[RegisterCoalescer] Make sure each live-range has only one component, as demanded by the machine verifier. After shrinking a live-range to its uses, it is possible to create several smaller live-ranges. When this happens, shrinkToUses returns true and we need to split the different components into their own live-ranges. The problem does not reproduce on any in-tree target but Jonas Paulsson <jonas.paulsson@ericsson.com>, who reported the problem, checked that this patch fixes the issue. llvm-svn: 236658	2015-05-06 22:41:50 +00:00
Kostya Serebryany	754f55d6f5	[lib/Fuzzer] add a fuzzer test for memcmp (does not work yet) llvm-svn: 236656	2015-05-06 22:36:00 +00:00
Zachary Turner	6d6e947916	Fix link failure on MinGW due to use of CoInitialize. ole32 is considered a default library with MSVC, but apparently not with MinGW. Since we use CoInitialize, we need to explicitly link against it in LLVMSupport for a MinGW build. llvm-svn: 236654	2015-05-06 22:26:51 +00:00
Zachary Turner	c007aa41b6	A few fixes for llvm-symbolizer on Windows. Specifically, this patch correctly respects the -demangle option, and additionally adds a hidden --relative-address option allows input addresses to be relative to the module load address instead of absolute addresses into the image. llvm-svn: 236653	2015-05-06 22:26:30 +00:00
Kostya Serebryany	566bc5aa8a	[lib/Fuzzer] rename TestOneInput to LLVMFuzzerTestOneInput to make it more unique llvm-svn: 236652	2015-05-06 22:19:00 +00:00
Pete Cooper	54085cdc7b	Fix incorrect kill flags in fastisel. If called twice in the same BB on the same constant, FastISel::fastEmit_ri_ was marking the materialized vreg as killed on each use, instead of only the last use. Change this to only mark the last use as killed by making earlier uses check if the vreg is already used elsewhere. llvm-svn: 236650	2015-05-06 22:09:29 +00:00
Pete Cooper	d31583ddfb	[x86] Fix register class of folded load index reg. When folding a load in to another instruction, we need to fix the class of the index register Otherwise, it could be something like GR64 not GR64_NOSP and would fail the machine verifier. llvm-svn: 236644	2015-05-06 21:37:19 +00:00
Alexey Samsonov	0a648a4bfe	[SanitizerCoverage] Fix a couple of typos. NFC. llvm-svn: 236643	2015-05-06 21:35:25 +00:00
Duncan P. N. Exon Smith	c177fec93f	MC: Skip names of temporary symbols in object streamer Don't create names for temporary symbols when using an object streamer. The names never make it to the output anyway. From the starting point of r236629, my heap profile says this drops peak memory usage from 1100 MB to 1058 MB for CodeGen of `verify-uselistorder`, a savings of almost 4% on peak memory, and removes `StringMap<bool, BumpPtrAllocator...>` from the profile entirely. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 236642	2015-05-06 21:34:34 +00:00
Tim Northover	e4310fe946	CodeGen: move over-zealous assert into actual if statement. It's quite possible to encounter an insertvalue instruction that's more deeply nested than the value we're looking for, but when that happens we really mustn't compare beyond the end of the index array. Since I couldn't see any guarantees about what comparisons std::equal makes, we probably need to directly check the size beforehand. In practice, I suspect most std::equal implementations would probably bail early, which would be OK. But just in case... rdar://20834485 llvm-svn: 236635	2015-05-06 20:07:38 +00:00
Duncan P. N. Exon Smith	653c1099b4	DwarfDebug: Emit number of bytes in .debug_loc entry directly Emit the number of bytes in a `.debug_loc` entry directly. The old code created temp labels (expensive), emitted the difference between them, and then emitted one on each side of the relevant bytes. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc` (the optimized version of ld64's `-save-temps` when linking the `verify-uselistorder` executable in an LTO bootstrap). I've hacked `MCContext::Allocate()` to just call `malloc()` instead of using the `BumpPtrAllocator` so that the heap profile is easier to read. As far as peak memory is concerned, `MCContext::Allocate()` is equivalent to a leak, since it only gets freed at process teardown. In my heap profile, this patch drops memory usage of `DwarfDebug::emitDebugLoc()` from 132.56 MB (11.4%) down to 29.86 MB (2.7%) at peak memory. Some of that must be noise from `SmallVector` (or other) allocations -- peak memory only dropped from 1160 MB down to 1100 MB -- but this nevertheless shaves 5% off the top.) llvm-svn: 236629	2015-05-06 19:11:20 +00:00
Ismail Pazarbasi	56ccf1c9d5	Implement `createSanitizerCtor`, common helper function for all sanitizers Summary: This helper function creates a ctor function, which calls sanitizer's init function with given arguments. This constructor is then expected to be added to module's ctors. The patch helps unifying how sanitizer constructor functions are created, and how init functions are called across all sanitizers. Reviewers: kcc, samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8777 llvm-svn: 236627	2015-05-06 18:48:22 +00:00
Reid Kleckner	d1b38c4b0b	[WinEH] Improve fatal error message about failed demotion llvm-svn: 236626	2015-05-06 18:45:24 +00:00
Sanjoy Das	6c0fe24bd1	[SelectionDAG] Delete SelectionDAGBuilder::removeValue. NFC. SelectionDAGBuilder::removeValue is dead now, after rL236563. llvm-svn: 236618	2015-05-06 18:02:10 +00:00
Diego Novillo	14f94de1ee	Allow 0-weight branches in BranchProbabilityInfo. Summary: When computing branch weights in BPI, we used to disallow branches with weight 0. This is a minor nuisance, because a branch with weight 0 is different to "don't have information". In the context of instrumentation, it may mean "never executed", in the context of sampling, it means "never or seldom executed". In allowing 0 weight branches, I ran into issues with the switch expansion code in selection DAG. It is currently hardwired to not handle branches with weight 0. To maintain the current behaviour, I changed it to use 1 when it finds 0, but perhaps the algorithm needs changes to tolerate branches with weight zero. Reviewers: hansw Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9533 llvm-svn: 236617	2015-05-06 17:55:11 +00:00
Sanjoy Das	06cf33fbea	Add missing dereferenceable_or_null getters Summary: Add missing dereferenceable_or_null getters required for http://reviews.llvm.org/D9253 change. Separated from the D9253 review. Patch by Artur Pilipenko! Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9499 llvm-svn: 236615	2015-05-06 17:41:54 +00:00
Wei Mi	062c74484d	[X86] Disable loop unrolling in loop vectorization pass when VF is 1. The patch disabled unrolling in loop vectorization pass when VF==1 on x86 architecture, by setting MaxInterleaveFactor to 1. Unrolling in loop vectorization pass may introduce the cost of overflow check, memory boundary check and extra prologue/epilogue code when regular unroller will unroll the loop another time. Disable it when VF==1 remove the unnecessary cost on x86. The same can be done for other platforms after verifying interleaving/memory bound checking to be not perf critical on those platforms. Differential Revision: http://reviews.llvm.org/D9515 llvm-svn: 236613	2015-05-06 17:12:25 +00:00
Matt Arsenault	633dba4f41	Add ChangeTo* to MachineOperand for symbols llvm-svn: 236612	2015-05-06 17:05:54 +00:00
Pete Cooper	d927c6eaf8	[ARM] Fast-Isel was incorrectly selecting <2 x double> adds. With neon enabled, we reach SelectBinaryFPOp and are able to get registers for a <2 x double> add. However, we shouldn't actually attempt arithmetic on it as ARMIselLowering says "v2f64 is legal so that QR subregs can be extracted as f64 elements, but neither Neon nor VFP support any arithmetic operations on it." This commit disables SelectBinaryFPOp for any vector types. There's already a FIXME to try handle neon. Doing so would require fixing this conditional which isn't safe for vectors 'VT == MVT::f64 \|\| VT == MVT::i64' llvm-svn: 236609	2015-05-06 16:39:17 +00:00
Bill Schmidt	5fe2e25f7c	[PPC64LE] Adjust vector splats during VSX swap optimization The initial code drop for VSX swap optimization permitted the optimization only when all operations in a web of related computation are lane-insensitive. For some lane-sensitive operations, we can still permit the optimization provided that we make adjustments to those operations. This patch adds special handling for vector splats so that their presence doesn't kill the optimization. Vector splats are lane-sensitive since they identify by number a vector element to be used as the source of a splat. When swap optimizations take place, the desired vector element will move to the opposite doubleword of the quadword vector. We thus replace the index I by (I + N/2) % N, where N is the number of elements in the vector. A new test case is added to test that swap optimization succeeds when vector splats are present, and that the proper input element is used as the source of the splat. An ancillary change removes SH_BUILDVEC as one of the kinds of special handling that may be required by VSX swap optimization. From experience with GCC, I had expected to need some modifications for vector build operations, but I did not find that to be the case. llvm-svn: 236606	2015-05-06 15:40:46 +00:00
NAKAMURA Takumi	e452998b4b	Reformat. llvm-svn: 236601	2015-05-06 14:03:22 +00:00
NAKAMURA Takumi	d7c0be9c42	Revert r236546, "propagate IR-level fast-math-flags to DAG nodes (NFC)" It caused undefined behavior. llvm-svn: 236600	2015-05-06 14:03:12 +00:00
Artyom Skrobov	3f8eae92a4	[ARM] generate VMAXNM/VMINNM for a compare followed by a select, in safe math mode too llvm-svn: 236590	2015-05-06 11:44:10 +00:00
Pawel Bylica	9f1fb9d1ef	SelectionDAG: Handle out-of-bounds index in extract vector element Summary: This patch correctly handles undef case of EXTRACT_VECTOR_ELT node where the element index is constant and not less than vector size. Test Plan: CodeGen for X86 test included. Also one incorrect regression test fixed. Reviewers: qcolombet, chandlerc, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D9250 llvm-svn: 236584	2015-05-06 10:19:14 +00:00
Adam Nemet	e340f851a3	[DomTree] verifyDomTree to unconditionally perform DT verification I folded the check for the flag -verify-dom-info into the only caller where I think it is supposed to be checked: verifyAnalysis. (The idea of the flag is to enable this expensive verification in verifyPreservedAnalysis.) I'm assuming that when manually scheduling the verification pass with -passes=verify<domtree>, we do want to perform the verification. llvm-svn: 236575	2015-05-06 08:18:41 +00:00
Ahmed Bougacha	e8d0c4ccea	[ARM][FastISel] Use TST #1 instead of CMP #0 for select. Since r234249, i1 are sext instead of zext; because of that, doing "CMP rN, #0; IT EQ/NE" isn't correct anymore. "TST #1" is the conservatively correct alternative - the tradeoff being that it doesn't have a 16-bit encoding -, so use that instead. llvm-svn: 236569	2015-05-06 04:14:02 +00:00
Sanjoy Das	63245b5d3c	[IRBuilder] Fix indentation. NFC. Whitespace-only change. llvm-svn: 236567	2015-05-06 02:36:34 +00:00
Sanjoy Das	4bfb472072	[Statepoint] Clean up StatepointLowering: symbolic constants. For accessors in the `Statepoint` class, use symbolic constants for offsets into the argument vector instead of literals. This makes the code intent clearer and simpler to change. llvm-svn: 236566	2015-05-06 02:36:31 +00:00
Sanjoy Das	499d703f52	[Statepoint] Clean up Statepoint.h: accessor names. Use getFoo() as accessors consistently and some other naming changes. llvm-svn: 236564	2015-05-06 02:36:26 +00:00
Sanjoy Das	c6bf3e9f12	[StatepointLowering] Don't create temporary instructions. NFCI. Summary: Instead of creating a temporary call instruction and lowering that, use SelectionDAGBuilder::lowerCallOperands. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9480 llvm-svn: 236563	2015-05-06 02:36:20 +00:00
Ahmed Bougacha	ed363c5dcb	[WinEH] Reset WinEHPrepare::SEHExceptionCodeSlot when we're done. This caused a use-after-free on test/CodeGen/X86/win32-eh.ll No functional change intended. llvm-svn: 236561	2015-05-06 01:28:58 +00:00
Justin Bogner	0b4c484fb9	InstrProf: Strip filename prefixes from the names we display for coverage For consumers of coverage data, any filename prefixes we store in the profile data are just noise. Strip this prefix if it exists. llvm-svn: 236558	2015-05-05 23:44:48 +00:00
Pete Cooper	d0dae3e577	[X86 fast-isel] Constrain the index reg class to not include SP. The index reg on instructions with complex address modes is a GPR64_NOSP. Constrain it to appease the machine verifier. llvm-svn: 236557	2015-05-05 23:41:53 +00:00
Sanjoy Das	1194d1e799	[SelectionDAG] Make an argument optional in RFV::getCopyToRegs. NFC. Summary: We default the value argument to nullptr. The only use of the value is in diagnosePossiblyInvalidConstraint and that seems to be resilient to it being nullptr. Reviewers: atrick, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9479 llvm-svn: 236555	2015-05-05 23:06:57 +00:00
Sanjoy Das	3936a97f11	[SelectionDAG] Move RegsForValue into SelectionDAGBuilder.h. NFC. Summary: The exported class will be used in later change, in StatepointLowering.cpp. It is still internal to SelectionDAG (not exported via include/). Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9478 llvm-svn: 236554	2015-05-05 23:06:54 +00:00
Sanjoy Das	84153c450a	[SelectionDAG] Pass explicit type to lowerCallOperands. NFC. Summary: Currently this does not change anything, but change will be used in a later change to StatepointLowering.cpp Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9477 llvm-svn: 236553	2015-05-05 23:06:52 +00:00
Sanjoy Das	3fb91c0a0d	[StatepointLowering] Rename variable, NFC. Rename LoweredArgs to LoweredMetaArgs to clarify intent. llvm-svn: 236552	2015-05-05 23:06:49 +00:00
Pete Cooper	ce9ad757c7	Fix IfConverter to handle regmask machine operands. Note, this is a recommit of r236515 after fixing an error in r236514. The buildbot ran fast enough that it picked up r236514 prior to r236515 and threw an error. r236515 itself ran 'make check' without errors. Original commit message follows: A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks. These are slightly different to a def in that we need to add both an implicit use and def to appease the machine verifier. Otherwise, uses after the if converted call could think they are reading an undefined register. Reviewed by Matthias Braun and Quentin Colombet. llvm-svn: 236550	2015-05-05 22:09:41 +00:00
Kostya Serebryany	ca6a2a2f1c	[lib/Fuzzer] on crash print the contents of the crashy input as base64 llvm-svn: 236548	2015-05-05 21:59:51 +00:00
Sanjay Patel	801caff64d	propagate IR-level fast-math-flags to DAG nodes (NFC) This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. There are 2 structural changes here: 1. The main diff is that we're preparing to extend the optimization flags to affect more than just binary SDNodes. Eg, IR intrinsics ( https://llvm.org/bugs/show_bug.cgi?id=21290 ) or non-binop nodes that don't even exist in IR such as FMA, FNEG, etc. 2. The other change is that we're actually copying the FP fast-math-flags from the IR instructions to SDNodes. Differential Revision: http://reviews.llvm.org/D8900 llvm-svn: 236546	2015-05-05 21:40:38 +00:00
Sanjay Patel	fbca70d767	use range-based for-loop; NFC llvm-svn: 236544	2015-05-05 21:20:52 +00:00
David Majnemer	ac256cfed2	[Inliner] Discard empty COMDAT groups COMDAT groups which have become rendered unused because of inline are discardable if we can prove that we've made the group empty. This fixes PR22285. llvm-svn: 236539	2015-05-05 20:14:22 +00:00
Pete Cooper	7605e37a63	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC Note, this is a reapplication of r236515 with a fix to not assert on non-register operands, but instead only handle them until the subsequent commit. Original commit message follows. The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236538	2015-05-05 20:14:22 +00:00
Peter Collingbourne	85a0e23bc8	Thumb2SizeReduction: Check the correct set of registers for LDMIA. The register set for LDMIA begins at offset 3, not 4. We were previously missing the short encoding of this instruction in the case where the base register was the first register in the register set. Also clean up some dead code: - The isARMLowRegister check is redundant with what VerifyLowRegs does; replace with an assert. - Remove handling of LDMDB instruction, which has no short encoding (and does not appear in ReduceTable). Differential Revision: http://reviews.llvm.org/D9485 llvm-svn: 236535	2015-05-05 20:07:10 +00:00
Ulrich Weigand	9958c489bb	[DAGCombiner] Account for getVectorIdxTy() when narrowing vector load This patch makes ReplaceExtractVectorEltOfLoadWithNarrowedLoad convert the element number from getVectorIdxTy() to PtrTy before doing pointer arithmetic on it. This is needed on z, where element numbers are i32 but pointers are i64. Original patch by Richard Sandiford. llvm-svn: 236530	2015-05-05 19:34:10 +00:00
Ulrich Weigand	af2c618e2b	[DAGCombiner] Fix ReplaceExtractVectorEltOfLoadWithNarrowedLoad for BE For little-endian, the function would convert (extract_vector_elt (load X), Y) to X + Ysizeof(elt). For big-endian it would instead use X + sizeof(vec) - Ysizeof(elt). The big-endian case wasn't right since vector index order always follows memory/array order, even for big-endian. (Note that the current handling has to be wrong for Y==0 since it would access beyond the end of the vector.) Original patch by Richard Sandiford. llvm-svn: 236529	2015-05-05 19:33:37 +00:00
Ulrich Weigand	2693c0a491	[LegalizeVectorTypes] Allow single loads and stores for more short vectors When lowering a load or store for TypeWidenVector, the type legalizer would use a single load or store if the associated integer type was legal. E.g. it would load a v4i8 as an i32 if i32 was legal. This patch extends that behavior to promoted integers as well as legal ones. If the integer type for the full vector width is TypePromoteInteger, the element type is going to be TypePromoteInteger too, and it's still better to use a single promoting load or truncating store rather than N individual promoting loads or truncating stores. E.g. if you have a v2i8 on a target where i16 is promoted to i32, it's better to load the v2i8 as an i16 rather than load both i8s individually. Original patch by Richard Sandiford. llvm-svn: 236528	2015-05-05 19:32:57 +00:00
Ulrich Weigand	c1708b2618	[SystemZ] Add vector intrinsics This adds intrinsics to allow access to all of the z13 vector instructions. Note that instructions whose semantics can be described by standard LLVM IR do not get any intrinsics. For each instructions whose semantics cannot (fully) be described, we define an LLVM IR target-specific intrinsic that directly maps to this instruction. For instructions that also set the condition code, the LLVM IR intrinsic returns the post-instruction CC value as a second result. Instruction selection will attempt to detect code that compares that CC value against constants and use the condition code directly instead. Based on a patch by Richard Sandiford. llvm-svn: 236527	2015-05-05 19:31:09 +00:00
Ulrich Weigand	5211f9ff4d	[SystemZ] Mark v1i128 and v1f128 as unsupported The ABI specifies that <1 x i128> and <1 x fp128> are supposed to be passed in vector registers. We do not yet support those types, and some infrastructure is missing before we can do so. In order to prevent accidentally generating code violating the ABI, this patch adds checks to detect those types and error out if user code attempts to use them. llvm-svn: 236526	2015-05-05 19:30:05 +00:00
Ulrich Weigand	cd2a1b5341	[SystemZ] Handle sub-128 vectors The ABI allows sub-128 vectors to be passed and returned in registers, with the vector occupying the upper part of a register. We therefore want to legalize those types by widening the vector rather than promoting the elements. The patch includes some simple tests for sub-128 vectors and also tests that we can recognize various pack sequences, some of which use sub-128 vectors as temporary results. One of these forms is based on the pack sequences generated by llvmpipe when no intrinsics are used. Signed unpacks are recognized as BUILD_VECTORs whose elements are individually sign-extended. Unsigned unpacks can have the equivalent form with zero extension, but they also occur as shuffles in which some elements are zero. Based on a patch by Richard Sandiford. llvm-svn: 236525	2015-05-05 19:29:21 +00:00
Ulrich Weigand	49506d78e7	[SystemZ] Add CodeGen support for scalar f64 ops in vector registers The z13 vector facility includes some instructions that operate only on the high f64 in a v2f64, effectively extending the FP register set from 16 to 32 registers. It's still better to use the old instructions if the operands happen to fit though, since the older instructions have a shorter encoding. Based on a patch by Richard Sandiford. llvm-svn: 236524	2015-05-05 19:28:34 +00:00
Ulrich Weigand	80b3af7ab3	[SystemZ] Add CodeGen support for v4f32 The architecture doesn't really have any native v4f32 operations except v4f32->v2f64 and v2f64->v4f32 conversions, with only half of the v4f32 elements being used. Even so, using vector registers for <4 x float> and scalarising individual operations is much better than generating completely scalar code, since there's much less register pressure. It's also more efficient to do v4f32 comparisons by extending to 2 v2f64s, comparing those, then packing the result. This particularly helps with llvmpipe. Based on a patch by Richard Sandiford. llvm-svn: 236523	2015-05-05 19:27:45 +00:00
Ulrich Weigand	cd808237b2	[SystemZ] Add CodeGen support for v2f64 This adds ABI and CodeGen support for the v2f64 type, which is natively supported by z13 instructions. Based on a patch by Richard Sandiford. llvm-svn: 236522	2015-05-05 19:26:48 +00:00
Ulrich Weigand	ce4c109585	[SystemZ] Add CodeGen support for integer vector types This the first of a series of patches to add CodeGen support exploiting the instructions of the z13 vector facility. This patch adds support for the native integer vector types (v16i8, v8i16, v4i32, v2i64). When the vector facility is present, we default to the new vector ABI. This is characterized by two major differences: - Vector types are passed/returned in vector registers (except for unnamed arguments of a variable-argument list function). - Vector types are at most 8-byte aligned. The reason for the choice of 8-byte vector alignment is that the hardware is able to efficiently load vectors at 8-byte alignment, and the ABI only guarantees 8-byte alignment of the stack pointer, so requiring any higher alignment for vectors would require dynamic stack re-alignment code. However, for compatibility with old code that may use vector types, when not using the vector facility, the old alignment rules (vector types are naturally aligned) remain in use. These alignment rules are not only implemented at the C language level (implemented in clang), but also at the LLVM IR level. This is done by selecting a different DataLayout string depending on whether the vector ABI is in effect or not. Based on a patch by Richard Sandiford. llvm-svn: 236521	2015-05-05 19:25:42 +00:00
Ulrich Weigand	a8b04e1cbc	[SystemZ] Add z13 vector facility and MC support This patch adds support for the z13 processor type and its vector facility, and adds MC support for all new instructions provided by that facilily. Apart from defining the new instructions, the main changes are: - Adding VR128, VR64 and VR32 register classes. - Making FP64 a subclass of VR64 and FP32 a subclass of VR32. - Adding a D(V,B) addressing mode for scatter/gather operations - Adding 1-, 2-, and 3-bit immediate operands for some 4-bit fields. Until now all immediate operands have been the same width as the underlying field (hence the assert->return change in decode[SU]ImmOperand). In addition, sys::getHostCPUName is extended to detect running natively on a z13 machine. Based on a patch by Richard Sandiford. llvm-svn: 236520	2015-05-05 19:23:40 +00:00
Pete Cooper	336d90b61b	Revert "Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC" This reverts commit 963cdbccf6e5578822836fd9b2ebece0ba9a60b7 (ie r236514) This is to get the bots green while i investigate. llvm-svn: 236518	2015-05-05 18:49:08 +00:00
Pete Cooper	05b84d4168	Revert "Fix IfConverter to handle regmask machine operands." This reverts commit b27413cbfd78d959c18e713bfa271fb69e6b3303 (ie r236515). This is to get the bots green while i investigate the failures. llvm-svn: 236517	2015-05-05 18:49:05 +00:00
Pete Cooper	6ebc207703	Fix IfConverter to handle regmask machine operands. A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks. These are slightly different to a def in that we need to add both an implicit use and def to appease the machine verifier. Otherwise, uses after the if converted call could think they are reading an undefined register. Reviewed by Matthias Braun and Quentin Colombet. llvm-svn: 236515	2015-05-05 18:31:36 +00:00
Pete Cooper	bbd1c727d1	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236514	2015-05-05 18:31:31 +00:00
Daniel Berlin	3459d6ead5	Update BasicAliasAnalysis to understand that nothing aliases with undef values. It got this in some cases (if one of them was an identified object), but not in all cases. This caused stores to undef to block load-forwarding in some cases, etc. Added test to Transforms/GVN to verify optimization occurs as expected. llvm-svn: 236511	2015-05-05 18:10:49 +00:00
David Blaikie	73cf872adb	[opaque pointer type] Track explicit GEP pointee type through in-memory IR llvm-svn: 236510	2015-05-05 18:03:48 +00:00
Reid Kleckner	0738a9c02e	Re-land "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236360. This change exposed a bug in WinEHPrepare by opting win32 code into EH preparation. We already knew that WinEHPrepare has bugs, and is the status quo for x64, so I don't think that's a reason to hold off on this change. I disabled exceptions in the sanitizer tests in r236505 and an earlier revision. llvm-svn: 236508	2015-05-05 17:44:16 +00:00
Quentin Colombet	61b305edfd	[ShrinkWrap] Add (a simplified version) of shrink-wrapping. This patch introduces a new pass that computes the safe point to insert the prologue and epilogue of the function. The interest is to find safe points that are cheaper than the entry and exits blocks. As an example and to avoid regressions to be introduce, this patch also implements the required bits to enable the shrink-wrapping pass for AArch64. Context Currently we insert the prologue and epilogue of the method/function in the entry and exits blocks. Although this is correct, we can do a better job when those are not immediately required and insert them at less frequently executed places. The job of the shrink-wrapping pass is to identify such places. Motivating example Let us consider the following function that perform a call only in one branch of a if: define i32 @f(i32 %a, i32 %b) { %tmp = alloca i32, align 4 %tmp2 = icmp slt i32 %a, %b br i1 %tmp2, label %true, label %false true: store i32 %a, i32* %tmp, align 4 %tmp4 = call i32 @doSomething(i32 0, i32* %tmp) br label %false false: %tmp.0 = phi i32 [ %tmp4, %true ], [ %a, %0 ] ret i32 %tmp.0 } On AArch64 this code generates (removing the cfi directives to ease readabilities): _f: ; @f ; BB#0: stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething LBB0_2: ; %false mov sp, x29 ldp x29, x30, [sp], #16 ret With shrink-wrapping we could generate: _f: ; @f ; BB#0: cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething add sp, x29, #16 ; =16 ldp x29, x30, [sp], #16 LBB0_2: ; %false ret Therefore, we would pay the overhead of setting up/destroying the frame only if we actually do the call. Proposed Solution This patch introduces a new machine pass that perform the shrink-wrapping analysis (See the comments at the beginning of ShrinkWrap.cpp for more details). It then stores the safe save and restore point into the MachineFrameInfo attached to the MachineFunction. This information is then used by the PrologEpilogInserter (PEI) to place the related code at the right place. This pass runs right before the PEI. Unlike the original paper of Chow from PLDI’88, this implementation of shrink-wrapping does not use expensive data-flow analysis and does not need hack to properly avoid frequently executed point. Instead, it relies on dominance and loop properties. The pass is off by default and each target can opt-in by setting the EnableShrinkWrap boolean to true in their derived class of TargetPassConfig. This setting can also be overwritten on the command line by using -enable-shrink-wrap. Before you try out the pass for your target, make sure you properly fix your emitProlog/emitEpilog/adjustForXXX method to cope with basic blocks that are not necessarily the entry block. Design Decisions 1. ShrinkWrap is its own pass right now. It could frankly be merged into PEI but for debugging and clarity I thought it was best to have its own file. 2. Right now, we only support one save point and one restore point. At some point we can expand this to several save point and restore point, the impacted component would then be: - The pass itself: New algorithm needed. - MachineFrameInfo: Hold a list or set of Save/Restore point instead of one pointer. - PEI: Should loop over the save point and restore point. Anyhow, at least for this first iteration, I do not believe this is interesting to support the complex cases. We should revisit that when we motivating examples. Differential Revision: http://reviews.llvm.org/D9210 <rdar://problem/3201744> llvm-svn: 236507	2015-05-05 17:38:16 +00:00
Lang Hames	cd68eba3b9	[Orc] Reapply r236465 with fixes for the MSVC bots. llvm-svn: 236506	2015-05-05 17:37:18 +00:00
Kit Barton	d4eb73c00e	This patch adds ABI support for v1i128 data type. It adds v1i128 to the appropriate register classes and checks parameter passing and return values. This is related to http://reviews.llvm.org/D9081, which will add instructions that exploit the v1i128 datatype. Phabricator review: http://reviews.llvm.org/D9475 llvm-svn: 236503	2015-05-05 16:10:44 +00:00
Igor Laevsky	2aa8cafaf6	Emit comment for gc.relocate showing base and derived pointers in human readable form. Differential Revision: http://reviews.llvm.org/D9326 llvm-svn: 236497	2015-05-05 13:20:42 +00:00
Daniel Sanders	eda60d217b	[mips] Generate code for insert/extract operations when using the N64 ABI and MSA. Summary: When using the N64 ABI, element-indices use the i64 type instead of i32. In many cases, we can use iPTR to account for this but additional patterns and pseudo's are also required. This fixes most (but not quite all) failures in the test-suite when using N64 and MSA together. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9342 llvm-svn: 236494	2015-05-05 10:32:24 +00:00
Ismail Donmez	5eb52b74cb	Fix regression in parsing armv{6,7}hl- triples. These are used by SUSE and Redhat currently. Reviewed by Jonathan Roelofs. llvm-svn: 236492	2015-05-05 09:29:43 +00:00
Daniel Sanders	4160c802d9	[mips][msa] Test basic operations for the N32 ABI too. Summary: This required adding instruction aliases for dneg. N64 will be enabled shortly but requires additional bugfixes. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9341 llvm-svn: 236489	2015-05-05 08:48:35 +00:00
Kostya Serebryany	177582c90f	[lib/Fuzzer] use handle_abort=1 by default so that when assert() fires we save the test case llvm-svn: 236476	2015-05-05 01:42:55 +00:00
Lang Hames	ac31a1f141	[Orc] Revert r236465 - It broke the Windows bots. Looks like the usual missing explicit move-constructor issue with MSVC. I should have a fix shortly. llvm-svn: 236472	2015-05-04 23:30:01 +00:00
Reid Kleckner	9dad227b85	[X86] Fix assertion while DAG combining offsets and ExternalSymbols ExternalSymbol nodes do not contain offsets, unlike GlobalValue nodes. llvm-svn: 236471	2015-05-04 23:22:36 +00:00
Pete Cooper	4dddbcfbb1	[ARM] IT block insertion needs to update kill flags When forming an IT block from the first MOV here: %R2<def> = t2MOVr %R0, pred:1, pred:%CPSR, opt:%noreg %R3<def> = tMOVr %R0<kill>, pred:14, pred:%noreg the move in to R3 is moved out of the IT block so that later instructions on the same predicate can be inside this block, and we can share the IT instruction. However, when moving the R3 copy out of the IT block, we need to clear its kill flags for anything in use at this point in time, ie, R0 here. This appeases the machine verifier which thought that R0 wasn't defined when used. I have a test case, but its extremely register allocator specific. It would be too fragile to commit a test which depends on the register allocator here. llvm-svn: 236468	2015-05-04 22:44:47 +00:00
Lang Hames	a68970dfd5	[Orc] Refactor the compile-on-demand layer to make module partitioning lazy, and avoid cloning unused decls into every partition. Module partitioning showed up as a source of significant overhead when I profiled some trivial test cases. Avoiding the overhead of partitionging for uncalled functions helps to mitigate this. This change also means that it is no longer necessary to have a LazyEmittingLayer underneath the CompileOnDemand layer, since the CompileOnDemandLayer will not extract or emit function bodies until they are called. llvm-svn: 236465	2015-05-04 22:03:10 +00:00
Tim Northover	851ff69b42	CodeGen: match up correct insertvalue indices when assessing tail calls. When deciding whether a value comes from the aggregate or inserted value of an insertvalue instruction, we compare the indices against those of the location we're interested in. One of the lists needs reversing because the input data is backwards (so that modifications take place at the end of the SmallVector), but we were reversing both before leading to incorrect results. Should fix PR23408 llvm-svn: 236457	2015-05-04 20:41:51 +00:00
Alex Lorenz	b122508501	YAML: Add an optional 'flow' field to the mapping trait to allow flow mapping output. This patch adds an optional 'flow' field to the MappingTrait class so that yaml IO will be able to output flow mappings. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9450 llvm-svn: 236456	2015-05-04 20:11:40 +00:00
Keno Fischer	d71a17710b	Respect object format choice on Darwin Summary: The object format can be set to something other than MachO, e.g. to use ELF-on-Darwin for MCJIT. This already works on Windows, so there's no reason it shouldn't on Darwin. Reviewers: lhames, grosbach Subscribers: rafael, grosbach, t.p.northover, llvm-commits Differential Revision: http://reviews.llvm.org/D6185 llvm-svn: 236455	2015-05-04 20:03:01 +00:00
Reid Kleckner	b61f06c9c2	Fix -Wmicrosoft warning by making enum unsigned llvm-svn: 236436	2015-05-04 18:21:35 +00:00
Davide Italiano	e197d4fbb6	[IR/Diagnostic] Assert that DebugLoc is valid before accessing. PR: 23380 Differential Revision: http://reviews.llvm.org/D9464 Reviewed by: dexonsmith llvm-svn: 236435	2015-05-04 18:08:35 +00:00
Hans Wennborg	8d8146fa73	Option parsing: properly handle flag aliases for joined options (PR23394) A joined option always needs to have an argument, even if it's an empty one. Clang would previously assert when trying to use --extra-warnings, which is a flag alias for -W, which is a joined option. llvm-svn: 236434	2015-05-04 18:00:13 +00:00
Ulrich Weigand	9ac2f9b2d8	[SystemZ] Reclassify f32 subregs of f64 registers At the moment, all subregs defined by the SystemZ target can be modified independently of the wider register. E.g. writing to a GR32 does not change the upper 32 bits of the GR64. Writing to an FP32 does not change the lower 32 bits of the FP64. Hoewver, the upcoming support for the vector extension redefines FP64 as one half of a V128. Floating-point operations leave the other half of a V128 in an unpredictable state, so it's no longer the case that writing to an FP32 leaves the bits of the underlying register (the V128) alone. I'd prefer to have separate subreg_ names for this situation, so that it's obvious at a glance whether we're talking about a subreg that leaves the other parts of the register alone. No behavioral change intended. Patch originally by Richard Sandiford. llvm-svn: 236433	2015-05-04 17:41:22 +00:00
Ulrich Weigand	1f698b003c	[SystemZ] Clean up AsmParser isMem() handling We know what MemoryKind an operand has at the time we construct it, so we might as well just record it in an unused part of the structure. This makes it easier to add scatter/gather addresses later. No behavioral change intended. Patch originally by Richard Sandiford. llvm-svn: 236432	2015-05-04 17:40:53 +00:00
Ulrich Weigand	1c6f07d616	[SystemZ] Fix getTargetNodeName It seems SystemZTargetLowering::getTargetNodeName got out of sync with some recent changes to the SystemZISD opcode list. Add back all the missing opcodes (and re-sort to the same order as SystemISelLowering.h). llvm-svn: 236430	2015-05-04 17:39:40 +00:00
Pete Cooper	300069a019	ScheduleDAGInstrs should toggle kill flags on bundled instrs. ScheduleDAGInstrs wasn't setting or clearing the kill flags on instructions inside bundles. This led to code such as this %R3<def> = t2ANDrr %R0 BUNDLE %ITSTATE<imp-def,dead>, %R0<imp-use,kill> t2IT 1, 24, %ITSTATE<imp-def> R6<def,tied6> = t2ORRrr %R0<kill>, ... being transformed to BUNDLE %ITSTATE<imp-def,dead>, %R0<imp-use> t2IT 1, 24, %ITSTATE<imp-def> R6<def,tied6> = t2ORRrr %R0<kill>, ... %R3<def> = t2ANDrr %R0<kill> where the kill flag was removed from the BUNDLE instruction, but not the t2ORRrr inside it. The verifier then thought that R0 was undefined when read by the AND. This change make the toggleKillFlags method also check for bundles and toggle flags on bundled instructions. Setting the kill flag is special cased as we only want to set the kill flag on the last instruction in the bundle. llvm-svn: 236428	2015-05-04 16:52:06 +00:00
Tom Stellard	b81f4aa952	R600/SI: Code cleanup This is a follow-up to r236004 llvm-svn: 236427	2015-05-04 16:45:08 +00:00
Elena Demikhovsky	60eb9db7bb	AVX-512: added calling convention for i1 vectors in 32-bit mode. Fixed some bugs in extend/truncate for AVX-512 target. Removed VBROADCASTM (masked broadcast) node, since it is not used any more. llvm-svn: 236420	2015-05-04 12:40:50 +00:00
Elena Demikhovsky	52266388f8	AVX-512: added integer "add" and "sub" instructions with saturation for SKX with intrinsics and tests by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236418	2015-05-04 12:35:55 +00:00
Elena Demikhovsky	2557a22be7	AVX-512: Added VPACK* instructions forms for KNL and SKX and their intrinsics by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236414	2015-05-04 09:14:02 +00:00
Yaron Keren	f8e6517591	Replace windows_error calls with mapWindowsError. After r210687, windows_error does nothing but call mapWindowsError. Other Windows/*.inc files directly call mapWindowsError. This patch updates Path.inc and Process.inc to do the same. llvm-svn: 236409	2015-05-04 04:48:10 +00:00
Craig Topper	23fdafe682	[TableGen] Replace 'static_cast' with 'cast'. llvm-svn: 236398	2015-05-04 01:35:42 +00:00
Craig Topper	a9642b4ec8	[TableGen] Formatting cleanup. Mostly removing trailing whitespace and unnecessary curly braces. NFC llvm-svn: 236397	2015-05-04 01:35:39 +00:00
Elena Demikhovsky	1b60ed7069	Masked gather and scatter intrinsics - enabled codegen for KNL. llvm-svn: 236394	2015-05-03 07:12:25 +00:00
Simon Pilgrim	d5e20306cc	[SSE2] Minor tidyup of v16i8 SHL lowering. NFC. Removed code that was replicating v8i16 'shift + mask' implementation that is done more nicely by making use of LowerScalarImmediateShift llvm-svn: 236388	2015-05-02 14:42:43 +00:00
Simon Pilgrim	017ca19384	[DAGCombiner] Enabled vector float/double -> int constant folding llvm-svn: 236387	2015-05-02 13:04:07 +00:00
David Blaikie	72d03efa6d	DebugInfo: Use low_pc relative debug_ranges under fission when the CU has a low_pc Seems we were setting the base address on the wrong DwarfCompileUnit object so it wasn't being used when generating the ranges. llvm-svn: 236377	2015-05-02 02:31:49 +00:00
Jim Grosbach	bfe3a9c318	Fix spelling. llvm-svn: 236367	2015-05-02 00:44:07 +00:00
Reid Kleckner	83d89fa546	Revert "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236359. Things are still broken despite testing. :( llvm-svn: 236360	2015-05-01 22:50:14 +00:00
Reid Kleckner	51476acd77	Re-land "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236340. llvm-svn: 236359	2015-05-01 22:40:25 +00:00
Quentin Colombet	0de2346859	[AArch64][FastISel] Variant of the logical instructions that use two input registers cannot write on SP. rdar://problem/20748715 llvm-svn: 236352	2015-05-01 21:34:57 +00:00
Colin LeMahieu	6efd273a61	[Hexagon] Removing variable unused in release. llvm-svn: 236351	2015-05-01 21:30:22 +00:00
Colin LeMahieu	b662565475	[Hexagon] Adding expression MC emission and removing XFAIL from test that hits this code path. llvm-svn: 236348	2015-05-01 21:14:21 +00:00
Quentin Colombet	9df2fa261b	[AArch64][FastISel] Fix the setting of kill flags for MUL -> UMULH sequences. rdar://problem/20748715 llvm-svn: 236346	2015-05-01 20:57:11 +00:00
Zachary Turner	6799af41fe	Fix build. llvm-svn: 236343	2015-05-01 20:33:10 +00:00
Zachary Turner	e5cb269352	[llvm-pdbdump] Support dynamic load address and external symbols. This patch adds the --load-address command line option to llvm-pdbdump, which dumps all addresses assuming the module has loaded at the specified address. Additionally, this patch adds an option to llvm-pdbdump to support dumping of public symbols (i.e. symbols with external linkage). llvm-svn: 236342	2015-05-01 20:24:26 +00:00
Keno Fischer	e6892c8149	Reapply [RuntimeDyldELF] Fold Placeholder into Addend This reapplies r235060 and 235070, which were reverted because of test failures in LLDB. The failure was caused because at moment RuntimeDyld is processing relocations for all sections, irrespective of whether we actually load them into memory or not, but RuntimeDyld was not actually remembering where in memory the unrelocated section is. This commit includes a fix for that issue by remembering that pointer, though the longer term fix should be to stop processing unneeded sections. Original Summary: This allows us to get rid of the original unrelocated object file after we're done processing relocations (but before applying them). MachO and COFF already do not require this (currently we have temporary hacks to prevent ownership from being released, but those are brittle and should be removed soon). The placeholder mechanism allowed the relocation resolver to look at original object file to obtain more information that are required to apply the relocations. This is usually necessary in two cases: - For relocations targetting sub-word memory locations, there may be pieces of the instruction at the target address which we should not override. - Some relocations on some platforms allow an extra addend to be encoded in their immediate fields. The problem is that in the second case the information cannot be recovered after the relocations have been applied once because they will have been overridden. In the first case we also need to be careful to not use any bits that aren't fixed and may have been overriden by applying a first relocation. In the past both have been fixed by just looking at original object file. This patch attempts to recover the information from the first by looking at the relocated object file, while the extra addend in the second case is read upon relocation processing and addend to the regular addend. I have tested this on X86. Other platforms represent my best understanding of how those relocations should work, but I may have missed something because I do not have access to those platforms. We will keep the ugly workarounds in place for a couple of days, so this commit can be reverted if it breaks the bots. Differential Revision: http://reviews.llvm.org/D9028 llvm-svn: 236341	2015-05-01 20:21:45 +00:00
Reid Kleckner	2747d3d55a	Revert "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236339, it breaks the win32 clang-cl self-host. llvm-svn: 236340	2015-05-01 20:14:04 +00:00
Reid Kleckner	4856fc61b4	[WinEH] Add an EH registration and state insertion pass for 32-bit x86 This pass is responsible for constructing the EH registration object that gets linked into fs:00, which is all it does in this change. In the future, it will also insert stores to update the EH state number. I considered keeping this functionality in WinEHPrepare, but it's pretty separable and X86 specific. It has conceptually very little to do with the task of WinEHPrepare, which is currently outlining. WinEHPrepare is also in theory useful on ARM, but this logic is pretty x86 specific. Reviewers: andrew.w.kaylor, majnemer Differential Revision: http://reviews.llvm.org/D9422 llvm-svn: 236339	2015-05-01 20:04:54 +00:00
Pete Cooper	f68d5038e6	[ARM] Transfer the internal flag in thumb2 size reduction. Converting from t2LDRs to tLDRr caused the shift argument to drop the internal flag. This would then throw machine verifier errors. Unfortunately i'm having trouble reducing a test case. I'm going to keep trying, but so far its a scary combination of machine sinking, an 'and i1', loads feeding loads, and a bunch of code which shouldn't change IT block formation, but does. Its not useful to commit a test in that state as we have no way of knowing if it even hits this code reliably in future. rdar://problem/20752113 llvm-svn: 236333	2015-05-01 18:57:32 +00:00
Alex Lorenz	42e91fa312	YAML: Fix the output of sequences that contain flow sequences. This patch fixes a bug where the YAML Output class emitted a sequence of flow sequences without the '-' characters. Before: seq: [ a, b ] [ c, d ] After: seq: - [ a, b ] - [ c, d ] Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9206 llvm-svn: 236329	2015-05-01 18:34:25 +00:00
Peter Collingbourne	d27d3a151f	ARM: Align functions containing Thumb-2 jump tables to 4 bytes. Functions with jump tables need an alignment of 4 because they use the ADR instruction, which aligns the PC to 4 bytes before adding an offset. Differential Revision: http://reviews.llvm.org/D9424 llvm-svn: 236327	2015-05-01 18:05:59 +00:00
James Y Knight	35e04e84fa	[Sparc] Repair fixups in little endian mode. Differential Revision: http://reviews.llvm.org/D9434 llvm-svn: 236324	2015-05-01 17:13:02 +00:00
Douglas Katzman	49e96132ce	Fix omission from adding sparcel (original http://reviews.llvm.org/D9263 ) "obviously" it needs to go in parseArch and getArchTypeForLLVMName. Differential Revision: http://reviews.llvm.org/D9436 llvm-svn: 236322	2015-05-01 16:49:58 +00:00
Toma Tabacu	00e9867988	[mips] [IAS] Fix error messages for using LI with 64-bit immediates. Summary: LI should never accept immediates larger than 32 bits. The additional Is32BitImm boolean also paves the way for unifying the functionality that LA and LI have in common. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9289 llvm-svn: 236313	2015-05-01 12:19:27 +00:00
Toma Tabacu	a2861db834	[mips] [IAS] Slightly improve shift instruction generation in expandLoadImm. Summary: Generate one DSLL32 of 0 instead of two consecutive DSLL of 16. In order to do this I had to change createLShiftOri's template argument from a bool to an unsigned. This also gave me the opportunity to rewrite the mips64-expansions.s test, as it was testing the same cases multiple times and skipping over other cases. It was also somewhat unreadable, as the CHECK lines were grouped in a huge block of text at the beginning of the file. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8974 llvm-svn: 236311	2015-05-01 10:26:47 +00:00
Simon Pilgrim	9fb06bca67	[SelectionDAG] Unary vector constant folding integer legality fixes This patch fixes issues with vector constant folding not correctly handling scalar input operands if they require implicit truncation - this was tested with llvm-stress as recommended by Patrik H Hagglund. The patch ensures that integer input scalars from a build vector are correctly truncated before folding, and that constant integer scalar results are promoted to a legal type before inclusion in the new folded build vector. I have added another crash test case and also a test for UINT_TO_FP / SINT_TO_FP using an non-truncated scalar input, which was failing before this patch. Differential Revision: http://reviews.llvm.org/D9282 llvm-svn: 236308	2015-05-01 08:20:04 +00:00
Tom Stellard	aa798340c3	R600/SI: Add VCC as an implict def of SI_KILL When SI_KILL has a register operand, its lowered form writes to vcc. llvm-svn: 236307	2015-05-01 03:44:09 +00:00
Tom Stellard	0b7feb1cb7	R600/SI: Fix verifier errors from the SIAnnotateControlFlow pass This pass was generating 'Instruction does not dominate all uses!' errors for programs which had loops with a condition variable that depended on the result of a phi instruction from outside of the loop. The pass was inserting new phi nodes outside of the loop which used values defined inside the loop. http://bugs.freedesktop.org/show_bug.cgi?id=90056 llvm-svn: 236306	2015-05-01 03:44:08 +00:00
Pete Cooper	2127b00cd5	[ARM] optimizeSelect should clear kill flags. If we move an instruction from one block down to a MOVC and predicate it, then the original instruction could be moved in to a loop. In this case, its invalid for any kill flags to remain on there. Fails with -verfy-machineinstrs. rdar://problem/20752113 llvm-svn: 236290	2015-04-30 23:57:47 +00:00
Justin Bogner	ba1900cefd	InstrProf: Instrumenter support for setting profile output from command line This change is the second of 3 patches to add support for specifying the profile output from the command line via -fprofile-instr-generate=<path>, where the specified output path/file will be overridden by the LLVM_PROFILE_FILE environment variable. This patch adds the necessary support to the llvm instrumenter, specifically a new member of GCOVOptions for clang to save the specified filename, and support for calling the new compiler-rt interface from __llvm_profile_init. Patch by Teresa Johnson. Thanks! llvm-svn: 236288	2015-04-30 23:49:23 +00:00
Matt Arsenault	59d2ca1cba	Fix typo llvm-svn: 236283	2015-04-30 23:20:56 +00:00
Pete Cooper	451755d370	Commute the internal flag on MachineOperands. When commuting a thumb instruction in the size reduction pass, thumb instructions are represented as a bundle and so some operands may be marked as internal. The internal flag has to move with the operand when commuting. This test is sensitive to register allocation so can't specifically check that this error was happening, but so long as it continues to pass with -verify then hopefully its still ok. rdar://problem/20752113 llvm-svn: 236282	2015-04-30 23:14:14 +00:00
Richard Trieu	6ae37961a8	Fix -Wpessimizing-move warnings by removing std::move calls. llvm-svn: 236278	2015-04-30 23:07:00 +00:00
Rafael Espindola	327fd4a255	Remove unnecessary break. llvm-svn: 236275	2015-04-30 22:41:12 +00:00
Quentin Colombet	329fa890ba	[AArch64] Fix bad register class constraint in fast-isel for TST instruction. rdar://problem/20748715 llvm-svn: 236273	2015-04-30 22:27:20 +00:00
Pete Cooper	5111881cfc	Don't always apply kill flag in thumb2 ABS pseudo expansion. The expansion for t2ABS was always setting the kill flag on the rsb instruction. It should instead only be set on rsb if it was set on the original ABS instruction. rdar://problem/20752113 llvm-svn: 236272	2015-04-30 22:15:59 +00:00
Reid Kleckner	60d5232be2	[X86] Use 4 byte preferred aggregate alignment on Win32 This helps reduce the frequency of stack realignment prologues in 32-bit X86 Windows code. Before this change and the corresponding clang change, we would take the max of the type preferred alignment and the explicit alignment on the alloca. If you don't override aggregate alignment in datalayout, you get a default of 8. This dates back to 2007 / r34356, and changing it seems prohibitively difficult at this point. llvm-svn: 236270	2015-04-30 22:11:59 +00:00
Matthias Braun	e48484c64f	InstCombineSimplifyDemanded: Remove nsw/nuw flags when optimizing demanded bits When optimizing demanded bits of the operands of an Add we have to remove the nsw/nuw flags as we have no guarantee anymore that we don't wrap. This is legal here because the top bit is not demanded. In fact this operaion was already performed but missed in the case of an Add with a constant on the right side. To fix this this patch refactors the code to unify the code paths in SimplifyDemandedUseBits() handling of Add/Sub: - The transformation of Add->Or is removed from the simplify demand code because the equivalent transformation exists in InstCombiner::visitAdd() - KnownOnes/KnownZero are not adjusted for Add x, C anymore as computeKnownBits() already performs these computations. - The simplification of the operands is unified. In this new version constant on the right side of a Sub are shrunk now as I could not find a reason why not to do so. - The special case for clearing nsw/nuw in ShrinkDemandedConstant() is not necessary anymore as the caller does that already. Differential Revision: http://reviews.llvm.org/D9415 llvm-svn: 236269	2015-04-30 22:05:30 +00:00
Matthias Braun	ec6833420f	InstCombine: Move Sub->Xor rule from SimplifyDemanded to InstCombine The rule that turns a sub to xor if the LHS is 2^n-1 and the remaining bits are known zero, does not use the demanded bits at all: Move it to the normal InstCombine code path. Differential Revision: http://reviews.llvm.org/D9417 llvm-svn: 236268	2015-04-30 22:04:26 +00:00
Rafael Espindola	868b3f47d2	Simplify the creation of compressed debug sections. This is actually fairly simple in the current code layout: Check if we should compress just before writing out and everything else just works. This removes the last case in which the object writer was creating a fragment. llvm-svn: 236267	2015-04-30 21:51:58 +00:00
Rafael Espindola	74ef480f8f	Replace unreachable code with llvm_unreachable. llvm-svn: 236261	2015-04-30 21:20:06 +00:00
Rafael Espindola	b186391e2d	Don't create a MCSectionData when we don't have to. llvm-svn: 236260	2015-04-30 21:10:06 +00:00
Andrea Di Biagio	c84b5bdd69	Fix for PR23103. Correctly propagate the 'IsUndef' flag to the register operands of a commuted instruction. Revision 220239 exposed a latent bug in method 'TargetInstrInfo::commuteInstruction'. When commuting the operands of a machine instruction, method 'commuteInstruction' didn't correctly propagate the 'IsUndef' flag to the register operands of the new (commuted) instruction. Before this patch, the following instruction: %vreg4<def> = VADDSDrr %vreg14, %vreg5<undef>; FR64:%vreg4,%vreg14,%vreg5 was wrongly converted by method 'commuteInstruction' into: %vreg4<def> = VADDSDrr %vreg5, %vreg14<undef>; FR64:%vreg4,%vreg5,%vreg14 The correct instruction should have been: %vreg4<def> = VADDSDrr %vreg5<undef>, %vreg14; FR64:%vreg4,%vreg5,%vreg14 This patch fixes the problem in method 'TargetInstrInfo::commuteInstruction'. When swapping the operands of a machine instruction, we now make sure that 'IsUndef' flags are correctly set. Added test case 'pr23103.ll'. Differential Revision: http://reviews.llvm.org/D9406 llvm-svn: 236258	2015-04-30 21:03:29 +00:00
Rafael Espindola	a001a32c31	Avoid an extra loop over the sections. Add string to the section header string table as we add sections. llvm-svn: 236257	2015-04-30 20:57:14 +00:00
Rafael Espindola	03d7abbbe5	Make the section table a member of ELFObjectWriter. This avoids passing it around and lets us build a small helper to add a section to the table. llvm-svn: 236255	2015-04-30 20:53:27 +00:00
Rafael Espindola	657117f82c	Nothing inherits from this, drop the protected: llvm-svn: 236253	2015-04-30 20:37:36 +00:00
Matt Arsenault	d42e017ee4	Mips: Remove dead declaration llvm-svn: 236250	2015-04-30 19:35:43 +00:00
Matt Arsenault	ee5c2ab734	MachineVerifier: Don't crash if MachineOperand has no parent If you somehow added a MachineOperand to an instruction that did not have the parent set, the verifier would crash since it attempts to use the operand's parent. llvm-svn: 236249	2015-04-30 19:35:41 +00:00
Pete Cooper	4d8d2ec3eb	Don't rewrite jumps to empty BBs to landing pads. In the test case here, the 'unreachable' BB was removed by BranchFolding because its empty. It then rewrote the jump from 'entry' to jump to its fallthrough, which was a landing pad. This results in 'entry' jumping to 2 different landing pads, which fails the machine verifier. rdar://problem/20750162 llvm-svn: 236248	2015-04-30 18:58:23 +00:00
Quentin Colombet	0a905042cd	[ARM] Do not generate invalid encoding for stack adjust, even if this is just temporary. Because of that: 1. The machine verifier was complaining on such code. 2. The generate code worked just because the thumb reduction size pass fixed the opcode. rdar://problem/20749824 llvm-svn: 236247	2015-04-30 18:52:49 +00:00
Tim Northover	03b99f66d7	AArch64: add BFC alias for the BFI/BFM instructions. Unlike 32-bit ARM, AArch64 can use wzr/xzr to implement this without the need for a separate instruction. rdar://18679590 llvm-svn: 236245	2015-04-30 18:28:58 +00:00
Reid Kleckner	582786b6cc	Add a note about permitting default member initializers Use them in WinEHPrepare so that we can spot any toolchain bugs that come up. llvm-svn: 236244	2015-04-30 18:17:12 +00:00
Jan Vesely	808fff585b	Reinstate revisions r234755, r234759, r234760 changes: Don't apply on hexagon and NVPTX since they no longer claim to support UADDO/USUBO Add location to getConstant Drop comment about the ops being turned into expand llvm-svn: 236240	2015-04-30 17:15:56 +00:00
Rafael Espindola	bda1980917	Write sections mostly in one pass. During ELF writing, there is no need to further relax the sections, so we should not be creating fragments. This patch avoids doing so in all cases but debug section compression (that is next). Also, the ELF format is fairly simple to write. We can do a single pass over the sections to write them out and compute the section header table. llvm-svn: 236235	2015-04-30 14:21:49 +00:00
Aaron Ballman	9cab732161	Silencing an "enumeral and non-enumeral type in conditional expression" warning; NFC. llvm-svn: 236234	2015-04-30 14:03:12 +00:00
Daniel Jasper	232778a7a0	Silence unused warning in non-assert builds. llvm-svn: 236213	2015-04-30 09:01:21 +00:00
Daniel Jasper	0366cd23ac	Inline local variable to silence unused warning. llvm-svn: 236212	2015-04-30 08:51:13 +00:00
Elena Demikhovsky	e1eda8a9e6	Masked gather and scatter - added DAGCombine visitors and AVX-512 instruction selection patterns. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 236211	2015-04-30 08:38:48 +00:00
Simon Pilgrim	ecf5875bd5	[SSE] Fix for MUL v16i8 on pre-SSE41 targets (PR23369). Sign extension of i8 to i16 was placing the unpacked bytes in the lower byte instead of the upper byte. llvm-svn: 236209	2015-04-30 08:23:16 +00:00
Craig Topper	85c07007ea	[TableGen] Cleanup formatting by moving operators from beginning of line to end of previous line. NFC llvm-svn: 236206	2015-04-30 05:54:22 +00:00
Craig Topper	9ed595e41d	[TableGen] Used range-based for loop. NFC. llvm-svn: 236205	2015-04-30 05:54:20 +00:00
Craig Topper	9ef76499b2	[TableGen] Merge a variable assignment and a return to drop curly braces. Fold an assignment into an if. Use auto on the result of a couple dyn_casts. NFC llvm-svn: 236204	2015-04-30 05:12:52 +00:00
Sanjoy Das	08e95b4703	[InstCombine] Add new rule for MIN(MAX(~A, ~B), ~C) et. al. Summary: Optimizing these well are especially interesting for IRCE since it "clamps" values by generating this sort of pattern through SCEV expressions. Depends on D9352. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9353 llvm-svn: 236203	2015-04-30 04:56:04 +00:00
Sanjoy Das	a8c178f280	[InstCombine] Add a new formula for SMIN. Summary: After this change `MatchSelectPattern` recognizes the following form of SMIN: Y >s C ? ~Y : ~C == ~Y <s ~C ? ~Y : ~C = SMIN(~Y, ~C) Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9352 llvm-svn: 236202	2015-04-30 04:56:00 +00:00
Filipe Cabecinhas	f8a16a952d	Don't overflow GCTable Summary: Bug found with AFL fuzz. Reviewers: rafael, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9361 llvm-svn: 236200	2015-04-30 04:09:41 +00:00
Owen Anderson	d8a029c81b	Semantically revert r236031, which is not a good idea for in-order targets. At the least it should be guarded by some kind of target hook. It also introduced catastrophic compile time and code quality regressions on some out of tree targets (test case still being reduced/sanitized). Sanjay agreed with reverting this patch until these issues can be resolved. llvm-svn: 236199	2015-04-30 04:06:32 +00:00
Filipe Cabecinhas	9a19e56306	Make sure Op->getType() is a PointerType before we cast<> it. Bug found with AFL fuzz. llvm-svn: 236193	2015-04-30 01:13:31 +00:00
Hans Wennborg	4b828d35fd	Switch lowering: use profile info to build weight-balanced binary search trees This will cause hot nodes to appear closer to the root. The literature says building the tree like this makes it a near-optimal (in terms of search time given key frequencies) binary search tree. In LLVM's case, we can do up to 3 comparisons in each leaf node, so it might be better to opt for lower tree height in some cases; that's something to look into in the future. Differential Revision: http://reviews.llvm.org/D9318 llvm-svn: 236192	2015-04-30 00:57:37 +00:00
Filipe Cabecinhas	bad0779f63	Make sure we don't resize(0) when we get a fwdref with Idx == UINT_MAX Make it an error instead. Bug found with AFL fuzz. llvm-svn: 236190	2015-04-30 00:52:42 +00:00
Rafael Espindola	34948e5e22	Store relocations in a map from MCSectionELF. Saves finding the MCSectionData just to do a map lookup. llvm-svn: 236189	2015-04-30 00:45:46 +00:00
Rafael Espindola	b8cbb2678f	Write relocations directly to the output stream. NFC. llvm-svn: 236187	2015-04-30 00:30:40 +00:00
Pete Cooper	46361a1ea1	Change x86 CMOVE_F to read it source, not write it. This was breaking sqlite with the machine verifier because operand 0 was a def according to tablegen, but didn't have the 'isDef' flag set. Looking at the ISA, its clear that this operand is a source as writing to st(0) is implicit. So move the operand to the correct place in the td file. rdar://problem/20751584 llvm-svn: 236183	2015-04-29 23:51:33 +00:00
David Blaikie	bf0a42ac09	[opaque pointer type] Store the value type of an alloca llvm-svn: 236175	2015-04-29 23:00:35 +00:00
Reid Kleckner	bcda1cd45a	[WinEH] Start EH preparation for 32-bit x86, it uses no arguments 32-bit x86 MSVC-style exceptions are functionaly similar to 64-bit, but they take no arguments. Instead, they implicitly use the value of EBP passed in by the caller as a pointer to the parent's frame. In LLVM, we can represent this as llvm.frameaddress(1), and feed that into all of our calls to llvm.framerecover. The next steps are: - Add an alloca to the fs:00 linked list of handlers - Add something like llvm.sjlj.lsda or generalize it to store in the alloca - Move state number calculation to WinEHPrepare, arrange for FunctionLoweringInfo to call it - Use the state numbers to insert explicit loads and stores in the IR llvm-svn: 236172	2015-04-29 22:49:54 +00:00
Sanjay Patel	04b0e92766	generalize binop reassociation; NFC Move the fold introduced in r236031: http://reviews.llvm.org/rL236031 to its own helper function, so we can use it for other binops. This is a preliminary step before partially solving: https://llvm.org/bugs/show_bug.cgi?id=21768 https://llvm.org/bugs/show_bug.cgi?id=23116 llvm-svn: 236171	2015-04-29 22:30:02 +00:00
Pat Gavlin	022c5acad8	Run StatepointLowering.{cpp,h} through clang-format. llvm-svn: 236166	2015-04-29 21:52:45 +00:00
Chris Bieneman	664294cf37	[NFC] Converting to range-based for. llvm-svn: 236163	2015-04-29 21:45:22 +00:00
David Blaikie	f64246be72	[opaque pointer type] Pass GlobalAlias the actual pointer type rather than decomposing it into pointee type + address space Many of the callers already have the pointer type anyway, and for the couple of callers that don't it's pretty easy to call PointerType::get on the pointee type and address space. This avoids LLParser from using PointerType::getElementType when parsing GlobalAliases from IR. llvm-svn: 236160	2015-04-29 21:22:39 +00:00
Rafael Espindola	59f0e3182d	Inline FragmentWriter into the only user. llvm-svn: 236158	2015-04-29 21:13:30 +00:00
Rafael Espindola	91fd2778b0	Write the symbol table directly to the output file. There is no need to first accumulate it in fragments. llvm-svn: 236157	2015-04-29 21:09:32 +00:00
Sanjay Patel	caf5180ff7	tidy up; NFC llvm-svn: 236156	2015-04-29 21:01:41 +00:00
Rafael Espindola	8c7829b82f	Use pwrite to write the number of sections. This avoids having to compute the number upfront, which will be used in the next patch. llvm-svn: 236153	2015-04-29 20:39:37 +00:00
Sanjay Patel	ee6678119d	too much space again; NFC llvm-svn: 236150	2015-04-29 20:38:02 +00:00
Rafael Espindola	88abc39d70	Write the string table directly to the output file. There is no need to accumulate it in fragments first. llvm-svn: 236148	2015-04-29 20:34:31 +00:00
Sanjay Patel	435efaadff	too much space; NFC llvm-svn: 236147	2015-04-29 20:32:57 +00:00
Douglas Katzman	9160e78ac8	[Sparc] Really add sparcel architecture support. Mostly copy-and-paste from Sparc v8 architecture. Differential Revision: http://reviews.llvm.org/D8741 llvm-svn: 236146	2015-04-29 20:30:57 +00:00
Rafael Espindola	88d1f632cf	Write the section header string table directly to the output stream. Instead of accumulating the content in a fragment first, just write it to the output stream. Also put it first in the section table, so that we never have to worry about its index being >= SHN_LORESERVE. llvm-svn: 236145	2015-04-29 20:25:24 +00:00
Manman Ren	0e20822887	[AArch64] Refactor out codes that depend on specific CS save sequence. No functionality change. llvm-svn: 236143	2015-04-29 20:03:38 +00:00
Rafael Espindola	ae7e4995ca	Avoid a few const_cast. llvm-svn: 236141	2015-04-29 19:20:10 +00:00
Tim Northover	5211715360	ARM: mark branch-like instructions with correct flags. There's probably no way to test BXJ, but if the compiler ever did emit it during CodeGen it would have to be a block terminator so "isBranch" is appropriate. BLX is more tricky. Clearly a call, but it affects surprisingly little. rdar://18719544 llvm-svn: 236140	2015-04-29 19:16:38 +00:00
Douglas Katzman	e0ff282d18	New architecture name - 'sparcel' for Sparc little-endian. Differential Revision: http://reviews.llvm.org/D9263 llvm-svn: 236139	2015-04-29 19:15:08 +00:00
Douglas Katzman	9cb88b73c6	Make Sparc assembler accept parenthesized constant expressions. Differential Revision: http://reviews.llvm.org/D9087 llvm-svn: 236137	2015-04-29 18:48:29 +00:00
Zoran Jovanovic	387ce30685	[mips][microMIPSr6] Implement MUL, MUH, MULU and MUHU instructions Differential Revision: http://reviews.llvm.org/D8894 llvm-svn: 236131	2015-04-29 17:23:22 +00:00
Andrew Kaylor	a33f159056	[WinEH] Fix minor bug in begincatch block splitting llvm-svn: 236129	2015-04-29 17:21:26 +00:00
Adrian Prantl	ba6ec4b70b	Temporarily relax a check in the debug info verifier. The clang frontend helps out GDB by emitting the members of local anonymous unions as artificial local variables with shared storage. When SROA splits the storage for artificial local variables that are smaller than the entire union, the overhang piece will be outside of the allotted space for the variable and this check fails. rdar://problem/20730771 llvm-svn: 236124	2015-04-29 16:52:17 +00:00
Reid Kleckner	c695471365	[X86] Avoid mangling frameescape labels x86 Windows uses the '_' prefix for all global symbols, and this was mistakenly being applied to frameescape labels, which are not externally visible global symbols. They use the private global prefix 'L'. The right way to fix this is probably to stop masquerading this label as an ExternalSymbol and create a new SDNode type. These labels are not "external", and we know they will be resolved by assembly time. Having a custom SDNode type would allow us to do better X86 address mode matching, so it's probably worth doing eventually. llvm-svn: 236123	2015-04-29 16:46:01 +00:00
Duncan P. N. Exon Smith	a9308c49ef	IR: Give 'DI' prefix to debug info metadata Finish off PR23080 by renaming the debug info IR constructs from `MD` to `DI`. The last of the `DIDescriptor` classes were deleted in r235356, and the last of the related typedefs removed in r235413, so this has all baked for about a week. Note: If you have out-of-tree code (like a frontend), I recommend that you get everything compiling and tests passing with the previous commit before updating to this one. It'll be easier to keep track of what code is using the `DIDescriptor` hierarchy and what you've already updated, and I think you're extremely unlikely to insert bugs. YMMV of course. Back to this commit: I did this using the rename-md-di-nodes.sh upgrade script I've attached to PR23080 (both code and testcases) and filtered through clang-format-diff.py. I edited the tests for test/Assembler/invalid-generic-debug-node-*.ll by hand since the columns were off-by-three. It should work on your out-of-tree testcases (and code, if you've followed the advice in the previous paragraph). Some of the tests are in badly named files now (e.g., test/Assembler/invalid-mdcompositetype-missing-tag.ll should be 'dicompositetype'); I'll come back and move the files in a follow-up commit. llvm-svn: 236120	2015-04-29 16:38:44 +00:00
Jan Vesely	7539548738	CodeGen: Default overflow operations to expand so we don't have to assume targets are lying Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: ab Differential Revision: http://reviews.llvm.org/D9265 llvm-svn: 236119	2015-04-29 16:30:46 +00:00
Zoran Jovanovic	cca29e8f6e	[mips][microMIPSr6] Implement SUB and SUBU instructions Differential Revision: http://reviews.llvm.org/D8764 llvm-svn: 236118	2015-04-29 16:22:46 +00:00
Zoran Jovanovic	5f34d44354	[mips][microMIPSr6] Implement ADD, ADDU and ADDIU instructions Differential Revision: http://reviews.llvm.org/D8704 llvm-svn: 236111	2015-04-29 15:11:07 +00:00
James Y Knight	c09bdfa4cb	Sparc: Prefer reg+reg address encoding when only one register used. Reg+%g0 is preferred to Reg+imm0 by the manual, and is what GCC produces. Futhermore, reg+imm is invalid for the (not yet supported) "alternate address space" instructions. Differential Revision: http://reviews.llvm.org/D8753 llvm-svn: 236107	2015-04-29 14:54:44 +00:00
Vasileios Kalintiris	1249e74648	Mips fast-isel - handle functions which return i8 or i6 . Summary: Allow Mips fast-isel to handle functions which return i8/i16 signed/unsigned. Test Plan: Make check tests are forthcoming. Already passes test-suite at O0/O2 for Mips 32 r1/r2 Reviewers: dsanders, rkotler Subscribers: llvm-commits, rfuhler Differential Revision: http://reviews.llvm.org/D6765 llvm-svn: 236103	2015-04-29 14:17:14 +00:00
Daniel Sanders	301f937765	[mips] Correct 128-bit shifts on 64-bit targets. Summary: The existing code was correct for 32-bit GPR's but not 64-bit GPR's. It now accounts for both cases. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits, mohit.bhakkad, sagar Differential Revision: http://reviews.llvm.org/D9337 llvm-svn: 236099	2015-04-29 12:28:58 +00:00
Toma Tabacu	79588100d7	[mips] [IAS] Inline assemble-time shifting out of createLShiftOri. NFC. Summary: Do the assemble-time shifts from createLShiftOri at the source, which groups all the shifting together, closer to the main logic path, and store the results in concisely-named variables to improve code clarity. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8973 llvm-svn: 236096	2015-04-29 10:19:56 +00:00
Elena Demikhovsky	a9f20495a2	fixed 80-chars; NFC llvm-svn: 236093	2015-04-29 08:49:57 +00:00
Elena Demikhovsky	ac969012ef	Fixed masked gather/scatter switch-case llvm-svn: 236092	2015-04-29 08:38:53 +00:00
Craig Topper	119998dbf1	[TableGen] Use range-based for loops. NFC. llvm-svn: 236089	2015-04-29 07:13:14 +00:00
Craig Topper	39ba33b8f0	[TableGen] Fold a couple dyn_casts into the ifs that check their results. NFC llvm-svn: 236088	2015-04-29 07:13:12 +00:00
Craig Topper	ed5a950808	[TableGen] Replace some dyn_casts followed by an assert with just a regular cast which asserts internally. NFC llvm-svn: 236087	2015-04-29 07:13:05 +00:00
Elena Demikhovsky	744fe0de33	fixed comments, blanks, nullptr; NFC llvm-svn: 236086	2015-04-29 06:49:50 +00:00
Craig Topper	eb4d7c6b70	[TableGen] Use range-based for loops. NFC llvm-svn: 236083	2015-04-29 04:43:36 +00:00
Filipe Cabecinhas	6fe8aabd45	Use an "early return" idiom for the error case. NFC llvm-svn: 236080	2015-04-29 02:36:08 +00:00
Filipe Cabecinhas	d8a1bcd0ad	Check that we have a valid PointerType element type before calling get() Same as r236073 but for PointerType. Bug found with AFL fuzz. llvm-svn: 236079	2015-04-29 02:27:28 +00:00
Filipe Cabecinhas	52d42c725a	Use the ArrayType member function for array element types. ArrayType and StructType accept the same types, so no test. llvm-svn: 236078	2015-04-29 02:27:21 +00:00
Filipe Cabecinhas	1351cba720	Turn an assert into report_fatal_error since it's reachable based on user input Bug found with AFL fuzz. llvm-svn: 236076	2015-04-29 01:58:31 +00:00
Filipe Cabecinhas	f15fb032ef	Make sure that isValidElementType(Type) before calling {Array,Struct}Type::get(Type) Bug found with AFL fuzz. llvm-svn: 236073	2015-04-29 01:27:01 +00:00
Matthias Braun	5295793bca	RegisterCoalescer: hide terminal rule option by default llvm-svn: 236062	2015-04-28 23:55:11 +00:00
Rafael Espindola	89feff3b76	Map directly from signature symbol to group index. NFC. llvm-svn: 236058	2015-04-28 22:59:58 +00:00
Eric Christopher	0ba41a6841	Reuse a lookup in an assert. llvm-svn: 236054	2015-04-28 22:38:35 +00:00
Rafael Espindola	cf6d5a9f94	Remove redundant temporary std::vector. New sections are added to the end of the list, so the RelSections array was redundant. llvm-svn: 236053	2015-04-28 22:26:19 +00:00
Tim Northover	e18d662201	ARM: fix peephole optimisation of TST We were trying to look through COPY instructions, but only to the next instruction in a BB and incorrectly anyway. The cases where that would actually be a good idea are rare enough (and not even tested!) that it's not worth trying to get right. rdar://20721342 llvm-svn: 236050	2015-04-28 22:03:55 +00:00
Rafael Espindola	41920d0382	Avoid one more walk over all sections. NFC. Set the group section index as they are created. llvm-svn: 236049	2015-04-28 22:03:22 +00:00
Andrew Kaylor	91307434f4	Style updates llvm-svn: 236048	2015-04-28 22:01:51 +00:00
Rafael Espindola	fac3fbc5ff	Use a range loop. NFC. llvm-svn: 236047	2015-04-28 21:58:05 +00:00
Andrew Kaylor	046f7b42f2	[WinEH] Split blocks at calls to llvm.eh.begincatch Differential Revision: http://reviews.llvm.org/D9311 llvm-svn: 236046	2015-04-28 21:54:14 +00:00
Rafael Espindola	8a90d87d76	Avoid an extra walk over the sections just to assign sections to groups. Assign the sections in the same pass we compute the index. llvm-svn: 236045	2015-04-28 21:52:33 +00:00
James Y Knight	e8da8096ec	Sparc: Add alternate aliases for conditional branch instructions. llvm-svn: 236042	2015-04-28 21:27:31 +00:00
Rafael Espindola	55a3afb418	Remove the GroupMapTy DenseMap. NFC. Instead use the Group symbol of MCSectionELF. llvm-svn: 236033	2015-04-28 21:07:28 +00:00
Sanjay Patel	2fbc4e5c49	transform fadd chains to increase parallelism This is a compromise: with this simple patch, we should always handle a chain of exactly 3 operations optimally, but we're not generating the optimal balanced binary tree for a longer sequence. In general, this transform will reduce the dependency chain for a sequence of instructions using N operands from a worst case N-1 dependent operations to N/2 dependent operations. The optimal balanced binary tree would reduce the chain to log2(N). The trade-off for not dealing with longer sequences is: (1) we have less complexity in the compiler, (2) we avoid unknown compile-time blowup calculating a balanced tree, and (3) we don't need to worry about the increased register pressure required to parallelize longer sequences. It also seems unlikely that we would ever encounter really long strings of dependent ops like that in the wild, but I'm not sure how to verify that speculation. FWIW, I see no perf difference for test-suite running on btver2 (x86-64) with -ffast-math and this patch. We can extend this patch to cover other associative operations such as fmul, fmax, fmin, integer add, integer mul. This is a partial fix for: https://llvm.org/bugs/show_bug.cgi?id=17305 and if extended: https://llvm.org/bugs/show_bug.cgi?id=21768 https://llvm.org/bugs/show_bug.cgi?id=23116 The issue also came up in: http://reviews.llvm.org/D8941 Differential Revision: http://reviews.llvm.org/D9232 llvm-svn: 236031	2015-04-28 21:03:22 +00:00
Alexei Starovoitov	659ece9ddb	[bpf] fix build Patch by Brenden Blanco. llvm-svn: 236030	2015-04-28 20:38:56 +00:00
Rafael Espindola	ad3cfaaa20	Use range loops. NFC. llvm-svn: 236028	2015-04-28 20:23:35 +00:00
Filipe Cabecinhas	b435d0f439	Relax an assert when there's a type mismatch in forward references Summary: We don't seem to need to assert here, since this function's callers expect to get a nullptr on error. This way we don't assert on user input. Bug found with AFL fuzz. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9308 llvm-svn: 236027	2015-04-28 20:18:47 +00:00
Rafael Espindola	e3ff9305cd	Avoid adding to SectionIndexMap sections that we never lookup. NFC. llvm-svn: 236026	2015-04-28 20:09:13 +00:00
Daniel Berlin	ec1de3fb19	Make getModRefInfo(Instruction *) not crash on certain types of instructions llvm-svn: 236023	2015-04-28 19:19:14 +00:00
Rafael Espindola	163f672cd5	Use a range loop. NFC. llvm-svn: 236015	2015-04-28 19:07:16 +00:00
Sanjay Patel	f75ee4dc07	[x86] remove RCPPS and RSQRTPS intrinsic instruction definitions We don't need codegen-only intrinsic instructions for the vector forms of these instructions. This makes the reciprocal estimate instruction lowering identical to how we handle normal square roots: (V)SQRTPS / (V)SQRTPD. No existing regression tests fail with this patch. Differential Revision: http://reviews.llvm.org/D9301 llvm-svn: 236013	2015-04-28 18:48:45 +00:00
Eric Christopher	35a8a62125	Add a fixme to resetTargetOptions to explain why it needs to go away. llvm-svn: 236009	2015-04-28 18:09:05 +00:00
Eric Christopher	f4bf3779d8	Fix a [-Werror,-Winconsistent-missing-override] problem in the NVPTX overrides. llvm-svn: 236007	2015-04-28 18:06:27 +00:00
Tom Stellard	96301d2455	R600: Fix up for AsmPrinter's OutStreamer being a unique_ptr Fixes a crash with basically any OpenGL application using the radeonsi driver. Patch by: Michel Dänzer Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=90176 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 236004	2015-04-28 17:37:03 +00:00
Tom Stellard	0a0fa03d5a	R600/SI: Add a lower case alias for subtarget feature: +DumpCode llc converts all feature strings to lower case, while the LLVM C API does not, so we need a lower case alias in order to test this with llc. llvm-svn: 236003	2015-04-28 17:37:00 +00:00
Justin Holewinski	3d2a976197	[NVPTX] Handle addrspacecast constant expressions in aggregate initializers We need to track if an AddrSpaceCast expression was seen when generating an MCExpr for a ConstantExpr. This change introduces a custom lowerConstant method to the NVPTX asm printer that will create NVPTXGenericMCSymbolRefExpr nodes at the appropriate places to encode the information that a given symbol needs to be casted to a generic address. llvm-svn: 236000	2015-04-28 17:18:30 +00:00
David Blaikie	bdb4910202	[opaque pointer type] Encode the allocated type of an alloca rather than its pointer result type. llvm-svn: 235998	2015-04-28 16:51:01 +00:00
Sanjay Patel	ba55804ea3	move IR-level optimization flags into their own struct This is a preliminary step to using the IR-level floating-point fast-math-flags in the SDAG (D8900). In this patch, we introduce the optimization flags as their own struct. As noted in the TODO comment, we should eventually share this data between the IR passes and the backend. We also switch the existing nsw / nuw / exact bit functionality of the BinaryWithFlagsSDNode class to use the new struct. The tradeoff is that instead of using the free but limited space of SDNode's SubclassData, we add a data member to the subclass. This means we don't have to repeat all of the get/set methods per flag, but we're potentially adding size to all nodes of this subclassi type. In practice on 64-bit systems (measured on Linux and MacOS X), there is no size difference between an SDNode and BinaryWithFlagsSDNode after this change: they're both 80 bytes. This means that we had at least one free byte to play with due to struct alignment. Differential Revision: http://reviews.llvm.org/D9325 llvm-svn: 235997	2015-04-28 16:39:12 +00:00
Rafael Espindola	a820169711	Use a std::vector to record the offsets of the sections. NFC. llvm-svn: 235995	2015-04-28 15:26:21 +00:00
Rafael Espindola	b64175076f	Avoid an extra loop for computing the section size. NFC. llvm-svn: 235994	2015-04-28 15:04:09 +00:00
Elena Demikhovsky	1f7b3644d3	Fixed crash of variable shift inst on AVX2 https://llvm.org/bugs/show_bug.cgi?id=22955 llvm-svn: 235993	2015-04-28 14:46:35 +00:00
Toma Tabacu	7dea2e3982	[mips] [IAS] Do not generate redundant ORi in createLShiftOri. Summary: If the immediate is 0, the ORi is pointless. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8969 llvm-svn: 235990	2015-04-28 14:06:35 +00:00
Sergey Dmitrouk	842a51bad8	Reapply r235977 "[DebugInfo] Add debug locations to constant SD nodes" [DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235989	2015-04-28 14:05:47 +00:00
Rafael Espindola	effdc7e981	Use CIE version 4 for dwarf4. According to http://www.dwarfstd.org/doc/DWARF4.pdf appendix F the CIE version for dwarf 4 is 4. llvm-svn: 235988	2015-04-28 13:55:31 +00:00
Daniel Jasper	48e93f7181	Revert "[DebugInfo] Add debug locations to constant SD nodes" This breaks a test: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/23870 llvm-svn: 235987	2015-04-28 13:38:35 +00:00
Toma Tabacu	6114565269	[mips] [IAS] Rename the createShiftOr function to createLShiftOri. NFC. Summary: The new name is more accurate with regard to the functionality. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8968 llvm-svn: 235984	2015-04-28 13:16:06 +00:00
Toma Tabacu	137d90ab88	[mips] [IAS] Store the expandLoadImm destination register in a variable. NFC. Summary: This removes multiple calls to getReg() and saves us column space in the source file. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8924 llvm-svn: 235978	2015-04-28 12:04:53 +00:00
Sergey Dmitrouk	adb4c69d5c	[DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235977	2015-04-28 11:56:37 +00:00
Elena Demikhovsky	ae51853924	AVX-512: Added "pandn" intrinsics set by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 235971	2015-04-28 08:12:42 +00:00
Elena Demikhovsky	584ce378ab	Masked gather and scatter: Added code for SelectionDAG. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 235970	2015-04-28 07:57:37 +00:00
David Blaikie	2a661cd062	[opaque pointer type] Encode the pointee type in the bitcode for 'cmpxchg' As a space optimization, this instruction would just encode the pointer type of the first operand and use the knowledge that the second and third operands would be of the pointee type of the first. When typed pointers go away, this assumption will no longer be available - so encode the type of the second operand explicitly and rely on that for the third. Test case added to demonstrate the backwards compatibility concern, which only comes up when the definition of the second operand comes after the use (hence the weird basic block sequence) - at which point the type needs to be explicitly encoded in the bitcode and the record length changes to accommodate this. llvm-svn: 235966	2015-04-28 04:30:29 +00:00
Ahmed Bougacha	190528703f	[MC] Use LShr for constant evaluation of ">>" on ELF/arm64--darwin. This matches other assemblers and is less unexpected (e.g. PR23227). On ELF, I tried binutils gas v2.24 and nasm 2.10.09, and they both agree on LShr. On COFF, I couldn't get my hands on an assembler yet, so don't change the behavior. For now, don't change it on non-AArch64 Darwin either, as the other assembler is gas v1.38, which does an AShr. llvm-svn: 235963	2015-04-28 01:37:11 +00:00
Duncan P. N. Exon Smith	1ec75ae963	DebugInfo: Support up to 2^16 arguments in a subprogram Support up to 2^16 arguments to a function. If we do hit the limit, assert out rather than restarting at 0 as we've done historically. This fixes PR23332. A clang test will follow. llvm-svn: 235955	2015-04-28 01:07:33 +00:00
Matthias Braun	eec4efcca5	Cleanup, remove unused return value llvm-svn: 235952	2015-04-28 00:37:05 +00:00
Ahmed Bougacha	177c14852b	[MC] Split MCBinaryExpr::Shr into LShr and AShr. Defaulting to AShr without consulting the target MCAsmInfo isn't OK. Add a flag to fix that. Keep it off for now: target migrations will follow in separate commits. llvm-svn: 235951	2015-04-28 00:21:32 +00:00
Ahmed Bougacha	457852f9f7	[MC] Move getBinOpPrecedence into AsmParser. NFC. In preparation for a future patch. llvm-svn: 235950	2015-04-28 00:17:39 +00:00
Hans Wennborg	7bf4d4eee0	Switch lowering: use uint32_t for weights everywhere I previously thought switch clusters would need to use uint64_t in case the weights of multiple cases overflowed a 32-bit int. It turns out that the weights on a terminator instruction are capped to allow for being added together, so using a uint32_t should be safe. llvm-svn: 235945	2015-04-27 23:52:19 +00:00
Duncan P. N. Exon Smith	5a490d0026	LTO: Add API to choose whether to embed uselists Reverse libLTO's default behaviour for preserving use-list order in bitcode, and add API for controlling it. The default setting is now `false` (don't preserve them), which is consistent with `clang`'s default behaviour. Users of libLTO should call `lto_codegen_should_embed_uselists(CG,true)` prior to calling `lto_codegen_write_merged_modules()` whenever the output file isn't part of the production workflow in order to reproduce results with subsequent calls to `llc`. (I haven't added tests since `llvm-lto` (the test tool for LTO) doesn't support bitcode output, and even if it did: there isn't actually a good way to test whether a tool has passed the flag. If the order is already "natural" (if the order will already round-trip) then no use-list directives are emitted at all. At some point I'll circle back to add tests to `llvm-as` (etc.) that they actually respect the flag, at which point I can somehow add a test here as well.) llvm-svn: 235943	2015-04-27 23:38:54 +00:00
Hans Wennborg	67c03759e4	Switch lowering: Take branch weight into account when ordering for fall-through Previously, the code would try to put a fall-through case last, even if that meant moving a case with much higher branch weight further down the chain. Ordering by branch weight is most important, putting a fall-through block last is secondary. llvm-svn: 235942	2015-04-27 23:35:22 +00:00
Duncan P. N. Exon Smith	7832e0a2f0	LTO: Simplify code generator initialization Simplify `LTOCodeGenerator` initialization by initializing simple fields at their definition. llvm-svn: 235939	2015-04-27 23:19:26 +00:00
Sanjay Patel	ca5ad5fb6d	remove obsolete pattern matches for scalar SSE ops The blendi pattern should always replace the insertps pattern after: http://reviews.llvm.org/rL232850 http://reviews.llvm.org/rL235124 llvm-svn: 235930	2015-04-27 22:23:17 +00:00
Rafael Espindola	a7c3163cdf	Use CIE version 1 for .eh_frame. According to http://www.linuxbase.org/betaspecs/lsb/LSB-Core-generic/LSB-Core-generic/ehframechpt.html we should always use 1. llvm-svn: 235923	2015-04-27 22:04:24 +00:00
Ahmed Bougacha	c004c60c0a	[AArch64] Also combine vector selects fed by non-i1 SETCCs. After legalization, scalar SETCC has an i32 result type on AArch64. The i1 requirement seems too conservative, replace it with an assert. This also means that we now can run after legalization. That should also be fine, since the ops legalizer runs again after each combine, and all types created all have the same sizes as the (legal) inputs. Exposed by r235917; while there, robustize its tests (bsl also uses the register it defines). llvm-svn: 235922	2015-04-27 21:43:12 +00:00
Pete Cooper	5b39524313	Add missing library dependency in libPDB. PDB uses COFFObjectFile::getPE32Header which lives in libObject. Make sure that LLVMBuild.txt reflects this dependency. llvm-svn: 235920	2015-04-27 21:23:12 +00:00
Ahmed Bougacha	89bba61c84	[AArch64] Don't assert when combining (v3f32 select (setcc f64)). When the setcc has f64 operands, we can't build a vector setcc mask to feed a vselect, because f64 doesn't divide v3f32 evenly. Just bail out when that happens. llvm-svn: 235917	2015-04-27 21:01:20 +00:00
Chris Bieneman	017ebf09d9	Fixes a hang that can occur if a signal comes in during malloc calls. We need to dereference the signals mutex during handler registration so that we force its construction. This is to prevent the first use being during handling an actual signal because you can't safely allocate memory in a signal handler. llvm-svn: 235914	2015-04-27 20:45:35 +00:00
Bill Schmidt	e71db85bed	Silence unused variable errors for no-asserts builds llvm-svn: 235913	2015-04-27 20:22:35 +00:00
Hans Wennborg	ba6d2568f9	Switch lowering: order bit tests by branch weight. llvm-svn: 235912	2015-04-27 20:21:17 +00:00
David Blaikie	1a848da518	[opaque pointer type] encode the pointee type of global variables Use a few extra bits in the const field (after widening it from a fixed single bit) to stash the address space which is no longer provided by the type (and an extra bit in there to specify that we're using that new encoding). llvm-svn: 235911	2015-04-27 19:58:56 +00:00
Bill Schmidt	fe723b9a6d	[PPC64LE] Remove unnecessary swaps from lane-insensitive vector computations This patch adds a new SSA MI pass that runs on little-endian PPC64 code with VSX enabled. Loads and stores of 4x32 and 2x64 vectors without alignment constraints are accomplished for little-endian using lxvd2x/xxswapd and xxswapd/stxvd2x. The existence of the additional xxswapd instructions hurts performance in comparison with big-endian code, but they are necessary in the general case to support correct semantics. However, the general case does not apply to most vector code. Many vector instructions are lane-insensitive; they do not "care" which lanes the parallel computations are performed within, provided that the resulting data is stored into the correct locations. Thus this pass looks for computations that perform only lane-insensitive operations, and remove the unnecessary swaps from loads and stores in such computations. Future improvements will allow computations using certain lane-sensitive operations to also be optimized in this manner, by modifying the lane-sensitive operations to account for the permuted order of the lanes. However, this patch only adds the infrastructure to permit this; no lane-sensitive operations are optimized at this time. This code is heavily exercised by the various vectorizing applications in the projects/test-suite tree. For the time being, I have only added one simple test case to demonstrate what the pass is doing. Although it is quite simple, it provides coverage for much of the code, including the special case handling of copies and subreg-to-reg operations feeding the swaps. I plan to add additional tests in the future as I fill in more of the "special handling" code. Two existing tests were affected, because they expected the swaps to be present, but they are now removed. llvm-svn: 235910	2015-04-27 19:57:34 +00:00
Sanjay Patel	8fd573e87f	fix 80-cols; NFC llvm-svn: 235902	2015-04-27 17:45:44 +00:00
Zachary Turner	e629530d3e	Fix build broken by incorrect class name. llvm-svn: 235901	2015-04-27 17:22:30 +00:00
Zachary Turner	20dbd0d0de	Make llvm-symbolizer work on Windows. Differential Revision: http://reviews.llvm.org/D9234 Reviewed By: Alexey Samsonov llvm-svn: 235900	2015-04-27 17:19:51 +00:00
Zachary Turner	c205570127	Make an RAII com initializer. Differential Revision: http://reviews.llvm.org/D9267 Reviewed By: Aaron Ballman, David Majnemer llvm-svn: 235898	2015-04-27 17:19:26 +00:00
Sanjay Patel	912315811e	fix typos; NFC llvm-svn: 235896	2015-04-27 17:03:31 +00:00
Toma Tabacu	bda745f532	[mips] Correct bytes to bits in 2 comments. NFC. llvm-svn: 235891	2015-04-27 15:21:38 +00:00
Elena Demikhovsky	a480ef5494	AVX-512: added calling conventions for i1 vectors. Fixed bug: https://llvm.org/bugs/show_bug.cgi?id=20724 llvm-svn: 235889	2015-04-27 15:11:19 +00:00
Brendon Cahoon	55bdeb7bc7	[Hexagon] Use constant extenders to fix up hardware loops Use a loop instruction with a constant extender for a hardware loop instruction that is too far away from the start of the loop. This is cheaper than changing the SA register value. Differential Revision: http://reviews.llvm.org/D9262 llvm-svn: 235882	2015-04-27 14:16:43 +00:00
Toma Tabacu	d9d344b485	[mips] [IAS] Improve warning for using AT with .set noat. Summary: Changed the warning message to show the current value of $at, similar to what clang does for typedef's, and renamed warnIfAssemblerTemporary to a more descriptive name. I also changed the type of variables which store registers from int to unsigned, updated the relevant test and tried to make the related comments clearer. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8479 llvm-svn: 235881	2015-04-27 14:05:04 +00:00
Vasileios Kalintiris	7a6b18783f	Reapply "[mips][FastISel] Implement shift ops for Mips fast-isel."" This reapplies r235194, which was reverted in r235495 because it was causing a failure in our out-of-tree buildbots for MIPS. With the sign-extension patch in r235718, this patch doesn't cause any problem any more. llvm-svn: 235878	2015-04-27 13:28:05 +00:00
Toma Tabacu	b19cf2082f	[mips] [IAS] Rename getATRegNum and setATReg to {g,s}etATRegIndex. NFC. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8480 llvm-svn: 235877	2015-04-27 13:12:59 +00:00
Elena Demikhovsky	d1084c5b3f	AVX-512: Extend/Truncate operations for SKX, SETCC for bit-vectors llvm-svn: 235875	2015-04-27 12:57:59 +00:00
Toma Tabacu	217116e684	[MC] [IAS] Add support for the \@ .macro pseudo-variable. Summary: When used, it is substituted with the number of .macro instantiations we've done up to that point in time. So if this is the 1st time we've instantiated a .macro (any .macro, regardless of name), \@ will instantiate to 0, if it's the 2nd .macro instantiation, it will instantiate to 1 etc. It can only be used inside a .macro definition, an .irp definition or an .irpc definition (those last 2 uses are undocumented). Reviewers: echristo, rafael Reviewed By: rafael Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D9197 llvm-svn: 235862	2015-04-27 10:50:29 +00:00
Pawel Bylica	c25918a043	Constfold insertelement to undef when index is out-of-bounds Summary: This patch adds constant folding of insertelement instruction to undef value when index operand is constant and is not less than vector size or is undef. InstCombine does not support this case, but I'm happy to add it there also if this change is accepted. Test Plan: Unittests and regression tests for ConstProp pass. Reviewers: majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9287 llvm-svn: 235854	2015-04-27 09:30:49 +00:00
Simon Pilgrim	4f683c264a	[X86][SSE] Add v16i8/v32i8 multiplication support Patch to allow int8 vectors to be multiplied on the SSE unit instead of being scalarized. The patch sign extends the i8 lanes to i16, uses the SSE2 pmullw multiplication instruction, then packs the lower byte from each result. Differential Revision: http://reviews.llvm.org/D9115 llvm-svn: 235837	2015-04-27 07:55:46 +00:00
Philip Reames	20c24f1da5	Make the message associated with a fatal error slightly more helpful Looking into 23095, my best guess is that the CodeGen library itself isn't getting linked and initialized properly. To make this slightly more obvious to consumers of LLVM, emit a different error message if we can tell that the registry is empty vs you've simply happened to name a collector which hasn't been registered. llvm-svn: 235824	2015-04-26 22:00:34 +00:00
Philip Reames	63294cbb6a	[RewriteStatepointsForGC] Exclude constant values from being considered live at a safepoint There can be various constant pointers in the IR which do not get relocated at a safepoint. One example is the address of a global variable. Another example is a pointer created via inttoptr. Note that the optimizer itself likes to create such inttoptrs when locally propagating constants through dynamically dead code. To deal with this, we need to exclude uses of constants from contributing to the liveness of a safepoint which might reach that use. At some later date, it might be worth exploring what could be done to support the relocation of various special types of "constants", but that's future work. Differential Revision: http://reviews.llvm.org/D9236 llvm-svn: 235821	2015-04-26 19:48:03 +00:00
Philip Reames	2e78fa49a8	Don't Place Entry Safepoints Before the llvm.frameescape() Intrinsic llvm.frameescape() intrinsic is not a real call. The intrinsic can only exist in the entry block. Inserting a gc.statepoint() before llvm.frameescape() may split the entry block, and push the intrinsic out of the entry block. Patch by: Swaroop.Sridhar@microsoft.com Differential Revision: http://reviews.llvm.org/D8910 llvm-svn: 235820	2015-04-26 19:41:23 +00:00
Alexei Starovoitov	f26c748b1b	[bpf] fix build and remove a compiler warning in Release mode Patch by Brenden Blanco. llvm-svn: 235814	2015-04-26 01:58:08 +00:00
Sanjay Patel	c1d20a36fb	[x86] instcombine more cases of insertps into a shufflevector This is a follow-on to D8833 (insertps optimization when the zero mask is not used). In this patch, we check for the case where the zmask is used, but both input vectors to the insertps intrinsic are the same operand or the zmask overrides the destination lane. This lets us replace the 2nd shuffle input operand with the zero vector. Differential Revision: http://reviews.llvm.org/D9257 llvm-svn: 235810	2015-04-25 20:55:25 +00:00
Benjamin Kramer	a44b37e676	[ARM] Simplify code. NFC. llvm-svn: 235803	2015-04-25 17:25:13 +00:00
Benjamin Kramer	6246069c89	[hexagon] Use range-based for loops. No functionality change intended. llvm-svn: 235802	2015-04-25 14:46:53 +00:00
Benjamin Kramer	a37c809ce5	[hexagon] Remove setHexLibcallName, it leaks memory. Just spell out the full names, it's not that much more code. No functional change intended. llvm-svn: 235801	2015-04-25 14:46:46 +00:00
Andrew Kaylor	8c384bbb35	Fix build error from accidental change llvm-svn: 235792	2015-04-24 23:34:46 +00:00
Andrew Kaylor	8c79411203	[WinEH] Find correct cloned entry block for outlined handler functions. llvm-svn: 235791	2015-04-24 23:27:32 +00:00
Andrew Kaylor	5dacfd8b8a	[WinEH] Find correct cloned entry block for outlined handler functions. llvm-svn: 235789	2015-04-24 23:10:38 +00:00
Nico Weber	7d831a5731	Assert that Process::FindInEnvPath() is passed a relative path. It misbehaves with absolute paths. (So does path::append().) Goes with clang r235787. llvm-svn: 235788	2015-04-24 22:18:46 +00:00
Duncan P. N. Exon Smith	c8d987b121	Linker: Copy over function metadata attachments Update `lib/Linker` to handle `Function` metadata attachments. The attachments stick with the function body. llvm-svn: 235786	2015-04-24 22:07:31 +00:00
Duncan P. N. Exon Smith	3d4cd756b6	IR: Add assembly/bitcode support for function metadata attachments Add serialization support for function metadata attachments (added in r235783). The syntax is: define @foo() !attach !0 { Metadata attachments are only allowed on functions with bodies. Since they come before the `{`, they're not really part of the body; since they require a body, they're not really part of the header. In `LLParser` I gave them a separate function called from `ParseDefine()`, `ParseOptionalFunctionMetadata()`. In bitcode, I'm using the same `METADATA_ATTACHMENT` record used by instructions. Instruction metadata attachments are included in a special "attachment" block at the end of a `Function`. The attachment records are laid out like this: InstID (KindID MetadataID)+ Note that these records always have an odd number of fields. The new code takes advantage of this to recognize function attachments (which don't need an instruction ID): (KindID MetadataID)+ This means we can use the same attachment block already used for instructions. This is part of PR23340. llvm-svn: 235785	2015-04-24 22:04:41 +00:00
Duncan P. N. Exon Smith	327e9bd399	Verifier: Function metadata attachments require a body Add a verifier check that only functions with bodies have metadata attachments. This should help catch bugs in frontends and transformation passes. Part of PR23340. llvm-svn: 235784	2015-04-24 21:53:27 +00:00
Duncan P. N. Exon Smith	e2510cdfe8	IR: Add Function metadata attachments Add IR support for `Metadata` attachments. Assembly and bitcode support will follow shortly, but for now we just have unit tests. This is part of PR23340. llvm-svn: 235783	2015-04-24 21:51:02 +00:00
Duncan P. N. Exon Smith	27d702cb8b	LLParser: Simplify ParseInstructionMetadata(), NFC Remove unused `PFS` variable and take the `Instruction` by reference. (Not really related to PR23340, but might as well clean this up while I'm here.) llvm-svn: 235782	2015-04-24 21:29:36 +00:00
Quentin Colombet	8229145961	[DAGCombiner] Fix the type used in canFoldInAddressingMode to account for the right scaling. In the function canFoldInAddressingMode, VT is computed as the type of the destination/source of a LOAD/STORE operations, instead of the memory type of the operation. On targets with a scaling factor on the offset of the LOAD/STORE operations, the function may return false for actually valid cases. This may then prevent the selection of profitable pre or post indexed load/store operations, and instead select pre or post indexed load/store for unprofitable cases. Patch by Francois de Ferriere <francois.de-ferriere@st.com>! Differential Revision: http://reviews.llvm.org/D9146 llvm-svn: 235780	2015-04-24 21:28:00 +00:00
Duncan P. N. Exon Smith	19717ea416	LLParser: Split out ParseMetadataAttachment(), NFC Make the code reusable for `Function` metadata attachments (PR23340). llvm-svn: 235778	2015-04-24 21:21:57 +00:00
David Blaikie	5bacf3722c	[opaque pointer type] Verifier/AutoUpgrade: Remove a few uses of PointerType::getElementType llvm-svn: 235777	2015-04-24 21:16:07 +00:00
Duncan P. N. Exon Smith	d3e4c2a63f	AsmWriter: Parameterize the syntactic separator for attachments Parameterize the separator for attachments, since `Function` metadata attachments (PR23340) aren't going to use a `,` (comma). No real functionality change. llvm-svn: 235775	2015-04-24 21:06:21 +00:00
Duncan P. N. Exon Smith	e30f10e671	AsmWriter: Only collect attachment names once per module Collect metadata names once per `AssemblyWriter` instead of every time we need to print some attachments. Just a drive-by; this caught my eye while I was refactoring the code in r235772. llvm-svn: 235774	2015-04-24 21:03:05 +00:00
Kaelyn Takata	5e5524bc25	Remove an unused variable to prevent -Werror build failures. llvm-svn: 235773	2015-04-24 21:02:18 +00:00
Duncan P. N. Exon Smith	86979275aa	AsmWriter: Split out code for printing Metadata attachments, NFC Refactor the code for printing `Instruction` metadata attachments so it can be reused for `Function`. llvm-svn: 235772	2015-04-24 20:59:52 +00:00
Hans Wennborg	86ac630585	SimplifyCFG: Correctly handle switch lookup tables which fully cover the input type and use bit tests to check for holes When using bit tests for hole checks, we call AddPredecessorToBlock to give the phi node a value from the bit test block. This would break if we've previously called removePredecessor on the default destination because the switch is fully covered. Test case by Mark Lacey. llvm-svn: 235771	2015-04-24 20:57:56 +00:00
Duncan P. N. Exon Smith	4472b776b0	IR: Use a bitmask to access GlobalObject subclass data Make room for more than just `Function::isMaterializable()` in the `GlobalObject` subclass data bitfield. Since we're treating it like a bitfield, change `Function::Function()` to zero-out the whole thing. llvm-svn: 235770	2015-04-24 20:47:23 +00:00
Duncan P. N. Exon Smith	cbc28dc549	IR: Extract set logic from Instruction attachments, NFC Extract the set logic for metadata attachments from `Instruction` so it can be reused for `Function` (PR23340). This data structure makes a `SmallVector<>` look (a little) like a map, just doing the bare minimum to support the `Instruction` (and soon, `Function`) metadata API. llvm-svn: 235769	2015-04-24 20:36:25 +00:00
Reid Kleckner	cfbfe6f29c	[SEH] Implement GetExceptionCode in __except blocks This introduces an intrinsic called llvm.eh.exceptioncode. It is lowered by copying the EAX value live into whatever basic block it is called from. Obviously, this only works if you insert it late during codegen, because otherwise mid-level passes might reschedule it. llvm-svn: 235768	2015-04-24 20:25:05 +00:00
Duncan P. N. Exon Smith	75ef0c09d0	IR: Use remove_if for Instruction::dropUnknownMetadata() Technically the operations are different -- the old logic moved items from the back into the opened-up slots, instead of the usual `remove_if()` logic of a slow and a fast iterator -- but unless a profile tells us otherwise I prefer the simpler logic here. Regardless, there shouldn't be an observable function change. llvm-svn: 235767	2015-04-24 20:23:44 +00:00
Duncan P. N. Exon Smith	14a384bee6	IR: Remove MDMapTy and MDPairTy typedefs, NFC Remove some typedefs in preparation for factoring out attachment logic from `Instruction`. llvm-svn: 235764	2015-04-24 20:19:13 +00:00
Duncan P. N. Exon Smith	391fc56ffd	IR: Rename LLVMContextImpl::MetadataStore to InstructionMetadata Rename `MetadataStore` to the more explicit `InstructionMetadata`. This will make room for `FunctionMetadata` (start of PR23340). llvm-svn: 235763	2015-04-24 20:16:42 +00:00
David Blaikie	445e3fbc54	[opaque pointer type] Add textual IR support for explicit type parameter to the invoke instruction Same as r235145 for the call instruction - the justification, tradeoffs, etc are all the same. The conversion script worked the same without any false negatives (after replacing 'call' with 'invoke'). llvm-svn: 235755	2015-04-24 19:32:54 +00:00
Lang Hames	9ff69c8f4d	[AsmPrinter] Make AsmPrinter's OutStreamer member a unique_ptr. AsmPrinter owns the OutStreamer, so an owning pointer makes sense here. Using a reference for this is crufty. llvm-svn: 235752	2015-04-24 19:11:51 +00:00
David Blaikie	5ea1f7b744	[opaque pointer type] bitcode: add explicit callee type to invoke instructions llvm-svn: 235735	2015-04-24 18:06:06 +00:00
Andrew Kaylor	08c5f1efc1	Fix LoopInterchange/reductions.ll test for debug builds llvm-svn: 235734	2015-04-24 17:39:16 +00:00
Hans Wennborg	ec679a8b3b	Switch lowering: fix APInt overflow causing infinite loop / OOM llvm-svn: 235729	2015-04-24 16:53:55 +00:00
Reid Kleckner	2c3ccaacb7	[WinEH] Split the landingpad BB instead of cloning it This means we don't have to RAUW the landingpad instruction and landingpad BB, which is a nice win. llvm-svn: 235725	2015-04-24 16:22:19 +00:00
Diego Novillo	af9fdb9dcf	Fix typo in comment. llvm-svn: 235723	2015-04-24 15:46:41 +00:00
Yaron Keren	24a86df13e	Use the cleaner syntx value initialization to zero initialize POD structs. Suggestion from David Blaikie! llvm-svn: 235721	2015-04-24 15:39:47 +00:00

... 5 6 7 8 9 ...

79660 Commits