llvm-project

Commit Graph

Author	SHA1	Message	Date
Jan Vesely	452b036697	R600: Make FMIN/MAXNUM legal on all asics v2: Add tests Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> reviewer: arsenm llvm-svn: 234716	2015-04-12 23:45:05 +00:00
Jan Vesely	811ef52db7	R600: remove manual BFE optimization Fixed since r233079 Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> reviewer: arsenm llvm-svn: 234715	2015-04-12 23:45:01 +00:00
Petr Hosek	9e0c890f3e	[MC] Write padding into fragments when -mc-relax-all flag is used Summary: When instruction bundling is enabled and the -mc-relax-all flag is set, we can write bundle padding directly into fragments and avoid creating large number of fragments significantly reducing LLVM MC memory usage. Test Plan: Regression test attached Reviewers: eliben Subscribers: jfb, mseaborn Differential Revision: http://reviews.llvm.org/D8072 llvm-svn: 234714	2015-04-12 23:42:25 +00:00
Lang Hames	c6de458f7c	[Orc] During module partitioning, rename anonymous and asm-private globals. If they're not (re)named, these globals will fail to resolve when the partitioned modules are linked. llvm-svn: 234707	2015-04-12 20:05:51 +00:00
Mark Lacey	274f48b5a8	Fix typo. llvm-svn: 234706	2015-04-12 18:18:51 +00:00
Hal Finkel	5551f2528c	[PowerPC] Really iterate over all loops in PPCLoopDataPrefetch/PPCLoopPreIncPrep When I fixed these a couple of days ago to iterate over all loops, not just depth == 1 loops, I inadvertently made it such that we'd only look at the first top-level loop. Make sure that we really look at all of them. llvm-svn: 234705	2015-04-12 17:18:56 +00:00
Sanjoy Das	71190feca5	[LoopUnrollRuntime] Clean up a predicate. Clean up a predicate I added in r229731, fix the relevant comment and add a test case. The earlier version is confusing to read and was also buggy (probably not a coincidence) till Alexey fixed it in r233881. llvm-svn: 234701	2015-04-12 01:24:01 +00:00
Duncan P. N. Exon Smith	7ad0bd54d3	DebugInfo: Make MDSubprogram::getFunction() return Constant Change `MDSubprogram::getFunction()` and `MDGlobalVariable::getConstant()` to return a `Constant`. Previously, both returned `ConstantAsMetadata`. llvm-svn: 234699	2015-04-11 20:27:40 +00:00
Duncan P. N. Exon Smith	5ad6ff76dc	Verifier: Check for incompatible bit piece expressions Convert an assertion into a `Verifier` check. Bit piece expressions must fit inside the variable, and mustn't be the entire variable. Catching this in the verifier will help us find bugs sooner, and makes `DIVariable::getSizeInBits()` dead code. llvm-svn: 234698	2015-04-11 19:58:35 +00:00
Duncan P. N. Exon Smith	127ea4b616	DebugInfo: Remove dead DIDescriptor::replaceAllUsesWith() r234696 replaced the only use of `DIDescriptor::replaceAllUsesWith()` with `DIBuilder::replaceTemporary()` (added in r234695). Delete the dead code. llvm-svn: 234697	2015-04-11 19:29:09 +00:00
Benjamin Kramer	79de6e6d89	Mark empty default constructors as =default if it makes the type POD NFC llvm-svn: 234694	2015-04-11 18:57:14 +00:00
Duncan P. N. Exon Smith	19f87726d5	DebugInfo: Assume a valid pointer for DISubprogram::getFunction() llvm-svn: 234693	2015-04-11 18:15:48 +00:00
Duncan P. N. Exon Smith	f0d81a50bf	DebugInfo: Move DIScope::getName() and getContext() to MDScope Continue gutting the `DIDescriptor` hierarchy. In this case, move the guts of `DIScope::getName()` and `DIScope::getContext()` to `MDScope::getName()` and `MDScope::getScope()`. llvm-svn: 234691	2015-04-11 17:37:23 +00:00
Benjamin Kramer	dd0ff85701	Remove empty non-virtual destructors or mark them =default when non-public These add no value but can make a class non-trivially copyable. NFC. llvm-svn: 234688	2015-04-11 15:32:26 +00:00
Hal Finkel	58f5f9c393	[PowerPC] Disable part-word atomics on the P7 As it turns out, even though these are part of ISA 2.06, the P7 does not support them (or, at least, not any P7s we're tested so far). llvm-svn: 234686	2015-04-11 13:40:36 +00:00
Nemanja Ivanovic	c38b5311cb	Add direct moves to/from VSR and exploit them for FP/INT conversions This patch corresponds to review: http://reviews.llvm.org/D8928 It adds direct move instructions to/from VSX registers to GPR's. These are exploited for FP <-> INT conversions. llvm-svn: 234682	2015-04-11 10:40:42 +00:00
Alexander Kornienko	f817c1cb9a	Use 'override/final' instead of 'virtual' for overridden methods The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' \ -j=32 -fix -format http://reviews.llvm.org/D8925 llvm-svn: 234679	2015-04-11 02:11:45 +00:00
Duncan P. N. Exon Smith	63ffa21d90	DebugInfo: Rewrite atSameLineAs() as MDLocation::canDiscriminate() Rewrite `DILocation::atSameLineAs()` as `MDLocation::canDiscriminate()` with a doxygen comment explaining its purpose. I've added a few FIXMEs where I think this check is too weak; fixing that is tracked by PR23199. llvm-svn: 234674	2015-04-11 01:00:47 +00:00
Duncan P. N. Exon Smith	50c065bc7d	DebugInfo: Add forwarding getFilename() accessor to new hierarchy Add forwarding `getFilename()` and `getDirectory()` accessors to nodes in the new hierarchy that define a `getFile()`. Use that to re-implement existing functionality in the `DIDescriptor` hierarchy. llvm-svn: 234671	2015-04-11 00:39:43 +00:00
Hal Finkel	ffb460fdf0	[PowerPC] Fix PPCLoopPreIncPrep for depth > 1 loops This pass had the same problem as the data-prefetching pass: it was only checking for depth == 1 loops in practice. Fix that, add some debugging statements, and make sure that, when we grab an AddRec, it is for the loop we expect. llvm-svn: 234670	2015-04-11 00:33:08 +00:00
Lang Hames	eb9bdb5d16	[Orc] Tidy up IndirectionUtils API a little, add some comments. NFC. llvm-svn: 234669	2015-04-11 00:23:49 +00:00
Philip Reames	9638ff9b37	[Statepoints] Fix a release only build failure A function which is used only in Asserts builds needs to be defined only in Asserts builds. llvm-svn: 234667	2015-04-11 00:06:47 +00:00
Ahmed Bougacha	b96444efd1	[CodeGen] Split -enable-global-merge into ARM and AArch64 options. Currently, there's a single flag, checked by the pass itself. It can't force-enable the pass (and is on by default), because it might not even have been created, as that's the targets decision. Instead, have separate explicit flags, so that the decision is consistently made in the target. Keep the flag as a last-resort "force-disable GlobalMerge" for now, for backwards compatibility. llvm-svn: 234666	2015-04-11 00:06:36 +00:00
Duncan P. N. Exon Smith	241982c380	DebugInfo: Remove dead DIDescriptor::getDescriptorField() llvm-svn: 234665	2015-04-10 23:53:44 +00:00
Quentin Colombet	fd7475b5e8	[AArch64] Strengthen the code for the prologue insertion. The spilled registers are pristine and thus, correctly handled by the register scavenger and so on, but the liveness information is strictly speaking wrong at this point. Fix that. llvm-svn: 234664	2015-04-10 23:14:34 +00:00
Reid Kleckner	9405ef0e1f	[WinEH] Recognize SEH finally block inserted by the frontend This allows winehprepare to build sensible llvm.eh.actions calls for SEH finally blocks. The pattern matching in this change is brittle and should be replaced with something more robust soon. In the meantime, this will let us write the code that produces __C_specific_handler xdata tables, which we need regardless of how we decide to get finally blocks through EH preparation. llvm-svn: 234663	2015-04-10 23:12:29 +00:00
Philip Reames	4d80ede538	[RewriteStatepointsForGC] Use a SetVector for a worklist [NFC] Using a SetVector to replace equivelent but more verbose functionality. llvm-svn: 234662	2015-04-10 23:11:26 +00:00
Philip Reames	df1ef08c0c	[RewriteStatepointsForGC] Use an actual liveness algorithm When rewriting statepoints to make relocations explicit, we need to have a conservative but consistent notion of where a particular pointer is live at a particular site. The old code just used dominance, which is correct, but decidedly more conservative then it needed to be. This patch implements a simple dataflow algorithm that's run one per function (well, twice counting fixup after base pointer insertion). There's still lots of room to make this faster, but it's fast enough for all practical purposes today. Differential Revision: http://reviews.llvm.org/D8674 llvm-svn: 234657	2015-04-10 22:53:14 +00:00
Philip Reames	704e78b149	[RewriteStatepointsForGC] clang-format file Format the entire file to reduce diff of change to follow. llvm-svn: 234656	2015-04-10 22:34:56 +00:00
Benjamin Kramer	b4bf14ceaa	[CodeGenPrepare] Report all changes made during instruction sinking r234638 chained another transform below which was tripping over the deleted instruction. Use after free found by asan in many regression tests. llvm-svn: 234654	2015-04-10 22:25:36 +00:00
Philip Reames	f66d73708b	[RewriteStatepointsForGC] Missed review comment from 234651 & build fix After submitting 234651, I noticed I hadn't responded to a review comment by mjacob. This patch addresses that comment and fixes a Release only build problem due to an unused variable. llvm-svn: 234653	2015-04-10 22:16:58 +00:00
Philip Reames	85b36a8157	[RewriteStatepointsForGC] Preprocess the IR to remove unreachable blocks and single entry phis Two related small changes: Various dominance based queries about liveness can get confused if we're talking about unreachable blocks. To avoid reasoning about such cases, just remove them before rewriting statepoints. Remove single entry phis (likely left behind by LCSSA) to reduce the number of live values. Both of these are motivated by http://reviews.llvm.org/D8674 which will be submitted shortly. Differential Revision: http://reviews.llvm.org/D8675 llvm-svn: 234651	2015-04-10 22:07:04 +00:00
Philip Reames	8531d8c491	[RewriteStatepointsForGC] Limited support for vectors of pointers This patch adds limited support for inserting explicit relocations when there's a vector of pointers live over the statepoint. This doesn't handle the case where the vector contains a mix of base and non-base pointers; that's future work. The current implementation just scalarizes the vector over the gc.statepoint before doing the explicit rewrite. An alternate approach would be to plumb the vector all the way though the backend lowering, but doing that appears challenging. In particular, the size of the indirect spill slot is currently assumed to be sizeof(pointer) throughout the backend. In practice, this is enough to allow running the SLP and Loop vectorizers before RewriteStatepointsForGC. Differential Revision: http://reviews.llvm.org/D8671 llvm-svn: 234647	2015-04-10 21:48:25 +00:00
Sanjoy Das	b6c5914308	[InstCombine][CodeGenPrep] Create llvm.uadd.with.overflow in CGP. Summary: This change moves creating calls to `llvm.uadd.with.overflow` from InstCombine to CodeGenPrep. Combining overflow check patterns into calls to the said intrinsic in InstCombine inhibits optimization because it introduces an intrinsic call that not all other transforms and analyses understand. Depends on D8888. Reviewers: majnemer, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8889 llvm-svn: 234638	2015-04-10 21:07:09 +00:00
Rafael Espindola	74d2f617f2	Remember if lseek works in this FD. It will be used in clang in a sec. llvm-svn: 234619	2015-04-10 18:15:51 +00:00
Duncan P. N. Exon Smith	457bfc779a	DebugInfo: Stop leaking temporaries in DIBuilder::createCompileUnit() Stop leaking temporary nodes from `DIBuilder::createCompileUnit()`. `replaceAllUsesWith()` doesn't delete the nodes, so we need to delete them "manually" (well, `TempMDTuple` does that for us). Similarly, stop leaking the temporary nodes used for variables of subprograms. llvm-svn: 234617	2015-04-10 18:01:58 +00:00
Rafael Espindola	a4800720b0	Have one raw_fd_ostream constructor forward to the other. This fixes some odd behavior differences between the two. In particular, the version that takes a FD no longer unconditionally sets stdout to binary. llvm-svn: 234615	2015-04-10 17:52:22 +00:00
Reid Kleckner	d03f9f4016	[FS] Report errors from llvm::sys::fs::rename on Windows Previously we would always report success, which is pretty bogus. I'm too lazy to write a test where rename will portably fail on all platforms. I'm just trying to fix breakage introduced by r234597, which happened to tickle this. llvm-svn: 234611	2015-04-10 17:20:45 +00:00
Reid Kleckner	6e48a826e8	[WinEH] Try to make outlining invokes work a little better WinEH currently turns invokes into calls. Long term, we will reconsider this, but for now, make sure we remap the operands and clone the successors of the new terminator. llvm-svn: 234608	2015-04-10 16:26:42 +00:00
Hal Finkel	a9fceb803d	[PowerPC] Prefetching should also consider depth > 1 loops Iterating over loops from the LoopInfo instance only provides top-level loops. We need to search the whole tree of loops to find the inner ones. llvm-svn: 234603	2015-04-10 15:05:02 +00:00
Benjamin Kramer	3a09ef64ee	[CallSite] Make construction from Value* (or Instruction) explicit. CallSite roughly behaves as a common base CallInst and InvokeInst. Bring the behavior closer to that model by making upcasts explicit. Downcasts remain implicit and work as before. Following dyn_cast as a mental model checking whether a Value V isa CallSite now looks like this: if (auto CS = CallSite(V)) // think dyn_cast instead of: if (CallSite CS = V) This is an extra token but I think it is slightly clearer. Making the ctor explicit has the advantage of not accidentally creating nullptr CallSites, e.g. when you pass a Value * to a function taking a CallSite argument. llvm-svn: 234601	2015-04-10 14:50:08 +00:00
Toma Tabacu	ae47f93b74	[mips] [IAS] Improve comments in MipsAsmParser::expandLoadImm. NFC. llvm-svn: 234595	2015-04-10 13:28:16 +00:00
Chad Rosier	518659d9b4	[AArch64] Changes some SchedAlias to WriteRes for Cortex-A57. Using SchedAliases is convenient and works well for latency and resource lookup for instructions. However, this creates an entry in AArch64WriteLatencyTable with a WriteResourceID of 0, breaking any SchedReadAdvance since the lookup will fail. http://reviews.llvm.org/D8043 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 234594	2015-04-10 13:19:27 +00:00
Chad Rosier	a82c876045	[AArch64] Adjusts Cortex-A57 machine model to handle zero shift. http://reviews.llvm.org/D8043 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 234593	2015-04-10 13:19:21 +00:00
Benjamin Kramer	619c4e57ba	Reduce dyn_cast<> to isa<> or cast<> where possible. No functional change intended. llvm-svn: 234586	2015-04-10 11:24:51 +00:00
Jingyue Wu	5da831cc31	Divergence analysis for GPU programs Summary: Some optimizations such as jump threading and loop unswitching can negatively affect performance when applied to divergent branches. The divergence analysis added in this patch conservatively estimates which branches in a GPU program can diverge. This information can then help LLVM to run certain optimizations selectively. Test Plan: test/Analysis/DivergenceAnalysis/NVPTX/diverge.ll Reviewers: resistor, hfinkel, eliben, meheff, jholewinski Subscribers: broune, bjarke.roune, madhur13490, tstellarAMD, dberlin, echristo, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8576 llvm-svn: 234567	2015-04-10 05:03:50 +00:00
David Majnemer	5c65f58f64	[WinEHPrepare] Don't rely on the order of IR The IPToState table must be emitted after we have generated labels for all functions in the table. Don't rely on the order of the list of globals. Instead, utilize WinEHFuncInfo to tell us how many catch handlers we expect to outline. Once we know we've visited all the catch handlers, emit the cppxdata. llvm-svn: 234566	2015-04-10 04:56:17 +00:00
Hal Finkel	93138503ae	[PowerPC] Don't crash on PPC32 i64 fp_to_uint on modern cores When we have an instruction for this (and, thus, don't generate a runtime call), we need to custom type legalize this (in a trivial way, just as we do for fp_to_sint). Fixes PR23173. llvm-svn: 234561	2015-04-10 03:39:00 +00:00
Ahmed Bougacha	1ffe7c7d36	[AArch64] Promote f16 operations to f32. For the most common ones (such as fadd), we already did the promotion. Do the same thing for all the others. Currently, we'll just crash/assert on all these operations, as there's no hardware or libcall support whatsoever. f16 (half) is specified as an interchange - not arithmetic - format, and is expected to be promoted to single-precision for arithmetic operations. While there, teach the legalizer about promoting some of the (mostly floating-point) operations that we never needed before. Differential Revision: http://reviews.llvm.org/D8648 See related discussion on the thread for: http://reviews.llvm.org/D8755 llvm-svn: 234550	2015-04-10 00:08:48 +00:00
Nemanja Ivanovic	c09047916a	Add LLVM support for remaining integer divide and permute instructions from ISA 2.06 This is the patch corresponding to review: http://reviews.llvm.org/D8406 It adds some missing instructions from ISA 2.06 to the PPC back end. llvm-svn: 234546	2015-04-09 23:54:37 +00:00
Rafael Espindola	5682ce2ceb	Simplify use of formatted_raw_ostream. formatted_raw_ostream is a wrapper over another stream to add column and line number tracking. It is used only for asm printing. This patch moves the its creation down to where we know we are printing assembly. This has the following advantages: * Simpler lifetime management: std::unique_ptr * We don't compute column and line number of object files :-) llvm-svn: 234535	2015-04-09 21:06:08 +00:00
Ahmed Bougacha	df43737782	[CodeGen] Combine concat_vector of trunc'd scalar to scalar_to_vector. We already do: concat_vectors(scalar, undef) -> scalar_to_vector(scalar) When the scalar is legal. When it's not, but is a truncated legal scalar, we can also do: concat_vectors(trunc(scalar), undef) -> scalar_to_vector(scalar) Which is equivalent, since the upper lanes are undef anyway. While there, teach the combine to look at more than 2 operands. Differential Revision: http://reviews.llvm.org/D8883 llvm-svn: 234530	2015-04-09 20:04:47 +00:00
Juergen Ributzka	bd0c7eb4dc	[AArch64][FastISel] Fix integer extend optimization. The integer extend optimization tries to fold the extend into the load instruction. This requires us to identify if the extend has already been emitted or not and act accordingly on it. The check that was originally performed for this was not sufficient. Besides checking the ValueMap for a mapped register we also need to check if the virtual register has already an associated machine instruction that defines it. This fixes rdar://problem/20470788. llvm-svn: 234529	2015-04-09 20:00:46 +00:00
Eric Christopher	fbe80f5f63	Remove duplicated code and consolidate initializers. llvm-svn: 234525	2015-04-09 19:20:37 +00:00
Rafael Espindola	49286e9f4a	clang-format bits of code to make a followup patch easy to read. llvm-svn: 234519	2015-04-09 18:32:58 +00:00
Rafael Espindola	1c84271694	Revert "Refactoring and enhancement to FMA combine." This reverts commit r234513. It was failing on the bots. llvm-svn: 234518	2015-04-09 18:29:32 +00:00
Rafael Espindola	37099d9afd	Define a function with "... llvm::func...". Using this instead of namespace llvm { func... } Has the advantage that the build fails with a compiler error if it gets out of sync with the .h file. llvm-svn: 234515	2015-04-09 18:08:15 +00:00
Olivier Sallenave	53703d0862	Refactoring and enhancement to FMA combine. llvm-svn: 234513	2015-04-09 17:55:26 +00:00
Duncan P. N. Exon Smith	628d6057d0	IR: Preserve use-list order by default in bitcode Pull the `-preserve-*-use-list-order` flags out of "experimental" mode, and preserve use-list order by default when serializing to bitcode. llvm-svn: 234510	2015-04-09 17:41:20 +00:00
Rafael Espindola	f546d0f9c5	Use a raw_svector_ostream instead of a raw_string_ostream. It saves a bit of copying. llvm-svn: 234507	2015-04-09 17:16:25 +00:00
Rafael Espindola	df7305a438	Don't repeat name in comment. NFC. llvm-svn: 234506	2015-04-09 17:10:57 +00:00
Rafael Espindola	62e6ec066e	Misc cleanup. NFC. These were lost when I reverted the raw_ostream changes. llvm-svn: 234504	2015-04-09 16:59:07 +00:00
Rafael Espindola	ee0dd4d289	This reverts commit r234460 and r234461. Revert "Add classof implementations to the raw_ostream classes." Revert "Use the cast machinery to remove dummy uses of formatted_raw_ostream." The underlying issue can be fixed without classof. llvm-svn: 234495	2015-04-09 15:54:59 +00:00
Javed Absar	5c5e3c5e36	[ARM] support for Cortex-R4/R4F Currently, llvm (backend) doesn't know cortex-r4, even though it is the default target for armv7r. Using "--target=armv7r-arm-none-eabi" provokes 'cortex-r4' is not a recognized processor for this target' by llvm. This patch adds support for cortex-r4 and, very closely related, r4f. llvm-svn: 234486	2015-04-09 14:07:28 +00:00
Rafael Espindola	3e608b0f0c	Nothing inherits from the asm streamer. Make that explicit and remove protected: llvm-svn: 234484	2015-04-09 13:04:20 +00:00
Toma Tabacu	be218927f8	[mips] Refactor saved-registers bitmask creation in MipsAsmPrinter::printSavedRegsBitmask. NFC. Summary: Make the code more readable by fusing the for-loops together and explicitly checking for each register class. Also, this version is more straightforward because it doesn't assume that FPU registers always come before CPU registers in the CalleeSavedInfo vector. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8033 llvm-svn: 234475	2015-04-09 10:54:16 +00:00
Kristof Beyls	17cb8982f4	[AArch64] Add support for dynamic stack alignment Differential Revision: http://reviews.llvm.org/D8876 llvm-svn: 234471	2015-04-09 08:49:47 +00:00
Lang Hames	522bf13b83	[AArch64] Remove redundant -march option. Also fix a think-o from r234462. llvm-svn: 234467	2015-04-09 05:34:57 +00:00
Lang Hames	903338511b	[AArch64] Teach AArch64TargetLowering::getOptimalMemOpType to consider alignment restrictions when choosing a type for small-memcpy inlining in SelectionDAGBuilder. This ensures that the loads and stores output for the memcpy won't be further expanded during legalization, which would cause the total number of instructions for the memcpy to exceed (often significantly) the inlining thresholds. <rdar://problem/17829180> llvm-svn: 234462	2015-04-09 03:40:33 +00:00
Rafael Espindola	132381f981	Use the cast machinery to remove dummy uses of formatted_raw_ostream. If we know we are producing an object, we don't need to wrap the stream in a formatted_raw_ostream anymore. llvm-svn: 234461	2015-04-09 02:28:12 +00:00
Rafael Espindola	0a261a3dda	Add classof implementations to the raw_ostream classes. More uses to follow in a another patch. llvm-svn: 234460	2015-04-09 02:10:28 +00:00
Manman Ren	ed6b5fc4b0	[LTO] do not run internalize pass from compileOptimized. The input to compileOptimized is already optimized and internalized, so remove internalize pass from compileOptimized. rdar://20227235 llvm-svn: 234446	2015-04-08 22:02:11 +00:00
Andrew Kaylor	e104d89c8f	Formmatting correction llvm-svn: 234438	2015-04-08 21:22:46 +00:00
Andrew Kaylor	67d3c0359d	[WinEH] Minor bug fixes. Fixed insert point for allocas created for demoted values. Clear the nested landing pad list after it has been processed. llvm-svn: 234433	2015-04-08 20:57:22 +00:00
Akira Hatanaka	c6fab80536	[DAGCombine] Fix a bug in MergeConsecutiveStores. The bug manifests when there are two loads and two stores chained as follows in a DAG, (ld v3f32) -> (st f32) -> (ld v3f32) -> (st f32) and the stores' values are extracted from the preceding vector loads. MergeConsecutiveStores would replace the first store in the chain with the merged vector store, which would create a cycle between the merged store node and the last load node that appears in the chain. This commits fixes the bug by replacing the last store in the chain instead. rdar://problem/20275084 Differential Revision: http://reviews.llvm.org/D8849 llvm-svn: 234430	2015-04-08 20:34:53 +00:00
Rafael Espindola	9a07495d6d	Remove unused variable. llvm-svn: 234426	2015-04-08 20:04:20 +00:00
Cameron Zwarich	b282ef0111	Eliminate O(n^2) worst-case behavior in SSA construction The code uses a priority queue and a worklist, which share the same visited set, but the visited set is only updated when inserting into the priority queue. Instead, switch to using separate visited sets for the priority queue and worklist. llvm-svn: 234425	2015-04-08 18:26:20 +00:00
Adam Nemet	ce48250f11	[LoopAccesses] Allow analysis to complete in the presence of uniform stores (Re-apply r234361 with a fix and a testcase for PR23157) Both run-time pointer checking and the dependence analysis are capable of dealing with uniform addresses. I.e. it's really just an orthogonal property of the loop that the analysis computes. Run-time pointer checking will only try to reason about SCEVAddRec pointers or else gives up. If the uniform pointer turns out the be a SCEVAddRec in an outer loop, the run-time checks generated will be correct (start and end bounds would be equal). In case of the dependence analysis, we work again with SCEVs. When compared against a loop-dependent address of the same underlying object, the difference of the two SCEVs won't be constant. This will result in returning an Unknown dependence for the pair. When compared against another uniform access, the difference would be constant and we should return the right type of dependence (forward/backward/etc). The changes also adds support to query this property of the loop and modify the vectorizer to use this. Patch by Ashutosh Nema! llvm-svn: 234424	2015-04-08 17:48:40 +00:00
Scott Douglass	7ad7792088	[ARM] make vminnm/vmaxnm work with ?le, ?ge and no-nans-fp-math Because -menable-no-nans causes fcmp conditions to be rewritten without 'o' or 'u' the recognition code in needs to cope. Also extended it to handle 'le' and 'ge. Differential Revision: http://reviews.llvm.org/D8725 llvm-svn: 234421	2015-04-08 17:18:28 +00:00
Toma Tabacu	c6ce0749ad	[mips] [IAS] Do not generate redundant move when expanding lw/sw with symbol. Summary: Even though there is no 2nd register operand in the "lw/sw $8, symbol" case, we still try to find one, and we end up with $0, which makes us generate an unnecessary "addu $8, $8, $0" (a.k.a. "move $8, $8"). We can avoid this by checking if the 2nd register operand is different from $0, before generating the addu. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8055 llvm-svn: 234406	2015-04-08 13:52:41 +00:00
Benjamin Kramer	e9929edd8c	[jitlistener] Remove unused code llvm-svn: 234404	2015-04-08 13:17:48 +00:00
Toma Tabacu	91fc0b3c10	[mips] [IAS] Add support for the BNEZL and BEQZL pseudo-instructions. Summary: They are of the form "bnezl/beqzl $rs, offset" and expand to "bnel/beql $rs, $zero, offset". These instructions are used in Linux inline assembly. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8540 llvm-svn: 234401	2015-04-08 12:15:05 +00:00
Rafael Espindola	7230f80e3b	Write the section header in the end. One could make the argument for writing it immediately after the ELF header, but writing it in the middle of the sections like we were doing just makes it harder for no reason. llvm-svn: 234400	2015-04-08 11:41:24 +00:00
Sergey Dmitrouk	3cc62b3715	[ARM][Debug Info] Restore emitting of .cfi_def_cfa_offset for functions without stack frame Summary: Looks like new code from [[ http://reviews.llvm.org/rL222057 \| rL222057 ]] doesn't account for early `return` in `ARMFrameLowering::emitPrologue`, which leads to loosing `.cfi_def_cfa_offset` directive for functions without stack frame. Reviewers: echristo, rengolin, asl, t.p.northover Reviewed By: t.p.northover Subscribers: llvm-commits, rengolin, aemerson Differential Revision: http://reviews.llvm.org/D8606 llvm-svn: 234399	2015-04-08 10:10:12 +00:00
Toma Tabacu	7567a10c47	[mips] [IAS] Remove AssemblerPredicate's from RelocPIC and RelocStatic. Summary: These AssemblerPredicate's are unnecessary and actually make some instructions unusable when assembling pre-MIPS32 ISAs. For example, this was causing the IAS to reject the 'j' instruction for MIPS I-V. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8300 llvm-svn: 234398	2015-04-08 10:06:45 +00:00
Daniel Jasper	018070c4c1	[MachineLICM] Cleanup, remove unused parameters. NFC. llvm-svn: 234392	2015-04-08 07:10:30 +00:00
Sanjoy Das	b098447128	[InstCombine] Refactor out OptimizeOverflowCheck. NFCI. Summary: This patch adds an enum `OverflowCheckFlavor` and a function `OptimizeOverflowCheck`. This will allow InstCombine to optimize overflow checks without directly introducing an intermediate call to the `llvm.$op.with.overflow` instrinsics. This specific change is a refactoring and does not intend to change behavior. Reviewers: majnemer, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8888 llvm-svn: 234388	2015-04-08 04:27:22 +00:00
Adam Nemet	e09a928c80	Revert "[LoopAccesses] Allow analysis to complete in the presence of uniform stores" This reverts commit r234361. It caused PR23157. llvm-svn: 234387	2015-04-08 04:16:55 +00:00
Alexei Starovoitov	f28ede810c	[bpf] support BPF backend as shared library dependencies were not set correctly for shared library build. static was ok Patch by Brenden Blanco. llvm-svn: 234386	2015-04-08 03:46:51 +00:00
Matthias Braun	9b9210264f	Oops, didn't mean to commit my debug fprintfs llvm-svn: 234385	2015-04-08 02:10:01 +00:00
Tom Stellard	80e169a435	R600/SI: Add some missing overrides llvm-svn: 234384	2015-04-08 02:07:05 +00:00
Matthias Braun	1e61bbf022	LiveInterval: Fix computeFromMainRange() producing adjacent segments with same valno If two livesegments from different subranges happened to have the same definition they could possibly end up as two adjacent segments in the main liverange with the same value number which is not allowed. Detect such cases and fix them in the 2nd pass of computeFromMainRange() if necessary. No testcase as there is only an out-of-tree target where I can sensibly come up with one. llvm-svn: 234382	2015-04-08 01:41:10 +00:00
Tom Stellard	d7e6f13671	R600/SI: Initial support for assembler and inline assembly This is currently considered experimental, but most of the more commonly used instructions should work. So far only SI has been extensively tested, CI and VI probably work too, but may be buggy. The current set of tests cases do not give complete coverage, but I think it is sufficient for an experimental assembler. See the documentation in R600Usage for more information. llvm-svn: 234381	2015-04-08 01:09:26 +00:00
Tom Stellard	8980dc323b	R600/SI: Add missing SOPK instructions llvm-svn: 234380	2015-04-08 01:09:22 +00:00
Tom Stellard	1f3416a63d	R600/SI: Don't print offset0/offset1 DS operands when they are 0 llvm-svn: 234379	2015-04-08 01:09:19 +00:00
NAKAMURA Takumi	1644e15c58	ELFObjectWriter.cpp: Prune obsolete \param since r234342. [-Wdocumentation] llvm-svn: 234377	2015-04-08 00:38:50 +00:00
Tim Northover	5b44f1ba19	AArch64: disallow "fmov sD, #-0.0" during assembly. We weren't checking the sign of the floating point immediate before translating it to "fmov sD, wzr". Similarly for D-regs. Technically "movi vD.2s, #0x80, lsl #24" would work most of the time, but it's not a blessed alias (and I don't think it should be since people expect writing sD to zero out the high lanes, and there's no dD equivalent). So an error it is. rdar://20455398 llvm-svn: 234372	2015-04-07 22:49:47 +00:00
Rafael Espindola	6cd2180b31	Delete commented code. Don't repeat name in comment. llvm-svn: 234370	2015-04-07 22:35:40 +00:00
Rafael Espindola	01f4c6cda6	Don't subtract the header size just to add it back. llvm-svn: 234362	2015-04-07 21:51:41 +00:00
Adam Nemet	0515c33b70	[LoopAccesses] Allow analysis to complete in the presence of uniform stores Both run-time pointer checking and the dependence analysis are capable of dealing with uniform addresses. I.e. it's really just an orthogonal property of the loop that the analysis computes. Run-time pointer checking will only try to reason about SCEVAddRec pointers or else gives up. If the uniform pointer turns out the be a SCEVAddRec in an outer loop, the run-time checks generated will be correct (start and end bounds would be equal). In case of the dependence analysis, we work again with SCEVs. When compared against a loop-dependent address of the same underlying object, the difference of the two SCEVs won't be constant. This will result in returning an Unknown dependence for the pair. When compared against another uniform access, the difference would be constant and we should return the right type of dependence (forward/backward/etc). The changes also adds support to query this property of the loop and modify the vectorizer to use this. Patch by Ashutosh Nema! llvm-svn: 234361	2015-04-07 21:46:16 +00:00

1 2 3 4 5 ...

78691 Commits