llvm-project

Commit Graph

Author	SHA1	Message	Date
Philip Reames	6628713f4f	[RewriteStatepointsForGC] Extend base pointer inference to handle insertelement This change is simply enhancing the existing inference algorithm to handle insertelement instructions by conservatively inserting a new instruction to propagate the vector of associated base pointers. In the process, I'm ripping out the peephole optimizations which mostly helped cover the fact this hadn't been done. Note that most of the newly inserted nodes will be nearly immediately removed by the post insertion optimization pass introduced in 246718. Arguably, we should be trying harder to avoid the malloc traffic here, but I'd rather get the code correct, then worry about compile time. Unlike previous extensions of the algorithm to handle more case, I discovered the existing code was causing miscompiles in some cases. In particular, we had an implicit assumption that the peephole covered all insert element instructions, so if we had a value directly based on a insert element the peephole didn't cover, we proceeded as if it were a base anyways. Not good. I believe we had the same issue with shufflevector which is why I adjusted the predicate for them as well. Differential Revision: http://reviews.llvm.org/D12583 llvm-svn: 247210	2015-09-09 23:40:12 +00:00
Philip Reames	15d5563cea	[RewriteStatepointsForGC] Make base pointer inference deterministic Previously, the base pointer algorithm wasn't deterministic. The core fixed point was (of course), but we were inserting new nodes and optimizing them in an order which was unspecified and variable. We'd somewhat hacked around this for testing by sorting by value name, but that doesn't solve the general determinism problem. Instead, we can use the order of traversal over the def/use graph to give us a single consistent ordering. Today, this is a DFS order, but the exact order doesn't mater provided it's deterministic for a given input. (Q: It is safe to rely on a deterministic order of operands right?) Note that this only fixes the determinism within a single inference step. The inference step is currently invoked many times in a non-deterministic order. That's a future change in the sequence. :) Differential Revision: http://reviews.llvm.org/D12640 llvm-svn: 247208	2015-09-09 23:26:08 +00:00
Peter Collingbourne	1cbc91eccf	LowerBitSets: Fix non-determinism bug. Visit disjoint sets in a deterministic order based on the maximum BitSetNM index, otherwise the order in which we visit them will depend on pointer comparisons. This was being exposed by MSan. llvm-svn: 247201	2015-09-09 22:30:32 +00:00
Reid Kleckner	94b704c469	[SEH] Emit 32-bit SEH tables for the new EH IR The 32-bit tables don't actually contain PC range data, so emitting them is incredibly simple. The 64-bit tables, on the other hand, use the same table for state numbering as well as label ranges. This makes things more difficult, so it will be implemented later. llvm-svn: 247192	2015-09-09 21:10:03 +00:00
Dan Gohman	5e0668426c	[WebAssembly] Update target datalayout strings. llvm-svn: 247187	2015-09-09 20:54:31 +00:00
Teresa Johnson	0f251a1cc6	Change EmitRecordWithAbbrevImpl to take Optional record code. NFC. This change enables EmitRecord to pass the supplied record Code to EmitRecordWithAbbrevImpl, rather than insert it into the Vals array. It is an enabler for changing EmitRecord to take an ArrayRef<uintty> instead of a SmallVectorImpl<uintty>& Patch suggested by Duncan P. N. Exon Smith, modified by myself a bit to get correct assertion checking. llvm-svn: 247186	2015-09-09 20:53:31 +00:00
Piotr Padlewski	0dde00d239	ScalarEvolution assume hanging bugfix http://reviews.llvm.org/D12719 llvm-svn: 247184	2015-09-09 20:47:30 +00:00
Mehdi Amini	c9a85abc6c	Revert "Bitcode Writer: EmitRecordWith* takes an ArrayRef instead of a SmallVector (NFC)" This reverts commit r247178. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 247182	2015-09-09 20:35:15 +00:00
David Majnemer	d34dbf07bd	Revert trunc(lshr (sext A), Cst) to ashr A, Cst This reverts commit r246997, it introduced a regression (PR24763). llvm-svn: 247180	2015-09-09 20:20:08 +00:00
Mehdi Amini	7d2bf53ed1	Bitcode Writer: EmitRecordWith* takes an ArrayRef instead of a SmallVector (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 247178	2015-09-09 20:08:39 +00:00
Renato Golin	db7ea86bf4	Revert "AVX512: Implemented encoding and intrinsics for vextracti64x4 ,vextracti64x2, vextracti32x8, vextracti32x4, vextractf64x4, vextractf64x2, vextractf32x8, vextractf32x4 Added tests for intrinsics and encoding." This reverts commit r247149, as it was breaking numerous buildbots of varied architectures. llvm-svn: 247177	2015-09-09 19:44:40 +00:00
Sanjay Patel	66dcafc3d6	allow unpredictable metadata on switch statements llvm-svn: 247174	2015-09-09 18:38:30 +00:00
Matthias Braun	d9da162789	Save LaneMask with livein registers With subregister liveness enabled we can detect the case where only parts of a register are live in, this is expressed as a 32bit lanemask. The current code only keeps registers in the live-in list and therefore enumerated all subregisters affected by the lanemask. This turned out to be too conservative as the subregister may also cover additional parts of the lanemask which are not live. Expressing a given lanemask by enumerating a minimum set of subregisters is computationally expensive so the best solution is to simply change the live-in list to store the lanemasks as well. This will reduce memory usage for targets using subregister liveness and slightly increase it for other targets Differential Revision: http://reviews.llvm.org/D12442 llvm-svn: 247171	2015-09-09 18:08:03 +00:00
Matthias Braun	cc58005885	VirtRegMap: Improve addMBBLiveIns() using SlotIndex::MBBIndexIterator; NFC Now that we have an explicit iterator over the idx2MBBMap in SlotIndices we can use the fact that segments and the idx2MBBMap is sorted by SlotIndex position so can advance both simultaneously instead of starting from the beginning for each segment. This complicates the code for the subregister case somewhat but should be more efficient and has the advantage that we get the final lanemask for each block immediately which will be important for a subsequent change. Removes the now unused SlotIndexes::findMBBLiveIns function. Differential Revision: http://reviews.llvm.org/D12443 llvm-svn: 247170	2015-09-09 18:07:54 +00:00
Chandler Carruth	7b560d40bd	[PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible with the new pass manager, and no longer relying on analysis groups. This builds essentially a ground-up new AA infrastructure stack for LLVM. The core ideas are the same that are used throughout the new pass manager: type erased polymorphism and direct composition. The design is as follows: - FunctionAAResults is a type-erasing alias analysis results aggregation interface to walk a single query across a range of results from different alias analyses. Currently this is function-specific as we always assume that aliasing queries are within a function. - AAResultBase is a CRTP utility providing stub implementations of various parts of the alias analysis result concept, notably in several cases in terms of other more general parts of the interface. This can be used to implement only a narrow part of the interface rather than the entire interface. This isn't really ideal, this logic should be hoisted into FunctionAAResults as currently it will cause a significant amount of redundant work, but it faithfully models the behavior of the prior infrastructure. - All the alias analysis passes are ported to be wrapper passes for the legacy PM and new-style analysis passes for the new PM with a shared result object. In some cases (most notably CFL), this is an extremely naive approach that we should revisit when we can specialize for the new pass manager. - BasicAA has been restructured to reflect that it is much more fundamentally a function analysis because it uses dominator trees and loop info that need to be constructed for each function. All of the references to getting alias analysis results have been updated to use the new aggregation interface. All the preservation and other pass management code has been updated accordingly. The way the FunctionAAResultsWrapperPass works is to detect the available alias analyses when run, and add them to the results object. This means that we should be able to continue to respect when various passes are added to the pipeline, for example adding CFL or adding TBAA passes should just cause their results to be available and to get folded into this. The exception to this rule is BasicAA which really needs to be a function pass due to using dominator trees and loop info. As a consequence, the FunctionAAResultsWrapperPass directly depends on BasicAA and always includes it in the aggregation. This has significant implications for preserving analyses. Generally, most passes shouldn't bother preserving FunctionAAResultsWrapperPass because rebuilding the results just updates the set of known AA passes. The exception to this rule are LoopPass instances which need to preserve all the function analyses that the loop pass manager will end up needing. This means preserving both BasicAAWrapperPass and the aggregating FunctionAAResultsWrapperPass. Now, when preserving an alias analysis, you do so by directly preserving that analysis. This is only necessary for non-immutable-pass-provided alias analyses though, and there are only three of interest: BasicAA, GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is preserved when needed because it (like DominatorTree and LoopInfo) is marked as a CFG-only pass. I've expanded GlobalsAA into the preserved set everywhere we previously were preserving all of AliasAnalysis, and I've added SCEVAA in the intersection of that with where we preserve SCEV itself. One significant challenge to all of this is that the CGSCC passes were actually using the alias analysis implementations by taking advantage of a pretty amazing set of loop holes in the old pass manager's analysis management code which allowed analysis groups to slide through in many cases. Moving away from analysis groups makes this problem much more obvious. To fix it, I've leveraged the flexibility the design of the new PM components provides to just directly construct the relevant alias analyses for the relevant functions in the IPO passes that need them. This is a bit hacky, but should go away with the new pass manager, and is already in many ways cleaner than the prior state. Another significant challenge is that various facilities of the old alias analysis infrastructure just don't fit any more. The most significant of these is the alias analysis 'counter' pass. That pass relied on the ability to snoop on AA queries at different points in the analysis group chain. Instead, I'm planning to build printing functionality directly into the aggregation layer. I've not included that in this patch merely to keep it smaller. Note that all of this needs a nearly complete rewrite of the AA documentation. I'm planning to do that, but I'd like to make sure the new design settles, and to flesh out a bit more of what it looks like in the new pass manager first. Differential Revision: http://reviews.llvm.org/D12080 llvm-svn: 247167	2015-09-09 17:55:00 +00:00
Matthias Braun	80595460d8	MachineVerifier: Check that SlotIndex MBBIndexList is sorted. This introduces a check that the MBBIndexList is sorted as proposed in http://reviews.llvm.org/D12443 but split up into a separate commit. llvm-svn: 247166	2015-09-09 17:49:46 +00:00
Matt Arsenault	ef67d76869	AMDGPU: Extract full 64-bit subregister and use subregs Instead of extracting both 32-bit components from the 128-bit register. This produces fewer copies and is easier for the copy peephole optimizer to understand and see the actual uses as extracts from a reg_sequence. This avoids needing to handle subregister composing in the PeepholeOptimizer's ValueTracker for this case. llvm-svn: 247162	2015-09-09 17:03:29 +00:00
Matt Arsenault	b5541fb098	AMDGPU: Remove unused multiclass argument llvm-svn: 247161	2015-09-09 17:03:18 +00:00
Tom Stellard	5268c17e52	llvm-config: Add --build-system option Summary: This can be used for distinguishing between cmake and autoconf builds. Users may need this in order to handle inconsistencies between the outputs of the two build systems. Reviewers: echristo, chandlerc, beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11838 llvm-svn: 247159	2015-09-09 16:39:30 +00:00
Dan Gohman	f71abef701	[WebAssembly] Implement calls with void return types. llvm-svn: 247158	2015-09-09 16:13:47 +00:00
Tom Stellard	9a197676b1	AMDGPU/SI: Fold operands through REG_SEQUENCE instructions Summary: This helps mostly when we use add instructions for address calculations that contain immediates. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12256 llvm-svn: 247157	2015-09-09 15:43:26 +00:00
Silviu Baranga	a3e27edb5d	[CostModel][AArch64] Remove amortization factor for some of the vector select instructions Summary: We are not scalarizing the wide selects in codegen for i16 and i32 and therefore we can remove the amortization factor. We still have issues with i64 vectors in codegen though. Reviewers: mcrosier Subscribers: mcrosier, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D12724 llvm-svn: 247156	2015-09-09 15:35:02 +00:00
Sanjay Patel	6eccf487c9	don't repeat function names in comments; NFC llvm-svn: 247154	2015-09-09 15:24:36 +00:00
Dan Gohman	1ce7ba5fe0	[WebAssembly] Tidy up some unneeded newline characters. llvm-svn: 247152	2015-09-09 15:13:36 +00:00
Joseph Tremoulet	e5e75afe8f	[CMake] Flag recursive cmake invocations for cross-compile Summary: Cross-compilation uses recursive cmake invocations to build native host tools. These recursive invocations only forward a fixed set of variables/options, since the native environment is generally the default. This change adds -DLLVM_TARGET_IS_CROSSCOMPILE_HOST=TRUE to the recursive cmake invocations, so that cmake files can distinguish these recursive invocations from top-level ones, which can explain why expected options are unset. LLILC will use this to avoid trying to generate its build rules in the crosscompile native host target (where it is not needed), which would fail if attempted because LLILC requires a cmake variable passed on the command line, which is not forwarded in the recursive invocation. Reviewers: rnk, beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12679 llvm-svn: 247151	2015-09-09 14:57:06 +00:00
Sanjay Patel	e283441836	function names start with a lower case letter; NFC llvm-svn: 247150	2015-09-09 14:54:29 +00:00
Igor Breger	ac29a82921	AVX512: Implemented encoding and intrinsics for vextracti64x4 ,vextracti64x2, vextracti32x8, vextracti32x4, vextractf64x4, vextractf64x2, vextractf32x8, vextractf32x4 Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D11802 llvm-svn: 247149	2015-09-09 14:35:09 +00:00
Sanjay Patel	2fbab9d893	don't repeat function names in comments; NFC llvm-svn: 247148	2015-09-09 14:34:26 +00:00
Zoran Jovanovic	6b28f09d67	[mips][microMIPS] Implement ADDU16, AND16, ANDI16, NOT16, OR16, SLL16 and SRL16 instructions Differential Revision: http://reviews.llvm.org/D11178 llvm-svn: 247146	2015-09-09 13:55:45 +00:00
Alex Lorenz	b9a68dbcae	Fix PR 24633 - Handle undef values when parsing standalone constants. llvm-svn: 247145	2015-09-09 13:44:33 +00:00
James Molloy	520838977b	Rename ExitCount to BackedgeTakenCount, because that's what it is. We called a variable ExitCount, stored the backedge count in it, then redefined it to be the exit count again. llvm-svn: 247140	2015-09-09 12:51:10 +00:00
James Molloy	89eccee4db	Delay predication of stores until near the end of vector code generation Predicating stores requires creating extra blocks. It's much cleaner if we do this in one pass instead of mutating the CFG while writing vector instructions. Besides which we can make use of helper functions to update domtree for us, reducing the work we need to do. llvm-svn: 247139	2015-09-09 12:51:06 +00:00
Alexandros Lamprineas	712099ccfd	LLVM does not distinguish Cortex-M4 from Cortex-M4F neither Cortex-R5 from R5F. Removed "cortex-r5f" and "cortex-m4f" from Target Parser, sinced they are unknown cpu names for llvm and clang. Also updated default FPUs for R5 and M4 accordingly. Differential Revision: http://reviews.llvm.org/D12692 Change-Id: Ib81c7216521a361d8ee1296e4b6a2aa00bd479c5 llvm-svn: 247136	2015-09-09 11:20:48 +00:00
Daniel Sanders	2038747fce	Fix vector splitting for extract_vector_elt and vector elements of <8-bits. Summary: One of the vector splitting paths for extract_vector_elt tries to lower: define i1 @via_stack_bug(i8 signext %idx) { %1 = extractelement <2 x i1> <i1 false, i1 true>, i8 %idx ret i1 %1 } to: define i1 @via_stack_bug(i8 signext %idx) { %base = alloca <2 x i1> store <2 x i1> <i1 false, i1 true>, <2 x i1>* %base %2 = getelementptr <2 x i1>, <2 x i1>* %base, i32 %idx %3 = load i1, i1* %2 ret i1 %3 } However, the elements of <2 x i1> are not byte-addressible. The result of this is that the getelementptr expands to '%base + %idx * (1 / 8)' which simplifies to '%base + %idx * 0', and then simply '%base' causing all values of %idx to extract element zero. This commit fixes this by promoting the vector elements of <8-bits to i8 before splitting the vector. This fixes a number of test failures in pocl. Reviewers: pekka.jaaskelainen Subscribers: pekka.jaaskelainen, llvm-commits Differential Revision: http://reviews.llvm.org/D12591 llvm-svn: 247128	2015-09-09 09:53:20 +00:00
Chandler Carruth	1688a772fc	Fix a typo I spotted when hacking on SROA. Somewhat alarming that nothing broke. llvm-svn: 247127	2015-09-09 09:46:16 +00:00
Zoran Jovanovic	d9790793d6	[mips][microMIPS] Implement CACHEE and PREFE instructions Differential Revision: http://reviews.llvm.org/D11628 llvm-svn: 247125	2015-09-09 09:10:46 +00:00
Matt Arsenault	d768737454	AMDGPU: Fix not encoding src2 of VOP3b instructions Broken by r247074. Should include an assembler test, but the assembler is currently broken for VOP3b apparently. llvm-svn: 247123	2015-09-09 08:39:49 +00:00
Sanjoy Das	da0d79e0a0	[IRCE] Add INITIALIZE_PASS_DEPENDENCY invocations. IRCE was just using INITIALIZE_PASS(), which is incorrect. llvm-svn: 247122	2015-09-09 03:47:18 +00:00
Lang Hames	856e4767ff	[RuntimeDyld] Add support for MachO x86_64 SUBTRACTOR relocation. llvm-svn: 247119	2015-09-09 03:14:29 +00:00
Dan Gohman	e590b33bf8	[WebAssembly] Fix lowering of calls with more than one argument. llvm-svn: 247118	2015-09-09 01:52:45 +00:00
Matt Arsenault	acd68b58ae	SelectionDAG: Support Expand of f16 extloads Currently this hits an assert that extload should always be supported, which assumes integer extloads. This moves a hack out of SI's argument lowering and is covered by existing tests. llvm-svn: 247113	2015-09-09 01:12:27 +00:00
Dan Gohman	4f52e00ecb	[WebAssembly] Implement WebAssemblyInstrInfo::copyPhysReg llvm-svn: 247110	2015-09-09 00:52:47 +00:00
Matt Arsenault	3099156261	Fix typos / grammar llvm-svn: 247109	2015-09-09 00:38:33 +00:00
Duncan P. N. Exon Smith	78b66ecd70	Revert "Bitcode: ArrayRef-ize EmitRecordWithAbbrev(), NFC" This reverts commit r247107. Turns out clang calls these functions directly, and `ArrayRef<T>` doesn't have a working implicit conversion from `SmallVector<T>`. http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/14247 llvm-svn: 247108	2015-09-09 00:37:52 +00:00
Duncan P. N. Exon Smith	98b3cd9280	Bitcode: ArrayRef-ize EmitRecordWithAbbrev(), NFC Change `EmitRecordWithAbbrev()` and friends to take an `ArrayRef<T>` instead of requiring a `SmallVectorImpl<T>`. No functionality change intended. llvm-svn: 247107	2015-09-09 00:34:25 +00:00
Davide Italiano	9a429b766f	[llvm-readobj] MachO -- dump LinkerOptions load command. Example output: Linker Options { Size: 32 Count: 2 Strings [ Value: -framework Value: Cocoa ] } There were only two tests using this -- so I converted them as part of this commit rather than separately. Differential Revision: http://reviews.llvm.org/D12702 llvm-svn: 247106	2015-09-09 00:21:18 +00:00
Reid Kleckner	51189f0a1d	[WinEH] Avoid creating MBBs for LLVM BBs that cannot contain code Typically these are catchpads, which hold data used to decide whether to catch the exception or continue unwinding. We also shouldn't create MBBs for catchendpads, cleanupendpads, or terminatepads, since no real code can live in them. This fixes a problem where MI passes (like the register allocator) would try to put code into catchpad blocks, which are not executed by the runtime. In the new world, blocks ending in invokes now have many possible successors. llvm-svn: 247102	2015-09-08 23:28:38 +00:00
Peter Collingbourne	8d24ae9441	Re-apply r247080 with order of evaluation fix. llvm-svn: 247095	2015-09-08 22:49:35 +00:00
Reid Kleckner	df1295173f	[WinEH] Emit prologues and epilogues for funclets Summary: 32-bit funclets have short prologues that allocate enough stack for the largest call in the whole function. The runtime saves CSRs for the funclet. It doesn't restore CSRs after we finally transfer control back to the parent funciton via a CATCHRET, but that's a separate issue. 32-bit funclets also have to adjust the incoming EBP value, which is what llvm.x86.seh.recoverframe does in the old model. 64-bit funclets need to spill CSRs as normal. For simplicity, this just spills the same set of CSRs as the parent function, rather than trying to compute different CSR sets for the parent function and each funclet. 64-bit funclets also allocate enough stack space for the largest outgoing call frame, like 32-bit. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12546 llvm-svn: 247092	2015-09-08 22:44:41 +00:00
Peter Collingbourne	07f3af2e82	Revert r247080, "LowerBitSets: Extend pass to support functions as bitset members." as it causes test failures on a number of bots. llvm-svn: 247088	2015-09-08 22:33:23 +00:00
Vedant Kumar	9ebd49a4cf	[Bitcode] Add compatibility tests for new instructions Adds basic compatibility tests for the following instructions: catchpad, catchendpad, cleanuppad, cleanupendpad, terminatepad, cleanupret, catchret llvm-svn: 247087	2015-09-08 22:33:23 +00:00
Vedant Kumar	ee6110cd39	[docs] Fix typo in catchret example An example usage of catchret omitted the "to" in "to label" in ExceptionHandling.rst. llvm-svn: 247086	2015-09-08 22:28:38 +00:00
Eric Christopher	71f6e2f568	Fix the PPC CTR Loop pass to look for calls to the intrinsics that read CTR and count them as reading the CTR. llvm-svn: 247083	2015-09-08 22:14:58 +00:00
Peter Collingbourne	c634ed0b1a	LowerBitSets: Extend pass to support functions as bitset members. This change extends the bitset lowering pass to support bitsets that may contain either functions or global variables. A function bitset is lowered to a jump table that is laid out before one of the functions in the bitset. Also add support for non-string bitset identifier names. This allows for distinct metadata nodes to stand in for names with internal linkage, as done in D11857. Differential Revision: http://reviews.llvm.org/D11856 llvm-svn: 247080	2015-09-08 21:57:45 +00:00
Ivan Krasin	a610cb5ba0	[libFuzzer]Add a test for defeating a hash sum. Summary: Add a test for a data followed by 4-byte hash value. I use a slightly modified Jenkins hash function, as described in https://en.wikipedia.org/wiki/Jenkins_hash_function The modification is to ensure that hash(zeros) != 0. Reviewers: kcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12648 llvm-svn: 247076	2015-09-08 21:22:52 +00:00
Matt Arsenault	86d336e91b	AMDGPU/SI: Fix input vcc operand for VOP2b instructions Adds vcc to output string input for e32. Allows option of using e64 encoding with assembler. Also fixes these instructions not implicitly reading exec. llvm-svn: 247074	2015-09-08 21:15:00 +00:00
Artem Belevich	0127d80986	[NVPTX] Added run NVVMReflect pass to NVPTX back-end. The pass is needed to remove __nvvm_reflect calls when we link in libdevice bitcode that comes with CUDA. Differential Revision: http://reviews.llvm.org/D11663 llvm-svn: 247072	2015-09-08 21:04:55 +00:00
Derek Schuff	45c832c5d8	Fix comments and RUN line in x86-64 stdarg test leftover from last commit From http://reviews.llvm.org/D12346 llvm-svn: 247070	2015-09-08 20:58:41 +00:00
Derek Schuff	eef533f422	x32. Fixes a bug in how struct va_list is initialized in x32 Summary: This patch modifies X86TargetLowering::LowerVASTART so that struct va_list is initialized with 32 bit pointers in x32. It also includes tests that call @llvm.va_start() for x32. Patch by João Porto Subscribers: llvm-commits, hjl.tools Differential Revision: http://reviews.llvm.org/D12346 llvm-svn: 247069	2015-09-08 20:51:31 +00:00
Kostya Serebryany	4b82de2e47	[libFuzzer] remove a piece of stale code llvm-svn: 247067	2015-09-08 20:40:10 +00:00
Kostya Serebryany	9cdea94f66	[libFuzzer] be more robust when dealing with files on disk (e.g. don't crash if a file was there but disappeared) llvm-svn: 247066	2015-09-08 20:36:33 +00:00
Dan Gohman	e32c57443f	[WebAssembly] Support running without a register allocator in the default CodeGen passes This allows backends which don't use a traditional register allocator, but do need PHI lowering and other passes, to use the default TargetPassConfig::addFastRegAlloc and TargetPassConfig::addOptimizedRegAlloc implementations. Differential Revision: http://reviews.llvm.org/D12691 llvm-svn: 247065	2015-09-08 20:36:33 +00:00
Matt Arsenault	90e0340850	Add const overload of findRegisterUseOperand llvm-svn: 247063	2015-09-08 20:21:29 +00:00
Vedant Kumar	9e1998e198	[docs] Update documentation for the landingpad instruction llvm-svn: 247062	2015-09-08 20:16:35 +00:00
Sanjay Patel	b54e62fe17	refactor matches for De Morgan's Laws; NFCI llvm-svn: 247061	2015-09-08 20:14:13 +00:00
Matt Arsenault	8ac35cd031	AMDGPU: Mark s_barrier as a high latency instruction These were marked as WriteSALU, which is low latency. I'm guessing at the value to use, but it should probably be considered the highest latency instruction. I'm not sure this has any actual effect since hasSideEffects probably is preventing any moving of these. llvm-svn: 247060	2015-09-08 19:54:32 +00:00
Matt Arsenault	8fb810a1d2	AMDGPU: Fix s_barrier flags This should be convergent. This is not a barrier in the isBarrier sense, nor hasCtrlDep. llvm-svn: 247059	2015-09-08 19:54:25 +00:00
Derek Schuff	ee4e947e23	x32. Fixes a bug in i8mem_NOREX declaration. The old implementation assumed LP64 which is broken for x32. Specifically, the MOVE8rm_NOREX and MOVE8mr_NOREX, when selected, would cause a 'Cannot emit physreg copy instruction' error message to be reported. This patch also enable the h-register*ll tests for x32. Differential Revision: http://reviews.llvm.org/D12336 Patch by João Porto llvm-svn: 247058	2015-09-08 19:47:15 +00:00
Matt Arsenault	966a94f861	AMDGPU: Handle sub of constant for DS offset folding sub C, x - > add (sub 0, x), C for DS offsets. This is mostly to fix regressions that show up when SeparateConstOffsetFromGEP is enabled. llvm-svn: 247054	2015-09-08 19:34:22 +00:00
Diego Novillo	f9aa39b0cf	Fix PR 24723 - Handle 0-mass backedges in irreducible loops This corner case happens when we have an irreducible SCC that is deeply nested. As we work down the tree, the backedge masses start getting smaller and smaller until we reach one that is down to 0. Since we distribute the incoming mass using the backedge masses as weight, the distributor does not allow zero weights. So, we simply ignore them (which will just use the weights of the non-zero nodes). llvm-svn: 247050	2015-09-08 19:22:17 +00:00
Davide Italiano	cb2da7166a	[MC/ELF] Accept zero for .align directive .align directive refuses alignment 0 -- a comment in the code hints this is done for GNU as compatibility, but it seems GNU as accepts .align 0 (and silently rounds up alignment to 1). Differential Revision: http://reviews.llvm.org/D12682 llvm-svn: 247048	2015-09-08 18:59:47 +00:00
David Blaikie	12dd3c4ebb	Fix CPP Backend for GEP API changes for opaque pointer types Based on a patch by Jerome Witmann. llvm-svn: 247047	2015-09-08 18:42:29 +00:00
Benjamin Kramer	c3c183554b	Merge or combine tests and convert to FileCheck. - Move tests only exercising instsimplify to instsimplify's apint-or.ll - Actually test the CHECK lines in instsimplify's apint-or.ll - Merge the remaining tests in apint-or1.ll and apint-or2.ll, use FileCheck llvm-svn: 247045	2015-09-08 18:36:56 +00:00
Evgeniy Stepanov	80d5569dba	Fix isDiscardableIfUnused to include available_externally linkage. AvailableExternally functions are discardable. llvm-svn: 247044	2015-09-08 18:25:20 +00:00
Sanjay Patel	1854927556	remove function names from comments; NFC llvm-svn: 247043	2015-09-08 18:24:36 +00:00
Andrew Kaylor	e2ea93c6c0	Fix for bz24500: Avoid non-deterministic code generation triggered by the x86 call frame optimization Patch by Dave Kreitzer Differential Revision: http://reviews.llvm.org/D12620 llvm-svn: 247042	2015-09-08 18:18:46 +00:00
Sanjay Patel	21a145c341	add tests for De Morgan instcombines based on PR22723 llvm-svn: 247040	2015-09-08 18:13:03 +00:00
Sanjay Patel	35f673ef08	fix typos, remove noise; NFCI llvm-svn: 247035	2015-09-08 17:58:22 +00:00
Kostya Serebryany	b06fae5ede	[libFuzzer] better documentatio for -save_minimized_corpus=1 llvm-svn: 247033	2015-09-08 17:43:51 +00:00
Vedant Kumar	52cd4eecac	[Bitcode] Add compatibility test for llvm 3.7.0 This patch adds llvm-3.7 IR and generated bitcode for our compatibility test (in accordance with the developer policy). llvm-svn: 247031	2015-09-08 17:39:21 +00:00
Kostya Serebryany	468ed78434	[libFuzzer] remove -iterations as redundant (there is also -num_runs) llvm-svn: 247030	2015-09-08 17:30:35 +00:00
JF Bastien	749ed88aa5	WebAssembly: NFC rename shr/sar Renamed from: https://github.com/WebAssembly/design/pull/332 llvm-svn: 247028	2015-09-08 17:21:21 +00:00
Kostya Serebryany	25425ad920	[libFuzzer] add one more mutator: Mutate_ChangeASCIIInteger llvm-svn: 247027	2015-09-08 17:19:31 +00:00
Jun Bum Lim	4d3c5986f2	Remove white space (test commit) llvm-svn: 247021	2015-09-08 16:11:22 +00:00
Zoran Jovanovic	2da1437d62	[mips][microMIPS] Implement LLE, LUI, LW and LWE instructions Differential Revision: http://reviews.llvm.org/D1179 llvm-svn: 247017	2015-09-08 15:02:50 +00:00
Dan Gohman	d4a12d25ff	[WebAssembly] Temporarily disable this test, as it depends on additional patches that aren't yet checked in. llvm-svn: 247011	2015-09-08 13:21:12 +00:00
Igor Breger	a54a1a84dd	AVX512: kunpck encoding implementation Added tests for encoding. Differential Revision: http://reviews.llvm.org/D12061 llvm-svn: 247010	2015-09-08 13:10:00 +00:00
Dan Gohman	25d2a0dda4	[WebAssembly] Enable SSA lowering and other pre-regalloc passes llvm-svn: 247008	2015-09-08 12:39:25 +00:00
Elena Demikhovsky	ddf715ef77	Removed an old comment, NFC llvm-svn: 247006	2015-09-08 12:22:22 +00:00
Alex Lorenz	37e026260b	MIRLangRef: Add documentation for the subregister indices. llvm-svn: 247005	2015-09-08 11:39:47 +00:00
Alex Lorenz	d4990ebd02	MIRLangRef: Add documentation for the global value machine operands. llvm-svn: 247004	2015-09-08 11:38:16 +00:00
Zoran Jovanovic	9eaa30d2bf	[mips][microMIPS] Implement SB, SBE, SCE, SH and SHE instructions Differential Revision: http://reviews.llvm.org/D11801 llvm-svn: 246999	2015-09-08 10:18:38 +00:00
Jakub Kuderski	7cd4810021	There is a trunc(lshr (zext A), Cst) optimization in InstCombineCasts that removes cast by performing the lshr on smaller types. However, currently there is no trunc(lshr (sext A), Cst) variant. This patch add such optimization by transforming trunc(lshr (sext A), Cst) to ashr A, Cst. Differential Revision: http://reviews.llvm.org/D12520 llvm-svn: 246997	2015-09-08 10:03:17 +00:00
Daniel Sanders	808dfb8ba7	[mips] Reserve address spaces 1-255 for software use. Summary: And define them to have noop casts with address spaces 0-255. Reviewers: pekka.jaaskelainen Subscribers: pekka.jaaskelainen, llvm-commits Differential Revision: http://reviews.llvm.org/D12678 llvm-svn: 246990	2015-09-08 09:07:03 +00:00
Zoran Jovanovic	68be5f21a9	[mips][microMIPS] Add microMIPS32r6 and microMIPS64r6 tests for existing 16-bit LBU16, LHU16, LW16, LWGP and LWSP instructions Differential Revision: http://reviews.llvm.org/D10956 llvm-svn: 246987	2015-09-08 08:25:34 +00:00
NAKAMURA Takumi	bb7483dd77	[CMake][CMP0051] Avoid for user of objlib to use llvm_update_compile_flags(). $<TARGET_OBJECTS> shouldn't require compile flags. Flags are set in obj.${name}. llvm-svn: 246984	2015-09-08 07:42:06 +00:00
Elena Demikhovsky	dec0f0885f	compilation issue, NFC llvm-svn: 246983	2015-09-08 07:34:06 +00:00
Elena Demikhovsky	d240d778b3	fixed compilation issue, NFC. llvm-svn: 246982	2015-09-08 07:10:08 +00:00
Elena Demikhovsky	e88038f235	AVX-512: Lowering for 512-bit vector shuffles. Vector types: <8 x 64>, <16 x 32>, <32 x 16> float and integer. Differential Revision: http://reviews.llvm.org/D10683 llvm-svn: 246981	2015-09-08 06:38:21 +00:00
Davide Italiano	bb9a6ccfa8	[llvm-readobj] Shrink code a little bit. No functional change. llvm-svn: 246976	2015-09-07 20:47:03 +00:00

1 2 3 4 5 ...

121486 Commits