llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	30ffc4ce45	[SROA] Don't falsely report that changes have occured We would report that the function changed despite creating no new allocas or performing any promotion. This fixes PR27316. llvm-svn: 267507	2016-04-26 01:05:00 +00:00
Andrew Kaylor	1aa3cf7d18	Reverting Thumb2SizeReduction opt bisect change to fix failing buildbots. llvm-svn: 267506	2016-04-26 00:56:36 +00:00
Sanjay Patel	a31b0c0ece	[CodeGenPrepare] don't convert an unpredictable select into control flow Suggested in the review of D19488: http://reviews.llvm.org/D19488 llvm-svn: 267504	2016-04-26 00:47:39 +00:00
Junmo Park	3c65acf87e	Remove MinLatency in SchedMachineModel. NFC. Summary: We don't use MinLatency any more since r184032. Reviewers: atrick, hfinkel, mcrosier Differential Revision: http://reviews.llvm.org/D19474 llvm-svn: 267502	2016-04-26 00:37:46 +00:00
Justin Bogner	1a07501379	PM: Port GlobalOpt to the new pass manager llvm-svn: 267499	2016-04-26 00:28:01 +00:00
Justin Bogner	d2f3d0a79d	PM: Convert the logic for GlobalOpt into static functions. NFC Pass all of the state we need around as arguments, so that these functions are easier to reuse. There is one part of this that is unusual: we pass around a functor to look up a DomTree for a function. This will be a necessary abstraction when we try to use this code in both the legacy and the new pass manager. llvm-svn: 267498	2016-04-26 00:27:56 +00:00
Ahmed Bougacha	5cf735a5b1	[X86] Use LivePhysRegs in X86FixupBWInsts. Kill-flags, which computeRegisterLiveness uses, are not reliable. LivePhysRegs is. Differential Revision: http://reviews.llvm.org/D19472 llvm-svn: 267495	2016-04-26 00:00:48 +00:00
Sanjay Patel	82059090d3	Add check for "branch_weights" with prof metadata While we're here, fix the comment and variable names to make it clear that these are raw weights, not percentages. llvm-svn: 267491	2016-04-25 23:15:16 +00:00
James Y Knight	51208eaccc	[Sparc] Fix double-float fabs and fneg on little endian CPUs. The SparcV8 fneg and fabs instructions interestingly come only in a single-float variant. Since the sign bit is always the topmost bit no matter what size float it is, you simply operate on the high subregister, as if it were a single float. However, the layout of double-floats in the float registers is reversed on little-endian CPUs, so that the high bits are in the second subregister, rather than the first. Thus, this expansion must check the endianness to use the correct subregister. llvm-svn: 267489	2016-04-25 22:54:09 +00:00
Tim Northover	cbba0aba16	ARM: put correct symbol index on indirect pointers in __thread_ptr. Otherwise the linker has no idea what should be resolved. llvm-svn: 267488	2016-04-25 22:36:07 +00:00
Andrew Kaylor	736efc894d	Fix build warning llvm-svn: 267487	2016-04-25 22:27:30 +00:00
Andrew Kaylor	7de74af929	Add optimization bisect opt-in calls for AMDGPU passes Differential Revision: http://reviews.llvm.org/D19450 llvm-svn: 267485	2016-04-25 22:23:44 +00:00
Amaury Sechet	9bbda191ba	Reformat LLVMConstPointerNull. NFC llvm-svn: 267484	2016-04-25 22:23:35 +00:00
Arch D. Robison	be0490a6e8	Optimize store of "bitcast" from vector to aggregate. This patch is what was the "instcombine" portion of D14185, with an additional test added (see julia_pseudovec in test/Transforms/InstCombine/insert-val-extract-elem.ll). The patch causes instcombine to replace sequences of extractelement-insertvalue-store that act essentially like a bitcast followed by a store. Differential review: http://reviews.llvm.org/D14260 llvm-svn: 267482	2016-04-25 22:22:39 +00:00
Philip Reames	1918384155	[LVI] Make a precondition explicit rather than handling a case which never happens [NFC] llvm-svn: 267481	2016-04-25 22:21:24 +00:00
Andrew Kaylor	a2b9111ef7	Add optimization bisect opt-in calls for ARM passes Differential Revision: http://reviews.llvm.org/D19449 llvm-svn: 267480	2016-04-25 22:01:04 +00:00
Andrew Kaylor	1ac98bb088	Add optimization bisect opt-in calls for AArch64 passes Differential Revision: http://reviews.llvm.org/D19394 llvm-svn: 267479	2016-04-25 21:58:52 +00:00
Krzysztof Parzyszek	1711f2d8bd	Add accidentally deleted "break" llvm-svn: 267476	2016-04-25 21:28:52 +00:00
Lang Hames	1fa0e0e006	[ORC] clang-format code that was touched in r267457. NFC. Commit r267457 made a lot of type-substitutions threw off code formatting and alignment. This patch should tidy those changes up. llvm-svn: 267475	2016-04-25 21:21:20 +00:00
Tim Northover	5c3140f745	ARM: put extern __thread stubs in a special section. The linker needs to know that the symbols are thread-local to do its job properly. llvm-svn: 267473	2016-04-25 21:12:04 +00:00
Teresa Johnson	c851d216e2	[ThinLTO] Introduce typedef for commonly-used map type (NFC) Add a typedef for the std::map<GlobalValue::GUID, GlobalValueSummary *> map that is passed around to identify summaries for values defined in a particular module. This shortens up declarations in a variety of places. llvm-svn: 267471	2016-04-25 21:09:51 +00:00
Krzysztof Parzyszek	3e28229000	[Hexagon] Few fixes for exception handling llvm-svn: 267469	2016-04-25 21:05:19 +00:00
Quentin Colombet	abe2d016cf	Re-apply r267206 with a fix for the encoding problem: when the immediate of log2(Mask) is smaller than 32, we must use the 32-bit variant because the 64-bit variant cannot encode it. Therefore, set the subreg part accordingly. [AArch64] Fix optimizeCondBranch logic. The opcode for the optimized branch does not depend on the size of the activate bits in the AND masks, but the AND opcode itself. Indeed, we need to use a X or W variant based on the AND variant not based on whether the mask fits into the related variant. Otherwise, we may end up using the W variant of the optimized branch for 64-bit register inputs! This fixes the last make check verifier issues for AArch64: PR27479. llvm-svn: 267465	2016-04-25 20:54:08 +00:00
Etienne Bergeron	50f02aa3fa	Cleanup redundant expression in InstCombineAndOrXor. Summary: The expression is redundant on both side of operator \|. detected by : http://reviews.llvm.org/D19451 Reviewers: rnk, majnemer Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D19459 llvm-svn: 267458	2016-04-25 20:15:33 +00:00
Lang Hames	ef5a0ee2c3	[ORC] Thread Error/Expected through the RPC library. This replaces use of std::error_code and ErrorOr in the ORC RPC support library with Error and Expected. This required updating the OrcRemoteTarget API, Client, and server code, as well as updating the Orc C API. This patch also fixes several instances where Errors were dropped. llvm-svn: 267457	2016-04-25 19:56:45 +00:00
Matt Arsenault	074ea2851c	AMDGPU/SI: Optimize adjacent s_nop instructions Use the operand for how long to wait. This is somewhat distasteful, since it would be better to just emit s_nop with the right argument in the first place. This would require changing TII::insertNoop to emit N operands, which would be easy. Slightly more problematic is the post-RA scheduler and hazard recognizer represent nops as a single null node, and would require inventing another way of representing N nops. llvm-svn: 267456	2016-04-25 19:53:22 +00:00
Kostya Serebryany	9ba19182be	[libFuzzer] remove dead code llvm-svn: 267455	2016-04-25 19:41:45 +00:00
Matt Arsenault	99c14524ec	AMDGPU: Implement addrspacecast llvm-svn: 267452	2016-04-25 19:27:24 +00:00
Matt Arsenault	48ab526f12	AMDGPU: Add queue ptr intrinsic llvm-svn: 267451	2016-04-25 19:27:18 +00:00
Matt Arsenault	dfaf4261ab	AMDGPU: Add DAG to debug dump Also reorder case to match enum order llvm-svn: 267449	2016-04-25 19:27:09 +00:00
Philip Reames	3bb2832900	[LVI] Clarify comments describing the lattice values There has been much recent confusion about the partition in the lattice between constant and non-constant values. Hopefully, documenting this will prevent confusion going forward. llvm-svn: 267440	2016-04-25 18:48:43 +00:00
Philip Reames	6671577eb3	[LVI] Split solveBlockValueConstantRange into two [NFC] This function handled both unary and binary operators. Cloning and specializing leads to much easier to follow code with minimal duplicatation. llvm-svn: 267438	2016-04-25 18:30:31 +00:00
Krzysztof Parzyszek	e8e754da74	[Hexagon] Register save/restore functions do not follow regular conventions Do not mark them as modifying any of the volatile registers by default. llvm-svn: 267433	2016-04-25 17:49:44 +00:00
Zachary Turner	0a43efea95	Resubmit "Refactor raw pdb dumper into library" This fixes a number of endianness issues as well as an ODR violation that hopefully causes everything to be happy. llvm-svn: 267431	2016-04-25 17:38:08 +00:00
Chad Rosier	e2cbd13e56	[ValueTracking] Improve isImpliedCondition when the dominating cond is false. llvm-svn: 267430	2016-04-25 17:23:36 +00:00
Jacques Pienaar	522031bd05	[lanai] Expand findClosestSuitableAluInstr check to consider offset register. Previously findClosestSuitableAluInstr was only considering the base register when checking the current instruction for suitability. Expand check to consider the offset if the offset is a register. llvm-svn: 267424	2016-04-25 16:41:21 +00:00
Marcin Koscielnicki	1c1af6ef77	[PR27390] [CodeGen] Reject indexed loads in CombinerDAG. visitAND, when folding and (load) forgets to check which output of an indexed load is involved, happily folding the updated address output on the following testcase: target datalayout = "e-m:e-i64:64-n32:64" target triple = "powerpc64le-unknown-linux-gnu" %typ = type { i32, i32 } define signext i32 @_Z8access_pP1Tc(%typ* %p, i8 zeroext %type) { %b = getelementptr inbounds %typ, %typ* %p, i64 0, i32 1 %1 = load i32, i32* %b, align 4 %2 = ptrtoint i32* %b to i64 %3 = and i64 %2, -35184372088833 %4 = inttoptr i64 %3 to i32* %_msld = load i32, i32* %4, align 4 %zzz = add i32 %1, %_msld ret i32 %zzz } Fix this by checking ResNo. I've found a few more places that currently neglect to check for indexed load, and tightened them up as well, but I don't have test cases for them. In fact, they might not be triggerable at all, at least with current targets. Still, better safe than sorry. Differential Revision: http://reviews.llvm.org/D19202 llvm-svn: 267420	2016-04-25 15:43:44 +00:00
Hrvoje Varga	c2dd5d223a	[mips][microMIPS] Revert commit r267137 Commit r267137 was the reason for failing tests in LLVM test suite. llvm-svn: 267419	2016-04-25 15:40:08 +00:00
Zlatko Buljan	b43d4bcbd5	[mips][microMIPS] Revert commit r266977 Commit r266977 was reason for failing LLVM test suite with error message: fatal error: error in backend: Cannot select: t17: i32 = rotr t2, t11 ... llvm-svn: 267418	2016-04-25 15:34:57 +00:00
Etienne Bergeron	06c14ec31e	Fix incorrect redundant expression in target AMDGPU. Summary: The expression is detected as a redundant expression. Turn out, this is probably a bug. ``` /home/etienneb/llvm/llvm/lib/Target/AMDGPU/SIInstrInfo.cpp:306:26: warning: both side of operator are equivalent [misc-redundant-expression] if (isSMRD(FirstLdSt) && isSMRD(FirstLdSt)) { ``` Reviewers: rnk, tstellarAMD Subscribers: arsenm, cfe-commits Differential Revision: http://reviews.llvm.org/D19460 llvm-svn: 267415	2016-04-25 15:06:33 +00:00
David Majnemer	dd21523653	[WinEH] Update SplitAnalysis::computeLastSplitPoint to cope with multiple EH successors We didn't have logic to correctly handle CFGs where there was more than one EH-pad successor (these are novel with WinEH). There were situations where a register was live in one exceptional successor but not another but the code as written would only consider the first exceptional successor it found. This resulted in split points which were insufficiently early if an invoke was present. This fixes PR27501. N.B. This removes getLandingPadSuccessor. llvm-svn: 267412	2016-04-25 14:31:32 +00:00
Silviu Baranga	82d04260b7	[ARM] Add support for the X asm constraint Summary: This patch adds support for the X asm constraint. To do this, we lower the constraint to either a "w" or "r" constraint depending on the operand type (both constraints are supported on ARM). Fixes PR26493 Reviewers: t.p.northover, echristo, rengolin Subscribers: joker.eph, jgreenhalgh, aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D19061 llvm-svn: 267411	2016-04-25 14:29:18 +00:00
Artem Tamazov	d6468666b5	[AMDGPU][llvm-mc] s_getreg/setreg* - Add hwreg(...) syntax. Added hwreg(reg[,offset,width]) syntax. Default offset = 0, default width = 32. Possibility to specify 16-bit immediate kept. Added out-of-range checks. Disassembling is always to hwreg(...) format. Tests updated/added. Differential Revision: http://reviews.llvm.org/D19329 llvm-svn: 267410	2016-04-25 14:13:51 +00:00
Anna Thomas	95f68aa7eb	Test commit: modified comment. NFC llvm-svn: 267406	2016-04-25 13:58:05 +00:00
Chad Rosier	3d75f8ce9e	Typo. NFC. llvm-svn: 267399	2016-04-25 13:25:14 +00:00
Krzysztof Parzyszek	e6ee481bdf	[Hexagon] Correctly set "Flags" in ELF header llvm-svn: 267397	2016-04-25 12:49:47 +00:00
James Molloy	eb040cc55f	[GlobalOpt] Allow constant globals to be SRA'd The current logic assumes that any constant global will never be SRA'd. I presume this is because normally constant globals can be pushed into their uses and deleted. However, that sometimes can't happen (which is where you really want SRA, so the elements that can be eliminated, are!). There seems to be no reason why we can't SRA constants too, so let's do it. llvm-svn: 267393	2016-04-25 10:48:29 +00:00
Igor Kudrin	ed99a96f06	[Coverage] Restore the correct count value after processing a nested region in case of combined regions. If several regions cover the same area of code, we have to restore the combined value for that area when return from a nested region. This patch achieves that by combining regions before calling buildSegments. Differential Revision: http://reviews.llvm.org/D18610 llvm-svn: 267390	2016-04-25 09:43:37 +00:00
Silviu Baranga	795c629ec9	[SCEV] Improve the run-time checking of the NoWrap predicate Summary: This implements a new method of run-time checking the NoWrap SCEV predicates, which should be easier to optimize and nicer for targets that don't correctly handle multiplication/addition of large integer types (like i128). If the AddRec is {a,+,b} and the backedge taken count is c, the idea is to check that \|b\| * c doesn't have unsigned overflow, and depending on the sign of b, that: a + \|b\| * c >= a (b >= 0) or a - \|b\| * c <= a (b <= 0) where the comparisons above are signed or unsigned, depending on the flag that we're checking. The advantage of doing this is that we avoid extending to a larger type and we avoid the multiplication of large types (multiplying i128 can be expensive). Reviewers: sanjoy Subscribers: llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D19266 llvm-svn: 267389	2016-04-25 09:27:16 +00:00
Marcin Koscielnicki	a44d44cb2e	[PowerPC] [PR27387] Disallow r0 for ADD8TLS. ADD8TLS, a variant of add instruction used for initial-exec TLS, currently accepts r0 as a source register. While add itself supports r0 just fine, linker can relax it to a local-exec sequence, converting it to addi - which doesn't support r0. Differential Revision: http://reviews.llvm.org/D19193 llvm-svn: 267388	2016-04-25 09:24:34 +00:00

1 2 3 4 5 ...

89469 Commits