llvm-project

Commit Graph

Author	SHA1	Message	Date
James Molloy	0f41227b21	[Thumb1] Teach optimizeCompareInstr about thumb1 compares This avoids us doing a completely unneeded "cmp r0, #0" after a flag-setting instruction if we only care about the Z or C flags. Add LSL/LSR to the whitelist while we're here and add testing. This code could really do with a spring clean. llvm-svn: 281027	2016-09-09 09:51:06 +00:00
Sam Kolton	d63d8a7c05	[AMDGPU] Assembler: match e32 VOP instructions before e64. Summary: Split assembler match table in 4 tables with assembler variants: Default - all instructions except VOP3, SDWA and DPP - VOP3 - SDWA - DPP First match Default table then VOP3, SDWA and DPP. Reviewers: tstellarAMD, artem.tamazov, vpykhtin Subscribers: arsenm, wdng, nhaehnle, AMDGPU Differential Revision: https://reviews.llvm.org/D24252 llvm-svn: 281023	2016-09-09 09:37:51 +00:00
Simon Dardis	8efa979029	[mips] Fix c.<cc>.<fmt> instruction definition. As part of this effort, remove MipsFCmp nodes and use tablegen patterns rather than custom lowering through C++. Unexpectedly, this improves codesize for microMIPS as previous floating point setcc expansions would materialize 0 and 1 into GPRs before using the relevant mov[tf].[sd] instruction. Now $zero is used directly. Reviewers: dsanders, vkalintiris, zoran.jovanovic Differential Review: https://reviews.llvm.org/D23118 llvm-svn: 281022	2016-09-09 09:22:52 +00:00
Gor Nishanov	faf36c2e0b	[Coroutines] Part13: Handle single edge PHINodes across suspends Summary: If one of the uses of the value is a single edge PHINode, handle it. Original: %val = something <suspend> %p = PHINode [%val] After Spill + Part13: %val = something %slot = gep val.spill.slot store %val, %slot <suspend> %p = load %slot Plus tiny fixes/changes: * use correct index for coro.free in CoroCleanup * fixup id parameter in coro.free to allow authoring coroutine in plain C with __builtins Reviewers: majnemer Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24242 llvm-svn: 281020	2016-09-09 05:39:00 +00:00
Amaury Sechet	5f04d819a5	Rationalise the attribute getter/setter methods on Function and CallSite. Summary: While woring on mapping attributes in the C API, it clearly appeared that the recent changes in the API on the C++ side left Function and Call/Invoke with an attribute API that grew in an ad hoc manner. This makes it difficult to work with it, because one doesn't know which overloads exists and which do not. Make sure that getter/setter function exists for both enum and string version. Remove inconsistent getter/setter, unless they have many callsites. This should make it easier to work with attributes in the future. This doesn't change how attribute works. Reviewers: bkramer, whitequark, mehdi_amini, void Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21514 llvm-svn: 281019	2016-09-09 04:50:38 +00:00
Kostya Serebryany	b76a2a5503	[libFuzzer] improve -print_pcs to not print new PCs coming from libFuzzer itself llvm-svn: 281016	2016-09-09 02:38:28 +00:00
Kostya Serebryany	8ea4f9873b	[libFuzzer] remove unneeded call llvm-svn: 281014	2016-09-09 01:57:38 +00:00
Craig Topper	149e6bdc16	[AVX-512] Add VPCMP instructions to the load folding tables and make them commutable. llvm-svn: 281013	2016-09-09 01:36:10 +00:00
Kostya Serebryany	5c04bd250e	[libFuzzer] remove use_traces=1 since use_value_profile seems to be strictly better llvm-svn: 281007	2016-09-09 01:17:03 +00:00
David Majnemer	2c3ea55498	[X86] Tighten up a comment which confused x64 ABI terminology. The x64 ABI has two major function types: - frame functions - leaf functions A frame function is one which requires a stack frame. A leaf function is one which does not. A frame function may or may not have a frame pointer. A leaf function does not require a stack frame and may never modify SP except via a return (RET, tail call via JMP). A frame function which has a frame pointer is permitted to use the LEA instruction in the epilogue, a frame function without which doesn't establish a frame pointer must use ADD to adjust the stack pointer epilogue. Fun fact: Leaf functions don't require a function table entry (associated PDATA/XDATA). llvm-svn: 281006	2016-09-09 01:07:01 +00:00
Hans Wennborg	c39ef776fc	Win64: Don't use REX prefix for direct tail calls The REX prefix should be used on indirect jmps, but not direct ones. For direct jumps, the unwinder looks at the offset to determine if it's inside the current function. Differential Revision: https://reviews.llvm.org/D24359 llvm-svn: 281003	2016-09-08 23:35:10 +00:00
Dehao Chen	87823f8e4d	Remove debug info when hoisting instruction from then/else branch. Summary: The hoisted instruction is executed speculatively. It could affect the debugging experience as user would see gdb go into code that may not be expected to execute. It will also affect sample profile accuracy by assigning incorrect frequency to source within then/else branch. Reviewers: davidxl, dblaikie, chandlerc, kcc, echristo Subscribers: mehdi_amini, probinson, eric_niebler, andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D24164 llvm-svn: 280995	2016-09-08 21:53:33 +00:00
Matthew Simpson	bfe5e1817b	[LV] Ensure proper handling of multi-use case when collecting uniforms The test case included in r280979 wasn't checking what it was supposed to be checking for the predicated store case. Fixing the test revealed that the multi-use case (when a pointer is used by both vectorized and scalarized memory accesses) wasn't being handled properly. We can't skip over non-consecutive-like pointers since they may have looked consecutive-like with a different memory access. llvm-svn: 280992	2016-09-08 21:38:26 +00:00
Krzysztof Parzyszek	a1218728d3	[RDF] Further improve handling of multiple phis reached from shadows llvm-svn: 280987	2016-09-08 20:48:42 +00:00
Matthew Simpson	408a3abcfe	[LV] Don't mark pointers used by scalarized memory accesses uniform Previously, all consecutive pointers were marked uniform after vectorization. However, if a consecutive pointer is used by a memory access that is eventually scalarized, the pointer won't remain uniform after all. An example is predicated stores. Even though a predicated store may be consecutive, it will still be scalarized, making it's pointer operand non-uniform. This patch updates the logic in collectLoopUniforms to consider the cases where a memory access may be scalarized. If a memory access may be scalarized, its pointer operand is not marked uniform. The determination of whether a given memory instruction will be scalarized or not has been moved into a common function that is used by the vectorizer, cost model, and legality analysis. Differential Revision: https://reviews.llvm.org/D24271 llvm-svn: 280979	2016-09-08 19:11:07 +00:00
Zachary Turner	35377f88f5	[YAMLIO] Add the ability to map with context. mapping a yaml field to an object in code has always been a stateless operation. You could still pass state by using the `setContext` function of the YAMLIO object, but this represented global state for the entire yaml input. In order to have context-sensitive state, it is necessary to pass this state in at the granularity of an individual mapping. This patch adds support for this type of context-sensitive state. You simply pass an additional argument of type T to the `mapRequired` or `mapOptional` functions, and provided you have specialized a `MappingContextTraits<U, T>` class with the appropriate mapping function, you can pass this context into the mapping function. Reviewed By: chandlerc Differential Revision: https://reviews.llvm.org/D24162 llvm-svn: 280977	2016-09-08 18:22:44 +00:00
Matt Arsenault	d745c28945	AMDGPU: Sign extend constants when splitting them This will confuse later passes which try to look at the immediate value and don't truncate first. llvm-svn: 280974	2016-09-08 17:44:36 +00:00
Krzysztof Parzyszek	a696b1b641	[Hexagon] Expand sext- and zextloads of vector types, not just extloads Recent change exposed this issue, breaking the Hexagon buildbots. llvm-svn: 280973	2016-09-08 17:42:14 +00:00
Matt Arsenault	be90f70d3a	AMDGPU: Try to commute when selecting s_addk_i32/s_mulk_i32 llvm-svn: 280972	2016-09-08 17:35:41 +00:00
Eric Christopher	98ddbdb563	AArch64 .arch directive - Include default arch attributes with extensions. Fix the .arch asm parser to use the full set of features for the architecture and any extensions on the command line. Add and update testcases accordingly as well as add an extension that was used but not supported. llvm-svn: 280971	2016-09-08 17:27:03 +00:00
Matt Arsenault	bbb47da8a1	AMDGPU: Support commuting with immediate in src0 llvm-svn: 280970	2016-09-08 17:19:29 +00:00
Renato Golin	049f387112	Revert "[XRay] ARM 32-bit no-Thumb support in LLVM" And associated commits, as they broke the Thumb bots. This reverts commit r280935. This reverts commit r280891. This reverts commit r280888. llvm-svn: 280967	2016-09-08 17:10:39 +00:00
Balaram Makam	c6cebf727c	[LoopDataPrefetch] Use range based for loop; NFCI Switch to range based for loop. No functional change, but more readable code. llvm-svn: 280966	2016-09-08 17:08:20 +00:00
Sanjay Patel	1c608f4323	[InstCombine] return a vector-safe true/false constant I introduced this potential bug by missing this diff in: https://reviews.llvm.org/rL280873 ...however, I'm not sure how to reach this code path with a regression test. We may be able to remove this code and assume that the transform to a constant is always handled by InstSimplify? llvm-svn: 280964	2016-09-08 16:54:02 +00:00
Dehao Chen	db3810771e	revert r280427 Refactor replaceDominatedUsesWith to have a flag to control whether to replace uses in BB itself. Summary: This is in preparation for LoopSink pass which calls replaceDominatedUsesWith to update after sinking. llvm-svn: 280949	2016-09-08 15:25:12 +00:00
Renato Golin	d257373887	[ARM XRay] Try to fix Thumb-only failure I mised the check that it had to support ARM to work. This commit tries to fix that, to make sure we don't emit ARM code in Thumb-only mode. llvm-svn: 280935	2016-09-08 13:45:10 +00:00
James Molloy	c6a6144966	[SDAGBuilder] Don't create a binary tree for switches in minsize mode This bloats codesize - all of the non-leaf nodes are extra code. llvm-svn: 280932	2016-09-08 13:12:22 +00:00
James Molloy	753c18f5c0	[Thumb1] AND with a constant operand can be converted into BIC So model the cost of materializing the constant operand C as the minimum of C and ~C. llvm-svn: 280929	2016-09-08 12:58:12 +00:00
James Molloy	7c7255e40b	[Thumb1] Fix cost calculation for complemented immediates Materializing something like "-3" can be done as 2 instructions: MOV r0, #3 MVN r0, r0 This has a cost of 2, not 3. It looks like we were already trying to detect this pattern in TII::getIntImmCost(), but were taking the complement of the zero-extended value instead of the sign-extended value which is unlikely to ever produce a number < 256. There were no tests failing after changing this... :/ llvm-svn: 280928	2016-09-08 12:58:04 +00:00
Simon Pilgrim	cc7b4b511b	[SelectionDAG] Add BUILD_VECTOR support to computeKnownBits and SimplifyDemandedBits Add the ability to computeKnownBits and SimplifyDemandedBits to extract the known zero/one bits from BUILD_VECTOR, returning the known bits that are shared by every vector element. This is an initial step towards determining the sign bits of a vector (PR29079). Differential Revision: https://reviews.llvm.org/D24253 llvm-svn: 280927	2016-09-08 12:57:51 +00:00
Simon Pilgrim	a01ee07a19	[DAGCombiner] Enable AND combines of splatted constant vectors Allow AND combines to use a vector splatted constant as well as a constant scalar. Preliminary part of D24253. llvm-svn: 280926	2016-09-08 12:36:39 +00:00
Pablo Barrio	2b7ed1339c	Revert "[ARM] Lower UDIV+UREM to UDIV+MLS (and the same for SREM)" This reverts commit r280808. It is possible that this change results in an infinite loop. This is causing timeouts in some tests on ARM, and a Chromebook bot is failing. llvm-svn: 280918	2016-09-08 10:05:57 +00:00
Hrvoje Varga	dbe4d96b4f	[mips][microMIPS] Implement DBITSWAP, DLSA and LWUPC and add tests for AUI instructions Differential Revision: https://reviews.llvm.org/D16452 llvm-svn: 280909	2016-09-08 07:41:43 +00:00
Vitaly Buka	58a81c6540	[asan] Avoid lifetime analysis for allocas with can be in ambiguous state Summary: C allows to jump over variables declaration so lifetime.start can be avoid before variable usage. To avoid false-positives on such rare cases we detect them and remove from lifetime analysis. PR27453 PR28267 Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24321 llvm-svn: 280907	2016-09-08 06:27:58 +00:00
Michael Zolotukhin	e72997a524	Revert "[LoopUnroll] Properly update loop-info when cloning prologues and epilogues." This reverts commit r280901. This caused a bunch of failures, reverting it until I investigate them. llvm-svn: 280905	2016-09-08 03:51:30 +00:00
Michael Zolotukhin	5e0a20697e	[LoopUnroll] Properly update loop-info when cloning prologues and epilogues. Summary: When cloning blocks for prologue/epilogue we need to replicate the loop structure from the original loop. It wasn't a problem for the innermost loops, but it led to an incorrect loop info when we unrolled a loop with a child loop - in this case created prologue-loop had a child loop, but loop info didn't reflect that. This fixes PR28888. Reviewers: chandlerc, sanjoy, hfinkel Subscribers: llvm-commits, silvas Differential Revision: https://reviews.llvm.org/D24203 llvm-svn: 280901	2016-09-08 01:52:26 +00:00
Michael Kuperstein	f79af6f8c4	[CGP] Be less conservative about tail-duplicating a ret to allow tail calls CGP tail-duplicates rets into blocks that end with a call that feed the ret. This puts the call in tail position, potentially allowing the DAG builder to lower it as a tail call. To avoid tail duplication in cases where we won't form the tail call, CGP tried to predict whether this is going to be possible, and avoids doing it when lowering as a tail call will definitely fail. However, it was being too conservative by always throwing away calls to functions with a signext/zeroext attribute on the return type. Instead, we can use the same logic the builder uses to determine whether the attributes work out. Differential Revision: https://reviews.llvm.org/D24315 llvm-svn: 280894	2016-09-08 00:48:37 +00:00
Dean Michael Berris	cf3801eee8	[XRay] Remove unused variable llvm-svn: 280891	2016-09-08 00:38:22 +00:00
Dean Michael Berris	17d94e279e	[XRay] ARM 32-bit no-Thumb support in LLVM This is a port of XRay to ARM 32-bit, without Thumb support yet. The XRay instrumentation support is moving up to AsmPrinter. This is one of 3 commits to different repositories of XRay ARM port. The other 2 are: 1. https://reviews.llvm.org/D23932 (Clang test) 2. https://reviews.llvm.org/D23933 (compiler-rt) Differential Revision: https://reviews.llvm.org/D23931 llvm-svn: 280888	2016-09-08 00:19:04 +00:00
Peter Collingbourne	8f1dd5c41e	IR: Remove Value::intersectOptionalDataWith, replace all calls with calls to Instruction::andIRFlags. The two functions are functionally equivalent. Differential Revision: https://reviews.llvm.org/D22830 llvm-svn: 280884	2016-09-07 23:39:04 +00:00
Vitaly Buka	c5e53b2a53	Revert "[asan] Avoid lifetime analysis for allocas with can be in ambiguous state" Fails on Windows. This reverts commit r280880. llvm-svn: 280883	2016-09-07 23:37:15 +00:00
Vitaly Buka	2ca05b07d6	[asan] Avoid lifetime analysis for allocas with can be in ambiguous state Summary: C allows to jump over variables declaration so lifetime.start can be avoid before variable usage. To avoid false-positives on such rare cases we detect them and remove from lifetime analysis. PR27453 PR28267 Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24321 llvm-svn: 280880	2016-09-07 23:18:23 +00:00
Sanjay Patel	9b40f98357	[InstCombine] use m_APInt to allow icmp (and (sh X, Y), C2), C1 folds for splat constant vectors llvm-svn: 280873	2016-09-07 22:33:03 +00:00
Hal Finkel	ac5803ba91	[SimplifyCFG] Don't try to create metadata-valued PHIs We can't create metadata-valued PHIs; don't try to do so when sinking. I created a test case for this using the @llvm.type.test intrinsic, because it takes a metadata parameter and does not have severe side effects (thus SimplifyCFG is willing to otherwise sink it). Previously, running the test case would crash with: Invalid use of metadata! %.sink = select i1 %flag, metadata <...>, metadata <0x4e45dc0> LLVM ERROR: Broken function found, compilation aborted! llvm-svn: 280866	2016-09-07 21:38:22 +00:00
Haicheng Wu	109f4f3509	[LoopUnroll] Correct a debug message. NFC. Differential Revision: https://reviews.llvm.org/D24299 llvm-svn: 280865	2016-09-07 21:30:16 +00:00
Elena Demikhovsky	dcc86d5bb6	Shift-left (ISD::SHL) operation crashes on "DAG Legalization" phase. https://llvm.org/bugs/show_bug.cgi?id=29058. While node legalization we tried to legalize its operands. If an operand node is replaced during legalization the user node may be destroyed. Differential Revision: https://reviews.llvm.org/D24244 llvm-svn: 280862	2016-09-07 20:54:33 +00:00
Sanjay Patel	def931e76a	[InstCombine] allow icmp (and X, C2), C1 folds for splat constant vectors This is a revert of r280676 which was a revert of r280637; ie, this is r280637 again. It was speculatively reverted to help debug buildbot failures. llvm-svn: 280861	2016-09-07 20:50:44 +00:00
Krzysztof Parzyszek	2db0c8b75f	[RDF] Fix liveness analysis for phi nodes with shadow uses Shadow uses need to be analyzed together, since each individual shadow will only have a partial reaching def. All shadows together may cover a given register ref, while each individual shadow may not. llvm-svn: 280855	2016-09-07 20:37:05 +00:00
Michael Kuperstein	71321563de	Don't reuse a variable name in a nested scope. NFC. llvm-svn: 280853	2016-09-07 20:29:49 +00:00
Krzysztof Parzyszek	1ff99525f7	[RDF] Introduce "undef" flag for ref nodes llvm-svn: 280851	2016-09-07 20:10:56 +00:00

1 2 3 4 5 ...

94713 Commits