llvm-project

Commit Graph

Author	SHA1	Message	Date
Nirav Dave	f8556ad48f	Revert "[DAG] Cleanup unused nodes after store merge. NFCI." This reverts commit r310648 which causes an unexpected assertion failure llvm-svn: 310659	2017-08-10 21:03:36 +00:00
Nirav Dave	4d28c0ff4f	[DAG] Relax type restriction for store merge Summary: Allow stores of bitcastable types to be merged by peeking through BITCAST nodes and recasting stored values constant and vector extract nodes as necessary. Reviewers: jyknight, hfinkel, efriedma, RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34569 llvm-svn: 310655	2017-08-10 19:52:45 +00:00
Nirav Dave	99d9d24553	[DAG] Cleanup unused nodes after store merge. NFCI. llvm-svn: 310648	2017-08-10 18:53:14 +00:00
Taewook Oh	f5040b9685	Make .file directive to have basename only Summary: Currently LLVM puts directory along with the filename in .file directive, but this behavior doesn't match gcc. There's a no clear description about which one is right (https://sourceware.org/binutils/docs/as/File.html#File), but one document (https://sourceware.org/gdb/current/onlinedocs/stabs/ELF-Linker-Relocation.html) suggests that STT_FILE symbol in elf file is expected to have basename only, which should have a same sting file .file directive according to (https://docs.oracle.com/cd/E26502_01/html/E28388/eoiyg.html). This also affects badly on the build system that uses hashing, as the directory info could be differnt from developer to developer even when they're working on same file. Reviewers: pcc, mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36018 llvm-svn: 310642	2017-08-10 18:17:11 +00:00
Krzysztof Parzyszek	bea30c6286	Add "Restored" flag to CalleeSavedInfo The liveness-tracking code assumes that the registers that were saved in the function's prolog are live outside of the function. Specifically, that registers that were saved are also live-on-exit from the function. This isn't always the case as illustrated by the LR register on ARM. Differential Revision: https://reviews.llvm.org/D36160 llvm-svn: 310619	2017-08-10 16:17:32 +00:00
Nirav Dave	06242a99ce	[DAG] Rewrite expression. NFC. llvm-svn: 310608	2017-08-10 15:29:33 +00:00
Nirav Dave	926e2d39bf	[X86] Keep dependencies when constructing loads in combineStore Summary: Preserve chain dependecies between old and new loads constructed to prevent loads from reordering below later stores. Fixes PR34088. Reviewers: craig.topper, spatel, RKSimon, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36528 llvm-svn: 310604	2017-08-10 15:12:32 +00:00
Guy Blank	136b543745	[SelectionDAG] Allow constant folding for implicitly truncating BUILD_VECTOR nodes. In FoldConstantArithmetic, handle BUILD_VECTOR nodes that do implicit truncation on the elements. This is similar to what is done in FoldConstantVectorArithmetic. Differential Revision: https://reviews.llvm.org/D36506 llvm-svn: 310593	2017-08-10 14:09:50 +00:00
Elad Cohen	22ba97a0a6	[SelectionDAG] When scalarizing vselect, don't assert on a legal cond operand. When scalarizing the result of a vselect, the legalizer currently expects to already have scalarized the operands. While this is true for the true/false operands (which have the same type as the result), it is not case for the condition operand. On X86 AVX512, v1i1 is legal - this leads to operations such as '< N x type> vselect < N x i1> < N x type> < N x type>' where < N x type > is illegal to hit an assertion during the scalarization. The handling is similar to r205625. This also exposes the fact that (v1i1 extract_subvector) should be legal and selectable on AVX512 - We do this by custom lowering to vector_extract_elt. This still leaves us in some cases with redundant dag nodes which will be combined in a separate soon to come patch. This fixes pr33349. Differential revision: https://reviews.llvm.org/D36511 llvm-svn: 310552	2017-08-10 07:44:23 +00:00
David Blaikie	76fb649b0d	Reduce variable scope by moving declaration into if clause llvm-svn: 310506	2017-08-09 18:34:18 +00:00
Nirav Dave	6110d3ad00	[DAG] Explicitly cleanup merged load values during store merge. NFCI. llvm-svn: 310474	2017-08-09 13:37:07 +00:00
Serguei Katkov	6ea2e81cf6	[ImplicitNullCheck] Fix the bug when dependent instruction accesses memory It is possible that dependent instruction may access memory. In this case we must reject optimization because the memory change will be visible in null handler basic block. So we will execute an instruction which we must not execute if check fails. Reviewers: sanjoy, reames Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36392 llvm-svn: 310443	2017-08-09 05:17:02 +00:00
Reid Kleckner	e2e82061f9	[codeview] Emit nested enums and typedefs from classes Previously we limited ourselves to only emitting nested classes, but we need other kinds of types as well. This fixes the Visual Studio STL visualizers, so that users can visualize std::string and other objects. llvm-svn: 310410	2017-08-08 20:30:14 +00:00
Nirav Dave	8a813cf646	[DAG] Introduce peekThroughBitcast function. NFCI. llvm-svn: 310405	2017-08-08 20:01:18 +00:00
Nirav Dave	515116d7c2	[DAG] Update comments. NFC. llvm-svn: 310404	2017-08-08 19:52:19 +00:00
Simon Pilgrim	91b7b991d4	[DAGCombiner] simplifyShuffleMask - handle UNDEF inputs from shuffles as well as BUILD_VECTOR Minor extension to D36393 llvm-svn: 310372	2017-08-08 16:10:33 +00:00
Simon Pilgrim	ef44228acb	[DAGCombiner] Simplify shuffle mask index if the referenced input element is UNDEF Fixes one of the cases in PR34041. Differential Revision: https://reviews.llvm.org/D36393 llvm-svn: 310344	2017-08-08 11:03:30 +00:00
Sanjay Patel	807f92b8ff	[x86] revert r310208 to investigate test-suite failures (PR34105 / PR34097) llvm-svn: 310264	2017-08-07 15:47:48 +00:00
Nirav Dave	3d3bde7682	[DAG] Extend visitSCALAR_TO_VECTOR optimization to truncated vector. Relanding after case to insert explicit truncation as necessary. Allow SCALAR_TO_VECTOR of EXTRACT_VECTOR_ELT to reduce to EXTRACT_SUBVECTOR of vector shuffle when output is smaller. Marginally improves vector shuffle computations. Reviewers: efriedma, RKSimon, spatel Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D35566 llvm-svn: 310256	2017-08-07 14:07:49 +00:00
Guy Blank	5ca01695f7	[SelectionDAG] reset NewNodesMustHaveLegalTypes flag between basic blocks The NewNodesMustHaveLegalTypes flag is set to false at the beginning of CodeGenAndEmitDAG, and set to true after legalizing types. But before calling CodeGenAndEmitDAG we build the DAG for the basic block. So for the first basic block NewNodesMustHaveLegalTypes would be 'false' during the SDAG building, and for all other basic blocks it would be 'true'. This patch sets the flag to false before SDAG building each basic block. Differential Revision: https://reviews.llvm.org/D33435 llvm-svn: 310239	2017-08-07 05:51:14 +00:00
Sanjay Patel	a923c2ee95	[x86] use more shift or LEA for select-of-constants We can convert any select-of-constants to math ops: http://rise4fun.com/Alive/d7d For this patch, I'm enhancing an existing x86 transform that uses fake multiplies (they always become shl/lea) to avoid cmov or branching. The current code misses cases where we have a negative constant and a positive constant, so this is just trying to plug that hole. The DAGCombiner diff prevents us from hitting a terrible inefficiency: we can start with a select in IR, create a select DAG node, convert it into a sext, convert it back into a select, and then lower it to sext machine code. Some notes about the test diffs: 1. 2010-08-04-MaskedSignedCompare.ll - We were creating control flow that didn't exist in the IR. 2. memcmp.ll - Choose -1 or 1 is the case that got me looking at this again. I think we could avoid the push/pop in some cases if we used 'movzbl %al' instead of an xor on a different reg? That's a post-DAG problem though. 3. mul-constant-result.ll - The trade-off between sbb+not vs. setne+neg could be addressed if that's a regression, but I think those would always be nearly equivalent. 4. pr22338.ll and sext-i1.ll - These tests have undef operands, so I don't think we actually care about these diffs. 5. sbb.ll - This shows a win for what I think is a common case: choose -1 or 0. 6. select.ll - There's another borderline case here: cmp+sbb+or vs. test+set+lea? Also, sbb+not vs. setae+neg shows up again. 7. select_const.ll - These are motivating cases for the enhancement; replace cmov with cheaper ops. Assembly differences between movzbl and xor to avoid a partial reg stall are caused later by the X86 Fixup SetCC pass. Differential Revision: https://reviews.llvm.org/D35340 llvm-svn: 310208	2017-08-06 16:27:07 +00:00
Matt Arsenault	b94972cb82	IPRA: Don't crash on null getCallPreservedMask Kernels aren't callable, so they don't have a call preserved mask. llvm-svn: 310172	2017-08-05 07:50:18 +00:00
Kyle Butt	74f61dd8ef	BlockPlacement: add a flag to force cold block outlining w/o a profile. NFC. llvm-svn: 310129	2017-08-04 21:13:41 +00:00
Nico Weber	b24df62bb6	Revert r310058, it caused PR34073. llvm-svn: 310118	2017-08-04 20:24:13 +00:00
Quentin Colombet	c7652e22d7	[GlobalISel] Remove a stall comment in CMake. Thanks to Diana Picus <diana.picus@linaro.org> for noticing. NFC llvm-svn: 310114	2017-08-04 20:15:41 +00:00
Marcello Maggioni	8de4bbdaa5	[MachineOperand] Add ChangeToTargetIndex method. NFC Differential Revision: https://reviews.llvm.org/D36301 llvm-svn: 310083	2017-08-04 18:24:09 +00:00
Simon Pilgrim	5c63586489	[DAGCombiner] Extending pattern detection for vector shuffle. If all the operands of a BUILD_VECTOR extract elements from same vector then split the vector efficiently based on the maximum vector access index. Committed on behalf of @jbhateja (Jatin Bhateja) Differential Revision: https://reviews.llvm.org/D35788 llvm-svn: 310058	2017-08-04 12:46:35 +00:00
Eric Christopher	52854dcd34	Fix typo. llvm-svn: 309997	2017-08-03 22:41:12 +00:00
Matt Arsenault	a52391f2db	DAG: Provide access to Pass instance from SelectionDAG This allows accessing an analysis pass during lowering. llvm-svn: 309991	2017-08-03 21:54:00 +00:00
Quentin Colombet	250e050a50	[GlobalISel] Make GlobalISel a non-optional library. With this change, the GlobalISel library gets always built. In particular, this is not possible to opt GlobalISel out of the build using the LLVM_BUILD_GLOBAL_ISEL variable any more. llvm-svn: 309990	2017-08-03 21:52:25 +00:00
Nirav Dave	3fc1c2365c	[DAG] Allow merging of stores of vector loads Remove restriction disallowing merging of stores vector loads into larger store of larger vector load. Reviewers: RKSimon, efriedma, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36158 llvm-svn: 309951	2017-08-03 15:51:20 +00:00
Robert Lougher	10f740df4d	[LiveDebugVariables] Use lexical scope to trim debug value live intervals The debug value live intervals computed by Live Debug Variables may extend beyond the range of the debug location's lexical scope. In this case, splitting of an interval can result in an interval outside of the scope being created, causing extra unnecessary DBG_VALUEs to be emitted. To prevent this, trim the intervals to the lexical scope. This resolves PR33730. Reviewers: aprantl Differential Revision: https://reviews.llvm.org/D35953 llvm-svn: 309933	2017-08-03 11:54:02 +00:00
Simon Dardis	51296593a8	[SelectionDAG] Resolve PR33978. rL306209 taught SelectionDAG how to add the dereferenceable flag when expanding memcpy and memmove. The fix however contained a nit where the offset + size was constructed as an APInt of PointerSize rather than PointerSizeInBits. This lead to isDereferenceableAndAlignedPointer() get truncated values or values which would be sign extended within that function leading to incorrect results. Thanks to Alex Crichton for reporting the issue! This resolves PR33978. Reviewers: inouehrs Differential Revision: https://reviews.llvm.org/D36236 llvm-svn: 309930	2017-08-03 09:38:46 +00:00
Sameer AbuAsal	c8befe687f	[RegisterCoalescer] Add wrapper for Erasing Instructions Summary: To delete an instruction the coalescer needs to call eraseFromParent() on the MachineInstr, insert it in the ErasedInstrs list and update the Live Ranges structure. This patch re-factors the code to do all that in one function. This will also fix cases where previous code wasn't inserting deleted instructions in the ErasedList. Reviewers: qcolombet, kparzysz Reviewed By: qcolombet Subscribers: MatzeB, llvm-commits, qcolombet Differential Revision: https://reviews.llvm.org/D36204 llvm-svn: 309915	2017-08-03 02:41:17 +00:00
Rafael Espindola	79e238afee	Delete Default and JITDefault code models IMHO it is an antipattern to have a enum value that is Default. At any given piece of code it is not clear if we have to handle Default or if has already been mapped to a concrete value. In this case in particular, only the target can do the mapping and it is nice to make sure it is always done. This deletes the two default enum values of CodeModel and uses an explicit Optional<CodeModel> when it is possible that it is unspecified. llvm-svn: 309911	2017-08-03 02:16:21 +00:00
Hiroshi Inoue	0bd906ec8f	[StackColoring] Update AliasAnalysis information in stack coloring pass (part 2) This patch is update after the first patch (https://reviews.llvm.org/rL309651) based on the post-commit comments. Stack coloring pass need to maintain AliasAnalysis information when merging stack slots of different types. Actually, there is a FIXME comment in StackColoring.cpp // FIXME: In order to enable the use of TBAA when using AA in CodeGen, // we'll also need to update the TBAA nodes in MMOs with values // derived from the merged allocas. But, TBAA has been already enabled in CodeGen without fixing this pass. The incorrect TBAA metadata results in recent failures in bootstrap test on ppc64le (PR33928) by allowing unsafe instruction scheduling. Although we observed the problem on ppc64le, this is a platform neutral issue. This patch makes the stack coloring pass maintains AliasAnalysis information when merging multiple stack slots. This patch fixes PR33928. llvm-svn: 309849	2017-08-02 18:16:32 +00:00
Adrian Prantl	bd6d291c59	Assert that the offset of a DBG_VALUE is always 0. (NFC) llvm-svn: 309834	2017-08-02 17:19:13 +00:00
Adrian Prantl	5577363a2c	Remove the unused Offset field from MachineLocation (NFC) rdar://problem/33580047 llvm-svn: 309831	2017-08-02 17:07:38 +00:00
Nirav Dave	e44032f7e7	[DAG] Improve candidate pruning in store merge failure case. NFCI During store merge we construct a sorted list of consecutive store candidates and consider subsequences for merging into a single store. For each subsequence we check if the stored value type is legal the merged store would have valid and fast and if the constructed value to be stored is valid. The only properties that affect this check between subsequences is the size of the subsequence, the alignment of the first store, the alignment of the stored load value (when merging stores-of-loads), and whether the merged value is a constant zero. If we do not find a viable mergeable subsequence starting from the first store of length N, we know that a subsequence starting at a later store of length N will also fail unless the new store's alignment, the new load's alignment (if we're merging store-of-loads), or we've dropped stores of nonzero value and could construct a merged stores of zero (for merging constants). As a result if we fail to find a valid subsequence starting from the first store we can safely skip considering subsequences that start with subsequent stores unless one of the above properties is true. This significantly (2x) improves compile time in some pathological cases. Reviewers: RKSimon, efriedma, zvi, spatel, waltl Subscribers: grandinj, llvm-commits Differential Revision: https://reviews.llvm.org/D35901 llvm-svn: 309830	2017-08-02 16:35:58 +00:00
Adrian Prantl	61b39b2aec	Remove unused includes of MachineLocation.h (NFC) llvm-svn: 309824	2017-08-02 15:32:18 +00:00
Adrian Prantl	2049c0d734	Remove unreachable code. (NFC) MachineLocation::getOffset() always returns 0. rdar://problem/33580047 llvm-svn: 309823	2017-08-02 15:22:17 +00:00
Diana Picus	d5a00b0ff6	[MIR] Print target-specific constant pools This should enable us to test the generation of target-specific constant pools, e.g. for ARM: constants: - id: 0 value: 'g(GOT_PREL)-(LPC0+8-.)' alignment: 4 isTargetSpecific: true I intend to use this to test PIC support in GlobalISel for ARM. This is difficult to test outside of that context, since the existing MIR tests usually rely on parser support as well, and that seems a bit trickier to add. We could try to add a unit test, but the setup for that seems rather convoluted and overkill. We do test however that the parser reports a nice error when encountering a target-specific constant pool. Differential Revision: https://reviews.llvm.org/D36092 llvm-svn: 309806	2017-08-02 11:09:30 +00:00
Nirav Dave	15894f8dec	[DAG] Refactor store merge subexpressions. NFC. Distribute various expressions across ifs. llvm-svn: 309777	2017-08-02 01:08:38 +00:00
Matt Arsenault	acc5e82b0e	DAG: Undo and->or combine with FrameIndexes This pattern shows up when lowering byval copies on AMDGPU. The byval object access is split into 4-byte chunks, adding a constant offset to the FixedStack base. When some of the offsets turn into ors, this prevents combining the constant offsets. This makes it not apparent that the object is there when matching addressing modes, so it ends up using a scratch wave offset relative access and the lengthy frame index expansion for that. llvm-svn: 309775	2017-08-02 00:43:42 +00:00
Adrian Prantl	83ca4fc7bc	Update LiveDebugValues to generate DIExpressions for spill offsets instead of using the deprecated offset field of DBG_VALUE. This has no observable effect on the generated DWARF, but the assembler comments will look different. rdar://problem/33580047 llvm-svn: 309773	2017-08-02 00:16:56 +00:00
Adrian Prantl	aac78ce47e	Use helper function instead of manually constructing DBG_VALUEs (NFC) rdar://problem/33580047 llvm-svn: 309757	2017-08-01 22:37:35 +00:00
Adrian Prantl	032d2381bf	Remove PrologEpilogInserter's usage of DBG_VALUE's offset field In the last half-dozen commits to LLVM I removed code that became dead after removing the offset parameter from llvm.dbg.value gradually proceeding from IR towards the backend. Before I can move on to DwarfDebug and friends there is one last side-called offset I need to remove: This patch modifies PrologEpilogInserter's use of the DBG_VALUE's offset argument to use a DIExpression instead. Because the PrologEpilogInserter runs at the Machine level I had to play a little trick with a named llvm.dbg.mir node to get the DIExpressions to print in MIR dumps (which print the llvm::Module followed by the MachineFunction dump). I also had to add rudimentary DwarfExpression support to CodeView and as a side-effect also fixed a bug (CodeViewDebug::collectVariableInfo was supposed to give up on variables with complex DIExpressions, but would fail to do so for fragments, which are also modeled as DIExpressions). With this last holdover removed we will have only one canonical way of representing offsets to debug locations which will simplify the code in DwarfDebug (and future versions of CodeViewDebug once it starts handling more complex expressions) and make it easier to reason about. This patch is NFC-ish: All test case changes are for assembler comments and the binary output does not change. rdar://problem/33580047 Differential Revision: https://reviews.llvm.org/D36125 llvm-svn: 309751	2017-08-01 21:45:24 +00:00
Nirav Dave	dcc5afaad9	[DAG] Factor out common expressions. NFC. llvm-svn: 309740	2017-08-01 20:30:52 +00:00
Reid Kleckner	29c675247d	[DebugInfo] Don't turn dbg.declare into DBG_VALUE for static allocas Summary: We already have information about static alloca stack locations in our side table. Emitting instructions for them is inefficient, and it only happens when the address of the alloca has been materialized within the current block, which isn't often. Reviewers: aprantl, probinson, dblaikie Subscribers: jfb, dschuff, sbc100, jgravelle-google, hiraditya, llvm-commits, aheejin Differential Revision: https://reviews.llvm.org/D36117 llvm-svn: 309729	2017-08-01 19:45:09 +00:00
Nirav Dave	35dd1ac29c	Pull out VectorNumElements value. NFC. llvm-svn: 309719	2017-08-01 18:19:56 +00:00

1 2 3 4 5 ...

23111 Commits