llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Jasper	84b3cc394d	Revert "[DAGCombiner] (add X, (adde Y, 0, Carry)) -> (adde X, Y, Carry)" This reverts commit r294186. On an internal test, this triggers an out-of-memory error on PPC, presumably because there is another dagcombine that does the exact opposite triggering and endless loop consuming more and more memory. Chandler has started at creating a reduced test case and we'll attach it as soon as possible. llvm-svn: 294288	2017-02-07 08:57:50 +00:00
Craig Topper	9191c3324a	[AVX-512] Add masked and unmasked shift by immediate instructions to load folding tables. llvm-svn: 294287	2017-02-07 07:31:00 +00:00
Craig Topper	62304d80e3	[AVX-512] Add masked shift instructions to load folding tables. This adds the masked versions of everything, but the shift by immediate instructions. llvm-svn: 294286	2017-02-07 07:30:57 +00:00
Craig Topper	45d9ddc687	[AVX-512] Add some of the shift instructions to the load folding tables. This includes unmasked forms of variable shift and shifting by the lower element of a register. Still need to do shift by immediate which was not foldable prior to avx512 and all the masked forms. llvm-svn: 294285	2017-02-07 07:30:54 +00:00
Matt Arsenault	f48169862d	AMDGPU: Fix missing static llvm-svn: 294281	2017-02-07 04:37:59 +00:00
Craig Topper	39d86bb688	[X86] Change the Defs list for VZEROALL/VZEROUPPER back to not including YMM16-31. llvm-svn: 294277	2017-02-07 04:10:57 +00:00
Peter Collingbourne	1ea1fd893c	LowerTypeTests: Simplify. NFC. llvm-svn: 294273	2017-02-07 03:20:58 +00:00
Krzysztof Parzyszek	0605f3fddd	[Hexagon] Address ASAN and UBSAN failures after r294226 Reinstate r294256 with a fix. llvm-svn: 294269	2017-02-07 02:31:53 +00:00
Matthias Braun	f80403d8ee	RegisterCoalescer: Fix joinReservedPhysReg() joinReservedPhysReg() can only deal with a liverange in a single basic block when copying from a vreg into a physreg. See also rdar://30306405 Differential Revision: https://reviews.llvm.org/D29436 llvm-svn: 294268	2017-02-07 01:59:39 +00:00
Chandler Carruth	346542b769	Revert r293017 and fix the actual underlying issue. The patch committed in r293017, as discussed on the list, doesn't really make sense but was causing an actual issue to go away. The issue turns out to be that in one place the extra template arguments were dropped from the OuterAnalysisManagerProxy. This in turn caused the types used in one set of places to access the key to be completely different from the types used in another set of places for both Loop and CGSCC cases where there are extra arguments. I have literally no idea how anything seemed to work with this bug in place. It blows my mind. But it did except for mingw64 in a DLL build. I've added a really handy static assert that helps ensure we don't break this in the future. It immediately diagnoses the issue with a compile failure and a very clear error message. Much better that staring at backtraces on a build bot. =] llvm-svn: 294267	2017-02-07 01:50:48 +00:00
Yaxun Liu	8f844f3960	[AMDGPU] Lower null pointers in static variable initializer For amdgcn target Clang generates addrspacecast to represent null pointers in private and local address spaces. In LLVM codegen, the static variable initializer is lowered by virtual function AsmPrinter::lowerConstant which is target generic. Since addrspacecast is target specific, AsmPrinter::lowerConst This patch overrides AsmPrinter::lowerConstant with AMDGPUAsmPrinter::lowerConstant, which is able to lower the target-specific addrspacecast in the null pointer representation so that -1 is co Differential Revision: https://reviews.llvm.org/D29284 llvm-svn: 294265	2017-02-07 00:43:21 +00:00
Philip Reames	c80bd0486d	[LVI] Switch from BFS to DFS exploration order This patch changes the order in which LVI explores previously unexplored paths. Previously, the code used an BFS strategy where each unexplored input was added to the search queue before any of them were explored. This has the effect of causing all inputs to be explored before returning to re-evaluate the merge point (non-local or phi node). This has the unfortunate property of doing redundant work if one of the inputs to the merge is found to be overdefined (i.e. unanalysable). If any input is overdefined, the result of the merge will be too; regardless of the values of other inputs. The new code uses a DFS strategy where we re-evaluate the merge after evaluating each input. If we discover an overdefined input, we immediately return without exploring other inputs. We have reports of large (4-10x) improvements of compile time with this patch and some reports of more precise analysis results as well. See the review discussion for details. The original motivating case was pr10584. Differential Revision: https://reviews.llvm.org/D28190 llvm-svn: 294264	2017-02-07 00:25:24 +00:00
Tim Northover	868332d6bf	GlobalISel: legalize narrow G_SELECTS on AArch64. Otherwise there aren't any patterns to select them. llvm-svn: 294261	2017-02-06 23:41:27 +00:00
Dehao Chen	4a9dd70213	Fix the samplepgo indirect call promotion bug: we should not promote a direct call. Summary: Checking CS.getCalledFunction() == nullptr does not necessary indicate indirect call. We also need to check if CS.getCalledValue() is not a constant. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29570 llvm-svn: 294260	2017-02-06 23:33:15 +00:00
Krzysztof Parzyszek	becf0a362a	Revert "[Hexagon] Address ASAN and UBSAN failures after r294226" This reverts commit r294256. It seems to be causing more problems instead of solving them. llvm-svn: 294259	2017-02-06 23:30:17 +00:00
Krzysztof Parzyszek	5b4a6b67c5	[Hexagon] Adding gp+ to the syntax of gp-relative instructions Patch by Colin LeMahieu. llvm-svn: 294258	2017-02-06 23:18:57 +00:00
Stanislav Mekhanoshin	99be1aff31	[AMDGPU] Fix GCNSchedStrategy.cpp debug output There is typo in the debug output: top and bottom candidates are switched. Differential Revision: https://reviews.llvm.org/D29608 llvm-svn: 294257	2017-02-06 23:16:51 +00:00
Krzysztof Parzyszek	35a7838954	[Hexagon] Address ASAN and UBSAN failures after r294226 llvm-svn: 294256	2017-02-06 23:11:32 +00:00
Paul Robinson	383c5c228f	Merge DebugLoc on combined stores; in this case, when combining stores from the end of two blocks, merge instead of arbitrarily picking one. Differential Revision: http://reviews.llvm.org/D29504 llvm-svn: 294251	2017-02-06 22:19:04 +00:00
Taewook Oh	44a856f7d5	[GVNHoist] Merge DebugLoc metadata on hoisted instructions Summary: When instructions are hoisted, current implementation keeps DebugLoc metadata of the instruction that chosen as Repl (and its GEP operand if Repl is a load or a store). However, DebugLoc metadata should be updated to the 'merged' location across all hoisted instructions. See the following example code: ``` 1: typedef struct { 2: int a[10]; 3: } S1; 4: 5: extern S1 s1[10]; 6: 7: void foo(int x, int y, int i) { 8: if (y) 9: s1[i]->a[i] = x + y; 10: else 11: s1[i]->a[i] = x; 12: } ``` Below is LLVM IR representation of the program before gvn-hoist: ``` %struct.S1 = type { [10 x i32] } @s1 = external local_unnamed_addr global [10 x %struct.S1], align 16 define void @foo(i32 %x, i32 %y, i32 %i) !dbg !4 { entry: %tobool = icmp ne i32 %y, 0, !dbg !8 br i1 %tobool, label %if.then, label %if.else, !dbg !10 if.then: ; preds = %entry %add = add nsw i32 %x, %y, !dbg !11 %idxprom = sext i32 %i to i64, !dbg !12 %arrayidx = getelementptr inbounds [10 x %struct.S1], [10 x %struct.S1]* @s1, i64 0, i64 %idxprom, !dbg !12 %0 = load %struct.S1, %struct.S1* %arrayidx, align 8, !dbg !12, !tbaa !13 %a = getelementptr inbounds %struct.S1, %struct.S1* %0, i32 0, i32 0, !dbg !17 br label %if.end, !dbg !12 if.else: ; preds = %entry %idxprom3 = sext i32 %i to i64, !dbg !18 %arrayidx4 = getelementptr inbounds [10 x %struct.S1], [10 x %struct.S1]* @s1, i64 0, i64 %idxprom3, !dbg !18 %1 = load %struct.S1, %struct.S1* %arrayidx4, align 8, !dbg !18, !tbaa !13 %a5 = getelementptr inbounds %struct.S1, %struct.S1* %1, i32 0, i32 0, !dbg !19 br label %if.end if.end: ; preds = %if.else, %if.then %a5.sink = phi [10 x i32]* [ %a5, %if.else ], [ %a, %if.then ] %.sink = phi i32 [ %x, %if.else ], [ %add, %if.then ] %idxprom6 = sext i32 %i to i64 %arrayidx7 = getelementptr inbounds [10 x i32], [10 x i32]* %a5.sink, i64 0, i64 %idxprom6 store i32 %.sink, i32* %arrayidx7, align 4, !tbaa !20 ret void, !dbg !22 } ``` where ``` !11 = !DILocation(line: 9, column: 18, scope: !9) !12 = !DILocation(line: 9, column: 5, scope: !9) !18 = !DILocation(line: 11, column: 5, scope: !9) !19 = !DILocation(line: 11, column: 9, scope: !9) ``` . And below is after gvn-hoist: ``` define void @foo(i32 %x, i32 %y, i32 %i) !dbg !4 { entry: %tobool = icmp ne i32 %y, 0, !dbg !8 %idxprom = sext i32 %i to i64, !dbg !10 %0 = getelementptr inbounds [10 x %struct.S1], [10 x %struct.S1]* @s1, i64 0, i64 %idxprom, !dbg !10 %1 = load %struct.S1, %struct.S1* %0, align 8, !dbg !10, !tbaa !11 br i1 %tobool, label %if.then, label %if.else, !dbg !15 if.then: ; preds = %entry %add = add nsw i32 %x, %y, !dbg !16 %arrayidx = getelementptr inbounds [10 x %struct.S1], [10 x %struct.S1]* @s1, i64 0, i64 %idxprom, !dbg !10 %a = getelementptr inbounds %struct.S1, %struct.S1* %1, i32 0, i32 0, !dbg !17 br label %if.end, !dbg !10 if.else: ; preds = %entry %arrayidx4 = getelementptr inbounds [10 x %struct.S1], [10 x %struct.S1]* @s1, i64 0, i64 %idxprom, !dbg !18 %a5 = getelementptr inbounds %struct.S1, %struct.S1* %1, i32 0, i32 0, !dbg !19 br label %if.end if.end: ; preds = %if.else, %if.then %a5.sink = phi [10 x i32]* [ %a5, %if.else ], [ %a, %if.then ] %.sink = phi i32 [ %x, %if.else ], [ %add, %if.then ] %arrayidx7 = getelementptr inbounds [10 x i32], [10 x i32]* %a5.sink, i64 0, i64 %idxprom store i32 %.sink, i32* %arrayidx7, align 4, !tbaa !20 ret void, !dbg !22 } ``` As you see, loads and their GEPs have been hosited from if.then/if.else block to entry block. However, DebugLoc metadata of these new instructions are still same as the instructions in if.then block, as they are moved/cloned from if.then block. This may result incorrect stepping and imprecise sample profile result. Reviewers: majnemer, pcc, sebpop Reviewed By: sebpop Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29377 llvm-svn: 294250	2017-02-06 22:05:04 +00:00
Tim Northover	6f2db57dae	GlobalISel: fall back gracefully when we can't map an operand's size. AArch64 was asserting when it was asked to provide a register-bank of a size it couldn't deal with (in this case an s128 IMPLICIT_DEF). But we want a robust fallback path so this isn't allowed. llvm-svn: 294248	2017-02-06 21:57:06 +00:00
Tim Northover	0e6afbdd77	GlobalISel: legalize G_INSERT instructions We don't handle all cases yet (see arm64-fallback.ll for an example), but this is enough to cover most common C++ code so it's a good place to start. llvm-svn: 294247	2017-02-06 21:56:47 +00:00
Eugene Zelenko	90562dfb50	[X86] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies.(vlsj-clangbuild)[622] llvm-svn: 294246	2017-02-06 21:55:43 +00:00
Michael Kuperstein	7a86bb2589	[SLP] Revert "Allow using of extra values in horizontal reductions." This breaks when one of the extra values is also a scalar that participates in the same vectorization tree which we'll end up reducing. llvm-svn: 294245	2017-02-06 21:50:59 +00:00
Peter Collingbourne	e69e73c7b8	IR: Consider two DISubprograms to be odr-equal if they have the same template parameters. In ValueMapper we create new operands for MDNodes and rely on MDNode::replaceWithUniqued to create a new MDNode with the specified operands. However this doesn't always actually happen correctly for DISubprograms because when we uniquify the new node, we only odr-compare it with existing nodes (MDNodeSubsetEqualImpl<DISubprogram>::isDeclarationOfODRMember). Although the TemplateParameters field can refer to a distinct DICompileUnit via DITemplateTypeParameter::type -> DICompositeType::scope -> DISubprogram::unit, it is not currently included in the odr comparison. As a result, we can end up getting our original DISubprogram back, which means we will have a cloned module referring to the DICompileUnit in the original module, which causes a verification error. The fix I implemented was to consider TemplateParameters to be one of the odr-equal properties. But I'm a little uncomfortable with this. In general it seems unsound to rely on distinct MDNodes never being reachable from nodes which we only check odr-equality of. My only long term suggestion would be to separate odr-uniquing from full uniquing. Differential Revision: https://reviews.llvm.org/D29240 llvm-svn: 294240	2017-02-06 21:23:03 +00:00
Kostya Serebryany	c24ec32370	[libFuzzer] make code less clever to avoid fallthrough in switch (and in turn avoid compiler warnings). NFC. Suggested by Christian Holler. llvm-svn: 294239	2017-02-06 21:21:37 +00:00
Chandler Carruth	a80cfb3063	[PM/LCG] Fix the no-asserts build after r294227. Sorry for the noise. llvm-svn: 294235	2017-02-06 20:59:07 +00:00
David Blaikie	efc4eba816	Get function start line number from DWARF info DWARF info contains info about the line number at which a function starts (DW_AT_decl_line). This patch creates a function to look up the start line number for a function, and returns it in DILineInfo when looking up debug info for a particular address. Patch by Simon Que! Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D27962 llvm-svn: 294231	2017-02-06 20:19:02 +00:00
Chandler Carruth	2e0fe3e65b	[PM/LCG] Remove the lazy RefSCC formation from the LazyCallGraph during iteration. The lazy formation of RefSCCs isn't really the most important part of the laziness here -- that has to do with walking the functions themselves -- and isn't essential to maintain. Originally, there were incremental update algorithms that relied on updates happening predominantly near the most recent RefSCC formed, but those have been replaced with ones that have much tighter general case bounds at this point. We do still perform asserts that only scale well due to this incrementality, but those are easy to place behind EXPENSIVE_CHECKS. Removing this simplifies the entire analysis by having a single up-front step that builds all of the RefSCCs in a direct Tarjan walk. We can even easily replace this with other or better algorithms at will and with much less confusion now that there is no iterator-based incremental logic involved. This removes a lot of complexity from LCG. Another advantage of moving in this direction is that it simplifies testing the system substantially as we no longer have to worry about observing and mutating the graph half-way through the RefSCC formation. We still need a somewhat special iterator for RefSCCs because we want the iterator to remain stable in the face of graph updates. However, this now merely involves relative indexing to the current RefSCC's position in the sequence which isn't too hard. Differential Revision: https://reviews.llvm.org/D29381 llvm-svn: 294227	2017-02-06 19:38:06 +00:00
Krzysztof Parzyszek	8cdfe8ecf3	[Hexagon] Update MCTargetDesc Changes include: - Updates to the instruction descriptor flags. - Improvements to the packet shuffler and checker. - Updates to the handling of certain relocations. - Better handling of duplex instructions. llvm-svn: 294226	2017-02-06 19:35:46 +00:00
Sanjay Patel	54656ca7db	[ValueTracking] emit a remark when we detect a conflicting assumption (PR31809) This is a follow-up to D29395 where we try to be good citizens and let the user know that we've probably gone off the rails. This should allow us to resolve: https://llvm.org/bugs/show_bug.cgi?id=31809 Differential Revision: https://reviews.llvm.org/D29404 llvm-svn: 294208	2017-02-06 18:26:06 +00:00
Dehao Chen	c81483d79c	Fix the bug of samplepgo indirect call promption when type casting of the return value is needed. Summary: When type casting of the return value is needed, promoteIndirectCall will return the type casting instruction instead of the direct call. This patch changed to return the direct call instruction instead. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29569 llvm-svn: 294205	2017-02-06 18:10:36 +00:00
John Brawn	3a9c842a9d	[AArch64] Fix incorrect MachinePointerInfo in splitStoreSplat When splitting up one store into several in splitStoreSplat we have to make sure we get the MachinePointerInfo right, otherwise alias analysis thinks they all store to the same location. This can then cause invalid scheduling later on. Differential Revision: https://reviews.llvm.org/D29446 llvm-svn: 294203	2017-02-06 18:07:20 +00:00
Artur Pilipenko	d3464bf9ad	[DAGCombiner] Support bswap as a part of load combine patterns Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D29397 llvm-svn: 294201	2017-02-06 17:48:08 +00:00
Sanjay Patel	cf4c90f3d3	[InstCombine] simplify dyn_cast + isa; NFCI llvm-svn: 294198	2017-02-06 17:16:16 +00:00
Eugene Leviant	3e582c8855	RuntimeDyldELF/AArch64: Implement basic GOT support This patch implements two GOT relocations: R_AARCH64_ADR_GOT_PAGE and R_AARCH64_LD64_GOT_LO12_NC Differential revision: https://reviews.llvm.org/D28571 llvm-svn: 294191	2017-02-06 15:31:28 +00:00
Amaury Sechet	e674f5c758	Add ADDC to SelectionDAG::computeKnownBits and ComputeNumSignBits. Summary: As per title. Reviewers: bkramer, sunfish, lattner, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29521 llvm-svn: 294188	2017-02-06 14:59:06 +00:00
Amaury Sechet	8a3b32941d	[DAGCombiner] Make DAGCombiner smarter about overflow Summary: Leverage it to transform addc into add. Reviewers: mkuper, spatel, RKSimon, zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29524 llvm-svn: 294187	2017-02-06 14:54:49 +00:00
Amaury Sechet	1d466f598e	[DAGCombiner] (add X, (adde Y, 0, Carry)) -> (adde X, Y, Carry) Summary: This is extracted from D29443 . Reviewers: mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29564 llvm-svn: 294186	2017-02-06 14:28:39 +00:00
Simon Pilgrim	bfd4495512	[X86][SSE] Combine shuffle nodes with multiple uses if all the users are being combined. Currently we only combine shuffle nodes if they have a single user to prevent us from causing code bloat by splitting the shuffles into several different combines. We don't take into account that in some cases we will already have combined all the users during recursively calling up the shuffle tree. This patch keeps a list of all the shuffle nodes that have been combined so far and permits combining of further shuffle nodes if all its users are in that list. Differential Revision: https://reviews.llvm.org/D29399 llvm-svn: 294183	2017-02-06 13:44:45 +00:00
Simon Dardis	3aa8a90eff	[mips] dla expansion without the at register Previously only the superscalar scheduled expansion of the dla macro for MIPS64 was implemented. If assembler temporary register is not available and the optional source register is not the destination register, synthesize the address using the naive solution of adds and shifts. This partially resolves PR/30383. Thanks to Sean Bruno for reporting the issue! Reviewers: slthakur, seanbruno Differential Revision: https://reviews.llvm.org/D29328 llvm-svn: 294182	2017-02-06 12:43:46 +00:00
Daniil Fukalov	6378bdb2dd	[SCEV] limit recursion depth and operands number in getAddExpr for a quite big function with source like %add = add nsw i32 %mul, %conv %mul1 = mul nsw i32 %add, %conv %add2 = add nsw i32 %mul1, %add %mul3 = mul nsw i32 %add2, %add ; repeat couple of thousands times that can be produced by loop unroll, getAddExpr() tries to recursively construct SCEV and runs almost infinite time. Added recursion depth restriction (with new parameter to set it) Reviewers: sanjoy Subscribers: hfinkel, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D28158 llvm-svn: 294181	2017-02-06 12:38:06 +00:00
Dylan McKay	0acfafdbd6	[AVR] Use 'print' instead of 'dump' This should fix an undefined reference on the AVR buildbot. llvm-svn: 294175	2017-02-06 08:43:30 +00:00
Igor Breger	5c31a4c9a3	[X86][GlobalISel] Add limited ret lowering support to the IRTranslator. Summary: Support return lowering for i8/i16/i32/i64/float/double, vector type supported for 64bit platform only. Support argument lowering for float/double types. Reviewers: t.p.northover, zvi, ab, rovka Reviewed By: zvi Subscribers: dberris, kristof.beyls, delena, llvm-commits Differential Revision: https://reviews.llvm.org/D29261 llvm-svn: 294173	2017-02-06 08:37:41 +00:00
Craig Topper	5d9ecd23e8	[AVX-512] Add VPSLLDQ/VPSRLDQ to load folding tables. llvm-svn: 294170	2017-02-06 05:12:14 +00:00
Craig Topper	f0eb60a6f3	[AVX-512] Add VPABSB/D/Q/W to load folding tables. llvm-svn: 294169	2017-02-06 03:18:01 +00:00
Craig Topper	864b1a5376	[AVX-512] Add VSHUFPS/PD to load folding tables. llvm-svn: 294168	2017-02-06 03:17:58 +00:00
Craig Topper	75218fb6b1	[AVX-512] Add VPMULLD/Q/W instructions to load folding tables. llvm-svn: 294164	2017-02-06 01:19:26 +00:00
Craig Topper	452a7770e6	[AVX-512] Add all masked and unmasked versions of VPMULDQ and VPMULUDQ to load folding tables. llvm-svn: 294163	2017-02-05 23:31:48 +00:00
Simon Pilgrim	380ce75687	[X86][SSE] Replace insert_vector_elt(vec, -1, idx) with shuffle Similar to what we already do for zero elt insertion, we can quickly rematerialize 'allbits' vectors so to avoid a unnecessary gpr value and insertion into a vector llvm-svn: 294162	2017-02-05 22:50:29 +00:00
Craig Topper	8eb1f315ac	[AVX-512] Add scalar masked max/min intrinsic instructions to the load folding tables. llvm-svn: 294153	2017-02-05 22:25:46 +00:00
Craig Topper	cb4bc8be5b	[AVX-512] Add scalar masked add/sub/mul/div intrinsic instructions to the load folding tables. llvm-svn: 294152	2017-02-05 22:25:42 +00:00
Craig Topper	59af67206d	[AVX-512] Add masked scalar FMA intrinsics to isNonFoldablePartialRegisterLoad to improve load folding of scalar loads. llvm-svn: 294151	2017-02-05 22:25:40 +00:00
Dylan McKay	ccd819ad94	[AVR] Implement stacksave/stackrestore by expanding (PR31342) Summary: Authored by Florian Zeitz. This implements the missing stacksave/stackrestore intrinsics via expansion. Output of `llc -O0 -march=avr ~/devel/llvm/test/CodeGen/Generic/stacksave-restore.ll` for sanity checking (comments mine): ``` .text .file ".../llvm/test/CodeGen/Generic/stacksave-restore.ll" .globl test .p2align 1 .type test,@function test: ; @test ; BB#0: push r28 push r29 in r28, 61 in r29, 62 sbiw r28, 4 in r0, 63 cli out 62, r29 out 63, r0 out 61, r28 in r18, 61 in r19, 62 mov r20, r22 mov r21, r23 in r30, 61 in r31, 62 lsl r22 rol r23 lsl r22 rol r23 in r26, 61 in r27, 62 sub r26, r22 sbc r27, r23 andi r26, 252 in r0, 63 cli out 62, r27 out 63, r0 out 61, r26 in r0, 63 cli out 62, r31 out 63, r0 out 61, r30 in r30, 61 in r31, 62 sub r30, r22 sbc r31, r23 andi r30, 252 in r0, 63 cli out 62, r31 out 63, r0 out 61, r30 std Y+3, r24 ; 2-byte Folded Spill std Y+4, r25 ; 2-byte Folded Spill mov r24, r26 mov r25, r27 in r0, 63 cli out 62, r19 out 63, r0 out 61, r18 std Y+1, r20 ; 2-byte Folded Spill std Y+2, r21 ; 2-byte Folded Spill adiw r28, 4 in r0, 63 cli out 62, r29 out 63, r0 out 61, r28 pop r29 pop r28 ret .Lfunc_end0: .size test, .Lfunc_end0-test ``` Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29553 llvm-svn: 294146	2017-02-05 21:35:45 +00:00
Kamil Rytarowski	5d2bd8dd54	Revamp llvm::once_flag to be closer to std::once_flag Summary: Make this interface reusable similarly to std::call_once and std::once_flag interface. This makes porting LLDB to NetBSD easier as there was in the original approach a portable way to specify a non-static once_flag. With this change translating std::once_flag to llvm::once_flag is mechanical. Sponsored by <The NetBSD Foundation> Reviewers: mehdi_amini, labath, joerg Reviewed By: mehdi_amini Subscribers: emaste, clayborg Differential Revision: https://reviews.llvm.org/D29566 llvm-svn: 294143	2017-02-05 21:13:06 +00:00
Craig Topper	cac328f25e	[X86] Fix printing of sha256rnds2 to include the implicit %xmm0 argument. llvm-svn: 294132	2017-02-05 18:33:31 +00:00
Craig Topper	d7ae9ab1fa	[X86] Fix printing of blendvpd/blendvps/pblendvb to include the implicit %xmm0 argument. This makes codegen output more obvious about the %xmm0 usage. llvm-svn: 294131	2017-02-05 18:33:24 +00:00
Craig Topper	6a35a81fc5	[X86] In LowerTRUNCATE, create an ISD::VECTOR_SHUFFLE instead of explicitly creating a PSHUFB. This will be lowered by regular shuffle lowering to a PSHUFB later. Similar was already done for several other shuffles in this function. The test changes are because the old code used explicity zeroing for elements that could have been undef. While I was here I also changed other shuffle vectors in the same function to use the same input twice instead of creating UNDEF nodes. getVectorShuffle can create the UNDEF for us. llvm-svn: 294130	2017-02-05 18:33:14 +00:00
Geoff Berry	76ca8c2b34	[SelectionDAG] In InstrEmitter, handle EXTRACT_SUBREG of a physical register. Summary: Without this change, the getVR() call would hit an assert since it was being passed a physical register. Update the AArch64/ldst-opt.ll test with a case that triggers this behavior by adding a run with strict-align, which causes an unaligned STR XZR instruction to be split into byte stores, creating an EXTRACT_SUBREG of XZR that triggers the original problem. Reviewers: bogner, qcolombet, MatzeB, atrick Subscribers: aemerson, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D29495 llvm-svn: 294129	2017-02-05 18:28:14 +00:00
Amaury Sechet	143902c29f	[DAGCombiner] Leverage add's commutativity Summary: This avoid the need to duplicate all pattern and actually end up exposing some opportunity to optimize existing pattern that did not exists in both directions on an existing test case. Reviewers: mkuper, spatel, bkramer, RKSimon, zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29541 llvm-svn: 294125	2017-02-05 14:22:20 +00:00
Daniel Sanders	c3ac566754	[globalisel][arm] Tablegen-erate current Register Bank Information. Summary: This patch tablegen-erates the ARM register bank information so that the static tables added in D27807 no longer need to be maintained. Depends on D27338 Reviewers: t.p.northover, rovka, ab, qcolombet, aditya_nandakumar Reviewed By: rovka Subscribers: aemerson, rengolin, mgorny, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D28567 llvm-svn: 294124	2017-02-05 12:07:55 +00:00
Dylan McKay	b78f36657e	[AVR] Fix a bug where asm operands are printed twice We would unconditionally call printOperand, even if PrintAsmOperand already printed the immediate. llvm-svn: 294121	2017-02-05 10:42:49 +00:00
Dylan McKay	7a3eb290ef	[AVR] Support zero-sized arguments in defined methods It is sufficient to skip emission of these arguments as we have nothing to actually pass through the function call. The AVR-GCC reference has nothing to say about zero-sized arguments, presumably because C/C++ doesn't support them. This means we don't have to worry about ABI differences. llvm-svn: 294119	2017-02-05 09:53:45 +00:00
Dehao Chen	448b5790f6	Refactor SampleProfile.cpp to make it cleaner. (NFC) llvm-svn: 294118	2017-02-05 07:32:17 +00:00
Craig Topper	978fdb75a4	[X86] Add support for folding (insert_subvector vec1, (extract_subvector vec2, idx1), idx1) -> (blendi vec2, vec1). llvm-svn: 294112	2017-02-04 23:26:46 +00:00
Craig Topper	3d95228dbe	[X86] Simplify the code that turns INSERT_SUBVECTOR into BLENDI. NFCI llvm-svn: 294111	2017-02-04 23:26:42 +00:00
Craig Topper	42b83f8d6e	[DAGCombiner] Canonicalize the order of a chain of INSERT_SUBVECTORs. Based on similar code for INSERT_VECTOR_ELT. llvm-svn: 294110	2017-02-04 23:26:39 +00:00
Craig Topper	04dce84ead	[DAGCombiner] Use DAG.getAnyExtOrTrunc to simplify some code. NFC llvm-svn: 294109	2017-02-04 23:26:37 +00:00
Craig Topper	ceaf9c1633	[DAGCombiner] In visitINSERT_VECTOR_ELT, move check for BUILD_VECTOR being legal below code that just canonicalizes INSERT_VECTOR_ELT without creating BUILD_VECTORS. llvm-svn: 294108	2017-02-04 23:26:34 +00:00
Davide Italiano	ec49313b11	[IPCP] Don't propagate return value for naked functions. This is pretty much the same change made in SCCP. llvm-svn: 294098	2017-02-04 19:44:14 +00:00
Amaury Sechet	6e2d8e49ec	Formatting in DAGCombiner. NFC llvm-svn: 294091	2017-02-04 13:01:53 +00:00
Xinliang David Li	c7db0d0e75	Fix variable name /NFC llvm-svn: 294090	2017-02-04 07:40:43 +00:00
Matthias Braun	82e7f4d877	MachineCopyPropagation: Respect implicit operands of COPY The code missed to check implicit operands of COPY instructions for defs/uses. Differential Revision: https://reviews.llvm.org/D29522 llvm-svn: 294088	2017-02-04 02:27:20 +00:00
Matthias Braun	776a1d7ecb	MachineCopyPropagation: Do not consider undef operands as clobbers This was originally introduced in r278321 to work around correctness problems in the ExecutionDepsFix pass; Probably also to keep the performance benefits of breaking the false dependencies which of course also affect undef operands. ExecutionDepsFix has been improved here recently (see for example r278321) so we should not need this exception any longer. Differential Revision: https://reviews.llvm.org/D29525 llvm-svn: 294087	2017-02-04 02:27:13 +00:00
Kyle Butt	c7d67eef5a	[CodeGen]: BlockPlacement: Skip extraneous logging. Move a check for blocks that are not candidates for tail duplication up before the logging. Reduces logging noise. No non-logging changes intended. llvm-svn: 294086	2017-02-04 02:26:34 +00:00
Kyle Butt	e9425c4ff8	[CodeGen]: BlockPlacement: Apply const liberally. NFC Anything that needs to be passed to AnalyzeBranch unfortunately can't be const, or more would be const. Added const_iterator to BlockChain to allow BlockChain to be const when we don't expect to change it. llvm-svn: 294085	2017-02-04 02:26:32 +00:00
Eugene Zelenko	502d0bc28e	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce TargetInstrInfo.h dependencies. llvm-svn: 294084	2017-02-04 02:00:53 +00:00
Craig Topper	aa741abfc5	[TwoAddressInstruction] Fix typo in comment. NFC llvm-svn: 294083	2017-02-04 01:58:10 +00:00
Eric Christopher	b128abcf7a	Remove a bunch of unnecessary casts to a target specific version of TII and TRI as we're working from a target specific STI. llvm-svn: 294081	2017-02-04 01:52:17 +00:00
Bob Haarman	e2fe59fddd	fix nullptr Mangler in LTOModule Reviewers: kcc, pcc Subscribers: mehdi_amini Differential Revision: https://reviews.llvm.org/D29523 llvm-svn: 294079	2017-02-04 01:28:44 +00:00
Eugene Zelenko	3f37f07c7f	[Sparc] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294072	2017-02-04 00:36:49 +00:00
Brendon Cahoon	9809b10586	[RegisterCoalescer] Do not call getInstructionIndex with DBG_VALUE An assert occurs when calling SlotIndexes::getInstructionIndex with a DBG_VALUE instruction because the function expects an instruction with a slot index. However, there is no slot index for a DBG_VALUE instruction. Differential Revision: https://reviews.llvm.org/D29048 llvm-svn: 294070	2017-02-04 00:10:22 +00:00
Eugene Zelenko	cd8ea02b4a	[Mips] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294069	2017-02-03 23:39:33 +00:00
Eugene Zelenko	06869c04f3	[SystemZ] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294068	2017-02-03 23:39:06 +00:00
Eugene Zelenko	e894b4dc59	[AMDGPU] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294067	2017-02-03 23:38:40 +00:00
Sanjay Patel	0fe32ac256	[InstCombine] treat i1 as a special type in shouldChangeType() This patch is based on the llvm-dev discussion here: http://lists.llvm.org/pipermail/llvm-dev/2017-January/109631.html Folding to i1 should always be desirable because that's better for value tracking and we have special folds for i1 types. I checked for other users of shouldChangeType() where this might have an effect, but we already handle the i1 case differently than other types in all of those cases. Side note: the default datalayout includes i1, so it seems we only find this gap in shouldChangeType + phi folding for the case when there is (1) an explicit datalayout without i1, (2) casting to i1 from a legal type, and (3) a phi with exactly 2 incoming casted operands (as Björn mentioned). Differential Revision: https://reviews.llvm.org/D29336 llvm-svn: 294066	2017-02-03 23:13:11 +00:00
Amaury Sechet	fb1756b35b	[APInt] Add integer API bor bitwise operations. Summary: As per title. I ran into that limitation of the API doing some other work, so I though that'd be a nice addition. Reviewers: jroelofs, compnerd, majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29503 llvm-svn: 294063	2017-02-03 22:54:41 +00:00
Kostya Serebryany	9f8e47b28c	[libFuzzer] properly hide the memcmp interceptor from msan llvm-svn: 294061	2017-02-03 22:51:38 +00:00
Xinliang David Li	6144a59b7f	[PGO] Add select instr profile in graph dump Differential Revision: http://reviews.llvm.org/D29474 llvm-svn: 294055	2017-02-03 21:57:51 +00:00
Eugene Zelenko	939f6b0167	[AArch64] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294053	2017-02-03 21:49:13 +00:00
Eugene Zelenko	07dc38f67a	[ARM] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294052	2017-02-03 21:48:12 +00:00
Eugene Zelenko	c3164b9c2f	[XCore] Fix some Include What You Use warnings; other minor fixes (NFC). This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294051	2017-02-03 21:46:55 +00:00
Sanjay Patel	73fc8ddb06	[InstCombine] fix operand-complexity-based canonicalization (PR28296) The code comments didn't match the code logic, and we didn't actually distinguish the fake unary (not/neg/fneg) operators from arguments. Adding another level to the weighting scheme provides more structure and can help simplify the pattern matching in InstCombine and other places. I fixed regressions that would have shown up from this change in: rL290067 rL290127 But that doesn't mean there are no pattern-matching logic holes left; some combines may just be missing regression tests. Should fix: https://llvm.org/bugs/show_bug.cgi?id=28296 Differential Revision: https://reviews.llvm.org/D27933 llvm-svn: 294049	2017-02-03 21:43:34 +00:00
Zachary Turner	5ce0f4a9de	Properly parse the TypeServer2 record. llvm-svn: 294046	2017-02-03 21:22:27 +00:00
Matt Arsenault	f15da6c419	AMDGPU: AsmParser cleanups Use typedef, remove unnecessary enum, line wraps. llvm-svn: 294039	2017-02-03 20:49:51 +00:00
Mike Aizatsky	1b65812267	[libfuzzer] chromium-related compilation fixes Reviewers: kcc Differential Revision: https://reviews.llvm.org/D29502 llvm-svn: 294035	2017-02-03 20:26:44 +00:00
Stanislav Mekhanoshin	81db53109d	[AMDGPU] Bump -amdgpu-unroll-threshold-private to 2000 This has quite positive performance impact according to measurements. Before previous fixes to limit the optimization that was too high and blowed compile time and scratch usage, but now this is gone and we can bump the threshold. Differential Revision: https://reviews.llvm.org/D29505 llvm-svn: 294032	2017-02-03 20:08:29 +00:00
Matt Arsenault	1fa5eacf9d	AMDGPU: Set MCAsmInfo::PointerSize llvm-svn: 294031	2017-02-03 20:02:23 +00:00
Matt Arsenault	d9cd736585	AMDGPU: Don't unroll for private with dynamic allocas This won't be elimnated, so this will just bloat code if/when these are ever used/supported. llvm-svn: 294030	2017-02-03 19:36:00 +00:00
Michael Kuperstein	2a735b71b6	[SLP] Make sortMemAccesses explicitly return an error. NFC. llvm-svn: 294029	2017-02-03 19:32:50 +00:00
Ahmed Bougacha	9677cc6fb7	[TLI] Robustize SDAG LibFunc proto checking by merging it into TLI. This re-applies commit r292189, reverted in r292191. SelectionDAGBuilder recognizes libfuncs using some homegrown parameter type-checking. Use TLI instead, removing another heap of redundant code. This isn't strictly NFC, as the SDAG code was too lax. Concretely, this means changes are required to a few tests: - calling a non-variadic function via a variadic prototype isn't OK; it just happens to work on x86_64 (but not on, e.g., aarch64). - mempcpy has a size_t parameter; the SDAG code accepts any integer type, which meant using i32 on x86_64 worked. - a handful of SystemZ tests check the SDAG support for lax prototype checking: Ulrich agrees on removing them. I don't think it's worth supporting any of these (IMO) invalid testcases. Instead, fix them to be more meaningful. llvm-svn: 294028	2017-02-03 19:11:19 +00:00
Michael Kuperstein	723999d4aa	[SLP] Use SCEV to sort memory accesses. This generalizes memory access sorting to use differences between SCEVs, instead of relying on constant offsets. That allows us to properly do SLP vectorization of non-sequentially ordered loads within loops bodies. Differential Revision: https://reviews.llvm.org/D29425 llvm-svn: 294027	2017-02-03 19:09:45 +00:00
Tim Northover	c3e3f59d12	GlobalISel: translate dynamic alloca instructions. llvm-svn: 294022	2017-02-03 18:22:45 +00:00
Simon Pilgrim	034c1bd32c	[X86][SSE] Add support for combining scalar_to_vector(extract_vector_elt) into a target shuffle. Correctly flagging upper elements as undef. llvm-svn: 294020	2017-02-03 17:59:58 +00:00
Anna Thomas	b555cc8cb6	NFC: [LoopUnroll] More meaningful message in tracing llvm-svn: 294017	2017-02-03 17:12:43 +00:00
Peter Collingbourne	e6fd9ff96a	IRMover: Merge flags LinkModuleInlineAsm and IsPerformingImport. Currently these flags are always the inverse of each other, so there is no need to keep them separate. Differential Revision: https://reviews.llvm.org/D29471 llvm-svn: 294016	2017-02-03 17:01:14 +00:00
Peter Collingbourne	7c70211653	ModuleLinker: Remove importing support. NFCI. Differential Revision: https://reviews.llvm.org/D29470 llvm-svn: 294015	2017-02-03 16:58:19 +00:00
Peter Collingbourne	6d8f817f8b	FunctionImport: Use IRMover directly. The importer was previously using ModuleLinker in a sort of "IRMover mode". Use IRMover directly instead in order to remove a level of indirection. I will remove all importing support from ModuleLinker in a separate change. Differential Revision: https://reviews.llvm.org/D29468 llvm-svn: 294014	2017-02-03 16:56:27 +00:00
Simon Dardis	68e9d94055	[mips] Remove absolute size assertion for end directive The .end <symbol> directive for MIPS marks the end of a symbol and sets the symbol's size. Previously, the corresponding emitDirective handler asserted that a function's size could be evaluated to an absolute value at that point in time. This cannot be done with when directives like .align have been encountered, instead set the function's size to the corresponding symbolic expression and let ELFObjectWriter resolve the expression to an absolute value. This avoids a redundant call to evaluateAsAbsolute. llvm-svn: 294012	2017-02-03 15:48:53 +00:00
Justin Lebar	e90c468444	[NVPTX] Enable combineRepeatedFPDivisors for NVPTX. Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D29477 llvm-svn: 294011	2017-02-03 15:13:50 +00:00
Artem Tamazov	43b61561b0	[AMDGPU][mc] Fix AddressSanitizer leftover issue in gfx7_asm_all test Issue occurs when assembling "ds_ordered_count v0, v0 gds". llvm-svn: 294004	2017-02-03 12:47:30 +00:00
Alexey Bataev	a0d9f2582b	[SelectionDAG] Fix for PR30775: Assertion `NodeToMatch->getOpcode() != ISD::DELETED_NODE && "NodeToMatch was removed partway through selection"' failed. NodeToMatch can be modified during matching, but code does not handle this situation. Differential Revision: https://reviews.llvm.org/D29292 llvm-svn: 294003	2017-02-03 12:28:40 +00:00
Sanne Wouda	a994185757	[ARM] Change TCReturn to tBL if tailcall optimization fails. Summary: The tail call optimisation is performed before register allocation, so at that point we don't know if LR is being spilt or not. If LR was spilt to the stack, then we cannot do a tail call optimisation. That would involve popping back into LR which is not possible in Thumb1 code. Reviewers: rengolin, jmolloy, rovka, olista01 Reviewed By: olista01 Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D29020 llvm-svn: 294000	2017-02-03 11:15:53 +00:00
Alexey Bataev	a16cfe6fa9	[SLP] Fix for PR31690: Allow using of extra values in horizontal reductions. Currently LLVM supports vectorization of horizontal reduction instructions with initial value set to 0. Patch supports vectorization of reduction with non-zero initial values. Also it supports a vectorization of instructions with some extra arguments, like: float f(float x[], int a, int b) { float p = a % b; p += x[0] + 3; for (int i = 1; i < 32; i++) p += x[i]; return p; } Patch allows vectorization of this kind of horizontal reductions. Differential Revision: https://reviews.llvm.org/D28961 llvm-svn: 293994	2017-02-03 08:08:50 +00:00
Mehdi Amini	1380edf4ef	Revert "[ThinLTO] Add an auto-hide feature" This reverts commit r293970. After more discussion, this belongs to the linker side and there is no added value to do it at this level. llvm-svn: 293993	2017-02-03 07:41:43 +00:00
Stanislav Mekhanoshin	f29602df65	[AMDGPU] Unroll preferences improvements Exit loop analysis early if suitable private access found. Do not account for GEPs which are invariant to loop induction variable. Do not account for Allocas which are too big to fit into register file anyway. Add option for tuning: -amdgpu-unroll-threshold-private. Differential Revision: https://reviews.llvm.org/D29473 llvm-svn: 293991	2017-02-03 02:20:05 +00:00
Marcos Pividori	db5a565514	[sanitizer coverage] Fix Instrumentation to work on Windows. On Windows, the symbols "___stop___sancov_guards" and "___start___sancov_guards" are not defined automatically. So, we need to take a different approach. We define 3 sections: Section ".SCOV$A" will only hold a variable ___start___sancov_guard. Section ".SCOV$M" will hold the main data. Section ".SCOV$Z" will only hold a variable ___stop___sancov_guards. When linking, they will be merged sorted by the characters after the $, so we can use the pointers of the variables ___[start\|stop]___sancov_guard to know the actual range of addresses of that section. In this diff, I updated instrumentation to include all the guard arrays in section ".SCOV$M". Differential Revision: https://reviews.llvm.org/D28434 llvm-svn: 293987	2017-02-03 01:08:06 +00:00
Matt Arsenault	e1b595306d	AMDGPU: Fold fneg into fmin/fmax_legacy llvm-svn: 293972	2017-02-03 00:51:50 +00:00
David Blaikie	a0e3c75187	DebugInfo: ensure type and namespace names are included in pubnames/pubtypes even when they are only present in type units While looking to add support for placing singular types (types that will only be emitted in one place (such as attached to a strong vtable or explicit template instantiation definition)) not in type units (since type units have overhead) I stumbled across that change causing an increase in pubtypes. Turns out we were missing some types from type units if they were only referenced from other type units and not from the debug_info section. This fixes that, following GCC's line of describing the offset of such entities as the CU die (since there's no compile unit-relative offset that would describe such an entity - they aren't in the CU). Also like GCC, this change prefers to describe the type stub within the CU rather than the "just use the CU offset" fallback where possible. This may give the DWARF consumer some opportunity to find the extra info in the type stub - though I'm not sure GDB does anything with this currently. The size of the pubnames/pubtypes sections now match exactly with or without type units enabled. This nearly triples (+189%) the pubtypes section for a clang self-host and grows pubnames by 0.07% (without compression). For a total of 8% increase in debug info sections of the objects of a Split DWARF build when using type units. llvm-svn: 293971	2017-02-03 00:44:18 +00:00
Mehdi Amini	b0a8ff71e5	[ThinLTO] Add an auto-hide feature When a symbol is not exported outside of the DSO, it is can be hidden. Usually we try to internalize as much as possible, but it is not always possible, for instance a symbol can be referenced outside of the LTO unit, or there can be cross-module reference in ThinLTO. This is a recommit of r293912 after fixing build failures, and a recommit of r293918 after fixing LLD tests. Differential Revision: https://reviews.llvm.org/D28978 llvm-svn: 293970	2017-02-03 00:32:38 +00:00
Craig Topper	bbb2b95ce5	[X86] Mark 256-bit and 512-bit INSERT_SUBVECTOR operations as legal and remove the custom lowering. llvm-svn: 293969	2017-02-03 00:24:49 +00:00
Matt Arsenault	2511c031de	AMDGPU: Fold fneg into fminnum/fmaxnum llvm-svn: 293968	2017-02-03 00:23:15 +00:00
Matt Arsenault	a8fcfadf46	AMDGPU: Check if users of fneg can fold mods In multi-use cases this can save a few instructions. llvm-svn: 293962	2017-02-02 23:21:23 +00:00
Mehdi Amini	21c89dc920	Revert "[ThinLTO] Add an auto-hide feature" This reverts commit r293918, one lld test does not pass. llvm-svn: 293961	2017-02-02 23:20:36 +00:00
Bob Haarman	dd4ebc1d3b	[lto] add getLinkerOpts() Summary: Some compilers, including MSVC and Clang, allow linker options to be specified in source files. In the legacy LTO API, there is a getLinkerOpts() method that returns linker options for the bitcode module being processed. This change adds that method to the new API, so that the COFF linker can get the right linker options when using the new LTO API. Reviewers: pcc, ruiu, mehdi_amini, tejohnson Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D29207 llvm-svn: 293950	2017-02-02 23:00:49 +00:00
Eugene Zelenko	fbd13c5c12	[X86] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 293949	2017-02-02 22:55:55 +00:00
Reid Kleckner	3c467e225e	[X86] Avoid sorted order check in release builds Effectively reverts r290248 and fixes the unused function warning with ifndef NDEBUG. llvm-svn: 293945	2017-02-02 22:06:30 +00:00
Craig Topper	c45657375b	[X86] Move turning 256-bit INSERT_SUBVECTORS into BLENDI from legalize to DAG combine. On one test this seems to have given more chance for DAG combine to do other INSERT_SUBVECTOR/EXTRACT_SUBVECTOR combines before the BLENDI was created. Looks like we can still improve more by teaching DAG combine to optimize INSERT_SUBVECTOR/EXTRACT_SUBVECTOR with BLENDI. llvm-svn: 293944	2017-02-02 22:02:57 +00:00
Reid Kleckner	c35139ec0d	[CodeGen] Remove dead call-or-prologue enum from CCState This enum has been dead since Olivier Stannard re-implemented ARM byval handling in r202985 (2014). llvm-svn: 293943	2017-02-02 21:58:22 +00:00
Xinliang David Li	58fcc9bdce	[PGO] internal option cleanups 1. Added comments for options 2. Added missing option cl::desc field 3. Uniified function filter option for graph viewing. Now PGO count/raw-counts share the same filter option: -view-bfi-func-name=. llvm-svn: 293938	2017-02-02 21:29:17 +00:00
Rafael Espindola	13a79bbfe5	Change how we handle section symbols on ELF. On ELF every section can have a corresponding section symbol. When in an assembly file we have .quad .text the '.text' refers to that symbol. The way we used to handle them is to leave .text an undefined symbol until the very end when the object writer would map them to the actual section symbol. The problem with that is that anything before the end would see an undefined symbol. This could result in bad diagnostics (test/MC/AArch64/label-arithmetic-diags-elf.s), or incorrect results when using the asm streamer (est/MC/Mips/expansion-jal-sym-pic.s). Fixing this will also allow using the section symbol earlier for setting sh_link of SHF_METADATA sections. This patch includes a few hacks to avoid changing our behaviour when handling conflicts between section symbols and other symbols. I reported pr31850 to track that. llvm-svn: 293936	2017-02-02 21:26:06 +00:00
Javed Absar	bb8dcc6aec	[ARM] Classification Improvements to ARM Sched-Model. NFCI. This is the second in the series of patches to enable adding of machine sched-models for ARM processors easier and compact. This patch focuses on integer instructions and adds missing sched definitions. Reviewers: rovka, rengolin Differential Revision: https://reviews.llvm.org/D29127 llvm-svn: 293935	2017-02-02 21:08:12 +00:00
Quentin Colombet	5725f56bb0	[LiveRangeEdit] Don't mess up with LiveInterval when a new vreg is created. In r283838, we added the capability of splitting unspillable register. When doing so we had to make sure the split live-ranges were also unspillable and we did that by marking the related live-ranges in the delegate method that is called when a new vreg is created. However, by accessing the live-range there, we also triggered their lazy computation (LiveIntervalAnalysis::getInterval) which is not what we want in general. Indeed, later code in LiveRangeEdit is going to build the live-ranges this lazy computation may mess up that computation resulting in assertion failures. Namely, the createEmptyIntervalFrom method expect that the live-range is going to be empty, not computed. Thanks to Mikael Holmén <mikael.holmen@ericsson.com> for noticing and reporting the problem. llvm-svn: 293934	2017-02-02 20:44:36 +00:00
Krzysztof Parzyszek	d0d42f0ec8	[Hexagon] Adding opExtentBits and opExtentAlign to GPrel instructions Patch by Colin LeMahieu. llvm-svn: 293933	2017-02-02 20:35:12 +00:00
Michael Kuperstein	e6d59fdca5	[X86] Add costs for non-AVX512 single-source permutation integer shuffles Differential Revision: https://reviews.llvm.org/D29416 llvm-svn: 293932	2017-02-02 20:27:13 +00:00
Krzysztof Parzyszek	e17b0bfb24	[Hexagon] Fix relocation kind for extended predicated calls Patch by Sid Manning. llvm-svn: 293931	2017-02-02 20:21:56 +00:00
Krzysztof Parzyszek	357b048666	[Hexagon] Remove A4_ext_* pseudo instructions Patch by Colin LeMahieu. llvm-svn: 293929	2017-02-02 19:58:22 +00:00
Kostya Serebryany	68382d0900	[libFuzzer] reorganize the tracing code to make it easier to experiment with inlined coverage instrumentation. NFC llvm-svn: 293928	2017-02-02 19:56:01 +00:00
Krzysztof Parzyszek	d67ab623f6	[Hexagon] Fix insertBranch for loops with multiple ENDLOOP instructions llvm-svn: 293925	2017-02-02 19:36:37 +00:00
Dan Gohman	b89f2d3d92	[WebAssembly] Add instruction definitions for drop and get/set_global. llvm-svn: 293922	2017-02-02 19:29:44 +00:00
Xinliang David Li	1eb4ec6a2e	[PGO] make graph view internal options available for all builds Differential Revision: https://reviews.llvm.org/D29259 llvm-svn: 293921	2017-02-02 19:18:56 +00:00
Marcos Pividori	d64360d935	[libFuzzer] Properly handle exceptions with UnhandledExceptionFilter. Use SetUnhandledExceptionFilter instead of AddVectoredExceptionHandler. According to the documentation on Structured Exception Handling, this is the order for the Exception Dispatching: + If the process is being debugged, the system notifies the debugger. + The Vectored Exception Handler is called. + The system attempts to locate a frame-based exception handler by searching the stack frames of the thread in which the exception occurred. + If no frame-based handler can be found, the UnhandledExceptionFilter filter is called. + Default handling based on the exception type. So, similar to what we do for asan, we should use SetUnhandledExceptionFilter instead of AddVectoredExceptionHandler, so user's code that is being fuzzed can execute frame-based exception handlers before we catch them . We want to catch unhandled exceptions, not all the exceptions. Differential Revision: https://reviews.llvm.org/D29462 llvm-svn: 293920	2017-02-02 19:07:53 +00:00
Peter Collingbourne	37e2459186	FunctionImport: Remove the -disable-force-link-odr flag and change importFunctions to never force link. This removes some functionality that was only being used by tests. Differential Revision: https://reviews.llvm.org/D29439 llvm-svn: 293919	2017-02-02 18:42:25 +00:00
Mehdi Amini	97624fb1ec	[ThinLTO] Add an auto-hide feature When a symbol is not exported outside of the DSO, it is can be hidden. Usually we try to internalize as much as possible, but it is not always possible, for instance a symbol can be referenced outside of the LTO unit, or there can be cross-module reference in ThinLTO. This is a recommit of r293912 after fixing build failures. Differential Revision: https://reviews.llvm.org/D28978 llvm-svn: 293918	2017-02-02 18:31:35 +00:00
Nirav Dave	93f9d5ce04	Revert "In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled." This reverts commit r293893 which is miscompiling lua on ARM and bootstrapping for x86-windows. llvm-svn: 293915	2017-02-02 18:24:55 +00:00
Mehdi Amini	827600deaf	Revert "[ThinLTO] Add an auto-hide feature" This reverts r293912, bots are broken. llvm-svn: 293914	2017-02-02 18:24:37 +00:00
Mehdi Amini	dc5a7444f0	[ThinLTO] Add an auto-hide feature When a symbol is not exported outside of the DSO, it is can be hidden. Usually we try to internalize as much as possible, but it is not always possible, for instance a symbol can be referenced outside of the LTO unit, or there can be cross-module reference in ThinLTO. Differential Revision: https://reviews.llvm.org/D28978 llvm-svn: 293912	2017-02-02 18:13:46 +00:00
Simon Dardis	08ce5fb66b	[mips] Expansion of BEQL and BNEL with immediate operands Adds support for BEQL and BNEL macros with immediate operands. Patch by: Srdjan Obucina Reviewers: dsanders, zoran.jovanovic, vkalintiris, sdardis, obucina, seanbruno Differential Revision: https://reviews.llvm.org/D17040 llvm-svn: 293905	2017-02-02 16:13:49 +00:00
Amaury Sechet	f3e421d6e9	Use N0 instead of N->getOperand(0) in DagCombiner::visitAdd. NFC llvm-svn: 293903	2017-02-02 16:07:44 +00:00
Jonas Paulsson	b7a2ef8375	[SystemZ] Add comment for ISD::FP_TO_UINT expansion. (Copied from the fp-conv-10.ll test to SystemZISelLowering.cpp) Review: Ulrich Weigand llvm-svn: 293900	2017-02-02 15:42:14 +00:00
Krzysztof Parzyszek	bc4dc9b4b9	[Hexagon] Emitting individual instructions without copying them Patch by Colin LeMahieu. llvm-svn: 293899	2017-02-02 15:32:26 +00:00
Jun Bum Lim	180bc5a021	[JumpThread] Enhance finding partial redundant loads by continuing scanning single predecessor Summary: While scanning predecessors to find an available loaded value, if the predecessor has a single predecessor, we can continue scanning through the single predecessor. Reviewers: mcrosier, rengolin, reames, davidxl, haicheng Reviewed By: rengolin Subscribers: zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D29200 llvm-svn: 293896	2017-02-02 15:12:34 +00:00
Krzysztof Parzyszek	f65b8f14f4	[Hexagon] Rename TypeCOMPOUND to TypeCJ llvm-svn: 293894	2017-02-02 15:03:30 +00:00
Nirav Dave	4442667fc5	In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled. Recommiting after fixing X86 inc/dec chain bug. * Simplify Consecutive Merge Store Candidate Search Now that address aliasing is much less conservative, push through simplified store merging search and chain alias analysis which only checks for parallel stores through the chain subgraph. This is cleaner as the separation of non-interfering loads/stores from the store-merging logic. When merging stores search up the chain through a single load, and finds all possible stores by looking down from through a load and a TokenFactor to all stores visited. This improves the quality of the output SelectionDAG and the output Codegen (save perhaps for some ARM cases where we correctly constructs wider loads, but then promotes them to float operations which appear but requires more expensive constant generation). Some minor peephole optimizations to deal with improved SubDAG shapes (listed below) Additional Minor Changes: 1. Finishes removing unused AliasLoad code 2. Unifies the chain aggregation in the merged stores across code paths 3. Re-add the Store node to the worklist after calling SimplifyDemandedBits. 4. Increase GatherAllAliasesMaxDepth from 6 to 18. That number is arbitrary, but seems sufficient to not cause regressions in tests. 5. Remove Chain dependencies of Memory operations on CopyfromReg nodes as these are captured by data dependence 6. Forward loads-store values through tokenfactors containing {CopyToReg,CopyFromReg} Values. 7. Peephole to convert buildvector of extract_vector_elt to extract_subvector if possible (see CodeGen/AArch64/store-merge.ll) 8. Store merging for the ARM target is restricted to 32-bit as some in some contexts invalid 64-bit operations are being generated. This can be removed once appropriate checks are added. This finishes the change Matt Arsenault started in r246307 and jyknight's original patch. Many tests required some changes as memory operations are now reorderable, improving load-store forwarding. One test in particular is worth noting: CodeGen/PowerPC/ppc64-align-long-double.ll - Improved load-store forwarding converts a load-store pair into a parallel store and a memory-realized bitcast of the same value. However, because we lose the sharing of the explicit and implicit store values we must create another local store. A similar transformation happens before SelectionDAG as well. Reviewers: arsenm, hfinkel, tstellarAMD, jyknight, nhaehnle llvm-svn: 293893	2017-02-02 14:39:42 +00:00
Nirav Dave	e14300e270	[X86,ISEL] Fix X86 increment chain dependence calculation Merging Load-add-store pattern into a increment op previously dropped the load's chain from the instructions dependence if the store is chained to a TokenFactor. llvm-svn: 293892	2017-02-02 14:39:26 +00:00
Diana Picus	32cd9b434c	[ARM] GlobalISel: Lower pointer args and returns It is important to change the ArgInfo's type from pointer to integer, otherwise the CC assign function won't know what to do. Instead of hacking it up, we use ComputeValueVTs and introduce some of the helpers that we will need later on for lowering more complex types. llvm-svn: 293889	2017-02-02 14:01:00 +00:00
Diana Picus	0c11c7b5c7	[ARM] GlobalISel: Error out instead of asserting Allow unknown types in TLI.getValueType, otherwise we get asserts for certain types that we do not support yet (instead of returning that we don't support them and falling through the normal error path). llvm-svn: 293888	2017-02-02 14:00:54 +00:00
Anna Thomas	7f4b26e189	[LICM] Hoist loads that are dominated by invariant.start intrinsic, and are invariant in the loop. Summary: We can hoist out loads that are dominated by invariant.start, to the preheader. We conservatively assume the load is variant, if we see a corresponding use of invariant.start (it could be an invariant.end or an escaping call). Reviewers: mkuper, sanjoy, reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29331 llvm-svn: 293887	2017-02-02 13:22:03 +00:00
Diana Picus	fc19a8ff07	[ARM] GlobalISel: Legalize loading pointers Make it legal to load pointer values. Also check that pointers are assigned to the GPR reg bank by default. llvm-svn: 293886	2017-02-02 13:20:49 +00:00
Simon Pilgrim	20ab6b875a	[X86][SSE] Use MOVMSK for all_of/any_of reduction patterns This is a first attempt at using the MOVMSK instructions to replace all_of/any_of reduction patterns (i.e. an and/or + shuffle chain). So far this only matches patterns where we are reducing an all/none bits source vector (i.e. a comparison result) but we should be able to expand on this in conjunction with improvements to 'bool vector' handling both in the x86 backend as well as the vectorizers etc. Differential Revision: https://reviews.llvm.org/D28810 llvm-svn: 293880	2017-02-02 11:52:33 +00:00
Craig Topper	047a8be18a	[X86] Remove some unused DAGCombinerInfo parameters. NFC llvm-svn: 293873	2017-02-02 08:03:23 +00:00
Craig Topper	94ed54b49a	[X86] Move some INSERT_SUBVECTOR optimizations from legalize to DAG combine. This moves creation of SUBV_BROADCAST and merging of adjacent loads that are being inserted together. This is a step towards removing legalizing of INSERT_SUBVECTOR except for vXi1 cases. llvm-svn: 293872	2017-02-02 08:03:20 +00:00
Adam Nemet	0bf1b863b9	[LV] Also port failure remarks to new OptimizationRemarkEmitter API llvm-svn: 293866	2017-02-02 05:41:51 +00:00
Peter Collingbourne	4613626d49	LTO: Link non-prevailing weak_odr or linkonce_odr globals into the combined module with available_externally linkage. These linkages mean that the ultimately prevailing symbol will have the same semantics as any non-prevailing copy of the symbol, so we are free to ignore the linker's resolution. Differential Revision: https://reviews.llvm.org/D29367 llvm-svn: 293865	2017-02-02 05:22:42 +00:00
Peter Collingbourne	c387e70c69	Linker: Move special casing for available_externally in IRMover to clients. NFCI. The goal is to simplify the semantic model for clients of IRMover. Differential Revision: https://reviews.llvm.org/D29435 llvm-svn: 293864	2017-02-02 05:12:15 +00:00
Craig Topper	b81e6c48f8	[AVX-512] Fix the implicit defs for VZEROALL/VZEROUPPER to include YMM16-YMM31. llvm-svn: 293862	2017-02-02 04:17:18 +00:00
Matt Arsenault	300836098f	InferAddressSpaces: Handle more cases with constant select operands llvm-svn: 293859	2017-02-02 03:37:22 +00:00
Matt Arsenault	9dba9bd4cf	AMDGPU: Use source modifiers with f16->f32 conversions The operand types were defined to fit the fp16_to_fp node, which has the half as an integer type. v_cvt_f32_f16 does support source modifiers, so change this to have an FP type and modifiers. For targets without legal f16, this requires recognizing the bit operations and trying to produce them. llvm-svn: 293857	2017-02-02 02:27:04 +00:00
Matthias Braun	9dc3b5ff89	RegisterCoalescer: Cleanup joinReservedPhysReg(); NFC - Factor out a common subexpression - Add some helpful comments - Fix printing of a register in a debug message llvm-svn: 293856	2017-02-02 02:23:27 +00:00
Matthias Braun	5b49f95592	AArch64RegisterInfo: Simplify getReservedReg(); NFC After marking a 32bit register and all its super registers the 64bit register does not need to be marked again. llvm-svn: 293855	2017-02-02 02:23:25 +00:00
Matt Arsenault	8e190b2f23	NVPTX: Fix not preserving volatile when expanding memset llvm-svn: 293851	2017-02-02 01:20:34 +00:00
Omair Javaid	f5d560bc84	Fix LLDB Android AArch64 GCC debug info build Committing after fixing suggested changes and tested release/debug builds on x86_64-linux and arm/aarch64 builds. Differential revision: https://reviews.llvm.org/D29042 llvm-svn: 293850	2017-02-02 01:17:49 +00:00
Rui Ueyama	a9b29615fb	Re-submit r293820: Return Error instead of bool from mergeTypeStreams(). llvm-svn: 293847	2017-02-02 00:47:10 +00:00
Davide Italiano	cb68f37184	[IPSCCP] Restore the old behaviour (pre r293799). It's not clear the change I made a good idea, and it definitely needs further discussion. Thanks to Eli for pointing out. llvm-svn: 293846	2017-02-02 00:46:54 +00:00
Peter Collingbourne	dc5e583687	X86: Produce @ABS8 symbol modifiers for absolute symbols in range [0,128). Differential Revision: https://reviews.llvm.org/D28689 llvm-svn: 293844	2017-02-02 00:32:03 +00:00
Matt Arsenault	db6e9e89a9	InferAddressSpaces: clang-format some things llvm-svn: 293843	2017-02-02 00:28:25 +00:00
Paul Robinson	5362216c36	Remove an assertion that doesn't hold when mixing -g and -gmlt through LTO. Replace it with a related assertion, ensuring that abstract variables appear only in abstract scopes. Part of PR31437. Differential Revision: http://reviews.llvm.org/D29430 llvm-svn: 293841	2017-02-01 23:51:56 +00:00
Stanislav Mekhanoshin	2b913b1f49	[AMDGPU] Account workgroup size in LDS occupancy limits Functions matching LDS use to occupancy return results for a workgroup of 64 workitems. The numbers has to be adjusted for bigger workgroups. For example a workgroup of size 256 already occupies 4 waves just by itself. Given that all numbers of LDS use in the compiler are per workgroup, occupancy shall be multiplied by 4 in this case. Each 64 workitems still limited by the same number, but 4 subrgoups 64 workitems each can afford 4 times more LDS to get the same occupancy. In addition change initializes LDS size in the subtarget to a real value for SI+ targets. This is required since LDS size is a variable in these calculations. Differential Revision: https://reviews.llvm.org/D29423 llvm-svn: 293837	2017-02-01 22:59:50 +00:00
Eugene Zelenko	c5eb8e29d0	[AArch64] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 293836	2017-02-01 22:56:06 +00:00
Dehao Chen	0944a8c2ec	Change debug-info-for-profiling from a TargetOption to a function attribute. Summary: LTO requires the debug-info-for-profiling to be a function attribute. Reviewers: echristo, mehdi_amini, dblaikie, probinson, aprantl Reviewed By: mehdi_amini, dblaikie, aprantl Subscribers: aprantl, probinson, ahatanak, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D29203 llvm-svn: 293833	2017-02-01 22:45:09 +00:00
Marcos Pividori	ba03abebfe	[libFuzzer] Disable afl tests on non-posix systems. AflDriver is not supported on non posix systems. Differential Revision: https://reviews.llvm.org/D29422 llvm-svn: 293830	2017-02-01 22:40:50 +00:00
Marcos Pividori	36464dd6a5	[libFuzzer] Disable equivalence tests on non posix systems. We can not run this test until we implement shared memory on Windows. Differential Revision: https://reviews.llvm.org/D29421 llvm-svn: 293829	2017-02-01 22:40:45 +00:00
Marcos Pividori	b056879700	[libFuzzer] Isolate merge tests that require posix. Differential Revision: https://reviews.llvm.org/D29420 llvm-svn: 293828	2017-02-01 22:40:40 +00:00
Marcos Pividori	9c0244c1eb	[libFuzzer] Add features `windows` and `posix` for lit tests. Add 2 features: posix and windows. Sometimes we want some specific tests only for posix and we use: REQUIRES: posix Sometimes we want some specific tests only for windows and we use: REQUIRES: windows Differential Revision: https://reviews.llvm.org/D29418 llvm-svn: 293827	2017-02-01 22:40:34 +00:00
Marcos Pividori	477d153045	[libFuzzer] Accept different extensions. Differential Revision: https://reviews.llvm.org/D29417 llvm-svn: 293826	2017-02-01 22:40:29 +00:00
Marcos Pividori	b340471ff5	[libFuzzer] Fix test because cmd prompt does not expand wildcard. Commands should expand the wildcards on Windows, the cmd prompt doesn't. Because of that sancov was not finding the needed file. To deal with this, we use ls and xargs from gnu win utils. Differential Revision: https://reviews.llvm.org/D29374 llvm-svn: 293825	2017-02-01 22:39:55 +00:00
Rui Ueyama	7d07a1652d	Revert r293820: Return Error instead of bool from mergeTypeStreams(). It broke buildbots. llvm-svn: 293824	2017-02-01 22:28:43 +00:00
Sanjay Patel	52e4e6594e	[ValueTracking] remove a FIXME for something we don't want to do; NFC The comment was added with: https://reviews.llvm.org/rL293773 ...but there would be a cost to implement this and possibly no payoff. llvm-svn: 293823	2017-02-01 22:27:34 +00:00
Rui Ueyama	00d4f49717	Return Error instead of bool from mergeTypeStreams(). Previously, mergeTypeStreams returns only true or false, so it was impossible to know the reason if it failed. This patch changes the function signature so that it returns an Error object. Differential Revision: https://reviews.llvm.org/D29362 llvm-svn: 293820	2017-02-01 22:09:34 +00:00
Paul Robinson	a380e613f1	Remove an assertion that doesn't hold when mixing -g and -gmlt through LTO. Part of PR31437. Differential Revision: http://reviews.llvm.org/D29310 llvm-svn: 293818	2017-02-01 21:54:50 +00:00
Sanjay Patel	c56d1ccd79	[InstCombine] move folds for shift-shift pairs; NFCI Although this is 'no-functional-change-intended', I'm adding tests for shl-shl and lshr-lshr pairs because there is no existing test coverage for those folds. It seems like we should be able to remove some code from foldShiftedShift() at this point because we're handling those patterns on the general path. llvm-svn: 293814	2017-02-01 21:31:34 +00:00
Michael Kuperstein	3c6b3ba258	Shut up another GCC warning about operator precedence. NFC. llvm-svn: 293812	2017-02-01 21:06:33 +00:00
Matt Arsenault	74f64833bc	AMDGPU: Allow clustering flat memory operations llvm-svn: 293809	2017-02-01 20:22:51 +00:00
Jun Bum Lim	423406fdcb	[JumpThread] No need to erase BB from LoopHeaders. NFC. Summary: No need to try to ease BB from LoopHeaders as we already know that BB is not in LoopHeaders. Reviewers: hsung, majnemer, mcrosier, haicheng, rengolin Reviewed By: rengolin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29232 llvm-svn: 293802	2017-02-01 19:06:55 +00:00
Davide Italiano	6849f20d85	[IPSCCP] Don't propagate return values of functions marked as noinline. This tries to address what Hal defined (in the post-commit review of r293727) a long-standing problem with noinline, where we end up de facto inlining trivial functions e.g. __attribute__((noinline)) int patatino(void) { return 5; } because of return value propagation. llvm-svn: 293799	2017-02-01 18:52:20 +00:00
Simon Dardis	ac9c30c37f	[mips] Parse the 'bopt' and 'nobopt' directives in IAS. The GAS assembler supports the ".set bopt" directive but according to the sources it doesn't do anything. It's supposed to optimize branches by filling the delay slot of a branch with it's target. This patch teaches the MIPS asm parser to accept both and warn in the case of 'bopt' that the bopt directive is unsupported. This resolves PR/31841. Thanks to Sean Bruno for reporting the issue! llvm-svn: 293798	2017-02-01 18:50:24 +00:00
Zachary Turner	d50c01308e	[pdb] Add a new command for analyzing hash collisions. This introduces the `analyze` subcommand. For now there is only one option, to analyze hash collisions in the type streams. In the future, however, we could add many more things here, such as performing size analyses, compacting, and statistics about the type of records etc. llvm-svn: 293795	2017-02-01 18:30:22 +00:00
Marcos Pividori	460886e3cf	[libFuzzer] Do not use llvm-objdump for disassembling a DSO. When disassembling a DSO, for calls to functions from the PLT, llvm-objdump only prints the offset from the PLT, like: <.plt+0x30>. While objdump and dumpbin print the function name, like: <__sanitizer_cov_trace_pc_guard@plt> When analyzing the coverage in libFuzzer we dissasemble and look for the calls to __sanitizer_cov_trace_pc_guard. So, this fails when using llvm-objdump on a DSO. Differential Revision: https://reviews.llvm.org/D29372 llvm-svn: 293791	2017-02-01 17:59:23 +00:00
Marcos Pividori	7a3a390afb	[libFuzzer] Properly check if we can use dumpbin. The flag "/sumary" is necessary, otherwise it returns a non-zero value. Differential Revision: https://reviews.llvm.org/D29371 llvm-svn: 293790	2017-02-01 17:59:19 +00:00
Matthew Simpson	ba5cf9dfee	[LV] Move interleaved access helper functions to VectorUtils (NFC) This patch moves some helper functions related to interleaved access vectorization out of LoopVectorize.cpp and into VectorUtils.cpp. We would like to use these functions in a follow-on patch that improves interleaved load and store lowering in (ARM/AArch64)ISelLowering.cpp. One of the functions was already duplicated there and has been removed. Differential Revision: https://reviews.llvm.org/D29398 llvm-svn: 293788	2017-02-01 17:45:46 +00:00

... 2 3 4 5 6 ...

99511 Commits