llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	03ca396b95	Revert "[SLP] Improve comments and naming of functions/variables/members, NFC." This reverts commit 6e311de8b907aa20da9a1a13ab07c3ce2ef4068a. llvm-svn: 304609	2017-06-02 23:09:15 +00:00
Quentin Colombet	60c9e88e1d	Change code formatting to look like the surrounding code clang-format decided differently and Matthias pointed out the difference. llvm-svn: 304608	2017-06-02 23:07:58 +00:00
Philip Reames	b70cecd60a	[Statepoint] Be consistent about using deopt naming [NFCI] We'd called this "vm state" in the early days, but have long since standardized on calling it "deopt" in line with the operand bundle tag. Fix a few cases we'd missed. llvm-svn: 304607	2017-06-02 23:03:26 +00:00
Matthias Braun	0021d46a1c	RegisterScavenging: Add ScavengerTest pass This pass allows to run the register scavenging independently of PrologEpilogInserter to allow targeted testing. Also adds some basic register scavenging tests. llvm-svn: 304606	2017-06-02 23:01:42 +00:00
Matthias Braun	3e95165b70	InitializePasses: Sort initializer list (by ASCII) llvm-svn: 304605	2017-06-02 23:01:38 +00:00
Quentin Colombet	2145cf3f07	[RABasic] Properly update the LiveRegMatrix when LR splitting occur Prior to this patch we used to not touch the LiveRegMatrix while doing live-range splitting. In other words, when live-range splitting was occurring, the LiveRegMatrix was not reflecting the changes. This is generally fine because it means the query to the LiveRegMatrix will be conservately correct. However, when decisions are taken based on what is going to happen on the interferences (e.g., when we spill a register and know that it is going to be available for another one), we might hit an assertion that the color used for the assignment is still in use. This patch makes sure the changes on the live-ranges are properly reflected in the LiveRegMatrix, so the assertions don't break. An alternative could have been to remove the assertion, but it would make the invariants of the code and the general reasoning more complicated in my opnion. http://llvm.org/PR33057 llvm-svn: 304603	2017-06-02 22:46:31 +00:00
Quentin Colombet	ebbaed6d3c	[RABasic] Properly initialize the pass Use the initializeXXX method to initialize the RABasic pass in the pipeline. This enables us to take advantage of the .mir infrastructure. llvm-svn: 304602	2017-06-02 22:46:26 +00:00
Xinliang David Li	5fdc75aea1	Fix debug build test failure llvm-svn: 304600	2017-06-02 22:38:48 +00:00
Xinliang David Li	0b7d858fa3	[PartialInlining] Minor cost anaysis tuning Also added a test option and 2 cost analysis related tests. llvm-svn: 304599	2017-06-02 22:08:04 +00:00
David Blaikie	6aeacaa527	FunctionAttrs: Skip it if the effective SCC (ignoring optnone functions) is empty Minor optimization but mostly simplifies my debugging so I'm not dealing with empty SCCNodeSets while investigating issues in this optimization. llvm-svn: 304597	2017-06-02 21:24:17 +00:00
Matthias Braun	dfa892139c	RegisterScavenging: Move scavenging logic from PEI to RegisterScavenging; NFC These parts do not depend on any PrologEpilogInserter logic and therefore better fits RegisterScaveging.cpp. llvm-svn: 304596	2017-06-02 21:02:03 +00:00
Zachary Turner	64726f2269	Fix build error on gcc. llvm-svn: 304595	2017-06-02 21:00:22 +00:00
Jun Bum Lim	2960d41e68	[InlineCost] Enable the new switch cost heuristic Summary: This is to enable the new switch inline cost heuristic (r301649) by removing the old heuristic as well as the flag itself. In my experiment for LLVM test suite and spec2000/2006, +17.82% performance and 8% code size reduce was observed in spec2000/vertex with O3 LTO in AArch64. No significant code size / performance regression was found in O3/O2/Os. No significant complain was reported from the llvm-dev thread. Reviewers: hans, chandlerc, eraman, haicheng, mcrosier, bmakam, eastig, ddibyend, echristo Reviewed By: echristo Subscribers: javed.absar, kristof.beyls, echristo, aemerson, rengolin, mehdi_amini Differential Revision: https://reviews.llvm.org/D32653 llvm-svn: 304594	2017-06-02 20:42:54 +00:00
Alexey Bataev	2c08fde9e5	[SLP] Improve comments and naming of functions/variables/members, NFC. Summary: Fixed some comments, added an additional description of the algorithms, improved readability of the code. Reviewers: anemet Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33320 llvm-svn: 304593	2017-06-02 20:39:27 +00:00
Ahmed Bougacha	018a68f9e4	[X86] Correctly broadcast NaN-like integers as float on AVX. Since r288804, we try to lower build_vectors on AVX using broadcasts of float/double. However, when we broadcast integer values that happen to have a NaN float bitpattern, we lose the NaN payload, thereby changing the integer value being broadcast. This is caused by ConstantFP::get, to which we pass the splat i32 as a float (by bitcasting it using bitsToFloat). ConstantFP::get takes a double parameter, so we end up lossily converting a single-precision NaN to double-precision. Instead, avoid any kinds of conversions by directly building an APFloat from the splatted APInt. Note that this also fixes another piece of code (broadcast of subvectors), that currently isn't susceptible to the same problem. Also note that we could really just use APInt and ConstantInt throughout: the constant pool type doesn't matter much. Still, for consistency, use the appropriate type. llvm-svn: 304590	2017-06-02 20:02:59 +00:00
Zachary Turner	4bedb5fd00	Fix build error with clang and gcc. llvm-svn: 304589	2017-06-02 20:00:10 +00:00
Zachary Turner	92dcdda623	[CodeView] Support CodeView subsections in any order. Previously we would expect certain subsections to appear in a certain order because some subsections would reference other subsections, but in practice we need to support arbitrary orderings since some object file and PDB file producers generate them this way. This also paves the way for supporting Yaml <-> Object File conversion of CodeView, since Object Files typically have quite a large number of subsections in their debug info. Differential Revision: https://reviews.llvm.org/D33807 llvm-svn: 304588	2017-06-02 19:49:14 +00:00
Petr Hosek	3440bc37ff	[CMake][runtimes] Add install target for runtimes builtins This adds an install-builtins target to avoid having to list all builtins targets explicitly. Differential Revision: https://reviews.llvm.org/D32710 llvm-svn: 304587	2017-06-02 19:38:11 +00:00
Amaury Sechet	04ffaca604	Regenerate expectation for wide-fma-contraction.ll . NFC llvm-svn: 304586	2017-06-02 19:15:04 +00:00
Keno Fischer	514a6a54e7	[SROA] Fix crash due to bad bitcast Summary: As shown in the test case, SROA was crashing when trying to split stores (to the alloca) of loads (from anywhere), because it assumed the pointer operand to the loads and stores had to have the same address space. This isn't the case. Make sure to use the correct pointer type for both the load and the store. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D32593 llvm-svn: 304585	2017-06-02 19:04:17 +00:00
Evgeniy Stepanov	63f056327d	[CFI] Remove LinkerSubsectionsViaSymbols. Since D17854 LinkerSubsectionsViaSymbols is unnecessary. It is interfering with ThinLTO implementation of CFI-ICall, where the aliases used on the !LinkerSubsectionsViaSymbols branch are needed to export jump tables to ThinLTO backends. llvm-svn: 304582	2017-06-02 18:45:14 +00:00
David Blaikie	358c012db2	BitcodeWriter: Removing unnecessary std::function in favor of template More cleanup from post-commit discussion on r304516 llvm-svn: 304579	2017-06-02 18:25:29 +00:00
Evgeniy Stepanov	b933ad3a77	Skip CFI for dead functions. Differential Revision: https://reviews.llvm.org/D33805 llvm-svn: 304578	2017-06-02 18:24:23 +00:00
Evgeniy Stepanov	659b3bc77d	Move summary dead stripping before regular LTO. This way dead stripping results are recorded in combined summary and can be used in regular LTO passes. Differential Revision: https://reviews.llvm.org/D33615 llvm-svn: 304577	2017-06-02 18:24:17 +00:00
Sanjay Patel	469014ada4	[x86] fix formatting; NFCI llvm-svn: 304576	2017-06-02 18:14:31 +00:00
Matt Arsenault	746e065716	AMDGPU: Register AMDGPUAlwaysInline llvm-svn: 304574	2017-06-02 18:02:42 +00:00
Reid Kleckner	146eb7a65f	Re-land "COFF: migrate def parser from LLD to LLVM" This reverts commit r304561 and re-lands r303490 & co. The fix was to use "SymbolName" when translating LLD's internal export list to lib/Object's short export struct. The SymbolName reflects the actual symbol name, which may include fastcall and stdcall mangling bits not included in the /EXPORT or .def file EXPORTS name: @@ -434,8 +434,7 @@ std::vector<COFFShortExport> createCOFFShortExportFromConfig() { std::vector<COFFShortExport> Exports; for (Export &E1 : Config->Exports) { COFFShortExport E2; - E2.Name = E1.Name; + // Use SymbolName, which will have any stdcall or fastcall qualifiers. + E2.Name = E1.SymbolName; E2.ExtName = E1.ExtName; E2.Ordinal = E1.Ordinal; E2.Noname = E1.Noname; llvm-svn: 304573	2017-06-02 17:53:06 +00:00
Konstantin Zhuravlyov	be6c0ca5e2	AMDGPU: Make auto waitcnt before barrier a feature Differential Revision: https://reviews.llvm.org/D33793 llvm-svn: 304571	2017-06-02 17:40:26 +00:00
Sanjay Patel	cdb5dad4cc	[TargetLowering] fix formatting; NFC llvm-svn: 304569	2017-06-02 17:35:02 +00:00
Craig Topper	9277a86f03	[LazyValueInfo] Fix formatting NFC. llvm-svn: 304567	2017-06-02 17:28:12 +00:00
David Blaikie	b6b42e018a	Tidy up a bit of r304516, use SmallVector::assign rather than for loop This might give a few better opportunities to optimize these to memcpy rather than loops - also a few minor cleanups (StringRef-izing, templating (to avoid std::function indirection), etc). The SmallVector::assign(iter, iter) could be improved with the use of SFINAE, but the (iter, iter) ctor and append(iter, iter) need it to and don't have it - so, workaround it for now rather than bothering with the added complexity. (also, as noted in the added FIXME, these assign ops could potentially be optimized better at least for non-trivially-copyable types) llvm-svn: 304566	2017-06-02 17:24:26 +00:00
Philip Reames	0f02bbc6f4	Verify a couple more fields in STATEPOINT instructions While doing so, clarify the comments and update them to reflect current reality. Note: I'm going to let this sit for a week or so before adding further verification. I want to give this time to cycle through bots and merge it into our downstream tree before pushing this further. llvm-svn: 304565	2017-06-02 17:02:33 +00:00
Philip Reames	94cc4a29ed	Add placeholder for more extensive verification of psuedo ops This initial patch doesn't actually do much useful. It's just to show where the new code goes. Once this is in, I'll extend the verification logic to check more useful properties. For those curious, the more complicated version of this patch already found one very suspicious thing. Differential Revision: https://reviews.llvm.org/D33819 llvm-svn: 304564	2017-06-02 16:36:37 +00:00
Craig Topper	3778c8943b	[LazyValueInfo] Make solveBlockValueBinaryOp take a BinaryOperator* instead of Instruction*. This removes a cast of getOpcode to BinaryOps. llvm-svn: 304563	2017-06-02 16:33:13 +00:00
Sanjay Patel	ce241f48c5	[InstCombine] fix icmp with not op and constant to work with splat vector constant llvm-svn: 304562	2017-06-02 16:29:41 +00:00
Reid Kleckner	d249e4a188	Revert "COFF: migrate def parser from LLD to LLVM" This reverts commits r303490, r303491, r303493, and r303494. This caused http://crbug.com/728726. Essentially, exporting stdcall functions doesn't appear to work after this change. Reduced test case soon. llvm-svn: 304561	2017-06-02 16:26:24 +00:00
Craig Topper	84a9f168f1	[LazyValueInfo] Fix typo in comment. NFC llvm-svn: 304560	2017-06-02 16:21:13 +00:00
Craig Topper	b23e7c78a5	[InstSimplify][ConstantFolding] Teach constant folding how to handle icmp null, (inttoptr x) as well as it handles icmp (inttoptr x), null Summary: The constant folding code currently assumes that the constant expression will always be on the left and the simple null will be on the right. But that's not true at least on the path from InstSimplify. This patch adds support to ConstantFolding to detect the reversed case. Reviewers: spatel, dberlin, majnemer, davide, joey Reviewed By: joey Subscribers: joey, llvm-commits Differential Revision: https://reviews.llvm.org/D33801 llvm-svn: 304559	2017-06-02 16:17:32 +00:00
Sanjay Patel	4dc85eb75a	[InstCombine] improve perf by not creating a known non-canonical instruction Op1 (RHS) is a constant, so putting it on the LHS makes us churn through visitICmp an extra time to canonicalize it: INSTCOMBINE ITERATION #1 on cmpnot IC: ADDING: 3 instrs to worklist IC: Visiting: %notx = xor i8 %x, -1 IC: Visiting: %cmp = icmp sgt i8 %notx, 42 IC: Old = %cmp = icmp sgt i8 %notx, 42 New = <badref> = icmp sgt i8 -43, %x IC: ADD: %cmp = icmp sgt i8 -43, %x IC: ERASE %1 = icmp sgt i8 %notx, 42 IC: ADD: %notx = xor i8 %x, -1 IC: DCE: %notx = xor i8 %x, -1 IC: ERASE %notx = xor i8 %x, -1 IC: Visiting: %cmp = icmp sgt i8 -43, %x IC: Mod = %cmp = icmp sgt i8 -43, %x New = %cmp = icmp slt i8 %x, -43 IC: ADD: %cmp = icmp slt i8 %x, -43 IC: Visiting: %cmp = icmp slt i8 %x, -43 IC: Visiting: ret i1 %cmp If we create the swapped ICmp directly, we go faster: INSTCOMBINE ITERATION #1 on cmpnot IC: ADDING: 3 instrs to worklist IC: Visiting: %notx = xor i8 %x, -1 IC: Visiting: %cmp = icmp sgt i8 %notx, 42 IC: Old = %cmp = icmp sgt i8 %notx, 42 New = <badref> = icmp slt i8 %x, -43 IC: ADD: %cmp = icmp slt i8 %x, -43 IC: ERASE %1 = icmp sgt i8 %notx, 42 IC: ADD: %notx = xor i8 %x, -1 IC: DCE: %notx = xor i8 %x, -1 IC: ERASE %notx = xor i8 %x, -1 IC: Visiting: %cmp = icmp slt i8 %x, -43 IC: Visiting: ret i1 %cmp llvm-svn: 304558	2017-06-02 16:11:14 +00:00
Amaury Sechet	5746e7356a	Update select.ll expected results. NFC llvm-svn: 304557	2017-06-02 16:07:43 +00:00
Sanjay Patel	630a524e8d	[InstCombine] fix/add tests for icmp with not ops; NFC The existing test was not minimal, and there was no coverage for the variants with a constant or vector types. llvm-svn: 304555	2017-06-02 15:35:45 +00:00
Alexander Timofeev	3f70b619a9	AMDGPUAnnotateUniformValue should always treat volatile loads as divergent llvm-svn: 304554	2017-06-02 15:25:52 +00:00
Geoff Berry	57d8a417e7	[AArch64][Falkor] Model immediate forwarding. llvm-svn: 304552	2017-06-02 14:27:41 +00:00
Mark Searles	70359ac60d	[AMDGPU] Turn on the new waitcnt insertion pass. Adjust tests. -enable-si-insert-waitcnts=1 becomes the default -enable-si-insert-waitcnts=0 to use old pass Differential Revision: https://reviews.llvm.org/D33730 llvm-svn: 304551	2017-06-02 14:19:25 +00:00
Zoran Jovanovic	2aae0649a1	[mips][microMIPS] Extending size reduction pass with LBU16, LHU16, SB16 and SH16 Author: milena.vujosevic.janicic Reviewers: sdardis The patch extends size reduction pass for MicroMIPS. The following instructions are examined and transformed, if possible: LBU instruction is transformed into 16-bit instruction LBU16 LHU instruction is transformed into 16-bit instruction LHU16 SB instruction is transformed into 16-bit instruction SB16 SH instruction is transformed into 16-bit instruction SH16 Differential Revision: https://reviews.llvm.org/D33091 llvm-svn: 304550	2017-06-02 14:14:21 +00:00
Krzysztof Parzyszek	066e8b56a0	[Hexagon] Return 0 from getDotNewPredOp when .new opcode does not exist This allows using this function to test if an instruction can be converted to a .new form. llvm-svn: 304549	2017-06-02 14:07:06 +00:00
Amaury Sechet	2e1fed9ef8	Regenerate sse3.ll test results. NFC llvm-svn: 304548	2017-06-02 14:02:49 +00:00
Amaury Sechet	8e370f14cb	Regenerate and-sink.ll test results. NFC llvm-svn: 304547	2017-06-02 14:02:46 +00:00
Amaury Sechet	f0c066f140	Regenerate shrink-compare.ll test results. NFC llvm-svn: 304546	2017-06-02 14:02:43 +00:00
Benjamin Kramer	c1f5ae236c	[OrderedBasicBlock] Return false for comesBefore(A, A) So far it would return true for the first uncached query, then cached queries return false. llvm-svn: 304545	2017-06-02 13:10:31 +00:00
Alex Lorenz	9e39013941	[lit][macOS] Add a utility function to find the platform SDK version on macOS This function will be used to tie Clang's Integeration tests to a particular SDK version. See https://reviews.llvm.org/D32178 for more context. llvm-svn: 304541	2017-06-02 11:21:37 +00:00
Benjamin Kramer	19092d783c	[X86] Don't fold into memory operands into insertps in the generated folding tables. insertps behaves differently, the register form selects from an input register based on the immediate operand while the memory form just loads the given address. We have custom code to change the immediate in cases where that's legal, so completely remove insertps from the generated tables. llvm-svn: 304540	2017-06-02 10:50:22 +00:00
John Brawn	6671616cde	[GlobalMerge] Don't merge globals that may be preempted When a global may be preempted it needs to be accessed directly, instead of indirectly through a MergedGlobals symbol, for the preemption to work. This fixes PR33136. Differential Revision: https://reviews.llvm.org/D33727 llvm-svn: 304537	2017-06-02 10:24:14 +00:00
Diana Picus	e7aa90987d	[ARM] GlobalISel: Support struct params/returns Very very similar to the support for arrays. As with arrays, we don't support returning large structs that wouldn't fit in R0-R3. Most front-ends would likely use sret arguments for that anyway. The only significant difference is that when splitting a struct, we need to make sure we set the correct original alignment on each member, otherwise it may get split incorrectly between stack and registers. llvm-svn: 304536	2017-06-02 10:16:48 +00:00
Amaury Sechet	437f7060fe	nits in TargetLowering.cpp . NFC llvm-svn: 304532	2017-06-02 09:18:18 +00:00
Javed Absar	4ae7e81233	[ARM] Cortex-A57 scheduling model for ARM backend (AArch32) This patch implements the Cortex-A57 scheduling model. The main code is in ARMScheduleA57.td, ARMScheduleA57WriteRes.td. Small changes in cpp,.h files to support required scheduling predicates. Scheduling model implemented according to: http://infocenter.arm.com/help/topic/com.arm.doc.uan0015b/Cortex_A57_Software_Optimization_Guide_external.pdf. Patch by : Andrew Zhogin (submitted on his behalf, as requested). Rewiewed by: Renato Golin, Diana Picus, Javed Absar, Kristof Beyls. Differential Revision: https://reviews.llvm.org/D28152 llvm-svn: 304530	2017-06-02 08:53:19 +00:00
Amaury Sechet	9a6fdc0bd5	Specify triple for xor-icmp.ll . llvm-svn: 304526	2017-06-02 07:45:22 +00:00
Amaury Sechet	968dda7f81	Regenerate expectations for xor-icmp.ll . NFC llvm-svn: 304525	2017-06-02 07:25:02 +00:00
Max Kazantsev	4d8748a987	[SelectionDAG] Get rid of recursion in findNonImmUse The recursive implementation of findNonImmUse may overflow stack on extremely long use chains. This patch replaces it with an equivalent iterative implementation. Reviewed By: bogner Differential Revision: https://reviews.llvm.org/D33775 llvm-svn: 304522	2017-06-02 07:11:00 +00:00
Craig Topper	694e8a0d2f	[TableGen] Remove a couple unused methods from Record that take a StringRef argument. NFC We also have a version that takes an Init* that are used. llvm-svn: 304521	2017-06-02 05:56:47 +00:00
Gor Nishanov	053d2d24f7	[coroutines] PR33271: Remove stray coro.save intrinsics during CoroSplit Summary: Optimization passes may remove llvm.coro.suspend intrinsic while leaving matching llvm.coro.save intrinsic orphaned. Make sure we clean up orphaned coro.saves. The bug manifested with a crash similar to this: ``` llvm_unreachable("Unknown type!"); llvm::MVT::getVT (Ty=0x489518, HandleUnknown=false) llvm::EVT::getEVT llvm::TargetLoweringBase::getValueType llvm::ComputeValueVTs llvm::SelectionDAGBuilder::visitTargetIntrinsic ``` Reviewers: GorNishanov Subscribers: EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D33817 llvm-svn: 304518	2017-06-02 02:18:36 +00:00
Xinliang David Li	621e8dcf1f	[Profile] Enhance expect lowering to handle correlated branches builtin_expect applied on && or \|\| expressions were not handled properly before. With this patch, the problem is fixed. Differential Revision: http://reviews.llvm.org/D33164 llvm-svn: 304517	2017-06-02 02:09:31 +00:00
Teresa Johnson	7a27b132a8	[ThinLTO] Efficiency improvement when writing module path string table Summary: When writing the combined index, we are walking the entire module path StringMap in the full index, and checking whether each one should be included in the index being written. For distributed backends, where we write an individual combined index for each file, each with only a few module paths, this is incredibly inefficient. Add a method that takes a callback and hides the details of whether we are writing the full combined index, or just a slice, and in the latter case it walks the set of modules to include instead of the entire index. For a huge application with around 23K files (i.e. where we were iterating through the 23K-entry modulePath StringMap 23K times), this change improved the thin link time by a whopping 48%. Reviewers: pcc Subscribers: Prazek, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D33813 llvm-svn: 304516	2017-06-02 01:56:02 +00:00
Philip Reames	ae80045deb	[RS4GC] Comment clarification llvm-svn: 304514	2017-06-02 01:52:06 +00:00
Jacob Gravelle	26115924a2	Revert r304117 - WebAssembly object format isn't ready to be the default Summary: Wasm object format has some functionality regressions from the ELF format, and doesn't play nicely with the rest of the toolchain. It should eventually be the default, but not yet. Reviewers: sunfish, sbc100 Subscribers: jfb, dschuff, llvm-commits Differential Revision: https://reviews.llvm.org/D33811 llvm-svn: 304512	2017-06-02 01:26:17 +00:00
Sam Clegg	c38e947e50	[WebAssembly] MC: Fix references to undefined externals in data section Undefined externals don't need to have a size or an offset. This was broken by r303915. Added a test for this case. This fixes the "Compile LLVM Torture (o)" step on the wasm waterfall. Differential Revision: https://reviews.llvm.org/D33803 llvm-svn: 304505	2017-06-02 01:05:24 +00:00
Mandeep Singh Grang	fce1f464ac	[PredicateInfo] Enable -reverse-iterate tests only for +Asserts builds Summary: The flag -reverse-iterate is present only on +Asserts builds. Reviewers: dberlin, davide, RKSimon, efriedma, chapuni Reviewed By: efriedma, chapuni Subscribers: chapuni, llvm-commits Differential Revision: https://reviews.llvm.org/D33795 llvm-svn: 304498	2017-06-01 23:52:59 +00:00
Davide Italiano	1dd5558e52	[PM] GVNSink is off by default, fix an obvious typo. llvm-svn: 304497	2017-06-01 23:47:53 +00:00
Eugene Zelenko	7ea692373c	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 304495	2017-06-01 23:25:02 +00:00
Zachary Turner	afb81a83a9	Fix 2 more -Wreorder warnings. llvm-svn: 304494	2017-06-01 23:24:50 +00:00
Tim Shen	4e912aa5af	[ThinLTO] Move -lto-use-new-pm to llvm-lto2, and change it to -use-new-pm. Summary: As we teach Clang to use ThinkLTO + new PM, it's good for the users to inject through Config, instead of setting a flag in the LTOBackend library. Move the flag to llvm-lto2. As it moves to llvm-lto2, a new name -use-new-pm seems simpler and as clear. Reviewers: davide, tejohnson Subscribers: mehdi_amini, Prazek, inglorion, eraman, chandlerc, llvm-commits Differential Revision: https://reviews.llvm.org/D33799 llvm-svn: 304492	2017-06-01 23:13:44 +00:00
Davide Italiano	c368831580	Move GVNHoist to the right position in the new pass manager pipeline. GVNHoist was moved as part of simplification passes for the current pass manager (but not for the new), so they're out-of-sync. Differential Revision: https://reviews.llvm.org/D33806 llvm-svn: 304490	2017-06-01 23:08:14 +00:00
Xinliang David Li	d6cfba2a02	Fix compiler_rt buildbot failure llvm-svn: 304489	2017-06-01 23:05:11 +00:00
Keno Fischer	fa635d730f	Reapply "[Cloning] Take another pass at properly cloning debug info" This was rL304226, reverted in 304228 due to a clang assertion failure on the build bots. That problem should have been addressed by clang commit rL304470. llvm-svn: 304488	2017-06-01 23:02:12 +00:00
Zachary Turner	86d25b12f8	Fix -Wreorder warnings. llvm-svn: 304485	2017-06-01 22:03:17 +00:00
Zachary Turner	ebd3ae8371	[CodeView] Properly align symbol records on read/write. Object files have symbol records not aligned to any particular boundary (e.g. 1-byte aligned), while PDB files have symbol records padded to 4-byte aligned boundaries. Since they share the same reading / writing code, we have to provide an option to specify the alignment and propagate it up to the producer or consumer who knows what the alignment is supposed to be for the given container type. Added a test for this by modifying the existing PDB -> YAML -> PDB round-tripping code to round trip symbol records as well as types. Differential Revision: https://reviews.llvm.org/D33785 llvm-svn: 304484	2017-06-01 21:52:41 +00:00
Yaxun Liu	a618acf923	[AMDGPU] Fix kernel arg segment size for amdgizcl Differential Revision: https://reviews.llvm.org/D33307 llvm-svn: 304482	2017-06-01 21:31:53 +00:00
Eli Friedman	0d823d610d	Add opt-bisect support for region passes. This is necessary to get opt-bisect working with polly. Differential Revision: https://reviews.llvm.org/D33751 llvm-svn: 304476	2017-06-01 21:22:26 +00:00
Craig Topper	5ea2d55e1c	[InstSimplify][ConstantFolding] Add test demonstrating failure to simplify (icmp eq null, inttoptr x) when the null is on the left hand side. NFC llvm-svn: 304474	2017-06-01 21:20:07 +00:00
Adrian Prantl	d9cd4d52e3	DbgValueHistoryCalculator: Ignore call instructions that claim to clobber SP. The AArch64 backend marks calls that involve aggregate function arguments as having an implicit def of SP. We already have the same workaround in LiveDebugValues and in DbgValueHistoryCalculator for SP clobbers in register masks. This adds register defs to the list. Fixes rdar://problem/30361929 and Swift SR-3851. llvm-svn: 304471	2017-06-01 21:14:58 +00:00
Teresa Johnson	596b2e7ab2	[PGO] Adjust indirect call promotion threshold Summary: Reduce min percent required for indirect call promotion from 33% to 30%, which matches gcc's threshold and catches the same hot opportunities. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33798 llvm-svn: 304469	2017-06-01 21:10:10 +00:00
Keno Fischer	189a811a8e	[llvm-config] Don't use PATH_MAX It doesn't exist on Windows. The number we use here doesn't really matter, the storage will expand automatically but 256 seems like a reasonable default. Should fix windows buildbots that complained about rL304458. llvm-svn: 304468	2017-06-01 20:51:55 +00:00
Keno Fischer	3cdd4935cd	[DIBuilder] Add a more fine-grained finalization method Summary: Clang wants to clone a function before it is done building the entire compilation unit. As of now, there is no good way to do that, because CloneFunction doesn't like dealing with temporary metadata. However, as long as clang doesn't want to add any variables to this SP, it should be fine to just prematurely finalize it. Add an API to allow this. This is done in preparation of a clang commit to fix the assertion that necessitated the revert of D33655. Reviewers: aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33704 llvm-svn: 304467	2017-06-01 20:42:44 +00:00
Evgeniy Stepanov	56584bbf16	(NFC) Track global summary liveness in GVFlags. Replace GVFlags::LiveRoot with GVFlags::Live and use that instead of all the DeadSymbols sets. This is refactoring in order to make liveness information available in the RegularLTO pipeline. llvm-svn: 304466	2017-06-01 20:30:06 +00:00
Nirav Dave	4952871630	[SDAG] Fix CombineTo ordering in visitZERO_EXTEND and visitSIGN_EXTEND Reorder CombineTo Calls to prevent references to stale/deleted SDNodes which caused undue assertions. Reviewers: dbabokin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D31625 llvm-svn: 304460	2017-06-01 19:33:50 +00:00
Keno Fischer	532a9e888a	[llvm-config] Report --bindir based on LLVM_TOOLS_INSTALL_DIR Summary: `LLVM_TOOLS_INSTALL_DIR` was introduced in r272200 in order to override the directory name into which to install LLVM's executable. However, `llvm-config --bindir` still reported `$PREFIX/bin` independent of what LLVM_TOOLS_INSTALL_DIR was set to. This fixes the out-of-tree clang standalone build for me. Reviewers: beanz, tstellar Reviewed By: tstellar Subscribers: chapuni, tstellar, llvm-commits Differential Revision: https://reviews.llvm.org/D22499 llvm-svn: 304458	2017-06-01 19:20:33 +00:00
David Blaikie	b762f689b9	Prefer static namespace-scoped variables over anon namespacing per style guide Also for consistency with the immediately preceeding variable definition. llvm-svn: 304457	2017-06-01 19:20:26 +00:00
Haicheng Wu	bf277f38ad	[InlineCost] Add a test case for GEP cost The added test case is to check whether the simplified value is passed to getGEPCost(). Differential Revision: https://reviews.llvm.org/D33779 llvm-svn: 304454	2017-06-01 19:06:07 +00:00
Xinliang David Li	ee8d6acb1f	[Profile] Fix builtin_expect lowering bug The lowerer wrongly assumes the ICMP instruction 1) always has a constant operand; 2) the operand has value 0. It also assumes the expected value can only be one, thus other values other than one will be considered 'zero'. This leads to wrong profile annotation when other integer values are used other than 0, 1 in the comparison or in the expect intrinsic. Also missing is handling of equal predicate. This patch fixes all the above problems. Differential Revision: http://reviews.llvm.org/D33757 llvm-svn: 304453	2017-06-01 19:05:55 +00:00
Xinliang David Li	0a0acbcf78	[PartialInlining] Emit branch info and profile data as remarks This allows us to collect profile statistics to tune static branch prediction. Differential Revision: http://reviews.llvm.org/D33746 llvm-svn: 304452	2017-06-01 18:58:50 +00:00
Mandeep Singh Grang	33a1b73600	[PredicateInfo] Fix non-determinism in codegen uncovered by reverse iterating SmallPtrSet Summary: Sort OpsToRename before iterating to make iteration order deterministic. Thanks to Daniel Berlin for the sorting logic. Reviewers: dberlin, RKSimon, efriedma, davide Reviewed By: dberlin, davide Subscribers: sanjoy, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D33265 llvm-svn: 304447	2017-06-01 18:36:24 +00:00
Adrian Prantl	f4bc1f77b7	[DWARF] Introduce Dump Options This commit introduces a structure that holds all the flags that control the pretty printing of dwarf output. Patch by Spyridoula Gravani! Differential Revision: https://reviews.llvm.org/D33749 llvm-svn: 304446	2017-06-01 18:18:23 +00:00
Krzysztof Parzyszek	3cf16576d5	[Hexagon] Fix dependence check in the packetizer An incorrect check in the packetizer lead to an attempt to convert an unconditional branch to a .new (conditional) form. llvm-svn: 304442	2017-06-01 18:02:40 +00:00
Krzysztof Parzyszek	51fd5405d5	[Hexagon] Handle long-running simplification loop in idiom recognition The initial assumption was that the simplification would converge to a fixed point relatvely quickly. Turns out that there are legitimate situa- tions where the complexity of the code causes it to take a large number of iterations. Two main changes: - Instead of aborting upon hitting the limit, simply return nullptr. - Reduce the limit to 10,000 from 100,000. llvm-svn: 304441	2017-06-01 18:00:47 +00:00
Amaury Sechet	2adb7bdbca	Remove ADDC, ADDE, SUBC, SUBE and SETCCE support from the X86 backend, use the CARRY ops instead. Summary: As per title. This cleanup some technical debt. Depends on D33374 Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33390 llvm-svn: 304435	2017-06-01 16:33:08 +00:00
Matt Arsenault	3416b8c874	AMDGPU: Remove error on call in AsmPrinter Partial revert of r301938 which is making it harder to split patches up. llvm-svn: 304418	2017-06-01 15:05:15 +00:00
Matt Arsenault	b083570532	DAG: Remove pointless type check These are only integer operations. llvm-svn: 304417	2017-06-01 14:49:46 +00:00
Matt Arsenault	50f43e4168	AMDGPU: Set high getCSRFirstUseCost llvm-svn: 304416	2017-06-01 14:38:02 +00:00
Amaury Sechet	94eb633dd2	Fix addcarry-crash.ll llvm-svn: 304415	2017-06-01 14:24:31 +00:00
Amaury Sechet	b761959993	Add regression test for the addcarry crash. See D33770 for context. llvm-svn: 304414	2017-06-01 14:09:56 +00:00
Florian Hahn	fca7b8348f	[ARM] Create relocations for Thumb functions calling ARM fns in ELF. Summary: Without using a fixup in this case, BL will be used instead of BLX to call internal ARM functions from Thumb functions. Reviewers: rafael, t.p.northover, peter.smith, kristof.beyls Reviewed By: peter.smith Subscribers: srhines, echristo, aemerson, rengolin, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33436 llvm-svn: 304413	2017-06-01 13:50:57 +00:00
Kamil Rytarowski	07c81b1856	[Solaris] Fix PR33228 - llvm::sys::fs::is_local_impl done right Summary: Solaris-specific implementation for llvm::sys::fs::is_local_impl. FStype pattern matching might be a bit unreliable, but at least it fixes the build failure. Reviewers: mgorny, nlopes, llvm-commits, krytarowski Reviewed By: krytarowski Subscribers: voskresensky.vladimir, krytarowski Differential Revision: https://reviews.llvm.org/D33695 llvm-svn: 304412	2017-06-01 12:57:00 +00:00
Amaury Sechet	c84cc230b3	Only generate addcarry node when it is legal. Summary: This is a problem uncovered by stage2 testing. ADDCARRY end up being generated on target that do not support it. The patch that introduced the problem has other patches layed on top of it, so we want to fix the issue rather than revert it to avoid creating a lor of churn. A regression test will be added shortly, but this is committed as this in order to get the build back to green promptly. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33770 llvm-svn: 304409	2017-06-01 12:03:16 +00:00
Chandler Carruth	8b3be4e59d	[PM/ThinLTO] Port the ThinLTO pipeline (both components) to the new PM. Based on the original patch by Davide, but I've adjusted the API exposed to just be different entry points rather than exposing more state parameters. I've factored all the common logic out so that we don't have any duplicate pipelines, we just stitch them together in different ways. I think this makes the build easier to reason about and understand. This adds a direct method for getting the module simplification pipeline as well as a method to get the optimization pipeline. While not my express goal, this seems nice and gives a good place comment about the restrictions that are imposed on them. I did make some minor changes to the way the pipelines are structured here, but hopefully not ones that are significant or controversial: 1) I sunk the PGO indirect call promotion to only be run when we have PGO enabled (or as part of the special ThinLTO pipeline). 2) I made the extra GlobalOpt run in ThinLTO just happen all the time and at a slightly more powerful place (before we remove available externaly functions). This seems like general goodness and not a big compile time sink, so it didn't make sense to only use it in ThinLTO. Fewer differences in the pipeline makes everything simpler IMO. 3) I hoisted the ThinLTO stop point pre-link above the the RPO function attr inference. The RPO inference won't infer anything terribly meaningful pre-link (recursiveness?) so it didn't make a lot of sense. But if the placement of RPO inference starts to matter, we should move it to the canonicalization phase anyways which seems like a better place for it (and there is a FIXME to this effect!). But that seemed a bridge too far for this patch. If we ever need to parameterize these pipelines more heavily, we can always sink the logic to helper functions with parameters to keep those parameters out of the public API. But the changes above seemed minor that we could possible get away without the parameters entirely. I added support for parsing 'thinlto' and 'thinlto-pre-link' names in pass pipelines to make it easy to test these routines and play with them in larger pipelines. I also added a really basic manifest of passes test that will show exactly how the pipelines behave and work as well as making updates to them clear. Lastly, this factoring does introduce a nesting layer of module pass managers in the default pipeline. I don't think this is a big deal and the flexibility of decoupling the pipelines seems easily worth it. Differential Revision: https://reviews.llvm.org/D33540 llvm-svn: 304407	2017-06-01 11:39:39 +00:00
Zvi Rackover	7693733e80	[X86] Match bitcast of vxi1 to pmovmsk Summary: Add an early combine to match patterns such as: (i16 bitcast (v16i1 x)) -> (i16 movmsk (v16i8 sext (v16i1 x))) This combine needs to happen early enough before type-legalization scalarizes the result of the setcc. Reviewers: igorb, craig.topper, RKSimon Subscribers: delena, llvm-commits Differential Revision: https://reviews.llvm.org/D33311 llvm-svn: 304406	2017-06-01 11:27:57 +00:00
Amaury Sechet	251ea8a4f8	Do not legalize large setcc with setcce, introduce setcccarry and do it with usubo/setcccarry. Summary: This is a continuation of the work started in D29872 . Passing the carry down as a value rather than as a glue allows for further optimizations. Introducing setcccarry makes the use of addc/subc unecessary and we can start the removal process. This patch only introduce the optimization strictly required to get the same level of optimization as was available before nothing more. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33374 llvm-svn: 304404	2017-06-01 11:14:17 +00:00
Amaury Sechet	6506a90a70	Remove ISD::SETCC match from combineX86ADD. It's done improperly and doesn't work. llvm-svn: 304403	2017-06-01 11:13:10 +00:00
Amaury Sechet	9c5d1e966b	[DAGCombine] Refactor common addcarry pattern. Summary: This pattern is no very useful per se, but it exposes optimization for toehr patterns that wouldn't kick in otherwize. It's very common and worth optimizing for. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32756 llvm-svn: 304402	2017-06-01 10:48:04 +00:00
Amaury Sechet	2e43cb6d03	[DAGCombine] (add/uaddo X, Carry) -> (addcarry X, 0, Carry) Summary: This enables further transforms. Depends on D32916 Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32925 llvm-svn: 304401	2017-06-01 10:42:39 +00:00
Kristof Beyls	955cc13bd1	Make mcpu=generic the default for armv7-a and armv8-a. As discussed in http://lists.llvm.org/pipermail/llvm-dev/2017-May/113525.html llvm-svn: 304390	2017-06-01 07:31:43 +00:00
Craig Topper	f441226085	[TableGen] Remove RecordVal constructor that takes a StringRef and Record::setName(StringRef). Leave just the versions that take an Init. They weren't used often enough to justify having two different interfaces. Push the responsiblity of creating a StringInit up to the caller. llvm-svn: 304388	2017-06-01 06:56:16 +00:00
Craig Topper	c05a1032e9	[TableGen] Remove code for renaming anonymous register classes as it can never execute. It tried to detect 9 letters (the length of anonymous) followed by a period. But anonymous classes start with "anonymous_" rather than "anonymous." these days. llvm-svn: 304387	2017-06-01 06:56:13 +00:00
Craig Topper	ebe46f6c6f	[TableGen] Use StringRef to capture getValueAsString in a couple more places. NFC llvm-svn: 304386	2017-06-01 06:56:11 +00:00
Tim Shen	6b41141863	[ThinLTO] Migrate ThinLTOBitcodeWriter to the new PM. Summary: Also see D33429 for other ThinLTO + New PM related changes. Reviewers: davide, chandlerc, tejohnson Subscribers: mehdi_amini, Prazek, cfe-commits, inglorion, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D33525 llvm-svn: 304378	2017-06-01 01:02:12 +00:00
Xinliang David Li	32c5e809be	[PartialInlining] Reduce outlining overhead by removing unneeded live-out(s) Differential Revision: http://reviews.llvm.org/D33694 llvm-svn: 304375	2017-06-01 00:12:41 +00:00
Dehao Chen	6b737ddce7	Add LiveRangeShrink pass to shrink live range within BB. Summary: LiveRangeShrink pass moves instruction right after the definition with the same BB if the instruction and its operands all have more than one use. This pass is inexpensive and guarantees optimal live-range within BB. Reviewers: davidxl, wmi, hfinkel, MatzeB, andreadb Reviewed By: MatzeB, andreadb Subscribers: hiraditya, jyknight, sanjoy, skatkov, gberry, jholewinski, qcolombet, javed.absar, krytarowski, atrick, spatel, RKSimon, andreadb, MatzeB, mehdi_amini, mgorny, efriedma, davide, dberlin, llvm-commits Differential Revision: https://reviews.llvm.org/D32563 llvm-svn: 304371	2017-05-31 23:25:25 +00:00
Eli Friedman	157dc73479	[docs] Update name of vectorization interleave flag. llvm-svn: 304370	2017-05-31 23:02:55 +00:00
Reid Kleckner	fc7ba565ed	[EH] Recognize __(gxx\|gcc)_personality_seh0 as the GNU EH personalities These are no-ops when there are no invokes. We don't need to emit LSDAs for them. Fixes PR33220. llvm-svn: 304367	2017-05-31 22:35:52 +00:00
Matthias Braun	605f779516	ImplicitNullChecks: Clear kill/dead flags when moving instructions around The values are marked as livein in the successor blocks so marking them as killed or dead was wrong. llvm-svn: 304366	2017-05-31 22:23:08 +00:00
Reid Kleckner	57ac61e005	Check hasPersonalityFn before calling getPersonalityFn llvm-svn: 304365	2017-05-31 22:21:20 +00:00
Reid Kleckner	c2f1bbfe4f	[EH] Fix the LSDA that we emit for unknown EH personalities We should have a single call site entry with no landing pad. This indicates that no EH action should be taken and the unwinder should unwind to the next frame. We currently don't recognize __gxx_personality_seh0 as a known personality, so we forcibly emit a table, and that table was wrong. This was filed as PR33220. Now we emit a correct table for that personality. The next step is to recognize that we can completely skip the table for this personality. llvm-svn: 304363	2017-05-31 22:18:49 +00:00
Steven Wu	97e2cf87e1	[MachOObject] Fix bind opcode parser error on valid opcode sequence BIND_OPCODE_SET_DYLIB_SPECIAL_IMM(0) is a valid way to setp library ordinal. MachOObject should set LibraryOrdinalSet even when IMM is zero. llvm-svn: 304362	2017-05-31 22:17:43 +00:00
Galina Kistanova	244621faad	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304361	2017-05-31 22:16:24 +00:00
Galina Kistanova	8514dd540d	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304358	2017-05-31 22:09:46 +00:00
Galina Kistanova	0b69e363f6	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304356	2017-05-31 22:02:05 +00:00
Galina Kistanova	c752c4bf56	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304355	2017-05-31 21:50:45 +00:00
Vedant Kumar	877e3cefb8	Avoid a UB pointer overflow in the ArrayRef unit test The intent of the test is to check that array lengths greater than UINT_MAX work properly. Change the test to stress that scenario, without triggering pointer overflow UB. Caught by a WIP pointer overflow checker in clang. Differential Revision: https://reviews.llvm.org/D33149 llvm-svn: 304353	2017-05-31 21:47:52 +00:00
Wei Mi	0bd3f41588	Revert rL304050. It may break sanitizer bootstrap. Revert it for now while investigating. llvm-svn: 304350	2017-05-31 21:29:33 +00:00
Matthias Braun	e2e65911a2	Try to fix buildbots It seems not all of our bots have a std::vector::erase() taking a const_iterator (even though that seems to be part of C++11) attempt to workaround. llvm-svn: 304349	2017-05-31 21:25:03 +00:00
Craig Topper	bcd3c37f4a	[TableGen] Adapt more places to getValueAsString now returning a StringRef instead of a std::string. llvm-svn: 304347	2017-05-31 21:12:46 +00:00
Matthias Braun	ac4beccaca	X86FloatingPoint: Fix livein lists After transforming FP to ST registers: - Do not add the ST register to the livein lists, they are reserved so we do not need to track their liveness. - Remove the FP registers from the livein lists, they don't have defs or uses anymore and so are not live. - (The setKillFlags() call is moved to an earlier place as it relies on the FP registers still being present in the livein list.) llvm-svn: 304342	2017-05-31 20:30:22 +00:00
Matthias Braun	43692a2245	X86FloatingPoint: Add some static assert, cleanup; NFC llvm-svn: 304341	2017-05-31 20:30:17 +00:00
Galina Kistanova	c2b642d009	Added missing break; added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304340	2017-05-31 20:25:13 +00:00
Kostya Serebryany	2e98c045cb	[libFuzzer] fix a test to match the new sanitizer run-time llvm-svn: 304333	2017-05-31 19:47:11 +00:00
Galina Kistanova	b2c0116e71	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304332	2017-05-31 19:41:33 +00:00
Reid Kleckner	5fbdd17714	[IR] Add additional addParamAttr/removeParamAttr to AttributeList API Summary: Fairly straightforward patch to fill in some of the holes in the attributes API with respect to accessing parameter/argument attributes. The patch aims to step further towards encapsulating the idx+FirstArgIndex pattern to access these attributes to within the AttributeList. Patch by Daniel Neilson! Reviewers: rnk, chandlerc, pete, javed.absar, reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33355 llvm-svn: 304329	2017-05-31 19:23:09 +00:00
Craig Topper	2b8419a22d	[TableGen] Make Record::getValueAsString and getValueAsListOfStrings return StringRefs instead of std::string Internally both these methods just return the result of getValue on either a StringInit or a CodeInit object. In both cases this returns a StringRef pointing to a string allocated in the BumpPtrAllocator so its not going anywhere. So we can just pass that StringRef along. This is a fairly naive patch that targets just the build failures caused by this change. There's additional work that can be done to avoid creating std::string at call sites that still think getValueAsString returns a std::string. I'll try to clean those up in future patches. Differential Revision: https://reviews.llvm.org/D33710 llvm-svn: 304325	2017-05-31 19:01:11 +00:00
Craig Topper	fa5dc09292	[BPF] Correct the file name of the -gen-asm-matcher output file to not start with X86. llvm-svn: 304324	2017-05-31 19:01:05 +00:00
Teresa Johnson	a6a3fb57a1	[ThinLTO] Reduce unnecessary map lookups during combined summary write Summary: Don't assign values to undefined references, simply don't emit those reference edges as they are not useful (we were already not emitting call edges to undefined refs). Also, streamline the later lookup of value ids when writing the summaries, by combining the check for value id existence with the access of that value id. Reviewers: pcc Subscribers: Prazek, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D33634 llvm-svn: 304323	2017-05-31 18:58:11 +00:00
Nirav Dave	3424373f30	[ScheduleDAG] Deal with already scheduled loads in ScheduleDAG. Summary: If we attempt to unfold an SUnit in ScheduleDAG that results in finding an already scheduled load, we must should abort the unfold as it will not improve scheduling. This fixes PR32610. Reviewers: jmolloy, sunfish, bogner, spatel Subscribers: llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D32911 llvm-svn: 304321	2017-05-31 18:43:17 +00:00
Matthias Braun	d6a36ae282	TargetMachine: Indicate whether machine verifier passes. This adds a callback to the LLVMTargetMachine that lets target indicate that they do not pass the machine verifier checks in all cases yet. This is intended to be a temporary measure while the targets are fixed allowing us to enable the machine verifier by default with EXPENSIVE_CHECKS enabled! Differential Revision: https://reviews.llvm.org/D33696 llvm-svn: 304320	2017-05-31 18:41:23 +00:00
Kostya Serebryany	53b34c8443	[sanitizer-coverage] remove stale code (old coverage); llvm part llvm-svn: 304319	2017-05-31 18:27:33 +00:00
Sean Fertile	457ddd311a	[PowerPC] Correctly specify the cache line size for Power 7, 8 and 9. Fixes PPCTTIImpl::getCacheLineSize() returning the wrong cache line size for newer ppc processors. Commiting on behalf of Stefan Pintilie. Differential Revision: https://reviews.llvm.org/D33656 llvm-svn: 304317	2017-05-31 18:20:17 +00:00
Anna Thomas	777bb90bdc	Revert "[Atomics][LoopIdiom] Recognize unordered atomic memcpy" This reverts commit r304310. It caused build failures in polly and mingw due to undefined reference to llvm::RTLIB::getMEMCPY_ELEMENT_ATOMIC. llvm-svn: 304315	2017-05-31 17:20:51 +00:00
Zaara Syeda	3a7578c658	[PPC] Inline expansion of memcmp This patch does an inline expansion of memcmp. It changes the memcmp library call into an inline expansion when the size is known at compile time and is under a target specified threshold. This expansion is implemented in CodeGenPrepare and expands into straight line code. The target specifies a maximum load size and the expansion works by using this size to load the two sources, compare, and exit early if a difference is found. It also has a special case when the memcmp result is used in a compare to zero equality. Differential Revision: https://reviews.llvm.org/D28637 llvm-svn: 304313	2017-05-31 17:12:38 +00:00
Galina Kistanova	6ad77845e2	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304312	2017-05-31 17:10:03 +00:00
Mark Searles	11d0a04050	[AMDGPU] Fix bugs in new waitcnt pass. Add test. - new waitcnt pass remains off by default; -enable-si-insert-waitcnts=1 to enable it - fix handling of PERMUTE ops - fix insertion of waitcnt instrs at function begin/end ( port of analogous code that was added to old waitcnt pass ) - add new test Differential Revision: https://reviews.llvm.org/D33114 llvm-svn: 304311	2017-05-31 16:44:23 +00:00
Anna Thomas	056c009f1b	[Atomics][LoopIdiom] Recognize unordered atomic memcpy Summary: Expanding the loop idiom test for memcpy to also recognize unordered atomic memcpy. The only difference for recognizing an unordered atomic memcpy and instead of a normal memcpy is that the loads and/or stores involved are unordered atomic operations. Background: http://lists.llvm.org/pipermail/llvm-dev/2017-May/112779.html Patch by Daniel Neilson! Reviewers: reames, anna, skatkov Reviewed By: reames Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33243 llvm-svn: 304310	2017-05-31 16:39:52 +00:00
Dmitry Preobrazhensky	793c592652	[AMDGPU][MC] New syntax for ds_swizzle_b32 offset See Bug 28601: https://bugs.llvm.org//show_bug.cgi?id=28601 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D33542 llvm-svn: 304309	2017-05-31 16:26:47 +00:00
Florian Hahn	ff25b6d8f6	[AArch64] Enable FeatureFuseAES on Cortex-A53. It improves performance on Cortex-A53. llvm-svn: 304307	2017-05-31 15:50:03 +00:00
Florian Hahn	064a2f9222	[AArch64] Enable FeatureFuseAES on Cortex-A73. It improves performance on Cortex-A73. llvm-svn: 304304	2017-05-31 15:25:25 +00:00
Reid Kleckner	1d7cbdfc3d	Fix assertion when merging multiple empty AttributeLists Patch by Nicholas Wilson! Differential Revision: https://reviews.llvm.org/D33627 llvm-svn: 304300	2017-05-31 14:24:06 +00:00
Nirav Dave	7c70fddba6	[DAG] Avoid use of stale store. Correct references to alignment of store which may be deleted in a previous iteration of merge. Instead use first store that would be merged. Corrects pr33172's use-after-poison caught by ASan. Reviewers: spatel, hfinkel, RKSimon Reviewed By: RKSimon Subscribers: thegameg, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33686 llvm-svn: 304299	2017-05-31 13:36:17 +00:00
Tony Jiang	60c247de18	[PowerPC] Fix a performance bug for PPC::XXPERMDI. There are some VectorShuffle Nodes in SDAG which can be selected to XXPERMDI Instruction, this patch recognizes them and does the selection to improve the PPC performance. Differential Revision: https://reviews.llvm.org/D33404 llvm-svn: 304298	2017-05-31 13:09:57 +00:00
Amaury Sechet	6a303a4e73	Regenerate xchg-nofold.ll expected results. NFC. llvm-svn: 304291	2017-05-31 09:44:08 +00:00
Nemanja Ivanovic	accab033c9	[PowerPC] Eliminate integer compare instructions - vol. 3 This patch builds upon https://reviews.llvm.org/rL302810 to add handling for the 64-bit SETEQ patterns. Differential Revision: https://reviews.llvm.org/D33369 llvm-svn: 304286	2017-05-31 08:04:07 +00:00
Dylan McKay	043fa4b3d6	[AVR] Fix a big in shift operator lowering; Authored by Dr. Gergo Erdi When generating code for a shift loop, check the shift amount against the literal value 0, not R0 llvm-svn: 304284	2017-05-31 06:27:46 +00:00
Dylan McKay	48614d4a2c	[AVR] CPIRdK can only work with r16..r31; Authored by Dr. Gergo Erdi (https://github.com/avr-rust/rust/issues/50) llvm-svn: 304283	2017-05-31 06:10:59 +00:00
Nemanja Ivanovic	e597bd8230	[PowerPC] Eliminate integer compare instructions - vol. 2 This patch builds upon https://reviews.llvm.org/rL302810 to add handling for bitwise logical operations in general purpose registers. The idea is to keep the values in GPRs as long as possible - only extracting them to a condition register bit when no further operations are to be done. Differential Revision: https://reviews.llvm.org/D31851 llvm-svn: 304282	2017-05-31 05:40:25 +00:00
Craig Topper	16942c2cb2	[TableGen] Implement non-const versions of Record::getValue by delegating to the const versions to avoid duplicate code. NFC llvm-svn: 304281	2017-05-31 05:12:36 +00:00
Craig Topper	01197f686f	[TableGen] Make one of RecordVal's constructors delegate to the other to reduce duplicate code. llvm-svn: 304280	2017-05-31 05:12:33 +00:00
Zachary Turner	1b88f4f33a	[ObjectYAML] Split CodeViewYAML into 3 pieces. The code was a mess and disorganized due to the sheer amount of it being in one file. So I'm splitting this into three files. One for CodeView types, one for CodeView symbols, and one for CodeView debug subsections. NFC. llvm-svn: 304278	2017-05-31 04:17:13 +00:00
Gor Nishanov	2bc782d8da	[coroutines] Call initializePass in coroutine pass constructors Summary: Fixes: https://bugs.llvm.org/show_bug.cgi?id=33226 Reviewers: chandlerc, davide, majnemer, dblaikie Reviewed By: chandlerc Subscribers: EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D33701 llvm-svn: 304277	2017-05-31 03:12:42 +00:00
George Burgess IV	0a7b989036	[CFLAA] Add missing break; note things are broken. Thanks to Galina Kistanova for finding the missing break! When trying to make a test for this, I realized our logic for handling extractvalue/insertvalue/... is somewhat broken. This makes constructing a test-case for this missing break nontrivial. llvm-svn: 304275	2017-05-31 02:35:26 +00:00
Matthias Braun	bcd4c68233	X86FrameLowering: No need to mark FP as live-in everywhere The frame pointer (when used as frame pointer) is a reserved register. We do not track liveness of reserved registers and hence do not need to add them to the basic block livein lists. llvm-svn: 304274	2017-05-31 02:11:10 +00:00
Galina Kistanova	49b6023095	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304273	2017-05-31 01:54:18 +00:00
Daniel Berlin	be3e7ba45e	NewGVN: Fix PR 33185 by checking whether we need to recursively generate a phi of ops, which we don't currently support. llvm-svn: 304272	2017-05-31 01:47:32 +00:00
Daniel Berlin	9ceafe267b	Fix test that wasn't update_test_check'd llvm-svn: 304271	2017-05-31 01:47:29 +00:00
Daniel Berlin	71ff663e1b	InstructionSimplify: Remove now-redundant reachability tests, as dominates() already does them llvm-svn: 304270	2017-05-31 01:47:24 +00:00
Vedant Kumar	b745804bb1	Mark a test as requiring a default triple This test assumes that llc can infer a default triple. I'm not sure why exactly, but the Verify MachineInstrs bot requires tests to be explicit about this dependency. This commit follows the lead from r248452 and adds in 'REQUIRES: default_triple' to omit-empty.ll. Bot URL: http://lab.llvm.org:8080/green/job/Verify-Machineinstrs_AArch64/7500 llvm-svn: 304269	2017-05-31 01:42:55 +00:00
Galina Kistanova	9ee35cf57b	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304268	2017-05-31 01:33:39 +00:00
Matthias Braun	05eeadbfd1	ARM: Fix cmpxchg O0 expansion This is the equivalent of r304048 for ARM: - Rewrite livein calculation to use the computeLiveIns() helper function. This is slightly less efficient but easier to reason about and doesn't unnecessarily add pristine and reserved registers[1] - Zero the status register at the beginning of the loop to make sure it has a defined value. - Remove kill flags of values that need to stay alive throughout the loop. [1] An upcoming commit of mine will tighten the MachineVerifier to catch these. llvm-svn: 304267	2017-05-31 01:21:35 +00:00
Matthias Braun	0dba4e3509	ARM: Do not add reserved registers to block livein lists; NFC llvm-svn: 304266	2017-05-31 01:21:30 +00:00
Eugene Zelenko	4e9736b1c9	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 304265	2017-05-31 01:10:10 +00:00
Vedant Kumar	bf154dd27f	Fix CodeView-related modules build failures post-r304248 llvm-svn: 304264	2017-05-31 01:08:43 +00:00
Zachary Turner	083342bd34	[ObjectYAML] Clean up the CodeView headers a bit. CodeViewYAML.h attempts to hide the details of many of the CodeView yaml structures and types, but at the same time it exposes the mapping traits for them to external users of the header. This patch just hides these in the implementation files so that the interface is kept as simple as possible. llvm-svn: 304263	2017-05-31 01:08:36 +00:00
Alina Sbirlea	032b5bdf2b	Fix misspelling llvm-svn: 304262	2017-05-31 01:00:51 +00:00
Abderrazek Zaafrani	855411566b	Add latency info for Exynos interleaved Load/Store instructions. llvm-svn: 304259	2017-05-31 00:20:55 +00:00
Zachary Turner	7a75bc05b7	Try to fix build again. llvm-svn: 304257	2017-05-30 23:57:46 +00:00
Zachary Turner	1e4d3693c4	[CodeView] Move CodeView symbol yaml logic to ObjectYAML. This continues the effort to get the CodeView YAML parsing logic into ObjectYAML. After this patch, the only missing piece will be the CodeView debug symbol subsections. llvm-svn: 304256	2017-05-30 23:50:44 +00:00
Eric Beckmann	025e82bac1	Fix bug on Big-Endian system, due to reference to vector out of scope. llvm-svn: 304255	2017-05-30 23:10:57 +00:00
Matthias Braun	bc09894d6a	MachineInstr: Do not skip dead def operands when printing. This was introduced a long time ago in r86583 when regmask operands didn't exist. Nowadays the behavior hurts more than it helps. This removes it. llvm-svn: 304254	2017-05-30 23:09:21 +00:00
Eric Beckmann	ba395ef491	This patch should fix various clang warnings and a use of to_string which isn't support before c++11. llvm-svn: 304252	2017-05-30 22:29:06 +00:00
Tim Shen	0bd0aa8f07	[AntiDepBreaker] Revert r299124 and add a test. Summary: AntiDepBreaker intends to add all live-outs, including the implicit CSRs, in StartBlock. r299124 was done without understanding that intention. Now with the live-ins propagated correctly (D32464), we can revert this change. Reviewers: MatzeB, qcolombet Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D33697 llvm-svn: 304251	2017-05-30 22:26:52 +00:00
Tim Northover	d276d85309	MIR: update test for noVRegs removal. I think I hadn't git pulled recently enough to bring it in. llvm-svn: 304250	2017-05-30 22:02:19 +00:00
Zachary Turner	9c1ba225a9	Try to fix build. llvm-svn: 304249	2017-05-30 22:00:37 +00:00
Zachary Turner	d427383cb8	[CodeView] Move CodeView YAML code to ObjectYAML. This is the beginning of an effort to move the codeview yaml reader / writer into ObjectYAML so that it can be shared. Currently the only consumer / producer of CodeView YAML is llvm-pdbdump, but CodeView can exist outside of PDB files, and indeed is put into object files and passed to the linker to produce PDB files. Furthermore, there are subtle differences in the types of records that show up in object file CodeView vs PDB file CodeView, but they are otherwise 99% the same. By having this code in ObjectYAML, we can have llvm-pdbdump reuse this code, while teaching obj2yaml and yaml2obj to use this syntax for dealing with object files that can contain CodeView. This patch only adds support for CodeView type information to ObjectYAML. Subsequent patches will add support for CodeView symbol information. llvm-svn: 304248	2017-05-30 21:53:05 +00:00
Matthias Braun	5e394c3d6f	TargetPassConfig: Keep a reference to an LLVMTargetMachine; NFC TargetPassConfig is not useful for targets that do not use the CodeGen library, so we may just as well store a pointer to an LLVMTargetMachine instead of just to a TargetMachine. While at it, also change the constructor to take a reference instead of a pointer as the TM must not be nullptr. llvm-svn: 304247	2017-05-30 21:36:41 +00:00
Tim Northover	fb26d9a286	MIR: remove explicit "noVRegs" property. We can infer this from the incoming MIR, so there's no reason to represent it with a special flag. llvm-svn: 304246	2017-05-30 21:28:57 +00:00
Xinliang David Li	74480adafd	[PartialInlining] Shrinkwrap allocas with live range contained in outline region. Differential Revision: http://reviews.llvm.org/D33618 llvm-svn: 304245	2017-05-30 21:22:18 +00:00
Quentin Colombet	73141d5b4b	[Localizer] Don't trick to be smart for the insertion point There is no guarantee that the first use of a constant that is traversed is actually the first in the related basic block. Thus, if we use that as the insertion point we may end up with definitions that don't dominate there use. llvm-svn: 304244	2017-05-30 20:53:06 +00:00
Ben Langmuir	a8217afe16	[llvm-config] Fix cflags test looking for "warning" This will fail if you configure with e.g. -Wno-unknown-warning-option. Change it to check for 'warning:' just like we did for 'error:' in r289484. llvm-svn: 304239	2017-05-30 20:21:47 +00:00
Matthew Simpson	646475a9bc	[LV] Reapply r303763 with fix for PR33193 r303763 caused build failures in some out-of-tree tests due to an assertion in TTI. The original patch updated cost estimates for induction variable update instructions marked for scalarization. However, it didn't consider that the incoming value of an induction variable phi node could be a cast instruction. This caused queries for cast instruction costs with a mix of vector and scalar types. This patch includes a fix for cast instructions and the test case from PR33193. The fix was suggested by Jonas Paulsson <paulsson@linux.vnet.ibm.com>. Reference: https://bugs.llvm.org/show_bug.cgi?id=33193 Original Differential Revision: https://reviews.llvm.org/D33457 llvm-svn: 304235	2017-05-30 19:55:57 +00:00
Benjamin Kramer	c69fe9cc62	[Object] Remove unused field + constructor. llvm-svn: 304233	2017-05-30 19:37:02 +00:00
Benjamin Kramer	14ea122e6e	[Object] Fix pessimizing move. Returning the Error by value triggers copy elision, the move is more expensive. Clang rightfully warns about it. llvm-svn: 304232	2017-05-30 19:36:58 +00:00
Vedant Kumar	87aefe9042	Revert "This patch closes PR28513: an optimization of multiplication by different constants. It's implemented on DAG combiner level." This reverts commit r304209. I think this change is responsible for a tablgen failure in stage2 builds: http://green.lab.llvm.org/green/job/clang-stage2-configure-Rthinlto_build/2171/ I reproduced the failure locally (without ThinLTO), reverted the commit, rebuilt the stage1 clang, rebuilt the stage2 llvm-tblgen tool, and found that the crash disappears when the commit is reverted. Here is the stack trace: FAILED: lib/Target/ARM/ARMGenRegisterBank.inc.tmp cd /Volumes/Builds/pz-master-stage2-RA/lib/Target/ARM && /Volumes/Builds/pz-master-stage2-RA/bin/llvm-tblgen -gen-register-bank -I /Users/vk/llvm/lib/Target/ARM -I /Users/vk/llvm/include -I /Users/vk/llvm/lib/Target /Users/vk/llvm/lib/Target/ARM/ARM.td -o /Volumes /Builds/pz-master-stage2-RA/lib/Target/ARM/ARMGenRegisterBank.inc.tmp 0 llvm-tblgen 0x0000000106fc9568 llvm::sys::PrintStackTrace(llvm::raw_ostream&) + 40 1 llvm-tblgen 0x0000000106fc9be6 SignalHandler(int) + 422 2 libsystem_platform.dylib 0x00000001076a7fba _sigtramp + 26 3 libsystem_platform.dylib 0x00007fff58deb468 _sigtramp + 1366570184 4 llvm-tblgen 0x0000000106e89cc7 llvm::CodeGenRegBank::getCompositeSubRegIndex(llvm::CodeGenSubRegIndex, llvm::CodeGenSubRegIndex) + 615 5 llvm-tblgen 0x0000000106e88be6 llvm::CodeGenRegister::computeSubRegs(llvm::CodeGenRegBank&) + 2182 6 llvm-tblgen 0x0000000106e8e9f0 llvm::CodeGenRegBank::CodeGenRegBank(llvm::RecordKeeper&) + 2192 7 llvm-tblgen 0x0000000106f384a1 llvm::EmitRegisterBank(llvm::RecordKeeper&, llvm::raw_ostream&) + 65 8 llvm-tblgen 0x0000000106f72c64 (anonymous namespace)::LLVMTableGenMain(llvm::raw_ostream&, llvm::RecordKeeper&) + 1172 9 llvm-tblgen 0x0000000106fcb15f llvm::TableGenMain(char, bool ()(llvm::raw_ostream&, llvm::RecordKeeper&)) + 3599 10 llvm-tblgen 0x0000000106f727a6 main + 134 11 libdyld.dylib 0x000000010733c6a5 start + 1 Stack dump: 0. Program arguments: /Volumes/Builds/pz-master-stage2-RA/bin/llvm-tblgen -gen-register-bank -I /Users/vk/llvm/lib/Target/ARM -I /Users/vk/llvm/include -I /Users/vk/llvm/lib/Target /Users/vk/llvm/lib/Target/ARM/ARM.td -o /Volumes/Builds/pz-master-stage2-RA/lib/Target/ARM/ARMGenRegisterBank.inc.tmp /bin/sh: line 1: 41986 Segmentation fault: 11 /Volumes/Builds/pz-master-stage2-RA/bin/llvm-tblgen -gen-register-bank -I /Users/vk/llvm/lib/Target/ARM -I /Users/vk/llvm/include -I /Users/vk/llvm/lib/Target /Users/vk/llvm/lib/Target/ARM/ARM.td -o /Volumes/Builds/pz -master-stage2-RA/lib/Target/ARM/ARMGenRegisterBank.inc.tmp llvm-svn: 304231	2017-05-30 19:25:22 +00:00
Galina Kistanova	8c1e2f9108	Added missing break. llvm-svn: 304230	2017-05-30 19:02:49 +00:00
Keno Fischer	3fa5db4c04	Revert "[Cloning] Take another pass at properly cloning debug info" At least one build bot is complaining. Will investigate after lunch. llvm-svn: 304228	2017-05-30 18:56:26 +00:00
Matthias Braun	700603555a	ARM: Add missing flags to TBB_[JH]T pseudo instructions NFC except for calming down the machine verifier in some cases. llvm-svn: 304227	2017-05-30 18:52:33 +00:00
Keno Fischer	945dc1d2d1	[Cloning] Take another pass at properly cloning debug info Summary: In rL302576, DISubprograms gained the constraint that a !dbg attachments to functions must have a 1:1 mapping to DISubprograms. As part of that change, the function cloning support was adjusted to attempt to enforce this invariant during cloning. However, there were several problems with the implementation. Part of these were fixed in rL304079. However, there was a more fundamental problem with these changes, namely that it bypasses the matadata value map, causing the cloned metadata to be a mix of metadata pointing to the new suprogram (where manual code was added to fix those up) and the old suprogram (where this was not the case). This mismatch could cause a number of different assertion failures in the DWARF emitter. Some of these are given at https://github.com/JuliaLang/julia/issues/22069, but some others have been observed as well. Attempt to rectify this by partially reverting the manual DI metadata fixup, and instead using the standard value map approach. To retain the desired semantics of not duplicating the compilation unit and inlined subprograms, explicitly freeze these in the value map. Reviewers: dblaikie, aprantl, GorNishanov, echristo Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33655 llvm-svn: 304226	2017-05-30 18:28:30 +00:00
Eric Beckmann	72fb6a87fb	Adding parsing ability for .res file. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33566 llvm-svn: 304225	2017-05-30 18:19:06 +00:00
Craig Topper	4cec434a1b	[InstCombine] Add test cases to show missed opportunities to remove compare instructions after cttz/ctlz/ctpop where some bits of the input is known. llvm-svn: 304224	2017-05-30 17:47:59 +00:00
Krzysztof Parzyszek	ef58017b35	[Hexagon] Improve code generation for 32x32-bit multiplication For multiplications of 64-bit values (giving 64-bit result), detect cases where the arguments are sign-extended 32-bit values, on a per- operand basis. This will allow few patterns to match a wider variety of combinations in which extensions can occur. llvm-svn: 304223	2017-05-30 17:47:51 +00:00
Zachary Turner	591312c5c1	[CodeView] Add more DebugSubsection implementations. This adds implementations for Symbols and FrameData, and renames the existing codeview::StringTable class to conform to the DebugSectionStringTable convention. llvm-svn: 304222	2017-05-30 17:13:33 +00:00
Craig Topper	5fd588be34	[SelectionDAG] Remove special case for ISD::FPOWI from the strict FP intrinsic handling. This code was compensating for FPOWI defaulting to Legal and many targets not changing it to Expand. This was fixed in r304215 to default to Expand so this special handling should no longer be necessary. llvm-svn: 304221	2017-05-30 17:12:18 +00:00
Stanislav Mekhanoshin	56ea488d8b	[AMDGPU] Allow SDWA in instructions with immediates and SGPRs An encoding does not allow to use SDWA in an instruction with scalar operands, either literals or SGPRs. That is however possible to copy these operands into a VGPR first. Several copies of the value are produced if multiple SDWA conversions were done. To cleanup MachineLICM (to hoist copies out of loops), MachineCSE (to remove duplicate copies) and SIFoldOperands (to replace SGPR to VGPR copy with immediate copy right to the VGPR) runs are added after the SDWA pass. Differential Revision: https://reviews.llvm.org/D33583 llvm-svn: 304219	2017-05-30 16:49:24 +00:00
Zachary Turner	8c099fe06e	[CodeView] Rename ModuleDebugFragment -> DebugSubsection. This is more concise, and matches the terminology used in other parts of the codebase more closely. llvm-svn: 304218	2017-05-30 16:36:15 +00:00
Mark Searles	00ce96f6ee	[AMDGPU] Require waitcnt before barrier for all targets; adjust tests. Differential Revision: https://reviews.llvm.org/D33576 llvm-svn: 304217	2017-05-30 16:22:43 +00:00
Craig Topper	f6d4dc5b4a	[SelectionDAG] Set ISD::FPOWI to Expand by default Summary: Currently FPOWI defaults to Legal and LegalizeDAG.cpp turns Legal into Expand for this opcode because Legal is a "lie". This patch changes the default for this opcode to Expand and removes the hack from LegalizeDAG.cpp. It also removes all the code in the targets that set this opcode to Expand themselves since they can just rely on the default. Reviewers: spatel, RKSimon, efriedma Reviewed By: RKSimon Subscribers: jfb, dschuff, sbc100, jgravelle-google, nemanjai, javed.absar, andrew.w.kaylor, llvm-commits Differential Revision: https://reviews.llvm.org/D33530 llvm-svn: 304215	2017-05-30 15:27:55 +00:00
Andrew V. Tischenko	8b04826663	This patch closes PR28513: an optimization of multiplication by different constants. It's implemented on DAG combiner level. llvm-svn: 304209	2017-05-30 13:00:44 +00:00
Max Kazantsev	d8fe3eb9cb	[SCEV][NFC] Remove redundant params from isAvailableAtLoopEntry Params DT and LI are redundant, because these values are contained in fields anyways. Differential Revision: https://reviews.llvm.org/D33668 llvm-svn: 304204	2017-05-30 10:54:58 +00:00
Ulrich Weigand	3f484e68cc	[SystemZ] Add decimal floating-point instructions This adds assembler / disassembler support for the decimal floating-point instructions. Since LLVM does not yet have support for decimal float types, these cannot be used for codegen at this point. llvm-svn: 304203	2017-05-30 10:15:16 +00:00
Ulrich Weigand	f32adf6944	[SystemZ] Add hexadecimal floating-point instructions This adds assembler / disassembler support for the hexadecimal floating-point instructions. Since the Linux ABI does not use any hex float data types, these are not useful for codegen. llvm-svn: 304202	2017-05-30 10:13:23 +00:00
Ulrich Weigand	6ceea9a4d3	[SystemZ] Add missing assembler/disassembler tests A few instructions that are actually correctly supported in the assembler and disassembler did not have any tests. llvm-svn: 304200	2017-05-30 10:11:13 +00:00
Oliver Stannard	3d0f9507d5	[MC] Fix constant pools with DenseMap sentinel values The MC ConstantPool class uses a DenseMap to track generated constants, with the int64_t value of the constant as the key. This fails when values of 0x7fffffffffffffff or 0x7ffffffffffffffe are inserted into the constant pool, as these are sentinel values for DenseMap. The fix is to use std::map instead, which doesn't use sentinel values. Differential revision: https://reviews.llvm.org/D33667 llvm-svn: 304199	2017-05-30 09:37:11 +00:00
Zoran Jovanovic	375b60de74	[mips] Expansion of LI.S and LI.D Author: smaksimovic Reviewers: dsanders sdardis Introduces LI.S and LI.D pseudo instructions with floating point operands. Differential Revision: https://reviews.llvm.org/D14390 llvm-svn: 304198	2017-05-30 09:33:43 +00:00
Kristof Beyls	2af1e90eb2	Fix PR33031: correct the estimate of maximum offset for instructions spilling/filling the stack. llvm-svn: 304196	2017-05-30 06:58:41 +00:00
Daniel Berlin	2aa5dc1589	NewGVN: Compute hash value of expression on demand and use it in inequality testing. llvm-svn: 304195	2017-05-30 06:58:18 +00:00
Daniel Berlin	c8ed40400c	NewGVN: Fix PR33194, memory corruption by putting temporary instructions in tables sometimes. llvm-svn: 304194	2017-05-30 06:42:29 +00:00
Galina Kistanova	5c4f1a9b02	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304187	2017-05-30 03:30:34 +00:00
Galina Kistanova	624b9c9a81	Added missing line continuation to HANDLE_DIEVALUE_SMALL and HANDLE_DIEVALUE_LARGE macros. llvm-svn: 304186	2017-05-30 03:21:12 +00:00
Galina Kistanova	0570841806	Reverted r303602, as it will be fixed in gtest. llvm-svn: 304184	2017-05-30 03:17:10 +00:00
Joerg Sonnenberger	9375a25342	Revert r303763, results in asserts i.e. while building Ruby. llvm-svn: 304179	2017-05-29 22:52:17 +00:00
Craig Topper	638b1021bf	[TableGen] Use StringMap instead of DenseMap<StringRef> to unique CodeInit and StringInit objects. Override the allocator to keep using the BumpPtrAllocator. NFCI StringMap is better suited to mapping strings than a DenseMap. llvm-svn: 304178	2017-05-29 21:49:37 +00:00
Craig Topper	481ff7087f	[TableGen] Introduce DagInit::getArgs that returns an ArrayRef. Use it to fix 80 column violations in arg_begin/arg_end. Remove DagInit::args and use getArgs instead. NFC llvm-svn: 304177	2017-05-29 21:49:34 +00:00
Benjamin Kramer	74de08031f	[ManagedStatic] Avoid putting function pointers in template args. This is super awkward, but GCC doesn't let us have template visible when an argument is an inline function and -fvisibility-inlines-hidden is used. llvm-svn: 304175	2017-05-29 20:56:27 +00:00
Davide Italiano	af66659d6b	[GlobalIsel] Fix a warning with GCC 7 -Wpedantic. NFCI. llvm-svn: 304174	2017-05-29 20:13:22 +00:00
Zvi Rackover	c7bf2a1fae	[X86] Add tests for (ix bitcast (vxi1 and ...)). NFC. To be improved by D33311. llvm-svn: 304171	2017-05-29 19:00:57 +00:00
Zvi Rackover	41e01b3c98	[X86] Replace undef value in flaky test D33311 exposes the flakiness in this test. Replacing the undef placed by bugpoint, makes it more interesting and robust. llvm-svn: 304168	2017-05-29 18:27:00 +00:00
Benjamin Kramer	94b72bfd35	[ManagedStatic] Make object_creator/object_deleter visible again. They're now exposed as template args, which creates complications when ManagedStatics are used across .so boundaries. llvm-svn: 304166	2017-05-29 18:00:33 +00:00
Benjamin Kramer	ca693ed79b	Don't destroy ManagedStatics in a unit test. Turns out this is very hostile towards other unit tests running in the same process, it unregisters all flags. llvm-svn: 304165	2017-05-29 17:25:37 +00:00
Benjamin Kramer	41b61242a4	[wasm] Fix test after r304117. llvm-svn: 304164	2017-05-29 16:32:52 +00:00
Benjamin Kramer	fd1952761e	[X86] Don't fold away the memory operand of an xchg. xchg with a mem operand has different locking semantics. If we unfold it into a xchg r,r we will loose the implicit lock. Likewise we never want to fold a register xchg into a memory one as it would be a lot slower. This triggers during LLVM selfhost. llvm-svn: 304163	2017-05-29 16:25:20 +00:00
Ayal Zaks	4c4baf5093	[Docs] Add VectorizationPlan to docs/Proposals. Following the request made in https://reviews.llvm.org/D32871, the general documentation of the Vectorization Plan is hereby placed under docs/Proposals. llvm-svn: 304161	2017-05-29 15:36:23 +00:00
Benjamin Kramer	2a441a52df	Try to work around MSVC being buggy. Attempt #1 . error C2971: 'llvm::ManagedStatic': template parameter 'Creator': 'CreateDefaultTimerGroup': a variable with non-static storage duration cannot be used as a non-type argument llvm-svn: 304157	2017-05-29 14:28:04 +00:00
Benjamin Kramer	351779e972	[Timer] Move DefaultTimerGroup into a ManagedStatic. This used to be just leaked. r295370 made it use magic statics. This adds a global destructor, which is something we'd like to avoid. It also creates a weird situation where the mutex used by TimerGroup is re-created during global shutdown and leaked. Using a ManagedStatic here is also subtle as it relies on the mutex inside of ManagedStatic to be recursive. I've added a test for that in a previous change. llvm-svn: 304156	2017-05-29 14:05:29 +00:00
Benjamin Kramer	1533eda111	[ManagedStatic] Add a way to pass custom creators/deleters. Also add a test case verifying that nested ManagedStatics work correctly. llvm-svn: 304155	2017-05-29 14:05:26 +00:00
Sanjay Patel	51152a3727	[DAGCombiner] fix load narrowing transform to exclude loads with extension The extending load possibility was missed in: https://reviews.llvm.org/rL304072 We might want to handle this cases as a follow-up, but bailing out for now to avoid miscompiling. llvm-svn: 304153	2017-05-29 13:24:58 +00:00
Jonas Paulsson	fe0c0935c8	[SystemZ] Improve buildVector() in SystemZISelLowering.cpp. Use VLREP when inserting one or more loads into a vector. This is more efficient than to first load and then use a VLVGP. Review: Ulrich Weigand llvm-svn: 304152	2017-05-29 13:22:23 +00:00
Mattias Eriksson	b808e99a55	Test commit: fix typos Just fixing a few typos in comments to test commit access. llvm-svn: 304149	2017-05-29 11:46:44 +00:00
Nikolai Bozhenov	82f0801c1b	[Nios2] Target registration Reviewers: craig.topper, hfinkel, joerg, lattner, zvi Reviewed By: craig.topper Subscribers: oren_ben_simhon, igorb, belickim, tvvikram, mgorny, llvm-commits, pavel.v.chupin, DavidKreitzer Differential Revision: https://reviews.llvm.org/D32669 Patch by AndreiGrischenko <andrei.l.grischenko@intel.com> llvm-svn: 304144	2017-05-29 09:48:30 +00:00
Diana Picus	0c05cce4e0	[ARM] GlobalISel: Extract helper. NFCI. Create a helper to deal with the common code for merging incoming values together after they've been split during call lowering. There's likely more stuff that can be commoned up here, but we'll leave that for later. llvm-svn: 304143	2017-05-29 09:09:54 +00:00
Hiroshi Inoue	ac9cd3080d	[trivial] fix a typo in comment, NFC llvm-svn: 304139	2017-05-29 08:37:42 +00:00
Diana Picus	bf4aed2c38	[ARM] GlobalISel: Support array returns These are a bit rare in practice, but they don't require anything special compared to array parameters, so support them as well. llvm-svn: 304137	2017-05-29 08:19:19 +00:00
Hiroshi Inoue	e3c14ebbfa	[PPC] Fix assertion failure during binary encoding with -mcpu=pwr9 Summary clang -c -mcpu=pwr9 test/CodeGen/PowerPC/build-vector-tests.ll causes an assertion failure during the binary encoding. The failure occurs when a D-form load instruction takes two register operands instead of a register + an immediate. This patch fixes the problem and also adds an assertion to catch this failure earlier before the binary encoding (i.e. during lit test). The fix is from Nemanja Ivanovic @nemanjai. Differential Revision: https://reviews.llvm.org/D33482 llvm-svn: 304133	2017-05-29 07:12:39 +00:00
Diana Picus	8cca8cb0ce	[ARM] GlobalISel: Support array parameters/arguments Clang coerces structs into arrays, so it's a good idea to support them. Most of the support boils down to getting the splitToValueTypes helper to actually split types. We then use G_INSERT/G_EXTRACT to deal with the parts. llvm-svn: 304132	2017-05-29 07:01:52 +00:00
Mehdi Amini	96ab48f9da	DebugInfo: Include .dwo file name when hashing multiple CUs in a single file This is really a workaround for ThinLTO in particular - since it can import partial CUs that may end up looking very similar/the same as the same partial import in another ThinLTO compile. An alternative fix would be to change the DICompileUnit metadata to include a "primary file" or the like - and when importing for ThinLTO set the primary file to the name of the DICompileUnit that is being imported into. This involves changing the schema and would reduce the excessive uniqueness in the hash that this change creates - allowing diagnosing of more duplicate CUs than will be caught with this change. But duplicate CUs can still be caught in non-ThinLTO builds & are mostly a nuisance rather than a particularly deliberate/effective tool for finding broken code. (arguably the hash could always include the dwo file and nothing in fission would break, I think..) Reapply of r304119 after adding a triple to the test and moving it to the X86 directory. llvm-svn: 304130	2017-05-29 06:32:34 +00:00
Mehdi Amini	4181205563	DebugInfo: Omit an empty CU when a subprogram was moved into its use When the only use of a CU is for a subprogram that's only emitted into the using CU (to avoid cross-CU references in DWO files), avoid creating that CU at all. Reapply of r304111 after adding a triple to the test and moving it to the X86 directory. llvm-svn: 304129	2017-05-29 06:25:30 +00:00
Tobias Grosser	8cf785f6b1	Revert "[IfConversion] Keep the CFG updated incrementally in IfConvertTriangle" The reverted change introdued assertions ala: "MachineBasicBlock::succ_iterator llvm::MachineBasicBlock::removeSuccessor(succ_iterator, bool): Assertion `I != Successors.end() && "Not a current successor!"' Mikael, the original committer, wrote me that he is working on a fix, but that it likely will take some time to get this resolved. As this bug is one of the last two issues that keep the AOSP buildbot from turning green, I revert the original commit r302876. I am looking forward to see this recommitted after the assertion has been resolved. llvm-svn: 304128	2017-05-29 06:12:18 +00:00
Mehdi Amini	e161ced16a	Revert "DebugInfo: Omit an empty CU when a subprogram was moved into its use" This reverts commit r304111. GreenDragon is broken. llvm-svn: 304126	2017-05-29 05:17:57 +00:00
Mehdi Amini	d8056bb7d8	Revert "DebugInfo: Include .dwo file name when hashing multiple CUs in a single file" This reverts commit r304119 and r304118. GreenDragon is broken. llvm-svn: 304125	2017-05-29 05:17:54 +00:00
Zachary Turner	eaacd07079	Don't capture a temporary std::string in a StringRef. This fixes the breakages in llvm-tblgen. llvm-svn: 304123	2017-05-29 02:20:12 +00:00
Zachary Turner	df1832cf86	Resubmit "[X86] Adding new LLVM TableGen backend that generates the X86 backend memory folding tables." This was reverted due to buildbot breakages and I was not familiar with this code to investigate it. But while trying to get a useful backtrace for the author, it turns out the fix was very obvious. Resubmitting this patch as is, and will submit the fix in a followup so that the fix is not hidden in the larger CL. llvm-svn: 304122	2017-05-29 02:19:37 +00:00
Zachary Turner	5b199be769	Revert "[X86] Adding new LLVM TableGen backend that generates the X86 backend memory folding tables." This reverts commit 28cb1003507f287726f43c771024a1dc102c45fe as well as all subsequent followups. llvm-tblgen currently segfaults with this change, and it seems it has been broken on the bots all day with no fixes in preparation. See, for example: http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/ llvm-svn: 304121	2017-05-29 01:48:53 +00:00
Galina Kistanova	229c9c1159	Disabled implicit-fallthrough warnings for ConvertUTF.cpp. ConvertUTF.cpp has a little dependency on LLVM, and since the code extensively uses fall-through switches, I prefer disabling the warning for the whole file, rather than adding attributes for each case. llvm-svn: 304120	2017-05-29 01:34:26 +00:00
David Blaikie	ce0c205813	DebugInfo: Include .dwo file name when hashing multiple CUs in a single file This is really a workaround for ThinLTO in particular - since it can import partial CUs that may end up looking very similar/the same as the same partial import in another ThinLTO compile. An alternative fix would be to change the DICompileUnit metadata to include a "primary file" or the like - and when importing for ThinLTO set the primary file to the name of the DICompileUnit that is being imported into. This involves changing the schema and would reduce the excessive uniqueness in the hash that this change creates - allowing diagnosing of more duplicate CUs than will be caught with this change. But duplicate CUs can still be caught in non-ThinLTO builds & are mostly a nuisance rather than a particularly deliberate/effective tool for finding broken code. (arguably the hash could always include the dwo file and nothing in fission would break, I think..) llvm-svn: 304119	2017-05-29 00:48:45 +00:00
David Blaikie	02f8a07689	Attempt to fix buildbots... llvm-svn: 304118	2017-05-29 00:24:01 +00:00
Saleem Abdulrasool	f122423ace	Support: adjust the default obj format for wasm WebAssemly uses a custom object file format. For the wasm targets, default to the `Wasm` object file format. llvm-svn: 304117	2017-05-29 00:14:57 +00:00
Dylan McKay	74fc1ce0c2	[AVR] Remove SREG from CPI's Uses; authored by Florian Zeitz Summary: CPI does not read the status register, but only writes it. Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33223 llvm-svn: 304116	2017-05-29 00:10:14 +00:00
Craig Topper	251cdbef1d	[TableGen][X86] Fix formatting I accidentally messed up in r304099. NFC llvm-svn: 304115	2017-05-28 23:47:17 +00:00
Erik Pilkington	de83eea576	[ItaniumDemangle] Fix a exponential string copying bug This is a port of libcxxabi's r304113. llvm-svn: 304114	2017-05-28 23:24:52 +00:00
NAKAMURA Takumi	a288ec412f	Prune trailing whitespace. (To regenerate makefiles) llvm-svn: 304112	2017-05-28 22:54:25 +00:00
David Blaikie	f2f898a044	DebugInfo: Omit an empty CU when a subprogram was moved into its use When the only use of a CU is for a subprogram that's only emitted into the using CU (to avoid cross-CU references in DWO files), avoid creating that CU at all. llvm-svn: 304111	2017-05-28 22:51:37 +00:00
Geoff Berry	2739ebafb6	[AArch64][Falkor] Combine sched details files into one. NFC. llvm-svn: 304109	2017-05-28 22:20:44 +00:00
Geoff Berry	b542fb3817	[AArch64][Falkor] Fix some sched details. - Remove all uses of base sched model entries and set them all to Unsupported so all the opcodes are described in AArch64SchedFalkorDetails.td. - Remove entries for unsupported half-float opcodes. - Remove entries for unsupported LSE extension opcodes. - Add entry for MOVbaseTLS (and set Sched in base td file entry to WriteSys) and a few other pseudo ops. - Fix a few FP load/store with reg offset entries to use the LSLfast predicates. - Add Q size BIF/BIT/BSL entries. - Fix swapped Q/D sized CLS/CLZ/CNT/RBIT entires. - Fix pre/post increment address register latency (this operand is always dest 0). - Fix swapped FCVTHD/FCVTHS/FCVTDH/FCVTDS entries. - Fix XYZ resource over usage on LD[1-4] opcodes. llvm-svn: 304108	2017-05-28 21:48:31 +00:00
Craig Topper	cf09175de8	[TableGen][X86] Use CHAR_BIT with sizeof instead of hardcoded 8. NFC llvm-svn: 304100	2017-05-28 18:24:43 +00:00
Craig Topper	8351075181	[TableGen][X86] Mark a couple global tables as const. NFC llvm-svn: 304099	2017-05-28 18:24:41 +00:00
Craig Topper	a38a80108b	[TableGen][X86] Improve formatting of the fold table output by indenting the body of the table and adding blank lines between tables. NFC llvm-svn: 304098	2017-05-28 18:24:39 +00:00
Craig Topper	2bf3152325	[TableGen][X86] Add an llvm_unreachable to a switch so we get an error if we need expansion in the future. llvm-svn: 304097	2017-05-28 18:24:37 +00:00
Craig Topper	8eaf0edb41	[TableGen][X86] Remove unnecessary std::string creations. NFC llvm-svn: 304096	2017-05-28 18:24:35 +00:00
Craig Topper	f62d4e240d	[TableGen][X86] Replace a global std::vector with a regular array. llvm::find works on arrays, just need to use std::end to check the result. llvm-svn: 304095	2017-05-28 18:24:32 +00:00
Craig Topper	5a4ec21461	[TableGen][X86] getValueAsString returns a std::string not a StringRef. Capture it that way to avoid a StringRef to a temporary. llvm-svn: 304093	2017-05-28 17:48:41 +00:00
Sanjay Patel	bb9fe3b409	[x86] auto-generate better checks; NFC llvm-svn: 304090	2017-05-28 13:57:59 +00:00
Benjamin Kramer	9d8ed2653f	[InstrProf] Use more ArrayRef/StringRef. No functional change intended. llvm-svn: 304089	2017-05-28 13:23:02 +00:00
Ayman Musa	d9f1fe43a8	[X86] Adding new LLVM TableGen backend that generates the X86 backend memory folding tables. X86 backend holds huge tables in order to map between the register and memory forms of each instruction. This TableGen Backend automatically generated all these tables with the appropriate flags for each entry. Differential Revision: https://reviews.llvm.org/D32684 llvm-svn: 304088	2017-05-28 12:55:36 +00:00
Ayman Musa	0b4f97d5e9	[X86] Adding FoldGenRegForm helper field (for memory folding tables tableGen backend) to X86Inst class and set its value for the relevant instructions. Some register-register instructions can be encoded in 2 different ways, this happens when 2 register operands can be folded (separately). For example if we look at the MOV8rr and MOV8rr_REV, both instructions perform exactly the same operation, but are encoded differently. Here is the relevant information about these instructions from Intel's 64-ia-32-architectures-software-developer-manual: Opcode Instruction Op/En 64-Bit Mode Compat/Leg Mode Description 8A /r MOV r8,r/m8 RM Valid Valid Move r/m8 to r8. 88 /r MOV r/m8,r8 MR Valid Valid Move r8 to r/m8. Here we can see that in order to enable the folding of the output and input registers, we had to define 2 "encodings", and as a result we got 2 move 8-bit register-register instructions. In the X86 backend, we define both of these instructions, usually one has a regular name (MOV8rr) while the other has "_REV" suffix (MOV8rr_REV), must be marked with isCodeGenOnly flag and is not emitted from CodeGen. Automatically generating the memory folding tables relies on matching encodings of instructions, but in these cases where we want to map both memory forms of the mov 8-bit (MOV8rm & MOV8mr) to MOV8rr (not to MOV8rr_REV) we have to somehow point from the MOV8rr_REV to the "regular" appropriate instruction which in this case is MOV8rr. This field enable this "pointing" mechanism - which is used in the TableGen backend for generating memory folding tables. Differential Revision: https://reviews.llvm.org/D32683 llvm-svn: 304087	2017-05-28 12:39:37 +00:00
Oren Ben Simhon	f3aab2fa33	[X86] Fixing VPOPCNTDQ feature set lookup. llvm-svn: 304086	2017-05-28 11:26:11 +00:00
Galina Kistanova	3642e5132b	Reverted r304083 as it seems there is a desire to address this in the googletest. llvm-svn: 304084	2017-05-28 05:50:22 +00:00
Galina Kistanova	6cb62f7260	Added braces to address gcc warning: suggest explicit braces to avoid ambiguous 'else' [-Wdangling-else]. NFC. llvm-svn: 304083	2017-05-28 03:50:52 +00:00
David Blaikie	7b91deb68d	DebugInfo: Add source code/build instructions for split-dwarf-dwp symbolizer test Addressing post-commit code review feedback from Paul Robinson on r303609. llvm-svn: 304080	2017-05-27 19:52:20 +00:00
Gor Nishanov	ffbeb22b6f	Cloning: Fix debug info cloning Summary: I believe https://reviews.llvm.org/rL302576 introduced two bugs: 1) it produces duplicate distinct variables for every: dbg.value describing the same variable. To fix the problme I switched form getDistinct() to get() in DebugLoc.cpp: auto reparentVar = [&](DILocalVariable Var) { return DILocalVariable::getDistinct( 2) It passes NewFunction plain name as a linkagename parameter to Subprogram constructor. Breaks assert in: \|\| DeclLinkageName.empty()) \|\| LinkageName == DeclLinkageName) && "decl has a linkage name and it is different"' failed. #9 0x00007f5010261b75 llvm::DwarfUnit::applySubprogramDefinitionAttributes(llvm::DISubprogram const, llvm::DIE&) /home/gor/llvm/lib/CodeGen/AsmPrinter/DwarfUnit.cpp:1173:3 # (Edit: reproducer added) Here how https://reviews.llvm.org/rL302576 broke coroutine debug info. Coroutine body of the original function is split into several parts by cloning and removing unneeded code. All parts describe the original function and variables present in the original function. For a simple case, prior to Split, original function has these two blocks: ``` PostSpill: ; preds = %AllocaSpillBB call void @llvm.dbg.value(metadata i32 %x, i64 0, metadata !14, metadata !15), !dbg !13 store i32 %x, i32* %x.addr, align 4 ... and sw.epilog: ; preds = %sw.bb %x.addr.reload.addr = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4, !dbg !20 %4 = load i32, i32* %x.addr.reload.addr, align 4, !dbg !20 call void @llvm.dbg.value(metadata i32 %4, i64 0, metadata !14, metadata !15), !dbg !13 !14 = !DILocalVariable(name: "x", arg: 1, scope: !6, file: !7, line: 55, type: !11) ``` Note that in two blocks different expression represent the same original user variable X. Before rL302576, for every cloned function there was exactly one cloned DILocalVariable(name: "x" as in: ``` define i8* @f(i32 %x) #0 !dbg !6 { ... !6 = distinct !DISubprogram(name: "f", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, flags: DIFlagPrototyped, ... !14 = !DILocalVariable(name: "x", arg: 1, scope: !6, file: !7, line: 55, type: !11) define internal fastcc void @f.resume(%f.Frame* %FramePtr) #0 !dbg !25 { ... !25 = distinct !DISubprogram(name: "f", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2) !28 = !DILocalVariable(name: "x", arg: 1, scope: !25, file: !7, line: 55, type: !11) ``` After rL302576, for every cloned function there were as many DILocalVariable(name: "x" as there were "call void @llvm.dbg.value" for that variable. This was causing asserts in VerifyDebugInfo and AssemblyPrinter. Example: ``` !27 = distinct !DISubprogram(name: "f", linkageName: "f.resume", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, !29 = distinct !DILocalVariable(name: "x", arg: 1, scope: !27, file: !7, line: 55, type: !11) !39 = distinct !DILocalVariable(name: "x", arg: 1, scope: !27, file: !7, line: 55, type: !11) !41 = distinct !DILocalVariable(name: "x", arg: 1, scope: !27, file: !7, line: 55, type: !11) ``` Second problem: Prior to rL302576, all clones were described by DISubprogram referring to original function. ``` define i8* @f(i32 %x) #0 !dbg !6 { ... !6 = distinct !DISubprogram(name: "f", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, flags: DIFlagPrototyped, define internal fastcc void @f.resume(%f.Frame* %FramePtr) #0 !dbg !25 { ... !25 = distinct !DISubprogram(name: "f", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, flags: DIFlagPrototyped, ``` After rL302576, DISubprogram for clones is of two minds, plain name refers to the original name, linkageName refers to plain name of the clone. ``` !27 = distinct !DISubprogram(name: "f", linkageName: "f.resume", scope: !7, file: !7, line: 55, type: !8, isLocal: false, isDefinition: true, scopeLine: 55, ``` I think the assumption in AsmPrinter is that both name and linkageName should refer to the same entity. It asserts here when they are not: ``` \|\| DeclLinkageName.empty()) \|\| LinkageName == DeclLinkageName) && "decl has a linkage name and it is different"' failed. #9 0x00007f5010261b75 llvm::DwarfUnit::applySubprogramDefinitionAttributes(llvm::DISubprogram const*, llvm::DIE&) /home/gor/llvm/lib/CodeGen/AsmPrinter/DwarfUnit.cpp:1173:3 ``` After this fix, behavior (with respect to coroutines) reverts to exactly as it was before and therefore making them debuggable again, or even more importantly, compilable, with "-g" Reviewers: dblaikie, echristo, aprantl Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33614 llvm-svn: 304079	2017-05-27 19:41:09 +00:00
George Rimar	a25d329b33	Recommit "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" With fix of uninitialized variable. Original commit message: This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 304078	2017-05-27 18:10:23 +00:00
Craig Topper	a568c72b7d	[TableGen] Prevent DagInit from leaking its Args and ArgNames when they exceed the size of the SmallVector. DagInits are allocated in a BumpPtrAllocator so they are never destructed. This means the destructor for the SmallVector never runs. To fix this we now allocate the vectors in the BumpPtrAllocator too using TrailingObjects. llvm-svn: 304077	2017-05-27 17:36:50 +00:00
Craig Topper	0836449337	[TableGen] Use the correct type for the first template for the ListInit TrailingObjects. llvm-svn: 304076	2017-05-27 17:36:47 +00:00
Tobias Grosser	e3684d0b84	[SCEV] Assume parameters coming from function calls contain IVs The optimistic delinearization implemented in LLVM detects array sizes by looking for non-linear products between parameters and induction variables. In OpenCL code, such products often look like: A[get_global_id(0) * N + get_global_id(1)] Hence, the IV is hidden in the get_global_id() call and consequently delinearization would fail as no induction variable is available that helps us to identify N as array size parameter. We now use a very simple heuristic to change this. We assume that each parameter that comes directly from a function call is a hidden induction variable. As a result, we can delinearize the access above to: A[get_global_id(0)][get_global_id(1] llvm-svn: 304073	2017-05-27 15:17:49 +00:00
Sanjay Patel	33f4a97287	[DAGCombiner] use narrow load to avoid vector extract If we have (extract_subvector(load wide vector)) with no other users, that can just be (load narrow vector). This is intentionally conservative. Follow-ups may loosen the one-use constraint to account for the extract cost or just remove the one-use check. The memop chain updating is based on code that already exists multiple times in x86 lowering, so that should be pulled into a helper function as a follow-up. Background: this is a potential improvement noticed via regressions caused by making x86's peekThroughBitcasts() not loop on consecutive bitcasts (see comments in D33137). Differential Revision: https://reviews.llvm.org/D33578 llvm-svn: 304072	2017-05-27 14:07:03 +00:00
Craig Topper	b8ff353fc6	[TableGen] Remove all the static vectors named TheActualPool. These used to hold std::unique_ptrs that managed the allocation for the various *Init object so that they would be deleted on exit. Everything is allocated in a BumpPtrAllocator name so there is no reason for these to still exist. llvm-svn: 304066	2017-05-27 06:14:12 +00:00
Gor Nishanov	9c6ac6138d	[coroutines] Define getPassName() for coroutine passes Reviewers: GorNishanov Reviewed By: GorNishanov Subscribers: EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D33622 llvm-svn: 304065	2017-05-27 05:54:30 +00:00
Vitaly Buka	a637489ef1	[PartialInlining] Replace delete with unique_ptr in computeCallsiteToProfCountMap Reviewers: davidxl Reviewed By: davidxl Subscribers: vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D33220 llvm-svn: 304064	2017-05-27 05:32:09 +00:00
Gor Nishanov	e5d2911856	ScalarEvolution unit test: fix typo that breaks check-all llvm-svn: 304063	2017-05-27 05:24:30 +00:00
Adam Nemet	147ede9a08	Rearrange Dom unittest to accommodate multiple tests I've taken the approach from the LoopInfo test: * Rather than running in the pass manager just build the analyses manually * Split out the common parts (makeLLVMModule, runWithDomTree) into helpers Differential Revision: https://reviews.llvm.org/D33617 llvm-svn: 304061	2017-05-27 04:05:52 +00:00
Adam Nemet	7fa6dee2e3	clang-format DomTree unittest llvm-svn: 304060	2017-05-27 04:05:50 +00:00
Matthias Braun	88c8c9847d	AArch64/PEI: Do not add reserved regs to liveins We do not track liveness for reserved registers. It is unnecessary to add them to block livein lists. llvm-svn: 304059	2017-05-27 03:38:02 +00:00
Keno Fischer	090f1959c1	[SCEVExpander] Try harder to avoid introducing inttoptr Summary: This fixes introduction of an incorrect inttoptr/ptrtoint pair in the included test case which makes use of non-integral pointers. I suspect there are more cases like this left, but this takes care of the one I was seeing at the moment. Reviewers: sanjoy Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D33129 llvm-svn: 304058	2017-05-27 03:22:55 +00:00
Matthias Braun	868bbd4022	ScheduleDAGInstrs: Fix fixupKills() Rewrite fixupKills() to use the LivePhysRegs class. Simplifies the code and fixes a bug where the CSR registers in return blocks where missed leading to invalid kill flags. Also remove the unnecessary rule that we wouldn't set kill flags on tied operands. No tests as I have an upcoming commit improving MachineVerifier checks to catch these cases in multiple existing lit tests. llvm-svn: 304055	2017-05-27 02:50:50 +00:00
Erik Pilkington	cbc82b3ca9	[Demangler] copy changes made in libcxxabi's r303718 to ItaniumDemangle llvm-svn: 304053	2017-05-27 01:48:34 +00:00
Quentin Colombet	7a43eddf28	[AArch64][GlobalISel] Add the Localizer pass for the O0 pipeline This should fix most of the issue we have right now with constants being spilled all over the place. llvm-svn: 304052	2017-05-27 01:34:07 +00:00
Quentin Colombet	bece442bd8	[GlobalISel] Add a localizer pass for target to use This reverts commit r299287 plus clean-ups. The localizer pass is a helper pass that could be run at O0 in the GISel pipeline to work around the deficiency of the fast register allocator. It basically shortens the live-ranges of the constants so that the allocator does not spill all over the place. Long term fix would be to make the greedy allocator fast. llvm-svn: 304051	2017-05-27 01:34:00 +00:00
Wei Mi	5bbb5aafc1	[GVN] Recommit the patch "Add phi-translate support in scalarpre". The recommit is to fix a bug about ExtractValue and InsertValue ops. For those ops, some varargs inside GVN::Expression are not value numbers but raw index numbers. It is wrong to do phi-translate for raw index numbers, and the fix is to stop doing that. Right now scalarpre doesn't have phi-translate support, so it will miss some simple pre opportunities. Like the following testcase, current scalarpre cannot recognize the last "a * b" is fully redundent because a and b used by the last "a * b" expr are both defined by phis. long a[100], b[100], g1, g2, g3; __attribute__((pure)) long goo(); void foo(long a, long b, long c, long d) { g1 = a * b; if (__builtin_expect(g2 > 3, 0)) { a = c; b = d; g2 = a * b; } g3 = a * b; // fully redundant. } The patch adds phi-translate support in scalarpre. This is only a temporary solution before the newpre based on newgvn is available. Differential Revision: https://reviews.llvm.org/D32252 llvm-svn: 304050	2017-05-27 00:54:19 +00:00
Matthias Braun	24dc63a9b9	BranchRelaxation: computeLiveIns() after creating new block One case in BranchRelaxation did not compute liveins after creating a new block. This is catched by existing tests with an upcoming commit that will improve MachineVerifier checking of livein lists. llvm-svn: 304049	2017-05-27 00:53:48 +00:00

... 4 5 6 7 8 ...

149988 Commits