llvm-project

Commit Graph

Author	SHA1	Message	Date
Konstantin Zhuravlyov	f13c9969fc	AMDGPU: Fix v_dot{4, 8}* instruction encoding Differential Revision: https://reviews.llvm.org/D46848 llvm-svn: 332387	2018-05-15 19:32:47 +00:00
Michael Zolotukhin	67cfbaac89	[MemorySSA] Don't sort IDF blocks. Summary: After r332167 we started to sort the IDF blocks inside IDF calculation, so there is no need to re-sort them on the user site. The test changes are due to a slightly different order we're using now (originally we used DFSInNumber and now the blocks are sorted by a pair (LevelFromRoot, DFSInNumber)). Reviewers: dberlin, mgrang Subscribers: Prazek, hiraditya, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D46899 llvm-svn: 332385	2018-05-15 18:40:29 +00:00
Tom Stellard	e182b28ae4	AMDGPU/GlobalISel: Implement select() for G_FCONSTANT Summary: Also clean up G_CONSTANT selection. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46170 llvm-svn: 332379	2018-05-15 17:57:09 +00:00
Konstantin Zhuravlyov	603a43fcd5	AMDGPU: Add disasm tests for deep learning instructions + fix v_fmac_f32 disasm Differential Revision: https://reviews.llvm.org/D46853 llvm-svn: 332377	2018-05-15 17:39:13 +00:00
Simon Pilgrim	be9a206883	[X86] Split WriteCvtF2F into F32->F64 and F64->F32 scheduler classes BtVer2 - Fixes schedules for (V)CVTPS2PD instructions A lot of the Intel models still have too many InstRW overrides for these new classes - this needs cleaning up but I wanted to get the classes in first llvm-svn: 332376	2018-05-15 17:36:49 +00:00
Sanjay Patel	3c35290c58	[InstCombine] fix binop-of-shuffles to check uses llvm-svn: 332375	2018-05-15 17:14:23 +00:00
Krzysztof Parzyszek	db39bf4088	[Hexagon] Remove unused function from subtarget llvm-svn: 332369	2018-05-15 16:32:24 +00:00
Krzysztof Parzyszek	8c389bd368	[Hexagon] Remove unused flag from subtarget and (non)corresponding test llvm-svn: 332365	2018-05-15 16:13:52 +00:00
Simon Dardis	f40eb03ce9	[mips] Mark select instructions correctly Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46702 llvm-svn: 332364	2018-05-15 16:05:04 +00:00
Sanjay Patel	8652c53d29	[DAG] propagate FMF for all FPMathOperators This is a simple hack based on what's proposed in D37686, but we can extend it if needed in follow-ups. It gets us most of the FMF functionality that we want without adding any state bits to the flags. It also intentionally leaves out non-FMF flags (nsw, etc) to minimize the patch. It should provide a superset of the functionality from D46563 - the extra tests show propagation and codegen diffs for fcmp, vecreduce, and FP libcalls. The PPC log2() test shows the limits of this most basic approach - we only applied 'afn' to the last node created for the call. AFAIK, there aren't any libcall optimizations based on the flags currently, so that shouldn't make any difference. Differential Revision: https://reviews.llvm.org/D46854 llvm-svn: 332358	2018-05-15 14:16:24 +00:00
Simon Pilgrim	891ebcdbaa	[X86] Split off F16C WriteCvtPH2PS/WriteCvtPS2PH scheduler classes Btver2 - VCVTPH2PSYrm needs to double pump the AGU Broadwell - missing VCVTPS2PH*mr stores extra latency Allows us to remove the WriteCvtF2FSt conversion store class llvm-svn: 332357	2018-05-15 14:12:32 +00:00
Pavel Labath	80827f10a1	Reapply "DWARFVerifier: Check "completeness" of .debug_names section" This is a resubmit of r331868 (D46583), which was reverted due to failures on the PS4 bot. These have been resolved with r332246/D46748. llvm-svn: 332349	2018-05-15 13:24:10 +00:00
whitequark	8f0ab258bd	[MergeFunctions] Fix merging of small weak functions When two interposable functions are merged, we cannot replace uses and have to emit calls to a common internal function. However, writeThunk() will not actually emit a thunk if the function is too small. This leaves us in a broken state where mergeTwoFunctions already rewired the functions, but writeThunk doesn't do anything. This patch changes the implementation so that: * writeThunk() does just that. * The direct replacement of calls is moved into mergeTwoFunctions() into the non-interposable case only. * isThunkProfitable() is extracted and will be called for the non-iterposable case always, and in the interposable case only if uses are still left after replacement. This issue has been introduced in https://reviews.llvm.org/D34806, where the code for checking thunk profitability has been moved. Differential Revision: https://reviews.llvm.org/D46804 Reviewed By: whitequark llvm-svn: 332342	2018-05-15 11:31:07 +00:00
Simon Dardis	ce5d3d657a	[mips] Fix formatting of floating point conversion patterns llvm-svn: 332341	2018-05-15 11:21:07 +00:00
Simon Dardis	aa6bdba0ca	[mips] Add disassembly support for comparison instructions llvm-svn: 332340	2018-05-15 11:18:24 +00:00
Simon Dardis	b79ecec20d	[mips] Fix predicates of mfc1, mtc1 instructions Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46692 llvm-svn: 332339	2018-05-15 11:10:30 +00:00
Artur Gainullin	243a3d56d8	[X86] Improve unsigned saturation downconvert detection. Summary: New unsigned saturation downconvert patterns detection was implemented in X86 Codegen: (truncate (smin (smax (x, C1), C2)) to dest_type), where C1 >= 0 and C2 is unsigned max of destination type. (truncate (smax (smin (x, C2), C1)) to dest_type) where C1 >= 0, C2 is unsigned max of destination type and C1 <= C2. These two patterns are equivalent to: (truncate (umin (smax(x, C1), unsigned_max_of_dest_type)) to dest_type) Reviewers: RKSimon Subscribers: llvm-commits, a.elovikov Differential Revision: https://reviews.llvm.org/D45315 llvm-svn: 332336	2018-05-15 10:24:12 +00:00
JF Bastien	9f62b4c8a8	[NFC] pull a function into its own lambda As requested in D46858, pulling this function into its own lambda makes it easier to read that part of the code and reason as to what's going on because the scope it can be called from is extremely limited. We want to keep it as a function because it's called from the two subsequent lines. llvm-svn: 332325	2018-05-15 04:23:48 +00:00
JF Bastien	93bce5108b	[NFC] Update comments Don't prepend function or data name before each comment. Split into its own NFC patch as requested in D46858. llvm-svn: 332323	2018-05-15 04:06:28 +00:00
Shiva Chen	3969425081	[RISCV] Define FeatureRelax and shouldForceRelocation for RISCV linker relaxation 1. Deine FeatureRelax to enable/disable linker relaxation. 2. Define shouldForceRelocation to preserve relocation types even if the fixup can be resolved when linker relaxation enabled. This is necessary for correctness as offsets may change during relaxation. Differential Revision: https://reviews.llvm.org/D46674 llvm-svn: 332318	2018-05-15 01:28:50 +00:00
Max Kazantsev	9b90373c8b	[NFC] Add const to method signature llvm-svn: 332317	2018-05-15 01:21:56 +00:00
Vedant Kumar	595ba1d548	[Debugify] Add -debugify-each for testing each pass in a pipeline This adds a -debugify-each mode to opt which, when enabled, wraps each {Module,Function}Pass in a pipeline with logic to add, check, and strip synthetic debug info for testing purposes. This mode can be used to test complex pipelines for debug info bugs, or to collect statistics about the number of debug values & locations lost throughout various stages of a pipeline. Patch by Son Tuan Vu! Differential Revision: https://reviews.llvm.org/D46525 llvm-svn: 332312	2018-05-15 00:29:27 +00:00
Keno Fischer	8248d7c661	[CommandLine] Error message for incorrect PositionalEatArgs usage Summary: bugpoint has several options specified as `PositionalEatArgs` to pass options through to the underlying tool, e.g. `-tool-args`. The `-help` message suggests the usage is: `-tool-args=<string>`. However, this is misleading, because that's not how these arguments work. Rather than taking a value, the option consumes all positional arguments until the next recognized option (or all arguments if `--` is specified at some point). To make this slightly clearer, instead print the help as: ``` -tool-args <string>... - <tool arguments>... ``` Additionally, add an error if the user attempts to use a `PositionalEatArgs` argument with a value, instead of silently ignoring it. Example: ``` ./bin/bugpoint -tool-args=-mpcu=skylake-avx512 bugpoint: for the -tool-args option: This argument does not take a value. Instead, it consumes any positional arguments until the next recognized option. ``` Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D46787 llvm-svn: 332311	2018-05-14 23:26:06 +00:00
Sam Clegg	5f87ab336e	[WebAssembly] Move toString helpers to BinaryFormat Subscribers: dschuff, mgorny, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D46847 llvm-svn: 332305	2018-05-14 22:42:07 +00:00
Brian Gesiak	ed5b3255f0	Revert "[Option] Fix PR37006 prefix choice in findNearest" Summary: This revision causes build failures in PS4 and ppc64le buildbots (for example, http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/29988). I'll revert for now and try to diagnose the issue. Test Plan: check-llvm check-clang llvm-svn: 332304	2018-05-14 22:36:47 +00:00
Keno Fischer	de577af8c0	[InstCombine] fix crash due to ignored addrspacecast Summary: Part of the InstCombine code for simplifying GEPs looks through addrspacecasts. However, this was done by updating a variable also used by the next transformation, for marking GEPs as inbounds. This led to replacing a GEP with a similar instruction in a different addrspace, which caused an assertion failure in RAUW. This caused julia issue https://github.com/JuliaLang/julia/issues/27055 Patch by Jeff Bezanson <jeff@juliacomputing.com> Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D46722 llvm-svn: 332302	2018-05-14 22:05:01 +00:00
Brian Gesiak	eda977f414	[Option] Fix PR37006 prefix choice in findNearest Summary: In https://bugs.llvm.org/show_bug.cgi?id=37006 Nico Weber points out a flaw in `OptTable::findNearest`: if an option "foo"'s prefixes are "--" and "-", then the nearest option for "--fob" will be "-foo". This is incorrect, however, since the function is expected to return "--foo". The bug is due to a naive loop that attempts to predetermines which prefix is best. Instead, compute the edit distance for each prefix/name pair. Test Plan: `check-llvm` Reviewers: thakis Reviewed By: thakis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46776 llvm-svn: 332299	2018-05-14 21:35:00 +00:00
Martin Storsjo	ace7ae935f	[ARM] Back up R4 and LR if calling the stack probe function Differential Revision: https://reviews.llvm.org/D46777 llvm-svn: 332298	2018-05-14 21:32:52 +00:00
Brock Wyma	3db2b108c3	[CodeView] Improve debugging of virtual base class member variables Initial support for passing the virtual base pointer offset to CodeViewDebug. https://reviews.llvm.org/D46271 llvm-svn: 332296	2018-05-14 21:21:22 +00:00
Krzysztof Parzyszek	44e180ba89	[Hexagon] Add a target feature to control using small data section llvm-svn: 332292	2018-05-14 21:01:56 +00:00
Krzysztof Parzyszek	f66f7612bf	[Hexagon] Add a target feature for generating new-value stores llvm-svn: 332290	2018-05-14 20:41:04 +00:00
Paul Robinson	5f53f07b66	[DWARF] Factor out a DWARFUnitHeader class. NFC Extract information related to a "unit header" from DWARFUnit into a new DWARFUnitHeader class, and add a DWARFUnit member for the header. This is one step in the direction of allowing type units in the .debug_info section for DWARF v5. Differential Revision: https://reviews.llvm.org/D46707 llvm-svn: 332289	2018-05-14 20:32:31 +00:00
Krzysztof Parzyszek	771f2422d0	[Hexagon] Add a target feature for memop generation llvm-svn: 332285	2018-05-14 20:09:07 +00:00
Simon Pilgrim	215ce4a1ca	[X86] Add NT load/store scheduler classes llvm-svn: 332274	2018-05-14 18:37:19 +00:00
Craig Topper	53ceb4805f	[X86] Remove and autoupgrade avx512.vbroadcast.ss/avx512.vbroadcast.sd intrinsics. llvm-svn: 332271	2018-05-14 18:21:22 +00:00
Simon Pilgrim	228d24a2d6	[X86][BtVer2] Fix MMX/YMM integer vector nt store schedules MMX was missing and YMM was tagged as a fp nt store llvm-svn: 332269	2018-05-14 18:07:28 +00:00
Geoff Berry	64a2ea41ea	[BranchFolding] Allow hoisting to block with a single conditional branch. Summary: The BranchFolding pass is currently missing opportunities to hoist common code if the hoisted-to block contains a single conditional branch that has register uses. This occurs somewhat frequently on AArch64 with CBZ/TBZ opcodes. This change also eliminates some code differences when debug info is present since the presence of e.g. DBG_VALUE instructions in the hoisted-to block can enable hoisting that wouldn't have occurred without them. Reviewers: MatzeB, rnk, kparzysz, twoh, aprantl, javed.absar Subscribers: kristof.beyls, JDevlieghere, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46324 llvm-svn: 332265	2018-05-14 17:31:18 +00:00
Krzysztof Parzyszek	329c3e9a5f	[Hexagon] Avoid predicate copies to integer registers from store-locked llvm-svn: 332260	2018-05-14 16:41:40 +00:00
Simon Dardis	bb818b4421	[mips] Fix the predicates of round, ceiling, floor and trunc. Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46691 llvm-svn: 332258	2018-05-14 16:26:50 +00:00
Zaara Syeda	421a5960d2	[NFC] [Power] Fix instruction format for xsrqpi xsrqpi is currently using Z23Form_1. The instruction format is xsrqpi R,VRT,VRB,RMC. Rathar than bits 11-15 being used for FRA, it should have bits 11-14 reserved and bit 15 for R. This patch adds a new class Z23Form_4 to fix the instruction format. Differential Revision: https://reviews.llvm.org/D46761 llvm-svn: 332253	2018-05-14 15:45:15 +00:00
Evandro Menezes	14fa2e4fa5	[AArch64] Improve single vector lane stores When storing the 0th lane of a vector, use a simpler and usually more efficient scalar store instead. Differential revision: https://reviews.llvm.org/D46655 llvm-svn: 332251	2018-05-14 15:26:35 +00:00
Pavel Labath	2a6afe5f87	[CodeGen/AccelTable]: Handle -dwarf-linkage-names=Abstract correctly Summary: If we are not emitting a linkage name in the .debug_info sections, we should not add it into the index either. This makes sure our index is consistent with the actual debug info. I am also explicitly setting the --dwarf-linkage-names=All in the name-collsions test as that one would now fail on targets where this defaults to "Abstract" (in fact, it would have failed already if there wasn't a bug in the DWARF verifier, which I fix as well). Reviewers: probinson, aprantl, JDevlieghere Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46748 llvm-svn: 332246	2018-05-14 14:13:20 +00:00
Sanjay Patel	bf55e6dee1	[AggressiveInstCombine] avoid crashing on unsimplified code (PR37446) This bug: https://bugs.llvm.org/show_bug.cgi?id=37446 ...raises another question: why do we run aggressive-instcombine before regular instcombine? llvm-svn: 332243	2018-05-14 13:43:32 +00:00
Nicola Zaghen	d34e60ca85	Rename DEBUG macro to LLVM_DEBUG. The DEBUG() macro is very generic so it might clash with other projects. The renaming was done as follows: - git grep -l 'DEBUG' \| xargs sed -i 's/\bDEBUG\s\?(/LLVM_DEBUG(/g' - git diff -U0 master \| ../clang/tools/clang-format/clang-format-diff.py -i -p1 -style LLVM - Manual change to APInt - Manually chage DOCS as regex doesn't match it. In the transition period the DEBUG() macro is still present and aliased to the LLVM_DEBUG() one. Differential Revision: https://reviews.llvm.org/D43624 llvm-svn: 332240	2018-05-14 12:53:11 +00:00
Simon Pilgrim	e91a631df7	Fix "not all control paths return a value" MSVC warning. NFCI. llvm-svn: 332238	2018-05-14 12:20:19 +00:00
Sander de Smalen	93380371bb	[AArch64][SVE] Extend parsing of Prefetch operation for SVE. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D46681 llvm-svn: 332234	2018-05-14 11:54:41 +00:00
Simon Dardis	fba0362096	[mips] Correct the predicates of indexed floating point stores and loads. Also, fix the register class for microMIPS. Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46689 llvm-svn: 332227	2018-05-14 10:53:15 +00:00
Nicola Zaghen	617d4a8199	Test commit access. Remove trailing whitespace. llvm-svn: 332220	2018-05-14 08:24:29 +00:00
Robert Widmann	bce36770b7	[LLVM-C] Add Bindings For Module Flags Summary: The first foray into merging debug info into the echo tests. - Add bindings to Module::getModuleFlagsMetadata() in the form of LLVMCopyModuleFlagsMetadata - Add the opaque type LLVMModuleFlagEntry to represent Module::ModuleFlagEntry - Add accessors for LLVMModuleFlagEntry's behavior, key, and metadata node. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: aprantl, JDevlieghere, llvm-commits, harlanhaskins Differential Revision: https://reviews.llvm.org/D46792 llvm-svn: 332219	2018-05-14 08:09:00 +00:00
Bill Wendling	2a302210d0	Correct compatibility with the GNU Assembler's handling of comparison ops GAS returns -1 for a comparison operator if the result is true and 0 if false. https://www.sourceware.org/binutils/docs-2.12/as.info/Infix-Ops.html#Infix%20Ops llvm-svn: 332215	2018-05-14 05:25:36 +00:00
Craig Topper	266b7ae55d	[X86] Cleanup a multiclass that doesn't need as many parameters after recent intrinsic removals. llvm-svn: 332207	2018-05-14 00:17:52 +00:00
Craig Topper	0e71c6d5ca	[X86] Remove and autoupgrade the cvtusi2sd intrinsic. Use uitofp+insertelement instead. llvm-svn: 332206	2018-05-14 00:06:49 +00:00
Craig Topper	97e74b05ef	[X86] Add patterns for combining movss+uint_to_fp into the intrinsic instructions under AVX512. This matches what we do for sint_to_fp. llvm-svn: 332205	2018-05-13 23:24:21 +00:00
Craig Topper	911025b1cd	[X86] Extend instcombine folds for pclmuldq intrinsics to the 256 and 512 bit version. llvm-svn: 332202	2018-05-13 21:56:32 +00:00
Craig Topper	85906cf041	[X86] Remove and autoupgrade masked vpermd/vpermps intrinsics. llvm-svn: 332198	2018-05-13 18:03:59 +00:00
Matt Arsenault	432aaea63f	AMDGPU: Rename OpenCL lowering pass to be R600 specific. This pass is a) broken. b) r600 specific. Fixing (a) is a bit more non-trivial, but fixing (b) is easy. Move this pass to being R600 only for now. This pass does pass all the unit tests, however clang no longer generates code that looks like the unit test input, so fixing the pass requires fixing the tests and the pass as one, and checking it works with clang still. Patch by Dave Airlie llvm-svn: 332196	2018-05-13 10:04:48 +00:00
Matt Arsenault	dfb88dfe30	AMDGPU: Make undef legal for v2i16/v2f16 This is apparently necessary to stop undef from being turned into a build_vector of 0s. llvm-svn: 332195	2018-05-13 10:04:38 +00:00
Puyan Lotfi	71540f5197	Fixing build bot error: adding const qualifiers to std::sort lambda. Errors were not reproducible on clang-6.0 on ubuntu 16.04. llvm-svn: 332192	2018-05-13 06:50:55 +00:00
Puyan Lotfi	380a6f55ff	[NFC] MIR-Canon: switching to a stable string sorting of instructions. llvm-svn: 332191	2018-05-13 06:07:20 +00:00
Craig Topper	38b713d4a7	[X86] Add some load folding patterns for cvtsi2ss/sd into intrinsic instructions. llvm-svn: 332189	2018-05-13 01:54:33 +00:00
Craig Topper	df3a9cedff	[X86] Remove an autoupgrade legacy cvtss2sd intrinsics. llvm-svn: 332187	2018-05-13 00:29:40 +00:00
Craig Topper	38ad7ddabc	[X86] Remove and autoupgrade cvtsi2ss/cvtsi2sd intrinsics to match what clang has used for a very long time. llvm-svn: 332186	2018-05-12 23:14:39 +00:00
Chandler Carruth	095d69507e	[x86] Remove a comment obviated by r330269. Should have deleted the comment in the same revision but missed it. Thanks to Dimitry Andric for catching this! llvm-svn: 332177	2018-05-12 21:28:53 +00:00
Dimitry Andric	74f2989202	Clear converters map after X86 Domain Reassignment to avoid crashes Summary: As reported in PR37264, in some cases the X86 Domain Reassignment `runOnMachineFunction()` is called twice. Because it only deletes the `.second` members of its `InstrConverterBaseMap`, and does not clean up the map itself, this can lead to double frees and crashes. Use `DeleteContainerSeconds()` instead, so the `Converters` map can safely be reinitialized and its members re-deleted for each X86 Domain Reassignment pass. Reviewers: guyblank, craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46425 llvm-svn: 332176	2018-05-12 19:59:54 +00:00
JF Bastien	6a68f38c0c	[NFC] Remove inaccurate comment Summary: r271558 moved getManagedStaticMutex's mutex from a function-local static to using call_once, but left a comment added in r211424. That comment is now erroneous, remove it. Reviewers: zturner, chandlerc Subscribers: aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D46784 llvm-svn: 332175	2018-05-12 19:39:34 +00:00
Simon Pilgrim	6e160c1813	[X86] Add WriteFCMOV scheduler class for x87 CMOVs llvm-svn: 332173	2018-05-12 18:07:07 +00:00
Simon Dardis	65cc0cb31f	[mips] Initialize the long branch pass for testing purposes llvm-svn: 332172	2018-05-12 16:57:26 +00:00
Craig Topper	a288f241cd	[X86] Remove some unused masked conversion intrinsics that can be replaced with an older intrinsic and a select. This is what clang already uses. llvm-svn: 332170	2018-05-12 02:34:28 +00:00
Michael Zolotukhin	a41660df7e	Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." Stage3/stage4 bootstrap miscompares should be fixed by a non-determinism fix in IDF (r332167). This reverts commit r330446. llvm-svn: 332168	2018-05-12 01:52:36 +00:00
Michael Zolotukhin	046da97806	[IDF] Enforce the returned blocks to be sorted. Summary: Currently the order of blocks returned by `IDF::calculate` can be non-deterministic. This was discovered in several attempts to enable SSAUpdaterBulk for JumpThreading (which led to miscompare in bootstrap between stage 3 and stage4). Originally, the blocks were put into a priority queue with a depth level as their key, and this patch adds a DFSIn number as a second key to specify a deterministic order across blocks from one level. The solution was suggested by Daniel Berlin. Reviewers: dberlin, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46646 llvm-svn: 332167	2018-05-12 01:44:32 +00:00
Stanislav Mekhanoshin	7012c246c1	[AMDGPU] Fix amdgpu-waves-per-eu accounting in scheduler We cannot query this attribute from a subtarget given a machine function. At this point attribute itself is already unavailable and can only be obtained through MFI. Differential Revision: https://reviews.llvm.org/D46781 llvm-svn: 332166	2018-05-12 01:41:56 +00:00
Sanjay Patel	fe645d295f	[DAG] add convenience function to propagate FMF; NFC There's only one use of this currently, but that could change with D46563. Either way, we shouldn't have to update code outside of the flags struct when those flag definitions change. llvm-svn: 332155	2018-05-11 23:13:36 +00:00
Tom Stellard	655fdd3f82	AMDGPU/GlobalISel: Implement select() for >32-bit G_STORE Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D46153 llvm-svn: 332154	2018-05-11 23:12:49 +00:00
Sergey Dmitriev	69c9cd277d	[CodeExtractor] Allow extracting blocks with exception handling This is a CodeExtractor improvement which adds support for extracting blocks which have exception handling constructs if that is legal to do. CodeExtractor performs validation checks to ensure that extraction is legal when it finds invoke instructions or EH pads (landingpad, catchswitch, or cleanuppad) in blocks to be extracted. I have also added an option to allow extraction of blocks with alloca instructions, but no validation is done for allocas. CodeExtractor caller has to validate it himself before allowing alloca instructions to be extracted. By default allocas are still not allowed in extraction blocks. Differential Revision: https://reviews.llvm.org/D45904 llvm-svn: 332151	2018-05-11 22:49:49 +00:00
Sanjay Patel	c4e4c5b076	[DAG] clean up flag propagation for binops; NFCI llvm-svn: 332150	2018-05-11 22:45:22 +00:00
Changpeng Fang	f094885a9e	AMDGPU/SI: Don't promote alloca to vector for AddrSpaceCast instruction. Summary: We have no logic to promote alloca to vector for an AddrSpaceCast instruction. Reviewer: arsenm Differential Revision: https://reviews.llvm.org/D45993 llvm-svn: 332147	2018-05-11 22:17:57 +00:00
Craig Topper	a17d627abb	[X86] Remove and autoupgrade a bunch of FMA instrinsics that are no longer used by clang. llvm-svn: 332146	2018-05-11 21:59:34 +00:00
Artem Belevich	c2cd5d5ce0	[Split GEP] handle trunc() in separate-const-offset-from-gep pass. Let separate-const-offset-from-gep pass handle trunc() when it calculates constant offset relative to base. The pass itself may insert trunc() instructions when it canonicalises array indices to pointer-size integers and needs to handle trunc() in order to evaluate the offset. Differential Revision: https://reviews.llvm.org/D46732 llvm-svn: 332142	2018-05-11 21:13:19 +00:00
Yaxun Liu	deba150c27	[AMDGPU] Fix compilation failure when IR contains comdat Remove a useless SwitchSection which also causes compilation failure when IR contains comdat. The SwitchSection is useless because the current section is already correct text section for the function therefore no need to switch. It causes compilation failure for comdat because functions with comdat has specific text section, not the default .text section. Since HIP uses comdat, this bug caused failures for HIP. Differential Revision: https://reviews.llvm.org/D46770 llvm-svn: 332137	2018-05-11 20:40:14 +00:00
Sanjay Patel	0ddf09a36c	[DAG] reduce code duplication; NFCI llvm-svn: 332133	2018-05-11 20:08:23 +00:00
Daniel Neilson	f6651d4d94	[InstCombine] Handle atomic memset in the same way as regular memset Summary: This change adds handling of the atomic memset intrinsic to the code path that simplifies the regular memset. In practice this means that we will now also expand a small constant-length atomic memset into a single unordered atomic store. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46660 llvm-svn: 332132	2018-05-11 20:04:50 +00:00
Vedant Kumar	99d5c072f0	[DAGCombiner] Set the right SDLoc on extended SETCC uses (7/N) ExtendSetCCUses updates SETCC nodes which use a load (OriginalLoad) to reflect a simplification to the load (ExtLoad). Based on my reading, ExtendSetCCUses may create new nodes to extend a constant attached to a SETCC. It also creates fresh SETCC nodes which refer to any updated operands. ISTM that the location applied to the new constant and SETCC nodes should be the same as the location of the ExtLoad. This was suggested by Adrian in https://reviews.llvm.org/D45995. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46216 llvm-svn: 332119	2018-05-11 18:40:10 +00:00
Vedant Kumar	fd340a4047	[DAGCombiner] Set the right SDLoc on a newly-created sextload (6/N) This teaches tryToFoldExtOfLoad to set the right location on a newly-created extload. With that in place, the logic for performing a certain ([s\|z]ext (load ...)) combine becomes identical for sexts and zexts, and we can get rid of one copy of the logic. The test case churn is due to dependencies on IROrders inherited from the wrong SDLoc. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46158 llvm-svn: 332118	2018-05-11 18:40:08 +00:00
Vedant Kumar	f0e5f7c45e	[DAGCombiner] Factor out duplicated logic for an extload combine, NFC (5/N) Part of the logic for combining (zext (load ...)) and (sext (load ...)) is duplicated. This creates problems because bugs in one version have to be fixed again in the other version. To address this, as a first step, I've extracted the duplicate logic into a helper. I'll fix the debug location bug in the helper and eliminate the copy of its logic in a followup. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46157 llvm-svn: 332117	2018-05-11 18:40:02 +00:00
David Bolvansky	cd93c4ef1a	[InstCombine] snprintf optimizations Reviewers: spatel, efriedma, majnemer, rja, bkramer Reviewed By: rja, bkramer Subscribers: mstorsjo, rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 332110	2018-05-11 17:50:49 +00:00
Simon Pilgrim	661ae7778d	[X86][BtVer2] Model ymm move as double pumped instructions We still need to handle mmx/xmm moves as 'decode-only' no-pipe instructions llvm-svn: 332109	2018-05-11 17:38:36 +00:00
Alex Bradbury	bca0c3cdb6	[RISCV] Support .option rvc and norvc assembler directives These directives allow the 'C' (compressed) extension to be enabled/disabled within a single file. Differential Revision: https://reviews.llvm.org/D45864 Patch by Kito Cheng llvm-svn: 332107	2018-05-11 17:30:28 +00:00
Martin Storsjo	0d7c37756b	[Analysis] Validate the return type of s(n)printf like libcalls If the sprintf function is static (as on mingw-w64, where many stdio functions are static inline wrappers), earlier optimization passes could optimize out the return value altogether, and make it void, which could break optimizations of this libcall that touch the return value. This fixes the issue discussed in PR37408 for the sprintf function. Differential Revision: https://reviews.llvm.org/D46752 llvm-svn: 332106	2018-05-11 16:53:56 +00:00
Simon Pilgrim	706403bab8	[X86][MMX] Tag MMX Move/Load/Store as WriteVec schedule classes Fixes an issue on SLM/Btver2 where we had instructions were being treated as scalar loads/stores llvm-svn: 332104	2018-05-11 16:38:59 +00:00
Geoff Berry	60460268c0	[AArch64] Fix performPostLD1Combine to check for constant lane index. Summary: performPostLD1Combine in AArch64ISelLowering looks for vector insert_vector_elt of a loaded value which it can optimize into a single LD1LANE instruction. The code checking for the pattern was not checking if the lane index was a constant which could cause two problems: - an assert when lowering the LD1LANE ISD node since it assumes an constant operand - an assert in isel if the lane index value depends on the post-incremented base register Both of these issues are avoided by simply checking that the lane index is a constant. Fixes bug 35822. Reviewers: t.p.northover, javed.absar Subscribers: rengolin, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46591 llvm-svn: 332103	2018-05-11 16:25:06 +00:00
Simon Dardis	ae2010813f	[mips] Rename Filler to MipsDelaySlotFiller and initialize the pass llvm-svn: 332102	2018-05-11 16:13:53 +00:00
Sanjoy Das	82105e2a7d	Use iteration instead of recursion in CFIInserter Summary: This recursive step can overflow the stack. Reviewers: djokov, petarj Subscribers: mcrosier, jlebar, bixia, llvm-commits Differential Revision: https://reviews.llvm.org/D46671 llvm-svn: 332101	2018-05-11 15:54:46 +00:00
Davide Italiano	6e1f7bf316	[Reassociate] Prevent infinite loops when processing PHIs. Phi nodes can reside in live blocks but one of their incoming arguments can come from a dead block. Dead blocks and reassociate don't play nice together. In fact, reassociate performs an RPO as a first step to avoid processing dead blocks. The reason why Reassociate might not fixpoint when examining dead blocks is that the following: %xor0 = xor i16 %xor1, undef %xor1 = xor i16 %xor0, undef is perfectly valid LLVM IR (if it appears in a dead block), so the worklist algorithm keeps pushing the two instructions for reexamination. Note that this is not Reassociate fault, at least not entirely. It's llvm that has a weird definition of dominance. Fixes PR37390. llvm-svn: 332100	2018-05-11 15:45:36 +00:00
Nico Weber	0f2a48c1ae	Remove unused SyncExecutor and make it clearer that the whole file is only used if LLVM_ENABLE_THREADS llvm-svn: 332098	2018-05-11 15:25:38 +00:00
Simon Dardis	d4169ad7c1	[mips] Enable disassembly of fused (negative) multiply add/sub instructions Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46392 llvm-svn: 332097	2018-05-11 15:21:40 +00:00
Simon Pilgrim	032a01f74a	[X86][SLM] Vector stores only use the MEC port. Confirmed by both Agner and Intel's AOM - the IEC/FPC are not required for pure load/stores (even if its a partial update). Can't fix WriteStore until all RMW instructions are cleaned up though.... llvm-svn: 332096	2018-05-11 15:16:15 +00:00
Simon Pilgrim	22dd72b995	[X86] Split WriteF/WriteVec Move/Load/Store scheduler classes by vector width Fixes a SNB issue that was missing vlddqu/vmovntdqa ymm instructions llvm-svn: 332094	2018-05-11 14:30:54 +00:00
Daniel Neilson	8f30ec65b0	[InstCombine] Unify handling of atomic memtransfer with non-atomic memtransfer Summary: This change reworks the handling of atomic memcpy within the instcombine pass. Previously, a constant length atomic memcpy would be lowered into loads & stores as long as no more than 16 load/store pairs are created. This is quite different from the lowering done for a non-atomic memcpy; which only ever lowers into a single load/store pair of no more than 8 bytes. Larger constant-sized memcpy calls are expanded to load/stores in later passes, such as SelectionDAG lowering. In this change the behaviour for atomic memcpy is unified with non-atomic memcpy; atomic memcpy is now treated in the same was as non-atomic memcpy has always been. We leave it to later passes to lower longer-length atomic memcpy calls. Due to the structure of the pass's handling of memtransfer intrinsics, this change also gives us handling of atomic memmove that we did not previously have. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46658 llvm-svn: 332093	2018-05-11 14:30:02 +00:00
Simon Pilgrim	ead11e4d4b	[X86] Added scheduler helper classes to split move/load/store by size Nothing uses this yet but this will allow us to specialize MMX/XMM/YMM/ZMM vector moves. llvm-svn: 332090	2018-05-11 12:46:54 +00:00
Sven van Haastregt	5ad5c3c7ed	[APFloat] Set losesInfo on no-op convert losesInfo would be left unset when no conversion needs to be done. A caller such as InstCombine's fitsInFPType would then branch on an uninitialized value. Caught using valgrind on an out-of-tree target. Differential Revision: https://reviews.llvm.org/D46645 llvm-svn: 332087	2018-05-11 09:45:42 +00:00
Tom Stellard	dcc95e9385	AMDGPU/GlobalISel: Implement select() for 32-bit G_FPTOUI Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45883 llvm-svn: 332082	2018-05-11 05:44:16 +00:00
Craig Topper	9968af4a2a	[X86] Remove and autoupgrade the avx512.mask.store.ss intrinsic. llvm-svn: 332079	2018-05-11 04:33:18 +00:00
Brian Gesiak	c651113439	[Coroutines] PR34897: Fix incorrect elisions Summary: https://bugs.llvm.org/show_bug.cgi?id=34897 demonstrates an incorrect coroutine frame allocation elision in the coro-elide pass. The elision is performed on the basis that the SSA variables from all llvm.coro.begin are directly referenced in subsequent llvm.coro.destroy instructions. However, this ignores the fact that the function may exit through paths that do not run these destroy instructions. In the sample program from PR34897, for example, the llvm.coro.destroy instruction is only executed in exception handling code. When the coroutine function exits normally, llvm.coro.destroy is not called. Eliding the allocation in this case causes a subsequent reference to the coroutine handle from outside of the function to access freed memory. To fix the issue, when finding an llvm.coro.destroy for each llvm.coro.begin, only consider llvm.coro.destroy that are executed along non-exceptional paths. Test Plan: 1. Download the sample program from https://bugs.llvm.org/show_bug.cgi?id=34897, compile it with `clang++ -fcoroutines-ts -stdlib=libc++ -std=c++1z -O2`, and run it. It should print `"run1\ncheck1\nrun2\ncheck2"` and then exit successfully. 2. Compile https://godbolt.org/g/mCKfnr and confirm it is still optimized to a single instruction, 'return 1190'. 3. `check-llvm` Reviewers: rsmith, GorNishanov, eric_niebler Reviewed By: GorNishanov Subscribers: andrewrk, lewissbaker, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D43242 llvm-svn: 332077	2018-05-11 03:12:28 +00:00
Brian Gesiak	82de4e6b93	[Support] Add docs for 'openFileFor{Write,Read}' Summary: Add documentation for the LLVM Support functions `openFileForWrite` and `openFileForRead`. The `openFileForRead` parameter `RealPath`, in particular, I think warranted some explanation. In addition, make the behavior of the functions more consistent across platforms. Prior to this patch, Windows would set or not set the result file descriptor based on the nature of the error, whereas Unix would consistently set it to `-1` if the open failed. Make Windows consistently set it to `-1` as well. Test Plan: 1. `ninja check-llvm` 2. `ninja docs-llvm-html` Reviewers: zturner, rnk, danielmartin, scanon Reviewed By: danielmartin, scanon Subscribers: scanon, danielmartin, llvm-commits Differential Revision: https://reviews.llvm.org/D46499 llvm-svn: 332075	2018-05-11 01:47:27 +00:00
Kostya Serebryany	a2759327fd	[sanitizer-coverage] don't instrument a function if it's entry block ends with 'unreachable' llvm-svn: 332072	2018-05-11 01:09:39 +00:00
Kamil Rytarowski	02c432a72b	Register NetBSD/i386 in AddressSanitizer.cpp Summary: Ship kNetBSD_ShadowOffset32 set to 1ULL << 30. This is prepared for the amd64 kernel runtime. Sponsored by <The NetBSD Foundation> Reviewers: vitalybuka, joerg, kcc Reviewed By: vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46724 llvm-svn: 332069	2018-05-11 00:58:01 +00:00
Wei Mi	0c2f6be662	[SampleFDO] Don't treat warm callsite with inline instance in the profile as cold We found current sampleFDO had a performance issue when triaging a regression. For a callsite with inline instance in the profile, even if hot callsite inliner cannot inline it, it may still execute enough times and should not be treated as cold in regular inliner later. However, currently if such callsite is not inlined by hot callsite inliner, and the BB where the callsite locates doesn't get samples from other instructions inside of it, the callsite will have no profile metadata annotated. In regular inliner cost analysis, if the callsite has no profile annotated and its caller has profile information, it will be treated as cold. The fix changes the isCallsiteHot check and chooses to compare CallsiteTotalSamples with hot cutoff value computed by ProfileSummaryInfo. Differential Revision: https://reviews.llvm.org/D45377 llvm-svn: 332058	2018-05-10 23:02:27 +00:00
Vedant Kumar	e0b5f86b30	[STLExtras] Add distance() for ranges, pred_size(), and succ_size() This commit adds a wrapper for std::distance() which works with ranges. As it would be a common case to write `distance(predecessors(BB))`, this also introduces `pred_size()` and `succ_size()` helpers to make that easier to write. Differential Revision: https://reviews.llvm.org/D46668 llvm-svn: 332057	2018-05-10 23:01:54 +00:00
Craig Topper	ea78a261de	[InstCombine] Replace an 'if' that should always be true with an assert. The bitwidth of the operation should always be wider than the result width of the truncate since we don't recurse through any width changing operations. llvm-svn: 332055	2018-05-10 22:45:28 +00:00
Sam Clegg	16c16827bc	[WebAssembly] Initial Disassembler. This implements a new table-gen emitter to create tables for a wasm disassembler, and a dissassembler to use them. Comes with 2 tests, that tests a few instructions manually. Is also able to disassemble large .wasm files with objdump reasonably. Not working so well, to be addressed in followups: - objdump appears to be passing an incorrect starting point. - since the disassembler works an instruction at a time, and it is disassembling stack instruction, it has no idea of pseudo register assignments. These registers are required for the instruction printing code that follows. For now, all such registers appear in the output as $0. Patch by Wouter van Oortmerssen Differential Revision: https://reviews.llvm.org/D45848 llvm-svn: 332052	2018-05-10 22:16:44 +00:00
Craig Topper	1ee19ae126	[X86] Add new patterns for masked scalar load/store to match clang's codegen from r331958. Clang's codegen now uses 128-bit masked load/store intrinsics in IR. The backend will widen to 512-bits on AVX512F targets. So this patch adds patterns to detect codegen's widening and patterns for AVX512VL that don't get widened. We may be able to drop some of the old patterns, but I leave that for a future patch. llvm-svn: 332049	2018-05-10 21:49:16 +00:00
Martin Storsjo	86e6742c17	Revert "[InstCombine] snprintf optimizations" This reverts commit SVN r331889, which could trigger failed assertions for cases where the snprintf function is declared with a vaguely differing signature (e.g. being defined as static inline), see PR37408. llvm-svn: 332043	2018-05-10 21:23:36 +00:00
Tom Stellard	1e0edad4bb	AMDGPU/GlobalISel: Implement select() for G_BITCAST s32 <--> <2 x s16> Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45881 llvm-svn: 332042	2018-05-10 21:20:10 +00:00
Robert Widmann	4b0084bfcf	[LLVM-C] Consolidate llgo's DIBuilder Bindings Summary: Move and correct LLVMDIBuilderCreateTypedef. This is the last API in DIBuilderBindings.h, so it is being removed and the C API will now be re-exported from IRBindings.h. Reviewers: whitequark, harlanhaskins, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46725 llvm-svn: 332041	2018-05-10 21:10:06 +00:00
Tom Stellard	1dc90204bf	AMDGPU/GlobalISel: Enable TableGen'd instruction selector Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, mgorny, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45994 llvm-svn: 332039	2018-05-10 20:53:06 +00:00
Sanjay Patel	c7bb14301a	[InstCombine] add folds for minnum(-a, -b) --> -maxnum(a, b) This is similar to what we do for integer min/max with 'not' ops (rL321882). This should fix: https://bugs.llvm.org/show_bug.cgi?id=37404 https://bugs.llvm.org/show_bug.cgi?id=37405 llvm-svn: 332031	2018-05-10 20:03:13 +00:00
Wolfgang Pieb	f2b6915ed4	[DWARF] Fixing a bug in DWARF v5 string offsets tables where the length encoded the contribution length excluding the table header. Instead it must encode the contribution length minus the length field itself. Reviewer: JDevliegehere Differential Revision: https://reviews.llvm.org/D45922 llvm-svn: 332030	2018-05-10 20:02:34 +00:00
Omer Paparo Bivas	fbb83deef7	[InstCombine] Moving overflow computation logic from InstCombine to ValueTracking; NFC Differential Revision: https://reviews.llvm.org/D46704 Change-Id: Ifabcbe431a2169743b3cc310f2a34fd706f13f02 llvm-svn: 332026	2018-05-10 19:46:19 +00:00
Gabor Buella	a3b581906f	[X86] Initialize HasPTWRITE member of X86Subtarget This was missing from r331961. Caught by sanitizer bots. llvm-svn: 332024	2018-05-10 19:15:10 +00:00
Simon Pilgrim	a3686c9a28	[X86] Convert/Merge more instregex patterns to reduce InstrRW compile time. Use instrs lists or merge multiple instregex patterns. llvm-svn: 332022	2018-05-10 19:08:06 +00:00
Haicheng Wu	0aae2bc260	[CGP] Split large data structres to sink more GEPs Accessing the members of a large data structures needs a lot of GEPs which usually have large offsets due to the size of the underlying data structure. If the offsets are too large to fit into the r+i addressing mode, these GEPs cannot be sunk to their users' blocks and many extra registers are needed then to carry the values of these GEPs. This patch tries to split a large data struct starting from %base like the following. Before: BB0: %base = BB1: %gep0 = gep %base, off0 %gep1 = gep %base, off1 %gep2 = gep %base, off2 BB2: %load1 = load %gep0 %load2 = load %gep1 %load3 = load %gep2 After: BB0: %base = %new_base = gep %base, off0 BB1: %new_gep0 = %new_base %new_gep1 = gep %new_base, off1 - off0 %new_gep2 = gep %new_base, off2 - off0 BB2: %load1 = load i32, i32* %new_gep0 %load2 = load i32, i32* %new_gep1 %load3 = load i32, i32* %new_gep2 In the above example, the struct is split into two parts. The first part still starts from %base and the second part starts from %new_base. After the splitting, %new_gep1 and %new_gep2 have smaller offsets and then can be sunk to BB2 and folded into their users. The algorithm to split data structure is simple and very similar to the work of merging SExts. First, it collects GEPs that have large offsets when iterating the blocks. Second, it splits the underlying data structures and updates the collected GEPs to use smaller offsets. Differential Revision: https://reviews.llvm.org/D42759 llvm-svn: 332015	2018-05-10 18:27:36 +00:00
Robert Widmann	260b581498	[LLVM-C] Add Accessors for Common DIType and DILocation Properties Summary: - Adds getters for the line, column, and scope of a DILocation - Adds getters for the name, size in bits, offset in bits, alignment in bits, line, and flags of a DIType Reviewers: whitequark, harlanhaskins, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46627 llvm-svn: 332014	2018-05-10 18:23:55 +00:00
Robert Widmann	a428eba85e	[LLVM-C] Move DIBuilder Bindings For Temporary MDNodes Summary: Move LLVMTemporaryMDNode and LLVMMetadataReplaceAllUsesWith to the C bindings and add LLVMDeleteTemporaryMDNode for deleting non-RAUW'ed temporary nodes. Reviewers: whitequark, harlanhaskins, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46632 llvm-svn: 332010	2018-05-10 18:09:53 +00:00
Simon Pilgrim	b7f274ef16	[X86][Znver1] Remove unnecessary SchedWritePMULLD InstRW overrides. llvm-svn: 332006	2018-05-10 17:42:26 +00:00
Sam Clegg	b210c64b28	[WebAssembly] Create section start symbols automatically for all sections These symbols only get included in the output symbols table if they are used in a relocation. This behaviour matches more closely the ELF object writer. Differential Revision: https://reviews.llvm.org/D46561 llvm-svn: 332005	2018-05-10 17:38:35 +00:00
Chandler Carruth	baf045fb28	[PM/LoopUnswitch] Avoid pointlessly creating an exit block set. This code can just test whether blocks are in the loop, which we already have a dedicated set tracking in the loop itself. llvm-svn: 332004	2018-05-10 17:33:20 +00:00
Simon Pilgrim	37fbb7f173	[X86][SNB] Fix typo in PEXTRDmr instregex, was missing VPEXTRDmr. llvm-svn: 332002	2018-05-10 17:30:49 +00:00
Simon Pilgrim	38ac0e9c6b	[X86] Split WriteVecALU/WriteVecLogic/WriteShuffle/WriteVarShuffle/WritePSADBW/WritePHAdd scheduler classes Split off XMM classes from the default (MMX) classes. llvm-svn: 331999	2018-05-10 17:06:09 +00:00
Simon Atanasyan	05db279f52	[mips] Accept 32-bit offsets for ld/sd/lld commands This is a follow up to the rL330983. The patch teaches ld, sd, and lld commands accept 32-bit memory offsets by replacing `mem_simm16` operand to `mem_simmptr`. In fact, these commands should accept 64-bit offsets, but so large offsets require another command expanding and will be supported by a separate patch. Differential Revision: https://reviews.llvm.org/D46629 llvm-svn: 331997	2018-05-10 16:01:36 +00:00
Simon Atanasyan	ac5f4e0546	[mips] Accept 32-bit offsets for lh and lhu commands This is a follow up to the rL330983. The patch teaches lh and lhu commands accepts 32-bit memory offsets by replacing `mem_simm16` operand to `mem_simmptr`. Differential Revision: https://reviews.llvm.org/D46513 llvm-svn: 331996	2018-05-10 16:01:18 +00:00
Sanjay Patel	b4e7893ba8	[x86] fix fmaxnum/fminnum with nnan With nnan, there's no need for the masked merge / blend sequence (that probably costs much more than the min/max instruction). Somewhere between clang 5.0 and 6.0, we started producing these intrinsics for fmax()/fmin() in C source instead of libcalls or fcmp/select. The backend wasn't prepared for that, so we regressed perf in those cases. Note: it's possible that other targets have similar problems as seen here. Noticed while investigating PR37403 and related bugs: https://bugs.llvm.org/show_bug.cgi?id=37403 The IR FMF propagation cases still don't work. There's a proposal that might fix those cases in D46563. llvm-svn: 331992	2018-05-10 15:40:49 +00:00
Daniel Neilson	71fa1b904a	[DSE] Teach the pass about partial overwrite of atomic memory intrinsics Summary: This change teaches DSE that the atomic memory intrinsics can be overwriten partially in the same way as the non-atomic forms. Specifically, that the atomic memcpy & memset can be shortened at the end and that the atomic memset can be shortened at the beginning, if they partially overwritten by later stores. Reviewers: mkazantsev, skatkov, apilipenko, efriedma, rsmith, spatel, filcab, sanjoy Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45584 llvm-svn: 331991	2018-05-10 15:12:49 +00:00
whitequark	68403564df	[PR37339] Fix assertion in FunctionComparator::cmpInlineAsm Fixes bug https://bugs.llvm.org/show_bug.cgi?id=37339. InlineAsm is only uniqued if the FunctionTypes are exactly the same, while cmpTypes() for example considers all pointer types in the default address space to be the same. For this reason the end of cmpInlineAsm() can be reached. This patch replaces the unreachable assertion with a check that the function types are not identical. Differential Revision: https://reviews.llvm.org/D46495 Reviewers: jfb llvm-svn: 331990	2018-05-10 15:05:47 +00:00
Nirav Dave	a5ad417589	[DAG] Avoid using deleted node in rebuildSetCC Summary: The combine in rebuildSetCC may be combined to another node leaving our references stale. Keep a handle on it to avoid stale references. Fixes PR36602. Reviewers: dbabokin, RKSimon, eli.friedman, davide Subscribers: hiraditya, uabelho, JesperAntonsson, qcolombet, llvm-commits Differential Revision: https://reviews.llvm.org/D46404 llvm-svn: 331985	2018-05-10 14:28:54 +00:00
James Henderson	11a9de74c9	Fix signed/unsigned comparison warning and print format The print format was causing at least 2 unit-test failures from r331971. The signed/unsigned comparison warnings only appeared to affect two lines but it was unclear whether it might just pop up on other lines, so I have been explicit in all the literals in the tests. There were other bot unit-test failures that I am still investigating. llvm-svn: 331978	2018-05-10 12:15:43 +00:00
David Bolvansky	10c218d4af	[CFLGraph] Fixed Select instruction handling Summary: Operand 0 is the condition, not the true value. Use op 1 and op 2 as the correct values. Reviewers: george.burgess.iv, nlopes, efriedma Reviewed By: george.burgess.iv Subscribers: craig.topper, rjmccall, lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D46343 llvm-svn: 331976	2018-05-10 11:47:36 +00:00
Benjamin Kramer	456f473ea8	[InstCombine] Only propagate known leading zeros from udiv input to output. Put in a conservatively correct estimate for now. Avoids miscompiling clang in FDO mode. This is really tricky to trigger in reality as basically all interesting cases will be folded away by computeKnownBits earlier, I was unable to find a reasonably small test case. llvm-svn: 331975	2018-05-10 11:45:18 +00:00
James Henderson	a3acf99e59	[DWARF] Rework debug line parsing to use llvm::Error and callbacks Reviewed by: dblaikie, JDevlieghere, espindola Differential Revision: https://reviews.llvm.org/D44560 Summary: The .debug_line parser previously reported errors by printing to stderr and return false. This is not particularly helpful for clients of the library code, as it prevents them from handling the errors in a manner based on the calling context. This change switches to using llvm::Error and callbacks to indicate what problems were detected during parsing, and has updated clients to handle the errors in a location-specific manner. In general, this means that they continue to do the same thing to external users. Below, I have outlined what the known behaviour changes are, relating to this change. There are two levels of "errors" in the new error mechanism, to broadly distinguish between different fail states of the parser, since not every failure will prevent parsing of the unit, or of subsequent unit. Malformed table errors that prevent reading the remainder of the table (reported by returning them) and other minor issues representing problems with parsing that do not prevent attempting to continue reading the table (reported by calling a specified callback funciton). The only example of this currently is when the last sequence of a unit is unterminated. However, I think it would be good to change the handling of unrecognised opcodes to report as minor issues as well, rather than just printing to the stream if --verbose is used (this would be a subsequent change however). I have substantially extended the DwarfGenerator to be able to handle custom-crafted .debug_line sections, allowing for comprehensive unit-testing of the parser code. For now, I am just adding unit tests to cover the basic error reporting, and positive cases, and do not currently intend to test every part of the parser, although the framework should be sufficient to do so at a later point. Known behaviour changes: - The dump function in DWARFContext now does not attempt to read subsequent tables when searching for a specific offset, if the unit length field of a table before the specified offset is a reserved value. - getOrParseLineTable now returns a useful Error if an invalid offset is encountered, rather than simply a nullptr. - The parse functions no longer use `WithColor::warning` directly to report errors, allowing LLD to call its own warning function. - The existing parse error messages have been updated to not specifically include "warning" in their message, allowing consumers to determine what severity the problem is. - If the line table version field appears to have a value less than 2, an informative error is returned, instead of just false. - If the line table unit length field uses a reserved value, an informative error is returned, instead of just false. - Dumping of .debug_line.dwo sections is now implemented the same as regular .debug_line sections. - Verbose dumping of .debug_line[.dwo] sections now prints the prologue, if there is a prologue error, just like non-verbose dumping. As a helper for the generator code, I have re-added emitInt64 to the AsmPrinter code. This previously existed, but was removed way back in r100296, presumably because it was dead at the time. This change also requires a change to LLD, which will be committed separately. llvm-svn: 331971	2018-05-10 10:51:33 +00:00
Simon Dardis	ce6ada4151	[mips] Correct the predicates of cvt.fmt.fmt instructions Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46390 llvm-svn: 331969	2018-05-10 10:42:30 +00:00
Gabor Buella	a832b22bae	[X86] ptwrite intrinsic Reviewers: craig.topper, RKSimon Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D46539 llvm-svn: 331961	2018-05-10 07:26:05 +00:00
Serguei Katkov	7d02f059e7	SCEV] Do not use induction in isKnownPredicate for simplification umax. During simplification umax we trigger isKnownPredicate twice. As a first attempt it tries the induction. To do that it tries to get post increment of SCEV. Re-writing the SCEV may result in simplification of umax. If the SCEV contains a lot of umax operations this recursion becomes very slow. The added test demonstrates the slow behavior. To resolve this we use only simple ways to check whether the predicate is known. Reviewers: sanjoy, mkazantsev Reviewed By: sanjoy Subscribers: lebedev.ri, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D46046 llvm-svn: 331949	2018-05-10 01:40:43 +00:00
Craig Topper	553d451e95	[InstCombine] Reorder an if condition to put a cheap check in front of a computeKnownBits call. NFC llvm-svn: 331948	2018-05-10 00:53:25 +00:00
Craig Topper	333efc951a	[InstCombine] Use APInt::getBitsSetFrom to shortern a line and fix an 80 columns violation. NFC Fix a similar line in the same function. llvm-svn: 331947	2018-05-10 00:53:22 +00:00
Philip Reames	913a779df2	[Inscombine] fix a signedness warning which broke -Werror builds llvm-svn: 331944	2018-05-10 00:05:29 +00:00
Artem Belevich	2f348ea1c7	[NVPTX] Added a feature to use short pointers for const/local/shared AS. Const/local/shared address spaces are all < 4GB and we can always use 32-bit pointers to access them. This has substantial performance impact on kernels that uses shared memory for intermediary results. The feature is disabled by default. Differential Revision: https://reviews.llvm.org/D46147 llvm-svn: 331941	2018-05-09 23:46:19 +00:00
Sanjay Patel	ac3951a735	[AggressiveInstCombine] convert a chain of 'and-shift' bits into masked compare This is a follow-up to D45986. As suggested there, we should match the "all-bits-set" pattern in addition to "any-bits-set". This was a little more complicated than I thought it would be initially because the "and 1" instruction can be anywhere in the chain. Hopefully, the code comments make that logic understandable, but if you see a way to simplify or improve that, it's most appreciated. This transforms patterns that emerge from bitfield tests as seen in PR37098: https://bugs.llvm.org/show_bug.cgi?id=37098 I think it would also help reduce the large test from: D46336 D46595 but we need something to reassociate that case to the forms we're expecting here first. Differential Revision: https://reviews.llvm.org/D46649 llvm-svn: 331937	2018-05-09 23:08:15 +00:00
Philip Reames	79e917d117	[InstCombine] Widen guards with conditions between The previous handling for guard widening in InstCombine was extremely restrictive. In particular, it didn't handle the common case where we had two guards separated by a single icmp. Handle this by scanning through a small fixed window of instructions to find the next guard if needed. Differential Revision: https://reviews.llvm.org/D46203 llvm-svn: 331935	2018-05-09 22:56:32 +00:00
Benjamin Kramer	0d2fc1a501	[InstCombine] Teach SimplifyDemandedBits that udiv doesn't demand low dividend bits that are zero in the divisor This is safe as long as the udiv is not exact. The pattern is not common in C++ code, but comes up all the time in code generated by XLA's GPU backend. Differential Revision: https://reviews.llvm.org/D46647 llvm-svn: 331933	2018-05-09 22:27:34 +00:00
Amaury Sechet	4f729f6a67	[ARM] Add support for SETCCCARRY instead of SETCCE Summary: As per title. SETCCE is deprecated and will eventually be removed. Reviewers: rogfer01, efriedma, rengolin, javed.absar Subscribers: kristof.beyls, chrib, llvm-commits Differential Revision: https://reviews.llvm.org/D46512 llvm-svn: 331929	2018-05-09 22:15:51 +00:00
Roman Tereshin	6d26638c90	[GlobalISel][Legalizer] Widening the second src op of shifts bug fix The second source operand of G_SHL, G_ASHR, and G_LSHR must preserve its value as a (small) unsigned integer, therefore its incorrect to widen it in any way but by zero extending it. G_SHL was using G_ANYEXT and G_ASHR - G_SEXT (which is correct for their destination and first source operands, but not the "number of bits to shift" operand). Generally, shifts aren't as similar to regular binary operations as it might seem, for instance, they aren't commutative nor associative and the second source operand usually requires a special treatment. Reviewers: bogner, javed.absar, aivchenk, rovka Reviewed By: bogner Subscribers: igorb, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D46413 llvm-svn: 331926	2018-05-09 21:43:30 +00:00
Farhana Aleen	e24f3ff8de	[AMDGPU] Support horizontal vectorization of min/max. Author: FarhanaAleen Reviewed By: rampitec Subscribers: AMDGPU Differential Revision: https://reviews.llvm.org/D46604 llvm-svn: 331920	2018-05-09 21:18:34 +00:00
Matt Arsenault	eac81b2448	AMDGPU: Ignore any_extend in mul24 combine If a multiply is truncated, SimplifyDemandedBits sometimes turns a zero_extend of the inputs into an any_extend, which makes the known bits computation unhelpful. Ignore these and compute known bits for the underlying value, since we insert the correct extend type after. llvm-svn: 331919	2018-05-09 21:11:35 +00:00
Krzysztof Parzyszek	cff73a2118	[Hexagon] Add patterns for vector shift-and-accumulate llvm-svn: 331918	2018-05-09 21:10:41 +00:00
Matt Arsenault	74fd7600d2	AMDGPU: Handle partial shift reduction for variable shifts If the variable shift amount has known bits, we can still reduce the shift. llvm-svn: 331917	2018-05-09 20:52:54 +00:00
Matt Arsenault	b143d9a5ea	AMDGPU: Partially shrink 64-bit shifts if reduced to 16-bit This is an extension of an existing combine to reduce wider shls if the result fits in the final result type. This introduces the same combine, but reduces the shift to a middle sized type to avoid the slow 64-bit shift. llvm-svn: 331916	2018-05-09 20:52:43 +00:00
Simon Pilgrim	ca7981ac98	[X86] Fix Broadwell's Shuffle256 schedule classes load latency values. Allows us to remove some unnecessary InstRW overrides. llvm-svn: 331913	2018-05-09 19:27:48 +00:00
Simon Pilgrim	d5d4cdb49d	[X86] Merge instregex patterns to reduce InstrRW compile time. llvm-svn: 331911	2018-05-09 19:04:15 +00:00
Matt Arsenault	762d498808	AMDGPU: Add combine for trunc of bitcast from build_vector If the truncate is only accessing the first element of the vector, we can use the original source value. This helps with some combine ordering issues after operations are lowered to integer operations between bitcasts of build_vector. In particular it stops unnecessarily materializing the unused top half of a vector in some cases. llvm-svn: 331909	2018-05-09 18:37:39 +00:00
Krzysztof Parzyszek	5093945e5c	[Hexagon] Check the end of the correct container (fix typo) llvm-svn: 331907	2018-05-09 18:33:59 +00:00
Matt Arsenault	378f86998c	AMDGPU: Stop special casing constant indexes of extract_vector_elt The same result folds out of the dynamic expansion logic if the index is constant. llvm-svn: 331906	2018-05-09 18:29:26 +00:00
Roman Tereshin	d5fa9fde58	Reapplying r331819 [GlobalISel][Legalizer] More concise and faster widenScalar, NFC The commit was a suspect for clang-cmake-aarch64-global-isel and clang-cmake-aarch64-quick bot failures, proved to be innocent. llvm-svn: 331898	2018-05-09 17:28:18 +00:00
Craig Topper	176ec8506f	[DAGCombiner] In visitBITCAST when trying to constant fold the bitcast, only call getBitcast if its an fp->int or int->fp conversion even when before legalize ops. Previously if !LegalOperations we would blindly call getBitcast and hope that getNode would constant fold it. But if the conversion is between a vector and a scalar, getNode has no simplification. This means we would just get back the original N. We would then return that N which would make the caller of visitBITCAST think that we used CombineTo and did our own worklist management. This prevents target specific optimizations from being called for vector/scalar bitcasts until after legal operations. llvm-svn: 331896	2018-05-09 17:14:27 +00:00
David Bolvansky	9b5e6e8288	[InstCombine] snprintf optimizations Reviewers: spatel, efriedma, majnemer, rja, bkramer Reviewed By: rja, bkramer Subscribers: rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 331889	2018-05-09 16:09:31 +00:00
Amara Emerson	4e66142f14	[DAGCombine] Change store merge candidates check cut off to 1024. The previous value of 8192 resulted in severe compile time hits in some pathological cases. rdar://39781410 Differential Revision: https://reviews.llvm.org/D46581 llvm-svn: 331888	2018-05-09 15:53:06 +00:00
Krzysztof Parzyszek	01cf424047	[Hexagon] Fix sanitizer error about using -1u in variable of enum type llvm-svn: 331887	2018-05-09 15:44:40 +00:00
Krzysztof Parzyszek	ea4c1bb772	[LV] Change MaxVectorSize bound to 256 in assertion, NFC otherwise It's possible to have a vector of 256 bytes in HVX code on Hexagon (vector pair in 128-byte mode). llvm-svn: 331885	2018-05-09 15:18:12 +00:00
Pavel Labath	b1bcafd706	APFloat/x87: Fix string conversion for "unnormal" values (pr35860) Summary: Unnormal values are a feature of some very old x87 processors. We handle them correctly for the most part -- the only exception was an unnormal value whose significand happened to be zero. In this case the APFloat was still initialized as normal number (category = fcNormal), but a subsequent toString operation would assert because the math would produce nonsensical values for the zero significand. During review, it was decided that the correct way to fix this is to treat all unnormal values as NaNs (as that is what any >=386 processor will do). The issue was discovered because LLDB would crash when trying to print some "long double" values. Reviewers: skatkov, scanon, gottesmm Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41868 llvm-svn: 331884	2018-05-09 15:13:45 +00:00
Krzysztof Parzyszek	8e7625eed7	[Hexagon] Simplify MCCodeEmitter, move data to tables llvm-svn: 331883	2018-05-09 15:02:04 +00:00
Pavel Labath	d20289b31a	[Support/Path] Make handling of paths like "///" consistent Summary: Various path functions were not treating paths consisting of slashes alone consistently. For example, the iterator-based accessors decomposed the path "///" into two elements: "/" and ".". This is not too bad, but it is different from the behavior specified by posix: ``` A pathname that contains *at least one non-slash character* and that ends with one or more trailing slashes shall be resolved as if a single dot character ( '.' ) were appended to the pathname. ``` More importantly, this was different from how we treated the same path in the filename+parent_path functions, which decomposed this path into "." and "". This was completely wrong as it lost the information that this was an absolute path which referred to the root directory. This patch fixes this behavior by making sure all functions treat paths consisting of (back)slashes alone the same way as "/". I.e., the iterator-based functions will just report one component ("/"), and the filename+parent_path will decompose them into "/" and "". A slightly controversial topic here may be the treatment of "//". Posix says that paths beginning with "//" may have special meaning and indeed we have code which parses paths like "//net/foo/bar" specially. However, as we were already not being consistent in parsing the "//" string alone, and any special parsing for it would complicate the code further, I chose to treat it the same way as longer sequences of slashes (which are guaranteed to be the same as "/"). Another slight change of behavior is in the parsing of paths like "//net//". Previously the last component of this path was ".". However, as in our parsing the "//net" part in this path was the same as the "drive" part in "c:\" and the next slash was the "root directory", it made sense to treat "//net//" the same way as "//net/" (i.e., not to add the extra "." component at the end). Reviewers: zturner, rnk, dblaikie, Bigcheese Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45942 llvm-svn: 331876	2018-05-09 13:21:16 +00:00
Adhemerval Zanella	f384bc7166	[AArch64] Improve cost of vector division by constant With custom lowering for vector MULLH{S,U}, it is now profitable to vectorize a divide by constant loop for the custom types (v16i8, v8i16, and v4i32). The cost if based on TargetLowering::Build{S,U}DIV which uses a multiply by constant plus adjustment to express a divide by constant. Both {u,s}mull{2} are expressed as Instruction::Mul and shifts by Instruction::AShr. llvm-svn: 331873	2018-05-09 12:48:22 +00:00
Pavel Labath	e0207a60dd	Revert "DWARFVerifier: Check "completeness" of .debug_names section" The new verifier check has found an error in the debug-names-name-collisions.ll test on the PS4 bot: error: Name Index @ 0x0: Entry @ 0xdc: mismatched Name of DIE @ 0x23: index - _ZN3foo3fooE; debug_info - foo. Reverting while I investigate whether this is a bug in the verifier or the generator. This reverts commit r331868. llvm-svn: 331869	2018-05-09 12:26:19 +00:00
Pavel Labath	3280e0467f	DWARFVerifier: Check "completeness" of .debug_names section Summary: This patch implements a check which makes sure all entries required by the DWARF v5 specification are present in the Name Index. The algorithm tries to follow the wording of Section 6.1.1.1 of the spec as closely as possible. The main deviation from it is that instead of a whitelist-based approach in the spec "The name index must contain an entry for each debugging information entry that defines a named subprogram, label, variable, type, or namespace" I chose a blacklist-based one, where I consider everything to be "in" and then remove the entries that don't make sense. I did this because it has more potential for catching interesting cases and the above is a bit vague (it uses plain words like "variable" and "subprogram", but the rest of the section speaks about specific TAGs). This approach has raised some interesting questions, the main one being whether enumerator values should be indexed. The consensus seems to be that they should, although it does not follow from section 6.1.1.1. For the time being I made the verifier ignore these, as LLVM does not do this yet, and I wanted to get a clean run when verifying generated debug info. Another interesting case was the DW_TAG_imported_declaration. It was not immediately clear to me whether this should go in or not, but currently it is not indexed, and (unlike the enumerators) in does not seem to cause problems for LLDB, so I've also ignored it. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46583 llvm-svn: 331868	2018-05-09 12:06:17 +00:00
Benjamin Kramer	ccb0fbe9a0	Revert "[InstCombine] snprintf optimizations" This reverts commit r331849. It miscompiles snprintf(buf, sizeof(buf), "%s", "any constant string); into memcpy(buf, "%s", sizeof("any constant string")); llvm-svn: 331866	2018-05-09 11:38:57 +00:00
Simon Pilgrim	ab34aa8294	[X86] Cleanup WriteFStore/WriteVecStore schedules MOVNTPD/MOVNTPS should be WriteFStore Standardized BDW/HSW/SKL/SKX WriteFStore/WriteVecStore - fixes some missed instregex patterns. (V)MASKMOVDQU was already using the default, its costs gets increased but is still nowhere near the real cost of that nasty instruction.... llvm-svn: 331864	2018-05-09 11:01:16 +00:00
Simon Dardis	eead208872	[mips] Move conditional moves out of isCodeGenOnly Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46389 llvm-svn: 331863	2018-05-09 10:33:21 +00:00
Martin Storsjo	284ab80f8d	[COFF] Improve correctness of def parsing for GNU features The operator == used for exporting a function with a different name in the DLL compared to the name in the import library (which is useful for adding linker level aliases for function in the import library) is a feature distinct and different from the operator = used for exporting a function with a different name (both in import library and DLL) than in the implementation producing the DLL. When creating an import library using dlltool, from a def file that contains forwards (Func = OtherDll.Func), this shouldn't affect the produced import library, which should still behave just as if it was a normal exported function. This clears a lot of confusion and subtle misunderstandings, and avoids a parameter that was used to avoid creating weak aliases when invoked from lld. (This parameter was added previously due to the existing conflation of the two features.) Differential Revision: https://reviews.llvm.org/D46245 llvm-svn: 331859	2018-05-09 09:21:53 +00:00
Bjorn Pettersson	9f953cdd7c	[MergedLoadStoreMotion] Fix a debug invariant bug in mergeStores Summary: MergedLoadStoreMotion::mergeStores is using some heuristics to limit the amount of stores that it tries to sink (see MagicCompileTimeControl in MergedLoadStoreMotion.cpp). The heuristic involves counting the number of instructions in one of the basic blocks that is part of the transformation. We now ignore dbg intrinsics when counting instruction for the MagicCompileTimeControl heuristic. This to make sure that the amount of stores that are sunk doesn't depend on the amount of debug information (if -g is used or not). Reviewers: Gerolf, davide, majnemer Reviewed By: davide Subscribers: dberlin, bjope, aprantl, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D46600 llvm-svn: 331852	2018-05-09 06:52:12 +00:00
David Bolvansky	44a37f04b2	[InstCombine] snprintf optimizations Reviewers: spatel, efriedma, majnemer, rja Reviewed By: rja Subscribers: rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 331849	2018-05-09 06:34:20 +00:00
Craig Topper	b9a473d186	[X86] Combine (vXi1 (bitcast (-1)))) and (vXi1 (bitcast (0))) to all ones or all zeros vXi1 vector. llvm-svn: 331847	2018-05-09 06:07:20 +00:00
Daniel Sanders	618437459c	Revert r331816 and r331820 - [globalisel] Add a combiner helpers for extending loads and use them in a pre-legalize combiner for AArch64 Reverting this to see if the clang-cmake-aarch64-global-isel and clang-cmake-aarch64-quick bots are failing because of this commit. We know it wasn't r331819. llvm-svn: 331846	2018-05-09 05:00:17 +00:00
Shiva Chen	801bf7ebbe	[DebugInfo] Examine all uses of isDebugValue() for debug instructions. Because we create a new kind of debug instruction, DBG_LABEL, we need to check all passes which use isDebugValue() to check MachineInstr is debug instruction or not. When expelling debug instructions, we should expel both DBG_VALUE and DBG_LABEL. So, I create a new function, isDebugInstr(), in MachineInstr to check whether the MachineInstr is debug instruction or not. This patch has no new test case. I have run regression test and there is no difference in regression test. Differential Revision: https://reviews.llvm.org/D45342 Patch by Hsiangkai Wang. llvm-svn: 331844	2018-05-09 02:42:00 +00:00
Shiva Chen	cd070cdc94	[DebugInfo] Convert intrinsic llvm.dbg.label to MachineInstr. In order to convert LLVM IR to MachineInstr, we need a new TargetOpcode, DBG_LABEL, to ‘lower’ intrinsic llvm.dbg.label. The patch creates this new TargetOpcode and convert intrinsic llvm.dbg.label to MachineInstr through SelectionDAG. In SelectionDAG, debug information is stored in SDDbgInfo. We create a new data member of SDDbgInfo for labels and use the new data member, SDDbgLabel, to create DBG_LABEL MachineInstr. The new DBG_LABEL MachineInstr uses label metadata from LLVM IR as its parameter. So, the backend could get metadata information of labels from DBG_LABEL MachineInstr. Differential Revision: https://reviews.llvm.org/D45341 Patch by Hsiangkai Wang. llvm-svn: 331842	2018-05-09 02:41:08 +00:00
Shiva Chen	2c864551df	[DebugInfo] Add DILabel metadata and intrinsic llvm.dbg.label. In order to set breakpoints on labels and list source code around labels, we need collect debug information for labels, i.e., label name, the function label belong, line number in the file, and the address label located. In order to keep these information in LLVM IR and to allow backend to generate debug information correctly. We create a new kind of metadata for labels, DILabel. The format of DILabel is !DILabel(scope: !1, name: "foo", file: !2, line: 3) We hope to keep debug information as much as possible even the code is optimized. So, we create a new kind of intrinsic for label metadata to avoid the metadata is eliminated with basic block. The intrinsic will keep existing if we keep it from optimized out. The format of the intrinsic is llvm.dbg.label(metadata !1) It has only one argument, that is the DILabel metadata. The intrinsic will follow the label immediately. Backend could get the label metadata through the intrinsic's parameter. We also create DIBuilder API for labels to be used by Frontend. Frontend could use createLabel() to allocate DILabel objects, and use insertLabel() to insert llvm.dbg.label intrinsic in LLVM IR. Differential Revision: https://reviews.llvm.org/D45024 Patch by Hsiangkai Wang. llvm-svn: 331841	2018-05-09 02:40:45 +00:00
Roman Tereshin	27bba4495a	Revert r331819 [GlobalISel][Legalizer] More concise and faster widenScalar, NFC Reverting this to see if the clang-cmake-aarch64-global-isel and clang-cmake-aarch64-quick bots are failing because of this commit llvm-svn: 331839	2018-05-09 01:43:12 +00:00
Lang Hames	3ae85708c9	[RuntimeDyld][MachO] Properly handle thumb to thumb calls within a section. Previously thumb bits were only checked for external relocations (thumb to arm code and vice-versa). This patch adds detection for thumb callees in the same section asthe (also thumb) caller. The MachO/Thumb test case is updated to cover this, and redundant checks (handled by the MachO/ARM test) are removed. llvm-svn: 331838	2018-05-09 01:38:13 +00:00
Heejin Ahn	bf7716952a	Support a funclet operand bundle in LowerInvoke Summary: The current LowerInvoke pass cannot handle invoke instructions with a funclet bundle operand. The order of operands for an invoke instruction is {call arguments, callee, funclet operand (if any), normal dest, unwind dest}. The current code assumes there is no funclet operand and incorrectly includes a funclet operand into call arguments. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46242 llvm-svn: 331832	2018-05-09 00:53:50 +00:00
Davide Italiano	48283ba3a1	[SimplifyCFG] Fix a crash when folding PHIs. We enter MergeBlockIntoPredecessor with a block looking like this: for.inc.us-lcssa: ; preds = %cond.end %k.1.lcssa.ph = phi i32 [ %conv15, %cond.end ] %t.3.lcssa.ph = phi i32 [ %k.1.lcssa.ph, %cond.end ] br label %for.inc, !dbg !66 [note the first arg of the PHI being a PHI]. FoldSingleEntryPHINodes gets rid of both PHIs (calling, eraseFromParent). But right before we call the function, we push into IncomingValues the only argument of the PHIs, and shortly after we try to iterate over something which has been invalidated before :( The fix its not trying to remove PHIs which have an incoming value coming from the same BB we're looking at. Fixes PR37300 and rdar://problem/39910460 Differential Revision: https://reviews.llvm.org/D46568 llvm-svn: 331824	2018-05-08 23:28:15 +00:00
Peter Collingbourne	8e173e1189	MC: Remove dead code. NFCI. We should never emit an SHT_DYNSYM into an object file. llvm-svn: 331821	2018-05-08 22:59:05 +00:00
Daniel Sanders	ec17920da1	[globalisel] Correct r331816 to check the opcode before calling getOperand(). Fix a silly mistake in my pre-commit changes for r331816. It should check what opcode the insn is before extracting the operands. NFC at the moment since the caller already checked the opcode. llvm-svn: 331820	2018-05-08 22:58:35 +00:00
Roman Tereshin	25cbfe680e	[GlobalISel][Legalizer] More concise and faster widenScalar, NFC Refactoring LegalizerHelper::widenScalar member function reducing its size by approximately a factor of 2 and (hopefuly) making it more straightforward and regular by introducing widenScalarSrc and widenScalarDst helper methods. The new widenScalar* methods mutate the instructions in place instead of recreating them from scratch and removing the originals. The compile time implications of this were measured on sqlite3 amalgamation, targeting AArch64 in -O0: LegalizerHelper::widenScalar: > 25% faster Legalizer::runOnMachineFunction: ~ 4.0 - 4.5% faster Also adding MachineOperand::setCImm and refactoring out MachineIRBuilder::recordInsertion methods to make the change possible. Reviewers: aditya_nandakumar, bogner, javed.absar, t.p.northover, ab, dsanders, arsenm Reviewed By: aditya_nandakumar Subscribers: wdng, rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D46414 llvm-svn: 331819	2018-05-08 22:53:09 +00:00
Daniel Sanders	d24dcdd1f7	[globalisel] Add a combiner helpers for extending loads and use them in a pre-legalize combiner for AArch64 Summary: Depends on D45541 Reviewers: ab, aditya_nandakumar, bogner, rtereshin, volkan, rovka, javed.absar, aemerson Reviewed By: aemerson Subscribers: aemerson, rengolin, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45543 llvm-svn: 331816	2018-05-08 22:26:39 +00:00
Jessica Paquette	ec37c640dd	Revert "[X86][CET] Shadow stack fix for setjmp/longjmp" This reverts commit 30962eca38ef02666ebcdded72a94f2cd0292d68. This commit has been causing test asan failures on a build bot. http://green.lab.llvm.org/green/job/clang-stage1-configure-RA/45108/ Original commit: https://reviews.llvm.org/D46181 llvm-svn: 331813	2018-05-08 22:00:57 +00:00
Martin Storsjo	966d34563f	[AsmPrinter] Allow emitting codeview for any windows target Before SVN r244158, codeview debug info was emitted always emitted for msvc if debug info was enabled, but that commit added a module flag. Since it's still restricted by the flag, we can allow it for any target if the user requests it, not only msvc (and windows-itanium, added in SVN r287567). Add a test for emitting it for a mingw target. Differential Revision: https://reviews.llvm.org/D46303 llvm-svn: 331809	2018-05-08 20:56:04 +00:00
Max Moroz	0c5b602015	[Coverage] Take filenames into account when loading function records. Summary: Don't skip functions with the same name but from different files. That change makes it possible to generate code coverage reports from different binaries compiled from different sources even if there are functions with non-unique names. Without that change, code coverage for such functions is missing except of the first function processed. Reviewers: vsk, morehouse Reviewed By: vsk Subscribers: llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D46478 llvm-svn: 331801	2018-05-08 19:26:51 +00:00
Hideki Saito	d722d61402	[LV] Fix for PR37248, Broadcast codegen incorrectly assumed vector loop body is single basic block Summary: Broadcast code generation emitted instructions in pre-header, while the instruction they are dependent on in the vector loop body. This resulted in an IL verification error ---- value used before defined. Reviewers: rengolin, fhahn, hfinkel Reviewed By: rengolin, fhahn Subscribers: dcaballe, Ka-Ka, llvm-commits Differential Revision: https://reviews.llvm.org/D46302 llvm-svn: 331799	2018-05-08 18:57:34 +00:00
Tim Renouf	64afc2d7f0	[AMDGPU] Provide machine -> name mapping Summary: AMDGPU stores a numerical code for the particular GPU variant in EFlags in the ELF file. This commit provides a mapping from that number into the machine name for use by objdump-type tools. Change-Id: Id37fc0bebad443bd89c0080985ce298c4e7e9319 Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46587 llvm-svn: 331798	2018-05-08 18:53:04 +00:00
Lei Huang	e41e3d3237	[Power9]Legalize and emit code for truncate and convert QP to HW and Byte Legalize and emit code for truncate and convert float128 to (un)signed short and (un)signed char. Differential Revision: https://reviews.llvm.org/D46194 llvm-svn: 331797	2018-05-08 18:52:06 +00:00
Matt Arsenault	869cbedc81	AMDGPU: Fix broken dynamic vector indexing for packed types The intention of this was to multiply by 16, not shift by 16. llvm-svn: 331793	2018-05-08 18:43:25 +00:00
Matt Arsenault	1f61c0792c	DAG: Use correct shift width type llvm-svn: 331792	2018-05-08 18:43:05 +00:00
Lei Huang	6364288dba	[Power9]Legalize and emit code for truncate and convert Quad-Precision to Word Legalize and emit code for: * xscvqpswz : VSX Scalar truncate & Convert Quad-Precision to Signed Word * xscvqpuwz : VSX Scalar truncate & Convert Quad-Precision to Unsigned Word Differential Revision: https://reviews.llvm.org/D45635 llvm-svn: 331790	2018-05-08 18:34:00 +00:00
Changpeng Fang	d049da3740	AMDGPU: Use eraseFromParent to delete am instruction when it is no longer needed. Reviewer: Nicolai Differential Revision: https://reviews.llvm.org/D46438 llvm-svn: 331788	2018-05-08 18:32:35 +00:00
Lei Huang	c517e95bc6	[Power9]Legalize and emit code for truncate and convert QP to DW Legalize and emit code for: * xscvqpsdz : VSX Scalar truncate & Convert Quad-Precision to Signed Dword * xscvqpudz : VSX Scalar truncate & Convert Quad-Precision to Unsigned Dword Differential Revision: https://reviews.llvm.org/D45553 llvm-svn: 331787	2018-05-08 18:23:31 +00:00
Guozhi Wei	1aea95a9ea	[CodeGenPrepare] Move Extension Instructions Through Logical And Shift Instructions CodeGenPrepare pass move extension instructions close to load instructions in different BB, so they can be combined later. But the extension instructions can't move through logical and shift instructions in current implementation. This patch enables this enhancement, so we can eliminate more extension instructions. Differential Revision: https://reviews.llvm.org/D45537 llvm-svn: 331783	2018-05-08 17:58:32 +00:00
Lei Huang	c29229a644	[PowerPC] Unify handling for conversion of FP_TO_INT feeding a store Existing DAG combine only handles conversions for FP_TO_SINT: "{f32, f64} x { i32, i16 }" This patch simplifies the code to handle: "{ FP_TO_SINT, FP_TO_UINT } x { f64, f32 } x { i64, i32, i16, i8 }" Differential Revision: https://reviews.llvm.org/D46102 llvm-svn: 331778	2018-05-08 17:36:40 +00:00
Stanislav Mekhanoshin	432936161e	[AMDGPU] Added checks for dpp_ctrl value - Report error for invalid dpp_ctrl values. - Changed the way it is reported, now the error will be emitted into asm and will work with release build as well. - Added dpp_ctrl value verifier for codegen. - Added symbolic constants for dpp_ctrl. Differential Revision: https://reviews.llvm.org/D46565 llvm-svn: 331775	2018-05-08 16:53:02 +00:00
Simon Pilgrim	f5f28aa714	[X86] Tag PCONFIG instruction with WriteSystem scheduler class llvm-svn: 331773	2018-05-08 15:55:14 +00:00
Stefan Maksimovic	c7113cc9e4	[mips][msa] Pattern match the splat.d instruction Introduced a new pattern for matching splat.d explicitly. Both splat.d and splati.d can now be generated from the @llvm.mips.splat.d intrinsic depending on whether an immediate value has been passed. Differential Revision: https://reviews.llvm.org/D45683 llvm-svn: 331771	2018-05-08 15:12:29 +00:00
Simon Pilgrim	2864b46469	[X86] Split off WriteIMul64 from WriteIMul schedule class (PR36931) This fixes a couple of BtVer2 missing instructions that weren't been handled in the override. NOTE: There are still a lot of overrides that still need cleaning up! llvm-svn: 331770	2018-05-08 14:55:16 +00:00
Simon Pilgrim	2580554333	[X86] Split WriteIDiv into div/idiv 8/16/32/64 implementations (PR36930) I've created the necessary classes but there are still a lot of overrides that need cleaning up. NOTE: The Znver1 model was missing some div/idiv variants in the instregex patterns and wasn't setting the resource cycles at all in the overrides. llvm-svn: 331767	2018-05-08 13:51:45 +00:00
Simon Pilgrim	b0a3be04ec	[X86] Add vector masked load/store scheduler classes (PR32857) Split off from existing vector load/store classes to remove InstRW overrides. llvm-svn: 331760	2018-05-08 12:17:55 +00:00
Sander de Smalen	d8e76494fc	[AArch64][SVE] Asm: Support for LD1R load-and-replicate scalar instructions. Reviewers: fhahn, rengolin, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D46251 llvm-svn: 331758	2018-05-08 10:46:55 +00:00
Simon Pilgrim	210286ed8f	[X86] Add SchedWriteFTest/SchedWriteVecTest TEST scheduler classes Split off from SchedWriteVecLogic to remove InstRW overrides. llvm-svn: 331757	2018-05-08 10:28:03 +00:00
Simon Dardis	e0982cca98	[mips] Mark various memory instructions as being in microMIPS (NFC) Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46388 llvm-svn: 331756	2018-05-08 10:16:21 +00:00
Sander de Smalen	20eede7093	[AArch64] Disallow vector operand if FPR128 Q register is required. Patch https://reviews.llvm.org/D41445 changed the behaviour of 'isReg()' to also return 'true' if the parsed register operand is a vector register. Code in the AsmMatcher checks if a register is a subclass of the expected register class. However, even though both parsed registers map to the same physical register, the 'v' register is of kind 'NeonVector', where 'q' is of type Scalar, where isSubclass() does not distinguish between the two cases. The solution is to use an AsmOperand instead of the register directly, and use the PredicateMethod to distinguish the two operands. This fixes for example: ldr v0, [x0] // 'v0' is an invalid operand for this instruction ldr q0, [x0] // valid Reviewers: aemerson, Gerolf, SjoerdMeijer, javed.absar Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D46310 llvm-svn: 331755	2018-05-08 10:01:04 +00:00
Simon Dardis	7563624fcb	[mips] Correct clo/clz predicates Reviewers: smaksimovic, abeserminji, atanasyan Differential Revision: https://reviews.llvm.org/D46125 llvm-svn: 331754	2018-05-08 09:50:37 +00:00
Jeremy Morse	4f799c027e	[X86] Mark all byval parameters as aliased This is a fix for PR30290: by marking all byval stack slots as being aliased, the instruction scheduler is more conservative about rescheduling memory accesses to such stack slots as an LLVM Value* might alias it. This fixes errors such as in the patched test case, where reads and writes to a data structure are illegally mixed. This could be fixed better in the future with better analysis for the instruction scheduler to know what Values alias what stack slots. Differential Revision: https://reviews.llvm.org/D45022 llvm-svn: 331749	2018-05-08 09:18:01 +00:00
Alexander Ivchenko	c47f799289	[X86][CET] Shadow stack fix for setjmp/longjmp This patch adds a shadow stack fix when compiling setjmp/longjmp with the shadow stack enabled. This allows setjmp/longjmp to work correctly with CET. Patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D46181 llvm-svn: 331748	2018-05-08 09:04:07 +00:00
Hans Wennborg	5e6e6cc721	Object: Find terminator correctly when reading long filenames in GNU archives (PR37244) The code was previously relying on there being a null terminator somewhere in (or after) the string table, something made less likely by r330786. Differential Revision: https://reviews.llvm.org/D46527 llvm-svn: 331746	2018-05-08 08:22:58 +00:00
Gabor Buella	4a02bf945e	[x86] Introduce the enclv instruction Summary: and use the -msgx flag as a requirement for the SGX instructions. Reviewers: craig.topper, zvi Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D46436 llvm-svn: 331742	2018-05-08 07:11:05 +00:00
Bjorn Pettersson	51cebc98f3	[LCSSA] Do not remove used PHI nodes in formLCSSAForInstructions Summary: In formLCSSAForInstructions we speculatively add new PHI nodes, that sometimes ends up without having any uses. It has been discovered that sometimes an added PHI node can appear as being unused in one iteration of the Worklist, although it can end up being used by a PHI node added in a later iteration. We now check, a second time, that the PHI node still is unused before we remove it. This avoids an assert about "Trying to remove a phi with uses." for the added test case. Reviewers: davide, mzolotukhin, mattd, dberlin Reviewed By: mzolotukhin, dberlin Subscribers: dberlin, mzolotukhin, davide, bjope, uabelho, llvm-commits Differential Revision: https://reviews.llvm.org/D46422 llvm-svn: 331741	2018-05-08 06:59:47 +00:00
Gabor Buella	2b5e96004b	[x86] Introduce the pconfig instruction Reviewers: craig.topper, zvi Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D46430 llvm-svn: 331739	2018-05-08 06:47:36 +00:00
Fangrui Song	bd088560a8	[DebugInfo] Accept `S` in augmentation strings in CIE. glibc libc.a(sigaction.o) compiled from sysdeps/unix/sysv/linux/x86_64/sigaction.c uses "zRS". llvm-svn: 331738	2018-05-08 06:21:12 +00:00
Roman Tereshin	d2421f9445	[MachineVerifier][GlobalISel] Verifying generic extends and truncates Making sure we don't truncate / extend pointers, don't try to change vector topology or bitcast vectors to scalars or back, and most importantly, don't extend to a smaller type or truncate to a large one. Reviewers: qcolombet t.p.northover aditya_nandakumar Reviewed By: qcolombet Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D46490 llvm-svn: 331718	2018-05-08 02:48:15 +00:00
Roman Tereshin	5e51fac39a	[MIRParser][GlobalISel] Parsing vector pointer types (<M x pA>) MIParser wasn't able to parse LLTs like `<4 x p0>`, fixing that. Reviewers: qcolombet t.p.northover aditya_nandakumar Reviewed By: qcolombet Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D46490 llvm-svn: 331712	2018-05-08 02:02:50 +00:00
Teresa Johnson	59da890c96	[NewPM] Emit inliner NoDefinition missed optimization remark Summary: Makes this consistent with the old PM. Reviewers: eraman Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D46526 llvm-svn: 331709	2018-05-08 01:45:46 +00:00
Sam Clegg	25d8e68b87	[WebAssembly] MC: Use existing MCSymbol.Index field rather than inventing extra mapping MCSymbol has getIndex/setIndex which are implementation defined and on other platforms are used to store the symbol table index. It makes sense to use this rather than invent a new mapping. Differential Revision: https://reviews.llvm.org/D46555 llvm-svn: 331705	2018-05-08 00:08:21 +00:00
Sam Clegg	22c568ba73	[MC] ELFObjectWriter: Removing unneeded variable and cast Differential Revision: https://reviews.llvm.org/D46289 llvm-svn: 331704	2018-05-07 23:52:17 +00:00
Chandler Carruth	029cea90fa	Revert r330742: Let TableGen write output only if it changed, instead of doing so in cmake. This change causes us to re-run tablegen for every single target on every single build. This is much, much worse than the problem being fixed AFAICT. On my system, it makes a clean rebuild of `llc` with nothing changed go from .5s to over 8s. On systems with less parallelism, slower file systems, or high process startup overhead this will be even more extreme. The only way I see this could be a win is in clean builds where we churn the filesystem. But I think incremental rebuild is more important, and so if we want to re-instate this, it needs to be done in a way that doesn't trigger constant re-runs of tablegen. llvm-svn: 331702	2018-05-07 23:41:48 +00:00
Roman Tereshin	d29fc89222	[MachineVerifier][GlobalISel] Checking that generic instrs have LLTs on all vregs Every generic machine instruction must have generic virtual registers only, that is, have a low-level type attached to each operand. Previously MachineVerifier would catch a type missing on an operand only if the previous operand for the the same type index exists and have a type attached to it and it will report it as a type mismatch. This is incosistent behaviour and a misleading error message. This commit makes sure MachineVerifier explicitly checks that the types are there for every operand and if not provides a straightforward error message. Reviewers: qcolombet t.p.northover bogner ab Reviewed By: qcolombet Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D46455 llvm-svn: 331694	2018-05-07 22:31:47 +00:00
Roman Tereshin	f487edae49	[MachineVerifier][GlobalISel] NFC, Improving MO printing and refactoring visitMachineInstrBefore This is an NFC pre-commit for the following "Checking that generic instrs have LLTs on all vregs" commit. This overloads MachineOperand::print to make it possible to print LLTs with standalone machine operands. This also overloads MachineVerifier::print(...MachineOperand...) with an optional LLT using the newly introduced MachineOperand::print variant; no actual calls added. This also refactors MachineVerifier::visitMachineInstrBefore in the parts dealing with all generic instructions (checking Selected property, LLTs, and phys regs). llvm-svn: 331693	2018-05-07 22:31:12 +00:00
Tom Stellard	37444285f1	AMDGPU/GlobalISel: Don't try to lower hull shaders Summary: The AMDGPU_HS calling convention is not supported yet. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46149 llvm-svn: 331691	2018-05-07 22:17:54 +00:00
Roman Lebedev	9bd6067db6	[DAGCombiner] Masked merge: enhance handling of 'andn' with immediates Summary: Split off from D46031. The previous patch, D46493, completely disabled unfolding in case of immediates. But we can do better: {F6120274} {F6120277} https://rise4fun.com/Alive/xJS Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D46494 llvm-svn: 331685	2018-05-07 21:52:22 +00:00
Roman Lebedev	cc42d08b1d	[DagCombiner] Not all 'andn''s work with immediates. Summary: Split off from D46031. In masked merge case, this degrades IPC by decreasing instruction count. {F6108777} The next patch should be able to recover and improve this. This also affects the transform @spatel have added in D27489 / rL289738, and the test coverage for X86 was missing. But after i have added it, and looked at the changes in MCA, i'm somewhat confused. {F6093591} {F6093592} {F6093593} I'd say this regression is an improvement, since `IPC` increased in that case? Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: andreadb, llvm-commits, spatel Differential Revision: https://reviews.llvm.org/D46493 llvm-svn: 331684	2018-05-07 21:52:11 +00:00
Dmitry Mikulin	738bac77c1	Remove explicit setting of the CFI jumptable section name, it does not appear to be needed: jump table sections are created with .cfi.jumptable suffix. With this change each jump table is placed in a separate section, which allows the linker to re-order them. Differential Revision: https://reviews.llvm.org/D46537 llvm-svn: 331680	2018-05-07 21:30:15 +00:00
Simon Pilgrim	1233e1234a	[X86] Split WriteFAdd/WriteFCmp/WriteFMul schedule classes Split to support single/double for scalar, XMM and YMM/ZMM instructions - removing InstrRW overrides for these instructions. Fixes Atom ADDSUBPD instruction and reclassifies VFPCLASS as WriteFCmp which is closer in behaviour. llvm-svn: 331672	2018-05-07 20:52:53 +00:00
Aaron Smith	47589e09dd	[SelectionDAG] Transfer DbgValues when casts are optimized in SelectionDAG::getNode Summary: getNode optimizes (ext (trunc x)) to x and the dbgvalue node on trunc is lost. The fix calls transferDbgValues to add the dbgvalue to x. Add DebugInfo/AArch64/dbg-value-i16.ll Patch by Sejong Oh! Reviewers: aprantl, javed.absar, llvm-commits, vsk Reviewed By: aprantl, vsk Subscribers: kristof.beyls, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D46348 llvm-svn: 331665	2018-05-07 20:15:50 +00:00
Sam Clegg	fb807d4dd4	[WebAssembly] Ensure all .debug_XXX section has proper symbol names Updated wasm section symbols names to match section name, and ensure all referenced sections will have a symbol (per DWARF spec v3, Figure 43) Patch by Yury Delendik! Differential Revision: https://reviews.llvm.org/D46543 llvm-svn: 331664	2018-05-07 19:40:50 +00:00
Simon Pilgrim	e480ed0b9f	[X86][AVX2] Tag VPMOVSX/VPMOVZX ymm instructions as WriteShuffle256 These are more like cross-lane shuffles than regular shuffles - we already do this for AVX512 equivalents. Differential Revision: https://reviews.llvm.org/D46229 llvm-svn: 331659	2018-05-07 18:25:19 +00:00
Krzysztof Parzyszek	786fc3d079	[Hexagon] Move clamping of extended operands directly to MC code emitter llvm-svn: 331653	2018-05-07 17:34:23 +00:00
Simon Pilgrim	763bf12085	[X86][Znver1] Remove WriteFMul/WriteFRcp InstRW overrides/aliases. Fixes x87 schedules to more closely match Agner - AMD doesn't tend to "special case" x87 instructions as much as Intel. llvm-svn: 331645	2018-05-07 16:34:26 +00:00
Simon Pilgrim	ac5d0a31ef	[X86] Split WriteFDiv schedule classes to support single/double scalar, XMM and YMM/ZMM instructions. This removes all InstrRW overrides for these instructions - some x87 overrides remain but most use default (and realistic) values. llvm-svn: 331643	2018-05-07 16:15:46 +00:00
Mark Searles	4a0f2c5047	[AMDGPU][Waitcnt] Remove the old waitcnt pass Remove the old waitcnt pass ( si-insert-waits ), which is no longer maintained and getting crufty Differential Revision: https://reviews.llvm.org/D46448 llvm-svn: 331641	2018-05-07 14:43:28 +00:00
Petar Jovanovic	cc4915701c	Add option -verify-cfiinstrs to run verifier in CFIInstrInserter Instead of enabling it for non NDEBUG builds, use -verify-cfiinstrs to run verifier in CFIInstrInserter. It defaults to false. Differential Revision: https://reviews.llvm.org/D46444 llvm-svn: 331635	2018-05-07 14:09:33 +00:00
Tim Renouf	18a1e9d03a	[AMDGPU] Don't force WQM for DS op Summary: Previously, all DS ops forced WQM in a pixel shader. That was a hack to allow for graphics frontends using ds_swizzle to implement explicit derivatives, on SI/CI at least where DPP is not available. But it forced WQM for _any_ DS op. With this commit, DS ops no longer force WQM. Both graphics frontends (Mesa and LLPC) need to change to issue an explicit llvm.amdgcn.wqm intrinsic call when calculating explicit derivatives. The required Mesa change is: "amd/common: use llvm.amdgcn.wqm for explicit derivatives". Subscribers: qcolombet, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46051 Change-Id: I9b745b626fa91bbd66456e6cf41ee07eeea42f81 llvm-svn: 331633	2018-05-07 13:21:26 +00:00
Simon Pilgrim	f3ae50fca2	[X86] Split WriteFRcp/WriteFRsqrt/WriteFSqrt schedule classes WriteFRcp/WriteFRsqrt are split to support scalar, XMM and YMM/ZMM instructions. WriteFSqrt is split into single/double/long-double sizes and scalar, XMM, YMM and ZMM instructions. This removes all InstrRW overrides for these instructions. NOTE: There were a couple of typos in the Znver1 model - notably a 1cy throughput for SQRT that is highly unlikely and doesn't tally with Agner. NOTE: I had to add Agner's numbers for several targets for WriteFSqrt80. llvm-svn: 331629	2018-05-07 11:50:44 +00:00
Petar Jovanovic	3ae0c0e291	Skip unreachable blocks for CFIInstrInserter verify Iterate only through reachable blocks. This finetunes r330706 and it resolves build issue reported by Craig Topper. llvm-svn: 331628	2018-05-07 11:47:48 +00:00
Jonas Paulsson	ebb1605bf3	[SystemZ] Bugfix for MVCLoop CC clobbering. MVCLoop clobbers CC (since it emits a compare/branch), but this was not modelled. Review: Ulrich Weigand llvm-svn: 331627	2018-05-07 10:48:43 +00:00
Amaury Sechet	f91b6a8cf7	[ARM] Select result 1 from ConvertBooleanCarryToCarryFlag's result automatically. NFC The old behavior return the value 0, which is error prone. llvm-svn: 331614	2018-05-07 01:43:42 +00:00
Craig Topper	7413b322ea	[TargetLowering] Use StringRef::split instead of SplitString. NFC SplitString splits based on a list of delimeters, but we're only using one delimeter so we should use the simpler split. llvm-svn: 331613	2018-05-07 01:32:18 +00:00
Craig Topper	c882014f43	[X86] Fix copy/paste mistake in comment. NFC llvm-svn: 331611	2018-05-07 00:47:02 +00:00
Roman Lebedev	cb1af9134a	[NFC][DAGCombine] unfoldMaskedMerge(): rename two variables The current names can be confused with the A and B sides of the canonical masked merge pattern. llvm-svn: 331609	2018-05-06 20:02:22 +00:00
Craig Topper	cb2abc7977	[X86] Enable reciprocal estimates for v16f32 vectors by using VRCP14PS/VRSQRT14PS Summary: The legacy VRCPPS/VRSQRTPS instructions aren't available in 512-bit versions. The new increased precision versions are. So we can use those to implement v16f32 reciprocal estimates. For KNL CPUs we can probably use VRCP28PS/VRSQRT28PS and avoid the NR step altogether, but I leave that for a future patch. Reviewers: spatel Reviewed By: spatel Subscribers: RKSimon, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D46498 llvm-svn: 331606	2018-05-06 17:48:21 +00:00
Daniel Sanders	f84bc3793e	[globalisel] Update GlobalISel emitter to match new representation of extending loads Summary: Previously, a extending load was represented at (G_EXT (G_LOAD x)). This had a few drawbacks: G_LOAD had to be legal for all sizes you could extend from, even if registers didn't naturally hold those sizes. * All sizes you could extend from had to be allocatable just in case the extend went missing (e.g. by optimization). * At minimum, G_EXT and G_TRUNC had to be legal for these sizes. As we improve optimization of extends and truncates, this legality requirement would spread without considerable care w.r.t when certain combines were permitted. The SelectionDAG importer required some ugly and fragile pattern rewriting to translate patterns into this style. This patch changes the representation to: * (G_[SZ]EXTLOAD x) * (G_LOAD x) any-extends when MMO.getSize() * 8 < ResultTy.getSizeInBits() which resolves these issues by allowing targets to work entirely in their native register sizes, and by having a more direct translation from SelectionDAG patterns. Each extending load can be lowered by the legalizer into separate extends and loads, however a target that supports s1 will need the any-extending load to extend to at least s8 since LLVM does not represent memory accesses smaller than 8 bit. The legalizer can widenScalar G_LOAD into an any-extending load but sign/zero-extending loads need help from something else like a combiner pass. A follow-up patch that adds combiner helpers for for this will follow. The new representation requires that the MMO correctly reflect the memory access so this has been corrected in a couple tests. I've also moved the extending loads to their own tests since they are (mostly) separate opcodes now. Additionally, the re-write appears to have invalidated two tests from select-with-no-legality-check.mir since the matcher table no longer contains loads that result in s1's and they aren't legal in AArch64 anymore. Depends on D45540 Reviewers: ab, aditya_nandakumar, bogner, rtereshin, volkan, rovka, javed.absar Reviewed By: rtereshin Subscribers: javed.absar, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D45541 llvm-svn: 331601	2018-05-05 20:53:24 +00:00
Heejin Ahn	c86da6bcfe	[MIRPraser] Improve error checking for typed immediate operands Summary: This improves error checks for typed immediate operands introduced in D45948 (rL331586), and removes a code block copied by mistake. Reviewers: rtereshin Subscribers: dschuff, sbc100, jgravelle-google, llvm-commits Differential Revision: https://reviews.llvm.org/D46491 llvm-svn: 331600	2018-05-05 20:53:23 +00:00
Fangrui Song	862eebb6d6	Simplify LLVM_ATTRIBUTE_USED call sites. llvm-svn: 331599	2018-05-05 20:14:38 +00:00
Roman Lebedev	a3b0b59f54	[DAGCombiner] Masked merge: don't touch "not" xor's. Summary: Split off form D46031. It seems we don't want to transform the pattern if the `xor`'s are actually `not`'s. In vector case, this breaks `andnpd` / `vandnps` patterns. That being said, we may want to re-visit this `not` handling, maybe in D46073. Reviewers: spatel, craig.topper, javed.absar Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46492 llvm-svn: 331595	2018-05-05 15:45:40 +00:00
Teresa Johnson	81d9207317	[LTO] Handle Task=-1 passed to addSaveTemps Summary: This change is necessary for D46464, which will pass -1 as the Task ID for distributed backends, so that the save temps files don't end up with "4294967295" in their path. For distributed back ends, when -1 is passed, don't append any Task ID. An existing test (tools/clang/test/CodeGen/thinlto_backend.ll) will fail without this change after D46464. Reviewers: pcc Subscribers: mehdi_amini, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D46488 llvm-svn: 331591	2018-05-05 14:37:20 +00:00
Roman Lebedev	49ada82fa7	[NFC][DagCombiner] unfoldMaskedMerge(): improve readability. llvm-svn: 331588	2018-05-05 10:39:54 +00:00
Piotr Padlewski	e9832dfdf3	[CaptureTracking] Handle capturing of launder.invariant.group Summary: launder.invariant.group has the same rules of capturing as bitcast, gep, etc - the original value is not captured if the returned pointer is not captured. With this patch, we mark 40% more functions as noalias when compiling with -fstrict-vtable-pointers; 1078 vs 1778 (39.37%) Reviewers: sanjoy, davide, nlewycky, majnemer, mehdi_amini Subscribers: JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D32673 llvm-svn: 331587	2018-05-05 10:23:27 +00:00
Heejin Ahn	c2ad096845	[MIRParser] Allow register class names in the form of integer/scalar Summary: The current code cannot handle register class names like 'i32', which is a valid register class name in WebAssembly. This patch removes special handling for integer/scalar/pointer type parsing and treats them as normal identifiers. Reviewers: thegameg Subscribers: jfb, dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D45948 llvm-svn: 331586	2018-05-05 07:05:51 +00:00
George Burgess IV	f9d26af4ea	Range-ify for loop; NFC llvm-svn: 331582	2018-05-05 04:52:26 +00:00
Craig Topper	781aa181ab	Fix a bunch of places where operator-> was used directly on the return from dyn_cast. Inspired by r331508, I did a grep and found these. Mostly just change from dyn_cast to cast. Some cases also showed a dyn_cast result being converted to bool, so those I changed to isa. llvm-svn: 331577	2018-05-05 01:57:00 +00:00
Peter Collingbourne	e04ecc88de	LowerTypeTests: Fix non-determinism in code that handles icall branch funnels. This was exposed by enabling expensive checks, which causes llvm::sort to sort randomly. Differential Revision: https://reviews.llvm.org/D45901 llvm-svn: 331573	2018-05-05 00:51:55 +00:00
Teresa Johnson	b77ab0966e	[LTO] Allow pass remarks with hotness to be set when emitting to stderr Summary: Set setDiagnosticsHotnessRequested before the early exit check for a diagnostic output file, so that pass remarks with hotness works when emitting pass remarks to stderr (e.g. via -pass-remarks=.). Also fix the llvm-lto2 diagnistic handler so that it only calls exit(1) when the diagnistic is an error type. Otherwise the new test invocation of llvm-lto2 with -pass-remarks causes it to fail. The new code is consistent with the diagnostic handler elsewhere (e.g. on the LLVMContext). Reviewers: pcc, davide Subscribers: fhahn, mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D46387 llvm-svn: 331569	2018-05-04 23:59:34 +00:00
Michael Berg	2dcf12ffd4	Mapping SDNode flags to MachineInstr flags Summary: Providing the glue to map SDNode fast math sub flags to MachineInstr fast math sub flags. Reviewers: spatel, arsenm, wristow Reviewed By: spatel Subscribers: wdng Differential Revision: https://reviews.llvm.org/D46447 llvm-svn: 331567	2018-05-04 23:41:15 +00:00
Konstantin Zhuravlyov	91a74f53db	AMDGPU/NFC: Update D16PreservesUnusedBits description based Tony Tye's comments llvm-svn: 331564	2018-05-04 22:53:55 +00:00
Philip Reames	5b39acd111	[LICM] Compute a must execute property for the prefix of the header as we go Computing this property within the existing walk ensures that the cost is linear with the size of the block. If we did this from within isGuaranteedToExecute, it would be quadratic without some very fancy caching. This allows us to reliably catch a hoistable instruction within a header which may throw at some point after our hoistable instruction. It doesn't do anything for non-header cases, but given how common single block loops are, this seems very worthwhile. llvm-svn: 331557	2018-05-04 21:35:00 +00:00
Konstantin Zhuravlyov	3fc4067ac4	AMDGPU/NFC: Fix formatting for 900, 902 ISA Version features llvm-svn: 331553	2018-05-04 20:21:31 +00:00
Konstantin Zhuravlyov	c2c2eb7d01	AMDGPU: Add D16 instructions preserve unused bits feature - Predicate D16 patterns on this new feature - Added this new feature to gfx900/2/4 Differential Revision: https://reviews.llvm.org/D46366 llvm-svn: 331551	2018-05-04 20:06:57 +00:00
Geoff Berry	8e4958e760	[MachineLICM] Debug intrinsics shouldn't affect hoist decisions Summary: When checking if an instruction stores to a given frame index, check that the instruction can write to memory before looking at the memory operands list to avoid e.g. DBG_VALUE instructions that reference a frame index preventing a load from that index from being hoisted. Reviewers: dblaikie, MatzeB, qcolombet, reames, javed.absar Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46284 llvm-svn: 331549	2018-05-04 19:25:09 +00:00
Shoaib Meenai	57fadab1cb	[ObjCARC] Account for catchswitch in bitcast insertion A catchswitch is both a pad and a terminator, meaning it must be the only non-phi instruction in its basic block. When we're inserting a bitcast in the incoming basic block for a phi, if that incoming block is a catchswitch, we should go up the dominator tree to find a valid insertion point rather than attempting to insert before the catchswitch (which would result in invalid IR). Differential Revision: https://reviews.llvm.org/D46412 llvm-svn: 331548	2018-05-04 19:03:11 +00:00
Michael Berg	7acc81b744	Fast Math Flag mapping into SDNode Summary: Adding support for Fast flags in the SDNode to leverage fast math sub flag usage. Reviewers: spatel, arsenm, jbhateja, hfinkel, escha, qcolombet, echristo, wristow, javed.absar Reviewed By: spatel Subscribers: llvm-commits, rampitec, nhaehnle, tstellar, FarhanaAleen, nemanjai, javed.absar, jbhateja, hfinkel, wdng Differential Revision: https://reviews.llvm.org/D45710 llvm-svn: 331547	2018-05-04 18:48:20 +00:00
Simon Pilgrim	0e51a125ea	[X86] Add WriteEMMS scheduler class Filled in the missing values from Btver2 SoG or Agner llvm-svn: 331546	2018-05-04 18:16:13 +00:00
Simon Pilgrim	d7ffbc5c7e	[X86] Finish splitting WriteVecShift and WriteVecIMul to remove InstRW overrides. llvm-svn: 331543	2018-05-04 17:47:46 +00:00
Craig Topper	ded8ee07e9	[LoopIdiomRecognize] Don't create an IRBuilder just to call getTrue/getFalse. We can call the methods in ConstantInt directly. We just need a context. llvm-svn: 331542	2018-05-04 17:39:08 +00:00
Adrian Prantl	3edc63a579	DwarfCompileUnit: Fix another assertion failure on malformed input that is not rejected by the Verifier. Thanks to Björn Pettersson for providing a reproducer! llvm-svn: 331535	2018-05-04 16:10:43 +00:00
Clement Courbet	b18c34bc29	[llvm-exegesis] Fix pfm counter names for BDW. Summary: They are not consistent with other microarchitectures. Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D46434 llvm-svn: 331532	2018-05-04 15:26:12 +00:00
Simon Pilgrim	67cc246dca	[X86] Cleanup SchedWriteFMA classes and use X86SchedWriteWidths directly. Rename scalar and XMM versions, this is to match/simplify an upcoming change to split MUL/DIV/SQRT scalar/xmm/ymm/zmm classes. llvm-svn: 331531	2018-05-04 15:20:18 +00:00
Krzysztof Parzyszek	5291562acd	[Hexagon] Remove leftover debugging code after r331527 llvm-svn: 331528	2018-05-04 15:07:30 +00:00
Krzysztof Parzyszek	effcc2fb79	[Hexagon] Handle non-immediate constants in HexagonSplitDouble llvm-svn: 331527	2018-05-04 15:04:48 +00:00
Simon Dardis	65b0492f0d	[mips] Correct the predicates of sign extension instructions And eliminatw the duplication of those instructions for microMIPS32r6. Reviewers: smaksimovic, abeserminji, atanasyan Differential Revision: https://reviews.llvm.org/D46117 llvm-svn: 331526	2018-05-04 15:00:54 +00:00
Simon Pilgrim	bf4c8c0ff2	[X86] Add WriteVecMOVMSKY scheduler class llvm-svn: 331525	2018-05-04 14:54:33 +00:00
Adhemerval Zanella	a57ef17ab6	[AArch64] Custom Lower MULLH{S,U} for v16i8, v8i16, and v4i32 This patch adds a custom lowering for ISD::MULH{S,U} used on divide by constant optimization (DAGCombiner::BuildSDIV and DAGCombiner::BuildUDIV). New patterns for smull and umull are added, so AArch64ISD::{S,U}MULL can be correctly lowered to smull2 and umull2. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D46009 llvm-svn: 331522	2018-05-04 14:33:55 +00:00
Krzysztof Parzyszek	af73d2bdd9	[Hexagon] Skip reserved physical registers when updating liveness llvm-svn: 331518	2018-05-04 13:59:05 +00:00
Simon Pilgrim	be51b20127	[X86] Add SchedWriteFRnd fp rounding scheduler classes Split off from SchedWriteFAdd for fp rounding/bit-manipulation instructions. Fixes an issue on btver2 which only had the ymm version using the JSTC pipe instead of JFPA. llvm-svn: 331515	2018-05-04 12:59:24 +00:00
Bjorn Pettersson	27a841fe83	[SelectionDAG] Refactor code by adding RegsForValue::getRegsAndSizes(). NFCI Summary: Added a helper method in RegsForValue to get a list with all the <RegNumber, RegSize> pairs that we want to iterate over in SelectionDAGBuilder::EmitFuncArgumentDbgValue and in SelectionDAGBuilder::visitIntrinsicCall. Reviewers: vsk Reviewed By: vsk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46360 llvm-svn: 331510	2018-05-04 08:50:48 +00:00
Jonas Paulsson	72fe760592	[RegUsageInfoCollector] Bugfix for handling of register aliases. Don't assume the alias of a defined reg is always already in the set. As the test case in https://bugs.llvm.org/show_bug.cgi?id=36587 discovered, it is wrong to assume that all the aliases of the defined register in the current function is already present in the UsedPhysRegsMask. This patch changes this so that any definition in the current function of a phys-reg always results in all its aliases inserted into the set of defined registers. Review: Quentin Colombet https://reviews.llvm.org/D45157 llvm-svn: 331509	2018-05-04 07:50:05 +00:00
Max Kazantsev	786032c1b7	[IRCE] Fix misuse of dyn_cast which leads to UB llvm-svn: 331508	2018-05-04 07:34:35 +00:00
Michael Zolotukhin	131e74910c	[MachineCSE] Rewrite a loop checking if a block is in a set of blocks without using a set. NFC. Summary: Using a set is unnecessary here an in some cases (see e.g. PR37277) takes significant amount of time to just insert values into it. In this particular case all we need is just to check if we find the block we are looking for or not. Reviewers: davide Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D46411 llvm-svn: 331502	2018-05-04 01:40:05 +00:00
Craig Topper	9510f70636	[LoopIdiomRecognize] Replace more unchecked dyn_casts with cast. Two of these are immediately dereferenced on the next line. The other two are passed immediately to the IRBuilder constructor which can't handle a nullptr. llvm-svn: 331500	2018-05-04 01:04:28 +00:00
Craig Topper	cafae62ec9	[LoopIdiomRecognize] Use a regular array instead of a SmallVector and explicit ArrayRef. llvm-svn: 331499	2018-05-04 01:04:26 +00:00
Craig Topper	8304231508	[LoopIdiomRecognize] Turn two uncheck dyn_casts into regular casts. These are casts on users of a PHINode to Instruction. I think since PHINode is an Instruction any users would also be Instructions. At least a cast will give us an assertion if its wrong. llvm-svn: 331498	2018-05-04 01:04:24 +00:00
Tom Stellard	b03c98d1a3	AMDGPU: Make getSubRegFromChannel a static member of AMDGPURegisterInfo Summary: This makes is possible to have R600RegisterInfo and SIRegisterInfo not inherit from AMDGPURegisterInfo. Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D46280 llvm-svn: 331490	2018-05-03 22:38:06 +00:00
Simon Pilgrim	542b20d656	[X86] Add WriteDPPD/WriteDPPS dot product scheduler classes llvm-svn: 331489	2018-05-03 22:31:19 +00:00
Simon Pilgrim	0aed731516	[X86][Znver1] Use SchedAlias to tag microcoded scheduler classes Avoids extra entries in the class tables. Found a typo that missed the MMX_PHSUBSW instruction. llvm-svn: 331488	2018-05-03 22:12:23 +00:00
Justin Bogner	33e0ae0d8f	Fix include of config.h that was incorrectly changed in r331184 The RWMutex implementation depends on config.h macros (specifically HAVE_PTHREAD_H and HAVE_PTHREAD_RWLOCK_INIT), so we need to be including it and not just llvm-config.h here or we fall back to a much slower implementation. llvm-svn: 331487	2018-05-03 21:59:13 +00:00
Sanjay Patel	e7b6654711	[InstCombine] refine select-of-constants to bitwise ops Add logic for the special case when a cmp+select can clearly be reduced to just a bitwise logic instruction, and remove an over-reaching chunk of general purpose bit magic. The primary goal is to remove cases where we are not improving the IR instruction count when doing these select transforms, and in all cases here that is true. In the motivating 3-way compare tests, there are further improvements because we can combine/propagate select values (not sure if that belongs in instcombine, but it's there for now). DAGCombiner has folds to turn some of these selects into bit magic, so there should be no difference in the end result in those cases. Not all constant combinations are handled there yet, however, so it is possible that some targets will see more cmov/csel codegen with this change in IR canonicalization. Ideally, we'll go further to not turn selects into multiple logic/math ops in instcombine, and we'll canonicalize to selects. But we should make sure that this step does not result in regressions first (and if it does, we should fix those in the backend). The general direction for this change was discussed here: http://lists.llvm.org/pipermail/llvm-dev/2016-September/105373.html http://lists.llvm.org/pipermail/llvm-dev/2017-July/114885.html Alive proofs for the new bit magic: https://rise4fun.com/Alive/XG7 Differential Revision: https://reviews.llvm.org/D46086 llvm-svn: 331486	2018-05-03 21:58:44 +00:00
Tom Stellard	abc9871d60	GlobalISel: Use a callback to compute constrained reg class for unallocatble registers Summary: constrainOperandRegClass() currently fails if it tries to constrain the register class of an operand that is defeined with an unallocatable register class. This patch resolves this by adding a target callback to compute register constriants in this case. This is required by the AMDGPU because many of its instructions have source opreands defined with the unallocatable register classe VS_32 which is a union of two allocatable register classes VGPR_32 and SReg_32. Reviewers: dsanders, aditya_nandakumar Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D45991 llvm-svn: 331485	2018-05-03 21:44:16 +00:00
Teresa Johnson	85cc298c1a	[ThinLTO] Add support for optimization remarks to thinBackend Summary: Support was added to the regular LTO backend, but not thinBackend. This patch adds that support. Reviewers: pcc, davide Subscribers: mehdi_amini, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D46376 llvm-svn: 331481	2018-05-03 20:24:12 +00:00
Simon Pilgrim	0720c8d90e	[X86][AVX512] VPLZCNT instructions match SchedWriteVecIMul scheduling class not SchedWriteVecALU. llvm-svn: 331473	2018-05-03 18:22:49 +00:00
Simon Pilgrim	f2d2cedab4	[X86] Split WriteVecShift/WriteVarVecShift into MMX, XMM and YMM/ZMM scheduler classes This took a bit of extra work as on Intel targets the old (V)PSLLDrr/(V)PSLLDrm style instructions act differently - I ended up creating WriteVecShiftImm classes for XMM/YMM/ZMM vector shift by immediate and retaining WriteVecShift as the default (used only by MMX) plus WriteVecShiftX/WriteVecShiftY. X86SchedWriteWidths hides most of this thank goodness. llvm-svn: 331472	2018-05-03 17:56:43 +00:00
Bjorn Pettersson	5479ad2945	[DebugInfo] Correction for an assert in DIExpression::createFragmentExpression Summary: When we create a fragment expression, and there already is an old fragment expression, we assert that the new fragment is within the range for the old fragment. If for example the old fragment expression says that we describe bit 10-16 of a variable (Offset=10, Size=6), and we now want to create a new fragment expression only describing bit 3-6 of the original value, then the resulting fragment expression should have Offset=13, Size=3. The assert is supposed to catch if the resulting fragment expression is outside the range for the old fragment. However, it used to verify that the Offset+Size of the new fragment was smaller or equal than Offset+Size for the old fragment. What we really want to check is that Offset+Size of the new fragment is smaller than the Size of the old fragment. Reviewers: aprantl, vsk Reviewed By: aprantl Subscribers: davide, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D46391 llvm-svn: 331465	2018-05-03 17:04:21 +00:00
Bjorn Pettersson	304877e5ec	Reapply "[SelectionDAG] Selection of DBG_VALUE using a PHI node result (pt 2)" Summary: This reverts SVN r331441 (reapplies r331337), together with a fix in to handle an already existing fragment expression in the dbg.value that must be fragmented due to a split PHI node. This should solve the problem seen in PR37321, which was the reason for the revert of r331337. The situation in PR37321 is that we have a PHI node like this %u.sroa = phi i80 [ %u.sroa.x, %if.x ], [ %u.sroa.y, %if.y ], [ %u.sroa.z, %if.z ] and a dbg.value like this call void @llvm.dbg.value(metadata i80 %u.sroa, metadata !13, metadata !DIExpression(DW_OP_LLVM_fragment, 0, 80)) The phi node is split into three 32-bit PHI nodes %30:gr32 = PHI %11:gr32, %bb.4, %14:gr32, %bb.5, %27:gr32, %bb.8 %31:gr32 = PHI %12:gr32, %bb.4, %15:gr32, %bb.5, %28:gr32, %bb.8 %32:gr32 = PHI %13:gr32, %bb.4, %16:gr32, %bb.5, %29:gr32, %bb.8 but since the original value only is 80 bits we need to adjust the size of the last fragment expression, and with this patch we get DBG_VALUE debug-use %30:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 0, 32) DBG_VALUE debug-use %31:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 32, 32) DBG_VALUE debug-use %32:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 64, 16) Reviewers: vsk, aprantl, mstorsjo Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46384 llvm-svn: 331464	2018-05-03 17:04:16 +00:00
Simon Pilgrim	f7dd6069a5	[X86] Split WriteVecALU/WritePHAdd into XMM and YMM/ZMM scheduler classes llvm-svn: 331453	2018-05-03 13:27:10 +00:00
Tim Northover	28e0a6f7dd	ARM: don't try to over-align large vectors as arguments. By default LLVM thinks very large vectors get aligned to their size when passed across functions. Unfortunately no-one told the ARM backend so it doesn't trigger stack realignment and so accesses can cause the usual misalignment issues (e.g. a data abort). This changes the ABI alignment to the stack alignment, which in practice (and as a bonus) also coincides with the alignment "natural" vectors get. llvm-svn: 331451	2018-05-03 12:54:25 +00:00
Piotr Padlewski	c77ab8ef2f	perform DSE through launder.invariant.group Summary: Alias Analysis knows that llvm.launder.invariant.group returns pointer that mustalias argument, but this information wasn't used, therefor we didn't DSE through launder.invariant.group Reviewers: chandlerc, dberlin, bogner, hfinkel, efriedma Reviewed By: dberlin Subscribers: amharc, llvm-commits, nlewycky, rsmith Differential Revision: https://reviews.llvm.org/D31581 llvm-svn: 331449	2018-05-03 11:03:53 +00:00
Piotr Padlewski	5dde809404	Rename invariant.group.barrier to launder.invariant.group Summary: This is one of the initial commit of "RFC: Devirtualization v2" proposal: https://docs.google.com/document/d/16GVtCpzK8sIHNc2qZz6RN8amICNBtvjWUod2SujZVEo/edit?usp=sharing Reviewers: rsmith, amharc, kuhar, sanjoy Subscribers: arsenm, nhaehnle, javed.absar, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45111 llvm-svn: 331448	2018-05-03 11:03:01 +00:00
Simon Pilgrim	39196a1dd3	[X86][AVX512] VPAVG instructions should be tagged as SchedWriteVecALU llvm-svn: 331446	2018-05-03 10:53:17 +00:00
Simon Pilgrim	93c878c76b	[X86] Split WriteVecIMul/WriteVecPMULLD/WriteMPSAD/WritePSADBW into XMM and YMM/ZMM scheduler classes Also retagged VDBPSADBW instructions as SchedWritePSADBW instead of SchedWriteVecIMul which matches the behaviour on SkylakeServer (the only thing that supports it...) llvm-svn: 331445	2018-05-03 10:31:20 +00:00
Simon Pilgrim	342ac8cd7e	[X86] Update MMX instructions to be tagged with X86SchedWriteWidths types llvm-svn: 331443	2018-05-03 09:11:32 +00:00
Martin Storsjo	67fdea490d	Revert "[SelectionDAG] Selection of DBG_VALUE using a PHI node result (pt 2)" This reverts SVN r331337, see PR37321 for details on the regression it introduced. llvm-svn: 331441	2018-05-03 07:09:33 +00:00
Clement Courbet	6794660828	[TableGen][NFC] Make ResourceCycles definitions more explicit. https://reviews.llvm.org/D46356 llvm-svn: 331439	2018-05-03 06:08:47 +00:00
Craig Topper	856fd68690	[LoopIdiomRecognize] When looking for 'x & (x -1)' for popcnt, make sure the left hand side of the 'and' matches the left hand side of the 'subtract' llvm-svn: 331437	2018-05-03 05:48:49 +00:00
Craig Topper	8ef2abdbc4	[LoopIdiomRecognize] Remove unnecessary cast from BinaryOperator to Instruction. NFC BinaryOperator is a sub class of Instruction. We don't need an explicit cast back to Instruction. llvm-svn: 331432	2018-05-03 05:00:18 +00:00
Max Kazantsev	58fce7e54b	Re-enable "[SCEV] Make computeExitLimit more simple and more powerful" This patch was temporarily reverted because it has exposed bug 37229 on PowerPC platform. The bug is unrelated to the patch and was just a general bug in the optimization done for PowerPC platform only. The bug was fixed by the patch rL331410. This patch returns the disabled commit since the bug was fixed. llvm-svn: 331427	2018-05-03 02:37:55 +00:00
Petr Hosek	87f1343a73	[Support] Support building LLVM for Fuchsia These are necessary changes to support building LLVM for Fuchsia. While these are not sufficient to run on Fuchsia, they are still useful when cross-compiling LLVM libraries and runtimes for Fuchsia. Differential Revision: https://reviews.llvm.org/D46345 llvm-svn: 331423	2018-05-03 01:38:49 +00:00
Shoaib Meenai	a07295f977	[ObjCARC] Convert an if to an early continue. NFC This reduces nesting and makes the logic slightly easier to follow. Differential Revision: https://reviews.llvm.org/D46371 llvm-svn: 331422	2018-05-03 01:20:36 +00:00
Nemanja Ivanovic	61ffbf21cd	Commit r331416 breaks the big-endian PPC bot. On the big endian build, we actually encounter constants wider than 64-bits. Add the guard to prevent tripping the assert. llvm-svn: 331420	2018-05-03 01:04:13 +00:00
Chandler Carruth	e74c354d12	[gcov] Switch to an explicit if clunky array to satisfy some compilers on various build bots that are unhappy with using makeArrayRef with an initializer list. llvm-svn: 331418	2018-05-03 00:11:03 +00:00
Michael Berg	7d1b25d053	MachineInst support mapping SDNode fast math flags for support in Back End code generation Summary: Machine Instruction flags for fast math support and MIR print support Reviewers: spatel, arsenm Reviewed By: arsenm Subscribers: wdng Differential Revision: https://reviews.llvm.org/D45781 llvm-svn: 331417	2018-05-03 00:07:56 +00:00
Nemanja Ivanovic	01e2e79abf	[PowerPC] Implement isMaskAndCmp0FoldingBeneficial Sinking the and closer to a compare against zero is beneficial on PPC as it allows us to emit record-form instructions. In the future, we may expand this to a larger set of operations that feed compares against zero since PPC has lots of record-form instructions. Differential revision: https://reviews.llvm.org/D46060 llvm-svn: 331416	2018-05-02 23:55:23 +00:00
Sam Clegg	4d57fbd02a	[WebAssembly] MC: Create and use first class section symbols Differential Revision: https://reviews.llvm.org/D46335 llvm-svn: 331413	2018-05-02 23:11:38 +00:00
Sam Clegg	87cc4dbde7	[MC] Factor MCObjectStreamer::addFragmentAtoms out of MachO streamer. This code previously existed only in MCMachOStreamer but is useful for WebAssembly too. See: D46335 Differential Revision: https://reviews.llvm.org/D46297 llvm-svn: 331412	2018-05-02 23:01:10 +00:00
Nemanja Ivanovic	2139e99e47	[PowerPC] No CTR loop if the candidate exiting block is in a different loop The CTR loops pass will insert the decrementing branch instruction in an exiting block for the loop being transformed. However if that block is part of another loop as well (whether a nested loop or with irreducible CFG), it is not valid to use that exiting block. In fact, if the loop hass irreducible CFG, we don't bother analyzing it and we just bail on the transformation. In practice, this doesn't lead to a noticeable reduction in the number of loops transformed by this pass. Fixes https://bugs.llvm.org/show_bug.cgi?id=37229 Differential Revision: https://reviews.llvm.org/D46162 llvm-svn: 331410	2018-05-02 22:56:04 +00:00
Chandler Carruth	71c3a3fac5	[GCOV] Emit the writeout function as nested loops of global data. Summary: Prior to this change, LLVM would in some cases emit massive writeout functions with many 10s of 1000s of function calls in straight-line code. This is a very wasteful way to represent what are fundamentally loops and creates a number of scalability issues. Among other things, register allocating these calls is extremely expensive. While D46127 makes this less severe, we'll still run into scaling issues with this eventually. If not in the compile time, just from the code size. Now the pass builds up global data structures modeling the inputs to these functions, and simply loops over the data structures calling the relevant functions with those values. This ensures that the code size is a fixed and only data size grows with larger amounts of coverage data. A trivial change to IRBuilder is included to make it easier to build the constants that make up the global data. Reviewers: wmi, echristo Subscribers: sanjoy, mcrosier, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D46357 llvm-svn: 331407	2018-05-02 22:24:39 +00:00
Simon Pilgrim	350c22c587	[X86][SNB] Fix scheduling of MMX integer multiply instructions. The entries were being bound to the wrong class. llvm-svn: 331388	2018-05-02 19:26:14 +00:00
Simon Pilgrim	6732f6ea51	[X86] Split WriteShuffle/WriteVarShuffle + WriteBlend/WriteVarBlend into XMM and YMM/ZMM scheduler classes llvm-svn: 331386	2018-05-02 18:48:23 +00:00
Martin Storsjo	d0b5034b8a	[COFF, ARM64] Hook up a few remaining relocations Differential Revision: https://reviews.llvm.org/D46355 llvm-svn: 331384	2018-05-02 18:24:37 +00:00
Farhana Aleen	07e612340f	[AMDGPU] A trivial fix for a buildbot failure caused by "commit 224a839fcbbead221f872cd32a1dd0c308d37299". Author: FarhanaAleen llvm-svn: 331383	2018-05-02 18:16:39 +00:00
Daniel Sanders	8d0d1aa229	[reassociate] Fix excessive revisits when processing long chains of reassociatable instructions. Summary: Some of our internal testing detected a major compile time regression which I've tracked down to: r278938 - Revert "Reassociate: Reprocess RedoInsts after each inst". It appears that processing long chains of reassociatable instructions causes non-linear (potentially exponential) growth in the number of times an instruction is revisited. For example, the included test revisits instructions 220 times in a 20-instruction test. It appears that r278938 reversed the order instructions were visited and that this is preventing scheduled revisits from being cancelled as a result of visiting the instructions naturally during normal processing. However, simply reversing the order also harmed the generated code. Upon closer inspection, it was discovered that revisits occurred in the opposite order to the first pass (Thanks to escha for spotting that). This patch makes the revisit order consistent with the first pass which allows more revisits to be cancelled. This does appear to have a small impact on the generated code in few cases but it significantly reduces compile-time. After this patch, our internal test that was most affected by the regression dropped from ~2 million revisits to ~4k resulting in Reassociate having 0.46% of the runtime it had before (99.54% improvement). Here's the summaries reported by lnt for the LLVM test-suite with --benchmarking-only: \| metric \| geomean before patch \| geomean after patch \| delta \| \| ----- \| ----- \| ----- \| ----- \| \| compile time \| 0.1956 \| 0.1261 \| -35.54% \| \| execution time \| 0.3240 \| 0.3237 \| - \| \| code size \| 7365.4459 \| 7365.6079 \| - \| The results have a few wins and losses on compile-time, mostly in the +/- 2.5% range. There was one outlier though: \| Performance Regressions - compile_time \| Δ \| Previous \| Current \| \| MultiSource/Benchmarks/ASC_Sequoia/CrystalMk/CrystalMk \| 9.82% \| 2.0473 \| 2.2483 \| Reviewers: javed.absar, dberlin Reviewed By: dberlin Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45734 llvm-svn: 331381	2018-05-02 17:59:16 +00:00
Simon Pilgrim	819f218f07	[X86] Cleanup WriteFShuffle/WriteFVarShuffle (+256 variants) scheduler classes with more common default values llvm-svn: 331380	2018-05-02 17:58:50 +00:00
Krzysztof Parzyszek	5e3aba8251	Add assertion to padding size calculation, NFC The size of an object cannot be less than the emitted size of all the contained elements. This would cause an overflow in padding size calculation. Add an assert to catch this. Patch by Suyog Sarda. llvm-svn: 331376	2018-05-02 17:20:22 +00:00
Farhana Aleen	150cb6d91a	Revert "[AMDGPU] performAddCombine should run after DAG is legalized." This reverts commit 6b97d2995566b4dddd6bf0d75579ff44501d4494. llvm-svn: 331371	2018-05-02 16:48:52 +00:00
Simon Pilgrim	a3a4df3708	[X86] Convert most remaining XOP uses of X86SchedWritePair scheduler classes to X86SchedWriteWidths. llvm-svn: 331369	2018-05-02 16:25:41 +00:00
Farhana Aleen	2f4100f56e	[AMDGPU] performAddCombine should run after DAG is legalized. Summary: performAddCombine should run after DAG is legalized; Otherwise generic optimization in the DAGCombiner can optimize an addcarry+trunc into an addcarry instruction with illegal types. Author: FarhanaAleen Reviewed By: rampitec Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D46337 llvm-svn: 331368	2018-05-02 16:24:10 +00:00
Simon Pilgrim	a53d330890	Fix line-endings. NFCI. llvm-svn: 331367	2018-05-02 16:16:24 +00:00
Clement Courbet	d2ff5fb536	Re-land rL331357 "[X86] Fix scheduling info for VMPSADBWYrmi." Without the rebase mess. https://reviews.llvm.org/D46356 llvm-svn: 331362	2018-05-02 14:35:48 +00:00
Simon Pilgrim	86d9f23ded	[X86] Cleanup WriteFMul scheduler classes with more common default values Intel models were targeting x87 instead of packed sse. llvm-svn: 331360	2018-05-02 14:25:32 +00:00
Simon Pilgrim	f53ee8e640	Fix '32-bit shift implicitly converted to 64 bits' warning by using APInt::setBit instead. llvm-svn: 331359	2018-05-02 14:22:30 +00:00
Clement Courbet	0f1da8f365	Revert rL331355 "[X86] Fix scheduling info for VMPSADBWYrmi." It contains unrelated changes. llvm-svn: 331357	2018-05-02 13:54:38 +00:00
Clement Courbet	a1a3095d88	[X86] Fix scheduling info for (V?)SQRTPDm on silvermont. https://reviews.llvm.org/D46356 llvm-svn: 331356	2018-05-02 13:46:14 +00:00
Clement Courbet	eeb2123a83	[X86] Fix scheduling info for VMPSADBWYrmi. https://reviews.llvm.org/D46356 llvm-svn: 331355	2018-05-02 13:40:48 +00:00
Clement Courbet	da54914cde	[MIPS] Fix DIV/DIVU scheduling classes. https://reviews.llvm.org/D46356. llvm-svn: 331354	2018-05-02 13:37:28 +00:00
Simon Pilgrim	a1f1a3bf94	[X86] Convert most remaining AVX512 uses of X86SchedWritePair scheduler classes to X86SchedWriteWidths. We've dealt with the majority already. llvm-svn: 331353	2018-05-02 13:32:56 +00:00
Sander de Smalen	659a48cd38	[AArch64][SVE] Asm: Support for LDR/STR fill and spill instructions. Reviewers: fhahn, rengolin, samparker, SjoerdMeijer, javed.absar Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D46270 llvm-svn: 331352	2018-05-02 13:32:39 +00:00
Simon Tatham	6a02604ee4	[TableGen] Don't quote variable name when printing !foreach. An input !foreach expression such as !foreach(a, lst, !add(a, 1)) would be re-emitted by llvm-tblgen -print-records with the first argument in quotes, giving !foreach("a", lst, !add(a, 1)), which isn't valid TableGen input syntax. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46352 llvm-svn: 331351	2018-05-02 13:17:26 +00:00
Sander de Smalen	57da042e32	[AArch64][SVE] Asm: Support for scatter ST1 store instructions. Reviewers: fhahn, rengolin, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D46248 llvm-svn: 331349	2018-05-02 13:00:30 +00:00
Simon Dardis	694fde215e	Revert "[mips] Correct the predicates of sign extension instructions" I accidently committed this patch after asking for a review, but it has not been reviewed yet. This reverts r331346. llvm-svn: 331348	2018-05-02 12:35:29 +00:00
Simon Pilgrim	e8671ef434	[X86] Convert most remaining uses of X86SchedWritePair scheduler classes to X86SchedWriteWidths. We've dealt with the majority already. llvm-svn: 331347	2018-05-02 12:27:54 +00:00
Simon Dardis	7a36495bf7	[mips] Correct the predicates of sign extension instructions And eliminate the duplication of those instructions for microMIPS32r6. llvm-svn: 331346	2018-05-02 12:25:33 +00:00

... 5 6 7 8 9 ...

113570 Commits