llvm-project

Commit Graph

Author	SHA1	Message	Date
Sjoerd Meijer	d16037d9bb	[ARM] Support for v4f16 and v8f16 vectors This is the groundwork for adding the Armv8.2-A FP16 vector intrinsics, which uses v4f16 and v8f16 vector operands and return values. All the moving parts are tested with two intrinsics, a 1-operand v8f16 and a 2-operand v4f16 intrinsic. In a follow-up patch the rest of the intrinsics and tests will be added. Differential Revision: https://reviews.llvm.org/D44538 llvm-svn: 327839	2018-03-19 13:35:25 +00:00
Xin Tong	116c309181	Stylish change. NFC llvm-svn: 327838	2018-03-19 13:35:23 +00:00
Andrea Di Biagio	91ab2ee954	[llvm-mca] Add pipeline stall events. This patch introduces a new class named HWStallEvent (see HWEventListener.h), and updates the event listener interface. A HWStallEvent represents a pipeline stall caused by the lack of hardware resources. Similarly to HWInstructionEvent, the event type is an unsigned, and the exact meaning depends on the subtarget. At the moment, HWStallEvent supports a few generic dispatch events. The main goals of this patch is to remove the logic that counts dispatch stalls from the DispatchUnit to the BackendStatistics view. Previously, DispatchUnit was responsible for counting and classifying dispatch stall events. With this patch, we delegate the task of counting and classifying stall events to the listeners (i.e. in our case, it is view "BackendStatistics"). So, the DispatchUnit doesn't have to do extra (unnecessary) bookkeeping. This patch also helps futher simplifying the Backend interface. Now class BackendStatistics no longer has to query the Backend interface to obtain the number of dispatch stalls. As a consequence, we can get rid of all the 'getNumXXX()' methods from class Backend. The long term goal is to remove all the remaining dependencies between the Backend and the BackendStatistics interface. Differential Revision: https://reviews.llvm.org/D44621 llvm-svn: 327837	2018-03-19 13:23:07 +00:00
Hans Wennborg	f646bcc2de	build_llvm_package.bat: Drop LLDB from the package. I don't think anyone ever got this to work, what with getting exactly the right Python dependency and so on. Removing it simplifies the script, removes a number of hairy dependencies, and cuts ~30 MB off the installer size. llvm-svn: 327835	2018-03-19 13:05:37 +00:00
Jonas Paulsson	a6216ec4cc	[SystemZ] Bugfix of CC liveness in emitMemMemWrapper (CLC). If DoneMBB becomes empty it must have CC added to its live-in list, since it will fall-through into EndMBB. This happens when the CLC loop does the complete range. Review: Ulrich Weigand llvm-svn: 327834	2018-03-19 13:05:22 +00:00
Hans Wennborg	13e8a85820	HexagonISelLowering.cpp: fix 'enum in bool context' warning llvm-svn: 327832	2018-03-19 12:55:58 +00:00
Alex Bradbury	0171a9f4ec	[RISCV] Peephole optimisation for load/store of global values or constant addresses (load (add base, off), 0) -> (load base, off) (store val, (add base, off)) -> (store val, base, off) This is similar to an equivalent peephole optimisation in PPCISelDAGToDAG. llvm-svn: 327831	2018-03-19 11:54:28 +00:00
Alexander Potapenko	fa0217276a	[MSan] fix the types of RegSaveAreaPtrPtr and OverflowArgAreaPtrPtr Despite their names, RegSaveAreaPtrPtr and OverflowArgAreaPtrPtr used to be i8* instead of i8**. This is important, because these pointers are dereferenced twice (first in CreateLoad(), then in getShadowOriginPtr()), but for some reason MSan allowed this - most certainly because it was possible to optimize getShadowOriginPtr() away at compile time. Differential revision: https://reviews.llvm.org/D44520 llvm-svn: 327830	2018-03-19 10:08:04 +00:00
Alexander Potapenko	014ff63f24	[MSan] Don't create zero offsets in getShadowPtrForArgument(). NFC For MSan instrumentation with MS.ParamTLS and MS.ParamOriginTLS being TLS variables, the CreateAdd() with ArgOffset==0 is a no-op, because the compiler is able to fold the addition of 0. But for KMSAN, which receives ParamTLS and ParamOriginTLS from a call to the runtime library, this introduces a stray instruction which complicates reading/testing the IR. Differential revision: https://reviews.llvm.org/D44514 llvm-svn: 327829	2018-03-19 10:03:47 +00:00
Alexander Potapenko	e0bafb4359	[MSan] Introduce insertWarningFn(). NFC This is a step towards the upcoming KMSAN implementation patch. KMSAN is going to use a different warning function, __msan_warning_32(uptr origin), so we'd better create the warning calls in one place. Differential Revision: https://reviews.llvm.org/D44513 llvm-svn: 327828	2018-03-19 09:59:44 +00:00
Mikhail Maltsev	f07278ec31	[ARM] Fix warnings about missing parentheses in ARMAsmParser llvm-svn: 327827	2018-03-19 09:48:58 +00:00
Serguei Katkov	7d0664b41f	[SCEV] Factor out isKnownViaInduction. NFC. This just extracts the isKnownViaInduction from isKnownPredicate. Reviewers: sanjoy, mkazantsev, reames Reviewed By: mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44554 llvm-svn: 327824	2018-03-19 08:32:09 +00:00
Serguei Katkov	529f42331e	[SCEV] Re-land: Fix isKnownPredicate This is re-land of https://reviews.llvm.org/rL327362 with a fix and regression test. The crash was due to it is possible that for found MDL loop, LHS or RHS may contain an invariant unknown SCEV which does not dominate the MDL. Please see regression test for an example. Reviewers: sanjoy, mkazantsev, reames Reviewed By: mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44553 llvm-svn: 327822	2018-03-19 06:35:30 +00:00
Craig Topper	e18fbab988	[X86] Merge XADD8rr regular expression with XADD16rr/XADD32rr/XADD64rr in a couple scheduler models. llvm-svn: 327821	2018-03-19 04:21:42 +00:00
Craig Topper	d10ceffa5f	[X86] Add ADD16i16/ADD32i32/ADD64i32 and similar to the scheduler models to match ADD8i8. Also move ADC8i8 and SBB8i8 in the Sandy Bridge model to the same class as ADC8ri and SBB8ri. That seems more accurate since its the 8i8 is just the register forced to AL instead of coming from modrm. llvm-svn: 327820	2018-03-19 04:21:40 +00:00
Craig Topper	e9c99d32b3	[X6] Remove two unused InstrItinClass llvm-svn: 327819	2018-03-19 02:07:32 +00:00
Craig Topper	793733a6c8	[X86] Use IIC_CMOV64_RR/RM on 64-bit cmov instructions. llvm-svn: 327817	2018-03-19 00:56:12 +00:00
Craig Topper	9b60dcb29b	[X86] Merge 32 and 64-bit RORX/SHLX/SARX/SHRX into single regular expressions in scheduler models. llvm-svn: 327816	2018-03-19 00:56:11 +00:00
Craig Topper	13a1650d8a	[X86] Merge 8-bit instructions into instregex with 16/32/64 instructions in the scheduler models as much as possible. NFCI This reduces the total number of generated scheduler classes from 5404 to 5316. llvm-svn: 327815	2018-03-19 00:56:09 +00:00
Dylan McKay	a35ee70641	[AVR] Lower i128 divisions to runtime library calls This patch adds i128 division support by instruction LLVM to lower 128-bit divisions to the __udivmodti4 and __divmodti4 rtlib functions. This also adds test for 64-bit division and 128-bit division. Patch by Peter Nimmervoll. llvm-svn: 327814	2018-03-19 00:55:50 +00:00
Craig Topper	f545cfee52	[Mips] Remove duplicate lines from MipsScheduleP5600.td and enable FullInstRWOverlapCheck. This fixes the errors found by the new check added in r327808. llvm-svn: 327813	2018-03-18 22:16:54 +00:00
Craig Topper	75aeb62eb4	[AArch64] Fix a few InstRWs in the A53 scheduler model and enable FullInstRWOverlapCheck. This fixes the errors found by the new check added in r327808. llvm-svn: 327812	2018-03-18 22:16:53 +00:00
Craig Topper	f7fdfb3fb6	[SelectionDAG] Don't default the SelectionDAG* parameter to SDValue::dump to nullptr. Use two different signatures instead. This matches what we do in SDNode. This should allow SDValue::dump to be used in the debugger without getting an error if you don't pass an argument. llvm-svn: 327811	2018-03-18 21:28:11 +00:00
Craig Topper	e1d6a4df1c	[TableGen] When trying to reuse a scheduler class for instructions from an InstRW, make sure we haven't already seen another InstRW containing this instruction on this CPU. This is similar to the check later when we remap some of the instructions from one class to a new one. But if we reuse the class we don't get to do that check. So many CPUs have violations of this check that I had to add a flag to the SchedMachineModel to allow it to be disabled. Hopefully we can get those cleaned up quickly and remove this flag. A lot of the violations are due to overlapping regular expressions, but that's not the only kind of issue it found. llvm-svn: 327808	2018-03-18 19:56:15 +00:00
Simon Pilgrim	203876f104	[X86][Btver2] Fix crc32 schedule costs The default is currently FAdd for some reason llvm-svn: 327807	2018-03-18 19:54:42 +00:00
Simon Pilgrim	13cd3b0961	[X86][Btver2] Add crc32 resource tests llvm-svn: 327805	2018-03-18 18:55:34 +00:00
Simon Pilgrim	c3db8c7cda	[X86][Btver2] FADD/FHADD ymm instructions are double pumped on the JFPA functional pipe llvm-svn: 327804	2018-03-18 18:45:57 +00:00
Simon Pilgrim	036cc82622	[X86][Btver2] Float bitwise ymm instructions are double pumped on the JFPX (JFPA/JFPM) functional pipes llvm-svn: 327803	2018-03-18 17:10:12 +00:00
Simon Pilgrim	87d2f7463f	[X86][Btver2] F16C instructions are performed on the JSTC functional pipe llvm-svn: 327801	2018-03-18 15:59:51 +00:00
Anastasis Grammenos	3a589103a4	[LICM] Salvage DI from dying Instructions LICM deletes trivially dead instructions which it won't attempt to sink. Attempt to salvage debug values which reference these instructions. llvm-svn: 327800	2018-03-18 15:59:19 +00:00
Roman Lebedev	e6da3063a5	[InstCombine] peek through unsigned FP casts for zero-equality compares (PR36682) Summary: This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H See also D44416 Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44424 llvm-svn: 327799	2018-03-18 15:53:02 +00:00
Andrea Di Biagio	e64f3b108d	[llvm-mca] Allow the definition of multiple register files. This is a refactoring in preparation for other two changes that will allow scheduling models to define multiple register files. This is the first step towards fixing PR36662. class RegisterFile (in Dispatch.h) now can emulate multiple register files. Internally, it tracks the number of available physical registers in each register file (described by class RegisterFileInfo). Each register file is associated to a list of MCRegisterClass indices. Knowing the register class indices allows to map physical registers to register files. The long term goal is to allow processor models to optionally specify how many register files are implemented via tablegen. Differential Revision: https://reviews.llvm.org/D44488 llvm-svn: 327798	2018-03-18 15:33:27 +00:00
Sanjay Patel	63b1028953	[InstCombine] add nnan requirement for sqrt(x) * sqrt(y) -> sqrt(x*y) This is similar to D43765. llvm-svn: 327797	2018-03-18 14:32:54 +00:00
Sanjay Patel	95ec4a4dfe	[InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X As shown in the code comment, we don't need all of 'fast', but we do need reassoc + nsz + nnan. Differential Revision: https://reviews.llvm.org/D43765 llvm-svn: 327796	2018-03-18 14:12:25 +00:00
Simon Pilgrim	541992203d	[X86][Btver2] Strip default latency/resource values. NFCI. llvm-svn: 327795	2018-03-18 13:16:11 +00:00
Simon Pilgrim	40f6d6ad0b	[X86][Btver2] SSE4A EXTRQ/INSERTQ instructions are performed on the JVALU0/JVALU1 functional pipes llvm-svn: 327794	2018-03-18 13:05:09 +00:00
Simon Pilgrim	e16790b133	[X86][Btver2] Modelled float bitwise instructions as being performed on the float cluster (FPA/FPM) not the integer. llvm-svn: 327793	2018-03-18 12:37:35 +00:00
Jonas Devlieghere	163326d10c	[dsymutil] Fix add_llvm_tool_symlink Update the arguments to add_llvm_tool_symlink to symlink llvm-dsymutil to dsymutil. llvm-svn: 327792	2018-03-18 12:27:05 +00:00
Simon Pilgrim	e409f84e7e	[X86][Btver2] Correctly distinguish between scheduling pipe and functional unit for JWriteResFpuPair defs Jaguar's FPU has 2 scheduler pipes (JFPU0/JFPU1) which forward to multiple functional sub-units each. We need to model that an micro-op will both consume the scheduler pipe and a functional unit. This patch just handles the ops defined through JWriteResFpuPair, I'll go through the custom cases later. llvm-svn: 327791	2018-03-18 12:09:17 +00:00
Jonas Devlieghere	a6ef1abc09	[dsymutil] Rename llvm-dsymutil -> dsymutil Now that almost all functionality of Apple's dsymutil has been upstreamed, the open source variant can be used as a drop in replacement. Hence we feel it's no longer necessary to have the llvm prefix. Differential revision: https://reviews.llvm.org/D44527 llvm-svn: 327790	2018-03-18 11:38:41 +00:00
Simon Pilgrim	f86d48b3ae	[X86][Btver2] Merge equivalent VBLENDVY + VPERMILY schedule groups Thanks to Craig Topper for noticing this. llvm-svn: 327789	2018-03-18 10:22:35 +00:00
Simon Pilgrim	0ba4a0f3a6	[X86][Btver2] Add llvm-mca tests to show pipe resource usage of most vector instructions Hopefully these tests can be easily reused should any other subtarget get in depth llvm-mca coverage (we can either copy the tests or move them into a common dir and run it with multiple prefixes). llvm-svn: 327788	2018-03-18 09:32:38 +00:00
Craig Topper	2d451e73f9	[X86] Fix a bunch of overlapping regular expressions in the scheduler models. llvm-svn: 327787	2018-03-18 08:38:06 +00:00
Craig Topper	86b02cf076	[X86] Fix a couple typos in the Zen scheduler model. llvm-svn: 327786	2018-03-18 08:38:04 +00:00
Craig Topper	93dd77d2dc	[TableGen] Remove unnecessary uses of make_range. llvm-svn: 327785	2018-03-18 08:38:03 +00:00
Craig Topper	7f31e735c9	[TableGen] Move some variables into for loop declaration. NFC They aren't needed after the loop. llvm-svn: 327784	2018-03-18 08:38:02 +00:00
Craig Topper	89dcda3e90	[X86] Remove MMX_MASKMOVQ64 and VMASKMOVDQU from scheduler models. The information was so wildly inaccurate and incomplete its better to just remove it. MMX_MASKMOVQ64 showed up twice in several scheduler models. In Haswell and Broadwell they were on adjacent lines. On Skylake the copies had different information. MMX_MASKMOVQ and MASKMOVDQU were completely missing. MMX_MASKMOVQ64 was listed on Haswell/Broadwell as 1 cycle on port 1 despite it being a store instruction. Filed PR36780 to track fixing this right. llvm-svn: 327783	2018-03-18 03:24:42 +00:00
Martin Storsjo	36d6419cc5	[AArch64] Skip an unnecessary getCopyToReg in DYNAMIC_STACKALLOC Differential Revision: https://reviews.llvm.org/D44586 llvm-svn: 327779	2018-03-17 20:08:48 +00:00
Nirav Dave	5f0ab71b62	Revert "[DAG, X86] Revert r327197 "Revert r327170, r327171, r327172"" as it times out building test-suite on PPC. llvm-svn: 327778	2018-03-17 19:24:54 +00:00
Nirav Dave	982d3a56ea	[DAG, X86] Revert r327197 "Revert r327170, r327171, r327172" Reland ISel cycle checking improvements after simplifying and reducing node id invariant traversal. llvm-svn: 327777	2018-03-17 17:42:10 +00:00
Sylvestre Ledru	543f15b028	Fix some user facing typos llvm-svn: 327776	2018-03-17 17:30:08 +00:00
Matt Arsenault	abdc4f2dc7	AMDGPU/GlobalISel: Cleanup constant legality llvm-svn: 327774	2018-03-17 15:17:48 +00:00
Matt Arsenault	685d1e8157	AMDGPU/GlobalISel: Basic G_GEP legality llvm-svn: 327773	2018-03-17 15:17:45 +00:00
Matt Arsenault	85803366d6	AMDGPU/GlobalISel: Basic legality for load/store llvm-svn: 327772	2018-03-17 15:17:41 +00:00
Chandler Carruth	7e71129be4	[bindings/go] Add a missing `,` in the test code to fix a go compile failure. llvm-svn: 327771	2018-03-17 15:12:52 +00:00
Oren Ben Simhon	fdd72fd522	[X86] Added support for nocf_check attribute for indirect Branch Tracking X86 Supports Indirect Branch Tracking (IBT) as part of Control-Flow Enforcement Technology (CET). IBT instruments ENDBR instructions used to specify valid targets of indirect call / jmp. The `nocf_check` attribute has two roles in the context of X86 IBT technology: 1. Appertains to a function - do not add ENDBR instruction at the beginning of the function. 2. Appertains to a function pointer - do not track the target function of this pointer by adding nocf_check prefix to the indirect-call instruction. This patch implements `nocf_check` context for Indirect Branch Tracking. It also auto generates `nocf_check` prefixes before indirect branchs to jump tables that are guarded by range checks. Differential Revision: https://reviews.llvm.org/D41879 llvm-svn: 327767	2018-03-17 13:29:46 +00:00
Jonas Paulsson	dbcf1bf503	[SystemZ] Add 'REQUIRES: asserts' to test case using debug output. llvm-svn: 327766	2018-03-17 09:15:13 +00:00
Jonas Paulsson	138960770c	[SystemZ] computeKnownBitsForTargetNode() / ComputeNumSignBitsForTargetNode() Improve/implement these methods to improve DAG combining. This mainly concerns intrinsics. Some constant operands to SystemZISD nodes have been marked Opaque to avoid transforming back and forth between generic and target nodes infinitely. Review: Ulrich Weigand llvm-svn: 327765	2018-03-17 08:32:12 +00:00
Jonas Paulsson	e9f7fa83d5	[SelectionDAG] Handle big endian target BITCAST in computeKnownBits() The BITCAST handling in computeKnownBits() previously only worked for little endian. This patch reverses the iteration over elements for a big endian target which allows this to work in this case also. SystemZ test case. Review: Eli Friedman https://reviews.llvm.org/D44249 llvm-svn: 327764	2018-03-17 08:04:00 +00:00
Chandler Carruth	196a9fab82	[GlobalsAA] Fix a pretty terrible bug that has been in GlobalsAA for a long time. The key thing is that we need to create value handles for every function that we create a `FunctionInfo` object around. Without this, when that function is deleted we can end up creating a new function that collides with its address and look up a stale AA result. With that AA result we can in turn miscompile code in ways that break. This is seriously one of the most absurd miscompiles I've seen. It only reproduced for us recently and only when building a very large server with both ThinLTO and PGO. A HUGE shout out to Wei Mi who tracked all of this down and came up with this patch. I'm just landing it because I happened to still by at a computer. He or I can work on crafting a test case to hit this (now that we know what to target) but it'll take a while, and we've been chasing this for a long time and need it fix Right Now. llvm-svn: 327761	2018-03-16 23:51:33 +00:00
Jessica Paquette	b3e7dc9144	[MachineOutliner] Make KILLs invisible At the point the outliner runs, KILLs don't impact anything, but they're still considered unique instructions. This commit makes them invisible like DebugValues so that they can still be outlined without impacting outlining decisions. llvm-svn: 327760	2018-03-16 22:53:34 +00:00
Andrea Di Biagio	09771ad2ca	[llvm-mca] Remove method getSchedModel() from the Backend. llvm-svn: 327756	2018-03-16 22:21:52 +00:00
Andrea Di Biagio	f6766b0e45	[llvm-mca] Remove unused methods from Backend. NFC llvm-svn: 327749	2018-03-16 22:02:47 +00:00
David L Kreitzer	febf70a9be	Quiet unused variable warnings. NFC. Differential revision: https://reviews.llvm.org/D44583 llvm-svn: 327745	2018-03-16 21:21:23 +00:00
Craig Topper	25007c4f32	[X86] Pass SelectionDAG into X86ISelAddressMode::dump and on to SDNode::dump. This prevents a crash in SelectionDAGDumper with -debug when trying to print mem operands if one of the registers in the addressing mode comes from a load. llvm-svn: 327744	2018-03-16 21:10:07 +00:00
Sanjay Patel	5a5c33d8b5	[InstSimplify] add NaN constant diversity; NFC llvm-svn: 327743	2018-03-16 20:55:55 +00:00
Krzysztof Parzyszek	f81a8d03c1	[Hexagon] Avoid bank conflicts in post-RA scheduler Avoid scheduling two loads in such a way that they would end up in the same packet. If there is a load in a packet, try to schedule a non-load next. Patch by Brendon Cahoon. llvm-svn: 327742	2018-03-16 20:55:49 +00:00
Krzysztof Parzyszek	889cbcacbc	[Hexagon] Add lit testcases for atomic intrinsics Patch by Ben Craig. llvm-svn: 327737	2018-03-16 20:21:43 +00:00
Reid Kleckner	f8b51c5f90	[IR] Avoid the need to prefix MS C++ symbols with '\01' Now the Windows mangling modes ('w' and 'x') do not do any mangling for symbols starting with '?'. This means that clang can stop adding the hideous '\01' leading escape. This means LLVM debug logs are less likely to contain ASCII escape characters and it will be easier to copy and paste MS symbol names from IR. Finally. For non-Windows platforms, names starting with '?' still get IR mangling, so once clang stops escaping MS C++ names, we will get extra '_' prefixing on MachO. That's fine, since it is currently impossible to construct a triple that uses the MS C++ ABI in clang and emits macho object files. Differential Revision: https://reviews.llvm.org/D7775 llvm-svn: 327734	2018-03-16 20:13:32 +00:00
Reid Kleckner	2aeb930a9f	Revert r327721 "This patch fixes the invalid usage of OptSize in Machine Combiner." It causes asserts when compiling Chromium on Win32 with optimizations. We compile many things with -Os. llvm-svn: 327733	2018-03-16 20:11:55 +00:00
Craig Topper	f0815e01d8	[X86] Merge ADDSUB/SUBADD detection into single methods that can detect either and indicate what they found. Previously, we called the same functions twice with a bool flag determining whether we should look for ADDSUB or SUBADD. It would be more efficient to run the code once and detect either pattern with a flag to tell which type it found. Differential Revision: https://reviews.llvm.org/D44540 llvm-svn: 327730	2018-03-16 18:25:59 +00:00
Craig Topper	71d69b2ea5	[CorrelatedValuePropagation] Use SelectInst::getCondition/getTrueValue/getFalseValue instead of getOperand for readability. NFC llvm-svn: 327728	2018-03-16 18:18:47 +00:00
Farhana Aleen	c6c9dc8773	[AMDGPU] Supported ds_write_b128 generation. Summary: This is a follow-on patch of https://reviews.llvm.org/D44210 Author: FarhanaAleen Reviewed By: msearles Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D44319 llvm-svn: 327726	2018-03-16 18:12:00 +00:00
Craig Topper	e6913ec340	[X86] Post process the DAG after isel to remove vector moves that were added to zero upper bits. We previously avoided inserting these moves during isel in a few cases which is implemented using a whitelist of opcodes. But it's too difficult to generate a perfect list of opcodes to whitelist. Especially with AVX512F without AVX512VL using 512 bit vectors to implement some 128/256 bit operations. Since isel is done bottoms up, we'd have to check the VT and opcode and subtarget in order to determine whether an EXTRACT_SUBREG would be generated for some operations. So instead of doing that, this patch adds a post processing step that detects when the moves are unnecesssary after isel. At that point any EXTRACT_SUBREGs would have already been created and appear in the DAG. So then we just need to ensure the input to the move isn't one. Differential Revision: https://reviews.llvm.org/D44289 llvm-svn: 327724	2018-03-16 17:13:42 +00:00
Dmitry Preobrazhensky	4c8f4234b6	[AMDGPU][MC][GFX8][GFX9][DISASSEMBLER] Added "_e32" suffix to 32-bit VINTRP opcodes See bug 36751: https://bugs.llvm.org/show_bug.cgi?id=36751 Differential Revision: https://reviews.llvm.org/D44529 Reviewers: artem.tamazov, arsenm llvm-svn: 327723	2018-03-16 16:38:04 +00:00
Philip Reames	8a106272e8	[LICM/mustexec] Extend first iteration must execute logic to fcmps This builds on the work from https://reviews.llvm.org/D44287. It turned out supporting fcmp was much easier than I realized, so let's do that now. As an aside, our -O3 handling of a floating point IVs leaves a lot to be desired. We do convert the float IV to an integer IV, but do so late enough that many other optimizations are missed (e.g. we don't vectorize). Differential Revision: https://reviews.llvm.org/D44542 llvm-svn: 327722	2018-03-16 16:33:49 +00:00
Andrew V. Tischenko	a0cd09d4a2	This patch fixes the invalid usage of OptSize in Machine Combiner. Differential Revision: https://reviews.llvm.org/D43813 llvm-svn: 327721	2018-03-16 16:06:24 +00:00
Dmitry Preobrazhensky	9c1a6e7e24	[AMDGPU][MC] Corrected default values for unused SDWA operands See bug 36355: https://bugs.llvm.org/show_bug.cgi?id=36355 Differential Revision: https://reviews.llvm.org/D44481 Reviewers: artem.tamazov, arsenm llvm-svn: 327720	2018-03-16 15:40:27 +00:00
Sanjay Patel	2b94927f0d	[InstCombine] add nnan requirement to potential fabs folds tests; NFC As noted in D44550, we can't guarantee preserving the sign-bit of NaN if we convert these to fabs(). llvm-svn: 327718	2018-03-16 15:27:39 +00:00
Jonas Paulsson	a9f05a9d50	[SystemZ] Make AnyRegBitRegClass unallocatable. AnyReg is just for the assembler and it is better to have it as not allocatable in order to simplify (make more intuitive) the RegPressureSets. Review: Ulrich Weigand llvm-svn: 327715	2018-03-16 15:21:26 +00:00
Aditya Nandakumar	573102e344	[GISel]: Remove unused header include in MachineIRBuilder.h llvm-svn: 327714	2018-03-16 15:14:18 +00:00
Brian M. Rzycki	f65ddc5fa2	[JumpThreading] Track unreachable BBs to avoid processing JumpThreading iterates over F until the IR quiesces. Transforming unreachable BBs increases compile time and it is also possible to never stabilize causing JumpThreading to hang. An older attempt at fixing this problem was D3991 where removeUnreachableBlocks(F) was called before JumpThreading began. This has a few drawbacks: * expensive - the routine attempts to fix up the IR to identify additional BBs that can be removed along with unreachable BBs. * aggressive - does not identify and preserve the shape of the IR. At a minimum it does not preserve loop hierarchies. * invasive - altering reachable blocks it may disrupt IR shapes that could have otherwise been JumpThreaded. This patch avoids removeUnreachableBlocks(F) and instead tracks unreachable BBs in a SmallPtrSet using DominatorTree to validate the initial state of all BBs. We then rely on subsequent passes to identify and remove these unreachable blocks from F. Reviewers: dberlin, sebpop, kuhar, dinesh.d Reviewed by: sebpop, kuhar Subscribers: hiraditya, uabelho, llvm-commits Differential Revision: https://reviews.llvm.org/D44177 llvm-svn: 327713	2018-03-16 15:13:47 +00:00
Krzysztof Parzyszek	9915291ab8	[Hexagon] Fix zero-extending non-HVX bool vectors llvm-svn: 327712	2018-03-16 15:03:37 +00:00
Mikhail Maltsev	ed1c8bfec2	[ARM] Convert more invalid NEON immediate loads Summary: Currently the LLVM MC assembler is able to convert e.g. vmov.i32 d0, #0xabababab (which is technically invalid) into a valid instruction vmov.i8 d0, #0xab this patch adds support for vmov.i64 and for cases with the resulting load types other than i8, e.g.: vmov.i32 d0, #0xab00ab00 -> vmov.i16 d0, #0xab00 Reviewers: olista01, rengolin Reviewed By: rengolin Subscribers: rengolin, javed.absar, kristof.beyls, rogfer01, llvm-commits Differential Revision: https://reviews.llvm.org/D44467 llvm-svn: 327709	2018-03-16 14:10:56 +00:00
Simon Pilgrim	23578e7d3c	[X86][Btver2] Add correct mul/imul schedule costs Integer multiply is performed on the JMul function unit and i64 requires double pumping llvm-svn: 327707	2018-03-16 14:01:01 +00:00
Simon Pilgrim	8d28ae6aec	[X86][Btver2] Add correct lzcnt/tzcnt/popcnt schedule costs Don't use WriteIMul defaults llvm-svn: 327706	2018-03-16 13:43:55 +00:00
Mikhail Maltsev	8dcf6fa308	[ARM] Fix a check in vmov/vmvn immediate parsing Summary: Currently the check is incorrect and the following invalid instruction is accepted and incorrectly assembled: vmov.i32 d2, #0x00a500a6 This patch fixes the issue. Reviewers: olista01, rengolin Reviewed By: rengolin Subscribers: SjoerdMeijer, javed.absar, rogfer01, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D44460 llvm-svn: 327704	2018-03-16 12:46:49 +00:00
Matthew Simpson	eacfefd056	[AArch64] Implement getArithmeticReductionCost This patch provides an implementation of getArithmeticReductionCost for AArch64. We can specialize the cost of add reductions since they are computed using the 'addv' instruction. Differential Revision: https://reviews.llvm.org/D44490 llvm-svn: 327702	2018-03-16 11:34:15 +00:00
Pavel Labath	3461b1e097	HashTableTest: squelch some "comparison of integers of different signs" warnings llvm-svn: 327701	2018-03-16 10:30:26 +00:00
Pavel Labath	906b777a6a	DWARFVerifier: Enhance validation of .debug_names hash tables Summary: This patch adds more checks to the .debug_names validator. Specifically, they check for: - buckets claiming to be non-empty but pointing to mismatched hashes (most consumers would interpret this as an empty bucket, but it questionable whether the generator meant that) - hashes that are not reachable from any bucket - names with incorrect hashes Together, these checks ensure that any name in the index can be reached through the hash table using the regular lookup algorithm. We also warn if we encounter a name index without a hash table. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44433 llvm-svn: 327699	2018-03-16 10:02:16 +00:00
Matthew Simpson	883e96c9d4	[TTI, AArch64] Allow the cost model analysis to test vector reduce intrinsics This patch considers the experimental vector reduce intrinsics in the default implementation of getIntrinsicInstrCost. The cost of these intrinsics is computed with getArithmeticReductionCost and getMinMaxReductionCost. This patch also adds a test case for AArch64 that indicates the costs we currently compute for vector reduce intrinsics. These costs are inaccurate and will be updated in a follow-on patch. Differential Revision: https://reviews.llvm.org/D44489 llvm-svn: 327698	2018-03-16 10:00:30 +00:00
Sjoerd Meijer	d391a1a985	[ARM] FP16 codegen support for VSEL This implements lowering of SELECT_CC for f16s, which enables codegen of VSEL with f16 types. Differential Revision: https://reviews.llvm.org/D44518 llvm-svn: 327695	2018-03-16 08:06:25 +00:00
Max Kazantsev	2e7fec7c90	[NFC] Void variables used for asserts only llvm-svn: 327693	2018-03-16 05:02:24 +00:00
Sanjay Patel	728269c10d	[InstCombine] add more tests for fcmp+select -> fabs; NFC This should correspond to the patterns in D44091 and might make handling these in the DAG unnecessary. llvm-svn: 327689	2018-03-16 01:06:33 +00:00
Simon Pilgrim	14e5a1b05b	[X86][Btver2] Add support for multiple pipelines stages for x86 scalar schedules. NFCI. This allows us to use JWriteResIntPair for complex schedule classes (like WriteIDiv) as well as single pipe instructions. llvm-svn: 327686	2018-03-15 23:46:12 +00:00
Simon Pilgrim	9c4157bb70	[X86][Btver2] Tweak pipes test to remove register dependencies It gives us a better view of pipe usage in the timeline which is what the test is trying to show. llvm-svn: 327685	2018-03-15 23:15:11 +00:00
Craig Topper	1b8cf49704	[SelectionDAG][ARM][X86] Teach PromoteIntRes_SETCC to do a better job picking the result type for the setcc. Previously if getSetccResultType returned an illegal type we just fell back to using the default promoted type. This appears to have been to handle the case where for vectors getSetccResultType returns the input type, but the input type itself isn't legal and will need to be promoted. Without the legality check we would never reach a legal type. But just picking the promoted type to be the setcc type can create strange setccs where the result type is 128 bits and the operand type is 256 bits. If for example the result type was promoted to v8i16 from v8i1, but the input type was promoted from v8i23 to v8i32. We currently handle this with custom lowering code in X86. This legality check also caused us reject the getSetccResultType when the input type needed to be widened or split. Even though that result wouldn't have caused legalization to get stuck. This patch tries to fix this by detecting the getSetccResultType needs to be promoted. If its input type also needs to be promoted we'll try a ask for a new setcc result type based on its eventual promoted value. Otherwise we fall back to default type to promote to. For any other illegal values we might get back from the initial call to getSetccResultType we just keep and allow it to be re-legalized later via splitting or widening or scalarizing. llvm-svn: 327683	2018-03-15 23:04:11 +00:00
Simon Pilgrim	3894809997	[X86][Btver2] Fix ymm div/sqrt to use fmul unit YMM FDiv/FSqrt are dispatched on pipe JFPU1 but should be performed on the JFPM unit - that is where most of the cycles are spent. This matches the pipes for WriteFSqrt/WriteFDiv definitions. llvm-svn: 327682	2018-03-15 23:00:47 +00:00
Sean Silva	673f4b57f6	Use standard `print(dbgs())` pattern to implement DebugLoc::dump The open-coded implementation had a bug. It didn't print filenames. llvm-svn: 327681	2018-03-15 22:51:55 +00:00
Sanjay Patel	2d568ec0e4	[InstCombine] add tests for fcmp+select -> fabs; NFC llvm-svn: 327680	2018-03-15 22:48:23 +00:00
Zachary Turner	a92e940a8c	Fix PDB injected sources test. This test was originally disabled because it was failing on a bot. It turns out I had run dos2unix on the file, and that removed a necessary byte from the file. I'm just recomitting the proper file and updating the test to test a little bit more now. llvm-svn: 327679	2018-03-15 22:47:04 +00:00
Simon Pilgrim	49a56faee2	[X86][Btver2] Add test to show timeline of fpu instructions on different pipes/units Try to demonstrate the scheduling from fpu0/fpu1 pipes to the valu0/vimul/fpa or valu1/stc/fpm functional units llvm-svn: 327676	2018-03-15 22:34:24 +00:00
Zachary Turner	edbcbe0b62	[PDB] Fix a bug where we were serializing hash tables incorrectly. There was some code that tried to calculate the number of 4-byte words required to hold N bits, but it was instead computing the number of bytes required to hold N bits. This was leading to extraneous data being output into the hash table, which would cause certain operations in DIA (the Microsoft PDB reader) to fail. llvm-svn: 327675	2018-03-15 22:31:00 +00:00
Derek Schuff	10b313581f	[WebAssembly] Add DebugLoc information to WebAssembly block and loop. Patch by Yury Delendik Differential Revision: https://reviews.llvm.org/D44448 llvm-svn: 327673	2018-03-15 22:06:51 +00:00
Artem Belevich	7b14e7f041	[NVPTX] TblGen-ized lowering of WMMA intrinsics. NFC. Differential Revision: https://reviews.llvm.org/D43151 llvm-svn: 327672	2018-03-15 21:40:56 +00:00
Florian Hahn	fc97b6173f	[LoopUnroll] Peel off iterations if it makes conditions true/false. If the loop body contains conditions of the form IndVar < #constant, we can remove the checks by peeling off #constant iterations. This improves codegen for PR34364. Reviewers: mkuper, mkazantsev, efriedma Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D43876 llvm-svn: 327671	2018-03-15 21:34:43 +00:00
Reid Kleckner	5a791ee408	Re-land r327620 "[CodeView] Initial support for emitting S_BLOCK32 symbols for lexical scopes" This is safe to land now that we don't copy FunctionInfo when rehashing the DenseMap. llvm-svn: 327670	2018-03-15 21:24:04 +00:00
Reid Kleckner	e9dc30d2c2	[codeview] Fix sense of the assertion about hashtable insertion llvm-svn: 327669	2018-03-15 21:18:42 +00:00
Zachary Turner	03028f327b	Fix structure alignment issue. llvm-svn: 327666	2018-03-15 21:12:51 +00:00
Reid Kleckner	55baeefd54	[codeview] Delete FunctionInfo copy ctor and move out of DenseMap We were unnecessarily copying a bunch of these FunctionInfo objects around when rehashing the DenseMap. Furthermore, r327620 introduced pointers referring to objects owned by FunctionInfo, and the default copy ctor did the wrong thing in this case, leading to use-after-free when the DenseMap gets rehashed. I will rebase r327620 on this next and recommit it. llvm-svn: 327665	2018-03-15 21:12:21 +00:00
Philip Reames	a21d5f1e18	[LICM] Ignore exits provably not taken on first iteration when computing must execute It is common to have conditional exits within a loop which are known not to be taken on some iterations, but not necessarily all. This patches extends our reasoning around guaranteed to execute (used when establishing whether it's safe to dereference a location from the preheader) to handle the case where an exit is known not to be taken on the first iteration and the instruction of interest is known to be taken on the first iteration. This case comes up in two major ways: * If we have a range check which we've been unable to eliminate, we frequently know that it doesn't fail on the first iteration. * Pass ordering. We may have a check which will be eliminated through some sequence of other passes, but depending on the exact pass sequence we might never actually do so or we might miss other optimizations from passes run before the check is finally eliminated. The initial version (here) is implemented via InstSimplify. At the moment, it catches a few cases, but misses a lot too. I added test cases for missing cases in InstSimplify which I'll follow up on separately. Longer term, we should probably wire SCEV through to here to get much smarter loop aware simplification of the first iteration predicate. Differential Revision: https://reviews.llvm.org/D44287 llvm-svn: 327664	2018-03-15 21:04:28 +00:00
Evandro Menezes	d4254ac1b9	[AArch64] Adjust the cost model for Exynos M3 Fix typo. llvm-svn: 327663	2018-03-15 20:37:32 +00:00
Evandro Menezes	5303f897d4	[AArch64] Adjust the cost model for Exynos M3 Add special case for rotate right. llvm-svn: 327662	2018-03-15 20:31:25 +00:00
Evandro Menezes	1515e859c6	[AArch64] Adjust the cost model for Exynos M3 Increase the number of cheap as move cases of register reset. llvm-svn: 327661	2018-03-15 20:31:13 +00:00
Craig Topper	c3983c34cd	[X86] Make sure we use FSUB instruction as the reference for operand order in isAddSubOrSubAdd when recognizing subadd The FADD part of the addsub/subadd pattern can have its operands commuted, but when checking for fsubadd we were using the fadd as reference and commuting the fsub node. llvm-svn: 327660	2018-03-15 20:30:54 +00:00
Craig Topper	46502fa2ef	[X86] Add test case showing bad fmsubadd creation due to bad commuting. The code that creates fmsubadd from shuffle vector has some code to allow commuting the operands of the fadd node. This code was originally created when we only recognized fmaddsub. When fmsubadd support was added this code was not updated and is now commuting the fsub operands instead. llvm-svn: 327659	2018-03-15 20:30:51 +00:00
David Blaikie	a46b8a7677	Remove empty file I should've deleted this in r320768 but accidentally just deleted its contents instead. llvm-svn: 327658	2018-03-15 20:29:14 +00:00
Reid Kleckner	7ac8e50e7e	Revert r327620 "[CodeView] Initial support for emitting S_BLOCK32 symbols for lexical scopes" It is causing crashes when compiling Chrome in debug mode. I'll try to debug it in a second. llvm-svn: 327657	2018-03-15 20:12:16 +00:00
Diego Caballero	cae4994a58	[LV] Test commit. Removing white space. This is just to check that I have commit access privilege. llvm-svn: 327656	2018-03-15 19:34:27 +00:00
Philip Reames	422024a1b7	[EarlyCSE] Don't hide earler invariant.scopes If we've already established an invariant scope with an earlier generation, we don't want to hide it in the scoped hash table with one with a later generation. I noticed this when working on the invariant-load handling, but it also applies to the invariant.start case as well. Without this change, my previous patch for invariant-load regresses some cases, so I'm pushing this without waiting for review. This is why you don't make last minute tweaks to patches to catch "obvious cases" after it's already been reviewed. Bad Philip! llvm-svn: 327655	2018-03-15 18:12:27 +00:00
Zachary Turner	3dbc362fe2	Move some function declarations higher so they can be found. llvm-svn: 327653	2018-03-15 17:56:44 +00:00
Zachary Turner	cce9b0d37e	Add missing #includes. llvm-svn: 327652	2018-03-15 17:51:10 +00:00
Guozhi Wei	9c916584ba	[PPC] Avoid non-simple MVT in STBRX optimization PR35402 triggered this case. It bswap and stores a 48bit value, current STBRX optimization transforms it into STBRX. Unfortunately 48bit is not a simple MVT, there is no PPC instruction to support it, and it can't be automatically expanded by llvm, so caused a crash. This patch detects the non-simple MVT and returns early. Differential Revision: https://reviews.llvm.org/D44500 llvm-svn: 327651	2018-03-15 17:49:12 +00:00
Simon Pilgrim	48b758e8ad	[X86][Btver2] Attach AES/CLMUL instructions to a scheduler pipe llvm-svn: 327650	2018-03-15 17:45:10 +00:00
Craig Topper	5a0251fe67	[X86] Simplify the type legality checking for (FM)ADDSUB/SUBADD matching. NFCI Rather than enumerating all specific types, for the DAG combine we can just use TLI::isTypeLegal and an SSE3 check. For the BUILD_VECTOR version we already know the type is legal so we just need to check SSE3. llvm-svn: 327649	2018-03-15 17:38:59 +00:00
Craig Topper	627e001fad	[X86] Fix 80 column violations. llvm-svn: 327648	2018-03-15 17:38:55 +00:00
Zachary Turner	ebf03f6c46	Refactor the PDB HashTable class. It previously only worked when the key and value types were both 4 byte integers. We now have a use case for a non trivial value type, so we need to extend it to support arbitrary value types, which means templatizing it. llvm-svn: 327647	2018-03-15 17:38:26 +00:00
Philip Reames	ca587fe0b4	[EarlyCSE] Reuse invariant scopes for invariant load This is a follow up to https://reviews.llvm.org/D43716 which rewrites the invariant load handling using the new infrastructure. It's slightly more powerful, but only in somewhat minor ways for the moment. It's not clear that DSE of stores to invariant locations is actually interesting since why would your IR have such a construct to start with? Note: The submitted version is slightly different than the reviewed one. I realized the scope could start for an invariant load which was proven redundant and removed. Added a test case to illustrate that as well. Differential Revision: https://reviews.llvm.org/D44497 llvm-svn: 327646	2018-03-15 17:29:32 +00:00
Roman Lebedev	6aca33534b	[InstSimplify] peek through unsigned FP casts for sign-bit compares (PR36682) This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H See also D44421, D44424 Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44425 llvm-svn: 327642	2018-03-15 16:17:46 +00:00
Roman Lebedev	0c43d72e90	[InstSimplify][NFC] simplifyICmpWithConstant(): refactor GetCompareTy() calls Preparation for D44425. llvm-svn: 327641	2018-03-15 16:17:40 +00:00
Andrea Di Biagio	af904b9919	[llvm-mca] Simplify code. NFC. Now both method DispatchUnit::checkRAT() and DispatchUnit::canDispatch take as input an Instruction refrence instead of an instruction descriptor. This was requested by Simon in D44488 to simplify the diff. llvm-svn: 327640	2018-03-15 16:13:12 +00:00
Matthew Simpson	c1c4ad6e64	[ConstantFolding, InstSimplify] Handle more vector GEPs This patch addresses some additional cases where the compiler crashes upon encountering vector GEPs. This should fix PR36116. Differential Revision: https://reviews.llvm.org/D44219 Reference: https://bugs.llvm.org/show_bug.cgi?id=36116 llvm-svn: 327638	2018-03-15 16:00:29 +00:00
Zaara Syeda	1110c4d336	[PowerPC] Optimize TLS initial-exec sequence to use X-Form loads/stores This patch adds new load/store instructions for integer scalar types which can be used for X-Form when fed by add with an @tls relocation. Differential Revision: https://reviews.llvm.org/D43315 llvm-svn: 327635	2018-03-15 15:34:41 +00:00
Simon Pilgrim	d30df5769e	[X86][Btver2] Remove JAny resource, and map system/microcoded instructions to JALU pipes Simplifies throughput to the issue width (1/2) instead of permitting any pipe (1/6) llvm-svn: 327632	2018-03-15 15:12:12 +00:00
Sanjay Patel	43f71eade0	[InstSimplify] add tests with NaN operand for fp binops; NFC llvm-svn: 327631	2018-03-15 14:48:39 +00:00
Simon Pilgrim	fb7aa57bf1	[X86][SSE] Introduce Float/Vector WriteMove, WriteLoad and Writetore scheduler classes As discussed on D44428 and PR36726, this patch splits off WriteFMove/WriteVecMove, WriteFLoad/WriteVecLoad and WriteFStore/WriteVecStore scheduler classes to permit vectors to be handled separately from gpr/scalar types. I've minimised the diff here by only moving various basic SSE/AVX vector instructions across - we can fix the rest when called for. This does fix the MOVDQA vs MOVAPS/MOVAPD discrepancies mentioned on D44428. Differential Revision: https://reviews.llvm.org/D44471 llvm-svn: 327630	2018-03-15 14:45:30 +00:00
Simon Pilgrim	69a4132f63	[X86] Regenerate schedule tests with zero latency comments llvm-svn: 327628	2018-03-15 14:30:59 +00:00
Sanjay Patel	a4f42f2cfd	[PatternMatch, InstSimplify] allow undef elements when matching any vector FP zero This matcher implementation appears to be slightly more efficient than the generic constant check that it is replacing because every use was for matching FP patterns, but the previous code would check int and pointer type nulls too. llvm-svn: 327627	2018-03-15 14:29:27 +00:00
Sanjay Patel	8f063d0c70	[InstSimplify] remove 'nsz' requirement for frem 0, X From the LangRef definition for frem: "The value produced is the floating-point remainder of the two operands. This is the same output as a libm ‘fmod‘ function, but without any possibility of setting errno. The remainder has the same sign as the dividend. This instruction is assumed to execute in the default floating-point environment." llvm-svn: 327626	2018-03-15 14:04:31 +00:00
Sjoerd Meijer	dfc7eb490a	[AArch64] Codegen tests for the Armv8.2-A FP16 intrinsics This is a follow up of the AArch64 FP16 intrinsics work; the codegen tests had not been added yet. Differential Revision: https://reviews.llvm.org/D44510 llvm-svn: 327624	2018-03-15 13:42:28 +00:00
Ulrich Weigand	f4ceef8d3f	[Debug] Retain both copies of debug intrinsics in HoistThenElseCodeToIf When hoisting common code from the "then" and "else" branches of a condition to before the "if", the HoistThenElseCodeToIf routine will attempt to merge the debug location associated with the two original copies of the hoisted instruction. This is a problem in the special case where the hoisted instruction is a debug info intrinsic, since for those the debug location is considered part of the intrinsic and attempting to modify it may resut in invalid IR. This is the underlying cause of PR36410. This patch fixes the problem by handling debug info intrinsics specially: instead of hoisting one copy and merging the two locations, the code now simply hoists both copies, each with its original location intact. Note that this is still only done in the case where both original copies are otherwise (i.e. apart from location metadata) identical. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D44312 llvm-svn: 327622	2018-03-15 12:28:48 +00:00
Brock Wyma	3cc5710cec	[CodeView] Initial support for emitting S_BLOCK32 symbols for lexical scopes This patch sorts local variables by lexical scope and emits them inside an appropriate S_BLOCK32 CodeView symbol. Differential Revision: https://reviews.llvm.org/D42926 llvm-svn: 327620	2018-03-15 11:52:17 +00:00
Fedor Sergeev	194a407bda	[New PM][IRCE] port of Inductive Range Check Elimination pass to the new pass manager There are two nontrivial details here: * Loop structure update interface is quite different with new pass manager, so the code to add new loops was factored out * BranchProbabilityInfo is not a loop analysis, so it can not be just getResult'ed from within the loop pass. It cant even be queried through getCachedResult as LoopCanonicalization sequence (e.g. LoopSimplify) might invalidate BPI results. Complete solution for BPI will likely take some time to discuss and figure out, so for now this was partially solved by making BPI optional in IRCE (skipping a couple of profitability checks if it is absent). Most of the IRCE tests got their corresponding new-pass-manager variant enabled. Only two of them depend on BPI, both marked with TODO, to be turned on when BPI starts being available for loop passes. Reviewers: chandlerc, mkazantsev, sanjoy, asbirlea Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D43795 llvm-svn: 327619	2018-03-15 11:01:19 +00:00
Andrei Elovikov	f9b8035f3c	[LoopUnroll] Ignore ephemeral values when checking full unroll profitability. Summary: Before this patch call graph is like this in the LoopUnrollPass: tryToUnrollLoop ApproximateLoopSize collectEphemeralValues /* Use collected ephemeral values / computeUnrollCount analyzeLoopUnrollCost / Bail out from the analysis if loop contains CallInst / This patch moves collection of the ephemeral values to the tryToUnrollLoop function and passes the collected values into both ApproximateLoopsize (as before) and additionally starts using them in analyzeLoopUnrollCost: tryToUnrollLoop collectEphemeralValues ApproximateLoopSize(EphValues) / Use EphValues / computeUnrollCount(EphValues) analyzeLoopUnrollCost(EphValues) / Ignore ephemeral values - they don't contribute to the final cost / / Bail out from the analysis if loop contains CallInst */ Reviewers: mzolotukhin, evstupac, sanjoy Reviewed By: evstupac Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43931 llvm-svn: 327617	2018-03-15 09:59:15 +00:00
Max Kazantsev	4f9c7c5086	[SCEV][NFC] Remove TBB, FBB parameters from exit limit computations Methods `computeExitLimitFromCondCached` and `computeExitLimitFromCondImpl` take true and false branches as parameters and only use them for asserts and for identifying whether true/false branch belongs to the loop (which can be done once earlier). This fact complicates generalization of exit limit computation logic on guards because the guards don't have blocks to which they go in case of failure explicitly. The motivation of this patch is that currently this part of SCEV knows nothing about guards and only works with explicit branches. As result, it fails to prove that a loop for (i = 0; i < 100; i++) guard(i < 10); exits after 10th iteration, while in the equivalent example for (i = 0; i < 100; i++) if (i >= 10) break; SCEV easily proves this fact. We are going to change it in near future, and this is why we need to make these methods operate on more abstract level. This patch refactors this code to get rid of these parameters as meaningless and prepare ground for teaching these methods to work with guards as well as they work with explicit branching instructions. Differential Revision: https://reviews.llvm.org/D44419 llvm-svn: 327615	2018-03-15 09:38:00 +00:00
Craig Topper	ff6e82c9d0	[X86] Add test cases for 512-bit addsub from build_vector. There is no 512 bit addsub instruction, but we partially match it handle fmaddsub matching. We explicitly bail out for 512 bit vectors after failing the fmaddsub match, but we had no test coverage for that bail out. We might want to consider splitting and using 256 bit instructions instead of the long sequence seen here. llvm-svn: 327605	2018-03-15 06:49:01 +00:00
Craig Topper	26a3a80c87	[X86] Add support for matching FMSUBADD from build_vector. llvm-svn: 327604	2018-03-15 06:14:55 +00:00
Craig Topper	a5e712f402	[X86] Remove old TODO. We have coverage for this now. Coverage was added in r320950. llvm-svn: 327603	2018-03-15 06:14:53 +00:00
Craig Topper	b9526e9fdb	[X86] Use MVT in a couple places where we know the type is legal. llvm-svn: 327602	2018-03-15 06:14:51 +00:00
Aaron Smith	40198f5905	[DebugInfo] Add a new method IPDBSession::findLineNumbersBySectOffset Summary: Some PDB symbols do not have a valid VA or RVA but have Addr by Section and Offset. For example, a variable in thread-local storage has the following properties: get_addressOffset: 0 get_addressSection: 5 get_lexicalParentId: 2 get_name: g_tls get_symIndexId: 12 get_typeId: 4 get_dataKind: 6 get_symTag: 7 get_locationType: 2 This change provides a new method to locate line numbers by Section and Offset from those symbols. Reviewers: zturner, rnk, llvm-commits Subscribers: asmith, JDevlieghere Differential Revision: https://reviews.llvm.org/D44407 llvm-svn: 327601	2018-03-15 06:04:51 +00:00
Lei Huang	1f8da3ae19	[PowerPC][NFC] formatting-only fix llvm-svn: 327599	2018-03-15 03:06:44 +00:00
George Burgess IV	cedfa6da81	Remove unused variable; NFC llvm-svn: 327597	2018-03-15 02:58:36 +00:00
Lang Hames	5721ee48a2	[ORC] Re-apply r327566 with a fix for test-global-ctors.ll. Also clang-formats the patch, which I should have done the first time around. llvm-svn: 327594	2018-03-15 00:30:14 +00:00
Matt Davis	9407bb5f54	[CleanUp] Remove NumInstructions field from LoopVectorizer's RegisterUsage struct. Summary: This variable is largely going unused; aside from reporting number of instructions for in DEBUG builds. The only use of NumInstructions is in debug output to represent the LoopSize. That value can be can be misleading as it also includes metadata instructions (e.g., DBG_VALUE) which have no real impact. If we do choose to keep this around, we probably should guard it by a DEBUG macro, as it's not used in production builds. Reviewers: majnemer, congh, rengolin Reviewed By: rengolin Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D44495 llvm-svn: 327589	2018-03-14 23:30:31 +00:00
Simon Pilgrim	48fbf0c69a	[X86][Btver2] Add support for multiple pipelines stages for fpu schedules. NFCI. This allows us to use JWriteResFpuPair for complex schedule classes as well as single pipe instructions. llvm-svn: 327588	2018-03-14 23:12:09 +00:00
Sanjay Patel	e4e3f79b83	[InstSimplify] add tests for frem and vectors with undef; NFC These should all be folded. The vector tests need to have m_AnyZero updated to ignore undef elements, but we need to be careful not to return the existing value in that case and unintentionally propagate undef. llvm-svn: 327585	2018-03-14 22:45:58 +00:00
Mark Searles	c3c02bde73	[AMDGPU] Waitcnt pass: Modify the waitcnt pass to propagate info in the case of a single basic block loop. mergeInputScoreBrackets() does this for us; update it so that it processes the single bb's score bracket when processing the single bb's preds. It is, after all, a pred of itself, so it's score bracket is needed. Differential Revision: https://reviews.llvm.org/D44434 llvm-svn: 327583	2018-03-14 22:04:32 +00:00
Simon Pilgrim	dfeebdbed7	[X86][Btver2] Add ResourceCycles and NumMicroOps overrides to scalar instructions. NFCI. Currently still use default values - this is setup for a future patch. llvm-svn: 327582	2018-03-14 21:55:54 +00:00
Reid Kleckner	3a7a2e4a0a	[FastISel] Sink local value materializations to first use Summary: Local values are constants, global addresses, and stack addresses that can't be folded into the instruction that uses them. For example, when storing the address of a global variable into memory, we need to materialize that address into a register. FastISel doesn't want to materialize any given local value more than once, so it generates all local value materialization code at EmitStartPt, which always dominates the current insertion point. This allows it to maintain a map of local value registers, and it knows that the local value area will always dominate the current insertion point. The downside is that local value instructions are always emitted without a source location. This is done to prevent jumpy line tables, but it means that the local value area will be considered part of the previous statement. Consider this C code: call1(); // line 1 ++global; // line 2 ++global; // line 3 call2(&global, &local); // line 4 Today we end up with assembly and line tables like this: .loc 1 1 callq call1 leaq global(%rip), %rdi leaq local(%rsp), %rsi .loc 1 2 addq $1, global(%rip) .loc 1 3 addq $1, global(%rip) .loc 1 4 callq call2 The LEA instructions in the local value area have no source location and are treated as being on line 1. Stepping through the code in a debugger and correlating it with the assembly won't make much sense, because these materializations are only required for line 4. This is actually problematic for the VS debugger "set next statement" feature, which effectively assumes that there are no registers live across statement boundaries. By sinking the local value code into the statement and fixing up the source location, we can make that feature work. This was filed as https://bugs.llvm.org/show_bug.cgi?id=35975 and https://crbug.com/793819. This change is obviously not enough to make this feature work reliably in all cases, but I felt that it was worth doing anyway because it usually generates smaller, more comprehensible -O0 code. I measured a 0.12% regression in code generation time with LLC on the sqlite3 amalgamation, so I think this is worth doing. There are some special cases worth calling out in the commit message: 1. local values materialized for phis 2. local values used by no-op casts 3. dead local value code Local values can be materialized for phis, and this does not show up as a vreg use in MachineRegisterInfo. In this case, if there are no other uses, this patch sinks the value to the first terminator, EH label, or the end of the BB if nothing else exists. Local values may also be used by no-op casts, which adds the register to the RegFixups table. Without reversing the RegFixups map direction, we don't have enough information to sink these instructions. Lastly, if the local value register has no other uses, we can delete it. This comes up when fastisel tries two instruction selection approaches and the first materializes the value but fails and the second succeeds without using the local value. Reviewers: aprantl, dblaikie, qcolombet, MatzeB, vsk, echristo Subscribers: dotdash, chandlerc, hans, sdardis, amccarth, javed.absar, zturner, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D43093 llvm-svn: 327581	2018-03-14 21:54:21 +00:00
Francis Visoiu Mistrih	e85b06d65f	[CodeGen] Use MIR syntax for MachineMemOperand printing Get rid of the "; mem:" suffix and use the one we use in MIR: ":: (load 2)". rdar://38163529 Differential Revision: https://reviews.llvm.org/D42377 llvm-svn: 327580	2018-03-14 21:52:13 +00:00
Philip Reames	0adbb19409	[EarlyCSE] Exploit open ended invariant.start scopes If we have an invariant.start with no corresponding invariant.end, then the memory location becomes invariant indefinitely after the invariant.start. As a result, anything dominated by the start is guaranteed to see the value the memory location had when the invariant.start executed. This patch adds an AvailableInvariants table which tracks the generation a particular memory location became invariant and then uses that information to allow value forwarding that would otherwise be disallowed by potentially aliasing stores. (Reminder: In EarlyCSE everything clobbers everything by default.) This should be compatible with the MemorySSA variant, but design is generational. We can and should add first class support for invariant.start within MemorySSA at a later time. I took a quick look at doing so, but probably need some input from a MemorySSA expert. Differential Revision: https://reviews.llvm.org/D43716 llvm-svn: 327577	2018-03-14 21:35:06 +00:00
Reid Kleckner	c7fd1540b3	Revert "[ORC] Switch from shared_ptr to unique_ptr for addModule methods." This reverts commit r327566, it breaks test/ExecutionEngine/OrcMCJIT/test-global-ctors.ll. The test doesn't crash with a stack trace, unfortunately. It merely returns 1 as the exit code. ASan didn't produce a report, and I reproduced this on my Linux machine and Windows box. llvm-svn: 327576	2018-03-14 21:32:34 +00:00
Sanjay Patel	11f7f9908b	[InstSimplify] fix folds for (0.0 - X) + X --> 0 (PR27151) As shown in: https://bugs.llvm.org/show_bug.cgi?id=27151 ...the existing fold could miscompile when X is NaN. The fold was also dependent on 'ninf' but that's not necessary. From IEEE-754 (with default rounding which we can assume for these opcodes): "When the sum of two operands with opposite signs (or the difference of two operands with like signs) is exactly zero, the sign of that sum (or difference) shall be +0...However, x + x = x − (−x) retains the same sign as x even when x is zero." llvm-svn: 327575	2018-03-14 21:23:27 +00:00
Simon Pilgrim	adf72e8549	[X86] Add haswell testing for PR35635 as well. To improve complete model testing for schedulers for instructions with multiple results. llvm-svn: 327572	2018-03-14 21:03:09 +00:00
Francis Visoiu Mistrih	164560bd74	[AArch64] Emit CSR loads in the same order as stores Optionally allow the order of restoring the callee-saved registers in the epilogue to be reversed. The flag -reverse-csr-restore-seq generates the following code: ``` stp x26, x25, [sp, #-64]! stp x24, x23, [sp, #16] stp x22, x21, [sp, #32] stp x20, x19, [sp, #48] ; [..] ldp x24, x23, [sp, #16] ldp x22, x21, [sp, #32] ldp x20, x19, [sp, #48] ldp x26, x25, [sp], #64 ret ``` Note how the CSRs are restored in the same order as they are saved. One exception to this rule is the last `ldp`, which allows us to merge the stack adjustment and the ldp into a post-index ldp. This is done by first generating: ldp x26, x27, [sp] add sp, sp, #64 which gets merged by the arm64 load store optimizer into ldp x26, x25, [sp], #64 The flag is disabled by default. llvm-svn: 327569	2018-03-14 20:34:03 +00:00
Lang Hames	7bea03c2bb	[ORC] Switch from shared_ptr to unique_ptr for addModule methods. Layer implementations typically mutate module state, and this is better reflected by having layers own the Module they are operating on. llvm-svn: 327566	2018-03-14 20:29:45 +00:00
Alexander Richardson	115b0673b6	[UpdateTestChecks] Handle IR variables with a '-' in the name Summary: I noticed that clang will emit variables such as %indirect-arg-temp when running update_cc1_test_checks.py and therefore update_cc1_test_checks.py wasn't adding FileCheck captures for those variables. Reviewers: MaskRay Reviewed By: MaskRay Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44459 llvm-svn: 327564	2018-03-14 20:28:53 +00:00
Reid Kleckner	5a5ac65768	[MC] Always emit relocations for same-section function references Summary: We already emit relocations in this case when the "incremental linker compatible" flag is set, but it turns out these relocations are also required for /guard:cf. Now that we have two use cases for this behavior, let's make it unconditional to try to keep things simple. We never hit this problem in Clang because it always sets the "incremental linker compatible" flag when targeting MSVC. However, LLD LTO doesn't set this flag, so we'd get CFG failures at runtime when using ThinLTO and /guard:cf. We probably don't want LLD LTO to set the "incremental linker compatible" assembler flag, since this has nothing to do with incremental linking, and we don't need to timestamp LTO temporary objects. Fixes PR36624. Reviewers: inglorion, espindola, majnemer Subscribers: mehdi_amini, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D44485 llvm-svn: 327557	2018-03-14 19:24:32 +00:00
Sanjay Patel	fd82fd000c	[InstSimplify] add tests to show missing/broken fadd folds (PR27151, PR26958); NFC llvm-svn: 327554	2018-03-14 18:52:40 +00:00
Sanjay Patel	e011d7964d	[InstSimplify] regenerate checks; NFC llvm-svn: 327553	2018-03-14 18:49:57 +00:00
Reid Kleckner	e66458a841	[LLVM-C] [bindings/go] Add C and Golang bindings for COMDAT Patch by Ben Clayton Differential Revision: https://reviews.llvm.org/D44086 llvm-svn: 327551	2018-03-14 18:33:53 +00:00
Roman Lebedev	60d24445dd	[InstSimplify] [NFC] cast-unsigned-icmp-cmp-0.ll - don't run instcombine As disscussed in post-commit review of D44421, there is simply no reason to run instcombine on this testcase. llvm-svn: 327541	2018-03-14 17:59:12 +00:00
Craig Topper	9c098ed819	[X86] Add back fast-isel code for handling i8 shifts. I removed this in r316797 because the coverage report showed no coverage and I thought it should have been handled by the auto generated table. I now see that there is code that bypasses the table if the shift amount is out of bounds. This adds back the code. We'll codegen out of bounds i8 shifts to effectively (amount & 0x1f). The 0x1f is a strange quirk of x86 that shift amounts are always masked to 5-bits(except 64-bits). So if the masked value is still out bounds the result will be 0. Fixes PR36731. llvm-svn: 327540	2018-03-14 17:57:19 +00:00
Fangrui Song	56fb2b2f20	Fix LLVM IR check lines in utils/update_cc_test_checks.py Reviewers: arichardson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44400 llvm-svn: 327538	2018-03-14 17:47:07 +00:00
Roman Lebedev	978aae7614	[InstSimplify] [NFC] Add tests for peeking through unsigned FP casts for sign compares (PR36682) Summary: This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H Looking at the IR pattern in question, as per [[ https://github.com/rutgers-apl/alive-nj \| alive-nj ]], for all the type combinations i checked (input: `i16`, `i32`, `i64`; intermediate: `half`/`i16`, `float`/`i32`, `double`/`i64`) for the following `icmp` comparisons the `uitofp`+`bitcast`+`icmp` can be evaluated to a boolean: * `slt 0` * `sgt -1` I did not check vectors, but i'm guessing it's the same there. {F5889242} Thus all these cases are in the testcase (along with the vector variant with additional `undef` element in the middle). There are no negative patterns here (unless alive-nj lied/is broken), all of these should be optimized. Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44421 llvm-svn: 327535	2018-03-14 17:31:08 +00:00
Roman Lebedev	6ab60358ca	[InstCombine] [NFC] Add tests for peeking through unsigned FP casts for zero-equality compares (PR36682) Summary: This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H Looking at the IR pattern in question, as per [[ https://github.com/rutgers-apl/alive-nj \| alive-nj ]], for all the type combinations i checked (input: `i16`, `i32`, `i64`; intermediate: `half`/`i16`, `float`/`i32`, `double`/`i64`) for the following `icmp` comparisons the `uitofp`+`bitcast` can be dropped: * `eq 0` * `ne 0` I did not check vectors, but i'm guessing it's the same there. {F5889189} Thus all these cases are in the testcase (along with the vector variant with additional `undef` element in the middle). There are no negative patterns here (unless alive-nj lied/is broken), all of these should be optimized. Generated with {F5889196} Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44416 llvm-svn: 327534	2018-03-14 17:31:03 +00:00
Francis Visoiu Mistrih	084e7d8770	[AArch64] Keep track of MIFlags in the LoadStoreOptimizer Merging: * $x26, $x25 = frame-setup LDPXi $sp, 0 * $sp = frame-destroy ADDXri $sp, 64, 0 into an LDPXpost should preserve the flags from both instructions as following: * frame-setup frame-destroy LDPXpost Differential Revision: https://reviews.llvm.org/D44446 llvm-svn: 327533	2018-03-14 17:10:58 +00:00
Craig Topper	b36cb20ef9	[X86] Teach X86TargetLowering::targetShrinkDemandedConstant to set non-demanded bits if it helps created an and mask that can be matched as a zero extend. I had to modify the bswap recognition to allow unshrunk masks to make this work. Fixes PR36689. Differential Revision: https://reviews.llvm.org/D44442 llvm-svn: 327530	2018-03-14 16:55:15 +00:00
Nicholas Wilson	48d6dbe3cb	[WebAssembly] Add DenseMap traits and operator== for Wasm type structs Differential Revision: https://reviews.llvm.org/D44303 llvm-svn: 327526	2018-03-14 15:58:03 +00:00
Simon Pilgrim	d1c3c995c0	[X86][AVX] Use WriteFShuffleLd for broadcast reg-mem instructions They shouldn't be treated as pure loads. Found while investigating D44428 llvm-svn: 327524	2018-03-14 15:47:08 +00:00
Nicholas Wilson	027b9357a8	[WebAssembly] Identify COMDATs by index rather than string. NFC This will enable an optimisation in LLD. Differential Revision: https://reviews.llvm.org/D44343 llvm-svn: 327522	2018-03-14 15:44:45 +00:00
Arnold Schwaighofer	bf1638daa8	SjLjEHPrepare: Don't reg-to-mem swifterror values swifterror llvm values model the swifterror register as memory at the LLVM IR level. ISel will perform adhoc mem-to-reg on them. swifterror values are constraint in how they can be used. Spilling them to memory is not allowed. SjLjEHPrepare tried to lower swifterror values to memory which is unecessary since the back-end will spill and reload the register as neccessary (as long as clobbering calls are marked as such which is the case here) and further leads to invalid IR because swifterror values can't be stored to memory. rdar://38164004 llvm-svn: 327521	2018-03-14 15:44:07 +00:00
Alexander Ivchenko	86ef9ab28f	[GlobalIsel][X86] Support for G_SDIV instruction Reviewed By: igorb Differential Revision: https://reviews.llvm.org/D44430 llvm-svn: 327520	2018-03-14 15:41:11 +00:00
Sanjay Patel	5773ac3ee8	[CodeGen] allow printing of zero latency in sched comments I don't know how to expose this in a test. There are ARM / AArch64 sched classes that include zero latency instructions, but I'm not seeing sched info printed for those targets. X86 will almost certainly have these soon (see PR36671), but no model has 'let Latency = 0' currently. llvm-svn: 327518	2018-03-14 15:28:48 +00:00
Andrea Di Biagio	36e34a99c7	[llvm-mca] Remove unused variable from InstrBuilder.cpp. NFC This was causing a buildbot failure. llvm-svn: 327517	2018-03-14 15:19:47 +00:00
Andrea Di Biagio	4732d43cae	[llvm-mca] Move the logic that updates the register files from InstrBuilder to DispatchUnit. NFCI Before this patch, the register file was always updated at instruction creation time. That means, new read-after-write dependencies, and new temporary registers were allocated at instruction creation time. This patch refactors the code in InstrBuilder, and move all the logic that updates the register file into the dispatch unit. We only want to update the register file when instructions are effectively dispatched (not before). This refactoring also helps removing a bad dependency between the InstrBuilder and the DispatchUnit. No functional change intended. llvm-svn: 327514	2018-03-14 14:57:23 +00:00
Petar Jovanovic	3408caf686	[mips] Add support for CRC ASE This includes Instructions: crc32b, crc32h, crc32w, crc32d, crc32cb, crc32ch, crc32cw, crc32cd Assembler directives: .set crc, .set nocrc, .module crc, .module nocrc Attribute: crc .MIPS.abiflags: CRC (0x8000) Patch by Vladimir Stefanovic. Differential Revision: https://reviews.llvm.org/D44176 llvm-svn: 327511	2018-03-14 14:13:31 +00:00
Simon Pilgrim	d594942928	[X86][Btver2] Fix YMM shuffle, permute and permutevar scheduler costs Account for ymm double pumping and add proper pshufb/permutevar support llvm-svn: 327510	2018-03-14 14:05:19 +00:00
Teresa Johnson	2f5c331138	[LTO/gold] Fix workaround for old plugin-api.h in --wrap support The workaround for older plugin-api.h in r327506 unfortunately used another union member that is also fairly new and not available in the plugin-api.h on some of the bots, leading to: http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/9121/steps/build-stage2-LLVMgold.so/logs/stdio Change to use a different member that we will definitely have (as it is used elsewhere in gold-plugin.cpp already). llvm-svn: 327509	2018-03-14 14:00:57 +00:00
Teresa Johnson	8883af6892	[LTO/gold] Support --wrap Summary: (Restores r327459 with handling for old plugin-api.h) Utilize new gold plugin api interface for obtaining --wrap option arguments, and LTO API handling (added for --wrap support in lld LTO), to mark symbols so that LTO does not optimize them inappropriately. Note the test cases will be in a new gold test subdirectory that is dependent on the next release of gold which will contain the new interfaces. Reviewers: pcc, tmsriram Subscribers: mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D44235 llvm-svn: 327506	2018-03-14 13:26:18 +00:00
Simon Pilgrim	de995e6e37	[X86][SSE] Use WriteFShuffleLd for MOVDDUP/MOVSHDUP/MOVSLDUP reg-mem instructions They shouldn't be treated as pure loads. Found while investigating D44428 llvm-svn: 327505	2018-03-14 13:22:56 +00:00
Martin Storsjo	bde677289a	[AArch64] Don't produce R_AARCH64_TLSLE_LDST32_TPREL_LO12_NC Support for this relocation is missing in both LLD and GNU binutils at the moment. This reverts the ELF parts of SVN r327316. llvm-svn: 327503	2018-03-14 13:09:10 +00:00
Simon Pilgrim	f0ccaae5bc	Fix 'not all control paths return a value' MSVC warning. NFCI. llvm-svn: 327502	2018-03-14 12:04:51 +00:00
Pavel Labath	8ed6582bb0	Fix msvc compiler error in r327498 msvc reports an "illegal indirection" error here. Attempt to appease it with a different initialization syntax. llvm-svn: 327500	2018-03-14 11:31:17 +00:00
Alexander Ivchenko	0bd4d8c901	[GlobalISel][X86] Support G_LSHR/G_ASHR/G_SHL Support G_LSHR/G_ASHR/G_SHL. We have 3 variance for shift instructions : shift gpr, shift imm, shift 1. Currently GlobalIsel TableGen generate patterns for shift imm and shift 1, but with shiftCount i8. In G_LSHR/G_ASHR/G_SHL like LLVM-IR both arguments has the same type, so for now only shift i8 can use auto generated TableGen patterns. The support of G_SHL/G_ASHR enables tryCombineSExt from LegalizationArtifactCombiner.h to hit, which results in different legalization for the following tests: LLVM :: CodeGen/X86/GlobalISel/ext-x86-64.ll LLVM :: CodeGen/X86/GlobalISel/gep.ll LLVM :: CodeGen/X86/GlobalISel/legalize-ext-x86-64.mir -; X64-NEXT: movsbl %dil, %eax +; X64-NEXT: movl $24, %ecx +; X64-NEXT: # kill: def $cl killed $ecx +; X64-NEXT: shll %cl, %edi +; X64-NEXT: movl $24, %ecx +; X64-NEXT: # kill: def $cl killed $ecx +; X64-NEXT: sarl %cl, %edi +; X64-NEXT: movl %edi, %eax ..which is not optimal and should be addressed later. Rework of the patch by igorb Reviewed By: igorb Differential Revision: https://reviews.llvm.org/D44395 llvm-svn: 327499	2018-03-14 11:23:57 +00:00
Pavel Labath	0dd81bab92	Explicitly initialize dwarf::FormParams in DIEInteger::SizeOf This could end up inititialized if someone called the function with a null AsmPrinter. Right now this only happens in DIEHash unit tests, presumably because it was hard to create an AsmPrinter in the context of unit tests. This only worked before r327486 because those tests did not use any dwarf forms whose size actually depended on the dwarf version (otherwise, they would have crashed due to null dereference). I fix the uninitialized error, by explicitly initializing FormParams to an invalid value, which will cause getFixedFormByteSize to return None if called with a form with version-dependent size. A more principled solution might be to fix the DIEHash tests to always pass in a valid AsmPrinter. llvm-svn: 327498	2018-03-14 11:14:43 +00:00
Nicolai Haehnle	a511dddced	TableGen: Explicitly forbid some nestings of class, multiclass, and foreach These previously all failed one way or another, but now we produce a more helpful error message. Change-Id: I8ffd2e87c8e35a5134c3be289e0a1fecaa2bb8ca Differential revision: https://reviews.llvm.org/D44115 llvm-svn: 327497	2018-03-14 11:01:01 +00:00
Nicolai Haehnle	aa9ca691cd	TableGen: Add !ne, !le, !lt, !ge, and !gt comparisons Change-Id: I8e2ece677268972d578a787467f7ef52a1f33a71 Differential revision: https://reviews.llvm.org/D44114 llvm-svn: 327496	2018-03-14 11:00:57 +00:00
Nicolai Haehnle	b61c26e614	TableGen: Allow dag operators to be resolved late Change-Id: I51bb80fd5c48c8ac441ab11e43d43c1b91b4b590 Differential revision: https://reviews.llvm.org/D44113 llvm-svn: 327495	2018-03-14 11:00:48 +00:00
Nicolai Haehnle	77841b159a	TableGen: Type-check BinOps Additionally, allow more than two operands to !con, !add, !and, !or in the same way as is already allowed for !listconcat and !strconcat. Change-Id: I9659411f554201b90cd8ed7c7e004d381a66fa93 Differential revision: https://reviews.llvm.org/D44112 llvm-svn: 327494	2018-03-14 11:00:43 +00:00
Nicolai Haehnle	ef60a26817	TableGen: Allow ? in lists This makes using !dag more convenient in some cases. Change-Id: I0a8c35e15ccd1ecec778fd1c8d64eee38d74517c Differential revision: https://reviews.llvm.org/D44111 llvm-svn: 327493	2018-03-14 11:00:33 +00:00
Nicolai Haehnle	6c11865638	TableGen: Add !dag function for construction This allows constructing DAG nodes with programmatically determined names, and can simplify constructing DAG nodes in other cases as well. Also, add documentation and some very simple tests for the already existing !con. Change-Id: Ida61cd82e99752548d7109ce8da34d29da56a5f7 Differential revision: https://reviews.llvm.org/D44110 llvm-svn: 327492	2018-03-14 11:00:26 +00:00
Alexander Timofeev	c683966cdd	[AMDGPU] Fix for DAGCombiner infinite loop in OCLtst Differential revision: https://reviews.llvm.org/D44417 llvm-svn: 327488	2018-03-14 09:48:51 +00:00
Pavel Labath	322711f529	DWARF: Unify form size handling code Summary: This patch replaces the two switches which are deducing the size of various forms with a single implementation. I have put the new implementation into BinaryFormat, to avoid introducing dependencies between the two independent libraries (DebugInfo and CodeGen) that need this functionality. Reviewers: aprantl, JDevlieghere, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44418 llvm-svn: 327486	2018-03-14 09:39:54 +00:00
Jonas Devlieghere	bc683cced8	[dsymutil] Print architecture in warning Make the architecture part of the warning in the DebugMapParser. This makes things consistent with the Apple's internal version of dsymutil. llvm-svn: 327485	2018-03-14 09:34:54 +00:00
Pavel Labath	d54c34722c	Export LLVM_DYLIB_COMPONENTS in LLVMConfig.cmake Summary: This is needed so that external projects (e.g. a standalone build of lldb) can link to the LLVM shared library via the USE_SHARED argument of llvm_config. Without this, llvm_config would add LLVM to the link list, but then also add the constituent static libraries, resulting in multiply defined symbols. Reviewers: beanz, mgorny Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44391 llvm-svn: 327484	2018-03-14 09:28:38 +00:00
Alexander Ivchenko	327de80529	[GlobalIsel][X86] Support for G_ZEXT instruction Reviewed By: igorb Differential Revision: https://reviews.llvm.org/D44378 llvm-svn: 327482	2018-03-14 09:11:23 +00:00
Robert Widmann	4bb481b2f2	[LLVM-C] Redo unnamed_address attribute bindings Summary: The old bindings should have used an enum instead of a boolean. This deprecates LLVMHasUnnamedAddr and LLVMSetUnnamedAddr , replacing them with LLVMGetUnnamedAddress and LLVMSetUnnamedAddress respectively that do. Though it is unlikely LLVM will gain more supported global value linker hints, the new API can scale to accommodate this. Reviewers: deadalnix, whitequark Reviewed By: whitequark Subscribers: llvm-commits, harlanhaskins Differential Revision: https://reviews.llvm.org/D43448 llvm-svn: 327479	2018-03-14 06:45:51 +00:00
Lang Hames	2d603a1860	[RuntimeDyld] Silence a compiler error. This should fix the error at http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/19008 llvm-svn: 327478	2018-03-14 06:39:49 +00:00
Lang Hames	b2facd6479	[ORC] Fix a data race in the lookup function. The Error locals need to be protected by a mutex. (This could be fixed by having the promises / futures contain Expected and Error values, but MSVC's future implementation does not support this yet). Hopefully this will fix some of the errors seen on the builders due to r327474. llvm-svn: 327477	2018-03-14 06:25:08 +00:00
Lang Hames	313f590aee	[ExecutionEngine] Add a getSymbolTable method to RuntimeDyld. This can be used to extract the symbol table from a RuntimeDyld instance prior to disposing of it. This patch also updates RTDyldObjectLinkingLayer to use the new method, rather than requesting symbols one at a time via getSymbol. llvm-svn: 327476	2018-03-14 06:25:07 +00:00
Lang Hames	d4a768e78f	[ORC] Silence a compiler error. This should fix the builder error at http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/19006 llvm-svn: 327475	2018-03-14 05:23:56 +00:00
Lang Hames	817f1f64d9	[ORC] Add a 'lookup' convenience function for finding symbols in a list of VSOs. The lookup function takes a list of VSOs, a set of symbol names (or just one symbol name) and a materialization function object. It returns an Expected<SymbolMap> (if given a set of names) or an Expected<JITEvaluatedSymbol> (if given just one name). The lookup method constructs an AsynchronousSymbolQuery for the given names, applies that query to each VSO in the list in turn, and then blocks waiting for the query to complete. If threading is enabled then the materialization function object can be used to execute the materialization on different threads. If threading is disabled the MaterializeOnCurrentThread utility must be used. llvm-svn: 327474	2018-03-14 04:18:04 +00:00
Eugene Zemtsov	ccf30835e1	Disable test debuglineinfo-path on powerpc llvm-svn: 327472	2018-03-14 02:10:07 +00:00
Eugene Zemtsov	c6d8fcb168	grep for global functions only llvm-svn: 327469	2018-03-14 01:07:18 +00:00
Matt Arsenault	41e5ac4fa4	TargetMachine: Add address space to getPointerSize llvm-svn: 327467	2018-03-14 00:36:23 +00:00
Eugene Zemtsov	e2d3ce2339	Add extra output/check to debug clang-ppc64be-linux test failure llvm-svn: 327466	2018-03-14 00:15:16 +00:00
Teresa Johnson	30b9f24fb0	Revert "[LTO/gold] Support --wrap" This reverts commit r327459. The new gold plugin interface is not available with older gold installations, leading to compile failures: http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/9109/steps/build-stage2-LLVMgold.so/logs/stdio llvm-svn: 327465	2018-03-13 23:55:34 +00:00
Craig Topper	9ca7e67c4c	[X86] Re-generate test to get proper capitalization of its CHECK lines. NFC llvm-svn: 327462	2018-03-13 23:31:48 +00:00
Aditya Nandakumar	b808e3ad8b	[GISel]: Fix incorrect type used in Pattern Match for ICst getConstantVRegVal() returns int64_t but we use uint64_t. llvm-svn: 327461	2018-03-13 23:21:13 +00:00
Teresa Johnson	e8f3b0782a	[LTO/gold] Support --wrap Summary: Utilize new gold plugin api interface for obtaining --wrap option arguments, and LTO API handling (added for --wrap support in lld LTO), to mark symbols so that LTO does not optimize them inappropriately. Note the test cases will be in a new gold test subdirectory that is dependent on the next release of gold which will contain the new interfaces. Reviewers: pcc, tmsriram Subscribers: mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D44235 llvm-svn: 327459	2018-03-13 23:06:19 +00:00
Eugene Zemtsov	b9d2908cd3	Disable optimizations in debuglineinfo-path test llvm-svn: 327458	2018-03-13 22:48:39 +00:00
Craig Topper	ec4881ad53	[X86] Simplify the LowerAVXCONCAT_VECTORS code a little by creating a single path for insert_subvector handling. We now only create recursive concats if we have more than two non-zero values. This keeps our subvector broadcast DAG combine functioning. llvm-svn: 327457	2018-03-13 22:36:07 +00:00
Craig Topper	cc060e921b	[X86] Rewrite LowerAVXCONCAT_VECTORS similar to how we handle vXi1 concats. This better able to detect undef and zeros pieces in the concat. Or cases when only one subvector is non-zero. This allows us to avoid silly things like double inserts into progressively larger undefs. This still builds 512 bit concats of 128 bits by building up through 256 bits first. But I don't know if that's best. We probably want to merge this with the vXi1 concat code since they are very similar. llvm-svn: 327454	2018-03-13 22:05:25 +00:00
Zachary Turner	1ccae9d943	Disable PDB injected sources test temporarily. llvm-svn: 327451	2018-03-13 21:18:00 +00:00
Hiroshi Yamauchi	e6a3dc7699	Simplify more cases of logical ops of masked icmps. Summary: For example, ((X & 255) != 0) && ((X & 15) == 8) -> ((X & 15) == 8). ((X & 7) != 0) && ((X & 15) == 8) -> false. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43835 llvm-svn: 327450	2018-03-13 21:13:18 +00:00
Eugene Zemtsov	f69d07e788	Remove explicit triple and data layout from the test llvm-svn: 327449	2018-03-13 21:10:15 +00:00
Craig Topper	4aeec51986	[DAGCombiner] Allow visitEXTRACT_SUBVECTOR to combine with BUILD_VECTORS between LegalizeVectorOps and LegalizeDAG. BUILD_VECTORs aren't themselves legalized until LegalizeDAG so we should still be able to create an "illegal" one before that. This helps combine with BUILD_VECTORS that are introduced during LegalizeVectorOps due to unrolling. llvm-svn: 327446	2018-03-13 20:36:28 +00:00
Zachary Turner	5cbf567921	Update modulemap to exclude new DIA headers. llvm-svn: 327444	2018-03-13 20:16:37 +00:00
Eugene Zemtsov	d6a1ca761a	Fix debuglineinfo-path.ll This fix is based on an assumption that some build bots are missing 'echo -n' llvm-svn: 327443	2018-03-13 20:06:33 +00:00
Francis Visoiu Mistrih	3abf05739f	[MIR] Allow frame-setup and frame-destroy on the same instruction Nothing prevents us from having both frame-setup and frame-destroy on the same instruction. When merging: * frame-setup OPCODE1 * frame-destroy OPCODE2 into * frame-setup frame-destroy OPCODE3 we want to be able to print and parse both flags. llvm-svn: 327442	2018-03-13 19:53:16 +00:00
Eugene Zemtsov	27c1afbb0b	Temporary disable debuglineinfo-path.ll to fix build llvm-svn: 327441	2018-03-13 19:48:31 +00:00
Anna Thomas	5ac72f94f3	Test Commit NFC. Updated comment llvm-svn: 327436	2018-03-13 19:38:45 +00:00
Sanjay Patel	bb45cc126d	[x86] add test for WriteZero sched class instructions; NFC Nops should have zero latency because there is no result. Idioms like 'xorps xmm0, xmm0' may have zero latency because they are handled without using an execution unit. llvm-svn: 327435	2018-03-13 19:20:01 +00:00
Haicheng Wu	aee0af3e23	[SLP] clean some formats llvm-svn: 327433	2018-03-13 18:44:19 +00:00
Brian M. Rzycki	252165b27a	[LazyValueInfo] PR33357 prevent infinite recursion on BinaryOperator Summary: It is possible for LVI to encounter instructions that are not in valid SSA form and reference themselves. One example is the following: %tmp4 = and i1 %tmp4, undef Before this patch LVI would recurse until running out of stack memory and crashed. This patch marks these self-referential instructions as Overdefined and aborts analysis on the instruction. Fixes https://bugs.llvm.org/show_bug.cgi?id=33357 Reviewers: craig.topper, anna, efriedma, dberlin, sebpop, kuhar Reviewed by: dberlin Subscribers: uabelho, spatel, a.elovikov, fhahn, eli.friedman, mzolotukhin, spop, evandro, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D34135 llvm-svn: 327432	2018-03-13 18:14:10 +00:00
Zachary Turner	a389c84df5	Implement pure virtual method to fix build. llvm-svn: 327431	2018-03-13 17:58:28 +00:00
Eugene Zemtsov	82d60d6b29	Handle mixed-OS paths in DWARF reader Make sure that DWARF line information generated by Windows can be properly read by Posix OS and vice versa. Differential Revision: https://reviews.llvm.org/D44290 llvm-svn: 327430	2018-03-13 17:54:29 +00:00
Sanjay Patel	c6cbbc899b	[MC] fix documentation comments; NFC llvm-svn: 327429	2018-03-13 17:50:27 +00:00
Zachary Turner	679aeadda1	[PDB] Support dumping injected sources via the DIA reader. Injected sources are basically a way to add actual source file content to your PDB. Presumably you could use this for shipping your source code with your debug information, but in practice I can only find this being used for embedding natvis files inside of PDBs. In order to effectively test LLVM's natvis file injection, we need a way to dump the injected sources of a PDB in a way that is authoritative (i.e. based on Microsoft's understanding of the PDB format, and not LLVM's). To this end, I've added support for dumping injected sources via DIA. I made a PDB file that used the /natvis option to generate a test case. Differential Revision: https://reviews.llvm.org/D44405 llvm-svn: 327428	2018-03-13 17:46:06 +00:00
Simon Dardis	e5f72dd5e1	Revert "[mips] Guard traps for microMIPS correctly" This appears to have broken the expensive checks bot in a strange fashion. Reverting until I can investigate. This reverts r327409. llvm-svn: 327427	2018-03-13 17:31:11 +00:00
Andrea Di Biagio	b52297508e	[llvm-mca] Remove the logic that computes the reciprocal throughput, and make the SummaryView independent from the Backend. NFCI Since r327420, the tool can query the MCSchedModel interface to obtain the reciprocal throughput information. As a consequence, method `ResourceManager::getRThroughput`, and method `Backend::getRThroughput` are no longer needed. This patch simplifies the code by removing the custom RThroughput computation. This patch also refactors class SummaryView by removing the dependency with the Backend object. No functional change intended. llvm-svn: 327425	2018-03-13 17:24:32 +00:00
Simon Pilgrim	9855b39380	[DAGCombine] visitREM - Don't assume that one divrem isn't driving another Under some circumstances the divrems won't have been combined together before getting to this code. So replace the assertion with a if() guard to not expand to X-((X/C)*C) to give the other combine chance to happen. Reduced from OSS-Fuzz #6883 https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6883 llvm-svn: 327424	2018-03-13 17:17:15 +00:00
Azharuddin Mohammed	8afcd938ed	Build system changes for RISCV Summary: Build system changes for RISCV. Makes it possible to build just the RISCV target alone. Reviewers: asb, apazos, mgrang, beanz Reviewed By: asb Subscribers: mgorny, kito-cheng, shiva0217, llvm-commits Differential Revision: https://reviews.llvm.org/D44153 llvm-svn: 327423	2018-03-13 17:04:33 +00:00
Brian Homerding	d5c558ff21	[lit] - Allow 1 test to report multiple micro-test results to provide support for microbenchmarks. Summary: These changes are to allow to a Result object to have nested Result objects in order to support microbenchmarks. Currently lit is restricted to reporting one result object for one test, this change provides support tests that want to report individual timings for individual kernels. This revision is the result of the discussions in https://reviews.llvm.org/D32272#794759, https://reviews.llvm.org/D37421#f8003b27 and https://reviews.llvm.org/D38496. It is a separation of the changes purposed in https://reviews.llvm.org/D40077. This change will enable adding LCALS (Livermore Compiler Analysis Loop Suite) collection of loop kernels to the llvm test suite using the google benchmark library (https://reviews.llvm.org/D43319) with tracking of individual kernel timings. Previously microbenchmarks had been handled by using macros to section groups of microbenchmarks together and build many executables while still getting a grouped timing (MultiSource/TSVC). Recently the google benchmark library was added to the test suite and utilized with a litsupport plugin. However the limitation of 1 test 1 result limited its use to passing a runtime option to run only 1 microbenchmark with several hand written tests (MicroBenchmarks/XRay). This runs the same executable many times with different hand-written tests. I will update the litsupport plugin to utilize the new functionality (https://reviews.llvm.org/D43316). These changes allow lit to report micro test results if desired in order to get many precise timing results from 1 run of 1 test executable. Reviewers: MatzeB, hfinkel, rengolin, delcypher Differential Revision: https://reviews.llvm.org/D43314 llvm-svn: 327422	2018-03-13 16:37:59 +00:00
Daniel Neilson	5182113f07	[SelectionDAGBuilder] Replace deprecated calls to MemoryIntrinsic::getAlignment() (NFCI) Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the SelectionDAGBuilder to cease using the old getAlignment() API of MemoryIntrinsic in favour of getting source & dest specific alignments through the new API. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886, rL323891, rL324148, rL324273, rL324278, rL324384, rL324395, rL324402, rL324626, rL324642, rL324653, rL324654, rL324773, rL324774, rL324781, rL324784, rL324955, rL324960, rL325816, rL327398 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 327421	2018-03-13 16:31:19 +00:00
Andrea Di Biagio	7faea7cb53	[MC] Move the reciprocal throughput computation from TargetSchedModel to MCSchedModel. The goal is to make the reciprocal throughput computation accessible through the MCSchedModel interface. This is particularly important for llvm-mca because it can only query the MCSchedModel interface. No functional change intended. Differential Revision: https://reviews.llvm.org/D44392 llvm-svn: 327420	2018-03-13 16:28:55 +00:00
Craig Topper	7e711a6822	[X86] Remove SplitBinaryOpsAndApply and use SplitOpsAndApply by adding curly braces around the ops. Summary: Unless you were intentionally avoiding this syntax? I saw you mentioned makeArrayRef in your commit that added SplitOpsAndApply. Reviewers: RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44403 llvm-svn: 327418	2018-03-13 16:23:27 +00:00
Andrea Di Biagio	c95a130105	[llvm-mca] Simplify code that computes the latency of an instruction in InstrBuilder. NFCI This was possible because of r327406, which added function`computeInstrLatency` to MCSchedModel. llvm-svn: 327415	2018-03-13 15:59:59 +00:00
Brock Wyma	f52e192293	Revert r327397 [CodeView] Omit forward references for unnamed structs and ... This reverts commit r327397 to investigate a buildbot failure. llvm-svn: 327414	2018-03-13 15:56:20 +00:00

... 3 4 5 6 7 ...

161757 Commits