llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	95ec4a4dfe	[InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X As shown in the code comment, we don't need all of 'fast', but we do need reassoc + nsz + nnan. Differential Revision: https://reviews.llvm.org/D43765 llvm-svn: 327796	2018-03-18 14:12:25 +00:00
Simon Pilgrim	541992203d	[X86][Btver2] Strip default latency/resource values. NFCI. llvm-svn: 327795	2018-03-18 13:16:11 +00:00
Simon Pilgrim	40f6d6ad0b	[X86][Btver2] SSE4A EXTRQ/INSERTQ instructions are performed on the JVALU0/JVALU1 functional pipes llvm-svn: 327794	2018-03-18 13:05:09 +00:00
Simon Pilgrim	e16790b133	[X86][Btver2] Modelled float bitwise instructions as being performed on the float cluster (FPA/FPM) not the integer. llvm-svn: 327793	2018-03-18 12:37:35 +00:00
Jonas Devlieghere	163326d10c	[dsymutil] Fix add_llvm_tool_symlink Update the arguments to add_llvm_tool_symlink to symlink llvm-dsymutil to dsymutil. llvm-svn: 327792	2018-03-18 12:27:05 +00:00
Simon Pilgrim	e409f84e7e	[X86][Btver2] Correctly distinguish between scheduling pipe and functional unit for JWriteResFpuPair defs Jaguar's FPU has 2 scheduler pipes (JFPU0/JFPU1) which forward to multiple functional sub-units each. We need to model that an micro-op will both consume the scheduler pipe and a functional unit. This patch just handles the ops defined through JWriteResFpuPair, I'll go through the custom cases later. llvm-svn: 327791	2018-03-18 12:09:17 +00:00
Jonas Devlieghere	a6ef1abc09	[dsymutil] Rename llvm-dsymutil -> dsymutil Now that almost all functionality of Apple's dsymutil has been upstreamed, the open source variant can be used as a drop in replacement. Hence we feel it's no longer necessary to have the llvm prefix. Differential revision: https://reviews.llvm.org/D44527 llvm-svn: 327790	2018-03-18 11:38:41 +00:00
Simon Pilgrim	f86d48b3ae	[X86][Btver2] Merge equivalent VBLENDVY + VPERMILY schedule groups Thanks to Craig Topper for noticing this. llvm-svn: 327789	2018-03-18 10:22:35 +00:00
Simon Pilgrim	0ba4a0f3a6	[X86][Btver2] Add llvm-mca tests to show pipe resource usage of most vector instructions Hopefully these tests can be easily reused should any other subtarget get in depth llvm-mca coverage (we can either copy the tests or move them into a common dir and run it with multiple prefixes). llvm-svn: 327788	2018-03-18 09:32:38 +00:00
Craig Topper	2d451e73f9	[X86] Fix a bunch of overlapping regular expressions in the scheduler models. llvm-svn: 327787	2018-03-18 08:38:06 +00:00
Craig Topper	86b02cf076	[X86] Fix a couple typos in the Zen scheduler model. llvm-svn: 327786	2018-03-18 08:38:04 +00:00
Craig Topper	93dd77d2dc	[TableGen] Remove unnecessary uses of make_range. llvm-svn: 327785	2018-03-18 08:38:03 +00:00
Craig Topper	7f31e735c9	[TableGen] Move some variables into for loop declaration. NFC They aren't needed after the loop. llvm-svn: 327784	2018-03-18 08:38:02 +00:00
Craig Topper	89dcda3e90	[X86] Remove MMX_MASKMOVQ64 and VMASKMOVDQU from scheduler models. The information was so wildly inaccurate and incomplete its better to just remove it. MMX_MASKMOVQ64 showed up twice in several scheduler models. In Haswell and Broadwell they were on adjacent lines. On Skylake the copies had different information. MMX_MASKMOVQ and MASKMOVDQU were completely missing. MMX_MASKMOVQ64 was listed on Haswell/Broadwell as 1 cycle on port 1 despite it being a store instruction. Filed PR36780 to track fixing this right. llvm-svn: 327783	2018-03-18 03:24:42 +00:00
Martin Storsjo	36d6419cc5	[AArch64] Skip an unnecessary getCopyToReg in DYNAMIC_STACKALLOC Differential Revision: https://reviews.llvm.org/D44586 llvm-svn: 327779	2018-03-17 20:08:48 +00:00
Nirav Dave	5f0ab71b62	Revert "[DAG, X86] Revert r327197 "Revert r327170, r327171, r327172"" as it times out building test-suite on PPC. llvm-svn: 327778	2018-03-17 19:24:54 +00:00
Nirav Dave	982d3a56ea	[DAG, X86] Revert r327197 "Revert r327170, r327171, r327172" Reland ISel cycle checking improvements after simplifying and reducing node id invariant traversal. llvm-svn: 327777	2018-03-17 17:42:10 +00:00
Sylvestre Ledru	543f15b028	Fix some user facing typos llvm-svn: 327776	2018-03-17 17:30:08 +00:00
Matt Arsenault	abdc4f2dc7	AMDGPU/GlobalISel: Cleanup constant legality llvm-svn: 327774	2018-03-17 15:17:48 +00:00
Matt Arsenault	685d1e8157	AMDGPU/GlobalISel: Basic G_GEP legality llvm-svn: 327773	2018-03-17 15:17:45 +00:00
Matt Arsenault	85803366d6	AMDGPU/GlobalISel: Basic legality for load/store llvm-svn: 327772	2018-03-17 15:17:41 +00:00
Chandler Carruth	7e71129be4	[bindings/go] Add a missing `,` in the test code to fix a go compile failure. llvm-svn: 327771	2018-03-17 15:12:52 +00:00
Oren Ben Simhon	fdd72fd522	[X86] Added support for nocf_check attribute for indirect Branch Tracking X86 Supports Indirect Branch Tracking (IBT) as part of Control-Flow Enforcement Technology (CET). IBT instruments ENDBR instructions used to specify valid targets of indirect call / jmp. The `nocf_check` attribute has two roles in the context of X86 IBT technology: 1. Appertains to a function - do not add ENDBR instruction at the beginning of the function. 2. Appertains to a function pointer - do not track the target function of this pointer by adding nocf_check prefix to the indirect-call instruction. This patch implements `nocf_check` context for Indirect Branch Tracking. It also auto generates `nocf_check` prefixes before indirect branchs to jump tables that are guarded by range checks. Differential Revision: https://reviews.llvm.org/D41879 llvm-svn: 327767	2018-03-17 13:29:46 +00:00
Jonas Paulsson	dbcf1bf503	[SystemZ] Add 'REQUIRES: asserts' to test case using debug output. llvm-svn: 327766	2018-03-17 09:15:13 +00:00
Jonas Paulsson	138960770c	[SystemZ] computeKnownBitsForTargetNode() / ComputeNumSignBitsForTargetNode() Improve/implement these methods to improve DAG combining. This mainly concerns intrinsics. Some constant operands to SystemZISD nodes have been marked Opaque to avoid transforming back and forth between generic and target nodes infinitely. Review: Ulrich Weigand llvm-svn: 327765	2018-03-17 08:32:12 +00:00
Jonas Paulsson	e9f7fa83d5	[SelectionDAG] Handle big endian target BITCAST in computeKnownBits() The BITCAST handling in computeKnownBits() previously only worked for little endian. This patch reverses the iteration over elements for a big endian target which allows this to work in this case also. SystemZ test case. Review: Eli Friedman https://reviews.llvm.org/D44249 llvm-svn: 327764	2018-03-17 08:04:00 +00:00
Chandler Carruth	196a9fab82	[GlobalsAA] Fix a pretty terrible bug that has been in GlobalsAA for a long time. The key thing is that we need to create value handles for every function that we create a `FunctionInfo` object around. Without this, when that function is deleted we can end up creating a new function that collides with its address and look up a stale AA result. With that AA result we can in turn miscompile code in ways that break. This is seriously one of the most absurd miscompiles I've seen. It only reproduced for us recently and only when building a very large server with both ThinLTO and PGO. A HUGE shout out to Wei Mi who tracked all of this down and came up with this patch. I'm just landing it because I happened to still by at a computer. He or I can work on crafting a test case to hit this (now that we know what to target) but it'll take a while, and we've been chasing this for a long time and need it fix Right Now. llvm-svn: 327761	2018-03-16 23:51:33 +00:00
Jessica Paquette	b3e7dc9144	[MachineOutliner] Make KILLs invisible At the point the outliner runs, KILLs don't impact anything, but they're still considered unique instructions. This commit makes them invisible like DebugValues so that they can still be outlined without impacting outlining decisions. llvm-svn: 327760	2018-03-16 22:53:34 +00:00
Andrea Di Biagio	09771ad2ca	[llvm-mca] Remove method getSchedModel() from the Backend. llvm-svn: 327756	2018-03-16 22:21:52 +00:00
Andrea Di Biagio	f6766b0e45	[llvm-mca] Remove unused methods from Backend. NFC llvm-svn: 327749	2018-03-16 22:02:47 +00:00
David L Kreitzer	febf70a9be	Quiet unused variable warnings. NFC. Differential revision: https://reviews.llvm.org/D44583 llvm-svn: 327745	2018-03-16 21:21:23 +00:00
Craig Topper	25007c4f32	[X86] Pass SelectionDAG into X86ISelAddressMode::dump and on to SDNode::dump. This prevents a crash in SelectionDAGDumper with -debug when trying to print mem operands if one of the registers in the addressing mode comes from a load. llvm-svn: 327744	2018-03-16 21:10:07 +00:00
Sanjay Patel	5a5c33d8b5	[InstSimplify] add NaN constant diversity; NFC llvm-svn: 327743	2018-03-16 20:55:55 +00:00
Krzysztof Parzyszek	f81a8d03c1	[Hexagon] Avoid bank conflicts in post-RA scheduler Avoid scheduling two loads in such a way that they would end up in the same packet. If there is a load in a packet, try to schedule a non-load next. Patch by Brendon Cahoon. llvm-svn: 327742	2018-03-16 20:55:49 +00:00
Krzysztof Parzyszek	889cbcacbc	[Hexagon] Add lit testcases for atomic intrinsics Patch by Ben Craig. llvm-svn: 327737	2018-03-16 20:21:43 +00:00
Reid Kleckner	f8b51c5f90	[IR] Avoid the need to prefix MS C++ symbols with '\01' Now the Windows mangling modes ('w' and 'x') do not do any mangling for symbols starting with '?'. This means that clang can stop adding the hideous '\01' leading escape. This means LLVM debug logs are less likely to contain ASCII escape characters and it will be easier to copy and paste MS symbol names from IR. Finally. For non-Windows platforms, names starting with '?' still get IR mangling, so once clang stops escaping MS C++ names, we will get extra '_' prefixing on MachO. That's fine, since it is currently impossible to construct a triple that uses the MS C++ ABI in clang and emits macho object files. Differential Revision: https://reviews.llvm.org/D7775 llvm-svn: 327734	2018-03-16 20:13:32 +00:00
Reid Kleckner	2aeb930a9f	Revert r327721 "This patch fixes the invalid usage of OptSize in Machine Combiner." It causes asserts when compiling Chromium on Win32 with optimizations. We compile many things with -Os. llvm-svn: 327733	2018-03-16 20:11:55 +00:00
Craig Topper	f0815e01d8	[X86] Merge ADDSUB/SUBADD detection into single methods that can detect either and indicate what they found. Previously, we called the same functions twice with a bool flag determining whether we should look for ADDSUB or SUBADD. It would be more efficient to run the code once and detect either pattern with a flag to tell which type it found. Differential Revision: https://reviews.llvm.org/D44540 llvm-svn: 327730	2018-03-16 18:25:59 +00:00
Craig Topper	71d69b2ea5	[CorrelatedValuePropagation] Use SelectInst::getCondition/getTrueValue/getFalseValue instead of getOperand for readability. NFC llvm-svn: 327728	2018-03-16 18:18:47 +00:00
Farhana Aleen	c6c9dc8773	[AMDGPU] Supported ds_write_b128 generation. Summary: This is a follow-on patch of https://reviews.llvm.org/D44210 Author: FarhanaAleen Reviewed By: msearles Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D44319 llvm-svn: 327726	2018-03-16 18:12:00 +00:00
Craig Topper	e6913ec340	[X86] Post process the DAG after isel to remove vector moves that were added to zero upper bits. We previously avoided inserting these moves during isel in a few cases which is implemented using a whitelist of opcodes. But it's too difficult to generate a perfect list of opcodes to whitelist. Especially with AVX512F without AVX512VL using 512 bit vectors to implement some 128/256 bit operations. Since isel is done bottoms up, we'd have to check the VT and opcode and subtarget in order to determine whether an EXTRACT_SUBREG would be generated for some operations. So instead of doing that, this patch adds a post processing step that detects when the moves are unnecesssary after isel. At that point any EXTRACT_SUBREGs would have already been created and appear in the DAG. So then we just need to ensure the input to the move isn't one. Differential Revision: https://reviews.llvm.org/D44289 llvm-svn: 327724	2018-03-16 17:13:42 +00:00
Dmitry Preobrazhensky	4c8f4234b6	[AMDGPU][MC][GFX8][GFX9][DISASSEMBLER] Added "_e32" suffix to 32-bit VINTRP opcodes See bug 36751: https://bugs.llvm.org/show_bug.cgi?id=36751 Differential Revision: https://reviews.llvm.org/D44529 Reviewers: artem.tamazov, arsenm llvm-svn: 327723	2018-03-16 16:38:04 +00:00
Philip Reames	8a106272e8	[LICM/mustexec] Extend first iteration must execute logic to fcmps This builds on the work from https://reviews.llvm.org/D44287. It turned out supporting fcmp was much easier than I realized, so let's do that now. As an aside, our -O3 handling of a floating point IVs leaves a lot to be desired. We do convert the float IV to an integer IV, but do so late enough that many other optimizations are missed (e.g. we don't vectorize). Differential Revision: https://reviews.llvm.org/D44542 llvm-svn: 327722	2018-03-16 16:33:49 +00:00
Andrew V. Tischenko	a0cd09d4a2	This patch fixes the invalid usage of OptSize in Machine Combiner. Differential Revision: https://reviews.llvm.org/D43813 llvm-svn: 327721	2018-03-16 16:06:24 +00:00
Dmitry Preobrazhensky	9c1a6e7e24	[AMDGPU][MC] Corrected default values for unused SDWA operands See bug 36355: https://bugs.llvm.org/show_bug.cgi?id=36355 Differential Revision: https://reviews.llvm.org/D44481 Reviewers: artem.tamazov, arsenm llvm-svn: 327720	2018-03-16 15:40:27 +00:00
Sanjay Patel	2b94927f0d	[InstCombine] add nnan requirement to potential fabs folds tests; NFC As noted in D44550, we can't guarantee preserving the sign-bit of NaN if we convert these to fabs(). llvm-svn: 327718	2018-03-16 15:27:39 +00:00
Jonas Paulsson	a9f05a9d50	[SystemZ] Make AnyRegBitRegClass unallocatable. AnyReg is just for the assembler and it is better to have it as not allocatable in order to simplify (make more intuitive) the RegPressureSets. Review: Ulrich Weigand llvm-svn: 327715	2018-03-16 15:21:26 +00:00
Aditya Nandakumar	573102e344	[GISel]: Remove unused header include in MachineIRBuilder.h llvm-svn: 327714	2018-03-16 15:14:18 +00:00
Brian M. Rzycki	f65ddc5fa2	[JumpThreading] Track unreachable BBs to avoid processing JumpThreading iterates over F until the IR quiesces. Transforming unreachable BBs increases compile time and it is also possible to never stabilize causing JumpThreading to hang. An older attempt at fixing this problem was D3991 where removeUnreachableBlocks(F) was called before JumpThreading began. This has a few drawbacks: * expensive - the routine attempts to fix up the IR to identify additional BBs that can be removed along with unreachable BBs. * aggressive - does not identify and preserve the shape of the IR. At a minimum it does not preserve loop hierarchies. * invasive - altering reachable blocks it may disrupt IR shapes that could have otherwise been JumpThreaded. This patch avoids removeUnreachableBlocks(F) and instead tracks unreachable BBs in a SmallPtrSet using DominatorTree to validate the initial state of all BBs. We then rely on subsequent passes to identify and remove these unreachable blocks from F. Reviewers: dberlin, sebpop, kuhar, dinesh.d Reviewed by: sebpop, kuhar Subscribers: hiraditya, uabelho, llvm-commits Differential Revision: https://reviews.llvm.org/D44177 llvm-svn: 327713	2018-03-16 15:13:47 +00:00
Krzysztof Parzyszek	9915291ab8	[Hexagon] Fix zero-extending non-HVX bool vectors llvm-svn: 327712	2018-03-16 15:03:37 +00:00

1 2 3 4 5 ...

161524 Commits