llvm-project

Commit Graph

Author	SHA1	Message	Date
Sergey Dmitriev	afd612ece9	[NFC] Avoid passing blocks vector to the OutlineRegionInfo constructor by value. Reviewers: vsk, fhahn, davidxl Reviewed By: vsk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57957 llvm-svn: 353582	2019-02-08 23:52:15 +00:00
Francis Visoiu Mistrih	8bc57953b7	Re-apply r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" With a fix after r353563 that adds some more opcodes. llvm-svn: 353579	2019-02-08 23:34:11 +00:00
Francis Visoiu Mistrih	decba8aa06	Revert r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" This reverts commit r353553. This breaks CodeGen/AArch64/GlobalISel/legalize-ext-csedebug-output.mir: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/57963/console llvm-svn: 353575	2019-02-08 22:49:43 +00:00
Craig Topper	fcb63c4c6c	[X86] Add FPCW as an implicit use on floating point load instructions. These instructions can generate a stack overflow exception so technically they read the stack overflow exception mask bit. llvm-svn: 353564	2019-02-08 20:50:09 +00:00
Craig Topper	784929d045	Implementation of asm-goto support in LLVM This patch accompanies the RFC posted here: http://lists.llvm.org/pipermail/llvm-dev/2018-October/127239.html This patch adds a new CallBr IR instruction to support asm-goto inline assembly like gcc as used by the linux kernel. This instruction is both a call instruction and a terminator instruction with multiple successors. Only inline assembly usage is supported today. This also adds a new INLINEASM_BR opcode to SelectionDAG and MachineIR to represent an INLINEASM block that is also considered a terminator instruction. There will likely be more bug fixes and optimizations to follow this, but we felt it had reached a point where we would like to switch to an incremental development model. Patch by Craig Topper, Alexander Ivchenko, Mikhail Dvoretckii Differential Revision: https://reviews.llvm.org/D53765 llvm-svn: 353563	2019-02-08 20:48:56 +00:00
Vedant Kumar	0e5dd512aa	[CodeExtractor] Restore outputs after creating exit stubs When CodeExtractor saves the result of InvokeInst at the first insertion point of the 'normal destination' basic block, this block can be omitted in the outlined region, so store is placed outside of the function. The suggested solution is to process saving outputs after creating exit stubs for new function, and stores will be placed in that blocks before return in this case. Patch by Sergei Kachkov! Fixes llvm.org/PR40455. Differential Revision: https://reviews.llvm.org/D57919 llvm-svn: 353562	2019-02-08 20:48:04 +00:00
Matt Arsenault	564f0f832c	AMDGPU: Eliminate GPU specific SubtargetFeatures Inline compatability is determined from the individual feature bits. These are just sets of the separate features, but will always be treated as incompatible unless they are specifically ignored. Defining the ISA version number here in tablegen would be nice, but it turns out this wasn't actually used. llvm-svn: 353558	2019-02-08 19:59:32 +00:00
Nemanja Ivanovic	92a8c36735	[DAGCombine] Optimize pow(X, 0.75) to sqrt(X) * sqrt(sqrt(X)) The sqrt case is faster and we already do this for the case where the exponent is 0.25. This adds the 0.75 case which is also not sensitive to signed zeros. Patch by Whitney Tsang (Whitney) Differential revision: https://reviews.llvm.org/D57434 llvm-svn: 353557	2019-02-08 19:50:58 +00:00
Aditya Nandakumar	01e818a97d	[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder https://reviews.llvm.org/D57932 Add some logging + tests to make sure CSEInfo prints debug output. reviewed by: arsenm llvm-svn: 353553	2019-02-08 19:41:13 +00:00
Matt Arsenault	d7047276ec	AMDGPU: Remove GCN features and predicates These are no longer necessary since the R600 tablegen files are split out now. llvm-svn: 353548	2019-02-08 19:18:01 +00:00
Reid Kleckner	987d331fab	[InstrProf] Implement static profdata registration Summary: The motivating use case is eliminating duplicate profile data registered for the same inline function in two object files. Before this change, users would observe multiple symbol definition errors with VC link, but links with LLD would succeed. Users (Mozilla) have reported that PGO works well with clang-cl and LLD, but when using LLD without this static registration, we would get into a "relocation against a discarded section" situation. I'm not sure what happens in that situation, but I suspect that duplicate, unused profile information was retained. If so, this change will reduce the size of such binaries with LLD. Now, Windows uses static registration and is in line with all the other platforms. Reviewers: davidxl, wmi, inglorion, void, calixte Subscribers: mgorny, krytarowski, eraman, fedor.sergeev, hiraditya, #sanitizers, dmajor, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D57929 llvm-svn: 353547	2019-02-08 19:03:50 +00:00
Simon Pilgrim	eb6a47a462	[TargetLowering] Use ISD::FSHR in expandFixedPointMul Replace OR(SHL,SRL) pattern with ISD::FSHR (legalization expands this later if necessary) - this helps with the scale == 0 'undefined' drop-through case that was discussed on D55720. llvm-svn: 353546	2019-02-08 18:57:38 +00:00
Simon Pilgrim	478bb90779	[TargetLowering] Add SimplifyDemandedBits funnel shift support llvm-svn: 353539	2019-02-08 17:19:01 +00:00
Teresa Johnson	3ce8112dad	ArgumentPromotion should copy all metadata to new Function Summary: ArgumentPromotion had code to specifically move the dbg metadata over to the new function, but other metadata such as the function_entry_count !prof metadata was not. Replace code that moved dbg metadata with a call to copyMetadata. The old metadata is automatically removed when the old Function is removed. Reviewers: davidxl Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57846 llvm-svn: 353537	2019-02-08 17:08:27 +00:00
Craig Topper	41a1792b15	[X86] Remove isReMaterializable from X87 floating point constant loads and constant pool loads. Summary: These instructions update FPSW so they aren't generically safe to rematerialize into any location if FPSW is live for a comparison result. They also use FPCW for exception masking control. Though the only exception they can generate is stack overflow and we manage the stack ourselves so that's not really going to occur. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57934 llvm-svn: 353536	2019-02-08 17:07:54 +00:00
Sanjay Patel	e9cc26a56a	[x86] fix formatting; NFC (test commit #2 migrating to git) llvm-svn: 353533	2019-02-08 16:48:40 +00:00
Carl Ritson	494b8ac95a	[AMDGPU] Fix CS scratch setup on pre-GCN3 ASICs Summary: Prior to GCN3 s_load_dword offsets are in dwords rather than bytes. Thus the scratch buffer descriptor offset must be adjusted for pre-GCN3 ASICs. Reviewers: nhaehnle, tpr Reviewed By: nhaehnle Subscribers: sheredom, arsenm, kzhuravl, jvesely, wdng, yaxunl, dstuttard, t-tye, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D56496 llvm-svn: 353530	2019-02-08 15:41:11 +00:00
Nirav Dave	97011ccce0	Revert r353416 "[DAG] Cleanup unused nodes on failed store-to-load forward combine." This cleanup causes out-of-tree crashes. llvm-svn: 353527	2019-02-08 15:21:13 +00:00
Matt Arsenault	b0a227049f	AMDGPU/GlobalISel: Fix shift legalization for non-power-of-2 clampScalar doesn't do anything for non-power-of-2 in range. There should probably be a combination rule to reduce the number of matching rules. llvm-svn: 353526	2019-02-08 15:06:24 +00:00
Dmitry Preobrazhensky	942c273d64	[AMDGPU][MC] Added support of lds_direct operand See bug 39293: https://bugs.llvm.org/show_bug.cgi?id=39293 Reviewers: artem.tamazov, rampitec Differential Revision: https://reviews.llvm.org/D57889 llvm-svn: 353524	2019-02-08 14:57:37 +00:00
Matt Arsenault	0f2debb1c2	AMDGPU/GlobalISel: Fix non-power-of-2 implicit_def llvm-svn: 353522	2019-02-08 14:46:27 +00:00
Petar Avramovic	c98b26d326	[MIPS GlobalISel] Select any extending load and truncating store Make behavior of G_LOAD in widenScalar same as for G_ZEXTLOAD and G_SEXTLOAD. That is perform widenScalarDst to size given by the target and avoid additional checks in common code. Targets can reorder or add additional rules in LegalizeRuleSet for the opcode to achieve desired behavior. Select extending load that does not have specified type of extension into zero extending load. Select truncating store that stores number of bytes indicated by size in MachineMemoperand. Differential Revision: https://reviews.llvm.org/D57454 llvm-svn: 353520	2019-02-08 14:27:23 +00:00
Matt Arsenault	dc88a2ce35	AMDGPU/GlobalISel: Don't use a copy in addrspacecast lowering llvm-svn: 353516	2019-02-08 14:16:11 +00:00
Dmitry Preobrazhensky	62a0318dff	[AMDGPU][MC][CODEOBJECT] Added predefined symbols to access GPU minor and stepping numbers Added the following Code Object v3 symbols: .amdgcn.gfx_generation_minor .amdgcn.gfx_generation_stepping Reviewers: artem.tamazov, kzhuravl Differential Revision: https://reviews.llvm.org/D57826 llvm-svn: 353515	2019-02-08 13:51:31 +00:00
Valery Pykhtin	7fe97f8c7c	[AMDGPU] Fix DPP combiner Differential revision: https://reviews.llvm.org/D55444 dpp move with uses and old reg initializer should be in the same BB. bound_ctrl:0 is only considered when bank_mask and row_mask are fully enabled (0xF). Otherwise the old register value is checked for identity. Added add, subrev, and, or instructions to the old folding function. Kill flag is cleared for the src0 (DPP register) as it may be copied into more than one user. The pass is still disabled by default. llvm-svn: 353513	2019-02-08 11:59:48 +00:00
Carlos Alberto Enciso	08dc50f2fb	[DWARF] LLVM ERROR: Broken function found, while removing Debug Intrinsics. Check that when SimplifyCFG is flattening a 'br', all their debug intrinsic instructions are removed, including any dbg.label referencing a label associated with the basic blocks being removed. Differential Revision: https://reviews.llvm.org/D57444 llvm-svn: 353511	2019-02-08 10:57:26 +00:00
Hans Wennborg	f5db715862	Revert r353424 "[llvm-ar][libObject] Fix relative paths when nesting thin archives." This broke the Chromium build on Windows, see https://crbug.com/930058 > Summary: > When adding one thin archive to another, we currently chop off the relative path to the flattened members. For instance, when adding `foo/child.a` (which contains `x.txt`) to `parent.a`, whe > lattening it we should add it as `foo/x.txt` (which exists) instead of `x.txt` (which does not exist). > > As a note, this also undoes the `IsNew` parameter of handling relative paths in r288280. The unit test there still passes. > > This was reported as part of testing the kernel build with llvm-ar: https://patchwork.kernel.org/patch/10767545/ (see the second point). > > Reviewers: mstorsjo, pcc, ruiu, davide, david2050 > > Subscribers: hiraditya, llvm-commits > > Tags: #llvm > > Differential Revision: https://reviews.llvm.org/D57842 This reverts commit `bf990ab5aa`. llvm-svn: 353507	2019-02-08 10:16:45 +00:00
Petar Avramovic	56dc218dc1	[MIPS GlobalISel] Select mul Legalize and select G_MUL for s32 and smaller types for MIPS32. Differential Revision: https://reviews.llvm.org/D57816 llvm-svn: 353506	2019-02-08 10:11:33 +00:00
Max Kazantsev	6b63d3a277	[LoopSimplifyCFG] Use DTU.applyUpdates instead of insert/deleteEdge `insert/deleteEdge` methods in DTU can make updates incorrectly in some cases (see https://bugs.llvm.org/show_bug.cgi?id=40528), and it is recommended to use `applyUpdates` methods instead when it is needed to make a mass update in CFG. Differential Revision: https://reviews.llvm.org/D57316 Reviewed By: kuhar llvm-svn: 353502	2019-02-08 08:12:41 +00:00
Sam Parker	5b09834bc3	[ARM] Add OptMinSize to ARMSubtarget In many places in the backend, we like to know whether we're optimising for code size and this is performed by checking the current machine function attributes. A subtarget is created on a per-function basis, so it's possible to know when we're compiling for code size on construction so record this in the new object. Differential Revision: https://reviews.llvm.org/D57812 llvm-svn: 353501	2019-02-08 07:57:42 +00:00
Sergey Dmitriev	807960e6ef	[CodeExtractor] Update function's assumption cache after extracting blocks from it Summary: Assumption cache's self-updating mechanism does not correctly handle the case when blocks are extracted from the function by the CodeExtractor. As a result function's assumption cache may have stale references to the llvm.assume calls that were moved to the outlined function. This patch fixes this problem by removing extracted llvm.assume calls from the function’s assumption cache. Reviewers: hfinkel, vsk, fhahn, davidxl, sanjoy Reviewed By: hfinkel, vsk Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57215 llvm-svn: 353500	2019-02-08 06:55:18 +00:00
Heejin Ahn	df6770f0c9	[WebAssembly] Fix parseImmediate's memory alignment requirement This fixes the current failure in the x86-64 ubsan bot caused by r353496. llvm-svn: 353499	2019-02-08 04:06:56 +00:00
Matt Arsenault	a8b4339c2f	AMDGPU/GlobalISel: Legalize addrspacecast Use a placeholder constant for now on targets that need the load from the queue ptr. llvm-svn: 353497	2019-02-08 02:40:47 +00:00
Wouter van Oortmerssen	0d9f3f7f95	[WebAssembly] Fixed Disassembler ignoring endian swap on big endian. Summary: This fixes: https://bugs.llvm.org/show_bug.cgi?id=40620 Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57933 llvm-svn: 353496	2019-02-08 01:43:23 +00:00
Craig Topper	738180cc7f	Fix the lowering issue of intrinsics llvm.localaddress on X86 Patch by Yuanke Luo Reviewers: craig.topper, annita.zhang, smaslov, rnk, wxiao3 Reviewed By: rnk Subscribers: efriedma, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57501 llvm-svn: 353492	2019-02-08 01:14:12 +00:00
Craig Topper	c782f18835	[X86] Add FPCW as a register and start using it as an implicit use on floating point instructions. Summary: FPCW contains the rounding mode control which we manipulate to implement fp to integer conversion by changing the roudning mode, storing the value to the stack, and then changing the rounding mode back. Because we didn't model FPCW and its dependency chain, other instructions could be scheduled into the middle of the sequence. This patch introduces the register and adds it as an implciit def of FLDCW and implicit use of the FP binary arithmetic instructions and store instructions. There are more instructions that need to be updated, but this is a good start. I believe this fixes at least the reduced test case from PR40529. Reviewers: RKSimon, spatel, rnk, efriedma, andrew.w.kaylor Subscribers: dim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57735 llvm-svn: 353489	2019-02-08 00:44:39 +00:00
Eli Friedman	29c0609301	[AArch64] Fix condition for "high-vector" DUP optimizations. AArch64 NEON has a bunch of instructions with a "2" suffix that extract the top half of the source vectors, instead of the bottom half. We have some DAGCombines to try to take advantage of that. However, they assumed that any EXTRACT_VECTOR was extracting the high half of the vector in question. This issue has apparently existed since the AArch64 backend was merged. Fixes https://bugs.llvm.org/show_bug.cgi?id=40632 . Differential Revision: https://reviews.llvm.org/D57862 llvm-svn: 353486	2019-02-08 00:23:35 +00:00
Petar Jovanovic	3cfcd75453	[mips][micromips] Fix how values in .gcc_except_table are calculated When a landing pad is calculated in a program that is compiled for micromips with -fPIC flag, it will point to an even address. Such an error will cause a segmentation fault, as the instructions in micromips are aligned on odd addresses. This patch sets the last bit of the offset where a landing pad is, to 1, which will effectively be an odd address and point to the instruction exactly. r344591 fixed this issue for -static compilation. Patch by Aleksandar Beserminji. Differential Revision: https://reviews.llvm.org/D57677 llvm-svn: 353480	2019-02-07 22:57:33 +00:00
Sanjay Patel	81f859d169	[x86] fix formatting; NFC llvm-svn: 353477	2019-02-07 22:36:55 +00:00
Dan Gohman	29874cea31	[WebAssembly] Fix imported function symbol names that differ from their import names in the .o format Add a flag to allow symbols to have a wasm import name which differs from the linker symbol name, allowing the linker to link code using the import_module attribute. This is the MC/Object portion of the patch. Differential Revision: https://reviews.llvm.org/D57632 llvm-svn: 353474	2019-02-07 22:03:32 +00:00
Quentin Colombet	96f54de8ff	[InstCombine] Optimize `atomicrmw <op>, 0` into `load atomic` when possible This commit teaches InstCombine how to replace an atomicrmw operation into a simple load atomic. For a given `atomicrmw <op>`, this is possible when: 1. The ordering of that operation is compatible with a load (i.e., anything that doesn't have a release semantic). 2. <op> does not modify the value being stored Differential Revision: https://reviews.llvm.org/D57854 llvm-svn: 353471	2019-02-07 21:27:23 +00:00
Florian Hahn	f557a94aa3	[LV] Remove unnecessary assignment to UserIC. llvm-svn: 353469	2019-02-07 21:23:37 +00:00
Sanjay Patel	781d883862	[InstCombine] Fix crashing from (icmp (bitcast ([su]itofp X)), Y) This fixes a class of bugs introduced by D44367, which transforms various cases of icmp (bitcast ([su]itofp X)), Y to icmp X, Y. If the bitcast is between vector types with a different number of elements, the current code will produce bad IR along the lines of: icmp <N x i32> ..., <M x i32> <...>. This patch suppresses the transform if the bitcast changes the number of vector elements. Patch by: @AndrewScheidecker (Andrew Scheidecker) Differential Revision: https://reviews.llvm.org/D57871 llvm-svn: 353467	2019-02-07 21:12:01 +00:00
Adrian Prantl	e794db8817	Move SMTSolver dump() methods out-of-line. This broke modularized non-local-submodule-visibility builds because the function bodies pulled in extra dependencies. llvm-svn: 353465	2019-02-07 21:03:18 +00:00
Nikita Popov	9d7e86a978	[CodeGen] Handle vector UADDO, SADDO, USUBO, SSUBO This is part of https://bugs.llvm.org/show_bug.cgi?id=40442. Vector legalization is implemented for the add/sub overflow opcodes. UMULO/SMULO are also handled as far as legalization is concerned, but they don't support vector expansion yet (so no tests for them). The vector result widening implementation is suboptimal, because it could result in a legalization loop. Differential Revision: https://reviews.llvm.org/D57639 llvm-svn: 353464	2019-02-07 21:02:22 +00:00
Sanjay Patel	e7f46c3db3	[InstCombine] refactor folds for (icmp (bitcast X), Y); NFCI llvm-svn: 353462	2019-02-07 20:54:09 +00:00
Florian Hahn	ba5acbc4fe	[LV] Prevent interleaving if computeMaxVF returned None. As discussed in D57382, interleaving should be avoided if computeMaxVF returns None, same as we currently do for vectorization. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6477 Reviewers: Ayal, dcaballe, hsaito, mkuper, rengolin Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D57837 llvm-svn: 353461	2019-02-07 20:49:10 +00:00
Matt Arsenault	e98cab11d7	GlobalISel: Try to fix bot failures Don't rely on order of evaluation of function arguments. llvm-svn: 353460	2019-02-07 20:44:08 +00:00
Simon Pilgrim	fe3ac70b18	[DAGCombiner] (add (umax X, C), -C) --> (usubsat X, C) (PR40111) Move the (add (umax X, C), -C) --> (usubsat X, C) X86 combine into generic DAGCombiner First of a number of saturated arithmetic folds that can be moved out of X86-specific code for PR40111. Differential Revision: https://reviews.llvm.org/D57754 llvm-svn: 353457	2019-02-07 20:14:43 +00:00
Matt Arsenault	fbec8fe93b	GlobalISel: Implement narrowScalar for shift main type This is pretty much directly ported from SelectionDAG. Doesn't include the shift by non-constant but known bits version, since there isn't a globalisel version of computeKnownBits yet. This shows a disadvantage of targets not specifically which type should be used for the shift amount. If type 0 is legalized before type 1, the operations on the shift amount type use the wider type (which are also less likely to legalize). This can be avoided by targets specifying legalization actions on type 1 earlier than for type 0. llvm-svn: 353455	2019-02-07 19:37:44 +00:00

1 2 3 4 5 ...

120378 Commits