llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Lively	cc612c2908	[WebAssembly] Fix FastISel address calculation bug Fixes PR47040, in which an assertion was improperly triggered during FastISel's address computation. The issue was that an `Address` set to be relative to the FrameIndex with offset zero was incorrectly considered to have an unset base. When the left hand side of an add set the Address to be 0 off the FrameIndex, the right side would not detect that the Address base had already been set and could try to set the Address to be relative to a register instead, triggering an assertion. This patch fixes the issue by explicitly tracking whether an `Address` has been set rather than interpreting an offset of zero to mean the `Address` has not been set. Differential Revision: https://reviews.llvm.org/D85581	2020-08-08 15:23:11 -07:00
Brad Smith	430db35bf2	fix typo	2020-08-08 17:58:13 -04:00
Brad Smith	4eb4ebf76a	Hook up OpenBSD 64-bit PowerPC support	2020-08-08 17:51:19 -04:00
Vincent Zhao	654e8aadfd	[MLIR] Consider AffineIfOp when getting the index set of an Op wrapped in nested loops This diff attempts to resolve the TODO in `getOpIndexSet` (formerly known as `getInstIndexSet`), which states "Add support to handle IfInsts surronding `op`". Major changes in this diff: 1. Overload `getIndexSet`. The overloaded version considers both `AffineForOp` and `AffineIfOp`. 2. The `getInstIndexSet` is updated accordingly: its name is changed to `getOpIndexSet` and its implementation is based on a new API `getIVs` instead of `getLoopIVs`. 3. Add `addAffineIfOpDomain` to `FlatAffineConstraints`, which extracts new constraints from the integer set of `AffineIfOp` and merges it to the current constraint system. 4. Update how a `Value` is determined as dim or symbol for `ValuePositionMap` in `buildDimAndSymbolPositionMaps`. Differential Revision: https://reviews.llvm.org/D84698	2020-08-09 03:16:03 +05:30
Craig Topper	d3153b5ca2	[X86] Remove a DCI.isBeforeLegalize() call from combineVSelectWithAllOnesOrZeros. This was blocking isTypeLegal call so that we could do a particular transform on illegal types before type legalization. But the we create a target specific node using that type. We shouldn't do that if the type isn't legal. So I think we should just always make sure the type is legal. I suspect that in order to get the condition VT to not be a vector of i1 we already completed type legalization anyway so this probably doesn't matter much in practice.	2020-08-08 14:19:13 -07:00
Roman Lebedev	d4c3f20285	[Reduce] Rewrite function body delta pass again It is not enough to replace all uses of users of the function with undef, the users, we only drop instruction users, so they may stick around. Let's try different approach - first drop bodies for all the functions we will drop, which should take care of blockaddress issue the previous rewrite was dealing with; then, after dropping all such bodies, replace remaining uses with undef (thus all the uses are either outside of functions, or are in kept functions) and then finally drop functions. This seems to work, and passes the existing test coverage, but it is possible that a new issue will be discovered later :) A new (previously crashing) test added.	2020-08-08 23:48:44 +03:00
Dávid Bolvanský	48887c4e81	[libcxx-fuzzing] Fixed bug found by -Wstring-concatenation	2020-08-08 22:44:14 +02:00
Craig Topper	966a58e329	[X86] Support matching VPTERNLOG when the root node is X86ISD::ANDNP.	2020-08-08 13:11:47 -07:00
Craig Topper	a599e1320c	[X86] Add VPTERNLOG test cases where the root node will be X86ISD::ANDNP. NFC We currently fail to match this.	2020-08-08 12:53:28 -07:00
Dávid Bolvanský	c814eca3e4	[AArch64RegisterInfo] Supress new warning	2020-08-08 21:47:01 +02:00
Craig Topper	815a9b256b	[X86] Remove isSafeToClobberEFLAGS helper and just inline it into the call sites. This is just a thin wrapper around computeRegisterLivness which we can just call directly. The only real difference is that isSafeToClobberEFLAGS returns a bool and computeRegisterLivness returns an enum. So we need to check for the specific enum value that isSafeToClobberEFLAGS was hiding. I've also adjusted which sites pass an explicit value for Neighborhood since the default for computeRegisterLivness is 10.	2020-08-08 12:31:58 -07:00
Muhammad Omair Javaid	c888694a8e	[LLDB] Fix timeout value on expect_gdbremote_sequence D83904 seems to have changed timeout value on expect_gdbremote_sequence which was 120 previously. This seems to be causing intermittent failures on lldb-aarch64-ubuntu buildbot. This patch fixes the timeout value to see the impact on test suite. Example: http://lab.llvm.org:8011/builders/lldb-aarch64-ubuntu/builds/7401/steps/test/logs/stdio Differential Revision: https://reviews.llvm.org/D85514	2020-08-08 23:57:08 +05:00
Craig Topper	8d3ae64b04	Recommit "[X86] Increase the number of instructions searched for isSafeToClobberEFLAGS in a couple places" I messed up the bug numbers in the commit message before Previously this function searched 4 instructions forwards or backwards to determine if it was ok to clobber eflags. This is called in 3 places: rematerialization, turning 2 operand leas into adds or splitting 3 ops leas into an lea and add on some CPU targets. This patch increases the search limit to 10 instructions for rematerialization and 2 operand lea to add. I've left the old treshold for 3 ops lea spliting as that increases code size. Fixes PR47024 and PR46315.	2020-08-08 11:53:14 -07:00
Craig Topper	761f568420	Revert "[X86] Increase the number of instructions searched for isSafeToClobberEFLAGS in a couple places" This reverts commit `44b260cb0a`. I messed up the bug number in the commit message so I'm reverting to fix it.	2020-08-08 11:53:14 -07:00
Dávid Bolvanský	4cc914280f	[FileCheckTest] Supress new warning	2020-08-08 20:45:24 +02:00
Simon Pilgrim	cc15380f10	[X86][SSE] combineTargetShuffle - use scaleShuffleMask helper to widen shuffle mask. NFCI. Use scaleShuffleMask helper for the shuffle(hadd,hadd) canonicalization.	2020-08-08 19:36:18 +01:00
Craig Topper	44b260cb0a	[X86] Increase the number of instructions searched for isSafeToClobberEFLAGS in a couple places Previously this function searched 4 instructions forwards or backwards to determine if it was ok to clobber eflags. This is called in 3 places: rematerialization, turning 2 operand leas into adds or splitting 3 ops leas into an lea and add on some CPU targets. This patch increases the search limit to 10 instructions for rematerialization and 2 operand lea to add. I've left the old treshold for 3 ops lea spliting as that increases code size. Fixes PR47024 and PR43014	2020-08-08 11:29:41 -07:00
Simon Pilgrim	f13e92d4b2	[InstCombine] Use CreateVectorSplat(ElementCount) variant directly This was introduced at rGe20223672100, and the CreateVectorSplat(unsigned NumElements) variant calls it internally	2020-08-08 19:26:02 +01:00
Simon Pilgrim	090f9d5a55	Fix MSVC "not all control paths return a value" warning. NFC.	2020-08-08 19:12:11 +01:00
Brad Smith	cd5ab56bc4	Change the default target CPU for OpenBSD/i386 to i586	2020-08-08 13:49:45 -04:00
Dávid Bolvanský	6cd23558d3	[Clang] Fixed buildboit failure; bot defaults to older C++ standard	2020-08-08 19:37:50 +02:00
Dávid Bolvanský	0fef780aa7	[Clang] Avoid whitespace in fixit note	2020-08-08 19:34:07 +02:00
Dávid Bolvanský	dc096a66cb	[Diagnostics] Diagnose missing comma in string array initialization Motivation (from PR37674): const char *ss[] = { "foo", "bar", "baz", "qux" // <-- Missing comma! "abc", "xyz" }; This kind of bug was recently also found in LLVM codebase (see PR47030). Solves PR47038, PR37674 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D85545	2020-08-08 19:24:30 +02:00
Roman Lebedev	e492f0e03b	[SimplifyCFG] Fix invoke->call fold w/ multiple invokes in presence of lifetime intrinsics SimplifyCFG has two main folds for resumes - one when resume is directly using the landingpad, and the other one where resume is using a PHI node. While for the first case, we were already correctly ignoring all the PHI nodes, and both the debug info intrinsics and lifetime intrinsics, in the PHI-based-one, we weren't ignoring PHI's in the resume block, and weren't ignoring lifetime intrinsics. That is clearly a bug. On RawSpeed library, this results in +9.34% (+81) more invoke->call folds, -0.19% (-39) landing pads, -0.24% (-81) invoke instructions but +51 call instructions and -132 basic blocks. Though, the run-time performance impact appears to be within the noise.	2020-08-08 20:00:28 +03:00
Roman Lebedev	1f452ac1d7	[NFC][SimplifyCFG] Rewrite isCleanupBlockEmpty() to be iterator_range-based	2020-08-08 20:00:28 +03:00
Roman Lebedev	c2ebb32465	[NFC][SimplifyCFG] Add a test showing invoke->call simplification failure	2020-08-08 20:00:28 +03:00
Roman Lebedev	a587bf3eb0	[NFC][SimplifyCFG] Count the number of invokes turned into calls due to empty cleanup blocks	2020-08-08 20:00:27 +03:00
Fangrui Song	99cd56906a	[ELF] --wrap: set isUsedInRegularObj of __wrap_ if it is defined or shared Fixes PR47017 (a regression when fixing PR46169): if __wrap_ is shared, it is not exported.	2020-08-08 09:24:31 -07:00
Sanjay Patel	f22ac1d15b	[DAGCombiner] reassociate reciprocal sqrt expression to eliminate FP division, part 2 Follow-up to D82716 / rGea71ba11ab11 We do not have the fabs removal fold in IR yet for the case where the sqrt operand is repeated, so that's another potential improvement.	2020-08-08 10:38:06 -04:00
Sanjay Patel	ba4c214181	[x86] add tests for another reciprocal sqrt pattern; NFC	2020-08-08 10:38:06 -04:00
Benjamin Kramer	38537307e5	lib/CodeGen doesn't depend on lib/Passes.	2020-08-08 13:40:24 +02:00
Rainer Orth	0b90a08f77	[test][DebugInfo] Adapt two tests for Sun assembler syntax on Sparc Two DebugInfo tests currently `FAIL` on Sparc: LLVM :: DebugInfo/Generic/2010-06-29-InlinedFnLocalVar.ll LLVM :: DebugInfo/Generic/array.ll both in a similar way. E.g. : 'RUN: at line 1'; /var/llvm/local-sparcv9-A/bin/llc -O2 /vol/llvm/src/llvm-project/local/llvm/test/DebugInfo/Generic/2010-06-29-InlinedFnLocalVar.ll -o - \| /var/llvm/local-sparcv9-A/bin/FileCheck /vol/llvm/src/llvm-project/local/llvm/test/DebugInfo/Generic/2010-06-29-InlinedFnLocalVar.ll /vol/llvm/src/llvm-project/local/llvm/test/DebugInfo/Generic/2010-06-29-InlinedFnLocalVar.ll:4:10: error: CHECK: expected string not found in input ; CHECK: debug_info, ^ On `amd64-pc-solaris2.11`, the corresponding line is .section .debug_info,"",@progbits while on `sparcv9-sun-solaris2.11` we have only .section .debug_info This happens because Sparc currently emits `.section` directives using the style of the Solaris/SPARC assembler (controlled by `SunStyleELFSectionSwitchSyntax`). This patch takes the easy way out and allows both forms while tightening the check to only match the `.section` directive. Tested on `sparcv9-sun-solaris2.11`, `amd64-pc-solaris2.11`, `x86_64-pc-linux-gnu`, and `x86_64-apple-darwin20.0.0`. Differential Revision: https://reviews.llvm.org/D85414	2020-08-08 09:13:47 +02:00
Siva Chandra Reddy	f6d74b29d6	[libc][NFC] Disable a loader test as ld.gold fails to link. Will be reenabled after investigating and fixing the problem.	2020-08-07 23:45:18 -07:00
Siva Chandra Reddy	db936e0e91	[libc][NFC] Add library of floating point test matchers. This eliminates UnitTest's dependency on FPUtil and hence prevents non-math tests from depending indirectly on FPUtil. The patch essentially moves some of the existing pieces into a library of its own. Along the way, renamed add_math_unittest to add_fp_unittest. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D85486	2020-08-07 23:34:15 -07:00
Feng Liu	5c9c4ade9d	Add the inline interface to the shape dialect This patch also fixes a minor issue that shape.rank should allow returning !shape.size. The dialect doc has such an example for shape.rank. Differential Revision: https://reviews.llvm.org/D85556	2020-08-07 23:29:43 -07:00
Juneyoung Lee	b6d9add71b	[InstCombine] Optimize select(freeze(icmp eq/ne x, y), x, y) This patch adds an optimization that folds select(freeze(icmp eq/ne x, y), x, y) to x or y. This was needed to resolve slowdown after D84940 is applied. I tried to bake this logic into foldSelectInstWithICmp, but it wasn't clear. This patch conservatively writes the pattern in a separate function, foldSelectWithFrozenICmp. The output does not need freeze; https://alive2.llvm.org/ce/z/X49hNE (from @nikic) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D85533	2020-08-08 15:22:29 +09:00
Siva Chandra Reddy	5d59385ba6	[libc] Setup TLS in x86_64 loader. The new code added is still very x86_64 specific. AArch64 support will be added very soon and refactoring of the loader code will be done as part of the patches adding it. Reviewed By: asteinhauser Differential Revision: https://reviews.llvm.org/D82700	2020-08-07 23:19:03 -07:00
Juneyoung Lee	595d3b5ecc	[InstCombine] Add tests for select(freeze(icmp x, y), x, y); NFC	2020-08-08 15:09:08 +09:00
Craig Topper	514b00c439	[X86] Limit the scope of the min/max canonicalization in combineSelect Previously the transform was doing these two canonicalizations (x > y) ? x : y -> (x >= y) ? x : y (x < y) ? x : y -> (x <= y) ? x : y But those don't seem to be useful generally. And they actively pessimize the cases in PR47049. This patch limits it to (x > 0) ? x : 0 -> (x >= 0) ? x : 0 (x < -1) ? x : -1 -> (x <= -1) ? x : -1 These are the cases mentioned in the comments as the motivation for the canonicalization. These allow the CMOV to use the S flag from the compare thus improving opportunities to use a TEST or the flags from an arithmetic instruction.	2020-08-07 22:51:49 -07:00
Mehdi Amini	872bdc0be7	Remove unused static helper getMemRefTypeFromTensorType() (NFC)	2020-08-08 05:37:42 +00:00
Mehdi Amini	eebd0a57fc	Remove unused class member (NFC) Fix include/mlir/Reducer/ReductionNode.h:79:18: warning: private field 'parent' is not used [-Wunused-private-field]	2020-08-08 05:36:41 +00:00
Mehdi Amini	58acda1c16	Revert "[mlir] Add a utility class, ThreadLocalCache, for storing non static thread local objects." This reverts commit `9f24640b7e`. We hit some dead-locks on thread exit in some configurations: TLS exit handler is taking a lock. Temporarily reverting this change as we're debugging what is going on.	2020-08-08 05:31:25 +00:00
Fangrui Song	d30d461938	[ELF] Support .cfi_signal_frame glibc/sysdeps/unix/sysv/linux/x86_64/sigaction.c libc.a(sigaction.o) has a CIE with the augmentation string "zRS". Support 'S' to allow --icf={safe,all}.	2020-08-07 22:08:44 -07:00
Vincent Zhao	754e09f9ce	[MLIR] Add tiling validity check to loop tiling pass This revision aims to provide a new API, `checkTilingLegality`, to verify that the loop tiling result still satisifes the dependence constraints of the original loop nest. Previously, there was no check for the validity of tiling. For instance: ``` func @diagonal_dependence() { %A = alloc() : memref<64x64xf32> affine.for %i = 0 to 64 { affine.for %j = 0 to 64 { %0 = affine.load %A[%j, %i] : memref<64x64xf32> %1 = affine.load %A[%i, %j - 1] : memref<64x64xf32> %2 = addf %0, %1 : f32 affine.store %2, %A[%i, %j] : memref<64x64xf32> } } return } ``` You can find more information about this example from the Section 3.11 of [1]. In general, there are three types of dependences here: two flow dependences, one in direction `(i, j) = (0, 1)` (notation that depicts a vector in the 2D iteration space), one in `(i, j) = (1, -1)`; and one anti dependence in the direction `(-1, 1)`. Since two of them are along the diagonal in opposite directions, the default tiling method in `affine`, which tiles the iteration space into rectangles, will violate the legality condition proposed by Irigoin and Triolet [2]. [2] implies two tiles cannot depend on each other, while in the `affine` tiling case, two rectangles along the same diagonal are indeed dependent, which simply violates the rule. This diff attempts to put together a validator that checks whether the rule from [2] is violated or not when applying the default tiling method in `affine`. The canonical way to perform such validation is by examining the effect from adding the constraint from Irigoin and Triolet to the existing dependence constraints. Since we already have the prior knowlegde that `affine` tiles in a hyper-rectangular way, and the resulting tiles will be scheduled in the same order as their respective loop indices, we can simplify the solution to just checking whether all dependence components are non-negative along the tiling dimensions. We put this algorithm into a new API called `checkTilingLegality` under `LoopTiling.cpp`. This function iterates every `load`/`store` pair, and if there is any dependence between them, we get the dependence component and check whether it has any negative component. This function returns `failure` if the legality condition is violated. [1]. Bondhugula, Uday. Effective Automatic parallelization and locality optimization using the Polyhedral model. https://dl.acm.org/doi/book/10.5555/1559029 [2]. Irigoin, F. and Triolet, R. Supernode Partitioning. https://dl.acm.org/doi/10.1145/73560.73588 Differential Revision: https://reviews.llvm.org/D84882	2020-08-08 09:29:47 +05:30
Richard Smith	fb943696cb	PR47025, PR47043: Diagnose unexpanded parameter packs in concept declarations and requires-expressions.	2020-08-07 18:19:39 -07:00
Keno Fischer	c58674df14	[X86] Don't produce bad x86andp nodes for i1 vectors In D85499, I attempted to fix this same issue by canonicalizing andnp for i1 vectors, but since there was some opposition to such a change, this commit just fixes the bug by using two different forms depending on which kind of vector type is in use. We can then always decide to switch the canonical forms later. Description of the original bug: We have a DAG combine that tries to fold (vselect cond, 0000..., X) -> (andnp cond, x). However, it does so by attempting to create an i64 vector with the number of elements obtained by truncating division by 64 from the bitwidth. This is bad for mask vectors like v8i1, since that division is just zero. Besides, we don't want i64 vectors anyway. For i1 vectors, switch the pattern to (andnp (not cond), x), which is the canonical form for `kandn` on mask registers. Fixes https://github.com/JuliaLang/julia/issues/36955. Differential Revision: https://reviews.llvm.org/D85553	2020-08-07 20:05:47 -04:00
LLVM GN Syncbot	ca4bcfbf2c	[gn build] Port `f5b5ccf2a6`	2020-08-07 23:43:14 +00:00
Yuanfang Chen	f5b5ccf2a6	Reland "Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager"" This relands commit `320eab2d55`. The test failed because it was looking for x86-linux target unconditionally. Now it gets the default target.	2020-08-07 16:40:49 -07:00
peter klausler	4ac617f490	[flang] Handle DATA initialization of EQUIVALENCE'd objects Objects that are storage associated by EQUIVALENCE and initialized with DATA are initialized by creating a compiler temporary data object in the same scope, assigning it an offset, type, and size that covers the transitive closure of the associated initialized original symbols, and combining their initializers into one common initializer for the temporary. Some problems with offset assignment of EQUIVALENCE'd objects in COMMON were exposed and corrected, and some more error cases are checked. Remove obsolete function. Small bugfix (nested implied dos). Add a test. Fix struct/class warning. Differential Revision: https://reviews.llvm.org/D85560	2020-08-07 16:39:23 -07:00
Matt Arsenault	3c0597a9e4	AMDGPU: Avoid explicitly listing all the memory nodes	2020-08-07 19:22:46 -04:00

1 2 3 4 5 ...

362924 Commits All Branches Search

362924 Commits

All Branches