llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	3cbfe988bc	[X86] Merge X86TargetInfo::setFeatureEnabled and X86TargetInfo::setFeatureEnabledImpl. NFC setFeatureEnabled is a virtual function. setFeatureEnabledImpl was its implementation. This split was to avoid virtual calls when we need to call setFeatureEnabled in initFeatureMap. With C++11 we can use 'final' on setFeatureEnabled to enable the compiler to perform de-virtualization for the initFeatureMap calls.	2020-07-06 23:54:56 -07:00
Carl Ritson	560292fa99	[AMDGPU] Update isFMAFasterThanFMulAndFAdd assumptions MAD/MAC is no longer always available. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D83207	2020-07-07 15:40:44 +09:00
Saiyedul Islam	38d6640ba5	[libomptarget] Implement atomic inc and fence functions for AMDGCN using clang builtins This function uses __builtin_amdgcn_atomic_inc32(): uint32_t atomicInc(uint32_t *address, uint32_t max); These functions use __builtin_amdgcn_fence(): __kmpc_impl_threadfence() __kmpc_impl_threadfence_block() __kmpc_impl_threadfence_system() They will take place of current mechanism of directly calling IR functions. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D83132	2020-07-07 06:36:25 +00:00
Saiyedul Islam	0882c9d4fc	[AMDGPU] Change Clang AMDGCN atomic inc/dec builtins to take unsigned values builtin_amdgcn_atomic_inc32(uint Ptr, uint Val, unsigned MemoryOrdering, const char SyncScope) builtin_amdgcn_atomic_inc64(uint64_t Ptr, uint64_t Val, unsigned MemoryOrdering, const char SyncScope) builtin_amdgcn_atomic_dec32(uint Ptr, uint Val, unsigned MemoryOrdering, const char SyncScope) builtin_amdgcn_atomic_dec64(uint64_t Ptr, uint64_t Val, unsigned MemoryOrdering, const char SyncScope) As AMDGCN IR instrinsic for atomic inc/dec does unsigned comparison, these clang builtins should also take unsigned types instead of signed int types. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D83121	2020-07-07 06:36:25 +00:00
Craig Topper	16f3d698f2	[X86] Move the feature dependency handling in X86TargetInfo::setFeatureEnabledImpl to a table based lookup in X86TargetParser.cpp Previously we had to specify the forward and backwards feature dependencies separately which was error prone. And as dependencies have gotten more complex it was hard to be sure the transitive dependencies were handled correctly. The way it was written was also not super readable. This patch replaces everything with a table that lists what features a feature is dependent on directly. Then we can recursively walk through the table to find the transitive dependencies. This is largely based on how we handle subtarget features in the MC layer from the tablegen descriptions. Differential Revision: https://reviews.llvm.org/D83273	2020-07-06 23:14:02 -07:00
Max Kazantsev	094e99d264	[Test] Add one more missing optimization opportunity test	2020-07-07 13:04:15 +07:00
Craig Topper	7fb3a849c1	[X86] Remove duplicate SSE4A feature bit from X86TargetParser.def. NFC We had both SSE4A and SSE4_A. So remove one of them.	2020-07-06 22:11:51 -07:00
Martin Waitz	72df59d590	[mlir] resolve types from attributes in assemblyFormat An operation can specify that an operation or result type matches the type of another operation, result, or attribute via the `AllTypesMatch` or `TypesMatchWith` constraints. Use these constraints to also automatically resolve types in the automatically generated assembly parser. This way, only the attribute needs to be listed in `assemblyFormat`, e.g. for constant operations. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D78434	2020-07-07 04:40:01 +00:00
Sameer Arora	3b5db7fc69	[llvm-install-name-tool] Merge install-name options This diff merges all options for llvm-install-name-tool under a single function processLoadCommands. Also adds another test case for -add_rpath option. Test plan: make check-all Reviewed by: jhenderson, alexshap, smeenai, Ktwu Differential Revision: https://reviews.llvm.org/D82812	2020-07-06 20:32:32 -07:00
Nemanja Ivanovic	1b1539712e	[PowerPC] Do not RAUW combined nodes in VECTOR_SHUFFLE legalization When legalizing shuffles, we make an attempt to combine it into a PPC specific canonical form that avoids a need for a swap. If the combine is successful, we RAUW the node and the custom legalization replaces the now dead node instead of the one it should replace. Remove that erroneous call to RAUW.	2020-07-06 22:09:28 -05:00
LLVM GN Syncbot	fc67b25426	[gn build] Port `939d8309db`	2020-07-07 02:20:39 +00:00
Valentin Clement	65482e8a70	[openmp] Move isAllowedClauseForDirective to tablegen + add clause version to OMP.td Summary: Generate the isAllowedClauseForDirective function from tablegen. This patch introduce the VersionedClause in the tablegen file so that clause can be encapsulated in this class to specify a range of validity on a directive. VersionedClause has default minVersion, maxVersion so it can be used without them or minVersion. Reviewers: jdoerfert, jdenny Reviewed By: jdenny Subscribers: yaxunl, hiraditya, guansong, jfb, sstefan1, aaron.ballman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82982	2020-07-06 22:20:06 -04:00
Xiang1 Zhang	939d8309db	[X86-64] Support Intel AMX Intrinsic INTEL ADVANCED MATRIX EXTENSIONS (AMX). AMX is a new programming paradigm, it has a set of 2-dimensional registers (TILES) representing sub-arrays from a larger 2-dimensional memory image and operate on TILES. These intrinsics use direct TMM register number as its params. Spec can be found in Chapter 3 here https://software.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D83111	2020-07-07 10:13:40 +08:00
Mauricio Sifontes	28a45d54a7	Create the framework and testing environment for MLIR Reduce - a tool with the objective to reduce large test cases into smaller ones while preserving their interesting behavior. Implement the framework to parse the command line arguments, parse the input MLIR test case into a module and call reduction passes on the MLIR module. Implement the Tester class which allows the different reduction passes to test the interesting behavior of the generated reduced variants of the test case and keep track of the most reduced generated variant.	2020-07-07 01:59:11 +00:00
Biplob Mishra	0c6b6e28e7	[PowerPC] Implement Vector Splat Immediate Builtins in Clang Implements builtins for the following prototypes: vector signed int vec_splati (const signed int); vector float vec_splati (const float); vector double vec_splatid (const float); vector signed int vec_splati_ins (vector signed int, const unsigned int, const signed int); vector unsigned int vec_splati_ins (vector unsigned int, const unsigned int, const unsigned int); vector float vec_splati_ins (vector float, const unsigned int, const float); Differential Revision: https://reviews.llvm.org/D82520	2020-07-06 20:29:33 -05:00
Amy Kwan	c13e3e2c2e	[PowerPC][Power10] Exploit the xxsplti32dx instruction when lowering VECTOR_SHUFFLE. This patch aims to exploit the xxsplti32dx XT, IX, IMM32 instruction when lowering VECTOR_SHUFFLEs. We implement lowerToXXSPLTI32DX when lowering vector shuffles to check if: - Element size is 4 bytes - The RHS is a constant vector (and constant splat of 4-bytes) - The shuffle mask is a suitable mask for the XXSPLTI32DX instruction where it is one of the 32 masks: <0, 4-7, 2, 4-7> <4-7, 1, 4-7, 3> Differential Revision: https://reviews.llvm.org/D83245	2020-07-06 20:28:38 -05:00
Paula Toth	ab25ed26c6	[libc] Add documentation for clang-tidy checks. Reviewers: sivachandra Reviewed By: sivachandra Subscribers: tschuett, ecnelises, libc-commits Tags: #libc-project Differential Revision: https://reviews.llvm.org/D82846	2020-07-06 18:15:35 -07:00
David Blaikie	7a99aab869	[ModuloSchedule] Devirtualize PeelingModuloScheduleExpander::expand as it's not needed The use case is out of tree code deriving from this class - but without a need to use the base class polymorphically, so skip the virtualization and virtual dtor. Post-commit review from `50ac7ce94f`	2020-07-06 18:05:32 -07:00
Jordan Rupprecht	10c82eecbc	Revert "[LV] Enable the LoopVectorizer to create pointer inductions" This reverts commit `a8fe12065e`. It causes a crash when building gzip. Will post the detailed reduced test case to D81267.	2020-07-06 17:50:38 -07:00
LLVM GN Syncbot	7a3258912c	[gn build] Port `05f2b5ccfc`	2020-07-07 00:37:49 +00:00
LLVM GN Syncbot	bfa8bda046	[gn build] Port	2020-07-07 00:37:49 +00:00
Nico Weber	003ea14220	fix typos to cycle bots	2020-07-06 20:37:11 -04:00
Wolfgang Pieb	129387497e	Correct 3 spelling errors in headers and doc strings.	2020-07-06 17:27:51 -07:00
Amara Emerson	3c7e8d6d0e	Fix sdk version test to use 99.99.99 as a max dummy version instead of 10.99.99. Was failing on macOS 11 hosts which is > 10.99.99	2020-07-06 16:53:12 -07:00
Sanjay Patel	ea71ba11ab	[DAGCombiner] reassociate reciprocal sqrt expression to eliminate FP division X / (fabs(A) * sqrt(Z)) --> X / sqrt(AAZ) --> X * rsqrt(AAZ) In the motivating case from PR46406: https://bugs.llvm.org/show_bug.cgi?id=46406 ...this is restoring the sequence that was originally in the source code. We extracted a term from within the sqrt because we do not know in instcombine whether a target will expand a sqrt call. Note: we could say that the transform in IR should be restricted, but that would not solve the problem if the source was originally in the pattern shown here. This is a gray area for fast-math-flag requirements. I think we should at least check fast-math-flags on the fdiv and fmul because I view this transform as 2 pieces: reassociate the fmul operands and form reciprocal from the fdiv (as with the existing transform). We could argue that the sqrt also needs FMF, but that was not required before, so we should change that in a follow-up patch if that seems better. We don't currently have a way to check that the target will produce a sqrt or recip estimate without actually creating nodes (the APIs are SDValue getSqrtEstimate() and SDValue getRecipEstimate()), so we clean up speculatively created nodes if we are not able to create an estimate. The x86 test with doubles verifies that we are not changing a test with no estimate sequence. Differential Revision: https://reviews.llvm.org/D82716	2020-07-06 19:12:21 -04:00
Eric Christopher	4029f8ede4	Temporarily Revert "[llvm-install-name-tool] Merge install-name options" as it breaks the objcopy build. This reverts commit `c143900a08`.	2020-07-06 15:40:14 -07:00
Yuanfang Chen	1e495e10e6	[NFC] change getLimitedCodeGenPipelineReason to static function	2020-07-06 15:39:27 -07:00
Roman Lebedev	fc4f5d6584	[NFCI][llvm-reduce] ReduceOperandBundles: actually put Module forward-declaration back into llvm namespace	2020-07-07 01:32:26 +03:00
MinJae Hwang	8421364282	Modifications to the algorithm sort benchmark Summary: Modifies the algorithm sort bench: - shows sorting time per element, instead of sorting time per array. This would make comparison between different sizes of arrays easier. - adds std::pair benchmark cases. - uses a large number of arrays to benchmark, instead of repeatedly sorting the same array. * sorting the same array again and again would not show actual sorting performance over randomized data sets. Reviewers: EricWF, #libc, mvels Reviewed By: EricWF, #libc, mvels Subscribers: mgrang, libcxx-commits Tags: #libc Differential Revision: https://reviews.llvm.org/D81770	2020-07-06 18:30:02 -04:00
Peyton, Jonathan L	95a28df5c4	[OpenMP] Add GOMP 5.0 loop entry points This patch adds missing GOMP_5.0 loop entry points which incorporate new non-monotonic default into entry point name. Since monotonic schedules are a subset of nonmonotonic, it is acceptable to use monotonic as the implementation. This patch simply has the nonmonotonic (and possibly non-monontonic) versions of the loop entry points as wrappers around the monotonic ones. Differential Revision: https://reviews.llvm.org/D73922	2020-07-06 17:22:26 -05:00
Roman Lebedev	05f2b5ccfc	[llvm-reduce] Reducing call operand bundles Summary: This would have been marginally useful to me during/for rG7ea46aee3670981827c04df89b2c3a1cbdc7561b. With ongoing migration to representing assumes via operand bundles on the assume, this will be gradually more useful. Reviewers: nickdesaulniers, diegotf, dblaikie, george.burgess.iv, jdoerfert, Tyker Reviewed By: nickdesaulniers Subscribers: hiraditya, mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83177	2020-07-07 01:16:37 +03:00
Roman Lebedev	69dca6efc6	[NFCI][IR] Introduce CallBase::Create() wrapper Summary: It is reasonably common to want to clone some call with different bundles. Let's actually provide an interface to do that. Reviewers: chandlerc, jdoerfert, dblaikie, nickdesaulniers Reviewed By: nickdesaulniers Subscribers: llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D83248	2020-07-07 01:16:36 +03:00
Sameer Arora	c143900a08	[llvm-install-name-tool] Merge install-name options This diff merges all options for llvm-install-name-tool under a single function processLoadCommands. Also adds another test case for -add_rpath option. Test plan: make check-all Reviewed by: jhenderson, alexshap, smeenai, Ktwu Differential Revision: https://reviews.llvm.org/D82812	2020-07-06 15:15:20 -07:00
Roman Lebedev	db05f2e34a	[Scalarizer] Centralize instruction DCE As reported in https://reviews.llvm.org/D83101#2133062 the new visitInsertElementInst()/visitExtractElementInst() functionality is causing miscompiles (previously-crashing test added) It is due to the fact how the infra of Scalarizer is dealing with DCE, it was not updated or was it ready for such scalar value forwarding. It always assumed that the moment we "scalarized" something, it can go away, and did so with prejudice. But that is no longer safe/okay to do. Instead, let's prevent it from ever shooting itself into foot, and let's just accumulate the instructions-to-be-deleted in a vector, and collectively cleanup (those that are actually dead) them all at the end. All existing tests are not reporting any new garbage leftovers, but maybe it's test coverage issue.	2020-07-07 01:12:51 +03:00
Craig Topper	c359c5d534	[X86] Centalize the 'sse4' hack to a single place in X86TargetInfo::setFeatureEnabledImpl. NFCI Instead of detecting the string in 2 places. Just swap the string to 'sse4.1' or 'sse4.2' at the top of the function. Prep work for a patch to switch the rest of this function to a table based system. And I don't want to include 'sse4a' in the table.	2020-07-06 15:00:32 -07:00
Joachim Protze	6d9626d2da	[OpenMP][Tests] Fix/Mark compatibilty for GCC Reviewed by: Hahnfeld, saiislam Differential Revision: https://reviews.llvm.org/D82267	2020-07-06 23:56:09 +02:00
Zixu Wang	f47b885131	[clang] Enable errors for undefined TARGET_OS_ macros in Darwin driver Add clang option `-Wundef-prefix=TARGET_OS_` and `-Werror=undef-prefix` to Darwin driver. Differential Revision: https://reviews.llvm.org/D83250	2020-07-06 14:52:12 -07:00
Eric Christopher	7c63804383	Fix [-Werror,-Wsign-compare] in dominator unit test.	2020-07-06 14:50:13 -07:00
Bruno Ricci	02946de380	[Support][NFC] Fix Wdocumentation warning in ADT/Bitfields.h \tparam is used for template parameters instead of \param.	2020-07-06 22:41:40 +01:00
Stanislav Mekhanoshin	f7a7efbf88	[AMDGPU] Tweak getTypeLegalizationCost() Even though wide vectors are legal they still cost more as we will have to eventually split them. Not all operations can be uniformly done on vector types. Conservatively add the cost of splitting at least to 8 dwords, which is our widest possible load. We are more or less lying to cost mode with this change but this can prevent vectorizer from creation of wide vectors which results in RA problems for us. Differential Revision: https://reviews.llvm.org/D83078	2020-07-06 14:07:48 -07:00
Bruno Ricci	f63e3ea558	[clang] Rework how and when APValues are dumped Currently APValues are dumped as a single string. This becomes quickly completely unreadable since APValue is a tree-like structure. Even a simple example is not pretty: struct S { int arr[4]; float f; }; constexpr S s = { .arr = {1,2}, .f = 3.1415f }; // Struct fields: Array: Int: 1, Int: 2, 2 x Int: 0, Float: 3.141500e+00 With this patch this becomes: -Struct \|-field: Array size=4 \| \|-elements: Int 1, Int 2 \| `-filler: 2 x Int 0 `-field: Float 3.141500e+00 Additionally APValues are currently only dumped as part of visiting a ConstantExpr. This patch also dump the value of the initializer of constexpr variable declarations: constexpr int foo(int a, int b) { return a + b - 42; } constexpr int a = 1, b = 2; constexpr int c = foo(a, b) > 0 ? foo(a, b) : foo(b, a); // VarDecl 0x62100008aec8 <col:3, col:57> col:17 c 'const int' constexpr cinit // \|-value: Int -39 // `-ConditionalOperator 0x62100008b4d0 <col:21, col:57> 'int' // <snip> Do the above by moving the dump functions to TextNodeDumper which already has the machinery to display trees. The cases APValue::LValue, APValue::MemberPointer and APValue::AddrLabelDiff are left as they were before (unimplemented). We try to display multiple elements on the same line if they are considered to be "simple". This is to avoid wasting large amounts of vertical space in an example like: constexpr int arr[8] = {0,1,2,3,4,5,6,7}; // VarDecl 0x62100008bb78 <col:3, col:42> col:17 arr 'int const[8]' constexpr cinit // \|-value: Array size=8 // \| \|-elements: Int 0, Int 1, Int 2, Int 3 // \| `-elements: Int 4, Int 5, Int 6, Int 7 Differential Revision: https://reviews.llvm.org/D83183 Reviewed By: aaron.ballman	2020-07-06 22:03:08 +01:00
Matt Arsenault	f25d020c2e	AMDGPU/GlobalISel: Add types to special inputs When passing special ABI inputs, we have no existing context for the type to use.	2020-07-06 17:00:55 -04:00
Arlo Siemsen	1d8cb09923	Add option LLVM_NM to allow specifying the location of the llvm-nm tool The new option works like the existing LLVM_TABLEGEN, and LLVM_CONFIG_PATH options. Instead of building llvm-nm, the build uses the executable defined by LLVM_NM. This is useful for cross-compilation scenarios where the host cannot run the cross-compiled tool, and recursing into another cmake build is not an option (due to required DEFINE's, for example). Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D83022	2020-07-06 13:27:56 -07:00
Adrian Prantl	60c07fd016	Use CMAKE_OSX_SYSROOT instead of the environment variable SYSROOT to detect energy support in debugserver. The way that Swift build-script is invoked the former may be overridden manually. <rdar://problem/63840635>	2020-07-06 13:17:31 -07:00
Tim Keith	1b18391818	[flang] Add missing include for std::min This was causing the build to fail on macos. Differential Revision: https://reviews.llvm.org/D83237	2020-07-06 13:03:02 -07:00
Nicolai Hähnle	f987ba3cf9	DomTree: add private create{Child,Node} helpers Summary: Aside from unifying the code a bit, this change smooths the transition to use of future "opaque generic block references" in the type-erased dominator tree base class. Change-Id: If924b092cc8561c4b6a7450fe79bc96df0e12472 Reviewers: arsenm, RKSimon, mehdi_amini, courbet Subscribers: wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83086	2020-07-06 21:58:11 +02:00
Nicolai Hähnle	dfcc68c528	DomTree: Remove getRoots() accessor Summary: Avoid exposing details about how roots are stored. This enables subsequent type-erasure changes. v5: - cleanup a unit test by using EXPECT_EQ instead of EXPECT_TRUE Change-Id: I532b774cc71f2224e543bc7d79131d97f63f093d Reviewers: arsenm, RKSimon, mehdi_amini, courbet Subscribers: jvesely, wdng, hiraditya, kuhar, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83085	2020-07-06 21:58:11 +02:00
Nicolai Hähnle	723a44c9b5	DomTree: Remove the releaseMemory() method Summary: It is fully redundant with reset(). Change-Id: I25850b9f08eace757cf03cbb8780e970aca7f51a Reviewers: arsenm, RKSimon, mehdi_amini, courbet Subscribers: wdng, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D83084	2020-07-06 21:58:11 +02:00
Nicolai Hähnle	76c5cb05a3	DomTree: Remove getChildren() accessor Summary: Avoid exposing details about how children are stored. This will enable subsequent type-erasure changes. New methods are introduced to cover common access patterns. Change-Id: Idb5f4b1b9c84e4cc71ddb39bb52a388682f5674f Reviewers: arsenm, RKSimon, mehdi_amini, courbet Subscribers: qcolombet, sdardis, wdng, hiraditya, jrtc27, zzheng, atanasyan, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83083	2020-07-06 21:58:11 +02:00
Wouter van Oortmerssen	16d83c395a	[WebAssembly] Added 64-bit memory.grow/size/copy/fill This covers both the existing memory functions as well as the new bulk memory proposal. Added new test files since changes where also required in the inputs. Also removes unused init/drop intrinsics rather than trying to make them work for 64-bit. Differential Revision: https://reviews.llvm.org/D82821	2020-07-06 12:49:50 -07:00

... 3 4 5 6 7 ...

359675 Commits All Branches Search

359675 Commits

All Branches