llvm-project

Commit Graph

Author	SHA1	Message	Date
Jon Chesterfield	d71062fbda	Revert "[OpenMP][AMDGCN] Initial math headers support" This reverts commit `968899ad9c`.	2021-07-21 17:35:40 +01:00
Jon Chesterfield	a733bbbd17	[libomptarget][amdgpu][nfc] Refactor #includes Create a hsa_api.h header that includes the ROCr headers in use Drop some unused headers and _cplusplus macros Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106455	2021-07-21 17:28:07 +01:00
Thomas Lively	1a57ee1276	[WebAssembly] Codegen for v128.load{32,64}_zero Replace the experimental clang builtins and LLVM intrinsics for these instructions with normal instruction selection patterns. The wasm_simd128.h intrinsics header was already using portable code for the corresponding intrinsics, so now it produces the correct instructions. Differential Revision: https://reviews.llvm.org/D106400	2021-07-21 09:02:12 -07:00
Quinn Pham	e23ff55931	[PowerPC] Removing a REQUIRES line from llvm test The test has been moved to the correct directory so this `REQUIRES` line is not needed.	2021-07-21 10:52:23 -05:00
Eric Astor	69551486fd	[ms] [llvm-ml] Restrict implicit RIP-relative addressing to named-variable references ML64.EXE applies implicit RIP-relative addressing only to memory references that include a named-variable reference. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D105372	2021-07-21 11:49:58 -04:00
Arthur Eubanks	8bc298d041	[NewPM][Inliner] Check if deleted function is in current SCC In weird cases, the inliner will inline internal recursive functions, sometimes causing them to have no more uses, in which case the inliner will mark the function to be deleted. The function is actually deleted after the call to updateCGAndAnalysisManagerForCGSCCPass(). In updateCGAndAnalysisManagerForCGSCCPass(), UR.UpdatedC may be set to the SCC containing the function to be deleted. Then the inliner calls CG.removeDeadFunction() which can cause that SCC to be deleted, even though it's still stored in UR.UpdatedC. We could potentially check in the wrappers/pass managers if UR.UpdatedC is in UR.InvalidatedSCCs before doing anything with it, but it's safer to do this as close to possible to the call to CG.removeDeadFunction() to avoid issues with allocating a new SCC in the same address as the deleted one. It's hard to find a small test case since we need to have recursive internal functions be reachable from non-internal functions, yet they need to become non-recursive and not referenced by other functions when inlined. Similar to https://reviews.llvm.org/D106306. Fixes PR50788. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D106405	2021-07-21 08:47:45 -07:00
Jon Roelofs	4de74a7c4d	[MachineVerifier] Make INSERT_SUBREG diagnostic respect operand 2 subregs This came out of post-commit review: https://reviews.llvm.org/D105953#inline-1012919 Thanks uabelho!	2021-07-21 08:47:17 -07:00
Eric Astor	5fba605896	[ms] [llvm-ml] Support built-in text macros Add support for all built-in text macros supported by ML64: @Date, @Time, @FileName, @FileCur, and @CurSeg. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D104965	2021-07-21 11:44:09 -04:00
Eric Astor	4cbb912d75	[ms] [llvm-ml] Add support for numeric built-in symbols Support @Version and @Line as built-in symbols. For now, resolves @Version to 1427 (the same as for the VS 2019 release of ML.EXE). Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D104964	2021-07-21 11:43:07 -04:00
Geoffrey Martin-Noble	13e5aa8973	[Bazel] Remove deprecated td_relative_includes This has been deprecated for a while and there are no in-tree usages. I'm not aware of any out-of-tree usages either.	2021-07-21 08:38:15 -07:00
Pushpinder Singh	968899ad9c	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-07-21 16:15:39 +01:00
Peter Steinfeld	ece9aa29ff	[flang] Implement the runtime portion of the UNPACK intrinsic I'd previously merged this into the fir-dev branch. This change is to do the same thing to the main branch of llvm-project. Differential Revision: https://reviews.llvm.org/D106294	2021-07-21 08:03:49 -07:00
Uday Bondhugula	104fad99c9	[MLIR] Add folder for zero trip count affine.for AffineForOp's folding hook is expected to fold away trivially empty affine.for. This allows simplification to happen as part of the canonicalizer and from wherever the folding hook is used. While more complex analysis based zero trip count detection is available from other passes in analysis and transforms, simple and inexpensive folding had been missing. Also, update/improve affine.for op documentation clarifying semantics of the result values for zero trip count loops. Differential Revision: https://reviews.llvm.org/D106123	2021-07-21 20:28:35 +05:30
Marek Kurdej	1daf0e2256	[libc++] Add `__libcpp_copysign` conditionally constexpr overloads. This is a spin-off from D79555 review, that with this patch will be able to use `__libcpp_copysign` instead of adhoc `__copysign_constexpr` helper. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D106364	2021-07-21 16:57:43 +02:00
Uday Bondhugula	7932d21f5d	[MLIR] Introduce a new rewrite driver to simplify supplied list of ops Introduce a new rewrite driver (MultiOpPatternRewriteDriver) to rewrite a supplied list of ops and other ops. Provide a knob to restrict rewrites strictly to those ops or also to affected ops (but still not to completely related ops). This rewrite driver is commonly needed to run any simplification and cleanup at the end of a transforms pass or transforms utility in a way that only simplifies relevant IR. This makes it easy to write test cases while not performing unrelated whole IR simplification that may invalidate other state at the caller. The introduced utility provides more freedom to developers of transforms and transform utilities to perform focussed and local simplification. In several cases, it provides greater efficiency as well as more simplification when compared to repeatedly calling `applyOpPatternsAndFold`; in other cases, it avoids the need to undesirably call `applyPatternsAndFoldGreedily` to do unrelated simplification in a FuncOp. Update a few transformations that were earlier using applyOpPatternsAndFold (SimplifyAffineStructures, affineDataCopyGenerate, a linalg transform). TODO: - OpPatternRewriteDriver can be removed as it's a special case of MultiOpPatternRewriteDriver, i.e., both can be merged. Differential Revision: https://reviews.llvm.org/D106232	2021-07-21 20:25:16 +05:30
Quinn Pham	c3e17ceaaa	[PowerPC] Move backend test to fix non PPC bots Moving `llvm/test/CodeGen/builtins-ppc-xlcompat-fp.ll` to `llvm/test/CodeGen/PowerPC/builtins-ppc-xlcompat-fp.ll`	2021-07-21 09:36:29 -05:00
David Spickett	2404834c20	[PowerPC] Require power-pc target for new builtin test The llvm test added in `e002d251dd` was missing a REQUIRES. Failed to run on our AArch64 only bot: https://lab.llvm.org/buildbot/#/builders/171/builds/1262	2021-07-21 14:19:26 +00:00
Kerry McLaughlin	be753b207f	Revert "[LV] Use lookThroughAnd with logical reductions" Reverting patch due to buildbot failures. This reverts commit `e22a599672`.	2021-07-21 15:16:00 +01:00
Simon Pilgrim	ca9b60f9de	[LoopVectorize] Regenerate sve-vector-reverse.ll test checks	2021-07-21 15:14:04 +01:00
Kazu Hirata	ba2dd12d4f	[InstCombine] Remove CreateOverflowTuple (NFC) The last use was removed On Jun 3, 2020 in commit `2a6c871596`.	2021-07-21 07:07:53 -07:00
Quinn Pham	e002d251dd	[PowerPC] Floating Point Builtins for XL Compat. This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds builtins related to floating point operations Reviewed By: #powerpc, nemanjai, amyk, NeHuang Differential Revision: https://reviews.llvm.org/D103986	2021-07-21 08:33:39 -05:00
Jakub Kuderski	3c3165cfa0	[ADT] Add initializer_list constructor to SmallDenseMap Make it easier to initialize small maps inline. Note that DenseMap already has an initializer_list constructor. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106363	2021-07-21 09:32:16 -04:00
Simon Pilgrim	f55de3576d	[InstCombine] Regenerate gep-custom-dl.ll test checks	2021-07-21 14:29:34 +01:00
Hedin Garca	efa2115266	[libc] Include nextafter's functions to Windows's entrypoints Incorporated the varied functions for nextafter and refactored NextAfterTest.h to correctly define bitWidthOfType for both Linux and Windows; by letting FloatProperties take care of the directives' logic based on the platform being used. This allows to successfully run nextafter's tests. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D106395	2021-07-21 13:28:01 +00:00
Sebastian Neubauer	b642d01fa8	[AMDGPU] Improve killed check for vgpr optimization The killed flag is not always set. E.g. when a variable is used in a loop, it is never marked as killed, although it is unused in following basic blocks. Also, we try to deprecate kill flags and not use them. Check if the register is live in the endif block. If not, consider it killed in the then and else blocks. The vgpr-liverange tests have two new tests with loops (pre-committed, so the diff is visible). I also needed to change the subtarget to gfx10.1, otherwise calls are not working. Differential Revision: https://reviews.llvm.org/D106291	2021-07-21 15:24:59 +02:00
Sebastian Neubauer	aba1f157ca	[AMDGPU] Precommit vgpr-liverange tests	2021-07-21 15:24:59 +02:00
Hedin Garca	f49f2e2d1f	[libc] Append math functions to Window's entrypoints Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D106391	2021-07-21 13:21:55 +00:00
Hedin Garca	137740eced	[libc] Exclude few unused bits from x86 state for Windows Windows fenv_t does not include the MXCSR register and the unused bits at the end of the x87 status. So we exclude them in our struct definitions to make it easy to read/write the state. getEnv and setEnv were also excluded to avoid using MXCSR, but a forthcoming patch will handle these functions. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D106386	2021-07-21 13:11:10 +00:00
Deep Majumder	80068ca623	[analyzer] Fix for faulty namespace test in SmartPtrModelling This patch: - Fixes how the std-namespace test is written in SmartPtrModelling (now accounts for functions with no Decl available) - Adds the smart pointer checker flag check where it was missing Differential Revision: https://reviews.llvm.org/D106296	2021-07-21 18:23:35 +05:30
Kirill Bobyrev	907efdf95d	[clangd] Cleanup FuzzyFindRequest serialization and dex benchmark * Due to the LLVM's JSON library changes (?), FuzzyFindRequest serialization is no longer valid since arrays are serialized as llvm::json::Array already. Hence, current implementation creates a nested array. * YAML format is no longer the default, mention this for the benchmark. * FIXME is no longer relevant. I ran benchmarks that showed no improvement with priority_queue years ago. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D106432	2021-07-21 14:51:16 +02:00
Guillaume Chatelet	d6da02d952	[llvm] Add enum iteration to Sequence This patch allows iterating typed enum via the ADT/Sequence utility. It also changes the original design to better separate concerns: - `StrongInt` only deals with safe `intmax_t` operations, - `SafeIntIterator` presents the iterator and reverse iterator interface but only deals with safe `StrongInt` internally. - `iota_range` only deals with `SafeIntIterator` internally. This design ensures that operations are always valid. In particular, "Out of bounds" assertions fire when: - the `value_type` is not representable as an `intmax_t` - iterator operations make internal computation underflow/overflow - the internal representation cannot be converted back to `value_type` Differential Revision: https://reviews.llvm.org/D106279	2021-07-21 12:48:53 +00:00
Simon Pilgrim	59db3a5df9	[InstCombine] Add multiuse test for D106352	2021-07-21 13:48:15 +01:00
David Spickett	bb4f7b9166	[compiler-rt][hwasan] Update register-dump-read.c test Since `d564cfb53c` moved __hwasan_tag_mismatch4 this test has been reporting a frame 0 of __hwasan_tag_mismatch_v2. This failure can be seen on our bots: https://lab.llvm.org/buildbot/#/builders/185/builds/170 Before the change: #0 0xaaaaba100e40 in main <...>/register-dump-read.c:21:10 After the change: #0 0xaaaab8494bec in __hwasan_tag_mismatch_v2 <...>/hwasan/hwasan_tag_mismatch_aarch64.S:147 #1 0xaaaab84b4df8 in main <..>/register-dump-read.c:14:10 Update the test to check for a main frame as either frame 0 or frame 1.	2021-07-21 12:43:07 +00:00
Roman Lebedev	48e9602c40	[NFC][VectorCombine] Load widening: add a few more negative tests	2021-07-21 15:21:37 +03:00
Simon Pilgrim	7c53a7d390	IFSStub.cpp - consistently use default case to silence 'not all control paths return' MSVC warnings. NFCI.	2021-07-21 11:59:34 +01:00
David Green	72dc5cab4f	[LV] Make use of PatternMatchers in getReductionPatternCost. NFC Pulled out of D106166, this modifies getReductionPatternCost to use PatternMatchers, hopefully simplifying the code a little.	2021-07-21 11:34:30 +01:00
Jay Foad	3ed29f960c	[AMDGPU] NFC refactoring in isel for buffer access intrinsics Rename getBufferOffsetForMMO to updateBufferMMO and pass in the MMO to be updated, in preparation for the bug fix in D106284. Call updateBufferMMO consistently for all buffer intrinsics, even the ones that use setBufferOffsets to decompose a combined offset expression. Add a getIdxEn helper function. Differential Revision: https://reviews.llvm.org/D106354	2021-07-21 11:12:49 +01:00
Gabor Marton	732a8a9dfb	[Analyzer][solver][NFC] Add explanatory comments to trivial eq classes Differential Revision: https://reviews.llvm.org/D106370	2021-07-21 11:59:56 +02:00
Simon Tatham	21401a7262	[clang] Introduce SourceLocation::[U]IntTy typedefs. This is part of a patch series working towards the ability to make SourceLocation into a 64-bit type to handle larger translation units. NFC: this patch introduces typedefs for the integer type used by SourceLocation and makes all the boring changes to use the typedefs everywhere, but for the moment, they are unconditionally defined to uint32_t. Patch originally by Mikhail Maltsev. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D105492	2021-07-21 10:45:46 +01:00
Sam McCall	91670f5f20	[clangd] Remove big PreambleData constructor. NFC	2021-07-21 11:31:52 +02:00
Rosie Sumpter	44c9adb414	[LoopFlatten][LoopInfo] Use Loop to identify latch compare instruction Make getLatchCmpInst non-static and use it in LoopFlatten as a more robust way of identifying the compare. Differential Revision: https://reviews.llvm.org/D106256	2021-07-21 10:14:18 +01:00
Sven van Haastregt	724f0e2abb	[OpenCL] Add cl_khr_extended_bit_ops Add the builtins defined by Section 40 "Extended Bit Operations" in the OpenCL Extension Specification. Differential Revision: https://reviews.llvm.org/D106267	2021-07-21 10:01:19 +01:00
Kerry McLaughlin	e22a599672	[LV] Use lookThroughAnd with logical reductions If a reduction Phi has a single user which `AND`s the Phi with a type mask, `lookThroughAnd` will return the user of the Phi and the narrower type represented by the mask. Currently this is only used for arithmetic reductions, whereas loops containing logical reductions will create a reduction intrinsic using the widened type, for example: for.body: %phi = phi i32 [ %and, %for.body ], [ 255, %entry ] %mask = and i32 %phi, 255 %gep = getelementptr inbounds i8, i8* %ptr, i32 %iv %load = load i8, i8* %gep %ext = zext i8 %load to i32 %and = and i32 %mask, %ext ... ^ this will generate an and reduction intrinsic such as the following: call i32 @llvm.vector.reduce.and.v8i32(<8 x i32>...) The same example for an add instruction would create an intrinsic of type i8: call i8 @llvm.vector.reduce.add.v8i8(<8 x i8>...) This patch changes AddReductionVar to call lookThroughAnd for other integer reductions, allowing loops similar to the example above with reductions such as and, or & xor to vectorize. Reviewed By: david-arm, dmgreen Differential Revision: https://reviews.llvm.org/D105632	2021-07-21 09:56:00 +01:00
Jan Kratochvil	278df28557	[nfc] [lldb] Rename GetRnglist() to GetRnglistTable() My D99653 implemented a getter GetRnglist() for m_rnglist_table. That was confusing as the getter returns DWARFDebugRnglistTable which contains DWARFDebugRnglist as its elements.	2021-07-21 10:45:37 +02:00
Cullen Rhodes	008c755d76	[AArch64][SME] Support .arch and .arch_extension assembler directives Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D105566	2021-07-21 08:40:27 +00:00
Tim Northover	19d2e42be2	ARM: don't return by popping PC if we have to adjust the stack afterwards. In mandatory tail calling conventions we might have to deallocate stack space used by our arguments before return. This happens after popping CSRs, so the pop cannot be turned into the return itself in this case. The else branch here was already a nop, so removing it as a tidy-up.	2021-07-21 09:35:14 +01:00
Tim Northover	291e0daa6e	AArch64: support 8 & 16-bit atomic operations in GlobalISel We have SelectionDAG patterns for 8 & 16-bit atomic operations, but they assume the value types will have been legalized to 32-bits. So this adds the ability to widen them to both AArch64 & generic GISel infrastructure.	2021-07-21 09:35:14 +01:00
Cullen Rhodes	2d80bbd939	[AArch64][SME] Add mova instructions This patch adds the mova instruction to insert/extract an SVE vector register to/from a ZA tile vector. The preferred MOV aliases are also implemented. Depends on D105572. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-06 Reviewed By: david-arm, CarolineConcatto Differential Revision: https://reviews.llvm.org/D105574	2021-07-21 08:20:01 +00:00
Cullen Rhodes	6c32cfe85c	[AArch64][SME] Add ldr and str instructions The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-06 Reviewed By: kmclaughlin Differential Revision: https://reviews.llvm.org/D105573	2021-07-21 08:17:13 +00:00
Siva Chandra Reddy	a31f6d2ccf	[libc][Obvious] Fix few typos in FPUtil/TestHelpers.cpp	2021-07-21 08:07:35 +00:00

1 2 3 4 5 ...

394331 Commits All Branches Search

394331 Commits

All Branches