llvm-project

Commit Graph

Author	SHA1	Message	Date
Pushpinder Singh	12dcbf913c	[AMDGPU][OpenMP] Use complex definitions from complex_cmath.h Following nvptx approach, this patch uses complex function definitions from complex_cmath.h. With this patch, ovo passes 23/34 complex mathematical test cases. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D109344	2021-09-09 10:55:17 +05:30
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Tom Stellard	9ee64c3746	scudo: Only add no-omit-frame-pointer flags when the compiler supports them Reviewed By: cryptoad Differential Revision: https://reviews.llvm.org/D109196	2021-09-08 21:10:40 -07:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Leonard Chan	9da62d3ed9	[polly] Fix "no member named 'getIndexExpressionsFromGEP'" As of 741fabc222f226d34d806056b804244b012853b, polly builders are failing from this error. The signiature is slightly different and accepts a ScalarEvolution reference instead. This should fix the polly builders.	2021-09-08 20:04:56 -07:00
Peter Collingbourne	883e93cb28	gn build: Add support for building lldb-server on Android. The cross-compiled lldb-server targets are added to the lldb deps if Android cross compilation is enabled. Differential Revision: https://reviews.llvm.org/D109464	2021-09-08 19:33:51 -07:00
Peter Collingbourne	9449f441fc	gn build: Add support for building LLDB on Linux. On Linux, LLDB depends on lldb-server at runtime (on Mac, the dependency on a debug server presumably comes via the system debugserver), so I added it to deps. Differential Revision: https://reviews.llvm.org/D109463	2021-09-08 19:33:51 -07:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Leonard Chan	d96e0c5388	Revert "[runtimes] Set more paths when building runtimes standalone" This reverts commit `407e07aa67`. Reverting since this seems to break OpenMP builds and our clang builders. See thread on https://reviews.llvm.org/D107895.	2021-09-08 18:31:10 -07:00
Chris Lattner	9e46dd965a	[APInt.h] Reduce the APInt header file interface a bit. NFC This moves one mid-size function out of line, inlines the trivial tcAnd/tcOr/tcXor/tcComplement methods into their only caller, and moves the magic/umagic functions into SelectionDAG since they are implementation details of its algorithm. This also removes the unit tests for magic, but these are already tested in the divide lowering logic for various targets. This also upgrades some C style comments to C++. Differential Revision: https://reviews.llvm.org/D109476	2021-09-08 18:17:07 -07:00
Jessica Paquette	22a64d4a14	[MachineOutliner][AArch64] Ensure LR is live-in when inserting reg-save calls Similar to other code which handles creating the function frame. If LR isn't live-in to the block that we're inserting the call into, we'll get a MachineVerifier error.	2021-09-08 17:44:27 -07:00
Amara Emerson	eae44c8a86	[GlobalISel] Implement merging of stores of truncates. This is a port of a combine which matches a pattern where a wide type scalar value is stored by several narrow stores. It folds it into a single store or a BSWAP and a store if the targets supports it. Assuming little endian target: i8 p = ... i32 val = ... p[0] = (val >> 0) & 0xFF; p[1] = (val >> 8) & 0xFF; p[2] = (val >> 16) & 0xFF; p[3] = (val >> 24) & 0xFF; => ((i32)p) = val; On CTMark AArch64 -Os this results in a good amount of savings: Program before after diff SPASS 412792 412788 -0.0% kc 432528 432512 -0.0% lencod 430112 430096 -0.0% consumer-typeset 419156 419128 -0.0% bullet 475840 475752 -0.0% tramp3d-v4 367760 367628 -0.0% clamscan 383388 383204 -0.0% pairlocalalign 249764 249476 -0.1% 7zip-benchmark 570100 568860 -0.2% sqlite3 287628 286920 -0.2% Geomean difference -0.1% Differential Revision: https://reviews.llvm.org/D109419	2021-09-08 17:06:33 -07:00
Philip Reames	e741fabc22	[SCEV] Move getIndexExpressionsFromGEP to delinearize [NFC]	2021-09-08 16:56:49 -07:00
Mehdi Amini	4eaaf05394	Add sanity check in MLIR ODS to catch case where two results have the same name This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109456	2021-09-08 23:38:50 +00:00
Chris Lattner	717ed1c310	[APInt.h] don't privatize "needsCleanup"; it is used by Clang APValue	2021-09-08 16:33:06 -07:00
David Blaikie	d18083c6dc	Error: Improve unit test by using gtest equality rather than explicit string compare calls This ensures error messages from gtest includes the raw text of both sides of the comparison - otherwise all gtest can report is the text of the expression source, without any information about the values or how they differ.	2021-09-08 16:21:11 -07:00
David Blaikie	f03689ace5	FileError: Provide a way to retrieve the underlying error string without the file name For use with APIs that want to report the file name in a different syntactic form, have other knowledge of the filename, etc.	2021-09-08 16:16:54 -07:00
David Blaikie	0c502507f4	FileError: Support zero-length file names It's a common error in an API - to try to open an empty file, so it seems like a reasonable FileError to produce "hey, you tried to open an empty file" and to handle it the same way as any other file error.	2021-09-08 16:16:54 -07:00
Chris Lattner	a024d35b38	[APInt.h] Clean up the APInt interface. NFC. This moves all the private implementation details to the bottom of the header, and pushes all the "make an APInt" stuff up to the top. This is in prep for making other changes to spiff up APInt a bit.	2021-09-08 16:08:57 -07:00
Usman Nadeem	0a9d740c23	[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored - Make flto an alias of flto=full. - Make foffload-lto an alias of foffload-lto=full. - Make flto_EQ_jobserver, flto_EQ_auto aliases of flto=full, since they are being treated as full lto right now. - Clean up the code for parseLTOMode and setLTOMode. - Replace uses of OPT_flto with OPT_flto_EQ since they alias now. Differential Revision: https://reviews.llvm.org/D108881 Change-Id: I5d867db83a680434fba5c8d85c9a83135d3b81ee	2021-09-08 15:53:49 -07:00
Leonard Chan	78f7ad5a83	[clang][Fuchsia] Remove COMPILER_RT_CAN_EXECUTE_TESTS I forgot that we run `check-runtimes-x86_64-unknown-linux-gnu`, which will run all compiler-rt tests also even though we are currently not in a state where we can run them all yet. Remove this for now to fix our CI builders.	2021-09-08 15:52:02 -07:00
Usman Nadeem	54612a037a	Revert "[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored" This reverts commit `d2d2e5ea48`.	2021-09-08 15:49:35 -07:00
Usman Nadeem	d2d2e5ea48	[clang][Driver] Update/cleanup LTO logic to ensure that the last lto argument is honored - Make flto an alias of flto=full. - Make foffload-lto an alias of foffload-lto=full. - Make flto_EQ_jobserver, flto_EQ_auto aliases of flto=full, since they are being treated as full lto right now. - Clean up the code for parseLTOMode and setLTOMode. - Replace uses of OPT_flto with OPT_flto_EQ since they alias now. Change-Id: Iea5338c20cb800b43529b20745e92600e2cfd2b1	2021-09-08 15:40:32 -07:00
Philip Reames	4b5e260b1d	[SCEV] Simplify findExistingSCEVInCache interface [NFC] We were returning a tuple when all but one caller only cared about one piece of the return value. That one caller can inline the complexity, and we can simplify all other uses.	2021-09-08 15:26:07 -07:00
Andrew Litteken	144cd22bae	[CodeExtractor] Creating exit stubs based off original order branch instructions. Previously the CodeExtractor created exit stubs, and the subsequent return value of the outlined function based on the order of out-of-region blocks after splitting any phi nodes, and collecting the blocks to be outlined. This could cause differences in order if there was a difference of exit block phi nodes between the two regions. This patch moves the collection of the output target blocks to be before this occurs, so that the assignment of target block to output value will be the same, regardless of the contents of the output block. Reviewers: paquette, roelofs Differential Revision: https://reviews.llvm.org/D108657	2021-09-08 15:15:15 -07:00
David Green	7ff67d5bf8	[AArch64] Rewrite floatdp_1source.ll test. NFC Rewrite this test to not rely on volatile stores in a large function, just use separate functions like any other test would.	2021-09-08 23:00:34 +01:00
Arthur Eubanks	fe15347a1e	Port the cost model printer to New PM Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D109284	2021-09-08 14:47:05 -07:00
Craig Topper	a574f0e0c3	[RISCV] Disable use of i128 shift libcalls on RV32. Since i128 isn't a legal C type on RV32, I don't believe libgcc implements these functions for RV32. compiler-rt does implement them because i128 support is enabled in order to handle long double. This is consistent with 32-bit X86 and ARM. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D109383	2021-09-08 14:26:07 -07:00
Eli Friedman	0375734439	[NFC] Add extra test for D106331	2021-09-08 14:18:47 -07:00
Jon Chesterfield	e62f4f172e	[openmp] `41c73671d0`, this time with staged patch applied	2021-09-08 22:07:47 +01:00
Michael Kruse	088577a38e	[Delinerization] Require by offset to be zero. Users of delinearization assume that the the offset into the array element is zero. In most cases it will indeed be zero, but if it is not, the delinearization has to fail since it violates that assumption without the API even allowing to signal to the caller that the by offset is non-zero. This bug caused Polly to miscompile blender (526.blender_r from SPEC CPU 2017) in -polly-process-unprofitable mode. The SCEV expression incorrectly delinearized has been reduced in the test case byte_offset.ll. The dropped offset into the array element of size 4 (a float) is ((sext i32 %mul7.i4534 to i64) + {(sext i32 %i1 to i64),+,((sext i32 (1 + ((1 + %shl.i.i) * (1 + %shl.i.i)) + %shl.i.i) to i64) * (sext i32 %i1 to i64))}<%for.body703>). This significant component was just dropped, and the wrong pointer was computed when regenerating code from the remaining delinearized subscripts. This occurred during blender's subsurface scattering implementation. As a result, blender's rendering diverged from the reference image. Patch D108885 would also fix the API. Reviewed By: bmahjour Differential Revision: https://reviews.llvm.org/D109133	2021-09-08 16:02:37 -05:00
Martin Storsjö	c4e8a2136c	[runtimes] Allow overriding where CMake installs RUNTIME type libraries (DLLs) Differential Revision: https://reviews.llvm.org/D107892	2021-09-09 00:01:38 +03:00
Martin Storsjö	407e07aa67	[runtimes] Set more paths when building runtimes standalone These paths are needed when building with per-target runtime directories. (It's possible to fix this by manually setting these when invoking cmake, but one isn't supposed to need to do that.) Also set LLVM_TOOLS_BINARY_DIR while touching this area (as it's also unset in this case) even if it isn't specifically needed by the per-target runtime configuration. Differential Revision: https://reviews.llvm.org/D107895	2021-09-09 00:01:38 +03:00
Greg Clayton	14850a0628	Log to the right stream in DwarfTransformer::handleDie(). Since we might end up using multiple threads when logging information in the DWARFTransformer, the handleDie() method must use the supplied stream named "OS" when logging warnings and errors. When we use multiple threads, we log to a thread specific stream buffer and then use a mutex to ensure our output doesn't overlap when we emit warnings and errors after a thread is done. Differential Revision: https://reviews.llvm.org/D109401	2021-09-08 14:00:19 -07:00
Jon Chesterfield	41c73671d0	[openmp] Re-enable test from D109057, now with windows path aware regex	2021-09-08 21:57:38 +01:00
Jonas Devlieghere	d1d4f36556	[lldb] Make sure there's a value for the key before dereferencing. Make sure there's a value for the shared_cache_base_address key exists in the dictionary before trying to dereference the value. rdar://76894476	2021-09-08 13:46:09 -07:00
Florian Hahn	f4726e7238	[LAA] Remove unused OrigPtr from replaceSymbolicStrideSCEV (NFC). The OrigPtr argument is not used in tree.	2021-09-08 22:35:36 +02:00
Chris Lattner	40a89da65c	[Canonicalize] Don't call isBeforeInBlock in OperationFolder::tryToFold. This patch (`e4635e6328`) fixed a bug where a newly generated/reused constant wouldn't dominate a folded operation. It did so by calling isBeforeInBlock to move the constant around on demand. This introduced a significant compile time regression, because "isBeforeInBlock" is O(n) in the size of a block the first time it is called, and the cache is invalidated any time canonicalize changes something big in the block. This fixes LLVM PR51738 and this CIRCT issue: https://github.com/llvm/circt/issues/1700 This does affect the order of constants left in the top of a block, I staged in the testsuite changes in rG42431b8207a5. Differential Revision: https://reviews.llvm.org/D109454	2021-09-08 13:33:22 -07:00
Michael Kruse	8ae6933881	[Polly] Compile fix after Delinearization move. by commit `585c594d74`	2021-09-08 15:30:19 -05:00
Nikita Popov	6dfdc6bfd2	[SROA] Support opaque pointers Make the following changes in order to support opaque pointers in SROA: * Generate i8 GEPs for opaque pointers. * Explicitly enforce that promotable allocas only have stores of the alloca type -- previously this was implicitly enforced. * Replace a check for pointer element type with load/store type. Differential Revision: https://reviews.llvm.org/D109259	2021-09-08 22:25:44 +02:00
Steven Wan	806ff3c4a4	[AIX] Check for typedef properly when getting preferred type align The current check for typedef is naive and doesn't deal with any convoluted cases. This patch makes use of the new 'AlignRequirement' enum field from 'TypeInfo' to determine whether or not this is an 'aligned' attribute on a typedef. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D109387	2021-09-08 16:21:52 -04:00
Arthur Eubanks	b493124ae2	[MemorySSA] Support invariant.group metadata The implementation is mostly copied from MemDepAnalysis. We want to look at all loads and stores to the same pointer operand. Bitcasts and zero GEPs of a pointer are considered the same pointer value. We choose the most dominating instruction. Since updating MemorySSA with invariant.group is non-trivial, for now handling of invariant.group is not cached in any way, so it's part of the walker. The number of loads/stores with invariant.group is small for now anyway. We can revisit if this actually noticeably affects compile times. To avoid invariant.group affecting optimized uses, we need to have optimizeUsesInBlock() not use invariant.group in any way. Co-authored-by: Piotr Padlewski <prazek@google.com> Reviewed By: asbirlea, nikic, Prazek Differential Revision: https://reviews.llvm.org/D109134	2021-09-08 13:06:12 -07:00
Louis Dionne	ff7a332e6f	[libc++] Revert OpenBSD-related changes to the documentation This commit partially reverts `0954e2b2d0` and `3fa4cff974`, which make changes to the libc++ documentation implifying that OpenBSD is supported. Neither of these changes have been reviewed AFAICT, so I'm reverting as a matter of enforcing: 1. That changes get reviewed before being committed 2. That we have a discussion and a support plan for supporting OpenBSD officially in libc++ Please note that I would be thrilled to support OpenBSD officially in libc++, however doing so requires more than adding a note in the docs. In particular, please make sure you read the note in [1] about setting up CI testing for OpenBSD. [1]: https://libcxx.llvm.org/#platform-and-compiler-support Differential Revision: https://reviews.llvm.org/D109373	2021-09-08 15:55:03 -04:00
Philip Reames	585c594d74	Move delinearization logic out of SCEV [NFC] None of this logic has anything to do with SCEV's internals, it just uses the existing public APIs. As a result, we can move the code from ScalarEvolution.cpp/hpp to Delinearization.cpp/hpp with only minor changes. This was discussed in advance on today's loop opt call. It turned out to be easy as hoped.	2021-09-08 12:28:35 -07:00
Nikita Popov	3e54de4df2	[ConstantHoisting] Support opaque pointers Directly use i8 for GEP, rather than fetching element type of i8*.	2021-09-08 21:23:10 +02:00
Louis Dionne	b4e88d4db1	[libc++][NFC] Rename _EnableIf to __enable_if_t for consistency In other places in the code, we use lowercase spelling for things that are not available in prior standards. Differential Revision: https://reviews.llvm.org/D109435	2021-09-08 15:20:58 -04:00
Akira Hatanaka	dea6f71af0	[ObjC][ARC] Use the addresses of the ARC runtime functions instead of integer 0/1 for the operand of bundle "clang.arc.attachedcall" https://reviews.llvm.org/D102996 changes the operand of bundle "clang.arc.attachedcall". This patch makes changes to llvm that are needed to handle the new IR. This should make it easier to understand what the IR is doing and also simplify some of the passes as they no longer have to translate the integer values to the runtime functions. Differential Revision: https://reviews.llvm.org/D103000	2021-09-08 11:58:03 -07:00
Akira Hatanaka	59cc39ae14	[ObjC][ARC] Use the addresses of the ARC runtime functions instead of integer 0/1 for the operand of bundle "clang.arc.attachedcall" This should make it easier to understand what the IR is doing and also simplify some of the passes as they no longer have to translate the integer values to the runtime functions. Differential Revision: https://reviews.llvm.org/D102996	2021-09-08 11:56:22 -07:00
Matt Morehouse	ff77c4eac7	[libFuzzer] Add missing argument to CrashResistantMerge. Fixes a build error caused by a bad merge conflict resolution for https://reviews.llvm.org/D105084.	2021-09-08 11:48:33 -07:00

1 2 3 4 5 ...

398631 Commits All Branches Search

398631 Commits

All Branches