llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	cbf6737cc4	[RISCV] Use RVVBitsPerBlock instead of hardcoding multiples of 64. NFC	2022-06-17 14:10:39 -07:00
Benjamin Kramer	d5c29b23e1	[mlir][sparse] Inline the definition of LLVM_ATTRIBUTE_WEAK This library is supposed not to have a dependency on LLVM, and linking LLVMSupport into it breaks its shared library setup.	2022-06-17 22:41:10 +02:00
Rafael Auler	9d5e6ccd9b	[BOLT] Fix for missing entry offset Temporary fix for missing entry offset when creating address translation tables (BAT) after D127935 landed. Will later work on assigning a more reasonable offset different than zero. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D128092	2022-06-17 13:14:42 -07:00
Benjamin Kramer	feb8313fbf	[bazel] Add missing dependency after 455679413 This is supposed to be header-only. Don't know how to express that in bazel.	2022-06-17 22:09:13 +02:00
Benjamin Kramer	3420cd7caf	[mlir][sparse] Add testing for bf16 and fallback for software bf16 This adds weak versions of the truncation libcalls in case the runtime environment doesn't have them. Differential Revision: https://reviews.llvm.org/D128091	2022-06-17 21:54:01 +02:00
Aart Bik	86d5d34c72	[mlir][sparse] renable f16 tests Sparse library ABI issues are fixed. https://github.com/llvm/llvm-project/issues/55992 Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D128086	2022-06-17 12:48:24 -07:00
Philip Reames	fb8ecca06f	[RISCV] Remove redundant code checking for exact VTYPE match [nfc] Should be fully covered by the generic demanded field based logic just below, and this ensures better coverage of that logic.	2022-06-17 12:20:20 -07:00
Philip Reames	4d245f1bc2	[RISCV] Move store policy and mask reg ops into demanded handling in InsertVSETVLI Doing so let's the post-mutation pass leverage the demanded info to rewrite vsetvlis before a store/mask-op to eliminate later vsetvlis. Sorry for the lack of store test change; all of my attempts to write something reasonable have been handled through existing logic.	2022-06-17 12:09:50 -07:00
Florian Hahn	e9cced2739	Recommit "[LAA] Initial support for runtime checks with pointer selects." This reverts commit `7aa8a67882`. This version includes fixes to address issues uncovered after the commit landed and discussed at D11448. Those include: * Limit select-traversal to selects inside the loop. * Freeze pointers resulting from looking through selects to avoid branch-on-poison.	2022-06-17 21:06:26 +02:00
Sanjay Patel	f126643862	[AArch64] add tests for masked subtract; NFC	2022-06-17 14:56:32 -04:00
Nikolas Klauser	db1978b674	[libc++] Mark standard-mandated includes as such Reviewed By: ldionne, Mordante, #libc, saugustine Spies: saugustine, MaskRay, arichardson, mstorsjo, jloser, libcxx-commits, arphaman Differential Revision: https://reviews.llvm.org/D127953	2022-06-17 20:43:33 +02:00
Chris Bieneman	d5745d0015	[docs] Adding table of object file formats The added section and table here list the object file formats LLVM MC supports and which targets support each format. Differential Revision: https://reviews.llvm.org/D127645	2022-06-17 13:37:52 -05:00
Chris Bieneman	f9e49644f4	Revert "wip" This reverts commit `0dd243fa8a`. I accidentally pushed this! Oops!	2022-06-17 13:36:53 -05:00
Frederik Gossen	e694b22344	[mlir][nvgpu] Fix Bazel BUILD file Differential Revision: https://reviews.llvm.org/D128078	2022-06-17 14:35:30 -04:00
Chris Bieneman	5b77a45c7f	[docs] Adding DirectX target usage doc This document is a work in progress to begin fleshing out documentation for the DirectX backend and related changes in the LLVM project. This is not intended to be exhaustive or complete, it is intended as a starting point so taht future changes have a place for documentation to land. Differential Revision: https://reviews.llvm.org/D127640	2022-06-17 13:34:25 -05:00
Chris Bieneman	0dd243fa8a	wip	2022-06-17 13:34:21 -05:00
Arthur Eubanks	05704e785a	[docs] Fix typo	2022-06-17 11:29:07 -07:00
isuckatcs	e77ac66b8c	[Static Analyzer] Structured binding to data members Introducing structured binding to data members. Differential Revision: https://reviews.llvm.org/D127643	2022-06-17 19:50:10 +02:00
Stanislav Gatev	ba53906cef	[clang][dataflow] Add support for comma binary operator Add support for comma binary operator. Differential Revision: https://reviews.llvm.org/D128013 Reviewed-by: ymandel, xazax.hun	2022-06-17 17:48:21 +00:00
Philip Reames	b595cddea7	[riscv] Extract isMaskRegOp helper [nfc]	2022-06-17 10:40:54 -07:00
Nico Weber	527395bd10	[gn build] (manually) port `2040b6df0a` (DebugInfoSymbolizerTests)	2022-06-17 13:37:11 -04:00
Daniel Thornburgh	2040b6df0a	[Symbolize] Parser for log symbolizer markup. This adds a parser for the log symbolizer markup format discussed in https://discourse.llvm.org/t/rfc-log-symbolizer/61282. The parser operates in a line-by-line fashion with minimal memory requirements. This doesn't yet include support for multi-line tags or specific parsing for ANSI X3.64 SGR control sequences, but it can be extended to do so. The latter can also be relatively easily handled by examining the resulting text elements. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D124686	2022-06-17 10:26:24 -07:00
Kiran Chandramohan	914b9eec04	[Flang] Fixes for XArrayCoorOp The upstreamed code was not incrementing the sliceOffset in multiples of 3. This issue is fixed by using Offsets and incrementing by 3 during every iteration. In the conversion pattern, we were comparing the definingOp of an operand with an FIR::UndefOp. Use LLVM::UndefOp for conversion. Reviewed By: clementval, Leporacanthicus Differential Revision: https://reviews.llvm.org/D128017	2022-06-17 17:14:55 +00:00
LLVM GN Syncbot	fd5a26e610	[gn build] Port `971e9c80e9`	2022-06-17 17:10:15 +00:00
Nikolas Klauser	971e9c80e9	[libc++] Implement std::boyer_moore{, _horspool}_searcher This mostly copys the `<experimental/functional>` stuff and updates the code to current libc++ style. Reviewed By: ldionne, #libc Spies: nlopes, adamdebreceni, arichardson, libcxx-commits, mgorny Differential Revision: https://reviews.llvm.org/D121074	2022-06-17 19:09:40 +02:00
Philip Reames	e1f1407beb	[RISCV] Delete dead elideCopy code in InsertVSETVLI [nfc] This code should be dead. A simple whole register copy of an IMPLICIT_DEF, is simply an IMPLICIT_DEF of it's own. (This would not be true for freeze, but is for copy.) If we find a case which gets here with vector operand copy of an IMPLICIT_DEF, we most likely have an earlier missed optimization anyways. (The most recent case of this was `e6c7a3a`, found by Craig during review of this patch.) There might be others, and if so, we'll revisit them individually as regressions are reported. Differential Revision: https://reviews.llvm.org/D127996	2022-06-17 09:58:11 -07:00
Joe Nash	75378d432f	[AMDGPU] NFC. Change comment format on gfx11 interp and ldsdir intrinsics	2022-06-17 12:28:26 -04:00
isuckatcs	92bf652d40	[Static Analyzer] Small array binding policy If a lazyCompoundVal to a struct is bound to the store, there is a policy which decides whether a copy gets created instead. This patch introduces a similar policy for arrays, which is required to model structured binding to arrays without false negatives. Differential Revision: https://reviews.llvm.org/D128064	2022-06-17 18:56:13 +02:00
Michael Jones	ad709a752d	[libc][obvious] fix sign warning in file_writer In the sign writer, a size_t was being compared to an int. This patch casts the size_t to an int so that the comparison doesn't cause a sign comparison warning. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D127984	2022-06-17 09:48:04 -07:00
Nikolas Klauser	fb3477a4da	[libc++] Unwrap reverse_iterator<reverse_iterator<Iter>> in __unwrap_iter Simplify the implementation of `std::copy` and `std::move` by using `__unwrap_iter` and `__rewrap_iter` to unwrap and rewrap `reverse_iterator<reverse_iterator<Iter>>` instead of specializing `__copy_impl` and `__move_impl`. Reviewed By: ldionne, #libc Spies: wenlei, libcxx-commits Differential Revision: https://reviews.llvm.org/D127049	2022-06-17 18:47:18 +02:00
Martin Sebor	5fb67e32f8	[InstCombine] Fold memcmp of constant arrays and variable size The memcmp simplifier is limited to folding to constants calls with constant arrays and constant sizes. This change adds the ability to simplify memcmp(A, B, N) calls with constant A and B and variable N to the pseudocode equivalent of N <= Pos ? 0 : (A < B ? -1 : B < A ? +1 : 0) where Pos is the offset of the first mismatch between A and B. Differential Revision: https://reviews.llvm.org/D127766	2022-06-17 10:35:35 -06:00
isuckatcs	fc6b2281bf	[Static Analyzer][CFG] Introducing the source array in the CFG of DecompositionDecl For DecompositionDecl, the array, which is being decomposed was not present in the CFG, which lead to the liveness analysis falsely detecting it as a dead symbol. Differential Revision: https://reviews.llvm.org/D127993	2022-06-17 18:34:34 +02:00
bixia1	48f4407c1a	[mlir][linalg] Extend opdsl to support operations on complex types. Linalg opdsl now supports negf/add/sub/mul on complex types. Add a test. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D128010	2022-06-17 09:34:26 -07:00
Adrian Prantl	f000de8760	[LLDB][ExpressionParser] Fix indices inside format-strings passed to LLDB_LOG llvm::formatv expects the parameter indexes to start with 0. Unfortunately it doesn't detect out-of-bounds accesses in the format string at compile-time, of which we had several inside ClangExpressionDeclMap. This patch fixes these out-of-bounds format accesses. Example output Before ClangExpressionDeclMap::FindExternalVisibleDecls for '$__lldb_class' in a 'TranslationUnit' CEDM::FEVD Searching the root namespace CEDM::FEVD Adding type for $__lldb_class: 1 After ClangExpressionDeclMap::FindExternalVisibleDecls for '$__lldb_class' in a 'TranslationUnit' CEDM::FEVD Searching the root namespace CEDM::FEVD Adding type for $__lldb_class: class (lambda) Patch by Michael Buch! Differential Revision: https://reviews.llvm.org/D128063	2022-06-17 09:26:01 -07:00
Nico Weber	c2bb2e5973	[gn build] (manually) port `7cca33b40f` Not really needed for anything as far as I can tell (?), more for completeness.	2022-06-17 11:51:10 -04:00
Christopher Bate	d089d68a2c	[mlir][nvgpu] fix missing build dependency for NVGPUTransforms Fixes build failure caused by `51b925df94`	2022-06-17 09:42:49 -06:00
Aart Bik	aef20f59a5	[mlir][sparse] move from by-value to by-reference for data types This fixes all sorts of ABI issues due to passing by-value (using by-reference with memref's exclusively). Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D128018	2022-06-17 08:39:25 -07:00
Christopher Bate	51b925df94	[mlir][nvgpu] shared memory access optimization pass This change adds a transformation and pass to the NvGPU dialect that attempts to optimize reads/writes from a memref representing GPU shared memory in order to avoid bank conflicts. Given a value representing a shared memory memref, it traverses all reads/writes within the parent op and, subject to suitable conditions, rewrites all last dimension index values such that element locations in the final (col) dimension are given by `newColIdx = col % vecSize + perm[row](col/vecSize,row)` where `perm` is a permutation function indexed by `row` and `vecSize` is the vector access size in elements (currently assumes 128bit vectorized accesses, but this can be made a parameter). This specific transformation can help optimize typical distributed & vectorized accesses common to loading matrix multiplication operands to/from shared memory. Differential Revision: https://reviews.llvm.org/D127457	2022-06-17 09:31:05 -06:00
Guillaume Chatelet	90f96ec7a5	[NFC][Alignment] Remove assumeAligned from MachineFrameInfo ctor	2022-06-17 15:21:17 +00:00
Quinn Pham	deb7655209	[PowerPC] Fix PPCVSXSwapRemoval pass to include MTVSCR and MFVSCR as not swappable. This patch adds the instructions `MTVSCR` and `MFVSCR` as not swappable to the PPCVSXSwapRemoval pass because they are not lane-insensitive. This will prevent the compiler from optimizing out required swaps when using `lxvd2x` and `stxvd2x`. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D128062	2022-06-17 10:14:24 -05:00
Philip Reames	755c84c62c	[RISCV] Avoid changing etype for splat of 0 or -1 A splat of the values 0 and -1 as sign extended 12 bit immediates are always the same bit pattern regardless of the etype used to perform the operation. As a result, we can sometimes avoid introducing a vsetvli just for the purposes of a splat. Looking at the diffs, we don't get a huge amount of immediate value out of this. We mostly push the vsetvli one instruction down, usually in front of a vmerge. We also don't get the corresponding fixed length vector cases because VL typically is changed despite the actual bits written being the same. Both of these are areas I plan to explore in future patches. Interestingly, this makes a great example of why we need the forward and backward implementation to be consistent. Before we merged the demanded field handling, if we implement only the forward direction, we lost the ability to mutate a prior vsetvli and eliminate a later one entirely. This resulted in practical regressions instead of improvements. It's always nice when practice matches theory. :) Differential Revision: https://reviews.llvm.org/D128006	2022-06-17 08:10:14 -07:00
Ben Langmuir	4a3a9a5fa0	[clang][deps] Sort submodules when calculating dependencies Dependency scanning does not care about the order of submodules for correctness, so sort the submodules so that we get the same command-lines to build the module across different TUs. The order of inferred submodules can vary depending on the order of #includes in the including TU. Differential Revision: https://reviews.llvm.org/D128008	2022-06-17 07:55:27 -07:00
Sanjay Patel	bfde861935	[InstCombine] convert mask and shift of power-of-2 to cmp+select When the mask is a power-of-2 constant and op0 is a shifted-power-of-2 constant, test if the shift amount equals the offset bit index: (ShiftC << X) & C --> X == (log2(C) - log2(ShiftC)) ? C : 0 (ShiftC >> X) & C --> X == (log2(ShiftC) - log2(C)) ? C : 0 This is an alternate to D127610 with a more general pattern. We match only shift+and instead of the trailing xor, so we see a few more tests diffs. I think we discussed this initially in D126617. Here are proofs for shifts in both directions: https://alive2.llvm.org/ce/z/CFrLs4 The test diffs look equal or better for IR, and this makes the patterns more uniform in IR. The backend can partially invert this in both cases if that is profitable. It is not trivially reversible, however, so if we find perf regressions that are not easy to undo, then we may want to revert this. Differential Revision: https://reviews.llvm.org/D127801	2022-06-17 10:51:57 -04:00
Kadir Cetinkaya	1a02c963e3	Revert "Revert "[clang] Dont print implicit forrange initializer"" This reverts commit `7aac15d5df`. Only updates the tests, as these statements are still part of the CFG and its just the pretty printer policy that changes. Hopefully this shouldn't affect any analysis.	2022-06-17 16:51:16 +02:00
Guillaume Chatelet	7b73f53790	[libc] Rely on __builtin_memcpy_inline for memcpy implementation This patch removes usage of `-mllvm -combiner-global-alias-analysis` and relies on compiler builtin to implement `memcpy`. Note that `-mllvm -combiner-global-alias-analysis` is actually only useful for functions where buffers can alias (namely `memcpy` and `memmove`). The other memory functions where not benefiting from the flag anyways. The upside is that the memory functions can now be compiled from source with thinlto (thinlto would not be able to carry on the flag when doing inlining). The downside is that for compilers other than clang (i.e. not providing `__builtin_memcpy_inline`) the codegen may be worse. Differential Revision: https://reviews.llvm.org/D128051	2022-06-17 14:22:26 +00:00
Eric Schweitz	6c89c53144	[flang] Fix bug in character casting. Add missing sext/trunc in code gen. This patch is part of the upstreaming effort from fir-dev branch. It also ensures all descriptors created inline complies with LBOUND requirement that the lower bound is `1` when the related dimension extent is zero. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier, PeteSteinfeld Differential Revision: https://reviews.llvm.org/D128047 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-06-17 16:10:12 +02:00
Joe Nash	2a68364745	[AMDGPU] gfx11 waitcnt support for VINTERP and LDSDIR instructions Reviewed By: rampitec, #amdgpu Differential Revision: https://reviews.llvm.org/D127781	2022-06-17 09:30:37 -04:00
Malhar Jajoo	6bb40552f2	[LoopVectorize] Add support for invariant stores of ordered reductions Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D126772	2022-06-17 14:56:21 +01:00
Nikolas Klauser	929d5de22c	[libc++] Simplify __config a bit Simplify logic in `__config` by assuming that we are using Clang in C++03 mode. Also, use standardized feature-test macros instead of compiler-specific checks (like `__has_feature`) in a couple of places. Reviewed By: ldionne, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D127606	2022-06-17 15:54:31 +02:00
Joe Nash	20d20156f4	[AMDGPU] gfx11 VINTERP intrinsics and ISel support Depends on D127664 Reviewed By: rampitec, #amdgpu Differential Revision: https://reviews.llvm.org/D127756	2022-06-17 09:16:59 -04:00

... 3 4 5 6 7 ...

427384 Commits All Branches Search

427384 Commits

All Branches