llvm-project

Commit Graph

Author	SHA1	Message	Date
Bixia Zheng	64e171c2d0	Avoid unnecessary output buffer allocation and initialization. The sparse tensor code generator allocates memory for the output tensor. As such, we only need to allocate a MemRefDescriptor to receive the output tensor and do not need to allocate and initialize the storage for the tensor. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D115292	2021-12-09 08:29:02 -08:00
Shraiysh Vaishay	d4865393b5	[NFC][mlir][OpenMP] Added documentation for omp.atomic ops This patch adds the documentation for the operations `omp.atomic.read`, `omp.atomic.write` and `omp.atomic.update`. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D115445	2021-12-09 21:46:38 +05:30
Krzysztof Drewniak	e1da62910e	[MLIR][GPU] Define gpu.printf op and its lowerings - Define a gpu.printf op, which can be lowered to any GPU printf() support (which is present in CUDA, HIP, and OpenCL). This op only supports constant format strings and scalar arguments - Define the lowering of gpu.pirntf to a call to printf() (which is what is required for AMD GPUs when using OpenCL) as well as to the hostcall interface present in the AMD Open Compute device library, which is the interface present when kernels are running under HIP. - Add a "runtime" enum that allows specifying which of the possible runtimes a ROCDL kernel will be executed under or that the runtime is unknown. This enum controls how gpu.printf is lowered This change does not enable lowering for Nvidia GPUs, but such a lowering should be possible in principle. And: [MLIR][AMDGPU] Always set amdgpu-implicitarg-num-bytes=56 on kernels This is something that Clang always sets on both OpenCL and HIP kernels, and failing to include it causes mysterious crashes with printf() support. In addition, revert the max-flat-work-group-size to (1, 256) to avoid triggering bugs in the AMDGPU backend. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110448	2021-12-09 15:54:31 +00:00
Eugene Zhulenev	49ce40e9ab	[mlir] AsyncParallelFor: align block size to be a multiple of inner loops iterations Depends On D115263 By aligning block size to inner loop iterations parallel_compute_fn LLVM can later unroll and vectorize some of the inner loops with small number of trip counts. Up to 2x speedup in multiple benchmarks. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115436	2021-12-09 06:50:50 -08:00
Eugene Zhulenev	9f151b784b	[mlir] AsyncParallelFor: sink constants into the parallel compute function With complex recursive structure of async dispatch function LLVM can't always propagate constants to the parallel_compute_fn and it often prevents optimizations like loop unrolling and vectorization. We help LLVM by pushing known constants into the parallel_compute_fn explicitly. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115263	2021-12-09 06:48:23 -08:00
Matthias Springer	cc45a13422	[mlir][linalg][bufferize] LinalgOps can bufferize inplace with input args LinalgOp results usually bufferize inplace with output args. With this change, they may buffer inplace with input args if the value of the output arg is not used in the computation. Differential Revision: https://reviews.llvm.org/D115022	2021-12-09 21:54:54 +09:00
Groverkss	6f9afad6d3	[MLIR] Move Presburger Math from FlatAffineConstraints to Presburger/IntegerPolyhedron This patch factors out math functionality that is a subset of Presburger arithmetic and moves it from FlatAffineConstraints to Presburger/IntegerPolyhedron. This patch only moves some parts of the functionality planned to be moved, with subsequent patches moving more functionality. There are three main reasons for this: 1. This split makes the Presburger Library easier and more flexible to use across MLIR, by not depending on IR. 2. This split allows the Presburger library to be developed independently from Affine Analysis, with Affine Analysis using this library. 3. With more functionality being upstreamed to the Presburger Library, the mlir/Analysis directory will be cluttered with Presburger library components since they depend on math functionality from FlatAffineConstraints. Moving this functionality to the Presburger directory allows keeping the new functionality in the Presburger directory. This patch is part of an ongoing effort to make the Presburger Library easier to use. The motivation for this effort is the feedback received at the LLVM conference from Mehdi and others. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D114674	2021-12-09 16:42:06 +05:30
Michel Weber	45ea542dd8	[MLIR] Introduce coalesce for PresburgerSet This patch provides functionality for simplifying `PresburgerSet`s by checking if any `FlatAffineConstraints` in the set is contained in another, and removing such redundant FACs. This is part of a series of patches to provide functionality for [integer set coalescing](http://impact.gforge.inria.fr/impact2015/papers/impact2015-verdoolaege.pdf) in MLIR. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D110617	2021-12-09 15:46:31 +05:30
Shraiysh Vaishay	d82c1f4e4b	[MLIR][OpenMP] Added omp.atomic.update This patch supports the atomic construct (update) following section 2.17.7 of OpenMP 5.0 standard. Also added tests and verifier for the same. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D112982	2021-12-09 15:21:24 +05:30
Nicolas Vasilache	d69f5e197c	[mlir][memref] Fix subview offset verification. Offset-specific verification seems to have been lost in one of the recent refactorings. Also add proper tests that would have caught this omission. This addresses the immediate issues discussed in: https://llvm.discourse.group/t/memref-subview-affine-map-and-symbols/4851 Differential Revision: https://reviews.llvm.org/D115427	2021-12-09 07:44:51 +00:00
MaheshRavishankar	6d7c9c3d0e	[mlir][Linalg] Bufferize the region of LinalgOps as well. The region of `linalg.generic` might contain `tensor` operations. For example, current lowering of `gather` uses a `tensor.extract` in the body of the `LinalgOp`. Bufferize the ops within a `LinalgOp` region as well to catch such cases. Differential Revision: https://reviews.llvm.org/D115322	2021-12-08 22:36:01 -08:00
Rob Suderman	23149d522b	[mlir] Added ctlz and cttz to math dialect and LLVM dialect Count leading/trailing zeros are an existing LLVM intrinsic. Added LLVM support for the intrinsics with lowerings from the math dialect to LLVM dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115206	2021-12-08 14:32:15 -08:00
Butygin	d8fce785de	[mlir][spirv] math.erf OpenCL lowering Differential Revision: https://reviews.llvm.org/D115335	2021-12-08 21:59:46 +03:00
Thomas Raoux	579c1ff67d	[mlir][nvvm] Add async copy ops to nvvm dialect Differential Revision: https://reviews.llvm.org/D115314	2021-12-08 09:42:20 -08:00
Matthias Springer	847710f7b7	[mlir][linalg][bufferize] Add dialect filter to BufferizationOptions This adds a new option `dialectFilter` to BufferizationOptions. Only ops from dialects that are allow-listed in the filter are bufferized. Other ops are left unbufferized. Note: This option requires `allowUnknownOps = true`. To make use of `dialectFilter`, BufferizationOptions or BufferizationState must be passed to various helper functions. The purpose of this change is to provide a better infrastructure for partial bufferization, which will be fully activated in a subsequent change. Differential Revision: https://reviews.llvm.org/D114691	2021-12-08 23:51:18 +09:00
Mehdi Amini	be0a7e9f27	Adjust "end namespace" comment in MLIR to match new agree'd coding style See D115115 and this mailing list discussion: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154199.html Differential Revision: https://reviews.llvm.org/D115309	2021-12-08 06:05:26 +00:00
Mehdi Amini	3bed2a7212	Build MLIR with -Werror=mismatched-tags (NFC) This is a defensive action to catch at build time on Linux failures that may happen only on Windows otherwise. Differential Revision: https://reviews.llvm.org/D115316	2021-12-08 05:59:06 +00:00
Mehdi Amini	ee0908703d	Change the printing/parsing behavior for Attributes used in declarative assembly format The new form of printing attribute in the declarative assembly is eliding the `#dialect.mnemonic` prefix to only keep the `<....>` part. Differential Revision: https://reviews.llvm.org/D113873	2021-12-08 02:02:37 +00:00
Aart Bik	e1b9d80532	[mlir][sparse] add a few more sparse output tests (for generated IR) also fixes two typos in IR doc Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115288	2021-12-07 15:31:29 -08:00
Aart Bik	bb8632c1ef	[mlir][sparse] fix broken build rebase and commit crossed the getFunc change Reviewed By: Chia-hungDuan Differential Revision: https://reviews.llvm.org/D115270	2021-12-07 11:14:21 -08:00
Aart Bik	4f2ec7f983	[mlir][sparse] finalize sparse output in the presence of reductions This revision implements sparse outputs (from scratch) in all cases where the loops can be reordered with all but one parallel loops outer. If the inner parallel loop appears inside one or more reductions loops, then an access pattern expansion is required (aka. workspaces in TACO speak). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115091	2021-12-07 10:54:29 -08:00
Rob Suderman	e9fae0f19e	[mlir][tosa] Disable tosa.depthwise_conv2d canonicalizer for quantized case Quantized case needs to include zero-point corrections before the tosa.mul. Disabled for the quantized use-case. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D115264	2021-12-07 10:16:12 -08:00
Lei Zhang	7709b23bef	[mlir][scf] NFC: create dedicated files for affine utils These functions are generic utility functions that operates on affine ops within SCF regions. Moving them to their own files for a better code structure, instead of mixing with loop specialization logic. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115245	2021-12-07 10:55:32 -05:00
Nicolas Vasilache	61ba9f9110	[mlir][Linalg] NFC - Extend the TilingInterface to allow better composition with out-of-tree dialects. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D115233	2021-12-07 13:06:27 +00:00
Matthias Springer	8a232632c5	[mlir][linalg][bufferize] Add FuncOp bufferization pass This passes bufferizes FuncOp bodies, but not FuncOp boundaries. Differential Revision: https://reviews.llvm.org/D114671	2021-12-07 21:44:26 +09:00
Matthias Springer	4ccbf1d2fb	[mlir][linalg][bufferize] Fix forward declaration	2021-12-07 20:13:24 +09:00
Matthias Springer	958ae8b2d4	[mlir][linalg][bufferize] Bufferize Operation* instead of FuncOp This change mainly changes the API. There is no mentioning of FuncOps in ComprehensiveBufferize anymore. Also, bufferize methods of the op interface are called for ops without tensor operands/results if they have a region. Differential Revision: https://reviews.llvm.org/D115212	2021-12-07 19:53:44 +09:00
Prashant Kumar	3415b1ca63	[MLIR] Simplify division extraction unit testing. The new `getLocalReprs` function also outputs `dividends` and `denominators` and hence the CheckDivisionRepresentation fn is modified to take the newer getLocalReprs function into account. Signed-off-by: Prashant Kumar <pk5561@gmail.com> Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D115146	2021-12-07 11:53:04 +05:30
Shraiysh Vaishay	31cf42bd9a	[mlir][OpenMP] Added omp.atomic.read lowering This patch adds lowering from omp.atomic.read to LLVM IR along with the memory ordering clause. Tests for the same are also added. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115134	2021-12-07 11:17:30 +05:30
wren romano	d8731bfc93	[mlir][sparse] Requiring emitCInterface parameter to be explicit Depends On D115004 Cleans up code legibility by requiring the `emitCInterface` parameter to be explicit at all call-sites, and defining boolean aliases for that parameter. Reviewed By: aartbik, rriddle Differential Revision: https://reviews.llvm.org/D115005	2021-12-06 20:50:08 -08:00
not-jenni	5911a29aa9	[mlir][tosa] Add tosa.depthwise_conv2d as tosa.mul canonicalization For a 1x1 weight and stride of 1, the input/weight can be reshaped and multiplied elementwise then reshaped back Reviewed By: rsuderman, KoolJBlack Differential Revision: https://reviews.llvm.org/D115207	2021-12-06 17:28:52 -08:00
Matthias Springer	7ce427e3bc	[mlir][linalg][bufferize][NFC] Clean up BufferizationState Make fields private and clean up the interface. In particular, BufferizableOpInterface::bufferize no longer has access to `aliasInfo`. This was potentially dangerous because some of the ops registered in BufferizationAliasInfo may have been deleted. Differential Revision: https://reviews.llvm.org/D114931	2021-12-07 10:05:39 +09:00
Rob Suderman	05e33d846f	[mlir][tosa] Resubmit add tosa.conv2d as tosa.fully_connected canonicalization Fixed the tosa.conv2d to tosa.fully_connected canonicalization for incorrect output channels. Included uptes to tests to include checks for the result shapes during canonicalization. This allows conv2d to transform to the simpler fully_connected operation. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115170	2021-12-06 15:33:07 -08:00
wren romano	f527fdf51e	[mlir][sparse] Code cleanup for SparseTensorConversion Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D115004	2021-12-06 14:13:35 -08:00
Eugene Zhulenev	68a7c001ad	[mlir] Improve async parallel for tests + fix typos Do load and store to verify that we process each element of the iteration space once. Reviewed By: cota Differential Revision: https://reviews.llvm.org/D115152	2021-12-06 13:27:54 -08:00
Rob Suderman	c5fef77bc3	[mlir] Add CtPop to MathOps with lowering to LLVM math.ctpop maths to the llvm.ctpop intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D114998	2021-12-06 11:54:20 -08:00
Alex Zinenko	d64b3e47ba	[mlir] Avoid needlessly converting LLVM named structs with compatible elements Conversion of LLVM named structs leads to them being renamed since we cannot modify the body of the struct type once it is set. Previously, this applied to all named struct types, even if their element types were not affected by the conversion. Make this behvaior only applicable when element types are changed. This requires making the LLVM dialect type-compatibility check recursively look at the element types (arguably, it should have been doing than since the moment the LLVM dialect type system stopped being closed). In addition, have a more lax check for outer types only to avoid repeated check when necessary (e.g., parser, verifiers that are going to also look at the inner type). Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D115037	2021-12-06 13:42:11 +01:00
Matthias Springer	e761c49a14	[mlir][linalg][bufferize][NFC] Utilize isWritable for FuncOps This is a cleanup of ModuleBufferization. Instead of storing information about writable function arguments in BufferizationAliasInfo, we can use isWritable and make the decision there, based on dialect-specifc bufferization state. Differential Revision: https://reviews.llvm.org/D114930	2021-12-06 18:36:54 +09:00
Matthias Springer	e9fb4dc9e9	[mlir][linalg][bufferize] Remove buffer equivalence from bufferize Remove all function calls related to buffer equivalence from bufferize implementations. Add a new PostAnalysisStep for scf.for that ensures that yielded values are equivalent to the corresponding BBArgs. (This was previously checked in `bufferize`.) This will be relaxed in a subsequent commit. Note: This commit changes two test cases. These were broken by design and should not have passed. With the new scf.for PostAnalysisStep, this bug was fixed. Differential Revision: https://reviews.llvm.org/D114927	2021-12-06 17:48:31 +09:00
MaheshRavishankar	3ec6b1bfac	[mlir] Add default implementations for methods in `TilingInterface`. Adding the default implementation of `getLoopIteratorTypes` and `getLoopBounds` allows ExternalModels to override these methods. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115101	2021-12-06 08:35:55 +00:00
Matthias Springer	cb4d0bf997	[mlir][linalg][bufferize][NFC] Collect equivalent FuncOp BBArgs in PostAnalysisStep Collect equivalent BBArgs right after the equivalence analysis of the FuncOp and before bufferizing. This is in preparation of decoupling bufferization from aliasInfo. Also gather equivalence info for CallOps, which was missing in the previous commit. Differential Revision: https://reviews.llvm.org/D114847	2021-12-06 17:31:39 +09:00
Michal Terepeta	caf89c0db6	[mlir][Vector] Support 0-D vectors in `ConstantMaskOp` To support creating both a mask with just a single `true` and `false` values, I had to relax the restriction in the verifier that the rank is always equal to the length of the attribute array, in other words, we now allow: - `vector.constant_mask [0] : vector<i1>` which gets lowered to `arith.constant dense<false> : vector<i1>` - `vector.constant_mask [1] : vector<i1>` which gets lowered to `arith.constant dense<true> : vector<i1>` (the attribute list for the 0-D case must be a singleton containing either `0` or `1`) Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115023	2021-12-06 08:03:04 +00:00
gysit	69bcff46bf	[mlir][linalg] Pad independent of application order (NFC). This revision makes the padding pattern independent of the application order. It addresses the concern that we cannot rely on the execution order of the greedy rewriter (https://reviews.llvm.org/D114689). Instead, the pattern is updated to apply repeatedly till all operations are padded. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114851	2021-12-06 07:26:15 +00:00
Mehdi Amini	afb0582325	Fix TOSA verifier to emit verbose errors Also as a test for invalid ops which was missing.	2021-12-05 19:16:54 +00:00
Butygin	91072b74f8	[mlir] Add InlinerInterface to bufferization dialect Differential Revision: https://reviews.llvm.org/D115080	2021-12-04 23:45:56 +03:00
Hugo Pompougnac	5d49511b30	Apply the permutation map on each affine nest When using -test-loop-permutation="permutation-map=...", applies the permutation map on each affine nest in the function (and not only the first one). If the size of the permutation map and the size of a nest are not consistent, do nothing on this particular nest (instead of making MLIR crash). Differential Revision: https://reviews.llvm.org/D112947	2021-12-04 17:48:34 +05:30
Chia-hung Duan	b8c6b15283	[mlir] Support collecting logs from notifyMatchFailure(). Let the user registers their own handler to processing the matching failure information. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D110896	2021-12-04 04:35:24 +00:00
Mehdi Amini	4022152b35	Use LLVM_ATTRIBUTE_UNUSED to silent warning for static function used in assert only (NFC)	2021-12-04 04:23:21 +00:00
Matthias Springer	5fa0b3561a	[mlir][linalg][bufferize] Implement equivalence analysis Instead of checking buffer equivalence during bufferization, gather buffer equivalence information right after the analysis. This is in preparation of decoupling bufferization from BufferizationAliasInfo. This change also fixes equivalence analysis for scf.if op results, which was not fully implemented. scf.if op results are equivalent to their corresponding yield values if both yield values are equivalent. Differential Revision: https://reviews.llvm.org/D114774	2021-12-04 11:52:04 +09:00
Uday Bondhugula	2108ed0671	[MLIR] Fix affine.for unroll for multi-result upper bound maps Fix affine.for unroll for multi-result upper bound maps: these can't be unrolled/unroll-and-jammed in cases where the trip count isn't known to be a multiple of the unroll factor. Fix and clean up repeated/unnecessary checks/comments at helper callees. Also, fix clang-tidy variable naming warnings and redundant includes. Differential Revision: https://reviews.llvm.org/D114662	2021-12-04 07:20:26 +05:30

1 2 3 4 5 ...

9471 Commits