llvm-project

Commit Graph

Author	SHA1	Message	Date
Stella Laurenzo	34696e6542	[NFC] Generalize a couple of passes so they can operate on any FunctionLike op. * Generalizes passes linalg-detensorize, linalg-fold-unit-extent-dims, convert-elementwise-to-linalg. * I feel that more work could be done in the future (i.e. make FunctionLike into a proper OpInterface and extend actions in dialect conversion to be trait based), and this patch would be a good record of why that is useful. * Note for downstreams: * Since these passes are now generic, they do not automatically nest with pass managers set up for that. * If running them over nested functions, you must nest explicitly. Upstream has adopted this style but *-opt still has some uses of implicit pipelines via args. See tests for argument changes needed. Differential Revision: https://reviews.llvm.org/D115645	2021-12-13 12:01:53 -08:00
gysit	8fc0525a15	[mlir][linalg] Stage application of pad tensor op vectoriztaion. Adapt the LinalgStrategyVectorizationPattern pass to apply the vectorization patterns in two stages. The change ensures the generic pad tensor op vectorization pattern does not run too early. Additionally, the revision adds the transfer op canonicalization patterns to the set of applied patterns, since they are needed to enable efficient vectorization for rank-reduced convolutions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115627	2021-12-13 19:49:35 +00:00
Alexander Belyaev	206365bf8f	[mlir] Update comments that mention `linalg.collapse/expand` shape.	2021-12-13 20:35:34 +01:00
Lei Zhang	f56b1d813f	[mlir][spirv] Use ScopedPrinter in deserialization debugging This gives us better debugging print as it supports indent levels and other nice features. Reviewed By: Hardcode84 Differential Revision: https://reviews.llvm.org/D115583	2021-12-13 10:51:56 -05:00
Lei Zhang	5e55a20119	[mlir][spirv] Serialize selection with separate header block The previous "optimization" that tries to reuse existing block for selection header block can be problematic for deserialization because it effectively pulls in previous ops in the selection op's enclosing block into the selection op's header. When deserializing, those ops will be placed in the selection op's region. If any of the previous ops has usage after the section op, it will break. That is, the following IR cannot round trip: ```mlir ^bb: %def = ... spv.mlir.selection { ... } %use = spv.SomeOp %def ``` This commit removes the "optimization" to always create new blocks for the selection header. Along the way, also made error reporting better in deserialization by turning asserts into proper errors and add check of uses outside of sinked structured control flow region blocks. Reviewed By: Hardcode84 Differential Revision: https://reviews.llvm.org/D115582	2021-12-13 10:42:26 -05:00
gysit	6c85a49e22	[mlir][memref] Use current source type in getCanonicalSubViewResultType. Use the current instead of the new source type to compute the rank-reduction map in getCanonicalSubViewResultType. Otherwise, the computation of the rank-reduction map fails when folding a cast into a subview since the strides of the new source type cannot be related to the strides of the current result type. Depends On D115428 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115446	2021-12-13 14:50:41 +00:00
Markus Böck	664cc9312c	[mlir] Implement `DataLayoutTypeInterface` for `LLVMStructType` Using this implementation of the interface it is possible to query the size, ABI alignment as well as the preferred alignment of a struct. It should yield the same results as LLVMs `llvm::DataLayout` on an equivalent `llvm::StructType`, including for packed structs. Additionally it is also possible to increase the ABI and preferred alignment using a data layout entry with the type `llvm.struct<()>, which serves the same functionality as the `a:` component in LLVMs data layout string. Differential Revision: https://reviews.llvm.org/D115600	2021-12-13 15:09:16 +01:00
gysit	db7a2e9176	[mlir][linalg] Only compose PadTensorOps if no ExtractSliceOp is rank-reducing. Do not compose pad tensor operations if the extract slice of the outer pad tensor operation is rank reducing. The inner extract slice op cannot be rank-reducing since it source type must match the desired type of the padding. Depends On D115359 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115428	2021-12-13 13:01:30 +00:00
gysit	6859f8ed1e	[mlir][linalg] Adapt the PadTensorOpVectorizationWithInsertSlicePattern matching. Tighten the matcher of the PadTensorOpVectorizationWithInsertSlicePattern pattern. Only match if the PadOp result is used by the InsertSliceOp source. Fail if the result is used by the InsertSliceOp dest. Depends On D115336 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115359	2021-12-13 12:55:07 +00:00
gysit	f895e95138	[mlir][linalg] Make padding work for rank-reducing slice ops. Adapt the computation of a static bounding box to take rank-reducing slice operations into account by filtering out reduced size one dimensions. The revision is needed to make padding work for decomposed convolution operations. The decomposition introduces rank reducing extract slice operations that previously let padding fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115336	2021-12-13 12:34:20 +00:00
Jacques Pienaar	b743ff161b	[mlir] Relax restriction on name location parsing We currently restrict parsing of location to not allow nameloc being nested inside nameloc. This restriction may be historical as there doesn't seem to be a reason for it anymore (locations like this can be constructed in C++ and they print fine). Relax this restriction in the parser to allow this nesting. Differential Revision: https://reviews.llvm.org/D115581	2021-12-12 08:06:59 -08:00
Jacques Pienaar	efb7727a96	[mlir] Flag near misses in file splitting Flags some potential cases where splitting isn't happening and so could result in confusing results. Also update some test files where there were near misses in splitting that seemed unintentional. Differential Revision: https://reviews.llvm.org/D109636	2021-12-12 08:03:30 -08:00
Nicolas Vasilache	408553dd96	[mlir][Vector] Support 0-D vectors in `CreateMaskOp` The 0-D case gets lowered in almost the same way that the 1-D case does in VectorCreateMaskOpConversion. I also had to slightly update the verifier for the op to always require exactly 1 operand in the 0-D case. Depends On D115220 Reviewed by: ftynse Differential revision: https://reviews.llvm.org/D115221	2021-12-12 13:32:29 +00:00
Arjun P	d7ec4d0be3	[MLIR] PresburgerSet subtraction: fix bug where the set `b` was not restored properly on return When subtracting `b \ c`, when there are divisions in `c`, these division constraints get added to `b`. `b` must be restored to its original state when returning, but these added divisions constraints were not removed in one of the return paths. This patch fixes this and deduplicates the restoration logic by encapuslating it in a lambda `restoreState`. The patch also includes a regression test for the bug fix. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D115577	2021-12-12 17:14:04 +05:30
Lei Zhang	731676b10d	[mlir][spirv] Fix nested control flow serialization If we have a `spv.mlir.selection` op nested in a `spv.mlir.loop` op, when serializing the loop's block, we might need to jump from the selection op's merge block, which might be different than the immediate MLIR IR predecessor block. But we still need to get the block argument from the MLIR IR predecessor block. Also, if the `spv.mlir.selection` is in the `spv.mlir.loop`'s header block, we need to make sure `OpLoopMerge` is emitted in the current block before start processing the nested selection op. Otherwise we'll see the LoopMerge in the wrong SPIR-V basic block. Reviewed By: Hardcode84 Differential Revision: https://reviews.llvm.org/D115560	2021-12-11 14:47:19 -05:00
Jacques Pienaar	1ab3efac41	[mlir][python] Add fused location	2021-12-11 10:16:13 -08:00
Groverkss	c6a8bec4c5	[MLIR][FlatAffineConstraints] Add support for extracting divisions with tighter bounds This patch adds support for extracting divisions when the set contains bounds which are tighter than the division bounds. For example: ``` 3q - i + 2 >= 0 <-- Lower bound for 'q' -3q + i - 1 >= 0 <-- Tighter upper bound for 'q' ``` Here, the actual upper bound for division for `q` would be `-3q + i >= 0`, but since this actual upper bound is implied by a tighter upper bound, which awe can still extract the divison. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D115096	2021-12-11 16:23:54 +05:30
Lei Zhang	3ed47bcc96	[mlir][spirv] Propagate LogicalResult in (de)serialization `(void)` was added when LogicalResult was marked as non discard. This commit cleans them up to properly propagate failures. Reviewed By: scotttodd Differential Revision: https://reviews.llvm.org/D115541	2021-12-10 19:20:49 -05:00
Lei Zhang	222d7fc7f8	[mlir][spirv] Avoid duplicated Block decoration during serialization It's legal per the Vulkan / SPIR-V spec; still it's better to avoid such duplication to have cleaner blob and reduce the binary size. Reviewed By: scotttodd Differential Revision: https://reviews.llvm.org/D115532	2021-12-10 19:20:49 -05:00
Lei Zhang	b289266cb2	[mlir][spirv] Add serialization control to emit symbol name In SPIR-V, symbol names are encoded as `OpName` instructions. They are not semantic impacting and can be omitted, which can reduce the binary size. Reviewed By: scotttodd Differential Revision: https://reviews.llvm.org/D115531	2021-12-10 19:20:49 -05:00
Nicolas Vasilache	f2e945a393	Revert "[mlir][tensor] Fix insert_slice + tensor cast overflow" This reverts commit `5601821dae`. The prefix + canonical complete behavior is actually obsolete and should not be reintroduced. Reverting.	2021-12-10 22:53:52 +00:00
Arjun P	d6f9bb0321	[MLIR] FlatAffineConstraints::isIntegerEmpty: fix bug in computation of duals The method that was previously used for computing dual variables was incorrect. This was used in the integer emptiness check algorithm, where this bug could lead to much longer running times. (Due to the way it is used, this never results in an incorrect emptiness check result.) This patch fixes the dual computation and adds some additional asserts that catch this bug, along with regression test cases that trigger the asserts when the incorrect dual computation is used. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D113803	2021-12-11 03:48:40 +05:30
Arjun P	98db55f108	[MLIR] IntegerPolyhedron: introduce getNumIdKind to replace calls to assertAtMostNumIdKind Introduce a function `getNumIdKind` that returns the number of ids of the specified kind. Remove the function `assertAtMostNumIdKind` and instead just directly assert the inequality with a call to `getNumIdKind`.	2021-12-11 03:42:46 +05:30
Thomas Raoux	f56933b263	[mlir][vector] NFC move vector unroll/distribute patterns to their own file Differential Revision: https://reviews.llvm.org/D115548	2021-12-10 14:00:13 -08:00
Uday Bondhugula	bc657b2eef	[MLIR][NFC] Move out affine scalar replacement utility to affine utils NFC. Move out and expose affine scalar replacement utility through affine utils. Renaming misleading forwardStoreToLoad -> affineScalarReplace. Update a stale doc comment. Differential Revision: https://reviews.llvm.org/D115495	2021-12-11 03:26:42 +05:30
Nicolas Vasilache	5601821dae	[mlir][tensor] Fix insert_slice + tensor cast overflow InsertSliceOp may have subprefix semantics where missing trailing dimensions are automatically inferred directly from the operand shape. This revision fixes an overflow that occurs in such cases when the impl is based on the op rank. Differential Revision: https://reviews.llvm.org/D115549	2021-12-10 21:41:26 +00:00
River Riddle	233e9476d8	[mlir:PDL] Allow non-bound pdl.attribute/pdl.type operations that create constants This allows for passing in these attributes/types to constraints/rewrites as arguments. Differential Revision: https://reviews.llvm.org/D114817	2021-12-10 19:38:43 +00:00
River Riddle	06c3b9c7be	[mlir:PDL] Fix bugs in PDLPatternModule merging * Constraints/Rewrites registered before a pattern was added were dropped * Constraints/Rewrites may be registered multiple times (if different pattern sets depend on them) * ModuleOp no longer has a terminator, so we shouldn't be removing the terminator from it Differential Revision: https://reviews.llvm.org/D114816	2021-12-10 19:38:43 +00:00
Mogball	0845635eda	[mlir][ir] Custom ops' parse/print fall back to dialect hooks Custom ops that have no parser or printer should fall back to the dialect's parser and/or printer hooks. This avoids the need to define parsers and printers that simply dispatch to the dialect hook. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D115481	2021-12-10 19:34:25 +00:00
Alexander Belyaev	b618880e7b	[mlir] Move `linalg.tensor_expand/collapse_shape` to TensorDialect. RFC: https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310 linalg.fill gets a canonicalizer, because `FoldFillWithTensorReshape` cannot be moved to tensorops (it uses linalg::FillOp inside). Before it was listed as a canonicalization pattern for the reshape operations, now it became a canonicalization for FillOp. Differential Revision: https://reviews.llvm.org/D115502	2021-12-10 12:11:48 +01:00
Mehdi Amini	79a0330a52	Fix crash from use of a temporary after its scope exit Introduced in D110448 and broke some bots (reported by ASAN). Differential Revision: https://reviews.llvm.org/D110448	2021-12-10 05:04:23 +00:00
Rob Suderman	46c96fca0e	[mlir][tosa] Fix quantized type for tosa.conv2d canonicalization Wrong type was used for the result type in the tosa.conv_2d canonicalization. The type should match the result element type should match the result type not the input element type. Differential Revision: https://reviews.llvm.org/D115463	2021-12-09 12:39:23 -08:00
MaheshRavishankar	9cfd8d7c6c	[mlir][Vector] Avoid infinite loop in InnerOuterDimReductionConversion. This patterns tries to convert an inner (outer) dim reduction to an outer (inner) dim reduction. Doing this on a 1D or 0D vector results in an infinite loop since the converted op is same as the original operation. Just returning failure when source rank <= 1 fixes the issue. Differential Revision: https://reviews.llvm.org/D115426	2021-12-09 09:30:05 -08:00
Krzysztof Drewniak	e1da62910e	[MLIR][GPU] Define gpu.printf op and its lowerings - Define a gpu.printf op, which can be lowered to any GPU printf() support (which is present in CUDA, HIP, and OpenCL). This op only supports constant format strings and scalar arguments - Define the lowering of gpu.pirntf to a call to printf() (which is what is required for AMD GPUs when using OpenCL) as well as to the hostcall interface present in the AMD Open Compute device library, which is the interface present when kernels are running under HIP. - Add a "runtime" enum that allows specifying which of the possible runtimes a ROCDL kernel will be executed under or that the runtime is unknown. This enum controls how gpu.printf is lowered This change does not enable lowering for Nvidia GPUs, but such a lowering should be possible in principle. And: [MLIR][AMDGPU] Always set amdgpu-implicitarg-num-bytes=56 on kernels This is something that Clang always sets on both OpenCL and HIP kernels, and failing to include it causes mysterious crashes with printf() support. In addition, revert the max-flat-work-group-size to (1, 256) to avoid triggering bugs in the AMDGPU backend. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110448	2021-12-09 15:54:31 +00:00
Eugene Zhulenev	49ce40e9ab	[mlir] AsyncParallelFor: align block size to be a multiple of inner loops iterations Depends On D115263 By aligning block size to inner loop iterations parallel_compute_fn LLVM can later unroll and vectorize some of the inner loops with small number of trip counts. Up to 2x speedup in multiple benchmarks. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115436	2021-12-09 06:50:50 -08:00
Eugene Zhulenev	9f151b784b	[mlir] AsyncParallelFor: sink constants into the parallel compute function With complex recursive structure of async dispatch function LLVM can't always propagate constants to the parallel_compute_fn and it often prevents optimizations like loop unrolling and vectorization. We help LLVM by pushing known constants into the parallel_compute_fn explicitly. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115263	2021-12-09 06:48:23 -08:00
Matthias Springer	cc45a13422	[mlir][linalg][bufferize] LinalgOps can bufferize inplace with input args LinalgOp results usually bufferize inplace with output args. With this change, they may buffer inplace with input args if the value of the output arg is not used in the computation. Differential Revision: https://reviews.llvm.org/D115022	2021-12-09 21:54:54 +09:00
Groverkss	6f9afad6d3	[MLIR] Move Presburger Math from FlatAffineConstraints to Presburger/IntegerPolyhedron This patch factors out math functionality that is a subset of Presburger arithmetic and moves it from FlatAffineConstraints to Presburger/IntegerPolyhedron. This patch only moves some parts of the functionality planned to be moved, with subsequent patches moving more functionality. There are three main reasons for this: 1. This split makes the Presburger Library easier and more flexible to use across MLIR, by not depending on IR. 2. This split allows the Presburger library to be developed independently from Affine Analysis, with Affine Analysis using this library. 3. With more functionality being upstreamed to the Presburger Library, the mlir/Analysis directory will be cluttered with Presburger library components since they depend on math functionality from FlatAffineConstraints. Moving this functionality to the Presburger directory allows keeping the new functionality in the Presburger directory. This patch is part of an ongoing effort to make the Presburger Library easier to use. The motivation for this effort is the feedback received at the LLVM conference from Mehdi and others. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D114674	2021-12-09 16:42:06 +05:30
Michel Weber	45ea542dd8	[MLIR] Introduce coalesce for PresburgerSet This patch provides functionality for simplifying `PresburgerSet`s by checking if any `FlatAffineConstraints` in the set is contained in another, and removing such redundant FACs. This is part of a series of patches to provide functionality for [integer set coalescing](http://impact.gforge.inria.fr/impact2015/papers/impact2015-verdoolaege.pdf) in MLIR. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D110617	2021-12-09 15:46:31 +05:30
Shraiysh Vaishay	d82c1f4e4b	[MLIR][OpenMP] Added omp.atomic.update This patch supports the atomic construct (update) following section 2.17.7 of OpenMP 5.0 standard. Also added tests and verifier for the same. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D112982	2021-12-09 15:21:24 +05:30
Nicolas Vasilache	d69f5e197c	[mlir][memref] Fix subview offset verification. Offset-specific verification seems to have been lost in one of the recent refactorings. Also add proper tests that would have caught this omission. This addresses the immediate issues discussed in: https://llvm.discourse.group/t/memref-subview-affine-map-and-symbols/4851 Differential Revision: https://reviews.llvm.org/D115427	2021-12-09 07:44:51 +00:00
MaheshRavishankar	6d7c9c3d0e	[mlir][Linalg] Bufferize the region of LinalgOps as well. The region of `linalg.generic` might contain `tensor` operations. For example, current lowering of `gather` uses a `tensor.extract` in the body of the `LinalgOp`. Bufferize the ops within a `LinalgOp` region as well to catch such cases. Differential Revision: https://reviews.llvm.org/D115322	2021-12-08 22:36:01 -08:00
Rob Suderman	23149d522b	[mlir] Added ctlz and cttz to math dialect and LLVM dialect Count leading/trailing zeros are an existing LLVM intrinsic. Added LLVM support for the intrinsics with lowerings from the math dialect to LLVM dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115206	2021-12-08 14:32:15 -08:00
Butygin	d8fce785de	[mlir][spirv] math.erf OpenCL lowering Differential Revision: https://reviews.llvm.org/D115335	2021-12-08 21:59:46 +03:00
Matthias Springer	847710f7b7	[mlir][linalg][bufferize] Add dialect filter to BufferizationOptions This adds a new option `dialectFilter` to BufferizationOptions. Only ops from dialects that are allow-listed in the filter are bufferized. Other ops are left unbufferized. Note: This option requires `allowUnknownOps = true`. To make use of `dialectFilter`, BufferizationOptions or BufferizationState must be passed to various helper functions. The purpose of this change is to provide a better infrastructure for partial bufferization, which will be fully activated in a subsequent change. Differential Revision: https://reviews.llvm.org/D114691	2021-12-08 23:51:18 +09:00
Mehdi Amini	be0a7e9f27	Adjust "end namespace" comment in MLIR to match new agree'd coding style See D115115 and this mailing list discussion: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154199.html Differential Revision: https://reviews.llvm.org/D115309	2021-12-08 06:05:26 +00:00
Mehdi Amini	ee0908703d	Change the printing/parsing behavior for Attributes used in declarative assembly format The new form of printing attribute in the declarative assembly is eliding the `#dialect.mnemonic` prefix to only keep the `<....>` part. Differential Revision: https://reviews.llvm.org/D113873	2021-12-08 02:02:37 +00:00
Aart Bik	bb8632c1ef	[mlir][sparse] fix broken build rebase and commit crossed the getFunc change Reviewed By: Chia-hungDuan Differential Revision: https://reviews.llvm.org/D115270	2021-12-07 11:14:21 -08:00
Aart Bik	4f2ec7f983	[mlir][sparse] finalize sparse output in the presence of reductions This revision implements sparse outputs (from scratch) in all cases where the loops can be reordered with all but one parallel loops outer. If the inner parallel loop appears inside one or more reductions loops, then an access pattern expansion is required (aka. workspaces in TACO speak). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115091	2021-12-07 10:54:29 -08:00
Rob Suderman	e9fae0f19e	[mlir][tosa] Disable tosa.depthwise_conv2d canonicalizer for quantized case Quantized case needs to include zero-point corrections before the tosa.mul. Disabled for the quantized use-case. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D115264	2021-12-07 10:16:12 -08:00
Lei Zhang	7709b23bef	[mlir][scf] NFC: create dedicated files for affine utils These functions are generic utility functions that operates on affine ops within SCF regions. Moving them to their own files for a better code structure, instead of mixing with loop specialization logic. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115245	2021-12-07 10:55:32 -05:00
Nicolas Vasilache	61ba9f9110	[mlir][Linalg] NFC - Extend the TilingInterface to allow better composition with out-of-tree dialects. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D115233	2021-12-07 13:06:27 +00:00
Matthias Springer	958ae8b2d4	[mlir][linalg][bufferize] Bufferize Operation* instead of FuncOp This change mainly changes the API. There is no mentioning of FuncOps in ComprehensiveBufferize anymore. Also, bufferize methods of the op interface are called for ops without tensor operands/results if they have a region. Differential Revision: https://reviews.llvm.org/D115212	2021-12-07 19:53:44 +09:00
Shraiysh Vaishay	31cf42bd9a	[mlir][OpenMP] Added omp.atomic.read lowering This patch adds lowering from omp.atomic.read to LLVM IR along with the memory ordering clause. Tests for the same are also added. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115134	2021-12-07 11:17:30 +05:30
wren romano	d8731bfc93	[mlir][sparse] Requiring emitCInterface parameter to be explicit Depends On D115004 Cleans up code legibility by requiring the `emitCInterface` parameter to be explicit at all call-sites, and defining boolean aliases for that parameter. Reviewed By: aartbik, rriddle Differential Revision: https://reviews.llvm.org/D115005	2021-12-06 20:50:08 -08:00
not-jenni	5911a29aa9	[mlir][tosa] Add tosa.depthwise_conv2d as tosa.mul canonicalization For a 1x1 weight and stride of 1, the input/weight can be reshaped and multiplied elementwise then reshaped back Reviewed By: rsuderman, KoolJBlack Differential Revision: https://reviews.llvm.org/D115207	2021-12-06 17:28:52 -08:00
Matthias Springer	7ce427e3bc	[mlir][linalg][bufferize][NFC] Clean up BufferizationState Make fields private and clean up the interface. In particular, BufferizableOpInterface::bufferize no longer has access to `aliasInfo`. This was potentially dangerous because some of the ops registered in BufferizationAliasInfo may have been deleted. Differential Revision: https://reviews.llvm.org/D114931	2021-12-07 10:05:39 +09:00
Rob Suderman	05e33d846f	[mlir][tosa] Resubmit add tosa.conv2d as tosa.fully_connected canonicalization Fixed the tosa.conv2d to tosa.fully_connected canonicalization for incorrect output channels. Included uptes to tests to include checks for the result shapes during canonicalization. This allows conv2d to transform to the simpler fully_connected operation. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115170	2021-12-06 15:33:07 -08:00
wren romano	f527fdf51e	[mlir][sparse] Code cleanup for SparseTensorConversion Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D115004	2021-12-06 14:13:35 -08:00
Eugene Zhulenev	68a7c001ad	[mlir] Improve async parallel for tests + fix typos Do load and store to verify that we process each element of the iteration space once. Reviewed By: cota Differential Revision: https://reviews.llvm.org/D115152	2021-12-06 13:27:54 -08:00
Rob Suderman	c5fef77bc3	[mlir] Add CtPop to MathOps with lowering to LLVM math.ctpop maths to the llvm.ctpop intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D114998	2021-12-06 11:54:20 -08:00
Alex Zinenko	d64b3e47ba	[mlir] Avoid needlessly converting LLVM named structs with compatible elements Conversion of LLVM named structs leads to them being renamed since we cannot modify the body of the struct type once it is set. Previously, this applied to all named struct types, even if their element types were not affected by the conversion. Make this behvaior only applicable when element types are changed. This requires making the LLVM dialect type-compatibility check recursively look at the element types (arguably, it should have been doing than since the moment the LLVM dialect type system stopped being closed). In addition, have a more lax check for outer types only to avoid repeated check when necessary (e.g., parser, verifiers that are going to also look at the inner type). Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D115037	2021-12-06 13:42:11 +01:00
Matthias Springer	e761c49a14	[mlir][linalg][bufferize][NFC] Utilize isWritable for FuncOps This is a cleanup of ModuleBufferization. Instead of storing information about writable function arguments in BufferizationAliasInfo, we can use isWritable and make the decision there, based on dialect-specifc bufferization state. Differential Revision: https://reviews.llvm.org/D114930	2021-12-06 18:36:54 +09:00
Matthias Springer	e9fb4dc9e9	[mlir][linalg][bufferize] Remove buffer equivalence from bufferize Remove all function calls related to buffer equivalence from bufferize implementations. Add a new PostAnalysisStep for scf.for that ensures that yielded values are equivalent to the corresponding BBArgs. (This was previously checked in `bufferize`.) This will be relaxed in a subsequent commit. Note: This commit changes two test cases. These were broken by design and should not have passed. With the new scf.for PostAnalysisStep, this bug was fixed. Differential Revision: https://reviews.llvm.org/D114927	2021-12-06 17:48:31 +09:00
Matthias Springer	cb4d0bf997	[mlir][linalg][bufferize][NFC] Collect equivalent FuncOp BBArgs in PostAnalysisStep Collect equivalent BBArgs right after the equivalence analysis of the FuncOp and before bufferizing. This is in preparation of decoupling bufferization from aliasInfo. Also gather equivalence info for CallOps, which was missing in the previous commit. Differential Revision: https://reviews.llvm.org/D114847	2021-12-06 17:31:39 +09:00
Michal Terepeta	caf89c0db6	[mlir][Vector] Support 0-D vectors in `ConstantMaskOp` To support creating both a mask with just a single `true` and `false` values, I had to relax the restriction in the verifier that the rank is always equal to the length of the attribute array, in other words, we now allow: - `vector.constant_mask [0] : vector<i1>` which gets lowered to `arith.constant dense<false> : vector<i1>` - `vector.constant_mask [1] : vector<i1>` which gets lowered to `arith.constant dense<true> : vector<i1>` (the attribute list for the 0-D case must be a singleton containing either `0` or `1`) Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115023	2021-12-06 08:03:04 +00:00
gysit	69bcff46bf	[mlir][linalg] Pad independent of application order (NFC). This revision makes the padding pattern independent of the application order. It addresses the concern that we cannot rely on the execution order of the greedy rewriter (https://reviews.llvm.org/D114689). Instead, the pattern is updated to apply repeatedly till all operations are padded. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114851	2021-12-06 07:26:15 +00:00
Mehdi Amini	afb0582325	Fix TOSA verifier to emit verbose errors Also as a test for invalid ops which was missing.	2021-12-05 19:16:54 +00:00
Butygin	91072b74f8	[mlir] Add InlinerInterface to bufferization dialect Differential Revision: https://reviews.llvm.org/D115080	2021-12-04 23:45:56 +03:00
Chia-hung Duan	b8c6b15283	[mlir] Support collecting logs from notifyMatchFailure(). Let the user registers their own handler to processing the matching failure information. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D110896	2021-12-04 04:35:24 +00:00
Mehdi Amini	4022152b35	Use LLVM_ATTRIBUTE_UNUSED to silent warning for static function used in assert only (NFC)	2021-12-04 04:23:21 +00:00
Matthias Springer	5fa0b3561a	[mlir][linalg][bufferize] Implement equivalence analysis Instead of checking buffer equivalence during bufferization, gather buffer equivalence information right after the analysis. This is in preparation of decoupling bufferization from BufferizationAliasInfo. This change also fixes equivalence analysis for scf.if op results, which was not fully implemented. scf.if op results are equivalent to their corresponding yield values if both yield values are equivalent. Differential Revision: https://reviews.llvm.org/D114774	2021-12-04 11:52:04 +09:00
Uday Bondhugula	2108ed0671	[MLIR] Fix affine.for unroll for multi-result upper bound maps Fix affine.for unroll for multi-result upper bound maps: these can't be unrolled/unroll-and-jammed in cases where the trip count isn't known to be a multiple of the unroll factor. Fix and clean up repeated/unnecessary checks/comments at helper callees. Also, fix clang-tidy variable naming warnings and redundant includes. Differential Revision: https://reviews.llvm.org/D114662	2021-12-04 07:20:26 +05:30
Matthias Springer	9e42f2aa0b	[mlir][linalg][bufferize][NFC] Add inPlaceAnalysis overload Differential Revision: https://reviews.llvm.org/D114773	2021-12-04 10:41:57 +09:00
River Riddle	7169996159	[mlir] Allow shape dimensions larger than 2^32 Internally we use int64_t to hold shapes, but for some reason the parser was limiting shapes to unsigned. This change updates the parser to properly handle int64_t shape dimensions. Differential Revision: https://reviews.llvm.org/D115086	2021-12-04 01:29:50 +00:00
Uday Bondhugula	ecf458507e	[MLIR] Improve error message on missing getArgument() override on pass Improve error message while registering a pass with a missing getArgument() override. Differential Revision: https://reviews.llvm.org/D114744	2021-12-04 06:54:52 +05:30
Matthias Springer	6db200736c	[mlir][linalg][bufferize][NFC] Use same OpBuilder throughout bufferization Also set insertion point right before calling `bufferize`. No need to put an InsertionGuard anymore. Differential Revision: https://reviews.llvm.org/D114928	2021-12-04 09:57:26 +09:00
natashaknk	e2d8b60742	Revert "[mlir][tosa] Add tosa.conv2d as fully_connected canonicalization" This reverts commit `13bdb7ab4a`. The commit introduced/uncovered an unintended bug in models containing Conv2D. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115079	2021-12-03 14:35:48 -08:00
Matthias Springer	e359a1e548	[mlir][linalg][bufferize][NFC] Map only tensors in BufferizationState BufferizationState had map/lookup overloads for non-tensor values. This was necessary for IREE. There is now a better way to do this, so these overloads can be removed. Differential Revision: https://reviews.llvm.org/D114929	2021-12-03 23:07:09 +09:00
Matthias Springer	ed8c63115e	[mlir][linalg][bufferize][NFC] Provide default implementation of getAliasingOpOperand This simplifies op interface implementations. Differential Revision: https://reviews.llvm.org/D115025	2021-12-03 22:36:22 +09:00
Adrian Kuegel	04d083b19e	[mlir][NFC] Use const reference for loop variables.	2021-12-03 13:07:54 +01:00
Alex Zinenko	9dd1f8dfdd	[mlir] support recursive type conversion of named LLVM structs A previous commit added support for converting elemental types contained in LLVM dialect types in case they were not compatible with the LLVM dialect. It was missing support for named structs as they could be recursive, which was not supported by the conversion infra. Now that it is, add support for converting such named structs. Depends On D113579 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D113580	2021-12-03 12:41:40 +01:00
Matthias Springer	5e1c038f7d	[mlir][linalg][bufferize][NFC] Move FuncOp boundary bufferization to ModuleBufferization Differential Revision: https://reviews.llvm.org/D114670	2021-12-03 20:29:39 +09:00
Matthias Springer	ad1ba42f68	[mlir][linalg][bufferize] Allow unbufferizable ops in input Allow ops that are not bufferizable in the input IR. (Deactivated by default.) bufferization::ToMemrefOp and bufferization::ToTensorOp are generated at the bufferization boundaries. Differential Revision: https://reviews.llvm.org/D114669	2021-12-03 20:20:46 +09:00
Matthias Springer	867cd948ac	[mlir][linalg][bufferize][NFC] Move BufferizationOptions to op interface Also store a reference to BufferizationOptions in BufferizationState. This is in preparation of adding support for partial bufferization. Differential Revision: https://reviews.llvm.org/D114661	2021-12-03 19:51:34 +09:00
Michal Terepeta	1423e8bf5d	[mlir][Vector] Support 0-D vectors in `BitCastOp` The implementation only allows to bit-cast between two 0-D vectors. We could probably support casting from/to vectors like `vector<1xf32>`, but I wasn't convinced that this would be important and it would require breaking the invariant that `BitCastOp` works only on vectors with equal rank. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114854	2021-12-03 08:55:59 +00:00
Matthias Springer	d30fcadf07	[mlir][linalg][bufferize] Op interface implementation for Bufferization dialect ops This change provides `BufferizableOpInterface` implementations for ops from the Bufferization dialects. These ops are needed at the bufferization boundaries for partial bufferization. Differential Revision: https://reviews.llvm.org/D114618	2021-12-03 16:25:44 +09:00
Ulysse Beaugnon	e45705ad50	[MLIR] Use a shared uniquer for affine maps and integer sets. Affine maps and integer sets previously relied on a single lock for creating unique instances. In a multi-threaded setting, this lock becomes a contention point. This commit updates AffineMap and IntegerSet to use StorageUniquer instead. StorageUniquer internally uses sharded locks and thread-local caches to reduce contention. It is already used for affine expressions, types and attributes. On my local machine, this gives me a 5X speedup for an application that manipulates a lot of affine maps and integer sets. This commit also removes the integer set uniquer threshold. The threshold was used to avoid adding integer sets with a lot of constraints to the hash_map containing unique instances, but the constraints and the integer set were still allocated in the same allocator and never freed, thus not saving any space expect for the hash-map entry. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D114942	2021-12-02 23:49:32 +01:00
Groverkss	d257f7c1bf	[MLIR][FlatAffineConstraints] Remove duplicate divisions while merging local ids This patch implements detecting duplicate local identifiers by extracting their division representation while merging local identifiers. For example, given the FACs A, B: ``` A: (x, y)[s0] : (exists d0 = [x / 4], d1 = [y / 4]: d0 <= s0, d1 <= s0, x + y >= 2) B: (x, y)[s0] : (exists d0 = [x / 4], d1 = [y / 4]: d0 <= s0, d1 <= s0, x + y >= 5) ``` The intersection of A and B without this patch would lead to the following FAC: ``` (x, y)[s0] : (exists d0 = [x / 4], d1 = [y / 4], d2 = [x / 4], d3 = [x / 4]: d0 <= s0, d1 <= s0, d2 <= s0, d3 <= s0, x + y >= 2, x + y >= 5) ``` after this patch, merging of local ids will detect that `d0 = d2` and `d1 = d3`, and the intersection of these two FACs will be (after removing duplicate constraints): ``` (x, y)[s0] : (exists d0 = [x / 4], d1 = [y / 4] : d0 <= s0, d1 <= s0, x + y >= 2, x + y >= 5) ``` This reduces the number of constraints by 2 (constraints) + 4 (2 constraints for each extra division) for this case. This is used to reduce the output size representation of operations like PresburgerSet::subtract, PresburgerSet::intersect which require merging local variables. Reviewed By: arjunp, bondhugula Differential Revision: https://reviews.llvm.org/D112867	2021-12-03 03:44:47 +05:30
Groverkss	cff427ee20	Revert changes that should have been sent as a patch Revert changes that were meant to be sent as a single commit with summary for the differential review, but were accidently sent directly. This reverts commit `3bc5353fc6`.	2021-12-03 03:42:37 +05:30
Groverkss	c15724ab34	Address bondhugula's comments	2021-12-03 03:23:22 +05:30
Groverkss	d82a676227	Addressed comments	2021-12-03 03:23:22 +05:30
Groverkss	76ad74a4a9	Address more comments.	2021-12-03 03:23:21 +05:30
Groverkss	a8b79d116a	Addressed more comments	2021-12-03 03:23:20 +05:30
Groverkss	1e0d7fd769	Fix asserts as suggested by Arjun	2021-12-03 03:23:20 +05:30
Groverkss	19352630c0	Fix clang-format errors	2021-12-03 03:23:19 +05:30
Groverkss	b8ea299628	Update docs	2021-12-03 03:23:19 +05:30
Groverkss	8a0967481f	Address arjun's comments	2021-12-03 03:23:18 +05:30
Groverkss	c9cea1909f	Move division representation to a common function	2021-12-03 03:23:18 +05:30
Groverkss	985789ce0b	Update mergeLocalIds docs	2021-12-03 03:23:17 +05:30

1 2 3 4 5 ...

7103 Commits