llvm-project

Commit Graph

Author	SHA1	Message	Date
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Vladislav Vinogradov	9a2255dfa0	[mlir][NFC] Add explicit "::mlir" namespace to tblgen generated code Reviewed By: lattner, ftynse Differential Revision: https://reviews.llvm.org/D109223	2021-09-20 10:37:50 +03:00
xndcn	9de88fc0ea	[mlir][emitc] Fix indent in CondBranchOp and block label 1. Add missing indent in CondBranchOp 2. Remove indent in block label Differential Revision: https://reviews.llvm.org/D109805	2021-09-19 20:03:42 +08:00
Arjun P	33afea5488	[MLIR] Simplex: rename num{Variables,Constraints} to getNum{Variables,Constraints} As per the LLVM Coding Standards, function names should be verb phrases.	2021-09-18 22:39:35 +05:30
Arjun P	2b44a7325c	[MLIR] Simplex: support adding new variables dynamically Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D109962	2021-09-18 21:32:17 +05:30
Jacques Pienaar	0a1e569d37	[mlir-c] Add getting fused loc For creating a fused loc using array of locations and metadata. Differential Revision: https://reviews.llvm.org/D110022	2021-09-18 06:57:51 -07:00
Uday Bondhugula	57eda9becc	[MLIR][GPU] Add constant propagator for gpu.launch op Add a constant propagator for gpu.launch op in cases where the grid/thread IDs can be trivially determined to take a single constant value of zero. Differential Revision: https://reviews.llvm.org/D109994	2021-09-18 12:02:46 +05:30
Geoffrey Martin-Noble	2cda4f8ed7	[mlir] Fix syntax example for tensor.from_elements Parens are not used here	2021-09-17 17:23:11 -07:00
Aart Bik	46e77b5d10	[mlir][sparse] add a sparse quantized_matmul example to integration test Note that this revision adds a very tiny bit of constant folding in the sparse compiler lattice construction. Although I am generally trying to avoid such canonicalizations (and rely on other passes to fix this instead), the benefits of avoiding a very expensive disjunction lattice construction justify having this special code (at least for now). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109939	2021-09-17 13:04:44 -07:00
Aart Bik	d4e16171e8	[mlir][sparse] add dce test for all sparse tensor ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109992	2021-09-17 13:03:42 -07:00
Krzysztof Drewniak	121aab84d1	[MLIR][Affine] Simplify nested modulo operations when able It is the case that, for all positive a and b such that b divides a (e mod (a * b)) mod b = e mod b. For example, ((d0 mod 35) mod 5) can be simplified to (d0 mod 5), but ((d0 mod 35) mod 4) cannot be simplified further (x = 36 is a counterexample). This change enables more complex simplifications. For example, ((d0 * 72 + d1) mod 144) mod 9 can now simplify to (d0 * 72 + d1) mod 9 and thus to d1 mod 9. Expressions with chained modulus operators are reasonably common in tensor applications, and this change _should_ improve code generation for such expressions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109930	2021-09-17 19:06:00 +00:00
thomasraoux	08f0cb7719	[mlir] Prevent crash in DropUnitDim pattern due to tensor with encoding Differential Revision: https://reviews.llvm.org/D109984	2021-09-17 12:03:16 -07:00
thomasraoux	36aac53b36	[mlir][linalg] Extend drop unit dim pattern to all cases of reduction Even with all parallel loops reading the output value is still allowed so we don't have to handle reduction loops differently. Differential Revision: https://reviews.llvm.org/D109851	2021-09-17 10:09:57 -07:00
thomasraoux	416679615d	[mlir] Linalg hoisting should ignore uses outside the loop Differential Revision: https://reviews.llvm.org/D109859	2021-09-17 10:06:57 -07:00
thomasraoux	a123e3c48b	[mlir] Fix potential crash in hoistRedundantVectorTransfers Differential Revision: https://reviews.llvm.org/D107856	2021-09-17 10:05:20 -07:00
Tobias Gysi	90b7817e03	[mlir][linalg] Add helper to update IndexOps after tiling (NFC). Add the addTileLoopIvsToIndexOpResults method to shift the IndexOp results after tiling. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109761	2021-09-17 15:17:33 +00:00
Arjun P	58719f6153	[MLIR] PresbugerSet: slightly expand documentation	2021-09-17 18:04:46 +05:30
Arjun P	44db07f11f	[MLIR] AffineStructures: support removing a range of constraints at once Reviewed By: Groverkss, grosser Differential Revision: https://reviews.llvm.org/D109892	2021-09-17 16:27:48 +05:30
Arjun P	6607bd9fd8	[MLIR] AffineStructures::removeIdRange: support specifying a range within an IdKind Reviewed By: Groverkss, grosser Differential Revision: https://reviews.llvm.org/D109896	2021-09-17 16:25:26 +05:30
Arjun P	f263ea1571	[MLIR] Matrix: support resizing horizontally Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D109897	2021-09-17 16:22:31 +05:30
MaheshRavishankar	04a66f8d2b	Fixing vector add pattern that incorrectly returns success. The pattern is returning success even if it does no work leading to pattern application running up to the max iteration count and failing. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D109791	2021-09-16 14:48:09 -07:00
Aart Bik	233b42a8bb	[mlir][sparse] remove unused TENSOR environment Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109919	2021-09-16 14:32:09 -07:00
Rob Suderman	8662a2f208	[mlir][tosa] Relax ranked constraint on quantization builder TosaOp defintion had an artificial constraint that the input/output types needed to be ranked to invoke the quantization builder. This is correct as an unranked tensor could still be quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D109863	2021-09-16 11:43:47 -07:00
Aart Bik	860cbeb159	[mlir][sparse] add more asserts to sparse support lib We are having issues running the integration test of the sparse compiler on AArch64 (crashing in the lib). This revision adds more assertions. Reviewed By: jsetoain Differential Revision: https://reviews.llvm.org/D109861	2021-09-16 10:13:29 -07:00
Nicolas Vasilache	ee2e414dde	[mlir][Linalg] Cleanup doc and improve logging and readability in ComprehensiveBufferize.cpp - NFC	2021-09-16 16:41:47 +00:00
Tobias Gysi	8f2db36b01	[mlir][OpDSL] Update op definitions to make shapes more concise (NFC). Express the input shape definitions of convolution and pooling operations in terms of the output shapes, filter shapes, strides, and dilations. Reviewed By: shabalin, rsuderman, stellaraccident Differential Revision: https://reviews.llvm.org/D109815	2021-09-16 06:02:00 +00:00
Aart Bik	b1d44e5902	[mlir][sparse] add affine subscripts to sparse compilation pass This enables the sparsification of more kernels, such as convolutions where there is a x(i+j) subscript. It also enables more tensor invariants such as x(1) or other affine subscripts such as x(i+1). Currently, we reject sparsity altogether for such tensors. Despite this restriction, however, we can already handle a lot more kernels with compound subscripts for dense access (viz. convolution with dense input and sparse filter). Some unit tests and an integration test demonstrate new capability. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109783	2021-09-15 20:28:04 -07:00
Mogball	cb8c30d35d	[DRR] Explicit Return Types in Rewrites Adds a new rewrite directive returnType that can be added at the end of an op's argument list to explicitly specify return types. ``` (OpX $v0, $v1, (returnType "$_builder.getI32Type()")) ``` Pass in a bound value to copy its return type, or pass a native code call to dynamically create new types. ``` (OpX $v0, $v1, (returnType $v0, (NativeCodeCall<"..."> $v1))) ``` Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D109472	2021-09-15 14:25:29 -07:00
Rob Suderman	1ac2d195ec	[mlir][linalg] Add canonicalizers for depthwise conv There are two main versions of depthwise conv depending whether the multiplier is 1 or not. In cases where m == 1 we should use the version without the multiplier channel as it can perform greater optimization. Add lowering for the quantized/float versions to have a multiplier of one. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D108959	2021-09-15 14:09:15 -07:00
Simon Camphausen	1b79efdc72	[mlir] Fix printing of EmitC attrs/types with escape characters Attributes and types were not escaped when printing. Reviewed By: jpienaar, marbre Differential Revision: https://reviews.llvm.org/D109143	2021-09-15 18:15:38 +00:00
Nicolas Vasilache	96ec0ff2b7	[mlir][Linalg] Revisit insertion points in comprehensive bufferization. This revision fixes a corner case that could appear due to incorrect insertion point behavior in comprehensive bufferization. Differential Revision: https://reviews.llvm.org/D109830	2021-09-15 18:11:38 +00:00
Mehdi Amini	13237c3b1e	Add llvm_unreachable after fully covered switch (NFC) This fixes a compiler warning for some version of GCC.	2021-09-15 17:53:05 +00:00
Uday Bondhugula	f68939d3d9	[MLIR] Tighten type constraint on memref.global op def Tighten the def of memref.global op to use the right kind of TypeAttr (of MemRefType). Differential Revision: https://reviews.llvm.org/D109822	2021-09-15 22:41:03 +05:30
Nicolas Vasilache	6fe77b1051	[mlir][Linalg] Fail comprehensive bufferization if a memref is returned. Summary: Reviewers: Subscribers: Differential revision: https://reviews.llvm.org/D109824	2021-09-15 15:11:17 +00:00
Nicolas Vasilache	660f281b5e	[mlir][Linalg] Make codegen strategy late transformations opt-in Summary: Making the late transformations opt-in results in less surprising behavior when composing multiple calls to the codegen strategy. Reviewers: Subscribers: Differential revision: https://reviews.llvm.org/D109820	2021-09-15 11:02:14 +00:00
Nicolas Vasilache	e3889b3059	[mlir][Linalg] Replace DenseSet by UnionFind in ComprehensiveBufferize - NFC AliasInfo can now use union-find for a much more efficient implementation. This brings no functional changes but large performance gains on more complex examples. Differential Revision: https://reviews.llvm.org/D109819	2021-09-15 10:35:54 +00:00
Matthias Springer	934e2f695e	[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results E.g.: ``` %2 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32> %3 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32> // ... (%3 is not written to) linalg.copy(%3, %2) : memref<256x256xf32>, memref<256x256xf32> vector.transfer_write %11, %2[%c0, %c0] {in_bounds = [true, true]} : vector<256x256xf32>, memref<256x256xf32> ``` Avoid copies of %3 if %3 came directly from an InitTensorOp. Differential Revision: https://reviews.llvm.org/D109742	2021-09-15 17:28:04 +09:00
Alex Zinenko	b10940edfc	[mlir] Update docs on conversion and translation to LLVM Create a new document that explain both stages of the process in a single place, merge and deduplicate the content from the two previous documents. Also extend the documentation to account for the recent changes in pass structure due to standard dialect splitting and translation being more flexible. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D109605	2021-09-15 09:50:21 +02:00
Tobias Gysi	a543abc5ea	[mlir][linalg] Update OpDSL doc (NFC). Update the doc due to recent path changes an point to a helper script.	2021-09-15 07:38:15 +00:00
Mehdi Amini	a32300a68f	Make the --mlir-disable-threading command line option overrides the C++ API usage This seems in-line with the intent and how we build tools around it. Update the description for the flag accordingly. Also use an injected thread pool in MLIROptMain, now we will create threads up-front and reuse them across split buffers. Differential Revision: https://reviews.llvm.org/D109802	2021-09-15 03:20:48 +00:00
cwz920716	500d4c45ba	[MLIR] Use memref.copy ops in BufferResultsToOutParams pass. Both copy/alloc ops are using memref dialect after this change. Reviewed By: silvas, mehdi_amini Differential Revision: https://reviews.llvm.org/D109480	2021-09-15 02:59:30 +00:00
Matthias Springer	9adc0114bf	[mlir][linalg] PadTensorOp vectorization: Avoid redundant FillOps Do not generate FillOps when these would be entirely overwritten. Differential Revision: https://reviews.llvm.org/D109741	2021-09-15 09:28:37 +09:00
Mehdi Amini	1a406cd5f2	Remove unused llvm/Support/Parallel.h from MLIR (NFC) This header aren't needed anymore: MLIR is using a thread pool injected in the context instead of a global one.	2021-09-14 23:30:42 +00:00
Sean Silva	8dca953dd3	[mlir] Apply py::module_local() to a few more classes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109776	2021-09-14 21:56:14 +00:00
Tobias Gysi	6091873651	[mli][linalg] Reuse getValueOrCreateConstantIndexOp method (NFC). Use getValueOrCreateConstantIndexOp introduced by https://reviews.llvm.org/D109601 in multiple places in LinalgOps.cpp. Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D109756	2021-09-14 15:32:29 +00:00
Tobias Gysi	44a889778c	[mlir][linalg] Fold ExtractSliceOps during tiling. Add the makeComposedExtractSliceOp method that creates an ExtractSliceOp and folds chains of ExtractSliceOps by computing the sum of their offsets and by multiplying their strides. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109601	2021-09-14 11:43:52 +00:00
Uday Bondhugula	a91cfd1990	[MLIR] Improve op parse error message for AtLeastNOperands trait Improve parse error message for "at least N operands" op trait. Differential Revision: https://reviews.llvm.org/D109747	2021-09-14 15:01:51 +05:30
Matthias Springer	62883459cd	[mlir][linalg] makeTiledShape: No affine.min if tile size == 1 This improves codegen (more static type information) with `scalarize-dynamic-dims`. Differential Revision: https://reviews.llvm.org/D109415	2021-09-14 10:48:20 +09:00
Matthias Springer	fb1def9c66	[mlir][linalg] New tiling option: Scalarize dynamic dims This tiling option scalarizes all dynamic dimensions, i.e., it tiles all dynamic dimensions by 1. This option is useful for linalg ops with partly dynamic tensor dimensions. E.g., such ops can appear in the partial iteration after loop peeling. After scalarizing dynamic dims, those ops can be vectorized. Differential Revision: https://reviews.llvm.org/D109268	2021-09-14 10:40:50 +09:00
Matthias Springer	8faf35c0a5	[mlir][linalg] Add scf.for loop peeling to codegen strategy Only scf.for loops are supported at the moment. linalg.tiled_loop support will be added in a subsequent commit. Only static tensor sizes are supported. Loops for dynamic tensor sizes can be peeled, but the generated code is not optimal due to a missing canonicalization pattern. Differential Revision: https://reviews.llvm.org/D109043	2021-09-14 10:35:01 +09:00
Matthias Springer	a4a654d301	[mlir][linalg] TiledLoopOp peeling: Do not peel partial iterations Extend the unit test with an option for skipping partial iterations during loop peeling. Differential Revision: https://reviews.llvm.org/D109640	2021-09-14 10:01:46 +09:00
Jian Cai	ce6d512015	[mlir][doc] fix typos. Also wrap some function/class names in backticks. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109723	2021-09-13 14:48:58 -07:00
Benoit Jacob	340314c4dc	Reorder mmt4d shapes: * Revert https://reviews.llvm.org/D107307 so that both LHS and RHS have the same layout with K0 as the innermost dimension. * Continuing from https://reviews.llvm.org/D107003, move also 'K' to the outer side, so that now the inter-tile dimensions as all outer, and the intra-tile dimensions are all inner. Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D109692	2021-09-13 12:09:22 -07:00
Nicolas Vasilache	181d18ef53	[mlir][Linalg] Insert static buffers as high as possible during ComprehensiveBufferization. This revision allows hoisting static alloc/dealloc pairs as high as possible during ComprehensiveBufferization. This also aligns such allocated buffers to 128B by default. This change exhibited some issues wrt insertion points and a missing copy that are also fixed in this revision; tests are updated accordingly. Differential Revision: https://reviews.llvm.org/D109684	2021-09-13 15:59:03 +00:00
Simon Camphausen	ec92f788f3	[mlir][emitc] Print signed integers properly Previously negative integers were printed as large unsigned values. Reviewed By: marbre Differential Revision: https://reviews.llvm.org/D109690	2021-09-13 15:29:30 +00:00
Jonas Paulsson	5f781ddffc	[MLIR] Mark test case XFAIL on SystemZ for now. mlir-cpu-runner/math_polynomial_approx.mlir This test case is currently failing on SystemZ, but it does not appear to necessarily be a target specific problem. See discussion at https://bugs.llvm.org/show_bug.cgi?id=51204.	2021-09-13 16:48:31 +02:00
Matthias Springer	7c9b6a3355	[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOps Do not copy InitTensorOps or casts thereof. Differential Revision: https://reviews.llvm.org/D109656	2021-09-13 22:31:54 +09:00
Nicolas Vasilache	b01d223faf	[mlir][Linalg] Use reify for padded op shape derivation. Previously, we would insert a DimOp and rely on later canonicalizations. Unfortunately, reifyShape kind of rewrites are not canonicalizations anymore. This introduces undesirable pass dependencies. Instead, immediately reify the result shape and avoid the DimOp altogether. This is akin to a local folding, which avoids introducing more reliance on `-resolve-shaped-type-result-dims` (similar to compositions of `affine.apply` by construction to avoid chains of size > 1). It does not completely get rid of the reliance on the pass as the process is merely local: calling the pass may still be necessary for global effects. Indeed, one of the tests still requires the pass. Differential Revision: https://reviews.llvm.org/D109571	2021-09-13 11:54:59 +00:00
Valentin Clement	57bf856011	[mlir] Add missing namespace to createInlinerPass One of the createInlinerPass does not have the mlir:: namespace Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109580	2021-09-13 11:58:27 +02:00
Mathieu Fehr	802bf02a73	[mlir] Allows to query traits from types and attributes Types and attributes now have a `hasTrait` function that allow users to check if a type defines a trait. Also, AbstractType and AbstractAttribute has now a `hasTraitFn` field to carry the implementation of the `hasTrait` function of the concrete type or attribute. This patch also adds the remaining functions to access type and attribute traits in TableGen. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D105202	2021-09-13 06:26:45 +00:00
Mehdi Amini	7fb2394a4f	Add sanity check in MLIR ODS to catch case where an arguments/results/regions/successors names overlap This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109474	2021-09-13 06:21:25 +00:00
Kiran Chandramohan	187d9f8cd9	[OpenMP][MLIR] Add a conversion pattern for the master op The conversion pattern is particularly useful for conversion of block arguments in the master op. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109610	2021-09-12 10:13:40 +00:00
Rob Suderman	b0532286fe	[mlir][tosa] Add shape inference for tosa.while Tosa.while shape inference requires repeatedly running shape inference across the body of the loop until the types become static as we do not know the number of iterations required by the loop body. Once the least specific arguments are known they are propagated to both regions. To determine the final end type, the least restrictive types are determined from all yields. Differential Revision: https://reviews.llvm.org/D108801	2021-09-10 13:11:53 -07:00
Alex Zinenko	61bc6aa5a7	[mlir] spelling and style changes in ReconcileUnrealizedCasts.cpp. NFC.	2021-09-10 14:09:29 +02:00
Stephan Herhut	5e6c170b3f	[mlir][linalg] Fix bufferize pattern to allow unknown operations in body of generic The original version of the bufferization pattern for linalg.generic would manually clone operations within the region to the bufferized clone of the operation. This triggers legality requirements on those operations in the conversion infra. Instead, this now uses the rewriter to inline the region instead, avoiding those legality requirements. Differential Revision: https://reviews.llvm.org/D109581	2021-09-10 13:37:42 +02:00
Matthias Springer	0f3544d185	[mlir][scf] Loop peeling: Use scf.for for partial iteration Generate an scf.for instead of an scf.if for the partial iteration. This is for consistency reasons: The peeling of linalg.tiled_loop also uses another loop for the partial iteration. Note: Canonicalizations patterns may rewrite partial iterations to scf.if afterwards. Differential Revision: https://reviews.llvm.org/D109568	2021-09-10 19:07:09 +09:00
Tobias Gysi	16488dc300	[mlir][linalg] Pass all operands to tile to the tile loop region builder (NFC). Extend the signature of the tile loop nest region builder to take all operand values to use and not just the scf::For iterArgs. This change allows us to pass in all block arguments of TiledLoop and use them directly instead of replacing them after the loop generation. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D109569	2021-09-10 08:35:11 +00:00
Nicolas Vasilache	5f1a1af4bf	[mlir][Linalg] Properly order extract_slice traversal in comprehensive bufferization This revision fixes the traversal order of extract_slice during the inplace analysis. It was previously thought that such ops could be analyzed at the very end. This is unfortunately not true as the AliasInfo for dependents of these ops need to be updated. This change allows the aliases introduced by the bufferization of extract_slice to be properly propagated. Differential Revision: https://reviews.llvm.org/D109519	2021-09-10 07:10:06 +00:00
Marius Brehler	6593cd3fe9	[mlir] Replace `include_directories` Switches to adding target specific, private includes instead of adding global includes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109494	2021-09-10 07:06:27 +00:00
natashaknk	d4d50e4710	[mlir][tosa] Add lowering for tosa.clz using scf::whileOp Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D109540	2021-09-09 15:57:35 -07:00
Aart Bik	066d786ce0	[mlir][sparse] add folding to sparse_tensor.convert folds conversion between identical types (with tests) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109545	2021-09-09 15:45:19 -07:00
thomasraoux	2a69790bad	[mlir][sparse] Mark convert op as noSideEffect Differential Revision: https://reviews.llvm.org/D109543	2021-09-09 14:39:09 -07:00
Alexander Slepko	89837a0e1b	Adding min(f/s/u) and max(f/s/u) cases for vector reduction This PR adds missing AtomicRMWKind::min/max cases which we would like to use for min/max reduction loop vectorizations. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104881	2021-09-09 12:00:43 -07:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Aart Bik	c34f3780a7	[mlir][sparse] fix broken test new flag requirements crossed the checkin of this new test Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109524	2021-09-09 09:40:00 -07:00
Aart Bik	e2d3db42e5	[mlir][sparse] add casts to operations to lattice and exp builders Further enhance the set of operations that can be handled by the sparse compiler Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109413	2021-09-09 08:49:50 -07:00
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Uday Bondhugula	524eafa5b2	[MLIR] Avoid double space print on llvm global op Fix extra space print for llvm global op when the 'unamed_addr' attribute was empty. This led to two spaces being printed in the custom form between non-whitespace chars. A round trip would add an extra space to a typical spaced form. NFC. Differential Revision: https://reviews.llvm.org/D109502	2021-09-09 19:52:38 +05:30
Alex Zinenko	1ce752b741	[mlir] support reductions in SCF to OpenMP conversion OpenMP reductions need a neutral element, so we match some known reduction kinds (integer add/mul/or/and/xor, float add/mul, integer and float min/max) to define the neutral element and the atomic version when possible to express using atomicrmw (everything except float mul). The SCF-to-OpenMP pass becomes a module pass because it now needs to introduce new symbols for reduction declarations in the module. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D107549	2021-09-09 13:04:27 +02:00
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Mehdi Amini	4eaaf05394	Add sanity check in MLIR ODS to catch case where two results have the same name This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109456	2021-09-08 23:38:50 +00:00
Chris Lattner	40a89da65c	[Canonicalize] Don't call isBeforeInBlock in OperationFolder::tryToFold. This patch (`e4635e6328`) fixed a bug where a newly generated/reused constant wouldn't dominate a folded operation. It did so by calling isBeforeInBlock to move the constant around on demand. This introduced a significant compile time regression, because "isBeforeInBlock" is O(n) in the size of a block the first time it is called, and the cache is invalidated any time canonicalize changes something big in the block. This fixes LLVM PR51738 and this CIRCT issue: https://github.com/llvm/circt/issues/1700 This does affect the order of constants left in the top of a block, I staged in the testsuite changes in rG42431b8207a5. Differential Revision: https://reviews.llvm.org/D109454	2021-09-08 13:33:22 -07:00
Chris Lattner	42431b8207	[tests] Make testsuite more resilient to "order of constant" changes. NFC.	2021-09-08 10:10:10 -07:00
Mehdi Amini	6f1f30a957	Add sanity check in MLIR ODS to catch case where two operands have the same name This is making a tablegen crash into a more friendly error. Differential Revision: https://reviews.llvm.org/D109449	2021-09-08 16:58:57 +00:00
Kunwar Shaanjeet Singh Grover	dea76ccaf4	[MLIR] FlatAffineConstraints: Refactored computation of explicit representation for identifiers This patch refactors the existing implementation of computing an explicit representation of an identifier as a floordiv in terms of other identifiers and exposes this computation as a public function. The computation of this representation is required to support local identifiers in PresburgerSet subtract, complement and isEqual. Reviewed By: bondhugula, arjunp Differential Revision: https://reviews.llvm.org/D106662	2021-09-08 20:24:46 +05:30
Arnab Dutta	1524b01541	[MLIR] Add loop coalesce utility for affine.for Add loop coalesce utility for affine.for. This expects loops to have been normalized a-priori. This works for both constant as well non constant upper bounds having single/multiple result upper bound affine map. With contributions from Arnab Dutta and Uday Bondhugula. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D108126	2021-09-08 18:02:23 +05:30
Aart Bik	d02e12fadf	[mlir][sparse] fix typos Perhaps one of these days I will actually learn how to spell opaque.... Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109391	2021-09-07 14:20:05 -07:00
Mehdi Amini	ee903a207b	Improve error message when creating an op that isn't registered in the context This prints a more helpful error for folks who aren't intrinsically familiar with the system. Differential Revision: https://reviews.llvm.org/D109378	2021-09-07 20:42:30 +00:00
Geoffrey Martin-Noble	6da594596b	[MLIR][docs] Clarify language in pass restrictions Right now all but the last bullet are relying on applied "must not" that isn't there and the last bullet is a "must". Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109389	2021-09-07 13:40:55 -07:00
Alex Zinenko	b841ae55e5	[mlir] Fix SplatOp lowering to the LLVM dialect The lowering has been incorrectly using the operands of the original op instead of rewritten operands provided to matchAndRewrite call. This may lead to spurious materializations and generally invalid IR. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D109355	2021-09-07 19:14:28 +02:00
Alex Zinenko	821262eef2	[mlir] Fix GPU LaunchFunc conversion to the LLVM dialect The conversion has been incorrectly using the operands of the original operation instead of the converted operands provided to the matchAndRewrite call. This may lead to spurious materializations and generally invalid IR if the producer of the original operands is deleted in the process of conversion. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D109356	2021-09-07 16:50:11 +02:00
Matthias Springer	c57c4f888c	[mlir][linalg] linalg.tiled_loop peeling Differential Revision: https://reviews.llvm.org/D108270	2021-09-07 09:50:08 +09:00
Alexander Belyaev	58c188507f	[mlir][linalg] Fix `FoldInitTensorWithDimOp` if dim(init_tensor) is static. It looks like it was a typo. Instead of `maybeConstantIndex`, `initTensorOp.getStaticSize(maybeConstantIndex)` should be used to access the dim size of the tensor. There is a test for that in `canonicalize.mlir`, but it was working correctly because `ReplaceStaticShapeDims` was canonicalizing DimOp before `FoldInitTensorWithDimOp`. So, to make the patterns more "orthogonal", this case is disabled. Differential Revision: https://reviews.llvm.org/D109247	2021-09-06 10:47:26 +02:00
Marius Brehler	779368bd9f	[mlir][docs] Complement list of supported scf ops	2021-09-06 05:51:36 +00:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Loren Maggiore	361458b1ce	[mlir] create gpu memset op Create a gpu memset op and corresponding CUDA and ROCm wrappers. Reviewed By: herhut, lorenrose1013 Differential Revision: https://reviews.llvm.org/D107548	2021-09-04 08:13:04 +02:00
William S. Moses	21d43daf8f	[MLIR] Primitive linkage lowering of FuncOp FuncOp always lowers to an LLVM external linkage presently. This makes it impossible to define functions in mlir which are local to the current module. Until MLIR FuncOps have a more formal linkage specification, this commit allows funcop's to have an optionally specified llvm.linkage attribute, whose value will be used as the linkage of the llvm funcop when lowered. Differential Revision: https://reviews.llvm.org/D108524 Support LLVM linkage	2021-09-03 20:41:39 -04:00
Mehdi Amini	78accf9f35	Make LLVM Linkage a first class attribute instead of using an integer attribute This makes the IR more readable, in particular when this will be used on the builtin func outside of the LLVM dialect. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D109209	2021-09-03 21:21:46 +00:00
Aart Bik	eee1f1c8fb	[mlir][sparse] add convenience method for sparse tensor setup This simplifies setting up sparse tensors through C-style data structures. Useful for runtimes that want to interact with MLIR-generated code without knowning about all bufferization details (viz. memrefs). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109251	2021-09-03 13:35:59 -07:00
Alexander Belyaev	5ee5bbd0ff	[mlir][linalg] Extend tiled_loop to SCF conversion to generate scf.parallel. Differential Revision: https://reviews.llvm.org/D109230	2021-09-03 18:05:54 +02:00
Aart Bik	b6d1a31c1b	[mlir][sparse] refine heuristic for iteration graph topsort The sparse index order must always be satisfied, but this may give a choice in topsorts for several cases. We broke ties in favor of any dense index order, since this gives good locality. However, breaking ties in favor of pushing unrelated indices into sparse iteration spaces gives better asymptotic complexity. This revision improves the heuristic. Note that in the long run, we are really interested in using ML for ML to find the best loop ordering as a replacement for such heuristics. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109100	2021-09-03 08:37:15 -07:00
Marius Brehler	36895cd8d8	[mlir] Update EmitC documentation	2021-09-03 15:23:55 +00:00
Jean Perier	49af2a6275	[mlir][flang] Do not prevent integer types from being parsed as MLIR keywords DialectAsmParser::parseKeyword is rejecting `'i' digit+` while it is a valid identifier according to mlir/docs/LangRef.md. Integer types actually used to be TOK_KEYWORD a while back before the change: `6af866c58d`. This patch Modifies `isCurrentTokenAKeyword` to return true for tokens that match integer types too. The motivation for this change is the parsing of `!fir.type<{` `component-name: component-type,`+ `}>` type in FIR that represent Fortran derived types. The component-names are parsed as keywords, and can very well be i32 or any ixxx (which are valid Fortran derived type component names). The Quant dialect type parser had to be modified since it relied on `iw` not being parsed as keywords. Differential Revision: https://reviews.llvm.org/D108913	2021-09-03 08:20:49 +02:00
Matthias Springer	4fa6c2734c	[mlir][scf] Allow runtime type of iter_args to change The limitation on iter_args introduced with D108806 is too restricting. Changes of the runtime type should be allowed. Extends the dim op canonicalization with a simple analysis to determine when it is safe to canonicalize. Differential Revision: https://reviews.llvm.org/D109125	2021-09-03 10:03:05 +09:00
Stella Laurenzo	cb7b03819a	[mlir][python] Simplify python extension loading. * Now that packaging has stabilized, removes old mechanisms for loading extensions, preferring direct importing. * Removes _cext_loader.py, _dlloader.py as unnecessary. * Fixes the path where the CAPI dll is written on Windows. This enables that path of least resistance loading behavior to work with no further drama (see: https://bugs.python.org/issue36085). * With this patch, `ninja check-mlir` on Windows with Python bindings works for me, modulo some failures that are actually due to a couple of pre-existing Windows bugs. I think this is the first time the Windows Python bindings have worked upstream. * Downstream changes needed: * If downstreams are using the now removed `load_extension`, `reexport_cext`, etc, then those should be replaced with normal import statements as done in this patch. Reviewed By: jdd, aartbik Differential Revision: https://reviews.llvm.org/D108489	2021-09-03 00:43:28 +00:00
Alex Zinenko	f9be7a7afd	[mlir] speed up construction of LLVM IR constants when possible The translation to LLVM IR used to construct sequential constants by recurring down to individual elements, creating constant values for them, and wrapping them into aggregate constants in post-order. This is highly inefficient for large constants with known data such as DenseElementsAttr. Use LLVM's ConstantData for the innermost dimension instead. LLVM does seem to support data constants for nested sequential constants so the outer dimensions are still handled recursively. Nevertheless, this speeds up the translation of large constants with equal dimensions by up to 30x. Users are advised to rewrite large constants to use flat types before translating to LLVM IR if more efficiency in translation is necessary. This is not done automatically as the translation is not aware of the expectations of the overall compilation flow about type changes and indexing, in particular for global constants with external linkage. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109152	2021-09-02 23:07:30 +02:00
Marius Brehler	f6063fedb4	[mlir] Add missing dep on MLIRTranslation	2021-09-02 16:54:46 +00:00
Kiran Chandramohan	711aa35759	[MLIR][OpenMP] Add support for declaring critical construct names Add an operation omp.critical.declare to declare names/symbols of critical sections. Named omp.critical operations should use symbols declared by omp.critical.declare. Having a declare operation ensures that the names of critical sections are global and unique. In the lowering flow to LLVM IR, the OpenMP IRBuilder creates unique names for critical sections. Reviewed By: ftynse, jeanPerier Differential Revision: https://reviews.llvm.org/D108713	2021-09-02 14:31:19 +00:00
Marius Brehler	2f0750dd2e	[mlir] Add Cpp emitter This upstreams the Cpp emitter, initially presented with [1], from [2] to MLIR core. Together with the previously upstreamed EmitC dialect [3], the target allows to translate MLIR to C/C++. [1] https://reviews.llvm.org/D76571 [2] https://github.com/iml130/mlir-emitc [3] https://reviews.llvm.org/D103969 Co-authored-by: Jacques Pienaar <jpienaar@google.com> Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Co-authored-by: Oliver Scherf <oliver.scherf@iml.fraunhofer.de> Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D104632	2021-09-02 13:51:05 +00:00
Alex Zinenko	8647e4c3a0	[mlir] support translating OpenMP loops with reductions Use the recently introduced OpenMPIRBuilder facility to transate OpenMP workshare loops with reductions to LLVM IR calling OpenMP runtime. Most of the heavy lifting is done at the OpenMPIRBuilder. When other OpenMP dialect constructs grow support for reductions, the translation can be updated to operate on, e.g., an operation interface for all reduction containers instead of workshare loops specifically. Designing such a generic translation for the single operation that currently supports reductions is premature since we don't know how the reduction modeling itself will be generalized. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D107343	2021-09-02 15:38:20 +02:00
Alexander Belyaev	f68de11c10	[mlir][linalg] Expose function to create op on buffers during bufferization. Differential Revision: https://reviews.llvm.org/D109140	2021-09-02 11:09:05 +02:00
Aart Bik	2754604e54	[mlir][sparse] sparse runtime support library improvements (1) renamed SparseTensor to SparseTensorCOO, the other one remains SparseTensorStorage to focus on contrast (2) documents difference between public API exclusively for compiler-generated code and methods that could be used by other runtimes (TBD) that want to interact with MLIR Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109039	2021-09-01 16:51:14 -07:00
Jacques Pienaar	f7bf8a8658	[mlir][capi] Add NameLoc Add method to get NameLoc. Treat null child location as unknown to avoid needing to create UnknownLoc in C API where child loc is not needed. Differential Revision: https://reviews.llvm.org/D108678	2021-09-01 16:16:35 -07:00
Weiwei Li	a79d7c2c85	[mlir][SPIRV] Add Image Operands for Image Instructions This patch is to add Image Operands in SPIR-V Dialect and also let ImageDrefGather to use Image Operands. Image Operands are used in many image instructions. "Image Operands encodes what oprands follow, as per Image Operands". And ususally, they are optional to image instructions. The format of image operands looks like: %0 = spv.ImageXXXX %1, ... %3 : f32 ["Bias\|Lod"](%4, %5 : f32, f32) -> ... This patch doesn’t implement all operands (see Section 3.14 in SPIR-V Spec) but provides a skeleton of it. There is TODO in verifyImageOperands function. Co-authored: Alan Liu <alanliu.yf@gmail.com> Reviewed by: antiagainst Differential Revision: https://reviews.llvm.org/D108501	2021-09-02 04:14:17 +08:00
Mehdi Amini	43a894365e	Remove deprecated registration APIs (NFC) In D104421, we changed the API for pass registration. Before you would write: void registerPass("my-pass", "My Pass Description.", [] { return createMyPass(); }); while now you’d only write: void registerPass([] { return createMyPass(); }); If you’re using TableGen to define your pass registration, you shouldn’t have anything to do. If you’re using directly the C++ API here are some changes. Your project may also be broken even if you use TableGen and you call the generated registration API in case your pass implementation didn’t inherit from the MyPassBase class generated by TableGen. If you don't use TableGen, the "my-pass" and "My Pass Description." fields must be provided by overriding methods on the pass itself: llvm::StringRef getArgument() const final { return "my-pass"; } llvm::StringRef getDescription() const final { return "My Pass Description."; } Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104429	2021-09-01 18:53:30 +00:00
natashaknk	f596acc74d	[mlir][tosa] Small refactor to the functionality of Depthwise_Conv2D to add the bias at the end of the convolution Follow-up to the Conv2d and fully_connected lowering adjustments Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108949	2021-09-01 10:01:00 -07:00
Tyler Augustine	7105512a34	Support alias.scope and noalias metadata lowering on intrinsics. Builds on https://reviews.llvm.org/D107870 to support annotating intrinsics with alias.scope and noalias metadata. Reviewed By: arpith-jacob, ftynse Differential Revision: https://reviews.llvm.org/D109025	2021-09-01 16:54:20 +00:00
wren romano	b04b757a8e	[mlir][sparse] Rename the public SparseTensorStorage::asCOO to toCOO Trying to reduce confusion by having the name of the public method match that of the private method for handling the recursion. Also adding some comments to SparseTensorStorage::fromCOO to help clarify what the recursive calls are doing in the dense case. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108954	2021-08-31 15:44:34 -07:00
Mehdi Amini	c7515a49b1	Fix MLIR python binding test after changes in ASM printer	2021-08-31 18:50:22 +00:00
MaheshRavishankar	b686fdbf92	[mlir][Linalg] Drop output tensor from `linalg.pad_tensor` op. The output tensor was added for tiling purposes. With use of `TilingInterface` for tiling pad operations, there is no need for an explicit operand for the shape of result of `linalg.pad_tensor` op. The interface allows the tiling pattern to query the value that can be used for the "init" needed for tiling dynamically. Differential Revision: https://reviews.llvm.org/D108613	2021-08-31 11:12:24 -07:00
Mehdi Amini	387f95541b	Add a new interface allowing to set a default dialect to be used for printing/parsing regions Currently the builtin dialect is the default namespace used for parsing and printing. As such module and func don't need to be prefixed. In the case of some dialects that defines new regions for their own purpose (like SpirV modules for example), it can be beneficial to change the default dialect in order to improve readability. Differential Revision: https://reviews.llvm.org/D107236	2021-08-31 17:52:40 +00:00
Mehdi Amini	c41b16c26b	Change ASM Op printer to print the operation name in the framework instead of leaving it up to each individual operation This aligns the printer with the parser contract: the operation isn't part of the user-controllable part of the syntax. Differential Revision: https://reviews.llvm.org/D108804	2021-08-31 17:52:40 +00:00
Mehdi Amini	fd87963eee	Change dialect `printOperation()` hook to `getOperationPrinter()` This makes the hook return a printer if available, instead of using LogicalResult to indicate if a printer was available (and invoked). This allows the caller to detect that the dialect has a printer for a given operation without actually invoking the printer. It'll be leveraged in a future revision to move printing the op name itself under control of the ASMPrinter. Differential Revision: https://reviews.llvm.org/D108803	2021-08-31 17:52:39 +00:00
Tres Popp	44485fcd97	[mlir] Prevent assertion failure in DropUnitDims Don't assert fail on strided memrefs when dropping unit dims. Instead just leave them unchanged. Differential Revision: https://reviews.llvm.org/D108205	2021-08-31 12:15:13 +02:00
marina kolpakova a.k.a. geexie	0080d2aa55	[mlir][gpu] folds memref.dim of gpu.alloc implements canonicalization which folds memref.dim(gpu.alloc(%size), %idx) -> %size Differential Revision: https://reviews.llvm.org/D108892	2021-08-31 12:33:10 +03:00
Stella Laurenzo	f05ff4f757	[mlir][python] Apply py::module_local() to all classes. * This allows multiple MLIR-API embedding downstreams to co-exist in the same process. * I believe this is the last thing needed to enable isolated embedding. Differential Revision: https://reviews.llvm.org/D108605	2021-08-30 22:18:43 -07:00
MaheshRavishankar	2dfb66833f	Fix unused variable in release build. Differential Revision: https://reviews.llvm.org/D108963	2021-08-30 19:34:52 -07:00
MaheshRavishankar	ba72cfe734	[mlir] Add an interface to allow operations to specify how they can be tiled. An interface to allow for tiling of operations is introduced. The tiling of the linalg.pad_tensor operation is modified to use this interface. Differential Revision: https://reviews.llvm.org/D108611	2021-08-30 16:31:18 -07:00
Chris Lattner	faf1c22408	[Builder] Eliminate the StringRef/StringAttr forms of getSymbolRefAttr. The StringAttr version doesn't need a context, so we can just use the existing `SymbolRefAttr::get` form. The StringRef version isn't preferred so we want to encourage people to use StringAttr. There is an additional form of getSymbolRefAttr that takes a (SymbolTrait implementing) operation. This should also be moved, but I'll do that as a separate patch. Differential Revision: https://reviews.llvm.org/D108922	2021-08-30 16:05:36 -07:00
natashaknk	203d38b234	[mlir][tosa] Small refactor to the functionality of Conv2D and Fully_connected to add the bias at the end of the convolution Made to adjust for a modification to the tiling algorithm Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108746	2021-08-30 13:18:43 -07:00
Stella Laurenzo	8e6c55c92c	[mlir][python] Extend C/Python API to be usable for CFG construction. * It is pretty clear that no one has tried this yet since it was both incomplete and broken. * Fixes a symbol hiding issues keeping even the generic builder from constructing an operation with successors. * Adds ODS support for successors. * Adds CAPI `mlirBlockGetParentRegion`, `mlirRegionEqual` + tests (and missing test for `mlirBlockGetParentOperation`). * Adds Python property: `Block.region`. * Adds Python methods: `Block.create_before` and `Block.create_after`. * Adds Python property: `InsertionPoint.block`. * Adds new blocks.py test to verify a plausible CFG construction case. Differential Revision: https://reviews.llvm.org/D108898	2021-08-30 08:28:00 -07:00
Alex Zinenko	9db95a67d1	Fix interface trait declaration in SymbolInterfaces.td `41d4aa7de6` introduced incorrect code in extraTraitClassDeclaration: `this` refers to the trait class and not the operation class so `->getContext()` is not valid. Use `$_op` instead.	2021-08-30 11:15:05 +02:00
Chris Lattner	41d4aa7de6	[SymbolRefAttr] Revise SymbolRefAttr to hold a StringAttr. SymbolRefAttr is fundamentally a base string plus a sequence of nested references. Instead of storing the string data as a copies StringRef, store it as an already-uniqued StringAttr. This makes a lot of things simpler and more efficient because: 1) references to the symbol are already stored as StringAttr's: there is no need to copy the string data into MLIRContext multiple times. 2) This allows pointer comparisons instead of string comparisons (or redundant uniquing) within SymbolTable.cpp. 3) This allows SymbolTable to hold a DenseMap instead of a StringMap (which again copies the string data and slows lookup). This is a moderately invasive patch, so I kept a lot of compatibility APIs around. It would be nice to explore changing getName() to return a StringAttr for example (right now you have to use getNameAttr()), and eliminate things like the StringRef version of getSymbol. Differential Revision: https://reviews.llvm.org/D108899	2021-08-29 21:54:47 -07:00
Matthias Springer	d18ffd61d4	[mlir][SCF] Canonicalize dim(x) where x is an iter_arg * Add `DimOfIterArgFolder`. * Move existing cross-dialect canonicalization patterns to `LoopCanonicalization.cpp`. * Rename `SCFAffineOpCanonicalization` pass to `SCFForLoopCanonicalization`. * Expand documentaton of scf.for: The type of loop-carried variables may not change with iterations. (Not even the dynamic type.) Differential Revision: https://reviews.llvm.org/D108806	2021-08-30 01:39:56 +00:00
Matthias Springer	eedc997b7d	[mlir][Analysis] Add batched version of FlatAffineConstraints::addId * Add batched version of all `addId` variants, so that multiple IDs can be added at a time. * Rename `addId` and variants to `insertId` and `appendId`. Most external users call `appendId`. Splitting `addId` into two functions also makes it possible to provide batched version for both. (Otherwise, the overloads are ambigious when calling `addId`.) Differential Revision: https://reviews.llvm.org/D108532	2021-08-30 00:56:44 +00:00
Lei Zhang	a5621e26db	[mlir][spirv] Use type dyn_cast when scanning spv.GlobalVariable This avoids crashes when there are spv.GlobalVariable without pointer type.	2021-08-29 12:01:19 -04:00
Aart Bik	b9f87e24f2	[mlir] add missing include, fix broken build Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108873	2021-08-28 09:36:38 -07:00
Markus Böck	0235e3c7a6	[mlir][NFC] Fully qualify default value of Attributes `getStorageType()` in files generated by mlir-tblgen	2021-08-28 15:37:56 +02:00
Uday Bondhugula	4edc9e2acf	[MLIR][GPU] Drop mgpuMemHostRegisterMemRef's dependence on LLVM Support Drop mgpuMemHostRegisterMemRef's dependence on LLVM Support. This method is the only one in CUDA runtime wrappers library that creates a dependence on libLLVMSupport due to its use of SmallVector and ArrayRef. The code can be as easily/compactly written without those ADT. The dependence on LLVMSupport adds a significant amount of additional complexity for external things that want to link this library in (both statically or as a shared object) since libLLVMSupport includes numerous other objects that are sensitive to C++ compiler version and ABI. Differential Revision: https://reviews.llvm.org/D108684	2021-08-28 11:37:55 +05:30
Mehdi Amini	022538f276	Remove `const` from `const T &&` in debugString() helper to make it a universal reference (NFC) It broke lvalue arguments otherwise:	2021-08-28 01:09:00 +00:00
Mehdi Amini	4387975170	Use a universal reference (&& instead of const &) for `debugString()` helper (NFC) Some classes like mlir::Operation have a non-const print() method.	2021-08-28 00:41:41 +00:00
Mehdi Amini	c0b70def21	Specify argument to be `const` for `debugString()` helper (NFC) This allows using this helper with rvalues.	2021-08-28 00:10:53 +00:00
Aart Bik	0a7b8cc5dd	[mlir][sparse] fully implement sparse tensor to sparse tensor conversions with rigorous integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108721	2021-08-27 15:08:18 -07:00
Butygin	1e35a7690d	[mlir][spirv] Initial support for 64 bit index type and builtins Differential Revision: https://reviews.llvm.org/D108516	2021-08-27 01:38:53 +03:00
Rob Suderman	90478251c7	[mlir][tosa] Tosa reverse to linalg supporting dynamic shapes Needed to switch to extract to support tosa.reverse using dynamic shapes. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108744	2021-08-26 13:23:59 -07:00
Rob Suderman	0600bb4d18	[mlir][tosa] Elementwise operation dynamic shape support Added dynamic shape support for elementwise operations. This assumes equal sizes (broadcasting 1-length dynamic is problematic). Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108730	2021-08-26 11:18:58 -07:00
Aart Bik	6b26857dbf	[mlir][sparse] add asCOO() functionality to sparse tensor object This prepares general sparse to sparse conversions. The code that needs to be generated using this new feature is now simply: (1) coo = sparse_tensor_1->asCOO(); // source format1 (2) sparse_tensor_2 = newSparseTensor(coo); // destination format2 By using COO as an intermediate, we can do all conversions without having to implement the full O(N^2) conversion matrix. Note that we can always improve particular conversions individually if a faster solution is required. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108681	2021-08-25 21:50:39 -07:00
Tobias Gysi	8e9808ca3a	[mlir][linalg] Tune hasTensorSemantics/hasBufferSemantics methods. Optimize performance by iterating all operands at once. Reviewed By: benvanik Differential Revision: https://reviews.llvm.org/D108716	2021-08-25 19:28:37 +00:00
Tobias Gysi	2b35b372fd	[mlir][linalg] Tune getTiedIndexingMap method (NFC). Optimize the performance by using the range directly. Reviewed By: benvanik Differential Revision: https://reviews.llvm.org/D108715	2021-08-25 18:44:01 +00:00
Aart Bik	d5f7f356ce	[mlir][sparse] add sparse-dense cases to storage integration test Reviewed By: grosul1 Differential Revision: https://reviews.llvm.org/D108685	2021-08-25 11:33:20 -07:00
River Riddle	c8d9e1ce43	[mlir][AttrTypeGen] Add support for specifying a "accessor" type of a parameter This allows for using a different type when accessing a parameter than the one used for storage. This allows for returning parameters by reference, enables using more optimized/convient reference results, and more. Differential Revision: https://reviews.llvm.org/D108593	2021-08-25 09:27:36 +00:00
River Riddle	9658b061dd	[mlir] Update DialectAsmParser::parseString to use std::string instead of StringRef This allows for parsing strings that have escape sequences, which require constructing a string (as they can't be represented by looking at the Token contents directly). Differential Revision: https://reviews.llvm.org/D108589	2021-08-25 09:27:35 +00:00
River Riddle	aea3026ea7	[mlir] Move the Operation use iteration utilities to ResultRange This allows for iterating and interacting with the uses of a specific subset of results as opposed to just the full range. Differential Revision: https://reviews.llvm.org/D108586	2021-08-25 09:27:35 +00:00
Tres Popp	868bd9938d	[mlir] Add assertion in NamedAttrList to prevent adding null attributes Differential Revision: https://reviews.llvm.org/D108570	2021-08-25 11:06:53 +02:00
Rob Suderman	5541a05d6a	[mlir][tosa] Quantized tosa.avg_pool2d lowering to linalg Includes the quantized version of average pool lowering to linalg dialect. This includes a lit test for the transform. It is not 100% correct as the multiplier / shift should be done in i64 however this is negligable rounding difference. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108676	2021-08-24 18:54:23 -07:00
Rob Suderman	4ef1770abd	[mlir][tosa] Table did not apply offset before extract on i8 input Lowering to table was incorrect as it did not apply a 128 offset before extracting the value from the table. Fixed and correct tensor length on input table. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108436	2021-08-24 18:52:33 -07:00
Matthias Springer	a9cff97f94	[mlir][SCF] Generalize AffineMinSCFCanonicalization to min/max ops * Add support for affine.max ops to SCF loop peeling pattern. * Add support for affine.max ops to `AffineMinSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalizationPattern` to `AffineOpSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalization` pass to `SCFAffineOpCanonicalization`. Differential Revision: https://reviews.llvm.org/D108009	2021-08-25 10:40:34 +09:00
wren romano	90e0c657b7	[mlir][sparse] Correcting the use of emplace_back The emplace commands are variadic and should take all the constructor arguments directly, since they implicitly call the constructor themselves in order to avoid the cost of constructing and then moving/copying temporaries. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108670	2021-08-24 18:32:13 -07:00
Rob Suderman	a7bf93807b	[mlir][tosa] Fix conv/depthwise conv padding for quantized values When padding quantized operations, the padding needs to equal the zero point of the input value. Corrected the pass to change the padding value if quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108440	2021-08-24 18:13:22 -07:00
Chenggang Zhao	2b2c13e672	[mlir][docs] A friendlier improvement for the Toy tutorial chapter 4. Add notes for discarding private-visible functions in the Toy tutorial chapter 4. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108026	2021-08-25 00:44:51 +00:00
Matthias Springer	2de2dbef2a	[mlir][linalg] Replace AffineMinSCFCanonicalizationPattern with SCF reimplementation Use the new canonicalization pattern in the SCF dialect. Differential Revision: https://reviews.llvm.org/D107732	2021-08-25 08:52:56 +09:00
Aart Bik	c5735fada4	[mlir][sparse] enable a few vectorized runs in integration tests Recent changes outside sparse compiler exposed the requirement of running a new pass (lower-affine) but this only became apparent with private testing. By adding some vectorized runs to integration test, we will detect the need for such changes earlier and also widen codegen coverage of course. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D108667	2021-08-24 16:08:01 -07:00
Matthias Springer	98aa694d0d	[mlir][scf] Add general affine.min canonicalization pattern This canonicalization simplifies affine.min operations inside "for loop"-like operations (e.g., scf.for and scf.parallel) based on two invariants: * iv >= lb * iv < lb + step * ((ub - lb - 1) floorDiv step) + 1 This commit adds a new pass `canonicalize-scf-affine-min` (instead of being a canonicalization pattern) to avoid dependencies between the Affine dialect and the SCF dialect. Differential Revision: https://reviews.llvm.org/D107731	2021-08-25 07:32:30 +09:00
Logan Chien	88125e8af1	[mlir] Fix attachInterface typo This commit fixes the documentation typo regarding `attachInterface`. Differential Revision: https://reviews.llvm.org/D108666	2021-08-24 15:17:52 -07:00
Tyler Augustine	d25e91d7f6	Support alias.scope and noalias metadata Introduces new Ops to represent 1. alias.scope metadata in LLVM, and 2. domains for these scopes. These correspond to the metadata described in https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata. Lists of scopes are modeled the same way as access groups - as an ArrayAttr on the Op (added in https://reviews.llvm.org/D97944). Lowering 'noalias' attributes on function parameters is already supported. However, lowering `noalias` metadata on individual Ops is not, which is added in this change. LLVM uses the same keyword for these, but this change introduces a separate attribute name 'noalias_scopes' to represent this distinct concept. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107870	2021-08-24 20:42:59 +02:00
Aart Bik	fda176892e	[mlir][sparse] use new permutation utility to avoid codedup Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D108636	2021-08-24 08:48:17 -07:00
Aart Bik	a643bd3189	[mlir] add permutation utility I found myself typing this code several times at different places by now, so time to make this a general utility instead. Given a permutation, it returns the permuted position of the input, for example (i,j,k) -> (k,i,j) yields position 1 for input 0. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D108347	2021-08-24 08:07:40 -07:00
Matthias Springer	ebf35370ff	[mlir][tensor] Insert explicit tensor.cast ops for insert_slice src If additional static type information can be deduced from a insert_slice's size operands, insert an explicit cast of the op's source operand. This enables other canonicalization patterns that are matching for tensor_cast ops such as `ForOpTensorCastFolder` in SCF. Differential Revision: https://reviews.llvm.org/D108617	2021-08-24 19:45:04 +09:00
Matthias Springer	0c36082963	[mlir][SCF] Use symbols in loop peeling rewrite Use symbols in the affine map instead of dims. Dims should not be divided. Differential Revision: https://reviews.llvm.org/D108431	2021-08-24 19:39:19 +09:00
MaheshRavishankar	b546f4347b	[mlir]Linalg] Allow controlling fusion of linalg.generic -> linalg.tensor_expand_shape. Differential Revision: https://reviews.llvm.org/D108565	2021-08-23 16:28:10 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
River Riddle	4e103a12d9	[mlir] Add support for VariadicOfVariadic operands This revision adds native ODS support for VariadicOfVariadic operand groups. An example of this is the SwitchOp, which has a variadic number of nested operand ranges for each of the case statements, where the number of case statements is variadic. Builtin ODS support allows for generating proper accessors for the nested operand ranges, builder support, and declarative format support. VariadicOfVariadic operands are supported by providing a segment attribute to use to store the operand groups, mapping similarly to the AttrSizedOperand trait (but with a user defined attribute name). `build` methods for VariadicOfVariadic operand expect inputs of the form `ArrayRef<ValueRange>`. Accessors for the variadic ranges return a new `OperandRangeRange` type, which represents a contiguous range of `OperandRange`. In the declarative assembly format, VariadicOfVariadic operands and types are by default formatted as a comma delimited list of value lists: `(<value>, <value>), (), (<value>)`. Differential Revision: https://reviews.llvm.org/D107774	2021-08-23 20:32:31 +00:00
MaheshRavishankar	4aeeb91a92	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-23 13:06:34 -07:00
River Riddle	da12d88b1c	[mlir][NFC] Add inlineRegion overloads that take a block iterator insert position This allows for inlining into an empty block or to the beginning of a block. NFC as the existing implementations now foward to this overload. Differential Revision: https://reviews.llvm.org/D108572	2021-08-23 19:49:53 +00:00
River Riddle	e4635e6328	[mlir][FoldUtils] Ensure the created constant dominates the replaced op This revision fixes a bug where an operation would get replaced with a pre-existing constant that didn't dominate it. This can occur when a pattern inserts operations to be folded at the beginning of the constants insertion block. This revision fixes the bug by moving the existing constant before the replaced operation in such cases. This is fine because if a constant didn't already exist, a new one would have been inserted before this operation anyways. Differential Revision: https://reviews.llvm.org/D108498	2021-08-23 18:48:24 +00:00
Krzysztof Drewniak	469172f3f4	[MLIR][Docs] Fix broken link to tuple type rationale Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108135	2021-08-23 18:35:36 +00:00
Matthias Springer	bc194a5bb5	[mlir][SCF] Do not peel loops inside partial iterations Do not apply loop peeling to loops that are contained in the partial iteration of an already peeled loop. This is to avoid code explosion when dealing with large loop nests. Can be controlled with a new pass option `skip-partial`. Differential Revision: https://reviews.llvm.org/D108542	2021-08-23 21:35:46 +09:00
Stella Laurenzo	a8de667af0	[mlir] Add op for NCHW conv2d. * This is the native data layout for PyTorch and npcomp was using the prior version before cleanup. Differential Revision: https://reviews.llvm.org/D108527	2021-08-22 17:27:33 -07:00
Stella Laurenzo	64e74e9d7c	[mlir][linalg] Add script to update the LinalgNamedStructuredOps.yaml. nfc Also adds banners to the files with update instructions. Differential Revision: https://reviews.llvm.org/D108529	2021-08-22 16:54:51 -07:00
Stella Laurenzo	e78b745cf2	[mlir][python] Makes C++ extension code relocatable by way of a macro. * Resolves a TODO by making this configurable by downstreams. * This seems to be the last thing allowing full use of the Python bindings as a library within another project (i.e. be embedding them). Differential Revision: https://reviews.llvm.org/D108523	2021-08-22 13:46:14 -07:00
William S. Moses	973cb2c326	[MLIR][OMP] Ensure nested scf.parallel execute all iterations Presently, the lowering of nested scf.parallel loops to OpenMP creates one omp.parallel region, with two (nested) OpenMP worksharing loops on the inside. When lowered to LLVM and executed, this results in incorrect results. The reason for this is as follows: An OpenMP parallel region results in the code being run with whatever number of threads available to OpenMP. Within a parallel region a worksharing loop divides up the total number of requested iterations by the available number of threads, and distributes accordingly. For a single ws loop in a parallel region, this works as intended. Now consider nested ws loops as follows: omp.parallel { A: omp.ws %i = 0...10 { B: omp.ws %j = 0...10 { code(%i, %j) } } } Suppose we ran this on two threads. The first workshare loop would decide to execute iterations 0, 1, 2, 3, 4 on thread 0, and iterations 5, 6, 7, 8, 9 on thread 1. The second workshare loop would decide the same for its iteration. This means thread 0 would execute i \in [0, 5) and j \in [0, 5). Thread 1 would execute i \in [5, 10) and j \in [5, 10). This means that iterations i in [5, 10), j in [0, 5) and i in [0, 5), j in [5, 10) never get executed, which is clearly wrong. This permits two options for a remedy: 1) Change the semantics of the omp.wsloop to be distinct from that of the OpenMP runtime call or equivalently #pragma omp for. This could then allow some lowering transformation to remedy the aforementioned issue. I don't think this is desirable for an abstraction standpoint. 2) When lowering an scf.parallel always surround the wsloop with a new parallel region (thereby causing the innermost wsloop to use the number of threads available only to it). This PR implements the latter change. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108426	2021-08-20 19:06:28 -04:00
Rob Suderman	871c812483	[mlir][linalg] Finish refactor of TC ops to YAML Multiple operations were still defined as TC ops that had equivalent versions as YAML operations. Reducing to a single compilation path guarantees that frontends can lower to their equivalent operations without missing the optimized fastpath. Some operations are maintained purely for testing purposes (mainly conv{1,2,3}D as they are included as sole tests in the vectorizaiton transforms. Differential Revision: https://reviews.llvm.org/D108169	2021-08-20 12:35:04 -07:00
Aart Bik	758ccf8506	[mlir][sparse] add test for DimOp folding Folding in the MLIR uses the order of the type directly but folding in the underlying implementation must take the dim ordering into account. These tests clarify that behavior and verify it is done right. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108474	2021-08-20 11:24:09 -07:00
Aart Bik	24ea94ad0c	[mlir][sparse][python] migrate more code from boilerplate into proper numpy land The boilerplate was setting up some arrays for testing. To fully illustrate python - MLIR potential, however, this data should also come from numpy land. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108336	2021-08-20 09:18:17 -07:00
Jacques Pienaar	a232a48dca	[mlir][ods] Skip adding TOC in doc gen when present Enables adding a TOC in the description to be able to interleave documentation before and after the TOC.	2021-08-20 07:01:54 -07:00
Denys Shabalin	1631d9a7ea	[mlir][linalg] Fix __repr__ implementation in const from opdsl Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D108369	2021-08-20 12:39:57 +02:00
Vladislav Vinogradov	9775c0c9f0	[mlir] Fix ControlFlowInterfaces implementation for Async dialect * Add `RegionBranchTerminatorOpInterface` to `YieldOp`. * Implement `getSuccessorEntryOperands` in `ExecuteOp`. * Fix `getSuccessorRegions` implementation in `ExecuteOp`. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D108373	2021-08-20 12:14:45 +03:00
Vladislav Vinogradov	d1883bc322	[mlir][NFC] Use explicit ::mlir namespace in mlir-tblgen generated code Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108376	2021-08-20 11:52:25 +03:00
Rob Suderman	3205ee7e81	[mlir][tosa] Support UInt8 inputs and outputs for tosa.rescale Tosa rescale can contain uint8 types. Added support for these types using an unrealized conversion cast. Optimistically it would be better to use bitcast however it does not support unsigned integers. Differential Revision: https://reviews.llvm.org/D108427	2021-08-19 18:58:44 -07:00
Morten Borup Petersen	6c1436a9b0	[MLIR][SCF] Parenthesize multiple return types in scf.execute_region asm op Previously, ExecuteRegionOps with multiple return values would fail a round-trip test due to missing parenthesis around the types. Differential Revision: https://reviews.llvm.org/D108402	2021-08-19 21:31:51 +01:00
MaheshRavishankar	16ffb283c5	Revert "[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes." This reverts commit `95ddc8341a`. Differential Revision: https://reviews.llvm.org/D108396	2021-08-19 11:53:41 -07:00
MaheshRavishankar	95ddc8341a	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-19 11:14:35 -07:00
Matthias Springer	76a1861816	[mlir][SparseTensor] Split scf.for loop into masked/unmasked parts Apply the "for loop peeling" pattern from SCF dialect transforms. This pattern splits scf.for loops into full and partial iterations. In the full iteration, all masked loads/stores are canonicalized to unmasked loads/stores. Differential Revision: https://reviews.llvm.org/D107733	2021-08-19 21:53:11 +09:00
Matthias Springer	8e8b70aa84	[mlir][scf] Simplify affine.min ops after loop peeling Simplify affine.min ops, enabling various other canonicalizations inside the peeled loop body. affine.min ops such as: ``` map = affine_map<(d0)[s0, s1] -> (s0, -d0 + s1)> %r = affine.min #affine.min #map(%iv)[%step, %ub] ``` are rewritten them into (in the case the peeled loop): ``` %r = %step ``` To determine how an affine.min op should be rewritten and to prove its correctness, FlatAffineConstraints is utilized. Differential Revision: https://reviews.llvm.org/D107222	2021-08-19 17:24:53 +09:00
John Demme	96fbd5cd5e	[MLIR] [Python] Add `owner` to `mlir.ir.Block` Provides a way for python users to access the owning Operation from a Block.	2021-08-19 00:02:09 -07:00
Tobias Gysi	234c4d2362	[mlir][linalg] Set result types in all builders. Add code to set the result types in all yaml op builders. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D108273	2021-08-19 06:19:12 +00:00
Matthias Springer	08dbed8a57	[mlir][linalg] Canonicalize dim ops of tiled_loop block args E.g.: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %x, %c0 : tensor<...> } ``` is rewritten to: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %y, %c0 : tensor<...> } ``` Differential Revision: https://reviews.llvm.org/D108272	2021-08-19 11:24:33 +09:00
Matthias Springer	9329438244	[mlir][linalg] Remove ConstraintsSet class The same functionality can be implemented with FlatAffineValueConstraints. Differential Revision: https://reviews.llvm.org/D108179	2021-08-19 10:57:35 +09:00
Matthias Springer	c777e51468	[mlir][Analysis][NFC] FlatAffineConstraints: Use BoundType enum in functions Differential Revision: https://reviews.llvm.org/D108185	2021-08-19 10:33:42 +09:00
Aart Bik	d37d72eaf8	[mlir][sparse] use shared util for DimOp generation This shares more code with existing utilities. Also, to be consistent, we moved dimension permutation on the DimOp to the tensor lowering phase. This way, both pre-existing DimOps on sparse tensors (not likely but possible) as well as compiler generated DimOps are handled consistently. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108309	2021-08-18 17:12:32 -07:00
Diego Caballero	b7cac864b2	[mlir] Fix typo in SuperVectorizer NFC. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108334	2021-08-18 22:55:12 +00:00
Chia-hung Duan	41e5dbe0fa	Enables inferring return types for Shape op if possible Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D102565	2021-08-18 21:36:55 +00:00
Robert Suderman	76c9712196	[mlir][tosa] Fix clamp to restrict only within valid bitwidth range Its possible for the clamp to have invalid min/max values on its range. To fix this we validate the range of the min/max and clamp to a valid range. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108256	2021-08-18 12:14:01 -07:00
William S. Moses	8c2ff7b69e	[MLIR] Correct linkage of lowered globalop LLVM considers global variables marked as externals to be defined within the module if it is initialized (including to an undef). Other external globals are considered as being defined externally and imported into the current translation unit. Lowering of MLIR Global Ops does not properly propagate undefined initializers, resulting in a global which is expected to be defined within the current TU, not being defined. Differential Revision: https://reviews.llvm.org/D108252	2021-08-18 11:09:43 -04:00
Butygin	ddc3d51d58	[mlir][spirv] Add (InBounds)PtrAccessChain ops Differential Revision: https://reviews.llvm.org/D108070	2021-08-18 17:59:21 +03:00
Jacques Pienaar	b41bfb819d	[mlir][ods] Fix packing in OperandOrAttribute Wrong combiner was used which led to information loss.	2021-08-17 20:55:48 -07:00
Lei Zhang	4c15ad2321	[mlir][linalg] Don't drop existing attributes when creating ops Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D108219	2021-08-17 15:44:56 -04:00
MaheshRavishankar	836649e040	Allow setting attributes in build method generated by YAML-gen. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D108182	2021-08-17 09:09:52 -07:00
Tobias Gysi	583a754248	[mlir][linalg] Remove duplicate methods (NFC). Remove duplicate methods used to check iterator types. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108102	2021-08-17 09:06:17 +00:00
John Demme	1689dade42	[MLIR] [Python] Allow 'operation.parent' to return 'None' This is more Pythonic and better matches the C++ and C APIs. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D108183	2021-08-16 22:38:07 -07:00
John Demme	5821047aac	[MLIR] [Python] Fix out-of-tree Windows python bindings MSVC needs to know where to put the archive (.lib) as well as the runtime (.dll). If left to the default location, multiple rules to generate the same file will be produced, creating a Ninja error. Differential Revision: https://reviews.llvm.org/D108181	2021-08-16 19:18:54 -07:00
Matthias Springer	c19c51e357	[mlir][Analysis][NFC] Clean up FlatAffineValueConstraints * Rename ids to values in FlatAffineValueConstraints. * Overall cleanup of comments in FlatAffineConstraints and FlatAffineValueConstraints. Differential Revision: https://reviews.llvm.org/D107947	2021-08-17 10:38:57 +09:00
Matthias Springer	4c4ab673f1	[mlir][Analysis][NFC] Split FlatAffineConstraints class * Extract "value" functionality of `FlatAffineConstraints` into a new derived `FlatAffineValueConstraints` class. Current users of `FlatAffineConstraints` can use `FlatAffineValueConstraints` without additional code changes, thus NFC. * `FlatAffineConstraints` no longer associates dimensions with SSA Values. All functionality that requires this, is moved to `FlatAffineValueConstraints`. * `FlatAffineConstraints` no longer makes assumptions about where Values associated with dimensions are coming from. Differential Revision: https://reviews.llvm.org/D107725	2021-08-17 10:09:17 +09:00
Geoffrey Martin-Noble	e2c97d4484	[MLIR] Add a bitcast method to DenseElementsAttr This method bitcasts a DenseElementsAttr elementwise to one of the same shape with a different element type. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D107612	2021-08-16 17:13:35 -07:00
Rob Suderman	f328f72e60	[mlir][tosa] Fixed depthwise conv parallel/reduction indices order Reduction axis should come after all parallel axis to work with vectorization. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108005	2021-08-16 14:06:22 -07:00
Robert Suderman	65532ea6dd	[mlir][linalg] Clear unused linalg tc operations These operations are not lowered to from any source dialect and are only used for redundant tests. Removing these named ops, along with their associated tests, will make migration to YAML operations much more convenient. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D107993	2021-08-16 11:55:45 -07:00
Aart Bik	19a906f372	[mlir][sparse][python] make imports more selective Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108055	2021-08-16 11:53:29 -07:00
tashuang.zk	2d45e332ba	[MLIR][DISC] Revise ParallelLoopTilingPass with inbound_check mode Expand ParallelLoopTilingPass with an inbound_check mode. In default mode, the upper bound of the inner loop is from the min op; in inbound_check mode, the upper bound of the inner loop is the step of the outer loop and an additional inbound check will be emitted inside of the inner loop. This was 'FIXME' in the original codes and a typical usage is for GPU backends, thus the outer loop and inner loop can be mapped to blocks/threads in seperate. Differential Revision: https://reviews.llvm.org/D105455	2021-08-16 14:02:53 +02:00
Tres Popp	2848f6966e	[mlir] Set top-down traversal for LinalgElementwiseOpFusion The primary pattern for this pass clones many operations from producers to consumers. Doing this top down prevents duplicated work when a producer has multiple consumers, if it also is consuming another linalg.generic. As an example, a chain of ~2600 generics that are fused into ~70 generics was resulting in 16255 pattern invocations. This took 14 seconds on one machine but takes only 0.3 seconds with top-down traversal. Differential Revision: https://reviews.llvm.org/D107818	2021-08-16 09:26:49 +02:00
Stephen Neuendorffer	7776b19eed	[MLIR] Move TestDialect to ::test namespace While the changes are extensive, they basically fall into a few categories: 1) Moving the TestDialect itself. 2) Updating C++ code in tablegen to explicitly use ::mlir, since it will be put in a headers that shouldn't expect a 'using'. 3) Updating some generic MLIR Interface definitions to do the same thing. 4) Updating the Tablegen generator in a few places to be explicit about namespaces 5) Doing the same thing for llvm references, since we no longer pick up the definitions from mlir/Support/LLVM.h Differential Revision: https://reviews.llvm.org/D88251	2021-08-14 13:24:41 -07:00
harsh-nod	e33f301ec2	[mlir] Add support for moving reductions to outer most dimensions in vector.multi_reduction The approach for handling reductions in the outer most dimension follows that for inner most dimensions, outlined below First, transpose to move reduction dims, if needed Convert reduction from n-d to 2-d canonical form Then, for outer reductions, we emit the appropriate op (add/mul/min/max/or/and/xor) and combine the results. Differential Revision: https://reviews.llvm.org/D107675	2021-08-13 12:59:50 -07:00
Lorenzo Chelini	e537a3adde	[MLIR][Linalg] Fix typo	2021-08-13 18:00:14 +02:00
Adrian Kuegel	3c6f115ffc	[mlir] Remove unused header include. Also adjust BUILD.bazel and remove an unused dependency. Differential Revision: https://reviews.llvm.org/D108027	2021-08-13 14:23:14 +02:00
Michael Kruse	b1de32d6dd	[OMPIRBuilder] Clarify CanonicalLoopInfo. NFC. Add in-source documentation on how CanonicalLoopInfo is intended to be used. In particular, clarify what parts of a CanonicalLoopInfo is considered part of the loop, that those parts must be side-effect free, and that InsertPoints to instructions outside those parts can be expected to be preserved after method calls implementing loop-associated directives. CanonicalLoopInfo are now invalidated after it does not describe canonical loop anymore and asserts when trying to use it afterwards. In addition, rename `createXYZWorkshareLoop` to `applyXYZWorkshareLoop` and remove the update location to avoid that the impression that they insert something from scratch at that location where in reality its InsertPoint is ignored. createStaticWorkshareLoop does not return a CanonicalLoopInfo anymore. First, it was not a canonical loop in the clarified sense (containing side-effects in form of calls to the OpenMP runtime). Second, it is ambiguous which of the two possible canonical loops it should actually return. It will not be needed before a feature expected to be introduced in OpenMP 6.0 Also see discussion in D105706. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D107540	2021-08-12 21:02:19 -05:00
natashaknk	ba0997ca09	[mlir][tosa] Fix depthwise_conv2D strides/dilation and name Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107997	2021-08-12 15:43:41 -07:00
Chia-hung Duan	62df4df41c	[mlir-tblgen] Minor Refactor for StaticVerifierFunctionEmitter. Move StaticVerifierFunctionEmitter to CodeGenHelper.h so that it can be used for both ODS and DRR. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D106636	2021-08-12 20:53:05 +00:00
Aart Bik	56d607006d	[mlir][sparse][python] add an "exhaustive" sparse test using python Using the python API to easily set up sparse kernels, this test exhaustively builds, compilers, and runs SpMM for all annotations on a sparse tensor, making sure every version generates the correct result. This test also illustrates using the python API to set up a sparse kernel and sparse compilation. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D107943	2021-08-12 11:13:04 -07:00
Florian Hahn	f999312872	Recommit "[Matrix] Overload stride arg in matrix.columnwise.load/store." This reverts the revert `28c04794df`. The failing MLIR test that caused the revert should be fixed in this version. Also includes a PPC test fix previously in `1f87c7c478`.	2021-08-12 18:31:57 +01:00
Tyler Augustine	3a2ff982d7	Support post-processing Ops in unrolled loop iterations This can be useful when one needs to know which unrolled iteration an Op belongs to, for example, conveying noalias information among memory-affecting ops in parallel-access loops. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107789	2021-08-11 23:11:10 +00:00
Benjamin Kramer	35d6e75aba	[mlir] Drop LLVM dialect from TestPolynomialApproximation No longer needed after `c1ebefdf77`	2021-08-12 00:58:52 +02:00
Mehdi Amini	93e084e7e8	Add missing cmake dep to fix MLIR build with BUILD_SHARED_LIBS=ON (NFC)	2021-08-11 22:51:57 +00:00
Aart Bik	a5ae34afaa	[mlir][linalg] fixed typo Differential Revision: https://reviews.llvm.org/D107915	2021-08-11 11:59:15 -07:00
Rob Suderman	7de439b2be	[mlir][tosa] Migrate tosa to more efficient linalg.conv Existing linalg.conv2d is not well optimized for performance. Changed to a version that is more aligned for optimziation. Include the corresponding transposes to use this optimized version. This also splits the conv and depthwise conv into separate implementations to avoid overly complex lowerings. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D107504	2021-08-11 11:05:12 -07:00
Benjamin Kramer	c1ebefdf77	[mlir] Make polynomial approximation emit std instead of LLVM ops This is a bit cleaner and removes issues with 2d vectors. It also has a big impact on constant folding, hence the test changes. Differential Revision: https://reviews.llvm.org/D107896	2021-08-11 16:37:21 +02:00
Alex Zinenko	a0d8a08e3e	[mlir] Add std.bitcast -> llvm.bitcast conversion The conversion is a straightforward one-to-one mapping with optional unrolling for nD vectors, similarly to other cast operations. Depends On D107889 Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107891	2021-08-11 16:30:21 +02:00
Alex Zinenko	79b0576dd4	[mlir] Tighten LLVM_AnyNonAggregate ODS type constraint The constraint was checking that the type is not an LLVM structure or array type, but was not checking that it is an LLVM-compatible type, making it accept incorrect types. As a result, some LLVM dialect ops could process values that are not compatible with the LLVM dialect leading to further issues with conversions and translations that assume all values are LLVM-compatible. Make LLVM_AnyNonAggregate only accept LLVM-compatible types. Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107889	2021-08-11 16:30:19 +02:00
Alexander Belyaev	1e733a8c04	Revert "Bufferization for tiled loop." This reverts commit `edaffebcb2`.	2021-08-11 10:04:12 +02:00
Alexander Belyaev	967578f0b8	Revert "[mlir] Change the pattern for TiledLoopOp bufferization." This reverts commit `2f946eaa9d`.	2021-08-11 10:01:36 +02:00
Matthias Springer	4b56e2ee1d	[mlir][Analysis][NFC] Remove code duplication around getFlattenedAffineExprs Remove code duplication in `addLowerOrUpperBound` and `composeMatchingMap`. Differential Revision: https://reviews.llvm.org/D107814	2021-08-11 16:02:10 +09:00
Matthias Springer	9e6e08149c	[mlir][Analysis][NFC] Reimplement FlatAffineConstraints::composeMap Reimplement this function in terms of `composeMatchingMap`. Also fix a bug in `composeMatchingMap` where local dims of `this` could be missing in `localCst`. Differential Revision: https://reviews.llvm.org/D107813	2021-08-11 15:49:50 +09:00
Matthias Springer	98e30a9b47	[mlir][Analysis][NFC] Reimplement FlatAffineConstraints::addLowerOrUpperBound Reimplement this function in terms of the function variant without Value semantics. Differential Revision: https://reviews.llvm.org/D107729	2021-08-11 15:26:36 +09:00
Matthias Springer	97e41c004c	[mlir][Analysis] Add FlatAffineConstraints::addLowerOrUpperBound This function overload is similar to the existing `FlatAffineConstraints::addLowerOrUpperBound`. It constrains a dimension based on an affine map. However, in contrast to the other overloading, it does not attempt to align dimensions/symbols of the affine map with the dimensions/symbols of the constraint set. Instead, dimensions/symbols are expected to already be aligned. Differential Revision: https://reviews.llvm.org/D107727	2021-08-11 15:13:48 +09:00
Matthias Springer	9832e1a079	[mlir][Analysis] Add alignAffineMapWithValues This function aligns an affine map (and operands) with given dims and syms SSA values. This is useful in conjunction with `FlatAffineConstraints::addLowerOrUpperBound`, which requires the `boundMap` to be aligned with the constraint set's dims and syms. Differential Revision: https://reviews.llvm.org/D107728	2021-08-11 14:59:03 +09:00
Rob Suderman	2b2ebb6f98	[mlir][tosa] Add folders for trivial tosa operation cases Some folding cases are trivial to fold away, specifically no-op cases where an operation's input and output are the same. Canonicalizing these away removes unneeded operations. The current version includes tensor cast operations to resolve shape discreprencies that occur when an operation's result type differs from the input type. These are resolved during a tosa shape propagation pass. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107321	2021-08-10 14:43:00 -07:00
Rob Suderman	86858c62ba	[mlir][tosa] Add dilation to tosa.transpose_conv2d lowering Dilation only requires increasing the padding on the left/right side of the input, and including dilation in the convolution. This implementation still lacks support for strided convolutions. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107680	2021-08-10 14:36:11 -07:00
natashaknk	a1f46569a1	[mlir][tosa] Add quantized and unquantized versions for tosa.depthwise_conv2d lowering Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107855	2021-08-10 14:29:26 -07:00
Jacques Pienaar	768a517581	[mlir][drr] Improve error message for unexpected attribute (NFC) When using an attribute where a value is expected previously this would fail complaining about unbound symbol. Instead make error clear and mention common failure reason.	2021-08-10 13:03:53 -07:00

... 3 4 5 6 7 ...

8789 Commits