llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	3773d04a13	[mlir][memref] Switch ViewOp to the declarative assembly format	2022-02-17 21:34:15 +01:00
Benjamin Kramer	1af15de6b7	[mlir] Switch {collapse,expand}_shape ops to the declarative assembly format Same functionality, a lot less code.	2022-02-17 20:00:05 +01:00
Aart Bik	515c617003	[mlir][linalg][sparse] add linalg optimization passes "upstream" It is time to compose Linalg related optimizations with SparseTensor related optimizations. This is a careful first start by adding some general Linalg optimizations "upstream" of the sparse compiler in the full sparse compiler pipeline. Some minor changes were needed to make those optimizations aware of sparsity. Note that after this, we will add a sparse specific fusion rule, just to demonstrate the power of the new composition. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D119971	2022-02-17 08:55:50 -08:00
Lei Zhang	c9b36807be	[mlir][spirv] Add a pass to unify aliased resource variables In SPIR-V, resources are represented as global variables that are bound to certain descriptor. SPIR-V requires those global variables to be declared as aliased if multiple ones are bound to the same slot. Such aliased decorations can cause issues for transcompilers like SPIRV-Cross when converting to source shading languages like MSL. So this commit adds a pass to perform analysis of aliased resources and see if we can unify them into one. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119872	2022-02-17 09:08:58 -05:00
Benjamin Kramer	d955ca4937	[BufferDeallocation] Don't assume successor operands are unique This would create a double free when a memref is passed twice to the same op. This wasn't a problem at the time the pass was written but is common since the introduction of scf.while. There's a latent non-determinism that's triggered by the test, but this change is messy enough as-is so I'll leave that for later. Differential Revision: https://reviews.llvm.org/D120044	2022-02-17 14:16:32 +01:00
Alex Zinenko	d4a53f3bfa	[mlir] call target materialization more in dialect conversion During dialect conversion, target materialization is triggered to create cast-like operations when a type mismatch occurs between the value that replaces a rewritten operation and the type that another operations expects as operands processed by the type conversion. First, a dummy cast is inserted to make sure the pattern application can proceed. The decision to trigger the user-provided materialization hook is taken later based on the result of the dummy cast having uses. However, it only has uses if other patterns constructed new operations using the casted value as operand. If existing (legal) operations use the replaced value, they may have not been updated to use the casted value yet. The conversion infra would then delete the dummy cast first, and then would replace the uses with now-invalid (null in the bast case) value. When deciding whether to trigger cast materialization, check for liveness the uses not only of the casted value, but also of all the values that it replaces. This was discovered in the finalizing bufferize pass that cleans up mutually-cancelling casts without touching other operations. It is not impossible that there are other scenarios where the dialect converison infra could produce invalid operand uses because of dummy casts erased too eagerly. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D119937	2022-02-17 10:13:23 +01:00
Ivan Butygin	d271fc04d5	[mlir][gpu] Split ops sinking from gpu-kernel-outlining pass into separate pass Previously `gpu-kernel-outlining` pass was also doing index computation sinking into gpu.launch before actual outlining. Split ops sinking from `gpu-kernel-outlining` pass into separate pass, so users can use theirs own sinking pass before outlining. To achieve old behavior users will need to call both passes: `-gpu-launch-sink-index-computations -gpu-kernel-outlining`. Differential Revision: https://reviews.llvm.org/D119932	2022-02-17 10:34:20 +03:00
Eugene Zhulenev	abe2dee5eb	[mlir] NFC Async: always use 'b' for the current builder Currently some of the nested IR building inconsistently uses `nb` and `b`, it's very easy to call wrong builder outside of the current scope, so for simplicity all builders are always called `b`, and in nested IR building regions they just shadow the "parent" builder. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D120003	2022-02-16 21:20:53 -08:00
Aart Bik	34381a76c1	[mlir][sparse] avoid some codeup in sparsification transformation A very small refactoring, but a big impact on tests that expect an exact order. This revision fixes the tests, but also makes them less brittle for similar minor changes in the future! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D119992	2022-02-16 17:39:04 -08:00
Eugene Zhulenev	b171583ae7	[mlir] Async: create async.group inside the scf.if branch Reviewed By: cota Differential Revision: https://reviews.llvm.org/D119959	2022-02-16 14:47:04 -08:00
Lei Zhang	e027c00821	[mlir][tensor] Add a pattern to split tensor.pad ops This commit adds a pattern to wrap a tensor.pad op with an scf.if op to separate the cases where we don't need padding (all pad sizes are actually zeros) and where we indeed need padding. This pattern is meant to handle padding inside tiled loops. Under such cases the padding sizes typically depend on the loop induction variables. Splitting them would allow treating perfect tiles and edge tiles separately. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D117018	2022-02-16 13:43:57 -05:00
Lei Zhang	0edb412773	[mlir][linalg] Add control to pad-slice swap pattern The pad-slice swap pattern generates `scf.if` and `tensor.generate` to guard against zero-sized slices if it cannot prove the slice is always non-zero. This is safe but quite conservative. It can be unnecessary for cases where we know by problem definition such cases does not exist, even if with dynamic shaped ops or unknown tile/slice sizes, e.g., convolution padding size = 1 with kernel dim size = 3. So this commit introduces a control to the pattern to specify whether to generate the if constructs to handle such cases better, given that once the if constructs is materialized, it's very hard to analyze and simplify. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D117017	2022-02-16 11:19:35 -05:00
Benjamin Kramer	27cd2a6284	[mlir][MemRef] Lower memref.copy with an offset to memcpy memcpy can handle them as long as they're contiguous. Differential Revision: https://reviews.llvm.org/D119938	2022-02-16 17:18:31 +01:00
Shao-Ce SUN	2aed07e96c	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 13:10:09 +08:00
Shao-Ce SUN	9cc49c1951	Revert "[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter`" This reverts commit `fe25c06cc5`.	2022-02-16 11:57:49 +08:00
Shao-Ce SUN	fe25c06cc5	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` For ten years, it seems that `MCRegisterInfo` is not used by any target. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 11:47:17 +08:00
Mahesh Ravishankar	2c58cde003	[mlir][Linalg] Add pattern for folding reshape by collapsing. Fusion of `linalg.generic` with `tensor.expand_shape/tensor.collapse_shape` currently handles fusion with reshape by expanding the dimensionality of the `linalg.generic` operation. This helps fuse elementwise operations better since they are fused at the highest dimensionality while keeping all indexing maps involved projected permutations. The intent of these is to push the reshape to the boundaries of functions. The presence of named ops (or other ops across which the reshape cannot be propagated) stops the propagation to the edges of the function. At this stage, the converse patterns that fold the reshapes with generic ops by collapsing the dimensions of the generic op can push the reshape towards edges. In particular it helps the case where reshapes exist in between named ops and generic ops. `linalg.named_op` -> `tensor.expand_shape` -> `linalg.generic` Pushing the reshape down will help fusion of `linalg.named_op` -> `linalg.generic` using tile + fuse transformations. This pattern is intended to replace the following patterns 1) FoldReshapeByLinearization : These patterns create indexing maps that are not projected permutations that affect future transformations. They are only useful for folding unit-dimensions. 2) PushReshapeByExpansion : This pattern has the same functionality but has some restrictions a) It tries to avoid creating new reshapes that limits its applicability. The pattern added here can achieve the same functionality through use of the `controlFn` that allows clients of the pattern freedom to make this decision. b) It does not work for ops with indexing semantics. These patterns will be deprecated in a future patch. Differential Revision: https://reviews.llvm.org/D119365	2022-02-16 03:15:20 +00:00
Sergei Grechanik	988a3ba0d8	[mlir] Expose printer flags in AsmState This change exposes printer flags in AsmState and AsmStateImpl. All functions receiving AsmState as a parameter now use the flags from the AsmState instead of taking an additional OpPrintingFlags parameter. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119870	2022-02-15 17:27:45 -08:00
Thomas Raoux	0736bbd7e2	[mlir][scf] Add callback to annotate ops during pipelining This allow user to register a callback that can annotate operations during software pipelining. This allows user potential annotate op to know what part of the pipeline they correspond to. Differential Revision: https://reviews.llvm.org/D119866	2022-02-15 12:48:01 -08:00
Jacques Pienaar	b077ee9240	[mlir][ods] Allow type attribute/operand for 0 result ops prefixed Without results, there is no getType injected and so generating one in prefixed form doesn't result in any failures during C++ compilation. Differential Revision: https://reviews.llvm.org/D119871	2022-02-15 12:20:07 -08:00
Mogball	761bc83af4	[mlir][ods] Default-valued parameters in attribute or type defs Optional parameters with `defaultValue` set will be populated with that value if they aren't encountered during parsing. Moreover, parameters equal to their default values are elided when printing. Depends on D118210 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118544	2022-02-15 19:02:11 +00:00
Jacques Pienaar	75044e9b4f	[mlir] Flipping vector dialect to both prefixed form. Following https://discourse.llvm.org/t/psa-ods-generated-accessors-will-change-to-have-a-get-prefix-update-you-apis/4476 Mostly mechanical, avoiding function name conflicts. Differential Revision: https://reviews.llvm.org/D119607	2022-02-15 09:48:51 -08:00
Javier Setoain	71705f531f	[mlir][Arith] Disallow casting between scalable and fixed-length vectors Casting between scalable vectors and fixed-length vectors doesn't make sense. If one of the operands is scalable, the other has to be scalable to be able to guarantee they have the same shape at runtime. Differential Revision: https://reviews.llvm.org/D119568	2022-02-15 17:34:42 +00:00
Krzysztof Drewniak	1aa71944cf	[MLIR][GPU] Add missing include to SerilazeToHsaco Differential Revision: https://reviews.llvm.org/D119852	2022-02-15 17:11:33 +00:00
Krzysztof Drewniak	cc15141794	[MLIR] Link SerializeToHsaco dependencies to correct MLIR library Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D119774	2022-02-15 16:31:10 +00:00
Adrian Kuegel	b122cbebec	[mlir][Math] Fix NaN handling in Exp approximation Differential Revision: https://reviews.llvm.org/D119832	2022-02-15 15:17:56 +01:00
Ivan Butygin	a2e2fbba17	[mlir][gpu] sinkOperationsIntoLaunchOp: Add user hook for isSinkingBeneficiary Differential Revision: https://reviews.llvm.org/D119632	2022-02-15 16:50:49 +03:00
Adrian Kuegel	14843d0c3d	[mlir][OpenMP] NFC: Remove unused variable	2022-02-15 14:01:12 +01:00
Shraiysh Vaishay	166713f987	[mlir][OpenMP] Change omp.atomic.update to have generic updates This patch changes the syntax of omp.atomic.update to allow the other dialects to modify the variable with appropriate operations in the region. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D119522	2022-02-15 17:58:13 +05:30
Matthias Springer	73e880fbf1	[mlir][bufferize] Add vector-bufferize pass and remove obsolete patterns from Linalg Bufferize Differential Revision: https://reviews.llvm.org/D119444	2022-02-15 21:25:14 +09:00
Adrian Kuegel	87de451bc5	[mlir][Math] Fix NaN handling in ExpM1 approximation. Differential Revision: https://reviews.llvm.org/D119822	2022-02-15 12:10:12 +01:00
Matthias Springer	e6f691615e	[mlir][bufferize] Support tensor.expand_shape and tensor.collapse_shape Differential Revision: https://reviews.llvm.org/D112512	2022-02-15 19:53:49 +09:00
Akshay Baviskar	f1efac7f08	Add verifier for gpu.alloc op Add verifier for gpu.alloc op to verify if the dimension operand counts and symbol operand counts are same as their memref counterparts. Differential Revision: https://reviews.llvm.org/D117427	2022-02-15 15:57:58 +05:30
Matthias Springer	695c341b84	[mlir][bufferize] Generalize filtering mechanism in BufferizationOptions Support ALLOW filters and DENY filters. This is needed for compatibility with existing code that specifies more complex op filters. Differential Revision: https://reviews.llvm.org/D119820	2022-02-15 19:17:33 +09:00
Ivan Butygin	32389d0c2e	[mlir][spirv] Add OpenCL fma op and lowering Also, it seems Khronos has changed html spec format so small adjustment to script was needed. Base op parsing is also probably broken. Differential Revision: https://reviews.llvm.org/D119678	2022-02-15 11:28:20 +03:00
Shraiysh Vaishay	b85cfe208f	[OpenMP][IRBuilder] Change the default constructor for OpenMPIRBuilder::LocationDescription This patch changes the argument from template-IRBuilder to IRBuilderBase thus allowing us to write less code while getting the location from a builder. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D119717	2022-02-15 00:40:34 +05:30
Stella Laurenzo	429b0cf1de	[mlir][python] Directly implement sequence protocol on Sliceable. * While annoying, this is the only way to get C++ exception handling out of the happy path for normal iteration. * Implements sq_length and sq_item for the sequence protocol (used for iteration, including list() construction). * Implements mp_subscript for general use (i.e. foo[1] and foo[1:1]). * For constructing a `list(op.results)`, this reduces the time from ~4-5us to ~1.5us on my machine (give or take measurement overhead) and eliminates C++ exceptions, which is a worthy goal in itself. * Compared to a baseline of similar construction of a three-integer list, which takes 450ns (might just be measuring function call overhead). * See issue discussed on the pybind side: https://github.com/pybind/pybind11/issues/2842 Differential Revision: https://reviews.llvm.org/D119691	2022-02-14 09:45:17 -08:00
Aart Bik	5517208d4e	[mlir][sparse] minor cleanup of include placement Rationale: empty line between main include for this file moved include that actually defines code into right section Note that this revision started as breaking up ops/attrs even more (for bug https://github.com/llvm/llvm-project/issues/52748), but due the the connection in Dialect.initalize(), this cannot be split further). All heavy lifting refactoring was already done by River in previous cleanup. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D119617	2022-02-14 09:16:45 -08:00
Marius Brehler	88b9d1a49a	[mlir][emitc] Add a pointer type Adds a pointer type to EmitC. The emission of pointers is so far only possible by using the `emitc.opaque` type Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D119337	2022-02-14 16:42:21 +00:00
gysit	348bfc8e50	[mlir][linalg] Add attributes to region builder (NFC). Adapt the region builder signature to hand in the attributes of the created ops. The revision is a preparation step the support named ops that need access to the operation attributes during op creation. Depends On D119692 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D119693	2022-02-14 13:14:14 +00:00
Ivan Butygin	cd0d095c07	[mlir][tensor] Check ops generated by InsertSliceOpCastFolder are valid Fixes https://github.com/llvm/llvm-project/issues/53099 Differential Revision: https://reviews.llvm.org/D119663	2022-02-13 21:37:31 +03:00
Benjamin Kramer	c45c53bbae	[Shape] Simplify getShapeVec a bit. NFCI.	2022-02-13 16:58:16 +01:00
Benjamin Kramer	935a5f67d1	[AffineMap] Move result exprs into trailing storage. NFCI.	2022-02-12 15:24:00 +01:00
Benjamin Kramer	a9dcbcfe9f	Use AffineMap::getSliceMap where applicable. NFCI.	2022-02-12 14:22:05 +01:00
Matthias Springer	9106d35b91	[mlir][bufferize] Use rewriter instead of replacing all uses directly This is important for compatibility with DialectConversion.	2022-02-12 02:35:36 +09:00
Sameer Sahasrabuddhe	d8f99bb6e0	[AMDGPU] replace hostcall module flag with function attribute The module flag to indicate use of hostcall is insufficient to catch all cases where hostcall might be in use by a kernel. This is now replaced by a function attribute that gets propagated to top-level kernel functions via their respective call-graph. If the attribute "amdgpu-no-hostcall-ptr" is absent on a kernel, the default behaviour is to emit kernel metadata indicating that the kernel uses the hostcall buffer pointer passed as an implicit argument. The attribute may be placed explicitly by the user, or inferred by the AMDGPU attributor by examining the call-graph. The attribute is inferred only if the function is not being sanitized, and the implictarg_ptr does not result in a load of any byte in the hostcall pointer argument. Reviewed By: jdoerfert, arsenm, kpyzhov Differential Revision: https://reviews.llvm.org/D119216	2022-02-11 22:51:56 +05:30
Adrian Kuegel	2219f9f57c	[mlir][MemRef] Fix MemRefCopyOpLowering to use correct number of bytes When lowering to memrefCopy call, the size for i1 type was calculated as 0. Instead of using getTypeSizeInBits() and dividing by 8, we should just use getTypeSize(). Differential Revision: https://reviews.llvm.org/D119540	2022-02-11 13:59:08 +01:00
Adrian Kuegel	5b02a48085	[mlir][MemRef] Fix MemRefCastOpLowering for 32 bit index type. The lowering creates llvm.insertvalue with the rank value, so it needs to use index type instead of 64 bit integer type. Otherwise, we get an error: llvm.insertvalue' op Type mismatch: cannot insert 'i64' into '!llvm.struct<(i32, ptr<i8>)>' Differential Revision: https://reviews.llvm.org/D119534	2022-02-11 12:37:15 +01:00
Arjun P	855cd847f7	[MLIR][Presburger] normalizeDivisionByGCD: fix bug when constant term is negative Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D119531	2022-02-11 17:02:52 +05:30
Markus Böck	1bf7921374	[mlir][LLVM] Add support for adding a garbage collector to a LLVM function This patch simply adds an optional garbage collector attribute to LLVMFuncOp which maps 1:1 to the "gc" property of functions in LLVM. Differential Revision: https://reviews.llvm.org/D119492	2022-02-11 10:23:51 +01:00
Mehdi Amini	b055e6d313	Add a new interface method `getAsmBlockName()` on OpAsmOpInterface to control block names This allows operations to control the block ids used by the printer in nested regions. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D115849	2022-02-11 08:46:08 +00:00
Uday Bondhugula	f2ff8a8e83	[MLIR] Add result status for normalizeAffineFor Add result status for normalizeAffineFor utility. Differential Revision: https://reviews.llvm.org/D119413	2022-02-11 11:29:52 +05:30
Thomas Raoux	5ab04bc068	[mlir][gpu] Add device side async copy operations Add new operations to the gpu dialect to represent device side asynchronous copies. This also add the lowering of those operations to nvvm dialect. Those ops are meant to be low level and map directly to llvm dialects like nvvm or rocdl. We can further add higher level of abstraction by building on top of those operations. This has been discuss here: https://discourse.llvm.org/t/modeling-gpu-async-copy-ampere-feature/4924 Differential Revision: https://reviews.llvm.org/D119191	2022-02-10 17:25:59 -08:00
River Riddle	ceb5dc55c2	[PDLL] Attempt to fix the gcc5 build by adding this-> to auto lambda	2022-02-10 16:59:03 -08:00
River Riddle	faf42264e5	[PDLL] Add support for user defined constraint and rewrite functions These functions allow for defining pattern fragments usable within the `match` and `rewrite` sections of a pattern. The main structure of Constraints and Rewrites functions are the same, and are similar to functions in other languages; they contain a signature (i.e. name, argument list, result list) and a body: ```pdll // Constraint that takes a value as an input, and produces a value: Constraint Cst(arg: Value) -> Value { ... } // Constraint that returns multiple values: Constraint Cst() -> (result1: Value, result2: ValueRange); ``` When returning multiple results, each result can be optionally be named (the result of a Constraint/Rewrite in the case of multiple results is a tuple). These body of a Constraint/Rewrite functions can be specified in several ways: * Externally In this case we are importing an external function (registered by the user outside of PDLL): ```pdll Constraint Foo(op: Op); Rewrite Bar(); ``` * In PDLL (using PDLL constructs) In this case, the body is defined using PDLL constructs: ```pdll Rewrite BuildFooOp() { // The result type of the Rewrite is inferred from the return. return op<my_dialect.foo>; } // Constraints/Rewrites can also implement a lambda/expression // body for simple one line bodies. Rewrite BuildFooOp() => op<my_dialect.foo>; ``` * In PDLL (using a native/C++ code block) In this case the body is specified using a C++(or potentially other language at some point) code block. When building PDLL in AOT mode this will generate a native constraint/rewrite and register it with the PDL bytecode. ```pdll Rewrite BuildFooOp() -> Op<my_dialect.foo> [{ return rewriter.create<my_dialect::FooOp>(...); }]; ``` Differential Revision: https://reviews.llvm.org/D115836	2022-02-10 12:48:59 -08:00
River Riddle	3d8b906012	[PDLL] Add support for single line lambda-like patterns This allows for defining simple patterns in a single line. The lambda body of a Pattern expects a single operation rewrite statement: ``` Pattern => replace op<my_dialect.foo>(operands: ValueRange) with operands; ``` Differential Revision: https://reviews.llvm.org/D115835	2022-02-10 12:48:58 -08:00
Krzysztof Drewniak	1ce314ce6b	[MLIR][GPU][lld] Use LLD bundled in ROCm, removing workaround Having clarified that executing the SerializeToHsaco pass can depend on a ROCm installation, switch from calling lld as a library to using the copy of lld guaranteed to be included in a ROCm install. This removes the workaround introduced in D119277 Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119463	2022-02-10 19:37:30 +00:00
Krzysztof Drewniak	c37b3e4108	[MLIR][GPU] Add now-required include to SerializeToHsaco Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119455	2022-02-10 18:36:38 +00:00
Nirvedh	ad9b5a4b8e	[mlir][vector] Add pattern to drop lead unit dim for Contraction Op If the result operand has a unit leading dim it is removed from all operands. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119206	2022-02-10 09:51:07 -08:00
Marius Brehler	44c1582265	[mlir] Add missing dep to new cf dialect	2022-02-10 14:15:20 +00:00
Lei Zhang	06a0385142	[mlir][linalg] Fold tensor.pad(linalg.fill) with the same value Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D119160	2022-02-10 08:39:35 -05:00
Matthias Springer	9b5a3d14b2	[mlir][vector] Add helper that builds a scalar reduction according to CombiningKind Differential Revision: https://reviews.llvm.org/D119433	2022-02-10 22:35:43 +09:00
Groverkss	4807587cf2	[MLIR][Presburger] Factor out space information to PresburgerSpace This patch factors out space information from IntegerPolyhedron, PresburgerSet and PWMAFunction to PresburgerSpace and its extension with local variables, PresburgerLocalSpace. Generally any new data structure additions in Presburger library will require space information. This patch removes the need to duplicate the space information. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D119280	2022-02-10 18:24:40 +05:30
Matthias Springer	fe0bf7d469	[mlir][vector][NFC] Use CombiningKindAttr instead of StringAttr This makes the op consistent with other ops in vector dialect. Differential Revision: https://reviews.llvm.org/D119343	2022-02-10 19:13:29 +09:00
Tres Popp	34ff99a0b7	Revert "[MLIR] Fix fold-memref-subview-ops for affine.load/store" This reverts commit `ac6cb41303`. This code has a stack-use-after-scope error that can be seen with asan.	2022-02-10 10:46:59 +01:00
Uday Bondhugula	ac6cb41303	[MLIR] Fix fold-memref-subview-ops for affine.load/store Fix fold-memref-subview-ops for affine.load/store. We need to expand out the affine apply on its operands. Differential Revision: https://reviews.llvm.org/D119402	2022-02-10 13:55:38 +05:30
Uday Bondhugula	8d12bf4ac1	[MLIR][NFC] Move expandAffineMap/Expr out to Affine utils Move expandAffineMap and expandAffineApplyExpr out to AffineUtils. This is a useful method. The child revision uses it. NFC. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119401	2022-02-10 09:56:26 +05:30
Uday Bondhugula	c2246eb893	[MLIR][NFC] Remove unused argument in affine scalrep helper util NFC. Remove unused argument in affine scalrep helper utility. Differential Revision: https://reviews.llvm.org/D119397	2022-02-10 08:25:55 +05:30
Rainer Orth	9159675535	[MLIR][Presburger] Disambiguate call to floor While testing LLVM 14.0.0 rc1 on Solaris, compilation of `FAIL`ed with /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/Analysis/Presburger/Utils.cpp: In lambda function: /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/Analysis/Presburger/Utils.cpp:48:58: error: call of overloaded ‘floor(int64_t)’ is ambiguous 48 \| [gcd](int64_t &n) { return floor(n / gcd); }); \| ^ ... /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:201:21: note: candidate: ‘long double std::floor(long double)’ 201 \| inline long double floor(long double __X) { return __floorl(__X); } \| ^~~~~ /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:165:15: note: candidate: ‘float std::floor(float)’ 165 \| inline float floor(float __X) { return __floorf(__X); } \| ^~~~~ /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:78:15: note: candidate: ‘double std::floor(double)’ 78 \| extern double floor __P((double)); \| ^~~~~ The same issue had already occured in the past, cf. D108750 <https://reviews.llvm.org/D108750>, and the solution is the same: cast the `floor` arg to `double`. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D119324	2022-02-09 22:01:55 +01:00
Rainer Orth	d2215e79ac	[mlir][sparse] Rename index_t to index_type again While testing LLVM 14.0.0 rc1 on Solaris, I ran into a compile failure: from /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/ExecutionEngine/SparseTensorUtils.cpp:22: /usr/include/sys/types.h:103:16: error: conflicting declaration ‘typedef short int index_t’ 103 \| typedef short index_t; \| ^~~~~~~ In file included from /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/ExecutionEngine/SparseTensorUtils.cpp:17: /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/include/mlir/ExecutionEngine/SparseTensorUtils.h:26:7: note: previous declaration as ‘using index_t = uint64_t’ 26 \| using index_t = uint64_t; \| ^~~~~~~ The same issue had already occured in the past and fixed in D72619 <https://reviews.llvm.org/D72619>. More detailed explanation can also be found there. Tested on `amd64-pc-solaris2.11` and `sparcv9-solaris2.11`. Differential Revision: https://reviews.llvm.org/D119323	2022-02-09 21:59:52 +01:00
Matthias Springer	69f7647158	[mlir][GPU] Add ShuffleOp builder for constant offset/width Differential Revision: https://reviews.llvm.org/D119345	2022-02-10 02:55:44 +09:00
Alexander Belyaev	c962038914	[mlir][nfc] Expose linalg tiling helpers. Differential Revision: https://reviews.llvm.org/D119330	2022-02-09 15:26:06 +01:00
Matthias Springer	585a8a321c	[mlir][bufferize] OpOperands can have multiple aliasing OpResults This makes getAliasingOpResult symmetric to getAliasingOpOperand. The previous implementation was confusing for users and implemented in such a way only because there are currently no bufferizable ops that have multiple aliasing OpResults. Differential Revision: https://reviews.llvm.org/D119259	2022-02-09 20:58:45 +09:00
Matthias Springer	22a1973dbe	[mlir][linalg][bufferize] Print results of FuncOp read/write analysis Print more information with test-analysis-only. Differential Revision: https://reviews.llvm.org/D119118	2022-02-09 20:52:38 +09:00
Matthias Springer	f30ec8f627	[mlir][linalg][bufferize][NFC] Allow passing custom BufferizationOptions to pass Differential Revision: https://reviews.llvm.org/D118891	2022-02-09 19:15:31 +09:00
Matthias Springer	cdb7675c26	[mlir][bufferize][NFC] Make PostAnalysisSteps a function They used to be classes with a virtual `run` function. This was inconvenient because post analysis steps are stored in BufferizationOptions. Because of this design choice, BufferizationOptions were not copyable. Differential Revision: https://reviews.llvm.org/D119258	2022-02-09 18:56:06 +09:00
Alexandre Ganea	1e661e583d	[MLIR] Temporary workaround for calling the LLD ELF driver as-a-lib This fixes the situation described in https://github.com/llvm/llvm-project/issues/53475 with a repro exposed by https://github.com/ROCmSoftwarePlatform/D108850-lld-bug-reproduction This is purposely just a workaround to unblock users. This could be transplanted to the release/14.x branch if need be. A proper fix will later be provided in https://reviews.llvm.org/D119049. Differential Revision: https://reviews.llvm.org/D119277	2022-02-08 19:12:15 -05:00
Jacques Pienaar	bbddd19ec7	[mlir][math] Expand coverage of atan2 expansion Reuse the higher precision F32 approximation for the F16 one (by expanding and truncating). This is partly RFC as I'm not sure what the expectations are here (e.g., these are only for F32 and should not be expanded, that reusing higher-precision ones for lower precision is undesirable due to increased compute cost and only approximations per exact type is preferred, or this is appropriate [at least as fallback] but we need to see how to make it more generic across all the patterns here). Differential Revision: https://reviews.llvm.org/D118968	2022-02-08 15:00:39 -08:00
Mogball	07486395d2	[mlir][ods] Optional Attribute or Type Parameters Implements optional attribute or type parameters, including support for such parameters in the assembly format `struct` directive. Also implements optional groups. Depends on D117971 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118208	2022-02-08 20:09:44 +00:00
harsh	4a876b13fb	Add case to handle 0-D vectors in FlattenContiguousRowMajorTransferWritePattern and FlattenContiguousRowMajorTransferReadPattern. For 0-D as well as 1-D vectors, both these patterns should return a failure as there is no need to collapse the shape of the source. Currently, only 1-D vectors were handled. This patch handles the 0-D case as well. Reviewed By: Benoit, ThomasRaoux Differential Revision: https://reviews.llvm.org/D119202	2022-02-08 20:00:12 +00:00
Krzysztof Drewniak	24a1869d00	[MLIR][GPU] Update GPUToROCDL to account for ControlFlow dialect The conversion to the new ControlFlow dialect didn't change the GPUToROCDL pass - this commit fixes this issue. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119188	2022-02-08 16:34:34 +00:00
Arjun P	1096fcff7d	[MLIR][Presburger] Support computing volumes via hyperrectangular overapproximation Add support for computing an overapproximation of the number of integer points in a polyhedron. The returned result is actually the number of integer points one gets by computing the "rational shadow" obtained by projecting out the local IDs, finding the minimal axis-parallel hyperrectangular approximation of the shadow, and returning the number of integer points in that. This does not currently support symbols. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D119228	2022-02-08 21:06:49 +05:30
Arjun P	738c738b44	[MLIR][Presburger] Simplex::computeIntegerBounds: support unbounded directions by returning Optionals	2022-02-08 20:57:18 +05:30
Tres Popp	64b918852c	Remove restriction on static dimensions in Shape method mlir::shape::ToExtentTensorOp::areCastCompatible didn't allow the input to have a static dimension, but that is allowed.	2022-02-08 11:20:01 +01:00
Cullen Rhodes	99d95025e1	[mlir][Affine][Vector] NFC: fix examples in comments s/-affine-vectorize/-affine-super-vectorize/g Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D118892	2022-02-08 10:03:32 +00:00
River Riddle	2418cd92c0	[mlir] Update uses of `parser`/`printer` ODS op field to `hasCustomAssemblyFormat` The parser/printer fields are deprecated and in the process of being removed.	2022-02-07 19:03:58 -08:00
River Riddle	60cac0c081	[mlir][NFC] Remove deprecated/old build/fold/parser utilities from OpDefinition These have generally been replaced by better ODS functionality, and do not need to be explicitly provided anymore. Differential Revision: https://reviews.llvm.org/D119065	2022-02-07 19:03:58 -08:00
River Riddle	3c69bc4d6e	[mlir][NFC] Remove a few op builders that simply swap parameter order Differential Revision: https://reviews.llvm.org/D119093	2022-02-07 19:03:57 -08:00
Sergei Grechanik	bb39ad43ce	[mlir][spirv] Fix verification of nested array constants Fix the verification function of spirv::ConstantOp to allow nesting array attributes. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D118939	2022-02-07 13:48:53 -08:00
Arjun P	d5a2944219	[MLIR][Presburger] Add support for piece-wise multi-affine functions Add the class MultiAffineFunction which represents functions whose domain is an IntegerPolyhedron and which produce an output given by a tuple of affine expressions in the IntegerPolyhedron's ids. Also add support for piece-wise MultiAffineFunctions, which are defined on a union of IntegerPolyhedrons, and may have different output affine expressions on each IntegerPolyhedron. Thus the function is affine on each individual IntegerPolyhedron piece in the domain. This is part of a series of patches leading up to parametric integer programming. Depends on D118778. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D118779	2022-02-08 00:43:59 +05:30
Matthias Springer	9aa74347d5	[mlir][SCF] Further simplify affine maps during `for-loop-canonicalization` * Implement `FlatAffineConstraints::getConstantBound(EQ)`. * Inject a simpler constraint for loops that have at most 1 iteration. * Taking into account constant EQ bounds of FlatAffineConstraint dims/symbols during canonicalization of the resulting affine map in `canonicalizeMinMaxOp`. Differential Revision: https://reviews.llvm.org/D119153	2022-02-08 02:40:08 +09:00
Benjamin Kramer	6635c12ada	[mlir] Use SmallBitVector instead of SmallDenseSet for AffineMap::compressSymbols This is both more efficient and more ergonomic to use, as inverting a bit vector is trivial while inverting a set is annoying. Sadly this leaks into a bunch of APIs downstream, so adapt them as well. This would be NFC, but there is an ordering dependency in MemRefOps's computeMemRefRankReductionMask. This is now deterministic, previously it was dependent on SmallDenseSet's unspecified iteration order. Differential Revision: https://reviews.llvm.org/D119076	2022-02-07 00:21:44 +01:00
River Riddle	330838eb90	[mlir] Fix GpuToLLVM conversion pass after ControlFlow operations were split from Standard	2022-02-06 15:10:03 -08:00
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00
Eugene Zhulenev	edca177cbe	[mlir] Add canonicalizer to remove redundant shape.cstr_broadcastable ops Depends On D119025 Reviewed By: frgossen Differential Revision: https://reviews.llvm.org/D119043	2022-02-06 14:46:42 -08:00
Mehdi Amini	0d8850ae2c	Remove dead forward declaration (NFC)	2022-02-06 19:48:46 +00:00
Arjun P	8a98c3e07f	[MLIR][Presburger] MaybeLocalRepr: add explicit bool() for convenience This also slightly simplifies some code. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D118790	2022-02-05 19:33:01 +05:30
Groverkss	2845ed29d4	[MLIR][Presburger][NFC] Use getters for IntegerPolyhedron members This patch makes IntegerPolyhedron and derived classes use of getters to access IntegerPolyhedron space information (`numIds, numDims, numSymbols`) instead of directly accessing them. This patch makes it easier to change the underlying implementation of the way identifiers are stored, making it easier to extend/modify existing implementation. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D118888	2022-02-05 18:08:15 +05:30
Groverkss	070bc9c1fb	[MLIR][Presburger][NFC] Fix clang-tidy warnings This patch changes variable naming to lowerCamelCase to remove clang-tidy warning in Presburger/Utils.cpp.	2022-02-05 11:59:21 +05:30
Eugene Zhulenev	981f0a14f1	[mlir] Add canonicalizer to merge shape.assuming_all ops Depends On D119021 Reviewed By: frgossen Differential Revision: https://reviews.llvm.org/D119025	2022-02-04 15:27:37 -08:00

1 2 3 4 5 ...

7711 Commits