llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	9106d35b91	[mlir][bufferize] Use rewriter instead of replacing all uses directly This is important for compatibility with DialectConversion.	2022-02-12 02:35:36 +09:00
Sameer Sahasrabuddhe	d8f99bb6e0	[AMDGPU] replace hostcall module flag with function attribute The module flag to indicate use of hostcall is insufficient to catch all cases where hostcall might be in use by a kernel. This is now replaced by a function attribute that gets propagated to top-level kernel functions via their respective call-graph. If the attribute "amdgpu-no-hostcall-ptr" is absent on a kernel, the default behaviour is to emit kernel metadata indicating that the kernel uses the hostcall buffer pointer passed as an implicit argument. The attribute may be placed explicitly by the user, or inferred by the AMDGPU attributor by examining the call-graph. The attribute is inferred only if the function is not being sanitized, and the implictarg_ptr does not result in a load of any byte in the hostcall pointer argument. Reviewed By: jdoerfert, arsenm, kpyzhov Differential Revision: https://reviews.llvm.org/D119216	2022-02-11 22:51:56 +05:30
Adrian Kuegel	2219f9f57c	[mlir][MemRef] Fix MemRefCopyOpLowering to use correct number of bytes When lowering to memrefCopy call, the size for i1 type was calculated as 0. Instead of using getTypeSizeInBits() and dividing by 8, we should just use getTypeSize(). Differential Revision: https://reviews.llvm.org/D119540	2022-02-11 13:59:08 +01:00
Adrian Kuegel	5b02a48085	[mlir][MemRef] Fix MemRefCastOpLowering for 32 bit index type. The lowering creates llvm.insertvalue with the rank value, so it needs to use index type instead of 64 bit integer type. Otherwise, we get an error: llvm.insertvalue' op Type mismatch: cannot insert 'i64' into '!llvm.struct<(i32, ptr<i8>)>' Differential Revision: https://reviews.llvm.org/D119534	2022-02-11 12:37:15 +01:00
Arjun P	855cd847f7	[MLIR][Presburger] normalizeDivisionByGCD: fix bug when constant term is negative Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D119531	2022-02-11 17:02:52 +05:30
Markus Böck	1bf7921374	[mlir][LLVM] Add support for adding a garbage collector to a LLVM function This patch simply adds an optional garbage collector attribute to LLVMFuncOp which maps 1:1 to the "gc" property of functions in LLVM. Differential Revision: https://reviews.llvm.org/D119492	2022-02-11 10:23:51 +01:00
Mehdi Amini	b055e6d313	Add a new interface method `getAsmBlockName()` on OpAsmOpInterface to control block names This allows operations to control the block ids used by the printer in nested regions. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D115849	2022-02-11 08:46:08 +00:00
Uday Bondhugula	f2ff8a8e83	[MLIR] Add result status for normalizeAffineFor Add result status for normalizeAffineFor utility. Differential Revision: https://reviews.llvm.org/D119413	2022-02-11 11:29:52 +05:30
Thomas Raoux	5ab04bc068	[mlir][gpu] Add device side async copy operations Add new operations to the gpu dialect to represent device side asynchronous copies. This also add the lowering of those operations to nvvm dialect. Those ops are meant to be low level and map directly to llvm dialects like nvvm or rocdl. We can further add higher level of abstraction by building on top of those operations. This has been discuss here: https://discourse.llvm.org/t/modeling-gpu-async-copy-ampere-feature/4924 Differential Revision: https://reviews.llvm.org/D119191	2022-02-10 17:25:59 -08:00
River Riddle	ceb5dc55c2	[PDLL] Attempt to fix the gcc5 build by adding this-> to auto lambda	2022-02-10 16:59:03 -08:00
River Riddle	faf42264e5	[PDLL] Add support for user defined constraint and rewrite functions These functions allow for defining pattern fragments usable within the `match` and `rewrite` sections of a pattern. The main structure of Constraints and Rewrites functions are the same, and are similar to functions in other languages; they contain a signature (i.e. name, argument list, result list) and a body: ```pdll // Constraint that takes a value as an input, and produces a value: Constraint Cst(arg: Value) -> Value { ... } // Constraint that returns multiple values: Constraint Cst() -> (result1: Value, result2: ValueRange); ``` When returning multiple results, each result can be optionally be named (the result of a Constraint/Rewrite in the case of multiple results is a tuple). These body of a Constraint/Rewrite functions can be specified in several ways: * Externally In this case we are importing an external function (registered by the user outside of PDLL): ```pdll Constraint Foo(op: Op); Rewrite Bar(); ``` * In PDLL (using PDLL constructs) In this case, the body is defined using PDLL constructs: ```pdll Rewrite BuildFooOp() { // The result type of the Rewrite is inferred from the return. return op<my_dialect.foo>; } // Constraints/Rewrites can also implement a lambda/expression // body for simple one line bodies. Rewrite BuildFooOp() => op<my_dialect.foo>; ``` * In PDLL (using a native/C++ code block) In this case the body is specified using a C++(or potentially other language at some point) code block. When building PDLL in AOT mode this will generate a native constraint/rewrite and register it with the PDL bytecode. ```pdll Rewrite BuildFooOp() -> Op<my_dialect.foo> [{ return rewriter.create<my_dialect::FooOp>(...); }]; ``` Differential Revision: https://reviews.llvm.org/D115836	2022-02-10 12:48:59 -08:00
River Riddle	3d8b906012	[PDLL] Add support for single line lambda-like patterns This allows for defining simple patterns in a single line. The lambda body of a Pattern expects a single operation rewrite statement: ``` Pattern => replace op<my_dialect.foo>(operands: ValueRange) with operands; ``` Differential Revision: https://reviews.llvm.org/D115835	2022-02-10 12:48:58 -08:00
Krzysztof Drewniak	1ce314ce6b	[MLIR][GPU][lld] Use LLD bundled in ROCm, removing workaround Having clarified that executing the SerializeToHsaco pass can depend on a ROCm installation, switch from calling lld as a library to using the copy of lld guaranteed to be included in a ROCm install. This removes the workaround introduced in D119277 Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119463	2022-02-10 19:37:30 +00:00
Krzysztof Drewniak	c37b3e4108	[MLIR][GPU] Add now-required include to SerializeToHsaco Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119455	2022-02-10 18:36:38 +00:00
Nirvedh	ad9b5a4b8e	[mlir][vector] Add pattern to drop lead unit dim for Contraction Op If the result operand has a unit leading dim it is removed from all operands. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119206	2022-02-10 09:51:07 -08:00
Marius Brehler	44c1582265	[mlir] Add missing dep to new cf dialect	2022-02-10 14:15:20 +00:00
Lei Zhang	06a0385142	[mlir][linalg] Fold tensor.pad(linalg.fill) with the same value Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D119160	2022-02-10 08:39:35 -05:00
Matthias Springer	9b5a3d14b2	[mlir][vector] Add helper that builds a scalar reduction according to CombiningKind Differential Revision: https://reviews.llvm.org/D119433	2022-02-10 22:35:43 +09:00
Groverkss	4807587cf2	[MLIR][Presburger] Factor out space information to PresburgerSpace This patch factors out space information from IntegerPolyhedron, PresburgerSet and PWMAFunction to PresburgerSpace and its extension with local variables, PresburgerLocalSpace. Generally any new data structure additions in Presburger library will require space information. This patch removes the need to duplicate the space information. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D119280	2022-02-10 18:24:40 +05:30
Matthias Springer	fe0bf7d469	[mlir][vector][NFC] Use CombiningKindAttr instead of StringAttr This makes the op consistent with other ops in vector dialect. Differential Revision: https://reviews.llvm.org/D119343	2022-02-10 19:13:29 +09:00
Tres Popp	34ff99a0b7	Revert "[MLIR] Fix fold-memref-subview-ops for affine.load/store" This reverts commit `ac6cb41303`. This code has a stack-use-after-scope error that can be seen with asan.	2022-02-10 10:46:59 +01:00
Uday Bondhugula	ac6cb41303	[MLIR] Fix fold-memref-subview-ops for affine.load/store Fix fold-memref-subview-ops for affine.load/store. We need to expand out the affine apply on its operands. Differential Revision: https://reviews.llvm.org/D119402	2022-02-10 13:55:38 +05:30
Uday Bondhugula	8d12bf4ac1	[MLIR][NFC] Move expandAffineMap/Expr out to Affine utils Move expandAffineMap and expandAffineApplyExpr out to AffineUtils. This is a useful method. The child revision uses it. NFC. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119401	2022-02-10 09:56:26 +05:30
Uday Bondhugula	c2246eb893	[MLIR][NFC] Remove unused argument in affine scalrep helper util NFC. Remove unused argument in affine scalrep helper utility. Differential Revision: https://reviews.llvm.org/D119397	2022-02-10 08:25:55 +05:30
Rainer Orth	9159675535	[MLIR][Presburger] Disambiguate call to floor While testing LLVM 14.0.0 rc1 on Solaris, compilation of `FAIL`ed with /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/Analysis/Presburger/Utils.cpp: In lambda function: /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/Analysis/Presburger/Utils.cpp:48:58: error: call of overloaded ‘floor(int64_t)’ is ambiguous 48 \| [gcd](int64_t &n) { return floor(n / gcd); }); \| ^ ... /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:201:21: note: candidate: ‘long double std::floor(long double)’ 201 \| inline long double floor(long double __X) { return __floorl(__X); } \| ^~~~~ /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:165:15: note: candidate: ‘float std::floor(float)’ 165 \| inline float floor(float __X) { return __floorf(__X); } \| ^~~~~ /usr/gcc/10/lib/gcc/sparcv9-sun-solaris2.11/10.3.0/include-fixed/iso/math_iso.h:78:15: note: candidate: ‘double std::floor(double)’ 78 \| extern double floor __P((double)); \| ^~~~~ The same issue had already occured in the past, cf. D108750 <https://reviews.llvm.org/D108750>, and the solution is the same: cast the `floor` arg to `double`. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D119324	2022-02-09 22:01:55 +01:00
Rainer Orth	d2215e79ac	[mlir][sparse] Rename index_t to index_type again While testing LLVM 14.0.0 rc1 on Solaris, I ran into a compile failure: from /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/ExecutionEngine/SparseTensorUtils.cpp:22: /usr/include/sys/types.h:103:16: error: conflicting declaration ‘typedef short int index_t’ 103 \| typedef short index_t; \| ^~~~~~~ In file included from /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/lib/ExecutionEngine/SparseTensorUtils.cpp:17: /var/llvm/llvm-14.0.0-rc1/rc1/llvm-project/mlir/include/mlir/ExecutionEngine/SparseTensorUtils.h:26:7: note: previous declaration as ‘using index_t = uint64_t’ 26 \| using index_t = uint64_t; \| ^~~~~~~ The same issue had already occured in the past and fixed in D72619 <https://reviews.llvm.org/D72619>. More detailed explanation can also be found there. Tested on `amd64-pc-solaris2.11` and `sparcv9-solaris2.11`. Differential Revision: https://reviews.llvm.org/D119323	2022-02-09 21:59:52 +01:00
Matthias Springer	69f7647158	[mlir][GPU] Add ShuffleOp builder for constant offset/width Differential Revision: https://reviews.llvm.org/D119345	2022-02-10 02:55:44 +09:00
Alexander Belyaev	c962038914	[mlir][nfc] Expose linalg tiling helpers. Differential Revision: https://reviews.llvm.org/D119330	2022-02-09 15:26:06 +01:00
Matthias Springer	585a8a321c	[mlir][bufferize] OpOperands can have multiple aliasing OpResults This makes getAliasingOpResult symmetric to getAliasingOpOperand. The previous implementation was confusing for users and implemented in such a way only because there are currently no bufferizable ops that have multiple aliasing OpResults. Differential Revision: https://reviews.llvm.org/D119259	2022-02-09 20:58:45 +09:00
Matthias Springer	22a1973dbe	[mlir][linalg][bufferize] Print results of FuncOp read/write analysis Print more information with test-analysis-only. Differential Revision: https://reviews.llvm.org/D119118	2022-02-09 20:52:38 +09:00
Matthias Springer	f30ec8f627	[mlir][linalg][bufferize][NFC] Allow passing custom BufferizationOptions to pass Differential Revision: https://reviews.llvm.org/D118891	2022-02-09 19:15:31 +09:00
Matthias Springer	cdb7675c26	[mlir][bufferize][NFC] Make PostAnalysisSteps a function They used to be classes with a virtual `run` function. This was inconvenient because post analysis steps are stored in BufferizationOptions. Because of this design choice, BufferizationOptions were not copyable. Differential Revision: https://reviews.llvm.org/D119258	2022-02-09 18:56:06 +09:00
Alexandre Ganea	1e661e583d	[MLIR] Temporary workaround for calling the LLD ELF driver as-a-lib This fixes the situation described in https://github.com/llvm/llvm-project/issues/53475 with a repro exposed by https://github.com/ROCmSoftwarePlatform/D108850-lld-bug-reproduction This is purposely just a workaround to unblock users. This could be transplanted to the release/14.x branch if need be. A proper fix will later be provided in https://reviews.llvm.org/D119049. Differential Revision: https://reviews.llvm.org/D119277	2022-02-08 19:12:15 -05:00
Jacques Pienaar	bbddd19ec7	[mlir][math] Expand coverage of atan2 expansion Reuse the higher precision F32 approximation for the F16 one (by expanding and truncating). This is partly RFC as I'm not sure what the expectations are here (e.g., these are only for F32 and should not be expanded, that reusing higher-precision ones for lower precision is undesirable due to increased compute cost and only approximations per exact type is preferred, or this is appropriate [at least as fallback] but we need to see how to make it more generic across all the patterns here). Differential Revision: https://reviews.llvm.org/D118968	2022-02-08 15:00:39 -08:00
Mogball	07486395d2	[mlir][ods] Optional Attribute or Type Parameters Implements optional attribute or type parameters, including support for such parameters in the assembly format `struct` directive. Also implements optional groups. Depends on D117971 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118208	2022-02-08 20:09:44 +00:00
harsh	4a876b13fb	Add case to handle 0-D vectors in FlattenContiguousRowMajorTransferWritePattern and FlattenContiguousRowMajorTransferReadPattern. For 0-D as well as 1-D vectors, both these patterns should return a failure as there is no need to collapse the shape of the source. Currently, only 1-D vectors were handled. This patch handles the 0-D case as well. Reviewed By: Benoit, ThomasRaoux Differential Revision: https://reviews.llvm.org/D119202	2022-02-08 20:00:12 +00:00
Krzysztof Drewniak	24a1869d00	[MLIR][GPU] Update GPUToROCDL to account for ControlFlow dialect The conversion to the new ControlFlow dialect didn't change the GPUToROCDL pass - this commit fixes this issue. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119188	2022-02-08 16:34:34 +00:00
Arjun P	1096fcff7d	[MLIR][Presburger] Support computing volumes via hyperrectangular overapproximation Add support for computing an overapproximation of the number of integer points in a polyhedron. The returned result is actually the number of integer points one gets by computing the "rational shadow" obtained by projecting out the local IDs, finding the minimal axis-parallel hyperrectangular approximation of the shadow, and returning the number of integer points in that. This does not currently support symbols. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D119228	2022-02-08 21:06:49 +05:30
Arjun P	738c738b44	[MLIR][Presburger] Simplex::computeIntegerBounds: support unbounded directions by returning Optionals	2022-02-08 20:57:18 +05:30
Tres Popp	64b918852c	Remove restriction on static dimensions in Shape method mlir::shape::ToExtentTensorOp::areCastCompatible didn't allow the input to have a static dimension, but that is allowed.	2022-02-08 11:20:01 +01:00
Cullen Rhodes	99d95025e1	[mlir][Affine][Vector] NFC: fix examples in comments s/-affine-vectorize/-affine-super-vectorize/g Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D118892	2022-02-08 10:03:32 +00:00
River Riddle	2418cd92c0	[mlir] Update uses of `parser`/`printer` ODS op field to `hasCustomAssemblyFormat` The parser/printer fields are deprecated and in the process of being removed.	2022-02-07 19:03:58 -08:00
River Riddle	60cac0c081	[mlir][NFC] Remove deprecated/old build/fold/parser utilities from OpDefinition These have generally been replaced by better ODS functionality, and do not need to be explicitly provided anymore. Differential Revision: https://reviews.llvm.org/D119065	2022-02-07 19:03:58 -08:00
River Riddle	3c69bc4d6e	[mlir][NFC] Remove a few op builders that simply swap parameter order Differential Revision: https://reviews.llvm.org/D119093	2022-02-07 19:03:57 -08:00
Sergei Grechanik	bb39ad43ce	[mlir][spirv] Fix verification of nested array constants Fix the verification function of spirv::ConstantOp to allow nesting array attributes. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D118939	2022-02-07 13:48:53 -08:00
Arjun P	d5a2944219	[MLIR][Presburger] Add support for piece-wise multi-affine functions Add the class MultiAffineFunction which represents functions whose domain is an IntegerPolyhedron and which produce an output given by a tuple of affine expressions in the IntegerPolyhedron's ids. Also add support for piece-wise MultiAffineFunctions, which are defined on a union of IntegerPolyhedrons, and may have different output affine expressions on each IntegerPolyhedron. Thus the function is affine on each individual IntegerPolyhedron piece in the domain. This is part of a series of patches leading up to parametric integer programming. Depends on D118778. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D118779	2022-02-08 00:43:59 +05:30
Matthias Springer	9aa74347d5	[mlir][SCF] Further simplify affine maps during `for-loop-canonicalization` * Implement `FlatAffineConstraints::getConstantBound(EQ)`. * Inject a simpler constraint for loops that have at most 1 iteration. * Taking into account constant EQ bounds of FlatAffineConstraint dims/symbols during canonicalization of the resulting affine map in `canonicalizeMinMaxOp`. Differential Revision: https://reviews.llvm.org/D119153	2022-02-08 02:40:08 +09:00
Benjamin Kramer	6635c12ada	[mlir] Use SmallBitVector instead of SmallDenseSet for AffineMap::compressSymbols This is both more efficient and more ergonomic to use, as inverting a bit vector is trivial while inverting a set is annoying. Sadly this leaks into a bunch of APIs downstream, so adapt them as well. This would be NFC, but there is an ordering dependency in MemRefOps's computeMemRefRankReductionMask. This is now deterministic, previously it was dependent on SmallDenseSet's unspecified iteration order. Differential Revision: https://reviews.llvm.org/D119076	2022-02-07 00:21:44 +01:00
River Riddle	330838eb90	[mlir] Fix GpuToLLVM conversion pass after ControlFlow operations were split from Standard	2022-02-06 15:10:03 -08:00
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00

1 2 3 4 5 ...

7617 Commits