llvm-project

Commit Graph

Author	SHA1	Message	Date
lorenzo chelini	2a3c07f897	[MLIR][Math] Re-order conversions alphabetically (NFC) Minor follow-up after: D127286 (https://reviews.llvm.org/D127286/new/) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D127382	2022-06-09 09:12:13 +02:00
Mogball	971e13d69e	[mlir][ods] Mark StructAttr as deprecated	2022-06-09 03:23:31 +00:00
bixia1	ff96d434d0	[mlir][sparse] Fix a problem introduced by the PR for reading complex number. The problem is in function isValid. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D127349	2022-06-08 15:01:50 -07:00
bixia1	5b1c5fc53a	[mlir][sparse] Add complex number reading from files. Support complex numbers for Matrix Market Exchange Formats. Add a test case. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D127138	2022-06-08 13:33:35 -07:00
Arjun P	4bf9cbc408	[MLIR][Presburger] subtract: improve redundant constraint detection When constraints in the two operands make each other redundant, prefer constraints of the second because this affects the number of sets in the output at each level; reducing these can help prevent exponential blowup. This is accomplished by adding extra overloads to Simplex::detectRedundant that only scan a subrange of the constraints for redundancy. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D127237	2022-06-08 14:44:31 -04:00
wren romano	0371ddf9ad	[mlir] Refactoring the tablegen Tensor types Reduces repetition in tablegen files for defining various tensor types. In particular the goal is to reduce the repetition when defining new tensor types (e.g., D126994). Reviewed By: aartbik, rriddle Differential Revision: https://reviews.llvm.org/D127039	2022-06-08 11:33:48 -07:00
bixia1	6c6eddb617	[mlir] Lower complex.power and complex.rsqrt to standard dialect. Add conversion tests and correctness tests. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D127255	2022-06-08 10:53:53 -07:00
dime10	4f55ed5a1e	Add Python bindings for the OpaqueType Implement the C-API and Python bindings for the builtin opaque type, which was previously missing. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D127303	2022-06-08 19:51:00 +02:00
Mogball	ee70039ae2	[mlir] Fix handling of some region branch terminator successors When `RegionBranchOpInterface::getSuccessorRegions` is called for anything other than the parent op, it expects the operands of the terminator of the source region to be passed, not the operands of the parent op. This was not always respected. This fixes a bug in integer range inference and ForwardDataFlowSolver and changes `scf.while` to allow narrowing of successors using constant inputs. Fixes #55873 Reviewed By: mehdi_amini, krzysz00 Differential Revision: https://reviews.llvm.org/D127261	2022-06-08 17:17:03 +00:00
bixia1	ea8ed5cbcf	[mlir][sparse] Add F16 and BF16. This is the first PR to add `F16` and `BF16` support to the sparse codegen. There are still problems in supporting these two data types, such as `BF16` is not quite working yet. Add tests cases. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D127010	2022-06-08 09:51:05 -07:00
Lei Zhang	2dfefe0283	[mlir][spirv] NFC: fix typo in UnifyAliasedResourcePass pass Reviewed By: ThomasRaoux, hanchung Differential Revision: https://reviews.llvm.org/D127265	2022-06-08 08:18:12 -07:00
lorenzo chelini	a0fc94ab61	[MLIR][Math] Add round operation Introduce RoundOp in the math dialect. The operation rounds the operand to the nearest integer value in floating-point format. RoundOp lowers to LLVM intrinsics 'llvm.intr.round' or as a function call to libm (round or roundf). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D127286	2022-06-08 13:07:39 +02:00
Matthias Springer	032be23309	[mlir][bufferize] Improve buffer writability analysis Find writability conflicts (writes to buffers that are not allowed to be written to) by checking SSA use-def chains. This is better than the current writability analysis, which is too conservative and finds false positives. Differential Revision: https://reviews.llvm.org/D127256	2022-06-08 10:11:52 +02:00
Benjamin Kramer	6eb0f8e285	[mlir][MemRef] Fix a crash when expanding a scalar shape In this case the reassociation is empty, yielding no strides for the result type. Differential Revision: https://reviews.llvm.org/D127232	2022-06-08 09:37:40 +02:00
lorenzo chelini	d48479791f	[MLIR][SCF] Improve doc (NFC)	2022-06-08 08:46:36 +02:00
Nathan Lanza	f46ce03734	[MLIR] Add an install target for mlir-libraries This is required for the distribution system for installing the mlir-libraries component. This is copied from clang's equivalent feature. Differential Revision: https://reviews.llvm.org/D126837	2022-06-07 22:57:07 -04:00
Aart Bik	7482cd6869	[mlir][sparse] updated our sparse dialect doc with some recent changes The `init` and `tensor` ops are renamed (and one moved to another dialect). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D127169	2022-06-07 14:27:57 -07:00
Christopher Bate	53fe155b3f	Revert "[mlir][vector] Allow unroll of contraction in arbitrary order" Reverts commit `1469ebf838` (original commit) Reverts commit `a392a39f75` (build fix for above commit) The commit broke tests in out-of-tree projects, indicating that some logical error was made in the previous change but not covered by current tests.	2022-06-07 14:54:01 -06:00
Groverkss	445e2b2aa0	[MLIR][Presburger] Fix subtract processing extra inequalities This patch fixes a bug in PresburgeRelation::subtract that made it process the inequality at index 0, multiple times. This was caused by allocating memory instead of reserving memory in llvm::SmallVector. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D127228	2022-06-07 22:51:03 +05:30
Kiran Chandramohan	dd32bf9a77	[Flang,MLIR,OpenMP] Fix a few tests that were not converting to LLVM A few OpenMP tests were retaining the FIR operands even after running the LLVM conversion pass. To fix these tests the legality checkes for OpenMP conversion are made stricter to include operands and results. The Flush, Single and Sections operations are added to conversions or legality checks. The RegionLessOpConversion is appropriately renamed to clarify that it works only for operations with Variable operands. The operands of the flush operation are changed to match those of Variable Operands. Fix for an OpenMP issue mentioned in https://github.com/llvm/llvm-project/issues/55210. Reviewed By: shraiysh, peixin, awarzynski Differential Revision: https://reviews.llvm.org/D127092	2022-06-07 09:55:53 +00:00
Alex Zinenko	3326eddcd1	[mlir] fix documentation format in SCF Four leading spaces are interpreted as a code block in markdown. Unless used consistently in ODS op description, they cannot be stripped away by the tablegen backend, which results in malformed markdown being generated.	2022-06-07 11:51:24 +02:00
Alexander Batashev	8324561e33	[mlir][spirv] Correctly deduce PhysicalStorageBuffer64 addressing model According to the SPIR-V specification[1], PhysicalStorageBuffer storage class can only be used iff addressing model is PhysicalStorageBuffer64. [1]: https://www.khronos.org/registry/SPIR-V/specs/unified1/SPIRV.html#_addressing_model Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D127067	2022-06-07 12:14:38 +03:00
lorenzo chelini	9b3712e0bf	[MLIR][LLVMIR] Add round intrinsic Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126879	2022-06-07 10:27:55 +02:00
lewuathe	62a34f6a6f	[mlir][complex] Add complex.conj op Add complex.conj op to calculate the complex conjugate which is widely used for the mathematical operation on the complex space. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D127181	2022-06-07 09:38:35 +02:00
lorenzo chelini	2cbf0b3dc6	[MLIR][SCF] Fix top-level comment (NFC)	2022-06-07 08:52:11 +02:00
River Riddle	a3a4f0335f	[vscode-mlir] Bump to version 0.9 Since version 0.8 we've added: * Switched PDLL and TableGen to use incremental doc updates * Added support to PDLL for inlay hints	2022-06-06 20:20:19 -07:00
River Riddle	5919eab55c	[mlir:PDLL] Add support for inlay hints These allow for displaying additional inline information, such as the types of variables, names operands/results, constraint/rewrite arguments, etc. This requires a bump in the vscode extension to a newer version, as inlay hints are a new LSP feature. Differential Revision: https://reviews.llvm.org/D126033	2022-06-06 20:20:19 -07:00
River Riddle	6187178e83	[mlir:LSP] Switch document sync mode to Incremental This is much more efficient over the full mode, as it only requires sending smalls chunks of files. It also works around a weird command ordering issue (full document updates are being sent after other commands like code completion) in newer versions of vscode. Differential Revision: https://reviews.llvm.org/D126032	2022-06-06 20:20:19 -07:00
River Riddle	1b501cbcbb	[mlir] Add documentation for TableGen LSP features and setup This commit beefs up the documentation for MLIR language servers by adding proper documentations/examples/etc for the provided TableGen language server capabilities. Given that this documentation is also used for the vscode extension, this commit also updates the user facing vscode extension documentation. Note that the images referenced in the new documentation are hosted on the website, and will be commited to mlir-www shortly after this commit lands.	2022-06-06 18:29:31 -07:00
Georgios Pinitas	3bcaf2eb93	[mlir][tosa] Moves constant folding operations out of the Canonicalizer Transpose operations on constant data were getting folded during the canonicalization process. This has compile time cost proportional to the constant size. Moving this to a separate pass to enable optionality and flexibility of how such scenarios can be handled. Reviewed By: rsuderman, jpienaar, stellaraccident Differential Revision: https://reviews.llvm.org/D124685	2022-06-06 22:10:22 +00:00
Christopher Bate	a392a39f75	[mlir][vector] fix typo in vector unroll transform	2022-06-06 16:09:13 -06:00
Christopher Bate	1469ebf838	[mlir][vector] Allow unroll of contraction in arbitrary order Adds supprot for vector unroll transformations to unroll in different orders. For example, the `vector.contract` can be unrolled into a smaller set of contractions. There is a choice of how to unroll the decomposition based on the traversal order of (dim0, dim1, dim2). The choice of traversal order can now be specified by a callback which given by the caller of the transform. For now, only the `vector.contract`, `vector.transfer_read/transfer_write` operations support the callback. Differential Revision: https://reviews.llvm.org/D127004	2022-06-06 14:31:04 -06:00
River Riddle	731dfca8a0	[mlir] Add documentation for PDLL LSP features and setup This commit beefs up the documentation for MLIR language servers by adding proper documentations/examples/etc for the provided PDLL language server capabilities. Given that this documentation is also used for the vscode extension, this commit also updates the user facing vscode extension documentation. Not that the images referenced in the new documentation are hosted on the website, and will be commited to mlir-www shortly after this commit lands. Differential Revision: https://reviews.llvm.org/D125650	2022-06-06 13:13:54 -07:00
Christopher Bate	cca662b849	[mlir][linalg] add conv_2d_nhwc_fhwc named op This operation should be supported as a named op because when the operands are viewed as having canonical layouts with decreasing strides, then the "reduction" dimensions of the filter (h, w, and c) are contiguous relative to each output channel. When lowered to a matrix multiplication, this layout is the simplest to deal with, and thus future transforms/vectorizations of `conv2d` may find using this named op convenient. Differential Revision: https://reviews.llvm.org/D126995	2022-06-06 13:18:08 -06:00
Christopher Bate	99069ab212	[mlir][linalg] fix crash when promoting rank-reducing memref.subviews This change adds support for promoting `linalg` operation operands that are produced by rank-reducing `memref.subview` ops. Differential Revision: https://reviews.llvm.org/D127086	2022-06-06 12:06:36 -06:00
jacquesguan	ad44495ad3	[mlir][NFC] Replace some llvm::find with llvm::is_contained. This patch replaces some llvm::find with llvm::is_contained, it should be more clear. Differential Revision: https://reviews.llvm.org/D127077	2022-06-06 03:01:14 +00:00
Stella Laurenzo	768a251587	[mlir] Tunnel LLVM_USE_LINKER through to the standalone example build. When building in debug mode, the link time of the standalone sample is excessive, taking upwards of a minute if using BFD. This at least allows lld to be used if the main invocation was configured that way. On my machine, this gets a standalone test that requires a relink to run in ~13s for Debug mode. This is still a lot, but better than it was. I think we may want to do something about this test: it adds a lot of latency to a normal compile/test cycle and requires a bunch of arg fiddling to exclude. I think we may end up wanting a `check-mlir-heavy` target that can be used just prior to submit, and then make `check-mlir` just run unit/lite tests. More just thoughts for the future (none of that is done here). Reviewed By: bondhugula, mehdi_amini Differential Revision: https://reviews.llvm.org/D126585	2022-06-05 12:31:41 -07:00
Fangrui Song	d86a206f06	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 00:31:44 -07:00
Christian Sigg	400fef081a	Recommit: "[MLIR][NVVM] Replace fdiv on fp16 with promoted (fp32) multiplication with reciprocal plus one (conditional) Newton iteration." This change rolls `bcfc0a9051` forward (i.e., reverting `369ce54bb3`) with fixed CMakeLists.txt.	2022-06-05 09:11:43 +02:00
Jacques Pienaar	29794ab0fa	[mlir] Use context provided rather than getContext Avoids "pass state was never initialized" assertion failure.	2022-06-04 12:18:51 -07:00
Mehdi Amini	369ce54bb3	Revert "[MLIR][GPU] Replace fdiv on fp16 with promoted (fp32) multiplication with reciprocal plus one (conditional) Newton iteration." This reverts commit `bcfc0a9051`. The build is broken with shared library enabled.	2022-06-04 08:35:45 +00:00
Christian Sigg	bcfc0a9051	[MLIR][GPU] Replace fdiv on fp16 with promoted (fp32) multiplication with reciprocal plus one (conditional) Newton iteration. This is correct for all values, i.e. the same as promoting the division to fp32 in the NVPTX backend. But it is faster (~10% in average, sometimes more) because: - it performs less Newton iterations - it avoids the slow path for e.g. denormals - it allows reuse of the reciprocal for multiple divisions by the same divisor Test program: ``` #include <stdio.h> #include "cuda_fp16.h" // This is a variant of CUDA's own __hdiv which is fast than hdiv_promote below // and doesn't suffer from the perf cliff of div.rn.fp32 with 'special' values. __device__ half hdiv_newton(half a, half b) { float fa = __half2float(a); float fb = __half2float(b); float rcp; asm("{rcp.approx.ftz.f32 %0, %1;\n}" : "=f"(rcp) : "f"(fb)); float result = fa * rcp; auto exponent = reinterpret_cast<const unsigned&>(result) & 0x7f800000; if (exponent != 0 && exponent != 0x7f800000) { float err = __fmaf_rn(-fb, result, fa); result = __fmaf_rn(rcp, err, result); } return __float2half(result); } // Surprisingly, this is faster than CUDA's own __hdiv. __device__ half hdiv_promote(half a, half b) { return __float2half(__half2float(a) / __half2float(b)); } // This is an approximation that is accurate up to 1 ulp. __device__ half hdiv_approx(half a, half b) { float fa = __half2float(a); float fb = __half2float(b); float result; asm("{div.approx.ftz.f32 %0, %1, %2;\n}" : "=f"(result) : "f"(fa), "f"(fb)); return __float2half(result); } __global__ void CheckCorrectness() { int i = threadIdx.x + blockIdx.x * blockDim.x; half x = reinterpret_cast<const half&>(i); for (int j = 0; j < 65536; ++j) { half y = reinterpret_cast<const half&>(j); half d1 = hdiv_newton(x, y); half d2 = hdiv_promote(x, y); auto s1 = reinterpret_cast<const short&>(d1); auto s2 = reinterpret_cast<const short&>(d2); if (s1 != s2) { printf("%f (%u) / %f (%u), got %f (%hu), expected: %f (%hu)\n", __half2float(x), i, __half2float(y), j, __half2float(d1), s1, __half2float(d2), s2); //__trap(); } } } __device__ half dst; __global__ void ProfileBuiltin(half x) { #pragma unroll 1 for (int i = 0; i < 10000000; ++i) { x = x / x; } dst = x; } __global__ void ProfilePromote(half x) { #pragma unroll 1 for (int i = 0; i < 10000000; ++i) { x = hdiv_promote(x, x); } dst = x; } __global__ void ProfileNewton(half x) { #pragma unroll 1 for (int i = 0; i < 10000000; ++i) { x = hdiv_newton(x, x); } dst = x; } __global__ void ProfileApprox(half x) { #pragma unroll 1 for (int i = 0; i < 10000000; ++i) { x = hdiv_approx(x, x); } dst = x; } int main() { CheckCorrectness<<<256, 256>>>(); half one = __float2half(1.0f); ProfileBuiltin<<<1, 1>>>(one); // 1.001s ProfilePromote<<<1, 1>>>(one); // 0.560s ProfileNewton<<<1, 1>>>(one); // 0.508s ProfileApprox<<<1, 1>>>(one); // 0.304s auto status = cudaDeviceSynchronize(); printf("%s\n", cudaGetErrorString(status)); } ``` Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D126158	2022-06-04 08:03:29 +02:00
wren romano	3cf03f1c56	[mlir][sparse] Adding IsSparseTensorPred and updating ops to use it Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126994	2022-06-03 17:15:31 -07:00
Christopher Bate	9f819f4c62	[mlir][linalg] fix crash in vectorization of elementwise operations The current vectorization logic implicitly expects "elementwise" linalg ops to have projected permutations for indexing maps, but the precondition logic misses this check. This can result in a crash when executing the generic vectorization transform on an op with a non-projected permutation input indexing map. This change fixes the logic and adds a test (which crashes without this fix). Differential Revision: https://reviews.llvm.org/D127000	2022-06-03 16:38:13 -06:00
Diego Caballero	9a79b1b04c	[mlir] Add peeling xform to Codegen Strategy This patch adds the knobs to use peeling in the codegen strategy infrastructure. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D126842	2022-06-03 21:31:43 +00:00
Krzysztof Drewniak	95aff23e29	Re-land "[mlir] Add integer range inference analysis"" This reverts commit `4e5ce2056e`. This relands commit `1350c9887d`. Reinstates the range analysis with the build issue fixed. Differential Revision: https://reviews.llvm.org/D126926	2022-06-03 17:13:48 +00:00
lewuathe	d4141c93a8	[mlir][complex] Check the correctness of tanh in complex dialect Correctness check for tanh operation in complex dialect. Ref: https://reviews.llvm.org/D126858 Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D126946	2022-06-03 14:04:48 +02:00
Adrian Kuegel	39f28397e2	[mlir] Fix ClangTidy warning (NFC). virtual is redundant since the function is already declared 'override'.	2022-06-03 12:46:14 +02:00
Shraiysh Vaishay	f5d29c15bf	[mlir][OpenMP] Add memory_order clause tests This patch adds tests for memory_order clause for atomic update and capture operations. This patch also adds a check for making sure that the operations inside and omp.atomic.capture region do not specify the memory_order clause. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D126195	2022-06-03 13:41:22 +05:30
Nicolas Vasilache	72de7588cc	[mlir][SCF] Add bufferization hook for scf.foreach_thread and terminator. `scf.foreach_thread` results alias with the underlying `scf.foreach_thread.parallel_insert_slice` destination operands and they bufferize to equivalent buffers in the absence of other conflicts. `scf.foreach_thread.parallel_insert_slice` conflict detection is similar to `tensor.insert_slice` conflict detection. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D126769	2022-06-03 07:14:05 +00:00
Alexander Batashev	b34fb277df	[mlir][cf] Implement missing SwitchOp::build function Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D126594	2022-06-03 09:08:04 +03:00
Thomas Raoux	271a48e029	[mlir][VectorToGPU] Fix bug generating incorrect ldmatrix ops ldmatrix transpose can only be used with types that are 16bits wide. Differential Revision: https://reviews.llvm.org/D126846	2022-06-03 04:30:22 +00:00
Thomas Raoux	205c08b54d	[mlir][scf] Add option to loop pipelining to not peel the epilogue Add an option to predicate the epilogue within the kernel instead of peeling the epilogue. This is a useful option to prevent generating large amount of code for deep pipeline. This currently require a user lamdba to implement operation predication. Differential Revision: https://reviews.llvm.org/D126753	2022-06-03 04:20:20 +00:00
River Riddle	ee1cf1f645	[mlir][NFC] Simplify the various `parseSourceFile<T>` overloads These effectively all share the same implementation, i.e. forward to the non-templated overload and then construct the container op.	2022-06-02 19:18:55 -07:00
Aart Bik	f8b692dd31	[mlir][python][f16] add ctype python binding support for f16 Similar to complex128/complex64, float16 has no direct support in the ctypes implementation. This fixes the issue by using a custom F16 type to change the view in and out of MLIR code Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D126928	2022-06-02 17:21:24 -07:00
River Riddle	bb81b3b274	[vscode-mlir] Bump to version 0.8 Since version 0.7 we've added: * Initial language support for TableGen * Tweaked syntax highlighting for PDLL * Added a new command to view intermediate PDLL output	2022-06-02 16:35:09 -07:00
River Riddle	bf352e0b2e	[mlir:PDLL] Add better support for providing Constraint/Pattern/Rewrite documentation This commit enables providing long-form documentation more seamlessly to the LSP by revamping decl documentation. For ODS imported constructs, we now also import descriptions and attach them to decls when possible. For PDLL constructs, the LSP will now try to provide documentation by parsing the comments directly above the decls location within the source file. This commit also adds a new parser flag `enableDocumentation` that gates the import and attachment of ODS documentation, which is unnecessary in the normal build process (i.e. it should only be used/consumed by tools). Differential Revision: https://reviews.llvm.org/D124881	2022-06-02 16:31:07 -07:00
Arjun P	8bc2cff95a	[MLIR][Presburger] Simplex: remove redundant member vars nRow, nCol Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126790	2022-06-03 00:30:48 +01:00
Chia-hung Duan	633ad1d864	[mlir:MultiOpDriver] Quick fix the assertion position The assertion should come after null check	2022-06-02 23:25:35 +00:00
Mehdi Amini	4e5ce2056e	Revert "[mlir] Add integer range inference analysis" This reverts commit `1350c9887d`. Shared library build is broken with undefined references.	2022-06-02 21:24:06 +00:00
Krzysztof Drewniak	1350c9887d	[mlir] Add integer range inference analysis This commit defines a dataflow analysis for integer ranges, which uses a newly-added InferIntRangeInterface to compute the lower and upper bounds on the results of an operation from the bounds on the arguments. The range inference is a flow-insensitive dataflow analysis that can be used to simplify code, such as by statically identifying bounds checks that cannot fail in order to eliminate them. The InferIntRangeInterface has one method, inferResultRanges(), which takes a vector of inferred ranges for each argument to an op implementing the interface and a callback allowing the implementation to define the ranges for each result. These ranges are stored as ConstantIntRanges, which hold the lower and upper bounds for a value. Bounds are tracked separately for the signed and unsigned interpretations of a value, which ensures that the impact of arithmetic overflows is correctly tracked during the analysis. The commit also adds a -test-int-range-inference pass to test the analysis until it is integrated into SCCP or otherwise exposed. Finally, this commit fixes some bugs relating to the handling of region iteration arguments and terminators in the data flow analysis framework. Depends on D124020 Depends on D124021 Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D124023	2022-06-02 20:24:11 +00:00
Aart Bik	bf7dbc2a30	[mlir][sparse][bufferization] fix doc on new init operation The example was still using the -now- removed sparse_tensor.init_tensor. Also, I made the input operands of the matrix multiplication sparse too (since it looks a bit strange to multiply two dense matrices into a sparse). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D126897	2022-06-02 12:04:36 -07:00
Chia-hung Duan	2aeffc6d8d	[mlir:MultiOpDriver] Don't add ops which are not in the allowed list In strict mode, only the new inserted operation is allowed to add to the worklist. Before this change, it would add the users of a replaced op and it didn't check if the users are allowed to be pushed into the worklist Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D126899	2022-06-02 18:27:37 +00:00
Ashay Rane	5fee1799f4	[mlir] translate memref.reshape with static shapes but dynamic dims Prior to this patch, the lowering of memref.reshape operations to the LLVM dialect failed if the shape argument had a static shape with dynamic dimensions. This patch adds the necessary support so that when the shape argument has dynamic values, the lowering probes the dimension at runtime to set the size in the `MemRefDescriptor` type. This patch also computes the stride for dynamic dimensions by deriving it from the sizes of the inner dimensions. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126604	2022-06-02 10:00:58 -07:00
Alex Zinenko	ce2e198bc2	[mlir] add decompose and generalize to structured transform ops These ops complement the tiling/padding transformations by transforming higher-level named structured operations such as depthwise convolutions into lower-level and/or generic equivalents that are better handled by some downstream transformations. Differential Revision: https://reviews.llvm.org/D126698	2022-06-02 15:25:18 +02:00
Nicolas Vasilache	311967701a	[mlir][SCF] Add scf.foreach_thread.parallel_insert_slice canonicalization. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126761	2022-06-02 11:53:25 +00:00
lewuathe	9f0869a61d	[mlir][complex] Lower complex.sin/cos to libm Lower sin/cos operation in complex dialect to libm as a baseline. This follows up to https://reviews.llvm.org/D125550. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D126755	2022-06-02 10:39:00 +02:00
lewuathe	4b13b061ae	[mlir][complex] Sanity check for tan operation in complex dialect Add a sanity check for newly added tan operation in complex dialect. It follows-up to https://reviews.llvm.org/D126685. Differential Revision: https://reviews.llvm.org/D126858	2022-06-02 10:33:40 +02:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00
jacquesguan	19e285477e	[mlir][Arithmetic] Add constant folder for RemF. This patch adds the constant folder for RemF. Differential Revision: https://reviews.llvm.org/D126045	2022-06-02 06:24:37 +00:00
jacquesguan	ce820375ef	[mlir] Support convert token type from LLVM IR. This patch supports the token type for converting from LLVM IR. Differential Revision: https://reviews.llvm.org/D126756	2022-06-02 03:32:51 +00:00
Matthias Springer	6232a8f3d6	[mlir][sparse][NFC] Switch InitOp to bufferization::AllocTensorOp Now that we have an AllocTensorOp (previously InitTensorOp) in the bufferization dialect, the InitOp in the sparse dialect is no longer needed. Differential Revision: https://reviews.llvm.org/D126180	2022-06-02 00:03:52 +02:00
wren romano	b364c76683	[mlir][sparse] Using non-empty function name suffix for OverheadType::kIndex The trick of using an empty token in the `FOREVERY_O` x-macro relies on preprocessor behavior which is only standard since C99 6.10.3/4 and C++11 N3290 16.3/4 (whereas it was undefined behavior up through C++03 16.3/10). Since the `ExecutionEngine/SparseTensorUtils.cpp` file is required to be compile-able under C++98 compatibility mode (unlike the C++11 used elsewhere in MLIR), we shouldn't rely on that behavior. Also, using a non-empty suffix helps improve uniformity of the API, since all other primary/overhead suffixes are also non-empty. I'm using the suffix `0` since that's the value used by the `SparseTensorEncoding` attribute for indicating the index overhead-type. Depends On D126720 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126724	2022-06-01 14:18:42 -07:00
Rob Suderman	f3bdb56d61	[mlir][math] Add math.ctlz expansion to control flow + arith operations Ctlz is an intrinsic in LLVM but does not have equivalent operations in SPIR-V. Including a decomposition gives an alternative path for these platforms. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D126261	2022-06-01 11:45:04 -07:00
Stella Laurenzo	3bb7999339	[mlir] Add global_load and global_store ops to ml_program. * Adds simple, non-atomic, non-volatile, non-synchronized direct load/store ops. Differential Revision: https://reviews.llvm.org/D126230	2022-06-01 11:32:15 -07:00
Alexander Belyaev	f711785e61	[mlir] Add conversion and tests for complex.[sqrt\|atan2] to Arith. Differential Revision: https://reviews.llvm.org/D126799	2022-06-01 20:21:51 +02:00
bixia1	548f0841cd	[mlir][sparse] Enable the test for operator expm1. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126732	2022-06-01 11:18:17 -07:00
Aart Bik	d668218946	[mlir][python][ctypes] fix ctype python binding complication for complex There is no direct ctypes for MLIR's complex (and thus np.complex128 and np.complex64) yet, causing the mlir python binding methods for memrefs to crash. This revision fixes this by passing complex arrays as tuples of floats, correcting at the boundaries for the proper view. NOTE: some of these changes (4 -> 2) were forced by the new "linting" Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D126422	2022-06-01 10:15:24 -07:00
Arjun P	8f99cdd27c	[MLIR][Presburger] Simplex: remove redundant zeroing out of row This fillRow(..., 0) is redundant because when the size of the tableau is consistent, the resize always creates a new row, which is zero-initialized. Also added asserts throughout to ensure the dimensions of the tableau remain consistent. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D126709	2022-06-01 16:59:37 +01:00
Arjun P	ec145ba2a3	[MLIR][Presburger] Matrix: inline trivial accessors This resolves a comment from https://reviews.llvm.org/D126708 that was previously missed.	2022-06-01 16:56:46 +01:00
Arjun P	d5e31cf38a	[MLIR][Presburger] Move Matrix accessors inline This gives a 1.5x speedup on the Presburger unittests. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D126708	2022-06-01 16:51:42 +01:00
PeixinQiao	fe2cc16035	[NFC][MLIR] Fix -Wtype-limits warning Fix the warning: comparison of unsigned expression in ‘>= 0’ is always true. Reviewed By: kiranchandramohan, shraiysh Differential Revision: https://reviews.llvm.org/D126784	2022-06-01 23:42:07 +08:00
Nicolas Vasilache	59b273a166	[mlir][SCF] Add parallel abstraction on tensors. This revision adds `scf.foreach_thread` and other supporting abstractions that allow connecting parallel abstractions and tensors. Discussion is available [here](https://discourse.llvm.org/t/rfc-parallel-abstraction-for-tensors-and-buffers/62607). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126555	2022-06-01 09:16:01 +00:00
lewuathe	ffb8eecdd6	[mlir][complex] Lowering complex.tanh to standard Lowering complex.tanh to standard dialects including math, arith. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D126521	2022-06-01 11:13:54 +02:00
Nicolas Vasilache	beab8e871e	Revert "[mlir][SCF] Add parallel abstraction on tensors." This reverts commit `9b7193f852`. This is an older branch that was committed by mistake and does not include addressed review comments, an updated version will come next.	2022-06-01 09:04:20 +00:00
Nicolas Vasilache	9b7193f852	[mlir][SCF] Add parallel abstraction on tensors. This revision adds `scf.foreach_thread` and other supporting abstractions that allow connecting parallel abstractions and tensors. Discussion is available [here](https://discourse.llvm.org/t/rfc-parallel-abstraction-for-tensors-and-buffers/62607).	2022-06-01 09:02:16 +00:00
Benjamin Kramer	7d431e9ec5	[mlir][complex] Remove unused variables. NFC.	2022-06-01 09:33:02 +02:00
lewuathe	6d75c89783	[mlir][complex] Add tan op for complex dialect Add tangent operation for complex dialect. This is the follow-up change of https://reviews.llvm.org/D126521 Differential Revision: https://reviews.llvm.org/D126685	2022-06-01 09:20:42 +02:00
wren romano	98e142cd4f	[mlir][sparse] Using x-macros in the function-suffix functions By defining the `{primary,overhead}TypeFunctionSuffix` functions via the same x-macros used to generate the runtime library's functions themselves, this helps avoid bugs from typos or things getting out of sync. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D126720	2022-05-31 17:36:43 -07:00
wren romano	c63d4fac4f	[mlir][sparse] Improving the FATAL macro The previous macro definition using `{...}` would fail to compile when the callsite uses a semicolon followed by an else-statement (i.e., `if (...) FATAL(...); else ...;`). Replacing the simple braces with `do{...}while(0)` (n.b., semicolon not included in the macro definition) enables callsites to use the semicolon plus else-statement syntax without problems. The new definition now requires the semicolon at all callsites, but since it was already being called that way nothing changes. For more explanation, see <https://gcc.gnu.org/onlinedocs/cpp/Swallowing-the-Semicolon.html> Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126514	2022-05-31 14:31:38 -07:00
wren romano	a4c53f8cd6	[mlir][sparse] Factoring out SparseTensorFile class for readSparseTensorShape The primary goal of this change is to define readSparseTensorShape. Whereas the SparseTensorFile class is merely introduced as a way to reduce code duplication along the way. Depends On D126106 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126233	2022-05-31 13:24:28 -07:00
Nathaniel McVicar	8fb1bef60f	[windows] Remove unused pybind exception params Resolve MSVC warning C4104 for unreferenced variable Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D126683	2022-05-31 12:36:57 -07:00
Arjun P	18a06d4f3a	[MLIR][Presburger] Simplex::computeOptimum: slightly simplify code (NFC)	2022-05-31 19:10:15 +01:00
lorenzo chelini	850dbff708	[MLIR][Math] Improve docs (NFC) Remove boilerplate examples and add a text at the dialect level to describe what kind of operands the operations accept (i.e., scalar, tensor or vector). Left a shorter sentence describing the input operands for each operation as this redundancy is convenient when browsing the documentation using the website. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126648	2022-05-31 18:16:59 +02:00
Mehdi Amini	5d93d2a9eb	Apply clang-tidy fixes for llvm-else-after-return in OpPythonBindingGen.cpp (NFC)	2022-05-31 11:54:19 +00:00
Mehdi Amini	d8c46eb612	Apply clang-tidy fixes for readability-identifier-naming in SparseTensorUtils.cpp (NFC)	2022-05-31 11:54:19 +00:00
jacquesguan	42c17073fc	[mlir] Support import llvm intrinsics. This patch supports to convert the llvm intrinsic to the corresponding op. It still leaves some intrinsics to be handled specially. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126639	2022-05-31 11:08:23 +00:00
River Riddle	1c2edb026e	[mlir:PDLL] Rework the C++ generation of native Constraint/Rewrite arguments and results The current translation uses the old "ugly"/"raw" form which used PDLValue for the arguments and results. This commit updates the C++ generation to use the recently added sugar that allows for directly using the desired types for the arguments and result of PDL functions. In addition, this commit also properly imports the C++ class for ODS operations, constraints, and interfaces. This allows for a much more convienent C++ API than previously granted with the raw/low-level types. Differential Revision: https://reviews.llvm.org/D124817	2022-05-30 17:35:34 -07:00
River Riddle	0429472efe	[mlir:PDLL] Fix signature help for operation operands We were currently only completing on the first operand because the completion check was outside of the parse loop. Differential Revision: https://reviews.llvm.org/D124784	2022-05-30 17:35:34 -07:00
River Riddle	01652d889c	[mlir:PDLL-LSP] Add a custom LSP command for viewing the output of PDLL This commit adds a new PDLL specific LSP command, pdll.viewOutput, that allows for viewing the intermediate outputs of a given PDLL file. The available intermediate forms currently mirror those in mlir-pdll, namely: AST, MLIR, CPP. This is extremely useful for a developer of PDLL, as it simplifies various testing, and is also quite useful for users as they can easily view what is actually being generated for their PDLL files. This new command is added to the vscode client, and is available in the right client context menu of PDLL files, or via the vscode command palette. Differential Revision: https://reviews.llvm.org/D124783	2022-05-30 17:35:34 -07:00

1 2 3 4 5 ...

11545 Commits