llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	f21896f2c6	[DenseElementAttr] Simplify the public API for creating these. Instead of requiring the client to compute the "isSplat" bit, compute it internally. This makes the logic more consistent and defines away a lot of "elements.size()==1" in the clients. This addresses Issue #55185 Differential Revision: https://reviews.llvm.org/D125447	2022-05-12 16:18:23 +01:00
Thomas Raoux	d02f10d96d	[mlir][vector] Add lowering pattern for vector.warp_execute_on_lane_0 op Add lowering of the vector.warp_execute_on_lane_0 into scf.if plus memory transfer for the operands and yield values. This also add an integration test running on GPU warp. The same tests can be later re-used with different comment lines to tests distribution transformations. This is mostly from @springerm contribution. Differential Revision: https://reviews.llvm.org/D125430	2022-05-12 13:27:43 +00:00
Benjamin Kramer	303638248a	[mlir][linalg] Add lowering of named ops on complex numbers This lets linalg.dot and friends lower to a complex muladd using ops from the complex dialect. Differential Revision: https://reviews.llvm.org/D125461	2022-05-12 13:37:34 +02:00
Benjamin Kramer	27dad99622	[mlir][LLVM] Make the nested type restriction on complex constants less aggressive Complex nested in other types is perfectly fine, just nested structs aren't supported. Instead of checking whether there's nesting just check whether the struct we're dealing with is a complex number. Differential Revision: https://reviews.llvm.org/D125381	2022-05-12 11:47:01 +02:00
Daniil Dudkin	70c463efc8	[mlir][NFC] Fix `GpuKernelOutliningPass` copy constructor warnings 1. Call copy constructor of the base class 2. Assign value of the option directly Reviewed By: dcaballe, rriddle Differential Revision: https://reviews.llvm.org/D125101	2022-05-12 11:41:18 +03:00
Nikita Popov	f02716a806	[MLIR] Fix build without native arch D125214 split off a MLIRExecutionEngineUtils library that is used by MLIRGPUTransforms. However, currently the entire ExecutionEngine directory is skipped if the LLVM_NATIVE_ARCH target is not available. Move the check for LLVM_NATIVE_ARCH, such that MLIRExecutionEngineUtils always gets built, and only the JIT-related libraries are omitted without native arch. Differential Revision: https://reviews.llvm.org/D125357	2022-05-12 09:50:51 +02:00
Matthias Springer	82ea0d8b82	[mlir][bufferize] Support alloc hoisting across function boundaries This change integrates the BufferResultsToOutParamsPass into One-Shot Module Bufferization. This improves memory management (deallocation) when buffers are returned from a function. Note: This currently only works with statically-sized tensors. The generated code is not very efficient yet and there are opportunities for improvment (fewer copies). By default, this new functionality is deactivated. Differential Revision: https://reviews.llvm.org/D125376	2022-05-12 09:44:07 +02:00
Matthias Springer	2fe40c34ea	[mlir][bufferize] Fix op filter Bufferization has an optional filter to exclude certain ops from analysis+bufferization. There were a few remaining places in the codebase where the filter was not checked. Differential Revision: https://reviews.llvm.org/D125356	2022-05-12 09:33:07 +02:00
Matthias Springer	011f1b1c1f	[mlir][bufferize] Add helpers for templatized DENY filters We already have templatized ALLOW filters but the DENY filters were missing. Differential Revision: https://reviews.llvm.org/D125358	2022-05-12 09:18:21 +02:00
River Riddle	1155c1fe65	[mlir:Parser] Emit a better diagnostic when a custom operation is unknown When a custom operation is unknown and does not have a dialect prefix, we currently emit an error using the name of the operation with the default dialect prefix. This leads to a confusing error message, especially when operations get moved between dialects. For example, `func` was recently moved out of `builtin` and to the `func` dialect. The current error message we get is: ``` func @foo() ^ custom op 'builtin.func' is unknown ``` This could lead users to believe that there is supposed to be a `builtin.func`, because there used to be. This commit adds a better error message that does not assume that the operation is supposed to be in the default dialect: ``` func @foo() ^ custom op 'func' is unknown (tried 'builtin.func' as well) ``` Differential Revision: https://reviews.llvm.org/D125351	2022-05-11 22:54:44 -07:00
Mahesh Ravishankar	8be7e6f56a	[mlir][Linalg] Combine canonicalizers that deal with removing dead/redundant args. `linalg.generic` ops have canonicalizers that either remove arguments not used in the payload, or redundant arguments. Combine these and enhance the canonicalization to also remove results that have no use. This is effectively dead code elimination for Linalg ops. Differential Revision: https://reviews.llvm.org/D123632	2022-05-12 05:22:30 +00:00
Mogball	0ffef0c23b	[mlir][ods] (NFC) don't use std::function for map_range	2022-05-12 05:15:03 +00:00
Mogball	19906262c9	[mlir] (NFC) Use assembly format for test.graph_region	2022-05-12 04:19:25 +00:00
bzcheeseman	bc22b5c9a2	[MLIR][Operation] Simplify Operation casting, NFC We can simplify the code needed to implement dyn_cast/cast/isa support for MLIR operations with documented interfaces via the CastInfo structures. This will also provide an example of how to use CastInfo. Depends on D123901 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124963	2022-05-12 00:17:01 -04:00
grosul1	a4b227c28a	[mlir] Fix loop unrolling: properly replace the arguments of the epilogue loop. Using "replaceUsesOfWith" is incorrect because the same initializer value may appear multiple times. For example, if the epilogue is needed when this loop is unrolled ``` %x:2 = scf.for ... iter_args(%arg1 = %c1, %arg2 = %c1) { ... } ``` then both epilogue's arguments will be incorrectly renamed to use the same result index (note #1 in both cases): ``` %x_unrolled:2 = scf.for ... iter_args(%arg1 = %c1, %arg2 = %c1) { ... } %x_epilogue:2 = scf.for ... iter_args(%arg1 = %x_unrolled#1, %arg2 = %x_unrolled#1) { ... } ```	2022-05-12 01:54:39 +00:00
Chris Lattner	86445e8c63	[AsmParser] Adopt emitWrongTokenError more, improving QoI This is a full audit of emitError calls, I took the opportunity to remove extranous parens and fix a couple cases where we'd generate multiple diagnostics for the same error. Differential Revision: https://reviews.llvm.org/D125355	2022-05-11 20:41:12 +01:00
River Riddle	5a9a438a54	[TableGen] Refactor TableGenParseFile to no longer use a callback Now that TableGen no longer relies on global Record state, we can allow for the client to own the RecordKeeper and SourceMgr. Given that TableGen internally still relies on the global llvm::SrcMgr, this method unfortunately still isn't thread-safe. Differential Revision: https://reviews.llvm.org/D125277	2022-05-11 11:55:33 -07:00
Matthias Springer	248e113e9f	[mlir][bufferize][NFC] Move helper functions to BufferizationOptions Move helper functions for creating allocs/deallocs/memcpys to BufferizationOptions. Differential Revision: https://reviews.llvm.org/D125375	2022-05-11 16:23:22 +02:00
Chris Lattner	34b6f206cb	[AsmParser] Improve error recovery again. Change the parsing logic to use StringRef instead of lower level char* logic. Also, if emitting a diagnostic on the first token in the file, we make sure to use that position instead of the very start of the file. Differential Revision: https://reviews.llvm.org/D125353	2022-05-11 08:25:36 +01:00
Chia-hung Duan	96e642652b	[mlir] Print some message for op-printing verification Before dump, Insetad of switching to generic form silently after verification failure. Print some debug logs to help identify why an op may be printed in a different way. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125136	2022-05-10 22:48:47 +00:00
Thomas Raoux	15bcc36eed	[mlir][gpu] Move async copy ops to NVGPU and add caching hints Move async copy operations to NVGPU as they only exist on NV target and are designed to match ptx semantic. This allows us to also add more fine grain caching hint attribute to the op. Add hint to bypass L1 and hook it up to NVVM op. Differential Revision: https://reviews.llvm.org/D125244	2022-05-10 22:30:24 +00:00
Nicolas Vasilache	1f23211cb1	[mlir][SCF] Retire `cloneWithNewYields` helper function. This is now subsumed by `replaceLoopWithNewYields`. Differential Revision: https://reviews.llvm.org/D125309	2022-05-10 18:44:11 +00:00
Mahesh Ravishankar	567fd523bf	[mlir][SCF] Add utility method to add new yield values to a loop. The current implementation of `cloneWithNewYields` has a few issues - It clones the loop body of the original loop to create a new loop. This is very expensive. - It performs `erase` operations which are incompatible when this method is called from within a pattern rewrite. All erases need to go through `PatternRewriter`. To address these a new utility method `replaceLoopWithNewYields` is added which - moves the operations from the original loop into the new loop. - replaces all uses of the original loop with the corresponding results of the new loop - use a call back to allow caller to generate the new yield values. - the original loop is modified to just yield the basic block arguments corresponding to the iter_args of the loop. This represents a no-op loop. The loop itself is dead (since all its uses are replaced), but is not removed. The caller is expected to erase the op. Consequently, this method can be called from within a `matchAndRewrite` method of a `PatternRewriter`. The `cloneWithNewYields` could be replaces with `replaceLoopWithNewYields`, but that seems to trigger a failure during walks, potentially due to the operations being moved. That is left as a TODO. Differential Revision: https://reviews.llvm.org/D125147	2022-05-10 18:44:11 +00:00
Krzysztof Drewniak	814b605095	[mlir][AMDGPU] Add AMDGPU conversion patterns to ConvertGPUToROCDL This ensures that attributes such as the index bitwidth propagate correctly to the AMDGPUToROCDL patterns. Differential Revision: https://reviews.llvm.org/D125320	2022-05-10 16:49:11 +00:00
Ashay Rane	53ff0daa7e	[mlir] Fail early if AnalysisState::getBuffer() returns failure This patch updates calls to AnalysisState::getBuffer() so that we return early with a failure if the call does not succeed. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125251	2022-05-10 08:08:38 -07:00
Krzysztof Drewniak	f1f05a91ca	[MLIR][AMDGPU] Add AMDGPU dialect, wrappers around raw buffer intrinsics By analogy with the NVGPU dialect, introduce an AMDGPU dialect for AMD-specific intrinsic wrappers. The dialect initially includes wrappers around the raw buffer intrinsics. On AMD GPUs, a memref can be converted to a "buffer descriptor" that allows more precise control of memory access, such as by allowing for out of bounds loads/stores to be replaced by 0/ignored without adding additional conditional logic, which is important for performance. The repository currently contains a limited conversion from transfer_read/transfer_write to Mubuf intrinsics, which are an older, deprecated intrinsic for the same functionality. The new amdgpu.raw_buffer_* ops allow these operations to be used explicitly and for including metadata such as whether the target chipset is an RDNA chip or not (which impacts the interpretation of some bits in the buffer descriptor), while still maintaining an MLIR-like interface. (This change also exposes the floating-point atomic add intrinsic.) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D122765	2022-05-10 14:59:58 +00:00
Chris Lattner	ad3b358180	[MLIR Parser] Improve QoI for "expected token" errors A typical problem with missing a token is that the missing token is at the end of a line. The problem with this is that the error message gets reported on the start of the following line (which is where the next / invalid token is) which can be confusing. Handle this by noticing this case and backing up to the end of the previous line. Differential Revision: https://reviews.llvm.org/D125295	2022-05-10 15:44:17 +01:00
Adrian Kuegel	64c8574209	[mlir] Remove unused using declaration (NFC)	2022-05-10 12:58:01 +02:00
Nikita Popov	03ab30686d	[MLIR] Split off MLIRExecutionEngineUtils to fix libMLIR.so build (PR54242) Building libMLIR.so currently fails with: > /usr/bin/ld: /tmp/ccNzulEA.ltrans39.ltrans.o: in function `(anonymous namespace)::SerializeToHsacoPass::optimizeLlvm(llvm::Module&, llvm::TargetMachine&)': > /builddir/build/BUILD/llvm-project-15.0.0.src/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp:328: undefined reference to `mlir::makeOptimizingTransformer(unsigned int, unsigned int, llvm::TargetMachine*)' This is because MLIRGPUTransforms depends on MLIRExecutionEngine in `61bb2e4ea8/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp (L328)`, but MLIRExecutionEngine is marked as excluded from libMLIR.so. However, this code doesn't require the full execution engine: It only performs middle-end optimization, and does not need any of the JIT/codegen infrastructure. As such, split off a separate library MLIRExecutionEngineUtils, which only contains that part and is not excluded from libMLIR.so. Fixes https://github.com/llvm/llvm-project/issues/54242. Differential Revision: https://reviews.llvm.org/D125214	2022-05-10 10:17:52 +02:00
Stella Stamenova	784a5bccfd	[mlir] Fix python bindings build on Windows in Debug Currently, building mlir with the python bindings enabled on Windows in Debug is broken because pybind11, python and cmake don't like to play together. This change normalizes how the three interact, so that the builds can now run and succeed. The main issue is that python and cmake both make assumptions about which libraries are needed in a Windows build based on the flavor. - cmake assumes that a debug (or a debug-like) flavor of the build will always require pythonX_d.lib and provides no option/hint to tell it to use a different library. cmake does find both the debug and release versions, but then uses the debug library. - python (specifically pyconfig.h and by extension python.h) hardcodes the dependency on pythonX_d.lib or pythonX.lib depending on whether `_DEBUG` is defined. This is NOT transparent - it does not show up anywhere in the build logs until the link step fails with `pythonX_d.lib is missing` (or `pythonX.lib is missing`) - pybind11 tries to "fix" this by implementing a workaround - unless Py_DEBUG is defined, `_DEBUG` is explicitly undefined right before including python headers. This also requires some windows headers to be included differently, so while clever, this is a non-trivial workaround. mlir itself includes the pybind11 headers (which contain the workaround) AS WELL AS python.h, essentially always requiring both pythonX.lib and pythonX_d.lib for linking. cmake explicitly only adds one or the other, so the build fails. This change does a couple of things: - In the cmake files, explicitly add the release version of the python library on Windows builds regardless of flavor. Since Py_DEBUG is not defined, pybind11 will always require release and it will be satisfied - To satisfy python as well, this change removes any explicit inclusions of Python.h on Windows instead relying on the fact that pybind11 headers will bring in what is needed There are a few additional things that we could do but I rejected as unnecessary at this time: - define Py_DEBUG based on the CMAKE_BUILD_TYPE - this will mostly work, we'd have to think through multiconfig generators like VS, but it's possible. There doesn't seem to be a need to link against debug python at the moment, so I chose not to overcomplicate the build and always default to release - similar to above, but define Py_DEBUG based on the CMAKE_BUILD_TYPE as well as the presence of the debug python library (`Python3_LIBRARY_DEBUG`). Similar to above, this seems unnecessary right now. I think it's slightly better than above because most people don't actually have the debug version of python installed, so this would prevent breaks in that case. - similar to the two above, but add a cmake variable to control the logic - implement the pybind11 workaround directly in mlir (specifically in Interop.h) so that Python.h can still be included directly. This seems prone to error and a pain to maintain in lock step with pybind11 - reorganize how the pybind11 headers are included and place at least one of them in Interop.h directly, so that the header has all of its dependencies included as was the original intention. I decided against this because it really doesn't need pybind11 logic and it's always included after pybind11 is, so we don't necessarily need the python includes Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D125284	2022-05-09 19:46:47 -07:00
Mathieu Fehr	67d0bc27c0	[mlir][doc] Move documentation of extensible dialects Merge the documentation of the definition of extensible dialects with the definition of dialects. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125200	2022-05-09 16:01:12 -07:00
River Riddle	867cd5007d	[mlir-LSP] Ensure existing documents are process synchronously This prevents races where we accidentally launched multiple servers.	2022-05-09 15:23:23 -07:00
Thomas Raoux	09fc685ce6	[mlir][nvvm] Add attribute to nvvm.cpAsyncOp to control l1 bypass Add attribute to be able to generate the intrinsic version of async copy generating a copy with l1 bypass. This correspond to cp.async.cg.shared.global in ptx. Differential Revision: https://reviews.llvm.org/D125241	2022-05-09 19:34:48 +00:00
Stella Stamenova	057863a9bc	[mlir] Fix build & test of mlir python bindings on Windows There are a couple of issues with the python bindings on Windows: - `create_symlink` requires special permissions on Windows - using `copy_if_different` instead allows the build to complete and then be usable - the path to the `python_executable` is likely to contain spaces if python is installed in Program Files. llvm's python substitution adds extra quotes in order to account for this case, but mlir's own python substitution does not - the location of the shared libraries is different on windows - if the type is not specified for numpy arrays, they appear to be treated as strings I've implemented the smallest possible changes for each of these in the patch, but I would actually prefer a slightly more comprehensive fix for the python_executable and the shared libraries. For the python substitution, I think it makes sense to leverage the existing %python instead of adding %PYTHON and instead add a new variable for the case when preloading is needed. This would also make it clearer which tests are which and should be skipped on platforms where the preloading won't work. For the shared libraries, I think it would make sense to pass the correct path and extension (possibly even the names) to the python script since these are known by lit and don't have to be hardcoded in the test at all. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D125122	2022-05-09 11:10:20 -07:00
Jakub Tucholski	167bbfcb9d	[mlir] Refactoring dialect and test code to use parseCommaSeparatedList Issue #55173 Reviewed By: lattner, rriddle Differential Revision: https://reviews.llvm.org/D124791	2022-05-09 12:36:54 -04:00
Jerry Wu	ad7c49bef7	[mlir][linalg] Fix padding size calculation for Conv2d ops. This patch fixed the padding size calculation for Conv2d ops when the stride > 1. It contains the changes below: - Use addBound to add constraint for AffineApplyOp in getUpperBoundForIndex. So the result value can be mapped and retrieved later. - Fixed the bound from AffineMinOp by adding as a closed bound. Originally the bound was added as an open upper bound, which results in the incorrect bounds when we multiply the values. For example: ``` %0 = affine.min affine_map<()[s0] -> (4, -s0 + 11)>()[iv0] %1 = affine.apply affine_map<()[s0] -> (s0 * 2)>()[%0] If we add the affine.min as an open bound, addBound will internally transform it into the close bound "%0 <= 3". The following sliceBounds will derive the bound of %1 as "%1 <= 6" and return the open bound "%1 < 7", while the correct bound should be "%1 <= 8". ``` - In addition to addBound, I also changed sliceBounds to support returning closed upper bound, since for the size computation, we usually care about the closed bounds. - Change the getUpperBoundForIndex to favor constant bounds when required. The sliceBounds will return a tighter but non-constant bounds, which can't be used for padding. The constantRequired option requires getUpperBoundForIndex to get the constant bounds when possible. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124821	2022-05-09 08:45:37 -07:00
Ashay Rane	e287d647c6	[mlir] Add translation from tensor.reshape to memref.reshape This patch augments the `tensor-bufferize` pass by adding a conversion rule to translate ReshapeOp from the `tensor` dialect to the `memref` dialect, in addition to adding a unit test to validate the translation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125031	2022-05-09 17:45:07 +02:00
Benjamin Kramer	a48adc5658	[mlir][math] Promote (b)f16 to f32 when lowering to libm calls libm doesn't have overloads for the small types, so promote them to a bigger type and use the f32 function. Differential Revision: https://reviews.llvm.org/D125093	2022-05-09 11:59:55 +02:00
Christopher Bate	9879807393	[mlir][NvGpu] Fix nvgpu.mma.sync lowering to NVVM for f32, tf32 types Adds missing logic in the lowering from NvGPU to NVVM to support fp32 (in an accumulator operand) and tf32 (in multiplicand operand) types. Fixes logic in one of the helper functions for converting the result of a mma.sync operation with multiple 8x256bit output tiles, which is the case for f32 outputs. Differential Revision: https://reviews.llvm.org/D124533	2022-05-08 21:49:42 -06:00
Sam McCall	e571e1a6c3	Reland "[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR." This reverts commit `a1bb952e83`. I'd somehow missed updating llvm-yaml-parser-fuzzer, now fixed.	2022-05-07 13:49:54 +02:00
Aaron Ballman	a1bb952e83	Revert "[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR." This reverts commit `1c5e85b3da`. It broke a lot of bots with a link error: https://lab.llvm.org/buildbot/#/builders/171/builds/14222 https://lab.llvm.org/buildbot/#/builders/188/builds/13748 https://lab.llvm.org/buildbot/#/builders/109/builds/38127	2022-05-07 07:29:57 -04:00
Sam McCall	1c5e85b3da	[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR. All llvm-project fuzzers use this library to parse command-line arguments. Many of them don't deal with LLVM IR or modules in any way. Bundling those functions in one library forces build dependencies that don't need to be there. Among other things, this means check-clang-pseudo no longer depends on most of LLVM. Differential Revision: https://reviews.llvm.org/D125081	2022-05-07 12:11:51 +02:00
Mehdi Amini	25cd6fba98	Fix MLIR integration test after `a8308020` (`func.` prefix is required bythe parser now)	2022-05-07 09:09:24 +00:00
River Riddle	a8308020ac	[mlir] Remove special case parsing/printing of `func` operations This was leftover from when the standard dialect was destroyed, and when FuncOp moved to the func dialect. Now that these transitions have settled a bit we can drop these. Most updates were handled using a simple regex: replace `^( *)func` with `$1func.func` Differential Revision: https://reviews.llvm.org/D124146	2022-05-06 13:36:15 -07:00
Mehdi Amini	6a9c1029f8	Fix build with shared libs: add missing CMake dep to MLIR sparse pipeline	2022-05-06 20:20:03 +00:00
Mehdi Amini	b37d158f71	Apply clang-tidy fixes for bugprone-copy-constructor-init in TestPassManager.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	298d2fa1c5	Apply clang-tidy fixes for readability-identifier-naming in SparseTensorUtils.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	90c2af57af	Apply clang-tidy fixes for llvm-include-order in Merger.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	072e0aabbc	Enable the use of ThreadPoolTaskGroup in MLIR threading helper to enable nested parallelism The LLVM ThreadPool recently got the addition of the concept of ThreadPoolTaskGroup: this is a way to "partition" the threadpool into a group of tasks and enable nested parallelism through this grouping at every level of nesting. We make use of this feature in MLIR threading abstraction to fix a long lasting TODO and enable nested parallelism. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124902	2022-05-06 19:40:22 +00:00
Mehdi Amini	c5ea8d509c	Apply clang-tidy fixes for llvm-else-after-return in Merger.cpp (NFC)	2022-05-06 19:38:03 +00:00

1 2 3 4 5 ...

11224 Commits