llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Raoux	a57ccad5a6	[VectorToGPU] Fix horizontal stride calculation for N-D memref Fix a bug in how we calculate the stride of mma load/store ops for N-D memrefs Differential Revision: https://reviews.llvm.org/D118378	2022-01-27 13:35:56 -08:00
natashaknk	024a1fab5c	[tosa][mlir] Add dynamic shape support for remaining ops Added support for concat, tile, pad, argmax and table ops Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D118397	2022-01-27 11:25:38 -08:00
Benjamin Kramer	608cc6b163	[mlir][complex] Lower complex.constant to LLVM This fixes a regression from `480cd4cb85` Differential Revision: https://reviews.llvm.org/D118347	2022-01-27 13:48:23 +01:00
River Riddle	9f85c198db	[mlir] Finish replacing OwningRewritePatternList with RewritePatternSet OwningRewritePatternList has been deprecated for ~10 months now, we can remove the leftover using directives at this point. Differential Revision: https://reviews.llvm.org/D118287	2022-01-26 23:11:02 -08:00
River Riddle	632a4f8829	[mlir] Move std.generic_atomic_rmw to the memref dialect This is part of splitting up the standard dialect. The move makes sense anyways, given that the memref dialect already holds memref.atomic_rmw which is the non-region sibling operation of std.generic_atomic_rmw (the relationship is even more clear given they have nearly the same description % how they represent the inner computation). Differential Revision: https://reviews.llvm.org/D118209	2022-01-26 11:52:01 -08:00
Chuanqi Xu	dbbe010908	[MLIR] [AsyncToLLVM] Use llvm.coro.align intrinsic Use llvm.coro.align to align coroutine frame properly. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D117978	2022-01-25 19:28:25 +08:00
River Riddle	881ff4e4eb	[mlir] Remove unnecessary dependency on Tensor from MemRef	2022-01-24 22:59:01 -08:00
River Riddle	633f5badbf	[mlir] Add more missing dependencies after D118062 These used to be covered transitively, but now need to be explicit.	2022-01-24 22:12:03 -08:00
River Riddle	e697b97148	[mlir] Add more missing dependencies after D118062 These used to be covered transitively, but now need to be explicit.	2022-01-24 21:36:31 -08:00
River Riddle	b827b6340b	[mlir] Add missing dependencies after D118062 These used to be covered transitively, but now need to be explicit.	2022-01-24 21:31:37 -08:00
River Riddle	a70aa7bb0d	[mlir:Transforms] Move out the remaining non-dialect independent transforms and utilities This has been a major TODO for a very long time, and is necessary for establishing a proper dialect-free dependency layering for the Transforms library. Code was moved to effectively two main locations: * Affine/ There was quite a bit of affine dialect related code in Transforms/ do to historical reasons (of a time way into MLIR's past). The following headers were moved to: Transforms/LoopFusionUtils.h -> Dialect/Affine/LoopFusionUtils.h Transforms/LoopUtils.h -> Dialect/Affine/LoopUtils.h Transforms/Utils.h -> Dialect/Affine/Utils.h The following transforms were also moved: AffineLoopFusion, AffinePipelineDataTransfer, LoopCoalescing * SCF/ Only one SCF pass was in Transforms/ (likely accidentally placed here): ParallelLoopCollapsing The SCF specific utilities in LoopUtils have been moved to SCF/Utils.h * Misc: mlir::moveLoopInvariantCode was also moved to LoopLikeInterface.h given that it is a simple utility defined in terms of LoopLikeOpInterface. Differential Revision: https://reviews.llvm.org/D117848	2022-01-24 19:25:53 -08:00
harsh	e01e4c9115	Fix bugs in GPUToNVVM lowering The current lowering from GPU to NVVM does not correctly handle the following cases when lowering the gpu shuffle op. 1. When the active width is set to 32 (all lanes), then the current approach computes (1 << 32) -1 which results in poison values in the LLVM IR. We fix this by defining the active mask as (-1) >> (32 - width). 2. In the case of shuffle up, the computation of the third operand c has to be different from the other 3 modes due to the op definition in the ISA reference. (https://docs.nvidia.com/cuda/parallel-thread-execution/index.html) Specifically, the predicate value is computed as j >= maxLane for up and j <= maxLane for all other modes. We fix this by computing maskAndClamp as 32 - width for this mode. TEST: We modify the existing test and add more checks for the up mode. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D118086	2022-01-25 03:24:14 +00:00
Rob Suderman	3e746c6d9e	[mlir] Add support for ExpM1 to GLSL/OpenCL SPIRV Backends Adding a similar decomposition for exponential minus one to the SPIRV backends along with the necessary tests. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D118081	2022-01-24 15:38:34 -08:00
Alexander Belyaev	fd0c6f5391	[mlir] Move linalg::PadTensorOp to tensor::PadOp. RFC: https://llvm.discourse.group/t/rfc-move-linalg-padtensorop-to-tensor-padop/5785 Differential Revision: https://reviews.llvm.org/D117892	2022-01-21 20:02:39 +01:00
Lei Zhang	4710750854	[mlir][spirv] Support size-1 vector inserts during conversion Differential Revision: https://reviews.llvm.org/D115517	2022-01-21 13:56:26 -05:00
River Riddle	e084679f96	[mlir] Make locations required when adding/creating block arguments BlockArguments gained the ability to have locations attached a while ago, but they have always been optional. This goes against the core tenant of MLIR where location information is a requirement, so this commit updates the API to require locations. Fixes #53279 Differential Revision: https://reviews.llvm.org/D117633	2022-01-19 17:35:35 -08:00
natashaknk	b9b10c0e61	[tosa][mlir] Lowering for dynamic shapes in the reduce_x ops in tosa-to-linalg Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117691	2022-01-19 11:15:14 -08:00
Thomas Raoux	d9edc1a585	[mlir][spirv] Add math.fma lowering to spirv Differential Revision: https://reviews.llvm.org/D117704	2022-01-19 10:57:05 -08:00
River Riddle	7ceffae18c	[mlir] Convert OpTrait::FunctionLike to FunctionOpInterface This commit refactors the FunctionLike trait into an interface (FunctionOpInterface). FunctionLike as it is today is already a pseudo-interface, with many users checking the presence of the trait and then manually into functionality implemented in the function_like_impl namespace. By transitioning to an interface, these accesses are much cleaner (ideally with no direct calls to the impl namespace outside of the implementation of the derived function operations, e.g. for parsing/printing utilities). I've tried to maintain as much compatability with the current state as possible, while also trying to clean up as much of the cruft as possible. The general migration plan for current users of FunctionLike is as follows: * function_like_impl -> function_interface_impl Realistically most user calls should remove references to functions within this namespace outside of a vary narrow set (e.g. parsing/printing utilities). Calls to the attribute name accessors should be migrated to the `FunctionOpInterface::` equivalent, most everything else should be updated to be driven through an instance of the interface. * OpTrait::FunctionLike -> FunctionOpInterface `hasTrait` checks will need to be moved to isa, along with the other various Trait vs Interface API differences. * populateFunctionLikeTypeConversionPattern -> populateFunctionOpInterfaceTypeConversionPattern Fixes #52917 Differential Revision: https://reviews.llvm.org/D117272	2022-01-18 20:56:53 -08:00
River Riddle	4157455425	[mlir][Pass] Deprecate FunctionPass in favor of OperationPass<FuncOp> The only benefit of FunctionPass is that it filters out function declarations. This isn't enough to justify carrying it around, as we can simplify filter out declarations when necessary within the pass. We can also explore with better scheduling primitives to filter out declarations at the pipeline level in the future. The definition of FunctionPass is left intact for now to allow time for downstream users to migrate. Differential Revision: https://reviews.llvm.org/D117182	2022-01-18 19:52:44 -08:00
River Riddle	755dc07d69	[mlir:Analysis] Move the LoopAnalysis library to Dialect/Affine/Analysis The current state of the top level Analysis/ directory is that it contains two libraries; a generic Analysis library (free from dialect dependencies), and a LoopAnalysis library that contains various analysis utilities that originated from Affine loop transformations. This commit moves the LoopAnalysis to the more appropriate home of `Dialect/Affine/Analysis/`, given the use and intention of the majority of the code within it. After the move, if there are generic utilities that would fit better in the top-level Analysis/ directory, we can move them. Differential Revision: https://reviews.llvm.org/D117351	2022-01-18 10:28:22 -08:00
Mogball	aae5125550	[mlir] Replace StrEnumAttr -> EnumAttr in core dialects Removes uses of `StrEnumAttr` in core dialects Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D117514	2022-01-18 17:15:00 +00:00
Benjamin Kramer	964dc368e7	[AsyncToLLVM] aligned_alloc requires the size to be a multiple of aignment, so round up Fixes a crash with debug malloc.	2022-01-17 21:48:00 +01:00
Benjamin Kramer	5acd6e0522	[AsyncToLLVM] Align frames to 64 bytes Coroutine lowering always takes the natural alignment when spilling to the frame (issue #53148) so using AVX2 or AVX512 in a coroutine doesn't work. Always overalign to 64 bytes to avoid this issue until we have a better solution. Differential Revision: https://reviews.llvm.org/D117501	2022-01-17 18:51:42 +01:00
Stephan Herhut	aa3cabe3cb	[mlir][memref] Fix memref.copy of scalar memref Also fix a memory leak in the test while at it. Differential Revision: https://reviews.llvm.org/D117314	2022-01-14 16:13:12 +01:00
Stephan Herhut	ab95ba704d	[mlir][memref] Implement fast lowering of memref.copy In the absence of maps, we can lower memref.copy to a memcpy. Differential Revision: https://reviews.llvm.org/D116099	2022-01-14 14:22:15 +01:00
Mehdi Amini	cac7aabbd8	Apply clang-tidy fixes for readability-identifier-naming to MLIR (NFC)	2022-01-14 02:26:28 +00:00
Rob Suderman	173fce4205	[mlir][tosa] Update default tosa-to-linalg passes Adding the optional decompositions have been verified to improve memory usage on common models. Added the decomposition to the default tosa to linalg passes. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D117175	2022-01-13 10:17:44 -08:00
Benoit Jacob	499703e9c0	Enable ReassociatingReshapeOpConversion with "non-identity" layouts. Enable ReassociatingReshapeOpConversion with "non-identity" layouts. This removes an early-return in this function, which seems unnecessary and is preventing some memref.collapse_shape from converting to LLVM (see included lit test). It seems unnecessary because the return message says "only empty layout map is supported" but there actually is code in this function to deal with non-empty layout maps. Maybe it refers to an earlier state of implementation and is just out of date? Though, there is another concern about this early return: the condition that it actually checks, `{src,dst}MemrefType.getLayout().isIdentity()`, is not quite the same as what the return message says, "only empty layout map is supported". Stepping through this `getLayout().isIdentity()` code in GDB, I found that it evaluates to `.getAffineMap().isIdentity()` which does (AffineMap.cpp:271): ``` if (getNumDims() != getNumResults()) return false; ``` This seems that it would always return false for memrefs of rank greater than 1 ? Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114808	2022-01-13 17:46:20 +00:00
natashaknk	310e9636ca	[tosa][mlir] Support dynamic batch dimension for ops where the batch dim is explicit Dynamic batch for rescale, gather, max_pool, avg_pool, conv2D and depthwise_conv2D. Split helper functions into a separate header file. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117031	2022-01-12 14:16:50 -08:00
River Riddle	676bfb2a22	[mlir] Refactor ShapedType into an interface ShapedType was created in a time before interfaces, and is one of the earliest type base classes in the ecosystem. This commit refactors ShapedType into an interface, which is what it would have been if interfaces had existed at that time. The API of ShapedType and it's derived classes are essentially untouched by this refactor, with the exception being the API surrounding kDynamicIndex (which requires a sole home). For now, the API of ShapedType and its name have been kept as consistent to the current state of the world as possible (to help with potential migration churn, among other reasons). Moving forward though, we should look into potentially restructuring its API and possible its name as well (it should really have "Interface" at the end like other interfaces at the very least). One other potentially interesting note is that I've attached the ShapedType::Trait to TensorType/BaseMemRefType to act as mixins for the ShapedType API. This is kind of weird, but allows for sharing the same API (i.e. preventing API loss from the transition from base class -> Interface). This inheritance doesn't affect any of the derived classes, it is just for API mixin. Differential Revision: https://reviews.llvm.org/D116962	2022-01-12 14:12:09 -08:00
Rob Suderman	aa1c533a4e	[mlir][tosa] Expand tosa.apply_scale lowering for vectors Apply scale may encounter scalar, tensor, or vector operations. Expand the lowering so that it can lower arbitrary of container types. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D117080	2022-01-12 14:07:52 -08:00
River Riddle	d4d016869d	[mlir] Remove populateFuncOpTypeConversionPattern This method simply forwards to populateFunctionLikeTypeConversionPattern, which is more general. This also helps to remove special treatment of FuncOp from DialectConversion. Differential Revision: https://reviews.llvm.org/D116624	2022-01-12 14:05:35 -08:00
Aaron DeBattista	dfd070820c	[mlir][tosa] Allow optional TOSA decompositions to be populated separately Moved all TOSA decomposition patterns so that they can be optionally populated and used by external rewrites. This avoids decomposing TOSa operations when backends may benefit from the non-decomposed version. Reviewed By: rsuderman, mehdi_amini Differential Revision: https://reviews.llvm.org/D116526	2022-01-11 10:26:30 -08:00
Adrian Kuegel	50da013406	[mlir] Use const reference for loop iteration variable.	2022-01-07 12:00:53 +01:00
Alex Zinenko	cafaa35036	[mlir] Make it possible to directly supply constant values to LLVM GEPOp In LLVM IR, the GEP indices that correspond to structures are required to be i32 constants. MLIR models constants as just values defined by special operations, and there is no verification that it is the case for structure indices in GEP. Furthermore, some common transformations such as control flow simplification may lead to the operands becoming non-constant. Make it possible to directly supply constant values to LLVM GEPOp to guarantee they remain constant until the translation to LLVM IR. This is not yet a requirement and the verifier is not modified, this will be introduced separately. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D116757	2022-01-07 09:56:01 +01:00
Mehdi Amini	564bcf9d02	Align adaptor's generator accessors for attribute on the Op class Each attribute has two accessor: one suffixed with `Attr` which returns the attribute itself and one without the suffix which unwrap the attribute. For example for a StringAttr attribute with a field named `kind`, we'll generate: StringAttr getKindAttr(); StringRef getKind(); Differential Revision: https://reviews.llvm.org/D116466	2022-01-05 05:42:15 +00:00
Benjamin Kramer	41760a6b40	[mlir] Make Value's constructor constexpr. NFCI. This allows clang to flag unused Values in more cases, so remove them.	2022-01-04 21:04:13 +01:00
Uday Bondhugula	80b3f08eee	[MLIR[PDL] NFC. Fix unused variable warning in PDLToPDLInterp.cpp NFC. Fix unused variable warning in PDLToPDLInterp.cpp. Differential Revision: https://reviews.llvm.org/D116571	2022-01-04 08:25:02 +05:30
Stanislav Funiak	138803e017	[MLIR][PDL] Make predicate order deterministic. The tree merging of pattern predicates places the predicates in an unordered set. When the predicates are sorted, they are taken in the set order, not the insertion order. This results in nondeterministic behavior. One solution to this problem would be to use `SetVector`. However, the value `SetVector` does not provide a `find` function for fast O(1) lookups and stores the predicates twice -- once in the set and once in the vector, which is undesirable, because we store patternToAnswer in each predicate. A simpler solution is to store the tie breaking ID (which follows the insertion order), and use this ID to break any ties when comparing predicates. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116081	2022-01-04 08:03:44 +05:30
Stanislav Funiak	2692eae574	[MLIR][PDL] Refactor the positions for multi-root patterns. When the original version of multi-root patterns was reviewed, several improvements were made to the pdl_interp operations during the review process. Specifically, the "get users of a value at the specified operand index" was split up into "get users" and "compare the users' operands with that value". The iterative execution was also cleaned up to `pdl_interp.foreach`. However, the positions in the pdl-to-pdl_interp lowering were not similarly refactored. This introduced several problems, including hard-to-detect bugs in the lowering and duplicate evaluation of `pdl_interp.get_users`. This diff cleans up the positions. The "upward" `OperationPosition` was split-out into `UsersPosition` and `ForEachPosition`, and the operand comparison was replaced with a simple predicate. In the process, I fixed three bugs: 1. When multiple roots were had the same connector (i.e., a node that they shared with a subtree at the previously visited root), we would generate a single foreach loop rather than one foreach loop for each such root. The reason for this is that such connectors shared the position. The solution for this is to add root index as an id to the newly introduced `ForEachPosition`. 2. Previously, we would use `pdl_interp.get_operands` indiscriminately, whether or not the operand was variadic. We now correctly detect variadic operands and insert `pdl_interp.get_operand` when needed. 3. In certain corner cases, we would trigger the "connector has not been traversed yet" assertion. This was caused by not inserting the values during the upward traversal correctly. This has now been fixed. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116080	2022-01-04 08:03:44 +05:30
Mehdi Amini	78389de4d3	Add back missing return to non-void function It was incorrectly removed accidentally in `e4e463e747`.	2022-01-03 06:18:25 +00:00
Mehdi Amini	891a0d7ccd	Remove dead return after return (NFC)	2022-01-03 06:03:34 +00:00
Mehdi Amini	e4e463e747	Remove useless nesting blok and dead return statement in TosaToLinalg.cpp (NFC) Flagged by Coverity.	2022-01-03 06:02:21 +00:00
Mehdi Amini	4f415216ca	Apply clang-tidy fixes for performance-unnecessary-value-param to MLIR (NFC)	2022-01-02 22:37:13 +00:00
Mehdi Amini	337c937ddb	Apply clang-tidy fixes for performance-move-const-arg to MLIR (NFC)	2022-01-02 22:36:56 +00:00
Mehdi Amini	e4853be2f1	Apply clang-tidy fixes for performance-for-range-copy to MLIR (NFC)	2022-01-02 22:19:56 +00:00
Mehdi Amini	6786d7e4f5	Apply clang-tidy fixes for readability-simplify-boolean-expr to MLIR (NFC) Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D116253	2022-01-02 01:59:31 +00:00
Mehdi Amini	f0fff1dfde	Remove unused applyPad function from TosaToLinalg.cpp (NFC)	2022-01-02 01:53:18 +00:00
Mehdi Amini	1fc096af1e	Apply clang-tidy fixes for performance-unnecessary-value-param to MLIR (NFC) Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116250	2022-01-02 01:45:18 +00:00

1 2 3 4 5 ...

1384 Commits