llvm-project

Commit Graph

Author	SHA1	Message	Date
George Mitenkov	fc148a4c88	[MLIR][SPIRVToLLVM] Added conversion for SPIR-V comparison ops Implemented `FComparePattern` and `IComparePattern` classes that provide conversion of SPIR-V comparison ops (such as `spv.FOrdGreaterThanEqual` and others) to LLVM dialect. Also added tests in `comparison-ops-to-llvm.mlir`. Differential Revision: https://reviews.llvm.org/D81487	2020-06-11 18:46:17 -04:00
Alexander Belyaev	e9ac792748	[mlir] Fix some of the warnings in MLIR code. Summary: * extra ';' in the following files: mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp mlir/lib/Dialect/Shape/IR/Shape.cpp * base class ‘mlir::ConvertVectorToSCFBase<ConvertVectorToSCFPass>’ should be explicitly initialized in the copy constructor [-Wextra] in mlir/lib/Conversion/VectorToSCF/VectorToSCF.cpp * warning: ‘bool Expression::operator==(const Expression&) const’ defined but not used [-Wunused-function] in mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp Differential Revision: https://reviews.llvm.org/D81673	2020-06-11 22:18:32 +02:00
jerryyin	2abad3433f	[mlir][rocdl] Adding vector to ROCDL dialect lowering * Created the vector to ROCDL lowering pass * The lowering pass lowers vector transferOps to rocdl mubufOps * Added unit test and functional test	2020-06-11 14:28:13 +00:00
George Mitenkov	d93d8fcdec	[MLIR][SPIRVToLLVM] Implemented conversion for arithmetic ops and 3 bitwise ops. Following the previous revision `D81100`, this commit implements a templated class that would provide conversion patterns for “straightforward” SPIR-V ops into LLVM dialect. Templating allows to abstract away from concrete implementation for each specific op. Those are mainly binary operations. Currently supported and tested ops are: - Arithmetic ops: `IAdd`, `ISub`, `IMul`, `FAdd`, `FSub`, `FMul`, `FDiv`, `FNegate`, `SDiv`, `SRem` and `UDiv` - Bitwise ops: `BitwiseAnd`, `BitwiseOr`, `BitwiseXor` The implementation relies on `SPIRVToLLVMConversion` class that makes use of `OpConversionPattern`. Differential Revision: https://reviews.llvm.org/D81305	2020-06-10 19:10:31 -04:00
Frederik Gossen	904f91db5f	[MLIR][Standard] Make the `dim` operation index an operand. Allow for dynamic indices in the `dim` operation. Rather than an attribute, the index is now an operand of type `index`. This allows to apply the operation to dynamically ranked tensors. The correct lowering of dynamic indices remains to be implemented. Differential Revision: https://reviews.llvm.org/D81551	2020-06-10 13:54:47 +00:00
Stephan Herhut	2c8afe1298	[mlir][gpu] Add support for f16 when lowering to nvvm intrinsics Summary: The NVVM target only provides implementations for tanh etc. on f32 and f64 operands. To also support f16, we now insert operations to extend to f32 and truncate back to f16 around the intrinsic call. Differential Revision: https://reviews.llvm.org/D81473	2020-06-09 19:33:45 +02:00
George Mitenkov	fda5192d4f	[MLIR][SPIRVToLLVM] Add skeleton for SPIR-V to LLVM dialect conversion These commits set up the skeleton for SPIR-V to LLVM dialect conversion. I created SPIR-V to LLVM pass, registered it in Passes.td, InitAllPasses.h. Added a pattern for `spv.BitwiseAndOp` and tests for it. Integer, float and vector types are converted through LLVMTypeConverter. Differential Revision: https://reviews.llvm.org/D81100	2020-06-08 18:22:42 -04:00
Alexander Belyaev	80be54c08f	[mlir] Lower Shape binary ops (AddOp, MulOp) to Standard. Differential Revision: https://reviews.llvm.org/D81344	2020-06-08 17:48:01 +02:00
Frederik Gossen	970bb4a291	[MLIR] Add `to/from_extent_tensor` lowering to the standard dialect The operations `to_extent_tensor` and `from_extent_tensor` become no-ops when lowered to the standard dialect. This is possible with a lowering from `shape.shape` to `tensor<?xindex>`. Differential Revision: https://reviews.llvm.org/D81162	2020-06-08 09:38:18 +00:00
Frederik Gossen	867bc41e85	[MLIR] Add type conversion for `shape.shape` Convert `shape.shape` to `tensor<?xindex>` when lowering the `shape` to the `std` dialect. Differential Revision: https://reviews.llvm.org/D81161	2020-06-08 09:34:03 +00:00
Frederik Gossen	24edbdf99b	[MLIR] Clean up `shape` to `std` lowering Apply post-commit suggestions to the new lowering. Differential Revision: https://reviews.llvm.org/D81160	2020-06-08 08:59:53 +00:00
Jacques Pienaar	92cb0ce8f8	[mlir] Change to re-enable cuda-runner tests mlir-cuda-runner tests were failing post https://reviews.llvm.org/D80676, small change to get those passing again. More cleanup may be needed post.	2020-06-06 09:31:51 -07:00
Wen-Heng (Jack) Chung	2fd6403a6d	[mlir][gpu] Introduce mlir-rocm-runner. Summary: `mlir-rocm-runner` is introduced in this commit to execute GPU modules on ROCm platform. A small wrapper to encapsulate ROCm's HIP runtime API is also inside the commit. Due to behavior of ROCm, raw pointers inside memrefs passed to `gpu.launch` must be modified on the host side to properly capture the pointer values addressable on the GPU. LLVM MC is used to assemble AMD GCN ISA coming out from `ConvertGPUKernelToBlobPass` to binary form, and LLD is used to produce a shared ELF object which could be loaded by ROCm HIP runtime. gfx900 is the default target be used right now, although it could be altered via an option in `mlir-rocm-runner`. Future revisions may consider using ROCm Agent Enumerator to detect the right target on the system. Notice AMDGPU Code Object V2 is used in this revision. Future enhancements may upgrade to AMDGPU Code Object V3. Bitcode libraries in ROCm-Device-Libs, which implements math routines exposed in `rocdl` dialect are not yet linked, and is left as a TODO in the logic. Reviewers: herhut Subscribers: mgorny, tpr, dexonsmith, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #mlir, #llvm Differential Revision: https://reviews.llvm.org/D80676	2020-06-05 09:46:39 -05:00
Nicolas Vasilache	247e185dd5	[mlir][Vector] Move temporary alloc to top of the function alloca when lowering vector_transfers Recently introduced allocation hoisting is quite conservative on the cases when it triggers. This revision makes it such that the allocations for vector transfer lowerings are hoisted to the top of the function. This should be revisited in the context of parallelism and is a temporary workaround. Differential Revision: https://reviews.llvm.org/D81253	2020-06-05 08:45:52 -04:00
Diego Caballero	5c990d6994	[mlir] Add support for bf16 to StandardToLLVM conversion Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D81127	2020-06-04 14:36:36 -07:00
Thomas Raoux	661235e126	[mlir][gpu] Add subgroup Id/Size/Num to GPU dialect Add SubgroupId, SubgroupSize and NumSubgroups to GPU dialect ops and add the lowering of those ops to SPIRV. Differential Revision: https://reviews.llvm.org/D81042	2020-06-04 10:52:40 -07:00
Hanhan Wang	0b025d2733	[mlir][StandardToSPIRV] Handle i1 case for lowering std.zexti to SPIR-V. Differential Revision: https://reviews.llvm.org/D80965	2020-06-03 15:01:18 -07:00
Frederik Gossen	3713314bfa	[MLIR] Shape to standard dialect lowering Add a new pass to lower operations from the `shape` to the `std` dialect. The conversion applies only to the `size_to_index` and `index_to_size` operations and affected types. Other patterns will be added as needed. Differential Revision: https://reviews.llvm.org/D81091	2020-06-03 16:17:03 +00:00
Alex Zinenko	5c5dafc534	[mlir] support materialization for 1-1 type conversions Dialect conversion infrastructure supports 1->N type conversions by requiring individual conversions to provide facilities to generate operations retrofitting N values into 1 of the original type when N > 1. This functionality can also be used to materialize explicit "cast"-like operations, but it did not support 1->1 type conversions until now. Modify TypeConverter to support materialization of cast operations for 1-1 conversions. This also makes materialization specification more extensible following the same pattern as type conversions. Instead of overloading a virtual function, users or subclasses of TypeConversion can now register type-specific materialization callbacks that will be called in order for the given type. Differential Revision: https://reviews.llvm.org/D79729	2020-06-02 13:48:33 +02:00
Alex Zinenko	eb8edd8526	[mlir] SCFToGPUPass: fix macros referring to LOOPS to use SCF instead One header guard was overlooked when renaming LoopOps to SCF, rename it. Also drop two unused macros, one of which referred to LoopOp (not "Ops", hence the overlook).	2020-06-02 13:03:13 +02:00
MaheshRavishankar	2bcd1927dd	[mlir][SCFToGPU] Remove conversions from scf.for to gpu.launch. Keeping in the affine.for to gpu.launch conversions, which should probably be the affine.parallel to gpu.launch conversion as well. Differential Revision: https://reviews.llvm.org/D80747	2020-06-01 23:06:20 -07:00
Nicolas Vasilache	5f9e0466f2	[mlir][Vector] Fix vector.transfer alignment calculation https://reviews.llvm.org/D79246 introduces alignment propagation for vector transfer operations. Unfortunately, the alignment calculation is incorrect and can result in crashes. This revision fixes the calculation by using the natural alignment of the memref elemental type, instead of the resulting vector type. If more alignment is desired, it can be done in 2 ways: 1. use a proper vector.type_cast to transform a memref<axbxcxdxf32> into a memref<axbxvector<cxdxf32>> giving a natural alignment of vector<cxdxf32> 2. add an alignment attribute to vector transfer operations and propagate it. With this change the alignment in the relevant tests goes down from 128 to 4. Lastly, a few minor cleanups are performed and the custom `isMinorIdentityMap` is deprecated. Differential Revision: https://reviews.llvm.org/D80734	2020-05-28 17:58:51 -04:00
Stephen Neuendorffer	8b3155829a	[MLIR] Fix build when NVPTX is not enabled In this case, neither target is selected, but there is still a dependence on the MC library (through the TargetOptions.h include)	2020-05-28 14:07:55 -07:00
Alex Zinenko	72ede60b75	[mlir][GPU] Link relevant LLVM components in GPUCommon instead of test D80142 restructured MLIR-to-GPU-binary conversion to support multiple targets. It also modified cmake files to link relevant LLVM components in test/lib, which broke shared-library builds, and likely made the conversions unusable outside mlir-opt (or other tools that link in test library targets). Link these components to GPUCommon instead. Differential Revision: https://reviews.llvm.org/D80739	2020-05-28 20:01:54 +02:00
Jacques Pienaar	fefe4366c3	[mlir] Use ValueRange instead of ArrayRef<Value> This allows constructing operand adaptor from existing op (useful for commonalizing verification as I want to do in a follow up). I also add ability to use member initializers for the generated adaptor constructors for convenience. Differential Revision: https://reviews.llvm.org/D80667	2020-05-28 09:05:24 -07:00
Wen-Heng (Jack) Chung	061fb8eb2d	[mlir][gpu][mlir-cuda-runner] Refactor ConvertKernelFuncToCubin to be generic. Make ConvertKernelFuncToCubin pass to be generic: - Rename to ConvertKernelFuncToBlob. - Allow specifying triple, target chip, target features. - Initializing LLVM backend is supplied by a callback function. - Lowering process from MLIR module to LLVM module is via another callback. - Change mlir-cuda-runner to adopt the revised pass. - Add new tests for lowering to ROCm HSA code object (HSACO). - Tests for CUDA and ROCm are kept in separate directories. Differential Revision: https://reviews.llvm.org/D80142	2020-05-28 09:08:28 -05:00
aartbik	c295a65da4	[mlir] [VectorOps] Add 'vector.flat_transpose' operation Summary: Provides a representation of the linearized LLVM instrinsic. With tests and lowering implementation to LLVM IR dialect. Prepares better lowering for 2-D vector.transpose. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, dcaballe Reviewed By: ftynse, dcaballe Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80419	2020-05-27 11:09:48 -07:00
MaheshRavishankar	4d6f44f5f0	[mlir][spirv] Lower allocation/deallocations of workgroup memory. This allocation of a workgroup memory is lowered to a spv.globalVariable. Only static size allocation with element type being int or float is handled. The lowering does account for the element type that are not supported in the lowered spv.module based on the extensions/capabilities and adjusts the number of elements to get the same byte length. Differential Revision: https://reviews.llvm.org/D80411	2020-05-27 09:53:16 -07:00
Alex Zinenko	cadb7ccf2c	[mlir] SCF: provide function_ref builders for IfOp Now that OpBuilder is available in `build` functions, it becomes possible to populate the "then" and "else" regions directly when building the "if" operation. This is desirable in more structured forms of builders, especially in when conditionals are mixed with loops. Provide new `build` APIs taking callbacks for body constructors, similarly to scf::ForOp, and replace more clunky edsc::BlockBuilder uses with these. The original APIs remain available and go through the new implementation. Differential Revision: https://reviews.llvm.org/D80527	2020-05-27 16:12:58 +02:00
Benjamin Kramer	a9b5edc5e2	Make mlir::Value's bool conversion operator explicit This still allows `if (value)` while requiring an explicit cast when not in a boolean context. This means things like `std::set<Value>` will no longer compile. Differential Revision: https://reviews.llvm.org/D80497	2020-05-25 18:22:00 +02:00
Wen-Heng (Jack) Chung	2cbbc266ec	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-21 08:53:47 -05:00
Mehdi Amini	5c3ebd7725	Revert "[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass." This reverts commit `cdb6f05e2d`. The build is broken with: You have called ADD_LIBRARY for library obj.MLIRGPUtoCUDATransforms without any source files. This typically indicates a problem with your CMakeLists.txt file	2020-05-21 03:44:35 +00:00
Nicolas Vasilache	3393cc4ceb	[mlir] NFC - Appease GCC 5 again..	2020-05-20 17:58:18 -04:00
Wen-Heng (Jack) Chung	cdb6f05e2d	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-20 16:11:48 -05:00
Nicolas Vasilache	ebf14d9b6d	[mlir] NFC - Appease GCC 5 again..	2020-05-20 16:47:45 -04:00
MaheshRavishankar	0e88eb5c51	[mlir][spirv] Adapt subview legalization to the updated op semantics. The subview semantics changes recently to allow for more natural representation of constant offsets and strides. The legalization of subview op for lowering to SPIR-V needs to account for this. Also change the linearization to use the strides from the affine map of a memref. Differential Revision: https://reviews.llvm.org/D80270	2020-05-20 12:00:21 -07:00
Nicolas Vasilache	7c3c5b11b1	[mlir][Vector] Add option to fully unroll for VectorTransfer to SCF lowering Summary: Previously, the only support partial lowering from vector transfers to SCF was going through loops. This requires a dedicated allocation and extra memory roundtrips because LLVM aggregates cannot be indexed dynamically (for more details see the [deep-dive](https://mlir.llvm.org/docs/Dialects/Vector/#deeperdive)). This revision allows specifying full unrolling which removes this additional roundtrip. This should be used carefully though because full unrolling will spill, negating the benefits of removing the interim alloc in the first place. Proper heuristics are left for a later time. Differential Revision: https://reviews.llvm.org/D80100	2020-05-20 11:02:13 -04:00
Alex Zinenko	a7d88a9038	[mlir] SCFToStandard: support any ops in and around the control flow ops Originally, the SCFToStandard conversion only declared Ops from the Standard dialect as legal after conversion. This is undesirable as it would fail the conversion if the SCF ops contained ops from any other dialect. Furthermore, this would be problematic for progressive lowering of `scf.parallel` to `scf.for` after `ensureRegionTerminator` is made aware of the pattern rewriting infrastructure because it creates temporary `scf.yield` operations declared illegal. Change the legalization target to declare any op other than `scf.for`, `scf.if` and `scf.parallel` legal. Differential Revision: https://reviews.llvm.org/D80137	2020-05-20 16:12:05 +02:00
Alex Zinenko	57cbeaa8b5	[mlir] Erase or clear blocks through ConversionPatternRewriter when applicable Multiple places in the code base were erasing Blocks or operations in them using in-place modifications (`Block::erase` or `Block::clear`) unknown to ConversionPatternRewriter. These operations could not be undone if the pattern failed and could lead to inconsistent in-memory state of the IR with dangling pointers. Use `ConversionPatternRewriter::eraseOp` and `::eraseBlock` instead. Differential Revision: https://reviews.llvm.org/D80136	2020-05-20 16:12:05 +02:00
Nicolas Vasilache	da95a0d8cc	[mlir] NFC - Appease gcc 5 This should fix the error ``` VectorToSCF.cpp:238:62: error: specialization of 'template<class ConcreteOp> mlir::LogicalResult {anonymous}::NDTransferOpHelper<ConcreteOp>::doReplace()' in different namespace ```	2020-05-19 22:47:51 -04:00
Benjamin Kramer	350dadaa8a	Give helpers internal linkage. NFC.	2020-05-19 22:16:37 +02:00
Hanhan Wang	520a570268	[mlir][StandardToSPIRV] Fix signedness issue in bitwidth emulation. Summary: Previously, after applying the mask, a negative number would convert to a positive number because the sign flag was forgotten. This patch adds two more shift operations to do the sign extension. This assumes that we're using two's complement. This patch applies sign extension unconditionally when loading a unspported integer width, and it relies the pattern to do the casting because the signedness semantic is carried by operator itself. Differential Revision: https://reviews.llvm.org/D79753	2020-05-19 11:00:01 -07:00
Alex Zinenko	d1560f3956	[mlir] scf::ForOp: provide builders with callbacks for loop body Thanks to a recent change that made `::build` functions take an instance of `OpBuilder`, it is now possible to build operations within a region attached to the operation about to be created. Exercise this on `scf::ForOp` by taking a callback that populates the loop body while the loop is being created. Additionally, provide helper functions to build perfect nests of `ForOp`s, with support for iteration arguments. These functions provide the same functionality as EDSC LoopNestBuilder with simpler implementation, without relying on edsc::ScopedContext, and using `OpBuilder` in an unambiguous way. Compatibility functions for EDSC are provided, but may be removed in the future. Differential Revision: https://reviews.llvm.org/D79688	2020-05-19 16:26:29 +02:00
George	3c4ef74555	Fixed a typo in the comment for allocateBuffer() Differential Revision: https://reviews.llvm.org/D80087	2020-05-18 14:41:57 -04:00
Nicolas Vasilache	1870e787af	[mlir][Vector] Add an optional "masked" boolean array attribute to vector transfer operations Summary: Vector transfer ops semantic is extended to allow specifying a per-dimension `masked` attribute. When the attribute is false on a particular dimension, lowering to LLVM emits unmasked load and store operations. Differential Revision: https://reviews.llvm.org/D80098	2020-05-18 11:52:08 -04:00
Nicolas Vasilache	36cdc17f8c	[mlir][Vector] Make minor identity permutation map optional in transfer op printing and parsing Summary: This revision makes the use of vector transfer operatons more idiomatic by allowing to omit and inferring the permutation_map. Differential Revision: https://reviews.llvm.org/D80092	2020-05-18 11:41:27 -04:00
Stephen Neuendorffer	37ce8d6ade	[MLIR] Fix linkage for libMLIR.so Generally: 1) don't use target_link_libraries() and add_mlir_library() on the same target, use LINK_LIBS PUBLIC instead. 2) don't use LINK_LIBS to specify LLVM libraries. Use LINK_COMPONENTS instead 3) no need to link against LLVMSupport. We pull it in by default. Differential Revision: https://reviews.llvm.org/D80076	2020-05-17 13:46:52 -07:00
Stephen Neuendorffer	1cff8e8de7	[MLIR] LinalgToStandard: use LINK_LIBS rather than target_link_libraries. Also, missing MLIRTransforms as a dependency. This breaks BUILD_SHARED_LIBS=on Differential Revision: https://reviews.llvm.org/D80035	2020-05-15 14:25:28 -07:00
aartbik	b1c688dbae	[mlir] [VectorOps] Implement vector.create_mask lowering to LLVM IR Summary: First, compact implementation of lowering to LLVM IR. A bit more challenging than the constant mask due to the dynamic indices, of course. I like to hear if there are more efficient ways of doing this in LLVM, but this for now at least gives us a functional reference implementation. Reviewers: nicolasvasilache, ftynse, bkramer, reidtatge, andydavis1, mehdi_amini Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79954	2020-05-15 11:02:30 -07:00
Alex Zinenko	4ead2cf76c	[mlir] Rename conversions involving ex-Loop dialect to mention SCF The following Conversions are affected: LoopToStandard -> SCFToStandard, LoopsToGPU -> SCFToGPU, VectorToLoops -> VectorToSCF. Full file paths are affected. Additionally, drop the 'Convert' prefix from filenames living under lib/Conversion where applicable. API names and CLI options for pass testing are also renamed when applicable. In particular, LoopsToGPU contains several passes that apply to different kinds of loops (`for` or `parallel`), for which the original names are preserved. Differential Revision: https://reviews.llvm.org/D79940	2020-05-15 10:45:11 +02:00
Alex Zinenko	7fc5f28068	[mlir] LinalgToStandard: add build dependency on MLIRPass This is supposed to resolve the build problem with shared libraries.	2020-05-15 10:20:04 +02:00
MaheshRavishankar	0b3e478b10	[mlir][GPUToSPIRV] Use default ABI only when none of the arguments have abi attributes. To ensure there is no conflict, use the default ABI only when none of the arguments have the spv.interface_var_abi attribute. This also implies that if one of the arguments has a spv.interface_var_abi attribute, all of them should have it as well. Differential Revision: https://reviews.llvm.org/D77232	2020-05-14 21:48:51 -07:00
Nicolas Vasilache	f1b972041a	[mlir][Linalg] Start a LinalgToStandard pass and move conversion to library calls. This revision starts decoupling the include the kitchen sink behavior of Linalg to LLVM lowering by inserting a -convert-linalg-to-std pass. The lowering of linalg ops to function calls was previously lowering to memref descriptors by having both linalg -> std and std -> LLVM patterns in the same rewrite. When separating this step, a new issue occurred: the layout is automatically type-erased by this process. This revision therefore introduces memref casts to perform these type erasures explicitly. To connect everything end-to-end, the LLVM lowering of MemRefCastOp is relaxed because it is artificially more restricted than the op semantics. The op semantics already guarantee that source and target MemRefTypes are cast-compatible. An invalid lowering test now becomes valid and is removed. Differential Revision: https://reviews.llvm.org/D79468	2020-05-15 00:24:03 -04:00
Diego Caballero	bc5565f9ea	[mlir][Affine] Introduce affine.vector_load and affine.vector_store This patch adds `affine.vector_load` and `affine.vector_store` ops to the Affine dialect and lowers them to `vector.transfer_read` and `vector.transfer_write`, respectively, in the Vector dialect. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D79658	2020-05-14 13:17:58 -07:00
Alex Zinenko	60f443bb3b	[mlir] Change dialect namespace loop->scf All ops of the SCF dialect now use the `scf.` prefix instead of `loop.`. This is a part of dialect renaming. Differential Revision: https://reviews.llvm.org/D79844	2020-05-13 19:20:21 +02:00
MaheshRavishankar	49e6c19100	[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. Differential Revision: https://reviews.llvm.org/D79505	2020-05-12 23:15:25 -07:00
Nicolas Vasilache	63c0e72b2f	[mlir] Revisit std.subview handling of static information. The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. Lowering to LLVM is updated, simplified and now supports all cases. A mixed static-dynamic mode test that wouldn't previously lower is added. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Differential Revision: https://reviews.llvm.org/D79662	2020-05-12 20:04:44 -04:00
Alex Zinenko	473bdaf2e8	[mlir] Move Conversion/StandardToStandard to Dialect/StandardOps/Transforms/FuncConversions Conversion/ folders were originally intended to store patterns for DialectA->DialectB conversions that depend on both dialects and do not conceptually belong to either of the dialects. As such, DialectA->DialectA conversion does not make sense under Conversion/ and should rather live with the dialect it operates on. Differential Revision: https://reviews.llvm.org/D79569	2020-05-13 00:33:25 +02:00
Hanhan Wang	756d6959d7	[mlir][StandardToSPIRV] Add support for lowering index_cast to SPIR-V. Differential Revision: https://reviews.llvm.org/D79644	2020-05-11 15:41:25 -07:00
Reid Tatge	334a4159ec	[mlir][Vector] NFC - Rename vector.strided_slice into vector.extract_strided_slice Differential Revision: https://reviews.llvm.org/D79734	2020-05-11 14:21:10 -07:00
Sean Silva	98eead8186	[mlir][Value] Add v.getDefiningOp<OpTy>() Summary: This makes a common pattern of `dyn_cast_or_null<OpTy>(v.getDefiningOp())` more concise. Differential Revision: https://reviews.llvm.org/D79681	2020-05-11 12:55:27 -07:00
Nicolas Vasilache	6ed61a26c2	[mlir] Simplify and better document std.view semantics This [discussion](https://llvm.discourse.group/t/viewop-isnt-expressive-enough/991/2) raised some concerns with ViewOp. In particular, the handling of offsets is incorrect and does not match the op description. Note that with an elemental type change, offsets cannot be part of the type in general because sizeof(srcType) != sizeof(dstType). Howerver, offset is a poorly chosen term for this purpose and is renamed to byte_shift. Additionally, for all intended purposes, trying to support non-identity layouts for this op does not bring expressive power but rather increases code complexity. This revision simplifies the existing semantics and implementation. This simplification effort is voluntarily restrictive and acts as a stepping stone towards supporting richer semantics: treat the non-common cases as YAGNI for now and reevaluate based on concrete use cases once a round of simplification occurred. Differential revision: https://reviews.llvm.org/D79541	2020-05-11 12:29:23 -04:00
Alex Zinenko	c25b20c0f6	[mlir] NFC: Rename LoopOps dialect to SCF (Structured Control Flow) This dialect contains various structured control flow operaitons, not only loops, reflect this in the name. Drop the Ops suffix for consistency with other dialects. Note that this only moves the files and changes the C++ namespace from 'loop' to 'scf'. The visible IR prefix remains the same and will be updated separately. The conversions will also be updated separately. Differential Revision: https://reviews.llvm.org/D79578	2020-05-11 15:04:27 +02:00
Hanhan Wang	3f07cab312	[mlir][StandardToLLVM] Add support for lowering FPToSIOp to LLVM. Summary: Depends On D79374 Differential Revision: https://reviews.llvm.org/D79455	2020-05-11 01:29:24 -07:00
Hanhan Wang	ac691c4fe7	[mlir][StandardToSPIRV] Add support for lowering FPToSIOp to SPIR-V. Summary: Depends On D79373 Differential Revision: https://reviews.llvm.org/D79374	2020-05-11 01:27:54 -07:00
Frederik Gossen	5d5f61fc89	[MLIR] Add complex addition and substraction to the standard dialect Complex addition and substraction are the first two binary operations on complex numbers. Remaining operations will follow the same pattern. Differential Revision: https://reviews.llvm.org/D79479	2020-05-08 09:54:18 +00:00
Alex Zinenko	a99f62c40a	[mlir] VectorToLLVM: propagate up from replaceTransferOp In the Vector to LLVM conversion, the `replaceTransferOp` function calls into a type converter that may fail and suppresses the status. Change the function to return the failure status instead, Since it is called from a pattern, the failure can be readily propagated to the rest of infrastructure.	2020-05-07 11:53:48 +02:00
Wen-Heng (Jack) Chung	a23f190213	[mlir][vector] set alignment when lowering transfer_read and transfer_write. When emitting masked load / store, set alignment from data layout. Differential Revision: https://reviews.llvm.org/D79246	2020-05-07 11:44:25 +02:00
Stephen Neuendorffer	e78ef9385c	[MLIR] GPUToCUDA conversion: MC is only needed if NVPTX is enabled. This patch conditionally links with MC	2020-05-05 08:55:17 -07:00
Alexander Belyaev	b79751e83d	[MLIR] Add conversion from AtomicRMWOp -> GenericAtomicRMWOp. Adding this pattern reduces code duplication. There is no need to have a custom implementation for lowering to llvm.cmpxchg. Differential Revision: https://reviews.llvm.org/D78753	2020-05-05 10:32:13 +02:00
Stephen Neuendorffer	5469f434bb	[MLIR] Reapply: Adjust libMLIR building to more closely follow libClang This reverts commit `ab1ca6e60f`.	2020-05-04 20:47:57 -07:00
Stephen Neuendorffer	146192ade4	[MLIR] Normalize usage of intrinsics_gen Portions of MLIR which depend on LLVMIR generally need to depend on intrinsics_gen, to ensure that tablegen'd header files from LLVM are built first. Without this, we get errors, typically about llvm/IR/Attributes.inc not being found. Note that previously the Linalg Dialect depended on intrinsics_gen, but it doesn't need to, since it doesn't use LLVMIR. Differential Revision: https://reviews.llvm.org/D79389	2020-05-04 20:47:57 -07:00
River Riddle	1e4faf23ff	[mlir][IR] Add a Region::getOps method that returns a range of immediately nested operations This allows for walking the operations nested directly within a region, without traversing nested regions. Differential Revision: https://reviews.llvm.org/D79056	2020-05-04 17:46:25 -07:00
Hanhan Wang	5d10613b6e	[mlir][StandardToSPIRV] Emulate bitwidths not supported for store op. Summary: As D78974, this patch implements the emulation for store op. The emulation is done with atomic operations. E.g., if the storing value is i8, rewrite the StoreOp to: 1) load a 32-bit integer 2) clear 8 bits in the loading value 3) store 32-bit value back 4) load a 32-bit integer 5) modify 8 bits in the loading value 6) store 32-bit value back The step 1 to step 3 are done by AtomicAnd as one atomic step, and the step 4 to step 6 are done by AtomicOr as another atomic step. Differential Revision: https://reviews.llvm.org/D79272	2020-05-04 15:18:44 -07:00
Stephen Neuendorffer	ab1ca6e60f	Revert "[MLIR] Adjust libMLIR building to more closely follow libClang" This reverts commit `4f0f436749`. This seems to show some compile dependence problems, and also breaks flang.	2020-05-04 12:40:12 -07:00
Valentin Churavy	4f0f436749	[MLIR] Adjust libMLIR building to more closely follow libClang - Exports MLIR targets to be used out-of-tree. - mimicks `add_clang_library` and `add_flang_library`. - Fixes libMLIR.so After https://reviews.llvm.org/D77515 libMLIR.so was no longer containing any object files. We originally had a cludge there that made it work with the static initalizers and when switchting away from that to the way the clang shlib does it, I noticed that MLIR doesn't create a `obj.{name}` target, and doesn't export it's targets to `lib/cmake/mlir`. This is due to MLIR using `add_llvm_library` under the hood, which adds the target to `llvmexports`. Differential Revision: https://reviews.llvm.org/D78773 [MLIR] Fix libMLIR.so and LLVM_LINK_LLVM_DYLIB Primarily, this patch moves all mlir references to LLVM libraries into either LLVM_LINK_COMPONENTS or LINK_COMPONENTS. This enables magic in the llvm cmake files to automatically replace reference to LLVM components with references to libLLVM.so when necessary. Among other things, this completes fixing libMLIR.so, which has been broken for some configurations since D77515. Unlike previously, the pattern is now that mlir libraries should almost always use add_mlir_library. Previously, some libraries still used add_llvm_library. However, this confuses the export of targets for use out of tree because libraries specified with add_llvm_library are exported by LLVM. Instead users which don't need/can't be linked into libMLIR.so can specify EXCLUDE_FROM_LIBMLIR A common error mode is linking with LLVM libraries outside of LINK_COMPONENTS. This almost always results in symbol confusion or multiply defined options in LLVM when the same object file is included as a static library and as part of libLLVM.so. To catch these errors more directly, there's now mlir_check_all_link_libraries. To simplify usage of add_mlir_library, we assume that all mlir libraries depend on LLVMSupport, so it's not necessary to separately specify it. tested with: BUILD_SHARED_LIBS=on, BUILD_SHARED_LIBS=off + LLVM_BUILD_LLVM_DYLIB, BUILD_SHARED_LIBS=off + LLVM_BUILD_LLVM_DYLIB + LLVM_LINK_LLVM_DYLIB. By: Stephen Neuendorffer <stephen.neuendorffer@xilinx.com> Differential Revision: https://reviews.llvm.org/D79067 [MLIR] Move from using target_link_libraries to LINK_LIBS This allows us to correctly generate dependencies for derived targets, such as targets which are created for object libraries. By: Stephen Neuendorffer <stephen.neuendorffer@xilinx.com> Differential Revision: https://reviews.llvm.org/D79243 Three commits have been squashed to avoid intermediate build breakage.	2020-05-04 11:40:46 -07:00
Frederik Gossen	031265ad8a	[MLIR] Add complex numbers to standard dialect Add `CreateComplexOp`, `ReOp`, and `ImOp` to the standard dialect. This is the first step to support complex numbers. Differential Revision: https://reviews.llvm.org/D79159	2020-05-04 14:04:28 +00:00
Wen-Heng (Jack) Chung	bc23c1d85e	[mlir][rocdl] add rocdl.barier op. - Add rocdl.barrier op. - Lower gpu.barier to rocdl.barrier in -convert-gpu-to-rocdl. Differential Revision: https://reviews.llvm.org/D79126	2020-05-04 10:35:01 +02:00
River Riddle	2265009fbe	[mlir][GPUOpsLowering] Add missing include for FormatVariadic	2020-05-01 15:58:20 -07:00
Benjamin Kramer	f9223d47e4	Remove unused variable. NFC.	2020-05-01 14:29:58 +02:00
MaheshRavishankar	43b89ecdb9	[mlir] Add sine operation to Standard dialect. Also add lowering of sine operation to SPIR-V dialect. Differential Revision: https://reviews.llvm.org/D79102	2020-04-30 22:15:42 -07:00
Hanhan Wang	be0ad5b034	[mlir][StandardToSPIRV] Add support for lowering integer casting. Summary: Maps ZeroExtendIOp and TruncateIOp to spirv::UConvertOp and spirv::SConvertOp. Depends On D78974 Differential Revision: https://reviews.llvm.org/D79143	2020-04-30 19:29:31 -07:00
Hanhan Wang	6601b65aed	[mlir][StandardToSPIRV] Emulate bitwidths not supported for load op. Summary: The current implementation in SPIRVTypeConverter just unconditionally turns everything into 32-bit if it doesn't meet the requirements of extensions or capabilities. In this case, we can load a 32-bit value and then do bit extraction to get the value. Differential Revision: https://reviews.llvm.org/D78974	2020-04-30 19:27:45 -07:00
Wen-Heng (Jack) Chung	9ad5e57316	[mlir][nvvm][rocdl] refactor NVVM and ROCDL dialect. NFC. - Extract common logic between -convert-gpu-to-nvvm and -convert-gpu-to-rocdl. - Cope with the fact that alloca operates on different addrspaces between NVVM and ROCDL. - Modernize unit tests for ROCDL dialect. Differential Revision: https://reviews.llvm.org/D79021	2020-05-01 00:13:26 +02:00
Nicolas Vasilache	0d61dcf606	[mlir][EDSC] Make use of InsertGuard Summary: This revision cleans up a layer of complexity in ScopedContext and uses InsertGuard instead of previously manual bookkeeping. The method `getBuilder` is renamed to `getBuilderRef` and spurious copies of OpBuilder are tracked. This results in some canonicalizations not happening anymore in the Linalg matmul to vector test. This test is retired because relying on DRRs for this has been shaky at best. The solution will be better support to write fused passes in C++ with more idiomatic pattern composition and application. Differential Revision: https://reviews.llvm.org/D79208	2020-04-30 18:04:31 -04:00
aartbik	6937251f01	[mlir] [VectorOps] Included i1 support for vector.print Summary: Added boolean support to vector.print. Useful for upcoming "mask" tests. Reviewers: ftynse, nicolasvasilache, andydavis1 Reviewed By: andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79198	2020-04-30 14:56:26 -07:00
Nicolas Vasilache	7a80139059	[mlir][Vector] Provide progressive lowering of masked n-D vector transfers This revision allows masked vector transfers with m-D buffers and n-D vectors to progressively lower to m-D buffer and 1-D vector transfers. For a vector.transfer_read, assuming a `memref<(leading_dims) x (major_dims) x (minor_dims) x type>` and a `vector<(minor_dims) x type>` are involved in the transfer, this generates pseudo-IR resembling: ``` if (any_of(%ivs_major + %offsets, <, major_dims)) { %v = vector_transfer_read( {%offsets_leading, %ivs_major + %offsets_major, %offsets_minor}, %ivs_minor): memref<(leading_dims) x (major_dims) x (minor_dims) x type>, vector<(minor_dims) x type>; } else { %v = splat(vector<(minor_dims) x type>, %fill) } ``` Differential Revision: https://reviews.llvm.org/D79062	2020-04-29 21:28:27 -04:00
MaheshRavishankar	1c12a95d9c	[mlir][StandardToSPIRV] Handle conversion of cmpi operation with i1 type operands. The instructions used to convert std.cmpi cannot have i1 types according to SPIR-V specification. A different set of operations are specified in the SPIR-V spec for comparing boolean types. Enhance the StandardToSPIRV lowering to target these instructions when operands to std.cmpi operation are of i1 type. Differential Revision: https://reviews.llvm.org/D79049	2020-04-29 10:09:03 -07:00
Wen-Heng (Jack) Chung	f2b505a459	[mlir][std] allow subview take memrefs from non-zero addrspaces. On certain targets std.subview should be able to take memrefs from non-zero addrspaces. Improve lowering logic to llvm dialect and amend the tests. Differential Revision: https://reviews.llvm.org/D79024	2020-04-29 17:19:27 +02:00
Wen-Heng (Jack) Chung	be16075bfc	[mlir][vector] let transfer_read and transfer_write take non-zero addrspace. Enhance lowering logic and tests so vector.transfer_read and vector.transfer_write take memrefs on non-zero addrspaces. Differential Revision: https://reviews.llvm.org/D79023	2020-04-29 17:11:48 +02:00
Nicolas Vasilache	0c02106058	[mlir][EDSC] Retire OperationHandle OperationHandle mostly existed to mirror the behavior of ValueHandle. This has become unnecessary and can be retired. Differential Revision: https://reviews.llvm.org/D78692	2020-04-29 00:32:44 -04:00
Alex Zinenko	bb1d976feb	[mlir][flang] use OpBuilder& instead of Builder* in <Op>::build methods As we start defining more complex Ops, we increasingly see the need for Ops-with-regions to be able to construct Ops within their regions in their ::build methods. However, these methods only have access to Builder, and not OpBuilder. Creating a local instance of OpBuilder inside ::build and using it fails to trigger the operation creation hooks in derived builders (e.g., ConversionPatternRewriter). In this case, we risk breaking the logic of the derived builder. At the same time, OpBuilder::create, which is by far the largest user of ::build already passes "this" as the first argument, so an OpBuilder instance is already available. Update all ::build methods in all Ops in MLIR and Flang to take "OpBuilder &" instead of "Builder *". Note the change from pointer and to reference to comply with the common style in MLIR, this also ensures all other users must change their ::build methods. Differential Revision: https://reviews.llvm.org/D78713	2020-04-28 10:42:08 +02:00
Nicolas Vasilache	b2c79c50ed	[mlir][VectorOps] Extend VectorTransfer lowering to n-D memref with minor identity map Summary: This revision extends the lowering of vector transfers to work with n-D memref and 1-D vector where the permutation map is an identity on the most minor dimensions (1 for now). Differential Revision: https://reviews.llvm.org/D78925	2020-04-27 11:20:55 -04:00
Alexander Belyaev	f31db760b3	[MLIR] Replace splitBlock() with createBlock in GenericAtomicRMWOp lowering. `addArgument()` is not undoable and should not be used in ConversionPattern, therefore replacing `splitBlock()` with `createBlock()`, that creates a block with specified args. Differential Revision: https://reviews.llvm.org/D78731	2020-04-25 17:42:45 +02:00
Nicolas Vasilache	367229e100	[mlir][EDSC] Retire ValueHandle The EDSC discussion [thread](https://llvm.discourse.group/t/evolving-builder-apis-based-on-lessons-learned-from-edsc/879) points out that ValueHandle has become an unnecessary level of abstraction since MLIR switch from `Value *` to `Value` everywhere. This revision removes this level of indirection.	2020-04-23 11:01:16 -04:00
Alexander Belyaev	21caba599e	[MLIR] Lower GenericAtomicRMWOp to llvm.cmpxchg. Summary: Lowering is pretty much a copy of AtomicRMWOp -> llvm.cmpxchg pattern. Differential Revision: https://reviews.llvm.org/D78647	2020-04-23 09:29:34 +02:00
Denis Khalikov	1009177d49	[mlir][vulkan-runner] Add support for integer types. Summary: Add support for memrefs with element type as integer type and simple test. Differential Revision: https://reviews.llvm.org/D78560	2020-04-22 19:42:39 +03:00
Frederik Gossen	0372db05bb	[MLIR] Use nested symbol to identify kernel in `LaunchFuncOp`. Summary: Use a nested symbol to identify the kernel to be invoked by a `LaunchFuncOp` in the GPU dialect. This replaces the two attributes that were used to identify the kernel module and the kernel within seperately. Differential Revision: https://reviews.llvm.org/D78551	2020-04-22 07:44:29 +00:00
Stephan Herhut	c22876b550	[MLIR] Add extra locking during cubin generation. We also need to lock the LLVMDialect mutex when initializing LLVM targets or destroying llvm modules concurrently. Added another scoped lock to that effect. Differential Revision: https://reviews.llvm.org/D78580	2020-04-22 08:57:45 +02:00
Denis Khalikov	a48f0a3c7e	[mlir][vulkan-runner] Simplify vulkan launch call op. Summary: Workgroup size is written into the kernel. So to properly modelling vulkan launch, we have to skip local workgroup size for vulkan launch call op. Differential Revision: https://reviews.llvm.org/D78307	2020-04-18 16:49:47 +03:00
aartbik	186709c6e0	[mlir] [VectorOps] Progressive lowering of vector.broadcast Summary: Rather than having a full, recursive, lowering of vector.broadcast to LLVM IR, it is much more elegant to have a progressive lowering of each vector.broadcast into a lower dimensional vector.broadcast, until only elementary vector operations remain. This results in more elegant, step-wise code, that is easier to understand. Also makes some optimizations in the generated code. Reviewers: nicolasvasilache, mehdi_amini, andydavis1, grosul1 Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78071	2020-04-16 21:02:27 -07:00
Stephen Neuendorffer	314f00a034	[MLIR][cmake] Remove redundant add_dependencies() Libraries declared as target_link_libraries() do not also need to be declared as dependencies using add_dependencies(). Differential Revision: https://reviews.llvm.org/D78320	2020-04-16 14:41:54 -07:00
Stephen Neuendorffer	f061295732	[MLIR] Complete refactoring of Affine dialect into sub-libraries. There were some unused CMakeFiles for Affine/IR and Affine/EDSC. This change builds separate MLIRAffineOps and MLIRAffineEDSC libraries using those CMakeFiles. This combination replaces the old MLIRAffine library. Differential Revision: https://reviews.llvm.org/D78317	2020-04-16 13:41:17 -07:00
Stephan Herhut	69040d5b0b	[MLIR] Allow for multiple gpu modules during translation. This change makes the ModuleTranslation threadsafe by locking on the LLVMContext. Furthermore, we now clone the llvm module into a new context when compiling to PTX similar to what the OrcJit does. Differential Revision: https://reviews.llvm.org/D78207	2020-04-16 14:18:31 +02:00
Jeremy Bruestle	9f3ab92ec8	[MLIR] Improve support for 0-dimensional Affine Maps. Summary: Modified AffineMap::get to remove support for the overload which allowed an ArrayRef of AffineExpr but no context (and gathered the context from a presumed first entry, resulting in bugs when there were 0 results). Instead, we support only a ArrayRef and a context, and a version which takes a single AffineExpr. Additionally, removed some now needless case logic which previously special cased which call to AffineMap::get to use. Reviewers: flaub, bondhugula, rriddle!, nicolasvasilache, ftynse, ulysseB, mravishankar, antiagainst, aartbik Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78226	2020-04-15 14:15:02 -07:00
River Riddle	ebf190fcda	[llvm][ADT] Move TypeSwitch class from MLIR to LLVM This class implements a switch-like dispatch statement for a value of 'T' using dyn_cast functionality. Each `Case<T>` takes a callable to be invoked if the root value isa<T>, the callable is invoked with the result of dyn_cast<T>() as a parameter. Differential Revision: https://reviews.llvm.org/D78070	2020-04-14 15:14:41 -07:00
Tres Popp	58516718fc	[MLIR] Constant fold multiplies in deriveStaticUpperBound. Summary: This operation occurs during collapseParallelLoops, so we constant fold them also to allow more situations of determining a loop invariant upper bound when lowering to the GPU dialect from the Loop dialect. Differential Revision: https://reviews.llvm.org/D77723	2020-04-14 13:04:16 +02:00
Christopher Tetreault	eab73dfed9	[SVE] Change return type of getNumElements to unsigned Reviewers: efriedma, sdesmalen, craig.topper, dexonsmith Reviewed By: efriedma, sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77763	2020-04-13 16:24:18 -07:00
River Riddle	d3588d0814	[mlir][NFC] Replace mlir/Support/Functional.h with llvm equivalents. Summary: Functional.h contains many different methods that have a direct, and more efficient, equivalent in LLVM. This revision replaces all usages with the LLVM equivalent, and removes the header. This is part of larger cleanup, pr45513, merging MLIR support facilities into LLVM. Differential Revision: https://reviews.llvm.org/D78053	2020-04-13 14:22:12 -07:00
Lei Zhang	a9cb529a84	[mlir][spirv] NFC: use Optional to replace SPV_Optional Differential Revision: https://reviews.llvm.org/D78046	2020-04-13 15:44:06 -04:00
Chris Lattner	74e6a5b2a3	Eliminate all uses of Identifier::is() in the source tree, this doesn't remove the definition of it (yet). NFC. Reviewers: mravishankar, antiagainst, herhut, rriddle! Subscribers: jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78042	2020-04-13 11:49:31 -07:00
Christopher Tetreault	92dde8a657	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: rriddle, efriedma, sdesmalen Reviewed By: sdesmalen Subscribers: frgossen, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77258	2020-04-10 13:46:18 -07:00
Uday Bondhugula	a5b9316b24	[MLIR][NFC] applyPatternsGreedily -> applyPatternsAndFoldGreedily Rename mlir::applyPatternsGreedily -> applyPatternsAndFoldGreedily. The new name is a more accurate description of the method - it performs both, application of the specified patterns and folding of all ops in the op's region irrespective of whether any patterns have been supplied. Differential Revision: https://reviews.llvm.org/D77478	2020-04-10 12:55:21 +05:30
Nicolas Vasilache	8345b86d9a	[mlir][Vector] Add lowering of 1-D vector transfer_read/write to masked load/store Summary: This revision adds support to lower 1-D vector transfers to LLVM. A mask of the vector length is created that compares the base offset + linear index to the dim of the vector. In each position where this does not overflow (i.e. offset + vector index < dim), the mask is set to 1. A notable fact is that the lowering uses llvm.dialect_cast to allow writing code in the simplest form by targeting the simplest mix of vector and LLVM dialects and letting other conversions kick in. Differential Revision: https://reviews.llvm.org/D77703	2020-04-09 16:17:05 -04:00
River Riddle	bd1ccfe6df	[mlir] Add a new RewritePattern::hasBoundedRewriteRecursion hook. Summary: Some pattern rewriters, like dialect conversion, prohibit the unbounded recursion(or reapplication) of patterns on generated IR. Most patterns are not written with recursive application in mind, so will generally explode the stack if uncaught. This revision adds a hook to RewritePattern, `hasBoundedRewriteRecursion`, to signal that the pattern can safely be applied to the generated IR of a previous application of the same pattern. This allows for establishing a contract between the pattern and rewriter that the pattern knows and can handle the potential recursive application. Differential Revision: https://reviews.llvm.org/D77782	2020-04-09 12:42:28 -07:00
Uday Bondhugula	d314b7d5ca	[MLIR] ShapedType accessor minor fixes + add isDynamicDim accessor Minor fixes and cleanup for ShapedType accessors, use ShapedType::kDynamicSize, add ShapedType::isDynamicDim. Differential Revision: https://reviews.llvm.org/D77710	2020-04-09 08:47:50 +05:30
River Riddle	400ad6f95d	[mlir] Eliminate the remaining usages of cl::opt instead of PassOption. Summary: Pass options are a better choice for various reasons and avoid the need for static constructors. Differential Revision: https://reviews.llvm.org/D77707	2020-04-08 13:05:08 -07:00
Uday Bondhugula	3156b5422e	[MLIR] Fix more gcc-5 build issues from D77528 820c420d4e1c630b5ead285917c6ecdd2f5092ad did not really fix all build issues by D77528. This gets rid of two unnecessary 'using' declarations. Differential Revision: https://reviews.llvm.org/D77726	2020-04-08 19:21:50 +05:30
Uday Bondhugula	a59008a3a5	[MLIR] Fix gcc-5 build failure cause by D77528 Fix gcc-5 build failure cause by D77528 Differential Revision: https://reviews.llvm.org/D77719	2020-04-08 17:27:12 +05:30
Uday Bondhugula	01d97a3549	[MLIR] Add support to use aligned_alloc to lower AllocOp from std to llvm Support to recognize and deal with aligned_alloc was recently added to LLVM's TLI/MemoryBuiltins and its various optimization passes. This revision adds support for generation of aligned_alloc's when lowering AllocOp from std to LLVM. Setting 'use-aligned_alloc=1' will lead to aligned_alloc being used for all heap allocations. An alignment and size that works with the constraints of aligned_alloc is chosen. Using aligned_alloc is preferable to "using malloc and adjusting the allocated pointer to align for indexing" because the pointer access arithmetic done for the latter only makes it harder for LLVM passes to deal with for analysis, optimization, attribute deduction, and rewrites. Differential Revision: https://reviews.llvm.org/D77528	2020-04-08 15:10:19 +05:30
River Riddle	1834ad4a69	[mlir][Pass] Update the PassGen to generate base classes instead of utilities Summary: This is much cleaner, and fits the same structure as many other tablegen backends. This was not done originally as the CRTP in the pass classes made it overly verbose/complex. Differential Revision: https://reviews.llvm.org/D77367	2020-04-07 14:08:52 -07:00
River Riddle	80aca1eaf7	[mlir][Pass] Remove the use of CRTP from the Pass classes This revision removes all of the CRTP from the pass hierarchy in preparation for using the tablegen backend instead. This creates a much cleaner interface in the C++ code, and naturally fits with the rest of the infrastructure. A new utility class, PassWrapper, is added to replicate the existing behavior for passes not suitable for using the tablegen backend. Differential Revision: https://reviews.llvm.org/D77350	2020-04-07 14:08:52 -07:00
River Riddle	722f909f7a	[mlir][Pass][NFC] Replace usages of ModulePass with OperationPass<ModuleOp> ModulePass doesn't provide any special utilities and thus doesn't give enough benefit to warrant a special pass class. This revision replaces all usages with the more general OperationPass. Differential Revision: https://reviews.llvm.org/D77339	2020-04-07 14:08:52 -07:00
Uday Bondhugula	7023f4b4cb	[MLIR] Introduce std.alloca op Introduce the alloca op for stack memory allocation. When converting to the LLVM dialect, this is lowered to an llvm.alloca. Refactor the std to llvm conversion for alloc op to reuse with alloca. Drop useAlloca option with alloc op lowering. Differential Revision: https://reviews.llvm.org/D76602	2020-04-07 15:45:07 +05:30
Nicolas Vasilache	8f229989d5	[mlir][Linalg] Add a linalg.tensor_reshape to operate on tensors Summary: This revision adds a tensor_reshape operation that operates on tensors. In the tensor world the constraints are less stringent and we can allow more arbitrary dynamic reshapes, as long as they are contractions. The expansion of a dynamic dimension into multiple dynamic dimensions is under-specified and is punted on for now. Differential Revision: https://reviews.llvm.org/D77360	2020-04-06 11:19:17 -04:00
Alexander Belyaev	51e3709c2b	[MLIR] Don't insert YieldOp for non-void loop.for by default. The ForOp::build ensures that there is a block terminator which is great for the default use case when there are no iter_args and loop.for returns no results. In non-zero results case we always need to call replaceOpWithNewOp which is not the nicest thing in the world. We can stop inserting YieldOp when iter_args is non-empty. IfOp::build already behaves similarly.	2020-04-05 11:48:22 +02:00
Kazuaki Ishizaki	5aacce3db2	[mlir] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77473	2020-04-05 11:30:30 +09:00
Alex Zinenko	340e1b2077	[mlir] LoopToStandard conversion: support "if/else" with results Summary: A recent extension allowed the `loop.if` operation to return results yielded by its regions. However, such operations could not be lowered to a CFG of standard operations because it would have required to modify the argument list of a block, which is not allowed in a conversion pattern. Now that the conversion infrastructure supports block creation, use it to create a block with an argument list that dominates the operations following the `loop.if` and forward the results as arguments of this block. Depends On D77416 Differential Revision: https://reviews.llvm.org/D77418	2020-04-03 23:49:03 +02:00
Nicolas Vasilache	e33a636e26	[mlir][Linalg] Employ finer-grained control of C interface emission Summary: Linalg makes it possible to interface codegen with externally precompiled HPC libraries. The mechanism to allow such interop uses a normalized ABI and the emission of C interface wrappers. The mechanism controlling these C interface emission is too aggressive and makes it very easy to obtained undefined symbols for external function (e.g. the ones coming from libm). This revision uses the newly introduced llvm.emit_c_interface function attribute which allows controlling this behavior at a function granularity. As a consequence LinalgToLLVM does not need to activate the C wrapper emission when adding the StdToLLVM patterns. Differential Revision: https://reviews.llvm.org/D77364	2020-04-03 16:14:53 -04:00
Denis Khalikov	0718e3ae31	[mlir][vulkan-runner] Add support for 3D memrefs. Summary: Add support for 3D memrefs in mlir-vulkan-runner and simple test. Differential Revision: https://reviews.llvm.org/D77157	2020-04-03 15:10:40 +03:00
Nicolas Vasilache	add9f1a5dc	[mlir][LLVM] Finer-grained control for C interface emission C interface emission is controlled by a flag and has coarse granularity. With this coarse control, interfaces are emitted for all external functions. This makes is easy to get undefined symbols. This revision adds support for controlling per-function emission with an "emit_c_interface" attribute.	2020-04-02 13:07:10 -04:00
Alex Zinenko	802bb8b5c2	[mlir] StandardToLLVM conversion: remove dead code This code is unused since `04ed07bc17`, but it was not removed in that commit.	2020-04-02 18:52:40 +02:00
MaheshRavishankar	9b31e595d7	[mlir] Modify GPU to SPIR-V conversion to respect spv.interface_var_abi attributes if it exists already. Differential Revision: https://reviews.llvm.org/D77195	2020-04-01 09:34:49 -07:00
River Riddle	9a277af2d4	[mlir][Pass] Add support for generating pass utilities via tablegen This revision adds support for generating utilities for passes such as options/statistics/etc. that can be inferred from the tablegen definition. This removes additional boilerplate from the pass, and also makes it easier to remove the reliance on the pass registry to provide certain things(e.g. the pass argument). Differential Revision: https://reviews.llvm.org/D76659	2020-04-01 02:10:46 -07:00
River Riddle	3dddd8969f	[mlir][Pass] Move the registration of conversion passes to tablegen This removes the need to statically register conversion passes, and also puts all of the conversions within one centralized file. Differential Revision: https://reviews.llvm.org/D76658	2020-04-01 02:10:46 -07:00
Hanhan Wang	69ddee1d2a	[mlir][Linalg] Introduce linalg.pooling_min/max/sum op. Summary: Performs an N-D pooling operation similarly to the description in the TF documentation: https://www.tensorflow.org/api_docs/python/tf/nn/pool Different from the description, this operation doesn't perform on batch and channel. It only takes tensors of rank `N`. ``` output[x[0], ..., x[N-1]] = REDUCE_{z[0], ..., z[N-1]} input[ x[0] * strides[0] - pad_before[0] + dilation_rate[0]z[0], ... x[N-1]strides[N-1] - pad_before[N-1] + dilation_rate[N-1]*z[N-1] ], ``` The required optional arguments are: - strides: an i64 array specifying the stride (i.e. step) for window loops. - dilations: an i64 array specifying the filter upsampling/input downsampling rate - padding: an i64 array of pairs (low, high) specifying the number of elements to pad along a dimension. If strides or dilations attributes are missing then the default value is one for each of the input dimensions. Similarly, padding values are zero for both low and high in each of the dimensions, if not specified. Differential Revision: https://reviews.llvm.org/D76414	2020-03-31 21:21:54 -07:00
Uday Bondhugula	5f9bf3f656	[MLIR][NFC] Move test/Transforms/lower-affine.mlir -> test/Conversion Move lower-affine.mlir from test/Transforms to test/Conversion/AffineToStandard/. Other related NFC. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77008	2020-03-31 23:34:50 +05:30
Aaron Smith	6dab806712	[mlir] Add exp2 conversion to llvm.intr.exp2	2020-03-29 01:23:08 -07:00
Kazuaki Ishizaki	e5a8512655	[mlir] NFC: fix trivial typo in source files Summary: fix trivial typos in the source files Reviewers: mravishankar, antiagainst, nicolasvasilache, herhut, rriddle, aartbik Reviewed By: antiagainst, rriddle Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, bader, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76876	2020-03-28 10:12:49 +09:00
Stephan Herhut	ac9d742bbe	[MLIR][LLVM] Make index type bitwidth configurable. This change adds a new option to the StandardToLLVM lowering to configure the bitwidth of the index type independently of the target architecture's pointer size. Differential revision: https://reviews.llvm.org/D76353	2020-03-27 12:42:54 +01:00
Denis Khalikov	8f4ab8c7d7	[mlir][vulkan-runner] Add support for 2D memref. Summary: This patch adds support for 2D memref in mlir-vulkan-runner. Differential Revision: https://reviews.llvm.org/D76737	2020-03-27 13:59:17 +03:00
Alex Zinenko	c16c07d4b9	[mlir] StandardToLLVM: use template aliases instead of dummy classes Multiple operation conversions from the Standard dialect to the LLVM dialect are trivial one-to-one conversions that use only the pattern defined in base utility classes such as OneToOneConvertToLLVMPattern and VectorConvertToLLVMPattern. Use template aliases ("using" declarations) instead of creating derived classes without new functionality.	2020-03-27 11:14:44 +01:00
Alex Zinenko	04ed07bc17	[mlir] StandardToLLVM: clean up conversion patterns for vector operations Summary: Provide a public VectorConvertToLLVMPattern utility class to implement conversions with automatic unrolling of operation on multidimensional vectors to lists of operations on single-dimensional vectors when lowering to the LLVM dialect. Drop the template-based check on the number of operands since the actual implementation does not depend on the operand number anymore. This check only creates spurious concepts (UnaryOpLowering, BinaryOpLowering, etc). Differential Revision: https://reviews.llvm.org/D76865	2020-03-26 18:24:10 +01:00
Alex Zinenko	987fbae0ad	[mlir] StandardToLLVM: make one-to-one convresion pattern publicly available Summary: The Standard-to-LLVM dialect convresion has a set of utility classes that simplify conversions, including patterns that provide one-to-one conversion operation conversion with optional result packing. Expose these classes in a public header so that conversions other than Standard-to-LLVM (e.g. vectors, or LLVM-based intrinsics) could also use them. Since the patterns are implemented as class templates and in order to keep the code size limited, keep the implementation private by resorting to op identifiers instead of template-based builders. Differential Revision: https://reviews.llvm.org/D76864	2020-03-26 18:24:07 +01:00
Marcel Koester	2b529a396d	[mlir] Removed TanHOp lowering from ConvertStandardToLLVM since there is no reasonable TanH representation in LLVM. Summary: The current ConvertStandardToLLVM phase lowers the standard TanHOp to function calls to external tanh symbols. However, this leads to misunderstandings since these external symbols are not defined anywhere. This commit removes the TanHOp lowering functionality from ConvertStandardToLLVM, adapts the LowerGpuOpsToNVVMOps and LowerGpuOpsToROCDLOps passes and adjusts the affected test cases. Reviewers: mravishankar, herhut Subscribers: jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75509	2020-03-25 16:43:45 +01:00
MaheshRavishankar	46bb6613a3	[mlir][GPU] Use StructAttr to drive lowering from loop.parallel to gpu.launch Current implementation of lowering from loop.parallel to gpu.launch uses a DictionaryAttr to specify the mapping. Moving this attribute to be auto-generated from specification as a StructAttr. This simplifies a lot the logic of looking up and creating this attribute. Differential Revision: https://reviews.llvm.org/D76165	2020-03-24 16:16:55 -07:00
Hanhan Wang	58cdb8bff0	[mlir][StandardToSPIRV] Add support for lowering unary ops Differential Revision: https://reviews.llvm.org/D76661	2020-03-24 09:16:10 -04:00
Benjamin Kramer	b6732056a4	Make helpers static. NFC.	2020-03-24 13:43:00 +01:00
Uday Bondhugula	56e1c20bfd	[MLIR][NFC] rename ConvertStandardToLLVM, ConvertLoopToStandard to drop Convert prefix This is in line with the convention agreed on https://llvm.discourse.group/t/rfc-canonical-file-paths-to-dialects/621 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76583	2020-03-23 08:24:11 +05:30
River Riddle	e8f5c072f6	[mlir] Move the testing pass for GpuKernelToCubin to the test/ directory Summary: This removes the static pass registration, and also cleans up some lingering technical debt. Differential Revision: https://reviews.llvm.org/D76554	2020-03-22 03:38:09 -07:00
River Riddle	e9482ed194	[mlir] Move several static cl::opts to be pass options instead. This removes the reliance on global options, and also simplifies the pass registration. Differential Revision: https://reviews.llvm.org/D76552	2020-03-22 03:16:21 -07:00
Rob Suderman	e708471395	[mlir][NFC] Cleanup AffineOps directory structure Summary: Change AffineOps Dialect structure to better group both IR and Tranforms. This included extracting transforms directly related to AffineOps. Also move AffineOps to Affine. Differential Revision: https://reviews.llvm.org/D76161	2020-03-20 14:23:43 -07:00
Nicolas Vasilache	462db62053	[mlir][AVX512] Start a primitive AVX512 dialect The Vector Dialect [document](https://mlir.llvm.org/docs/Dialects/Vector/) discusses the vector abstractions that MLIR supports and the various tradeoffs involved. One of the layer that is missing in OSS atm is the Hardware Vector Ops (HWV) level. This revision proposes an AVX512-specific to add a new Dialect/Targets/AVX512 Dialect that would directly target AVX512-specific intrinsics. Atm, we rely too much on LLVM’s peephole optimizer to do a good job from small insertelement/extractelement/shufflevector. In the future, when possible, generic abstractions such as VP intrinsics should be preferred. The revision will allow trading off HW-specific vs generic abstractions in MLIR. Differential Revision: https://reviews.llvm.org/D75987	2020-03-20 14:11:57 -04:00
River Riddle	2c1ba63ede	[mlir] Change missed usage PatternMatchResult to LogicalResult	2020-03-18 21:22:24 -07:00
Rob Suderman	cd1212deff	[mlir] Introduced CallOp Dialect Conversion Summary: Utility to perform CallOp Dialect conversion, specifically handling cases where an argument type has changed and the corresponding CallOp needs to be updated. Differential Revision: https://reviews.llvm.org/D76326	2020-03-18 20:07:38 -07:00
Lei Zhang	73431a492b	[mlir][spirv] Consolidate std.constant to spv.constant conversions This commit merges the DRR pattern for std.constant to spv.constant conversion into the C++ OpConversionPattern. This allows us to have remove the DRR pattern file. Along the way, this commit enhanced std.constant to spv.constant conversion to consider type conversions, which means converting the underlying attributes if necessary. Differential Revision: https://reviews.llvm.org/D76246	2020-03-18 20:11:05 -04:00
Lei Zhang	ffd4583c6a	[mlir][spirv] Change standard op patterns to consider type conversion Previously we have a few patterns that were written with DRR. DRR at the moment does not work nicely with dialect conversion framework. It generates normal RewritePatterns, while the dialect conversion framework requires ConversionPatterns to take into consideration the type conversion. So this commit starts to change existing DRR patterns for standard ops to OpConversionPattern to incorporate the SPIR-V type conversion. All patterns are converted except the one for constant ops, which will happen in a subsequent commit. Differential Revision: https://reviews.llvm.org/D76245	2020-03-18 20:11:05 -04:00
Lei Zhang	58df5e6d9a	[mlir][spirv] Plumbing target environment into type converter This commit unifies target environment queries into a new wrapper class spirv::TargetEnv and shares across various places needing the functionality. We still create multiple instances of TargetEnv though given the parent components (type converters, passes, conversion targets) have different lifetimes. In the meantime, LowerABIAttributesPass is updated to take into consideration the target environment, which requires updates to tests to provide that. Differential Revision: https://reviews.llvm.org/D76242	2020-03-18 20:11:05 -04:00
Lei Zhang	3b35f9d8b5	[mlir][spirv] Use memref memory space for storage class Previously in SPIRVTypeConverter, we always convert memref types to StorageBuffer regardless of their memory spaces. This commit fixes that to let the conversion to look into memory space properly. For this purpose, a mapping between SPIR-V storage class and memref memory space is introduced. The mapping is arbitary decided at the moment and the hope is that we can leverage string memory space later to be more clear. Now spv.interface_var_abi cannot contain storage class unless it's attached to a scalar value, where we need the storage class as side channel information. Verifications and tests are properly adjusted. Differential Revision: https://reviews.llvm.org/D76241	2020-03-18 20:11:04 -04:00
River Riddle	3145427dd7	[mlir][NFC] Replace all usages of PatternMatchResult with LogicalResult This also replaces usages of matchSuccess/matchFailure with success/failure respectively. Differential Revision: https://reviews.llvm.org/D76313	2020-03-17 20:21:32 -07:00
Nicolas Vasilache	2fae7878d5	[mlir][Vector] Mostly-NFC - Restructure options for lowering to LLVM Matrix Intrinsics Summary: This revision restructures the calling of vector transforms to make it more flexible to ask for lowering through LLVM matrix intrinsics. This also makes sure we bail out in degenerate cases (i.e. 1) in which LLVM complains about not being able to scalarize. Differential Revision: https://reviews.llvm.org/D76266	2020-03-17 22:58:02 -04:00
Rob Suderman	4d60f47b08	[mlir][NFC] Renamed VectorOps to Vector Summary: Renamed VectorOps to Vector to avoid the redundant Ops suffix. Differential Revision: https://reviews.llvm.org/D76317	2020-03-17 15:28:08 -07:00
Alex Zinenko	e119980f3f	[mlir] LLVM dialect: move ensureDistinctSuccessors out of std->LLVM conversion MLIR supports terminators that have the same successor block with different block operands, which cannot be expressed in the LLVM's phi-notation as the block identifier is used to tell apart the predecessors. This limitation can be worked around by branching to a new block instead, with this new block unconditionally branching to the original successor and forwarding the argument. Until now, this transformation was performed during the conversion from the Standard to the LLVM dialect. This does not scale well to multiple dialects targeting the LLVM dialect as all of them would have to be aware of this limitation and perform the preparatory transformation. Instead, do it as a separate pass and run it immediately before the translation. Differential Revision: https://reviews.llvm.org/D75619	2020-03-17 15:22:14 +01:00
Denis Khalikov	bfb2ce0256	[mlir][vulkan-runner] Use C-compatible wrapper emission. A memref argument is converted into a pointer-to-struct argument of type `{T, T, i64, i64[N], i64[N]}*` in the wrapper function, where T is the converted element type and N is the memref rank. Differential Revision: https://reviews.llvm.org/D76059	2020-03-17 07:54:41 -04:00
River Riddle	bd5941b9ce	[mlir] Remove the PatternState class and simplify PatternMatchResult. Summary: PatternState was a mechanism to pass state between the match and rewrite calls of a RewritePattern. With the rise of matchAndRewrite, this class is unused and unnecessary. This revision removes PatternState and simplifies PatternMatchResult to just be a LogicalResult. A future revision will replace all usages of PatternMatchResult/matchSuccess/matchFailure with LogicalResult equivalents. Differential Revision: https://reviews.llvm.org/D76202	2020-03-16 17:55:54 -07:00
Nicolas Vasilache	bbf3ef8541	[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply Summary: This revision adds lowering of vector.contract to llvm.intr.matrix_multiply. Note that there is currently a mismatch between the MLIR vector dialect which expects row-major layout and the LLVM matrix intrinsics which expect column major layout. As a consequence, we currently only match a vector.contract with indexing maps that express column-major matrix multiplication. Other cases would require additional transposes and it is better to wait for LLVM intrinsics to provide a per-operation attribute that would specify which layout is expected. A separate integration test, not submitted to MLIR core, has independently verified that correct execution occurs on a 2x2x2 matrix multiplication. Differential Revision: https://reviews.llvm.org/D76014	2020-03-13 16:33:23 -04:00
aartbik	a213ece30b	[mlir] [VectorOps,LinAlg] Remove direct LLVM lowering for vector.broadcast Summary: The direct lowering of vector.broadcast into LLVM has been replaced by progressive lowering into elementary vector ops. This also required a small refactoring of a llvm.mlir test that used a direct vector.broadcast operator (just to define a matmul). Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76143	2020-03-13 11:42:51 -07:00
Lei Zhang	3148f10b17	[mlir][spirv] Use spv.vce in spv.module and wire up (de)serialization This commits changes the definition of spv.module to use the #spv.vce attribute for specifying (version, capabilities, extensions) triple so that we can have better API and custom assembly form. Since now we have proper modelling of the triple, (de)serialization is wired up to use them. With the new UpdateVCEPass, we don't need to manually specify the required extensions and capabilities anymore when creating a spv.module. One just need to call UpdateVCEPass before serialization to get the needed version/extensions/capabilities. Differential Revision: https://reviews.llvm.org/D75872	2020-03-12 19:37:45 -04:00
River Riddle	0ddba0bd59	[mlir][SideEffects] Replace HasNoSideEffect with the memory effect interfaces. HasNoSideEffect can now be implemented using the MemoryEffectInterface, removing the need to check multiple things for the same information. This also removes an easy foot-gun for users as 'Operation::hasNoSideEffect' would ignore operations that dynamically, or recursively, have no side effects. This also leads to an immediate improvement in some of the existing users, such as DCE, now that they have access to more information. Differential Revision: https://reviews.llvm.org/D76036	2020-03-12 14:26:15 -07:00
aartbik	078776a679	[mlir] [VectorOps] Progressively lower vector.outerproduct to LLVM Summary: This replaces the direct lowering of vector.outerproduct to LLVM with progressive lowering into elementary vectors ops to avoid having the similar lowering logic at several places. NOTE1: with the new progressive rule, the lowered llvm is slightly more elaborate than with the direct lowering, but the generated assembly is just as optimized; still if we want to stay closer to the original, we should add a "broadcast on extract" to shuffle rewrite (rather than special cases all the lowering steps) NOTE2: the original outerproduct lowering code should now be removed but some linalg test work directly on vector and contain some dead code, so this requires another CL Reviewers: nicolasvasilache, andydavis1 Reviewed By: nicolasvasilache, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75956	2020-03-12 13:45:42 -07:00
Christian Sigg	fc421d7ca3	[MLIR] Remove all-reduce lowering from GPU to NVVM. Use in-dialect lowering instead. Reviewers: herhut, mravishankar Reviewed By: herhut Subscribers: merge_guards_bot, jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73794	2020-03-11 15:17:54 +01:00
Valentin Clement	c7380995f8	[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering Summary: This patch add some builtin operation for the gpu.all_reduce ops. - for Integer only: `and`, `or`, `xor` - for Float and Integer: `min`, `max` This is useful for higher level dialect like OpenACC or OpenMP that can lower to the GPU dialect. Differential Revision: https://reviews.llvm.org/D75766	2020-03-11 14:07:04 +01:00
Stephan Herhut	f6790a1c63	Revert "[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering" Attribution to original author got lost.	2020-03-11 14:07:04 +01:00
Lei Zhang	5b0c60c58e	[mlir][vulkan-runner] Use std::make_tuple to create tuple	2020-03-10 16:21:35 -04:00
Stephan Herhut	2eff566b07	[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering Summary: This patch add some builtin operation for the gpu.all_reduce ops. - for Integer only: `and`, `or`, `xor` - for Float and Integer: `min`, `max` This is useful for higher level dialect like OpenACC or OpenMP that can lower to the GPU dialect. Differential Revision: https://reviews.llvm.org/D75766	2020-03-10 21:09:06 +01:00
Denis Khalikov	1090a83069	[mlir][vulkan-runner] Update mlir-vulkan-runner execution driver. * Adds GpuLaunchFuncToVulkanLaunchFunc conversion pass. * Moves a serialization of the `spirv::Module` from LaunchFuncToVulkanCalls pass to newly created pass. * Updates LaunchFuncToVulkanCalls instrumentation pass, adds `initVulkan` and `deinitVulkan` runtime calls. * Adds `bindResource` call to bind specifc resource by the given descriptor set and descriptor binding. * Eliminates static construction and desctruction of `VulkanRuntimeManager`. Differential Revision: https://reviews.llvm.org/D75192	2020-03-10 15:58:31 -04:00
Mehdi Amini	f80c6d8dec	Fix MLIR build when NVPTX backend is not configured in The GPUToCUDA conversion needs to conditionally link it in.	2020-03-10 04:11:49 +00:00
Nicolas Vasilache	63b683a816	[mlir][Vector] Add a vector.matrix_multiply op on 1-D vectors Summary: This op mirrors the llvm.intr counterpart and allows lowering + type conversions in a progressive fashion. Differential Revision: https://reviews.llvm.org/D75775	2020-03-09 13:34:03 -04:00
Stephen Neuendorffer	9f979d7ad5	[MLIR] Fixes for BUILD_SHARED_LIBS=on Differential Revision: https://reviews.llvm.org/D75308	2020-03-06 13:25:18 -08:00
Stephen Neuendorffer	4594d0e943	[MLIR] Move from add_dependencies() to DEPENDS add_llvm_library and add_llvm_executable may need to create new targets with appropriate dependencies. As a result, it is not sufficient in some configurations (namely LLVM_BUILD_LLVM_DYLIB=on) to only call add_dependencies(). Instead, the explicit TableGen dependencies must be passed to add_llvm_library() or add_llvm_executable() using the DEPENDS keyword. Differential Revision: https://reviews.llvm.org/D74930	2020-03-06 13:25:17 -08:00
Stephen Neuendorffer	2488016bae	[MLIR] Remove redundant library dependencies In cmake, it is redundant to have a target list under target_link_libraries() and add_dependency(). This patch removes the redundant dependency from add_dependency(). Differential Revision: https://reviews.llvm.org/D74929	2020-03-06 10:12:31 -08:00
Stephen Neuendorffer	1c82dd39f9	[MLIR] Ensure that target_link_libraries() always has a keyword. CMake allows calling target_link_libraries() without a keyword, but this usage is not preferred when also called with a keyword, and has surprising behavior. This patch explicitly specifies a keyword when using target_link_libraries(). Differential Revision: https://reviews.llvm.org/D75725	2020-03-06 09:14:01 -08:00
Adrian Kuegel	86306df7dd	Extract common code to deal with multidimensional vectors. Summary: Also replace dyn_cast_or_null with dyn_cast when possible. Differential Revision: https://reviews.llvm.org/D75733	2020-03-06 13:54:54 +01:00
aartbik	0d924700a6	[mlir] [VectorOps] Merge VectorReduction/VectorReductionV2 into one Op Summary: Paying off some technical debt in VectorOps, where I introduced a special op for a fused accumulator into reduction to avoid some issues around printing and parsing an optional accumulator. This CL merges the two into one op again and does things the right way (still would be nice to have "assemblyFormat" for optional operands though....). Reviewers: nicolasvasilache, andydavis1, ftynse, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75699	2020-03-05 13:07:31 -08:00
River Riddle	cb1777127c	[mlir] Remove successor operands from the Operation class Summary: This revision removes all of the functionality related to successor operands on the core Operation class. This greatly simplifies a lot of handling of operands, as well as successors. For example, DialectConversion no longer needs a special "matchAndRewrite" for branching terminator operations.(Note, the existing method was also broken for operations with variadic successors!!) This also enables terminator operations to define their own relationships with successor arguments, instead of the hardcoded "pass-through" behavior that exists today. Differential Revision: https://reviews.llvm.org/D75318	2020-03-05 12:53:02 -08:00
River Riddle	988249a506	[mlir] Refactor a few users to no longer rely on the successor operand API of Operation. The existing API for successor operands on operations is in the process of being removed. This revision simplifies a later one that completely removes the existing API. Differential Revision: https://reviews.llvm.org/D75316	2020-03-05 12:51:59 -08:00
Frank Laub	cdc5cba721	[MLIR][Affine][NFC] Expose expandAffineMap Summary: Expose expandAffineMap so that it can be used by lowerings defined outside of MLIR core. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D75589	2020-03-04 14:17:17 -08:00
Alex Zinenko	aff6bf4ff8	[mlir] support conversion of parallel reduction loops to std Recently introduced support for converting sequential reduction loops to CFG of basic blocks in the Standard dialect makes it possible to perform a staged conversion of parallel reduction loops into a similar CFG by using sequential loops as an intermediate step. This is already the case for parallel loops without reduction, so extend the pattern to support an additional use case. Differential Revision: https://reviews.llvm.org/D75599	2020-03-04 16:37:17 +01:00
Adrian Kuegel	91acb5b3e1	Add rsqrt op to Standard dialect and lower it to LLVM dialect. Summary: This adds an rsqrt op to the standard dialect, and lowers it as 1 / sqrt to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D75353	2020-03-04 13:13:31 +01:00
Alex Zinenko	8ba8ab8c95	[mlir] support reductions in loop to std conversion Summary: Introduce support for converting loop.for operations with loop-carried values to a CFG in the standard dialect. This is achieved by passing loop-carried values as block arguments to the loop condition block. This block dominates both the loop body and the block immediately following the loop, so the arguments of this block are remain visible there. Differential Revision: https://reviews.llvm.org/D75513	2020-03-03 18:21:13 +01:00
Stephan Herhut	10ec1860a8	[MLIR][GPU] Add error checking to loop.parallel to gpu transform. Summary: Instead of crashing on malformed input, the pass now produces error messages. Differential Revision: https://reviews.llvm.org/D75468	2020-03-03 13:29:09 +01:00
Stephan Herhut	d17428d951	[MLIR][GPU] fix loop trip count computation in LoopsToGPU Summary: Added brackets to fix the loop trip count computation. The brackets ensure the bounds are subtracted before we divide the result by the step of the loop. Differential Revision: https://reviews.llvm.org/D75449	2020-03-02 15:53:33 +01:00
Stephen Neuendorffer	798e661567	Revert "[MLIR] Move from using target_link_libraries to LINK_LIBS for llvm libraries." This reverts commit `7a6c689771`. This breaks the build with cmake 3.13.4, but succeeds with cmake 3.15.3	2020-02-29 11:52:08 -08:00
Stephen Neuendorffer	0810acc7f6	Revert "[MLIR] Remove redundant library dependencies" This reverts commit `c4c8fbde64`.	2020-02-29 11:52:08 -08:00
Stephen Neuendorffer	d675df0379	Revert "[MLIR] Move from add_dependencies() to DEPENDS" This reverts commit `31e07d716a`.	2020-02-29 11:52:08 -08:00
Stephen Neuendorffer	bc991500ac	Revert "[MLIR] Fixes for BUILD_SHARED_LIBS=on" This reverts commit `777e97cc1a`.	2020-02-29 11:09:21 -08:00
Stephen Neuendorffer	777e97cc1a	[MLIR] Fixes for BUILD_SHARED_LIBS=on Differential Revision: https://reviews.llvm.org/D75308	2020-02-29 10:47:28 -08:00
Stephen Neuendorffer	31e07d716a	[MLIR] Move from add_dependencies() to DEPENDS add_llvm_library and add_llvm_executable may need to create new targets with appropriate dependencies. As a result, it is not sufficient in some configurations (namely LLVM_BUILD_LLVM_DYLIB=on) to only call add_dependencies(). Instead, the explicit TableGen dependencies must be passed to add_llvm_library() or add_llvm_executable() using the DEPENDS keyword. Differential Revision: https://reviews.llvm.org/D74930	2020-02-29 10:47:27 -08:00
Stephen Neuendorffer	c4c8fbde64	[MLIR] Remove redundant library dependencies In cmake, it is redundant to have a target list under target_link_libraries() and add_dependency(). This patch removes the redundant dependency from add_dependency(). Differential Revision: https://reviews.llvm.org/D74929	2020-02-29 10:47:27 -08:00
Stephen Neuendorffer	7a6c689771	[MLIR] Move from using target_link_libraries to LINK_LIBS for llvm libraries. When compiling libLLVM.so, add_llvm_library() manipulates the link libraries being used. This means that when using add_llvm_library(), we need to pass the list of libraries to be linked (using the LINK_LIBS keyword) instead of using the standard target_link_libraries call. This is preparation for properly dealing with creating libMLIR.so as well. Differential Revision: https://reviews.llvm.org/D74864	2020-02-29 10:47:26 -08:00
Stephen Neuendorffer	dc1056a3f1	Revert "[MLIR] Move from using target_link_libraries to LINK_LIBS for llvm libraries." This reverts commit `2f265e3528`.	2020-02-28 14:13:30 -08:00
Stephen Neuendorffer	fed2acc7f5	Revert "[MLIR] Remove redundant library dependencies" This reverts commit `e1cb15c8f9`.	2020-02-28 14:06:20 -08:00
Tim Shen	0d65000e11	[MLIR] Add llvm.mlir.cast op for semantic preserving cast between dialect types. Summary: See discussion here: https://llvm.discourse.group/t/rfc-dialect-type-cast-op/538/11 Reviewers: ftynse Subscribers: bixia, sanjoy.google, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Differential Revision: https://reviews.llvm.org/D75141	2020-02-28 12:20:23 -08:00
Tim Shen	2a00ae3984	[MLIR] Add LLVMConversionTarget as a customization point. NFC. This is in preparation for the next patch D75141. The purpose is to provide a single place where LLVM dialect registers its ops as legal/illegal. Reviewers: ftynse, mravishankar, herhut Subscribers: jholewinski, bixia, sanjoy.google, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Differential Revision: https://reviews.llvm.org/D75140	2020-02-28 12:20:23 -08:00
Stephen Neuendorffer	67f2a43cf8	Revert "[MLIR] Move from add_dependencies() to DEPENDS" This reverts commit `8a2b86b2c2`.	2020-02-28 12:17:40 -08:00
Stephen Neuendorffer	29c6721be2	Revert "[MLIR] Fixes for BUILD_SHARED_LIBS=on" This reverts commit `c767dc9394`.	2020-02-28 12:17:39 -08:00
Stephen Neuendorffer	c767dc9394	[MLIR] Fixes for BUILD_SHARED_LIBS=on Differential Revision: https://reviews.llvm.org/D75308	2020-02-28 11:35:19 -08:00
Stephen Neuendorffer	8a2b86b2c2	[MLIR] Move from add_dependencies() to DEPENDS add_llvm_library and add_llvm_executable may need to create new targets with appropriate dependencies. As a result, it is not sufficient in some configurations (namely LLVM_BUILD_LLVM_DYLIB=on) to only call add_dependencies(). Instead, the explicit TableGen dependencies must be passed to add_llvm_library() or add_llvm_executable() using the DEPENDS keyword. Differential Revision: https://reviews.llvm.org/D74930	2020-02-28 11:35:18 -08:00
Stephen Neuendorffer	e1cb15c8f9	[MLIR] Remove redundant library dependencies In cmake, it is redundant to have a target list under target_link_libraries() and add_dependency(). This patch removes the redundant dependency from add_dependency(). Differential Revision: https://reviews.llvm.org/D74929	2020-02-28 11:35:18 -08:00
Stephen Neuendorffer	2f265e3528	[MLIR] Move from using target_link_libraries to LINK_LIBS for llvm libraries. When compiling libLLVM.so, add_llvm_library() manipulates the link libraries being used. This means that when using add_llvm_library(), we need to pass the list of libraries to be linked (using the LINK_LIBS keyword) instead of using the standard target_link_libraries call. This is preparation for properly dealing with creating libMLIR.so as well. Differential Revision: https://reviews.llvm.org/D74864	2020-02-28 11:35:17 -08:00
Stephen Neuendorffer	c07fb9e016	[MLIR] Refactor library handling for conversions. Collect a list of conversion libraries in cmake, so we don't have to list these explicitly in most binaries. Differential Revision: https://reviews.llvm.org/D75222	2020-02-28 11:35:17 -08:00
Adrian Kuegel	39e1c1fa9e	Add GPU lowerings for the different log ops. Summary: This adds GPU lowerings for log, log10 and log2. Reviewers: mravishankar, herhut Subscribers: jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75239	2020-02-27 15:25:02 +01:00
Alex Zinenko	7d91fd23df	[mlir] NFC: update documentation in ConvertLinalgToLLVM The documentation was describing an obsolete version of the transformation.	2020-02-25 15:16:14 +01:00
Stephan Herhut	5e6d724633	[MLIR][GPU] Properly model step in parallel loop to gpu conversion. Summary: The original patch had TODOs to add support for step computations, which this commit addresses. The computations are expressed using affine expressions so that the affine canonicalizers can simplify the full bound and index computations. Also cleans up the code a little and exposes the pass in the header file. Differential Revision: https://reviews.llvm.org/D75052	2020-02-25 14:22:50 +01:00
Stephan Herhut	7a7eacc797	[MLIR][GPU] Implement a simple greedy loop mapper. Summary: The mapper assigns annotations to loop.parallel operations that are compatible with the loop to gpu mapping pass. The outermost loop uses the grid dimensions, followed by block dimensions. All remaining loops are mapped to sequential loops. Differential Revision: https://reviews.llvm.org/D74963	2020-02-25 11:42:42 +01:00
Frank Laub	fe210a1ff2	[MLIR] Add std.atomic_rmw op Summary: The RFC for this op is here: https://llvm.discourse.group/t/rfc-add-std-atomic-rmw-op/489 The std.atmomic_rmw op provides a way to support read-modify-write sequences with data race freedom. It is intended to be used in the lowering of an upcoming affine.atomic_rmw op which can be used for reductions. A lowering to LLVM is provided with 2 paths: - Simple patterns: llvm.atomicrmw - Everything else: llvm.cmpxchg Differential Revision: https://reviews.llvm.org/D74401	2020-02-24 16:54:21 -08:00
Rob Suderman	69d757c0e8	Move StandardOps/Ops.h to StandardOps/IR/Ops.h Summary: NFC - Moved StandardOps/Ops.h to a StandardOps/IR dir to better match surrounding directories. This is to match other dialects, and prepare for moving StandardOps related transforms in out for Transforms and into StandardOps/Transforms. Differential Revision: https://reviews.llvm.org/D74940	2020-02-21 11:58:47 -08:00
Nagy Mostafa	bc7b26c333	[MLIR] Allow Loop dialect IfOp and ForOp to define values This patch implements the RFCs proposed here: https://llvm.discourse.group/t/rfc-modify-ifop-in-loop-dialect-to-yield-values/463 https://llvm.discourse.group/t/rfc-adding-operands-and-results-to-loop-for/459/19. It introduces the following changes: - All Loop Ops region, except for ReduceOp, terminate with a YieldOp. - YieldOp can have variadice operands that is used to return values out of IfOp and ForOp regions. - Change IfOp and ForOp syntax and representation to define values. - Add unit-tests and update .td documentation. - YieldOp is a terminator to loop.for/if/parallel - YieldOp custom parser and printer Lowering is not supported at the moment, and will be in a follow-up PR. Thanks. Reviewed By: bondhugula, nicolasvasilache, rriddle Differential Revision: https://reviews.llvm.org/D74174	2020-02-21 10:05:32 -08:00
Lei Zhang	35b685270b	[mlir] Add a signedness semantics bit to IntegerType Thus far IntegerType has been signless: a value of IntegerType does not have a sign intrinsically and it's up to the specific operation to decide how to interpret those bits. For example, std.addi does two's complement arithmetic, and std.divis/std.diviu treats the first bit as a sign. This design choice was made some time ago when we did't have lots of dialects and dialects were more rigid. Today we have much more extensible infrastructure and different dialect may want different modelling over integer signedness. So while we can say we want signless integers in the standard dialect, we cannot dictate for others. Requiring each dialect to model the signedness semantics with another set of custom types is duplicating the functionality everywhere, considering the fundamental role integer types play. This CL extends the IntegerType with a signedness semantics bit. This gives each dialect an option to opt in signedness semantics if that's what they want and helps code sharing. The parser is modified to recognize `si[1-9][0-9]` and `ui[1-9][0-9]` as signed and unsigned integer types, respectively, leaving the original `i[1-9][0-9]*` to continue to mean no indication over signedness semantics. All existing dialects are not affected (yet) as this is a feature to opt in. More discussions can be found at: https://groups.google.com/a/tensorflow.org/d/msg/mlir/XmkV8HOPWpo/7O4X0Nb_AQAJ Differential Revision: https://reviews.llvm.org/D72533	2020-02-21 09:16:54 -05:00
Denis Khalikov	896ee361a6	[mlir][spirv] Add mlir-vulkan-runner Add an initial version of mlir-vulkan-runner execution driver. A command line utility that executes a MLIR file on the Vulkan by translating MLIR GPU module to SPIR-V and host part to LLVM IR before JIT-compiling and executing the latter. Differential Revision: https://reviews.llvm.org/D72696	2020-02-19 11:37:26 -05:00
Alex Zinenko	d97d409277	[mlir] NFC: use ValueRange for BlockArgument in ConvertStandardToLLVM When the conversion was implemented, ValueRange did not support BlockArguments the code materialized a vector. This is no longer necessary.	2020-02-19 17:26:30 +01:00
Tim Shen	f581e655ec	[MLIR] Add std.assume_alignment op. Reviewers: ftynse, nicolasvasilache, andydavis1 Subscribers: bixia, sanjoy.google, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74378	2020-02-18 17:55:07 -08:00
River Riddle	0d7ff220ed	[mlir] Refactor TypeConverter to add conversions without inheritance Summary: This revision refactors the TypeConverter class to not use inheritance to add type conversions. It instead moves to a registration based system, where conversion callbacks are added to the converter with `addConversion`. This method takes a conversion callback, which must be convertible to any of the following forms(where `T` is a class derived from `Type`: * Optional<Type> (T type) - This form represents a 1-1 type conversion. It should return nullptr or `llvm::None` to signify failure. If `llvm::None` is returned, the converter is allowed to try another conversion function to perform the conversion. * Optional<LogicalResult>(T type, SmallVectorImpl<Type> &results) - This form represents a 1-N type conversion. It should return `failure` or `llvm::None` to signify a failed conversion. If the new set of types is empty, the type is removed and any usages of the existing value are expected to be removed during conversion. If `llvm::None` is returned, the converter is allowed to try another conversion function to perform the conversion. When attempting to convert a type, the TypeConverter walks each of the registered converters starting with the one registered most recently. Differential Revision: https://reviews.llvm.org/D74584	2020-02-18 16:17:48 -08:00
Alex Zinenko	870c1fd4c8	[mlir] NFC: rename LLVMOpLowering to ConvertToLLVMPattern This better reflects the nature of the class and matches the current naming scheme. Differential Revision: https://reviews.llvm.org/D74774	2020-02-18 22:19:58 +01:00
Alex Zinenko	0f04384daf	[mlir] NFC: Rename LLVMOpLowering::lowering to LLVMOpLowering::typeConverter The existing name is an artifact dating back to the times when we did not have a dedicated TypeConverter infrastructure. It is also confusing with with the name of classes using it. Differential revision: https://reviews.llvm.org/D74707	2020-02-18 15:57:10 +01:00
Benjamin Kramer	5fc5c7db38	Strength reduce vectors into arrays. NFCI.	2020-02-17 15:37:35 +01:00
Mehdi Amini	850cb135a3	Do not build the CUBIN conversion pass when NVPTX Backend isn't configured This pass would currently build, but fail to run when this backend isn't linked in. On the other hand, we'd like it to initialize only the NVPTX backend, which isn't possible if we continue to build it without the backend available. Instead of building a broken configuration, let's skip building the pass entirely. Differential Revision: https://reviews.llvm.org/D74592	2020-02-14 09:33:12 +00:00
Alex Zinenko	39cb2a8fc7	[mlir] Fix argument attribute attribute reassignment in ConvertStandardToLLVM The commit switching the calling convention for memrefs (`5a1778057`) inadvertently introduced a bug in the function argument attribute conversion: due to incorrect indexing of function arguments it was not assigning the attributes to the arguments beyond those generated from the first original argument. This was not caught in the commit since the test suite does have a test for converting multi-argument functions with argument attributes. Fix the bug and add relevant tests.	2020-02-14 10:22:33 +01:00
Eric Christopher	f3b933266a	Remove unused lambda argument.	2020-02-13 17:24:55 -08:00
aartbik	b21c799952	[mlir] [VectorOps] Initial framework for progressively lowering vector.contract Summary: This sets the basic framework for lowering vector.contract progressively into simpler vector.contract operations until a direct vector.reduction operation is reached. More details will be filled out progressively as well. Reviewers: nicolasvasilache Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74520	2020-02-13 15:07:57 -08:00
Denis Khalikov	a062a3ed7f	[mlir][spirv] Add ConvertGpuLaunchFuncToVulkanCallsPass Implement a pass to convert gpu.launch_func op into a sequence of Vulkan runtime calls. The Vulkan runtime API surface is huge so currently we don't expose separate external functions in IR for each of them, instead we expose a few external functions to wrapper libraries which manages Vulkan runtime. Differential Revision: https://reviews.llvm.org/D74549	2020-02-13 14:10:07 -05:00
Stephan Herhut	715783d415	[MLIR][GPU] Implement initial mapping from loop.parallel to gpu.launch. Summary: To unblock other work, this implements basic lowering based on mapping attributes that have to be provided on all loop.parallel. The lowering does not yet support reduce. Differential Revision: https://reviews.llvm.org/D73893	2020-02-13 16:54:16 +01:00
Tobias Gysi	4f865b7794	[mlir] support creating memref descriptors from static shape with non-zero offset This patch adapts the method MemRefDescriptor::fromStaticShape to support static non-zero offsets. The updated method uses the getStridesAndOffset method to extract strides and offset. The patch also adapts the test cases since sizes and strides are now set in forward instead of reverse order. Differential Revision: https://reviews.llvm.org/D74474	2020-02-12 22:40:49 +01:00
Pierre Oechsel	fd11cda251	[mlir] StdToLLVM: Add error when the sourceMemRef of a subview is not a llvm type. A memref_cast casting to a memref with a non identity map can't be lowered to llvm. Take the following case: ``` func @invalid_memref_cast(%arg0: memref<?x?xf64>) { %c1 = constant 1 : index %c0 = constant 0 : index %5 = memref_cast %arg0 : memref<?x?xf64> to memref<?x?xf64, #map1> %25 = std.subview %5[%c0, %c0][%c1, %c1][] : memref<?x?xf64, #map1> to memref<?x?xf64, #map1> return } ``` When lowering the subview mlir was assuming `%5` to have an llvm type (which is not the case as mlir failed to lower the memref_cast). Differential Revision: https://reviews.llvm.org/D74466	2020-02-12 15:13:18 +01:00
Lei Zhang	d3e7816d85	[mlir][spirv] Introduce spv.func Thus far we have been using builtin func op to model SPIR-V functions. It was because builtin func op used to have special treatment in various parts of the core codebase (e.g., pass pipelines, etc.) and it's easy to bootstrap the development of the SPIR-V dialect. But nowadays with general op concepts and region support we don't have such limitations and it's time to tighten the SPIR-V dialect for completeness. This commits introduces a spv.func op to properly model SPIR-V functions. Compared to builtin func op, it can provide the following benefits: * We can control the full op so we can integrate SPIR-V information bits (e.g., function control) in a more integrated way and define our own assembly form and enforcing better verification. * We can have a better dialect and library boundary. At the current moment only functions are modelled with an external op. With this change, all ops modelling SPIR-V concpets will be spv.* ops and registered to the SPIR-V dialect. * We don't need to special-case func op anymore when creating ConversionTarget declaring SPIR-V dialect as legal. This is quite important given we'll see more and more conversions in the future. In the process, bumps a few FuncOp methods to the FunctionLike trait. Differential Revision: https://reviews.llvm.org/D74226	2020-02-12 07:46:43 -05:00
Mehdi Amini	7b635880ab	Fix MLIR build when the NVPTX target isn't configured Differential Revision: https://reviews.llvm.org/D74472	2020-02-12 12:38:45 +00:00
aartbik	e83b7b99da	[mlir] [VectorOps] Implement vector.reduce operation Summary: This new operation operates on 1-D vectors and forms the bridge between vector.contract and llvm intrinsics for vector reductions. Reviewers: nicolasvasilache, andydavis1, ftynse Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74370	2020-02-11 11:31:59 -08:00
Diego Caballero	696f80736b	[mlir] Turn flags in ConvertStandardToLLVM into pass flags Follow-up on D72802. Turn -convert-std-to-llvm-use-alloca and -convert-std-to-llvm-bare-ptr-memref-call-conv into pass flags of LLVMLoweringPass. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D73912	2020-02-11 10:28:30 -08:00
Stephan Herhut	890d5e2dd2	[MLIR][GPU] Disallow llvm tanh intrinsics when lowering to NVVM/ROCm. Summary: The lowering to NVVM and ROCm handles tanh operations differently by mapping them to NVVM/ROCm specific intrinsics. This conflicts with the lowering to LLVM, which uses the default llvm intrinsic. This change declares the LLVM intrinsics to be illegal, hence disallowing the correspondign rewrite. Differential Revision: https://reviews.llvm.org/D74389	2020-02-11 15:09:30 +01:00
Lei Zhang	50aeeed8a2	[mlir][spirv] Use spv.entry_point_abi in GPU to SPIR-V conversions We have spv.entry_point_abi for specifying the local workgroup size. It should be decorated onto input gpu.func ops to drive the SPIR-V CodeGen to generate the proper SPIR-V module execution mode. Compared to using command-line options for specifying the configuration, using attributes also has the benefits that 1) we are now able to use different local workgroup for different entry points and 2) the tests contains the configuration directly. Differential Revision: https://reviews.llvm.org/D74012	2020-02-10 16:24:48 -05:00
Nicolas Vasilache	75394e1301	[mlir][EDSC] Almost NFC - Refactor and untangle EDSC dependencies This CL refactors EDSCs to layer them better and break unnecessary dependencies. After this refactoring, the top-level EDSC target only depends on IR but not on Dialects anymore and each dialect has its own EDSC directory. This simplifies the layering and breaks cyclic dependencies. In particular, the declarative builder + folder are made explicit and are now confined to Linalg. As the refactoring occurred, certain classes and abstractions that were not paying for themselves have been removed. Differential Revision: https://reviews.llvm.org/D74302	2020-02-10 12:10:41 -05:00
Tobias Gysi	1555d7f729	[mlir] subview op lowering for target memrefs with const offset The current standard to llvm conversion pass lowers subview ops only if dynamic offsets are provided. This commit extends the lowering with a code path that uses the constant offset of the target memref for the subview op lowering (see Example 3 of the subview op definition for an example) if no dynamic offsets are provided. Differential Revision: https://reviews.llvm.org/D74280	2020-02-10 17:35:17 +01:00
Alex Zinenko	5a1778057f	[mlir] use unpacked memref descriptors at function boundaries The existing (default) calling convention for memrefs in standard-to-LLVM conversion was motivated by interfacing with LLVM IR produced from C sources. In particular, it passes a pointer to the memref descriptor structure when calling the function. Therefore, the descriptor is allocated on stack before the call. This convention leads to several problems. PR44644 indicates a problem with stack exhaustion when calling functions with memref-typed arguments in a loop. Allocating outside of the loop may lead to concurrent access problems in case the loop is parallel. When targeting GPUs, the contents of the stack-allocated memory for the descriptor (passed by pointer) needs to be explicitly copied to the device. Using an aggregate type makes it impossible to attach pointer-specific argument attributes pertaining to alignment and aliasing in the LLVM dialect. Change the default calling convention for memrefs in standard-to-LLVM conversion to transform a memref into a list of arguments, each of primitive type, that are comprised in the memref descriptor. This avoids stack allocation for ranked memrefs (and thus stack exhaustion and potential concurrent access problems) and simplifies the device function invocation on GPUs. Provide an option in the standard-to-LLVM conversion to generate auxiliary wrapper function with the same interface as the previous calling convention, compatible with LLVM IR porduced from C sources. These auxiliary functions pack the individual values into a descriptor structure or unpack it. They also handle descriptor stack allocation if necessary, serving as an allocation scope: the memory reserved by `alloca` will be freed on exiting the auxiliary function. The effect of this change on MLIR-generated only LLVM IR is minimal. When interfacing MLIR-generated LLVM IR with C-generated LLVM IR, the integration only needs to require auxiliary functions and change the function name to call the wrapper function instead of the original function. This also opens the door to forwarding aliasing and alignment information from memrefs to LLVM IR pointers in the standrd-to-LLVM conversion.	2020-02-10 15:03:43 +01:00
MaheshRavishankar	aaddca1efd	[mlir][GPUToSPIRV] Modify the lowering of gpu.block_dim to be consistent with Vulkan SPEC The existing lowering of gpu.block_dim added a global variable with the WorkGroupSize decoration. This raises an error within Vulkan/SPIR-V validation since Vulkan requires this to have a constant initializer. This is not yet supported in SPIR-V dialect. Changing the lowering to return the workgroup size as a constant value instead, obtained from spv.entry_point_abi attribute gets around the issue for now. The validation goes through since the workgroup size is specified using spv.execution_mode operation.	2020-02-08 22:30:03 -08:00
Nicolas Vasilache	681f929f59	[mlir][VectorOps] Introduce a `vector.fma` op that works on n-D vectors and lowers to `llvm.intrin.fmuladd` Summary: The `vector.fma` operation is portable enough across targets that we do not want to keep it wrapped under `vector.outerproduct` and `llvm.intrin.fmuladd`. This revision lifts the op into the vector dialect and implements the lowering to LLVM by using two patterns: 1. a pattern that lowers from n-D to (n-1)-D by unrolling when n > 2 2. a pattern that converts from 1-D to the proper LLVM representation Reviewers: ftynse, stellaraccident, aartbik, dcaballe, jsetoain, tetuante Reviewed By: aartbik Subscribers: fhahn, dcaballe, merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74075	2020-02-07 15:44:53 -05:00
Nicolas Vasilache	499ad45877	[mlir][VectorOps] Expose and use llvm.intrin.fma* Summary: This revision exposes the portable `llvm.fma` intrinsic in LLVMOps and uses it in lieu of `llvm.fmuladd` when lowering the `vector.outerproduct` op to LLVM. This guarantees proper `fma` instructions will be emitted if the target ISA supports it. `llvm.fmuladd` does not have this guarantee in its semantics, despite evidence that the proper x86 instructions are emitted. For more details, see https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic. Reviewers: ftynse, aartbik, dcaballe, fhahn Reviewed By: aartbik Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74219	2020-02-07 15:38:40 -05:00
aartbik	e52414b1ae	[mlir][VectorOps] Generalized vector.print to i32/i64 Summary: Lowering to LLVM IR was restricted to float/double. This CL also adds the integral values. Reviewers: andydavis1, nicolasvasilache, ftynse Reviewed By: nicolasvasilache, ftynse Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74179	2020-02-07 09:25:30 -08:00
OuHangKresnik	5c3b34930c	[mlir] Add AffineMaxOp Differential Revision: https://reviews.llvm.org/D73848	2020-02-06 10:26:50 +01:00
Stephan Herhut	921d4e7c8d	[MLIR][GPU] Fix build files for mlir-opt. The recent refactoring of build files broke building with the MIR CUDA integration enabled. This fixes it by adding some additional dependencies to mlir-opt. Differential Revision: https://reviews.llvm.org/D74041	2020-02-05 17:13:48 +00:00
Lei Zhang	13b197c7d1	[mlir][spirv] Add dialect-specific attribute for target environment We were using normal dictionary attribute for target environment specification. It becomes cumbersome with more and more fields. This commit changes the modelling to a dialect-specific attribute, where we can have control over its storage and assembly form. Differential Revision: https://reviews.llvm.org/D73959	2020-02-04 21:33:13 -05:00
Stephen Neuendorffer	d7cbef2714	[MLIR] Fixes for shared library dependencies. Summary: This patch is a step towards enabling BUILD_SHARED_LIBS=on, which builds most libraries as DLLs instead of statically linked libraries. The main effect of this is that incremental build times are greatly reduced, since usually only one library need be relinked in response to isolated code changes. The bulk of this patch is fixing incorrect usage of cmake, where library dependencies are listed under add_dependencies rather than under target_link_libraries or under the LINK_LIBS tag. Correct usage should be like this: add_dependencies(MLIRfoo MLIRfooIncGen) target_link_libraries(MLIRfoo MLIRlib1 MLIRlib2) A separate issue is that in cmake, dependencies between static libraries are automatically included in dependencies. In the above example, if MLIBlib1 depends on MLIRlib2, then it is sufficient to have only MLIRlib1 in the target_link_libraries. When compiling with shared libraries, it is necessary to have both MLIRlib1 and MLIRlib2 specified if MLIRfoo uses symbols from both. Reviewers: mravishankar, antiagainst, nicolasvasilache, vchuravy, inouehrs, mehdi_amini, jdoerfert Reviewed By: nicolasvasilache, mehdi_amini Subscribers: Joonsoo, merge_guards_bot, jholewinski, mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73653	2020-02-04 08:56:37 -08:00
Alex Zinenko	e0ea706a59	[mlir] ConvertStandardToLLVM: do not rely on command line options internally The patterns for converting `std.alloc` and `std.dealoc` can be configured to use `llvm.alloca` instead of calling `malloc` and `free`. This configuration has been only possible through a command-line flag, despite the presence of a (misleading) parameter in the pass constructor. Use the parameter instead and only initalize it from the command line flags if the pass is constructed from the mlir-opt registration.	2020-02-03 13:50:41 +01:00
Alex Zinenko	f3fa4a34b6	[mlir] Drop customization hooks from StandardToLLVM conversion Summary: These hooks were originally introduced to support passes deriving the StandardToLLVM conversion, in particular converting types from different dialects to LLVM types in a single-step conversion. They are no longer in use since the pass and conversion infrastructure has evolved sufficiently to make defining new passes with exactly the same functionality simple through the use of populate* functions, conversion targets and type converters. Remove the hooks. Any users of this hooks can call the dialect conversion infrastructure directly instead, which is likely to require less LoC than these hooks. Differential Revision: https://reviews.llvm.org/D73795	2020-02-03 13:26:17 +01:00
Alexander Belyaev	3dcc1fc61b	[MLIR][Linalg] Lower linalg.generic to ploops. Differential Revision: https://reviews.llvm.org/D73684	2020-02-03 11:52:23 +01:00
Stephan Herhut	283b5e733d	[MLIR] Make gpu.launch implicitly capture uses of values defined above. Summary: In the original design, gpu.launch required explicit capture of uses and passing them as operands to the gpu.launch operation. This was motivated by infrastructure restrictions rather than design. This change lifts the requirement and removes the concept of kernel arguments from gpu.launch. Instead, the kernel outlining transformation now does the explicit capturing. This is a breaking change for users of gpu.launch. Differential Revision: https://reviews.llvm.org/D73769	2020-02-03 10:08:48 +01:00
Diego Caballero	e5aaf30cf1	[mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialect Summary: This patch introduces an alternative calling convention for MemRef function arguments in LLVM dialect. It converts MemRef function arguments to LLVM bare pointers to the MemRef element type instead of creating a MemRef descriptor. Bare pointers are then promoted to a MemRef descriptors at the beginning of the function. This calling convention is only enabled with a flag. Reviewers: ftynse, bondhugula, nicolasvasilache, rriddle, mehdi_amini Reviewed By: ftynse, rriddle, mehdi_amini Subscribers: Joonsoo, flaub, merge_guards_bot, jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72802	2020-01-31 15:19:38 -08:00
aartbik	c8fc76a99b	[mlir] [VectorOps] fixed bug in vector.insert_strided_slice lowering Summary: Rationale: When lowering to LLVM for different rank insert (n vs k), the offset arrays needs to drop one dimension (becomes n-1), but the strides array needs to be preserved (remains k). With regression test. Note that this example was actually in the documentation, so extra important to do it right :-) Reviewers: nicolasvasilache, andydavis1, ftynse Reviewed By: nicolasvasilache, ftynse Subscribers: Joonsoo, merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73733	2020-01-31 11:29:46 -08:00
Alex Zinenko	23ccc055c7	[mlir] Remove the dependency of StdToLLVM on LoopToStd This is a leftover of a temporary state where loop operations were in Standard dialect.	2020-01-31 19:47:33 +01:00
Lei Zhang	df71000d7d	[mlir][spirv] Convert linalg.generic for reduction to SPIR-V ops This commit adds a pattern to lower linalg.generic for reduction to spv.GroupNonUniform* ops. Right now this only supports integer reduction on 1-D input memref. Shader entry point ABI is queried to make sure that the input memref's shape matches the local workgroup's invocation configuration. This makes sure that the workload fits in one local workgroup so that we can leverage SPIR-V group non-uniform operations. linglg.generic is a structured op that preserves the right level of information. It is easier to recognize reduction at this level than performing analysis on loops. This commit also exposes `getElementPtr` in SPIRVLowering.h given that it's a generally useful utility function. Differential Revision: https://reviews.llvm.org/D73437	2020-01-31 09:37:04 -05:00
Stephan Herhut	84695dd4d7	Fix conversion of loops to GPU with no block/thread dimensions. Summary: The current code assumes that one always maps at least one loop to block dimensions and at least one loop to thread dimensions. If either is not the case, a loop would get mapped twice. Differential Revision: https://reviews.llvm.org/D73685	2020-01-31 11:00:28 +01:00
Tim Shen	3ccaac3cdd	[mlir] Add MemRefTypeBuilder and refactor some MemRefType::get(). The refactored MemRefType::get() calls all intend to clone from another memref type, with some modifications. In fact, some calls dropped memory space during the cloning. Migrate them to the cloning API so that nothing gets dropped if they are not explicitly listed. It's close to NFC but not quite, as it helps with propagating memory spaces in some places. Differential Revision: https://reviews.llvm.org/D73296	2020-01-30 23:30:46 -08:00
Lubomir Litchev	fcabccd3d9	[MLIR] Add the sqrt operation to mlir. Summary: Add and pipe through the sqrt operation for Standard and LLVM dialects. Reviewers: nicolasvasilache, ftynse Reviewed By: ftynse Subscribers: frej, ftynse, merge_guards_bot, flaub, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73571	2020-01-30 08:07:38 -08:00
Julian Gross	addc27bc43	Changed wrong ROCDL instructions in GPU lowering. Summary: In the scope of the lowering phase from GPU to ROCDL, the intructions for the conversion patterns seems to be wrong. According to https://github.com/ROCm-Developer-Tools/HIP/blob/master/include/hip/hcc_detail/math_fwd.h the instructions need two underscores in the beginning instead of one. Reviewers: nicolasvasilache, herhut, rriddle Reviewed By: herhut, rriddle Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73535	2020-01-30 15:37:00 +01:00
Stephan Herhut	2692751895	Add 'gpu.terminator' operation. Summary: The 'gpu.terminator' operation is used as the terminator for the regions of gpu.launch. This is to disambugaute them from the return operation on 'gpu.func' functions. This is a breaking change and users of the gpu dialect will need to adapt their code when producting 'gpu.launch' operations. Reviewers: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73620	2020-01-30 12:41:41 +01:00
Benjamin Kramer	adcd026838	Make llvm::StringRef to std::string conversions explicit. This is how it should've been and brings it more in line with std::string_view. There should be no functional change here. This is mostly mechanical from a custom clang-tidy check, with a lot of manual fixups. It uncovers a lot of minor inefficiencies. This doesn't actually modify StringRef yet, I'll do that in a follow-up.	2020-01-28 23:25:25 +01:00
Stephan Herhut	fdcecefe30	Add lowering for loop.parallel to cfg. Summary: This also removes the explicit pattern for loop.terminator to ensure that the terminator is only erased if the parent op is rewritten. Reductions are not yet supported. Reviewers: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73348	2020-01-28 11:55:51 +01:00
Alex Zinenko	8ed47b7430	[mlir] NFC: use ValueRange in AffineToStandard conversion ValueRange is a more flexible way of passing around ranges of Values that avoids Value vector materialization in affine expression expansion.	2020-01-28 11:54:52 +01:00
Julian Gross	664d2f5bad	Add tanh lowering from Standard dialect to NVVM and ROCDL. Summary: The tanh lowering from Standard dialect to NVVM and ROCDL was not working. The conversion pattern are inserted in the lowering files. The test cases for the lowerings were added in the test files. Reviewers: nicolasvasilache, ftynse, herhut Reviewed By: ftynse, herhut Subscribers: merge_guards_bot, ftynse, jholewinski, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73471	2020-01-28 11:01:10 +01:00
Alex Zinenko	6895a1c37e	[mlir] NFC: use doxygen-style comments in AffineToStandard.cpp	2020-01-28 10:22:30 +01:00
River Riddle	aff4ed7326	[mlir][NFC] Update Operation::getResultTypes to use ArrayRef<Type> instead of iterator_range. Summary: The new internal representation of operation results now allows for accessing the result types to be more efficient. Changing the API to ArrayRef is more efficient and removes the need to explicitly materialize vectors in several places. Differential Revision: https://reviews.llvm.org/D73429	2020-01-27 19:57:48 -08:00
Kern Handa	74df89f67f	[NFC][mlir][linalg] Merge Utils/Intrinsics.h into EDSC/Intrinsics.h Differential Revision: https://reviews.llvm.org/D73377	2020-01-27 22:32:11 +01:00
Alex Zinenko	51ba5b528a	[mlir] add lowering from affine.min to std Summary: Affine minimum computation will be used in tiling transformation. The implementation is mostly boilerplate as we already lower the minimum in the upper bound of an affine loop. Differential Revision: https://reviews.llvm.org/D73488	2020-01-27 22:30:52 +01:00
aartbik	459cf6e500	[mlir] [VectorOps] Lowering of vector.extract/insert_slices to LLVM IR Summary: Uses progressive lowering to convert vector.extract_slices and vector_insert_slices to equivalent vector operations that can be subsequently lowered into LLVM. Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache, rriddle Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72808	2020-01-27 10:35:48 -08:00
Lei Zhang	09f9deaff2	[mlir][spirv] NFC: simplify load/store builder call sites This commit introduces default values for load/store builders to simplify builder call sites. Differential Revision: https://reviews.llvm.org/D73419	2020-01-26 10:45:42 -05:00
Lei Zhang	91d6655a29	[mlir][spirv] NFC: expose builtin func op conversion pattern This commit exposes the func op conversion pattern via a new `populateBuiltinFuncToSPIRVPatterns` function from the standard to SPIR-V conversion passs. This is structurally better given that func op belongs to the builtin dialect. More importantly, this makes the pattern reusable to other dialect to SPIR-V dialect conversion as other dialect can well adopt builtin func op instead of having its own. Besides, it's very common to use func ops as test wrappers in lit tests, so test passes will need to handle func ops too. Differential Revision: https://reviews.llvm.org/D73421	2020-01-26 10:42:06 -05:00
Mehdi Amini	308571074c	Mass update the MLIR license header to mention "Part of the LLVM project" This is an artifact from merging MLIR into LLVM, the file headers are now aligned with the rest of the project.	2020-01-26 03:58:30 +00:00
Benjamin Kramer	90c01357b8	[mlir] Shrink-wrap anonymous namespaces around the classes it's supposed to enclose. NFC. The coding standards prefer smaller anonymous namespaces with free functions just being static and in the global namespace.	2020-01-23 11:47:20 +01:00
Denis Khalikov	4460cb5bcd	[mlir][spirv] Add lowering for composite std.constant. Add lowering for constant operation with ranked tensor type to spv.constant with spv.array type. Differential Revision: https://reviews.llvm.org/D73022	2020-01-22 08:25:00 -05:00
Denis Khalikov	3023352a7d	[mlir][spirv] Simplify scalar type size calculation. Simplify scalar type size calculation and reject boolean memrefs. Differential Revision: https://reviews.llvm.org/D72999	2020-01-21 12:15:37 -05:00
Tres Popp	9a52ea5cf9	Create a gpu.module operation for the GPU Dialect. Summary: This is based on the use of code constantly checking for an attribute on a model and instead represents the distinct operaion with a different op. Instead, this op can be used to provide better filtering. Reverts "Revert "[mlir] Create a gpu.module operation for the GPU Dialect."" This reverts commit ac446302ca4145cdc89f377c0c364c29ee303be5 after fixing internal Google issues. This additionally updates ROCDL lowering to use the new gpu.module. Reviewers: herhut, mravishankar, antiagainst, nicolasvasilache Subscribers: jholewinski, mgorny, mehdi_amini, jpienaar, burmako, shauheen, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits, mravishankar, rriddle, antiagainst, bkramer Tags: #llvm Differential Revision: https://reviews.llvm.org/D72921	2020-01-21 14:05:03 +01:00
Kazuaki Ishizaki	fc817b09e2	[mlir] NFC: Fix trivial typos in comments Differential Revision: https://reviews.llvm.org/D73012	2020-01-20 03:17:03 +00:00
Rainer Orth	002ec79f97	[mlir] NFC: Rename index_t to index_type mlir currently fails to build on Solaris: /vol/llvm/src/llvm-project/dist/mlir/lib/Conversion/VectorToLoops/ConvertVectorToLoops.cpp:78:20: error: reference to 'index_t' is ambiguous IndexHandle zero(index_t(0)), one(index_t(1)); ^ /usr/include/sys/types.h:103:16: note: candidate found by name lookup is 'index_t' typedef short index_t; ^ /vol/llvm/src/llvm-project/dist/mlir/include/mlir/EDSC/Builders.h:27:8: note: candidate found by name lookup is 'mlir::edsc::index_t' struct index_t { ^ and many more. Given that POSIX reserves all identifiers ending in `_t` 2.2.2 The Name Space <https://pubs.opengroup.org/onlinepubs/9699919799/functions/V2_chap02.html>, it seems quite unwise to use such identifiers in user code, even more so without a distinguished prefix. The following patch fixes this by renaming `index_t` to `index_type`. cases. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D72619	2020-01-18 22:10:46 +01:00
Denis Khalikov	29779894af	[mlir][spirv] Add lowering from `loop.if` to `spv.selection` When lowering `loop.if` to `spv.selection` we explicitly create a selection header block before the control flow diverges and a merge block where control flow subsequently converges. Differential Revision: https://reviews.llvm.org/D72836	2020-01-17 12:04:12 -05:00
Benjamin Kramer	0133cc60e4	Revert "[mlir] Create a gpu.module operation for the GPU Dialect." This reverts commit `4624a1e8ac`. Causing problems downstream.	2020-01-15 17:52:17 +01:00
Nicolas Vasilache	7741de9435	[mlir][Linalg] NFC - Cleanup Linalg Pass locations and namespacing Summary: This diff moves the conversion pass declaration closer to its definition and makes the namespacing of passes consistent with the rest of the infrastructure (i.e. `mlir::linalg::createXXXPass` -> `mlir::createXXXPass`). Reviewers: ftynse, jpienaar, mehdi_amini Subscribers: rriddle, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72766	2020-01-15 11:06:28 -05:00
Nicolas Vasilache	89b395fe79	[mlir][EDSC] Refactor dependencies involving EDSCs. Summary: This diff removes the dependency of LinalgOps and VectorOps on EDSCs. Reviewers: jpienaar, ftynse Reviewed By: ftynse Subscribers: merge_guards_bot, mgorny, mehdi_amini, rriddle, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72481	2020-01-15 09:34:29 -05:00
Lei Zhang	47c6ab2b97	[mlir][spirv] Properly support SPIR-V conversion target This commit defines a new SPIR-V dialect attribute for specifying a SPIR-V target environment. It is a dictionary attribute containing the SPIR-V version, supported extension list, and allowed capability list. A SPIRVConversionTarget subclass is created to take in the target environment and sets proper dynmaically legal ops by querying the op availability interface of SPIR-V ops to make sure they are available in the specified target environment. All existing conversions targeting SPIR-V is changed to use this SPIRVConversionTarget. It probes whether the input IR has a `spv.target_env` attribute, otherwise, it uses the default target environment: SPIR-V 1.0 with Shader capability and no extra extensions. Differential Revision: https://reviews.llvm.org/D72256	2020-01-14 19:18:42 -05:00
Benjamin Kramer	df186507e1	Make helper functions static or move them into anonymous namespaces. NFC.	2020-01-14 14:06:37 +01:00
Tres Popp	4624a1e8ac	[mlir] Create a gpu.module operation for the GPU Dialect. Summary: This is based on the use of code constantly checking for an attribute on a model and instead represents the distinct operaion with a different op. Instead, this op can be used to provide better filtering. Reviewers: herhut, mravishankar, antiagainst, rriddle Reviewed By: herhut, antiagainst, rriddle Subscribers: liufengdb, aartbik, jholewinski, mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72336	2020-01-14 12:05:47 +01:00
Benjamin Kramer	a2cd4fe6bf	Unbreak the mlir build after `202ab273e6`	2020-01-13 19:18:43 +01:00
Julian Gross	202ab273e6	[mlir] Added missing GPU lowering ops. Summary: This diff adds missing GPU lowering ops to MLIR. Reviewers: herhut, pifon2a, ftynse Tags: #pre-merge_beta_testing, #llvm Differential Revision: https://reviews.llvm.org/D72439	2020-01-13 17:10:54 +01:00
River Riddle	2bdf33cc4c	[mlir] NFC: Remove Value::operator* and Value::operator-> now that Value is properly value-typed. Summary: These were temporary methods used to simplify the transition. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D72548	2020-01-11 08:54:39 -08:00
Eric Schweitz	016bf03ef6	[mlir] add a missing dependency for Linalg conversion We were seeing some occasional build failures that would come and go. It appeared to be this missing dependence. Differential Revision: https://reviews.llvm.org/D72419	2020-01-09 23:00:41 +01:00
Nicolas Vasilache	2d515e49d8	[mlir][VectorOps] Implement insert_strided_slice conversion Summary: This diff implements the progressive lowering of insert_strided_slice. Two cases appear: 1. when the source and dest vectors have different ranks, extract the dest subvector at the proper offset and reduce to case 2. 2. when they have the same rank N: a. if the source and dest type are the same, the insertion is trivial: just forward the source b. otherwise, iterate over all N-1 D subvectors and create an extract/insert_strided_slice/insert replacement, reducing the problem to vecotrs of the same N-1 rank. This combines properly with the other conversion patterns to lower all the way to LLVM. Reviewers: ftynse, rriddle, AlexEichenberger, andydavis1, tetuante, nicolasvasilache Reviewed By: andydavis1 Subscribers: merge_guards_bot, mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72317	2020-01-09 03:13:01 -05:00
Nicolas Vasilache	65678d9384	[mlir][VectorOps] Implement strided_slice conversion Summary: This diff implements the progressive lowering of strided_slice to either: 1. extractelement + insertelement for the 1-D case 2. extract + optional strided_slice + insert for the n-D case. This combines properly with the other conversion patterns to lower all the way to LLVM. Appropriate tests are added. Reviewers: ftynse, rriddle, AlexEichenberger, andydavis1, tetuante Reviewed By: andydavis1 Subscribers: merge_guards_bot, mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72310	2020-01-09 03:03:51 -05:00
Nicolas Vasilache	766ce87e9b	[mlir][Linalg] Lower linalg.reshape to LLVM for the static case Summary: This diff adds lowering of the linalg.reshape op to LLVM. A new descriptor is created with fields initialized as follows: 1. allocatedPTr, alignedPtr and offset are copied from the source descriptor 2. sizes are copied from the static destination shape 3. strides are copied from the static strides collected with `getStridesAndOffset` Only the static case in which the target view conforms to strided memref semantics is supported. Other cases are left for future work and will be added on a per-need basis. Reviewers: ftynse, mravishankar Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72316	2020-01-08 13:07:41 -05:00
Denis Khalikov	eac01f63a6	[mlir][spirv] Add lowering for std.fpext, std.fptrunc, std.sitofp. Differential Revision: https://reviews.llvm.org/D72137	2020-01-07 22:13:07 -05:00
Lei Zhang	dab2921f77	Revert "[mlir][spirv] Add lowering for std.fpext, std.fptrunc, std.sitofp." This reverts commit `7e7f849a6d` because it recorded the wrong commit author.	2020-01-07 22:11:17 -05:00
Denis Khalikov	dd495e8a87	[mlir][spirv] Add lowering for std cmp ops. Differential Revision: https://reviews.llvm.org/D72296	2020-01-07 21:51:51 -05:00
Denis Khalikov	9883b14cd1	[mlir][spirv] Add lowering for standard bit ops Differential Revision: https://reviews.llvm.org/D72205	2020-01-07 21:45:54 -05:00
Lei Zhang	7e7f849a6d	[mlir][spirv] Add lowering for std.fpext, std.fptrunc, std.sitofp. Differential Revision: https://reviews.llvm.org/D72137	2020-01-07 21:28:49 -05:00
Nicolas Vasilache	2140a973f2	[mlir][Linalg] Extend generic ops to allow tensors Summary: This diff adds support to allow `linalg.generic` and `linalg.indexed_generic` to take tensor input and output arguments. The subset of output tensor operand types must appear verbatim in the result types after an arrow. The parser, printer and verifier are extended to accomodate this behavior. The Linalg operations now support variadic ranked tensor return values. This extension exhibited issues with the current handling of NativeCall in RewriterGen.cpp. As a consequence, an explicit cast to `SmallVector<Value, 4>` is added in the proper place to support the new behavior (better suggestions are welcome). Relevant cleanups and name uniformization are applied. Relevant invalid and roundtrip test are added. Reviewers: mehdi_amini, rriddle, jpienaar, antiagainst, ftynse Subscribers: burmako, shauheen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72022	2020-01-02 13:54:57 -05:00
Fangrui Song	eeef50b1fe	[mlir] Fix -Wrange-loo-analysis warnings for (const auto &x : llvm::zip(..., ...)) -> for (auto x : llvm::zip(..., ...)) The return type of zip() is a wrapper that wraps a tuple of references. > warning: loop variable 'p' is always a copy because the range of type 'detail::zippy<detail::zip_shortest, ArrayRef<long> &, ArrayRef<long> &>' does not return a reference [-Wrange-loop-analysis]	2020-01-01 16:06:04 -08:00
Tung Le Duc	e5957ac3d7	[mlir] Fix the wrong computation of dynamic strides for lowering AllocOp to LLVM Leftover change from before the MLIR merge, reviewed at accepted at https://github.com/tensorflow/mlir/pull/338.	2019-12-28 23:33:28 +01:00
MaheshRavishankar	c3d3569d4c	[mlir] Convert std.and/std.or ops to spv.LogicalAnd/spv.LogicalOr The conversion from std.and/std.or to spv.LogicalAnd/spv.LogicalOr is only valid for boolean (i1) types. Modify BinaryOpPattern in StandardToSPIRV.td to allow limiting the type of the operands for which the pattern is applied. Differential Revision: https://reviews.llvm.org/D71881	2019-12-27 11:33:17 -08:00
River Riddle	21610e6651	Refactor the way that pass options are specified. This change refactors pass options to be more similar to how statistics are modeled. More specifically, the options are specified directly on the pass instead of in a separate options class. (Note that the behavior and specification for pass pipelines remains the same.) This brings about several benefits: * The specification of options is much simpler * The round-trip format of a pass can be generated automatically * This gives a somewhat deeper integration with "configuring" a pass, which we could potentially expose to users in the future. PiperOrigin-RevId: 286953824	2019-12-23 16:48:22 -08:00
River Riddle	e62a69561f	NFC: Replace ValuePtr with Value and remove it now that Value is value-typed. ValuePtr was a temporary typedef during the transition to a value-typed Value. PiperOrigin-RevId: 286945714	2019-12-23 16:36:53 -08:00
River Riddle	5d5bd2e1da	Change the `notifyRootUpdated` API to be transaction based. This means that in-place, or root, updates need to use explicit calls to `startRootUpdate`, `finalizeRootUpdate`, and `cancelRootUpdate`. The major benefit of this change is that it enables in-place updates in DialectConversion, which simplifies the FuncOp pattern for example. The major downside to this is that the cases that may modify an operation in-place will need an explicit cancel on the failure branches(assuming that they started an update before attempting the transformation). PiperOrigin-RevId: 286933674	2019-12-23 16:26:15 -08:00
Mehdi Amini	56222a0694	Adjust License.txt file to use the LLVM license PiperOrigin-RevId: 286906740	2019-12-23 15:33:37 -08:00
River Riddle	35807bc4c5	NFC: Introduce new ValuePtr/ValueRef typedefs to simplify the transition to Value being value-typed. This is an initial step to refactoring the representation of OpResult as proposed in: https://groups.google.com/a/tensorflow.org/g/mlir/c/XXzzKhqqF_0/m/v6bKb08WCgAJ This change will make it much simpler to incrementally transition all of the existing code to use value-typed semantics. PiperOrigin-RevId: 286844725	2019-12-22 22:00:23 -08:00
Manuel Freiberger	22954a0e40	Add integer bit-shift operations to the standard dialect. Rename the 'shlis' operation in the standard dialect to 'shift_left'. Add tests for this operation (these have been missing so far) and add a lowering to the 'shl' operation in the LLVM dialect. Add also 'shift_right_signed' (lowered to LLVM's 'ashr') and 'shift_right_unsigned' (lowered to 'lshr'). The original plan was to name these operations 'shift.left', 'shift.right.signed' and 'shift.right.unsigned'. This works if the operations are prefixed with 'std.' in MLIR assembly. Unfortunately during import the short form is ambigous with operations from a hypothetical 'shift' dialect. The best solution seems to omit dots in standard operations for now. Closes tensorflow/mlir#226 PiperOrigin-RevId: 286803388	2019-12-22 10:02:13 -08:00
Aart Bik	1d47564a53	[VectorOps] unify vector dialect "subscripts" PiperOrigin-RevId: 286650682	2019-12-20 15:33:04 -08:00
Christian Sigg	42d46b4efa	Add gpu.shuffle op. This will allow us to lower most of gpu.all_reduce (when all_reduce doesn't exist in the target dialect) within the GPU dialect, and only do target-specific lowering for the shuffle op. PiperOrigin-RevId: 286548256	2019-12-20 02:52:52 -08:00
River Riddle	7b3adda8f4	Move the specializations of VectorTransferRewriter::matchAndRewrite back into the anonymous namespace. This appeases the GCC bug related to specializations in a different namespace. PiperOrigin-RevId: 286234667	2019-12-18 11:53:57 -08:00
Marcel Koester	6054610bbe	Added LLVM ops and lowering phases from standard dialect for FAbs, FCeil, Cos, FNeg, CopySign. Added test cases for the newly added LLVM operations and lowering features. Closes tensorflow/mlir#300 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/300 from dfki-jugr:std_to_llvm da6168bbc1a369ae2e99ad3881fdddd82f075dd4 PiperOrigin-RevId: 286231169	2019-12-18 11:42:43 -08:00
Aart Bik	d9b500d3bb	[VectorOps] Add vector.print definition, with lowering support Examples: vector.print %f : f32 vector.print %x : vector<4xf32> vector.print %y : vector<3x4xf32> vector.print %z : vector<2x3x4xf32> LLVM lowering replaces these with fully unrolled calls into a small runtime support library that provides some basic printing operations (single value, opening closing bracket, comma, newline). PiperOrigin-RevId: 286230325	2019-12-18 11:31:34 -08:00
River Riddle	2666b97314	NFC: Cleanup non-conforming usages of namespaces. * Fixes use of anonymous namespace for static methods. * Uses explicit qualifiers(mlir::) instead of wrapping the definition with the namespace. PiperOrigin-RevId: 286222654	2019-12-18 10:46:48 -08:00
Uday Bondhugula	47034c4bc5	Introduce prefetch op: affine -> std -> llvm intrinsic Introduce affine.prefetch: op to prefetch using a multi-dimensional subscript on a memref; similar to affine.load but has no effect on semantics, but only on performance. Provide lowering through std.prefetch, llvm.prefetch and map to llvm's prefetch instrinsic. All attributes reflected through the lowering - locality hint, rw, and instr/data cache. affine.prefetch %0[%i, %j + 5], false, 3, true : memref<400x400xi32> Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#225 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/225 from bondhugula:prefetch 4c3b4e93bc64d9a5719504e6d6e1657818a2ead0 PiperOrigin-RevId: 286212997	2019-12-18 10:00:04 -08:00
River Riddle	4562e389a4	NFC: Remove unnecessary 'llvm::' prefix from uses of llvm symbols declared in `mlir` namespace. Aside from being cleaner, this also makes the codebase more consistent. PiperOrigin-RevId: 286206974	2019-12-18 09:29:20 -08:00
Alex Zinenko	40ef46fba4	Harden the requirements to memory attribution types in gpu.func When memory attributions are present in `gpu.func`, require that they are of memref type and live in memoryspaces 3 and 5 for workgroup and private memory attributions, respectively. Adapt the conversion from the GPU dialect to the NVVM dialect to drop the private memory space from attributions as NVVM is able to model them as local `llvm.alloca`s in the default memory space. PiperOrigin-RevId: 286161763	2019-12-18 03:38:55 -08:00
River Riddle	74278dd01e	NFC: Use TypeSwitch to simplify existing code. PiperOrigin-RevId: 286066371	2019-12-17 14:57:41 -08:00
Alex Zinenko	42b3fe8335	Make it possible to override the lowering of MemRef to the LLVM dialect. NFC. The lowering of MemRef types to the LLVM dialect is connected to the underlying runtime representation of structured memory buffers. It has changed several times in the past and reached the current state of a LLVM structured-typed descriptor containing two pointers and all sizes. In several reported use cases, a different, often simpler, lowering scheme is required. For example, lowering statically-shaped memrefs to bare LLVM pointers to simplify aliasing annotation. Split the pattern population functions into those include memref-related operations and the remaining ones. Users are expected to extend TypeConverter::convertType to handle the memref types differently. PiperOrigin-RevId: 286030610	2019-12-17 12:10:04 -08:00
Alex Zinenko	0bdc72d2df	StdToLLVM conversion: drop getMemRefElementType utility function This function has become redundant with MemRefDescriptor::getElementType and is no longer necessary. Use the MemRefDescriptor pervasively to concentrate descriptor-related logic in one place and drop the utility function. PiperOrigin-RevId: 286024168	2019-12-17 11:43:59 -08:00
Mahesh Ravishankar	80ec474a65	Add atomic operations to SPIR-V dialect. Some changes to the dialect generation script to allow specification of different base class to derive from in ODS. PiperOrigin-RevId: 285859230	2019-12-16 15:05:51 -08:00
Alex Zinenko	6273fa0c6a	Plug gpu.func into the GPU lowering pipelines This updates the lowering pipelines from the GPU dialect to lower-level dialects (NVVM, SPIRV) to use the recently introduced gpu.func operation instead of a standard function annotated with an attribute. In particular, the kernel outlining is updated to produce gpu.func instead of std.func and the individual conversions are updated to consume gpu.funcs and disallow standard funcs after legalization, if necessary. The attribute "gpu.kernel" is preserved in the generic syntax, but can also be used with the custom syntax on gpu.funcs. The special kind of function for GPU allows one to use additional features such as memory attribution. PiperOrigin-RevId: 285822272	2019-12-16 12:12:48 -08:00
Alex Zinenko	ed749b7689	Make "LowerToCFG" an operation pass The conversion from the Loops dialect to the Standard dialect, also known as loop-to-cfg lowering, has been historically a function pass. It can be required on non-Standard function Ops, in particular the recently introduced GPU functions. Make the conversion an operation pass instead of a function pass. PiperOrigin-RevId: 285814560	2019-12-16 11:36:02 -08:00
Aart Bik	cd5dab8ad7	[VectorOps] Add [insert/extract]element definition together with lowering to LLVM Similar to insert/extract vector instructions but (1) work on 1-D vectors only (2) allow for a dynamic index %c3 = constant 3 : index %0 = vector.insertelement %arg0, %arg1[%c : index] : vector<4xf32> %1 = vector.extractelement %arg0[%c3 : index] : vector<4xf32> PiperOrigin-RevId: 285792205	2019-12-16 09:52:46 -08:00
Alex Zinenko	0684aa9a8b	Make memref promotion during std->LLVM lowering the default calling convention During the conversion from the standard dialect to the LLVM dialect, memref-typed arguments are promoted from registers to memory and passed into functions by pointer. This had been introduced into the lowering to work around the abesnce of calling convention modeling in MLIR to enable better interoperability with LLVM IR generated from C, and has been exerciced for several months. Make this promotion the default calling covention when converting to the LLVM dialect. This adds the documentation, simplifies the code and makes the conversion consistent across function operations and function types used in other places, e.g. in high-order functions or attributes, which would not follow the same rule previously. PiperOrigin-RevId: 285751280	2019-12-16 05:17:14 -08:00
Nicolas Vasilache	7923abd357	Add a layer of EDSC for linalg.GenericOp This will be evolved into a simple programming model for custom ops and custom layers in followup CLs. This CL also deletes the obsolete tablegen's reference-impl.td that was using EDSCs. PiperOrigin-RevId: 285459545	2019-12-13 16:57:57 -08:00
Christian Sigg	8846557672	Fix maskAndClamp in gpu.all_reduce. The clamp value determines the returned predicate. Previously, the clamp value was fixed to 31 and the predicate was therefore always true. This is incorrect for partial warp reductions, but went unnoticed because the returned values happened to be zero (but it could be anything). PiperOrigin-RevId: 285343160	2019-12-13 15:28:58 -08:00
Aart Bik	1c81adf362	[VectorOps] Add lowering of vector.shuffle to LLVM IR For example, a shuffle %1 = vector.shuffle %arg0, %arg1 [0 : i32, 1 : i32] : vector<2xf32>, vector<2xf32> becomes a direct LLVM shuffle 0 = llvm.shufflevector %arg0, %arg1 [0 : i32, 1 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> but %1 = vector.shuffle %a, %b[1 : i32, 0 : i32, 2: i32] : vector<1x4xf32>, vector<2x4xf32> becomes the more elaborate (note the index permutation that drives argument selection for the extract operations) %0 = llvm.mlir.undef : !llvm<"[3 x <4 x float>]"> %1 = llvm.extractvalue %arg1[0] : !llvm<"[2 x <4 x float>]"> %2 = llvm.insertvalue %1, %0[0] : !llvm<"[3 x <4 x float>]"> %3 = llvm.extractvalue %arg0[0] : !llvm<"[1 x <4 x float>]"> %4 = llvm.insertvalue %3, %2[1] : !llvm<"[3 x <4 x float>]"> %5 = llvm.extractvalue %arg1[1] : !llvm<"[2 x <4 x float>]"> %6 = llvm.insertvalue %5, %4[2] : !llvm<"[3 x <4 x float>]"> PiperOrigin-RevId: 285268164	2019-12-12 14:11:56 -08:00
Nicolas Vasilache	782ae29678	Retire !linalg.buffer type - NFC This type is not used anymore now that Linalg view and subview have graduated to std and that alignment is supported on alloc. PiperOrigin-RevId: 285213424	2019-12-12 10:03:57 -08:00
Ehsan Toosi	f7bffad5a7	Added lowering of `std.tanh` to llvm function call to `tanh` and `tanhf`. Closes tensorflow/mlir#312 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/312 from dfki-ehna:tanh 9e89b072ff91ff390ad739501745114feb3ac856 PiperOrigin-RevId: 285205674	2019-12-12 09:25:15 -08:00
Christian Sigg	9b85582682	Automated rollback of commit `f68ac464d8` PiperOrigin-RevId: 285162061	2019-12-12 03:48:38 -08:00
Christian Sigg	f68ac464d8	Switch from shfl.bfly to shfl.down. Both work for the current use case, but the latter allows implementing prefix sums and is a little easier to understand for partial warps. PiperOrigin-RevId: 285145287	2019-12-12 01:28:01 -08:00
Nicolas Vasilache	9dfa84a269	Add std.log* and llvm.intr.log* that correspond to the LLVMIR intrinsics PiperOrigin-RevId: 285073483	2019-12-11 15:25:34 -08:00
Mahesh Ravishankar	652fc261d7	Expose a convenience function to add interface attributes to a function. PiperOrigin-RevId: 285036647	2019-12-11 12:21:42 -08:00
Denis Khalikov	d968f9696d	[spirv] Add lowering for std.fdiv, std.frem, std.fsub Closes tensorflow/mlir#313 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/313 from denis0x0D:sandbox/lowering_std_farith 41715070a74d13bfa9401957478978c1bb8006c0 PiperOrigin-RevId: 285023586	2019-12-11 11:17:35 -08:00
Christian Sigg	c5fb4c1303	NFC: Fix naming inconsistency: FuncOpLowering -> GPUFuncOpLowering. Remove nested anonymous namespace. PiperOrigin-RevId: 284987357	2019-12-11 08:24:58 -08:00
Stephan Herhut	b96f86daaf	Add a function to get lowering patterns from GPU to NVVM. This enables combining the patterns with other patterns into larger lowerings. PiperOrigin-RevId: 284979271	2019-12-11 07:14:33 -08:00
Aart Bik	9826fe5c9f	[VectorOps] Add lowering of vector.insert to LLVM IR For example, an insert %0 = vector.insert %arg0, %arg1[3 : i32] : f32 into vector<4xf32> becomes %0 = llvm.mlir.constant(3 : i32) : !llvm.i32 %1 = llvm.insertelement %arg0, %arg1[%0 : !llvm.i32] : !llvm<"<4 x float>"> A more elaborate example, inserting an element in a higher dimension vector %0 = vector.insert %arg0, %arg1[3 : i32, 7 : i32, 15 : i32] : f32 into vector<4x8x16xf32> becomes %0 = llvm.extractvalue %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> %1 = llvm.mlir.constant(15 : i32) : !llvm.i32 %2 = llvm.insertelement %arg0, %0[%1 : !llvm.i32] : !llvm<"<16 x float>"> %3 = llvm.insertvalue %2, %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> PiperOrigin-RevId: 284882443	2019-12-10 17:12:49 -08:00
Alex Zinenko	ac4873322f	Drop Markdown style annotations These come from a non-standard extenion that is not available on Github, so it only clutters the documentation source with {.mlir} or {.ebnf} tags. PiperOrigin-RevId: 284733003	2019-12-10 03:00:57 -08:00
Mahesh Ravishankar	4a62019eb8	Add lowering for module with gpu.kernel_module attribute. The existing GPU to SPIR-V lowering created a spv.module for every function with gpu.kernel attribute. A better approach is to lower the module that the function lives in (which has the attribute gpu.kernel_module) to a spv.module operation. This better captures the host-device separation modeled by GPU dialect and simplifies the lowering as well. PiperOrigin-RevId: 284574688	2019-12-09 09:52:21 -08:00
Kazuaki Ishizaki	ae05cf27c6	Minor spelling tweaks Closes tensorflow/mlir#304 PiperOrigin-RevId: 284568358	2019-12-09 09:23:48 -08:00
Uday Bondhugula	a63f6e0bf9	Replace spurious SmallVector constructions with ValueRange Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#305 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/305 from bondhugula:value_range 21d1fae73f549e3c8e72b60876eff1b864cea39c PiperOrigin-RevId: 284541027	2019-12-09 06:26:33 -08:00
River Riddle	d6ee6a0310	Update the builder API to take ValueRange instead of ArrayRef<Value *> This allows for users to provide operand_range and result_range in builder.create<> calls, instead of requiring an explicit copy into a separate data structure like SmallVector/std::vector. PiperOrigin-RevId: 284360710	2019-12-07 10:35:41 -08:00
River Riddle	9d1a0c72b4	Add a new ValueRange class. This class represents a generic abstraction over the different ways to represent a range of Values: ArrayRef<Value >, operand_range, result_range. This class will allow for removing the many instances of explicit SmallVector<Value , N> construction. It has the same memory cost as ArrayRef, and only suffers cost from indexing(if+elsing the different underlying representations). This change only updates a few of the existing usages, with more to be changed in followups; e.g. 'build' API. PiperOrigin-RevId: 284307996	2019-12-06 20:07:23 -08:00
Mahesh Ravishankar	6500b7e0c0	NFC: Separate implementation and definition in ConvertStandardToSPIRV.cpp PiperOrigin-RevId: 284274326	2019-12-06 15:26:17 -08:00
Alex Zinenko	e96150eb46	Replace custom getBody method with an ODS-generated in gpu::LaunchOp PiperOrigin-RevId: 284262981	2019-12-06 14:29:25 -08:00
Aart Bik	d37f27251f	[VecOps] Rename vector.[insert\|extract]element to just vector.[insert\|extract] Since these operations lower to [insert\|extract][element\|value] at LLVM dialect level, neither element nor value would correctly reflect the meaning. PiperOrigin-RevId: 284240727	2019-12-06 12:39:25 -08:00
Alex Zinenko	be3ed14658	LLVM::GlobalOp: take address space as builder argument Accept the address space of the global as a builder argument when constructing an LLVM::GlobalOp instance. This decreases the reliance of LLVM::GlobalOp users on the internal name of the attribute used for this purpose. Update several uses of the address space in GPU to NVVM conversion. PiperOrigin-RevId: 284233254	2019-12-06 12:01:46 -08:00
Aart Bik	b36aaeafb1	[VectorOps] Add lowering of vector.broadcast to LLVM IR For example, a scalar broadcast %0 = vector.broadcast %x : f32 to vector<2xf32> return %0 : vector<2xf32> which expands scalar x into vector [x,x] by lowering to the following LLVM IR dialect to implement the duplication over the leading dimension. %0 = llvm.mlir.undef : !llvm<"<2 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.insertelement %x, %0[%1 : !llvm.i64] : !llvm<"<2 x float>"> %3 = llvm.shufflevector %2, %0 [0 : i32, 0 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> return %3 : vector<2xf32> In the trailing dimensions, the operand is simply "passed through", unless a more elaborate "stretch" is required. For example %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> return %0 : vector<4xf32> becomes %0 = llvm.mlir.undef : !llvm<"<4 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.extractelement %arg0[%1 : !llvm.i64] : !llvm<"<1 x float>"> %3 = llvm.mlir.constant(0 : index) : !llvm.i64 %4 = llvm.insertelement %2, %0[%3 : !llvm.i64] : !llvm<"<4 x float>"> %5 = llvm.shufflevector %4, %0 [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>"> llvm.return %5 : !llvm<"<4 x float>"> PiperOrigin-RevId: 284219926	2019-12-06 11:02:29 -08:00
Alex Zinenko	e216a72ab8	Add conversions of GPU func with memory attributions to LLVM/NVVM GPU functions use memory attributions, a combination of Op attributes and region arguments, to specify function-wide buffers placed in workgroup or private memory spaces. Introduce a lowering pattern for GPU functions to be converted to LLVM functions taking into account memory attributions. Workgroup attributions get transformed into module-level globals with unique names derived from function names. Private attributions get converted into llvm.allocas inside the function body. In both cases, we inject at the beginning of the function the IR that obtains the raw pointer to the data and populates a MemRef descriptor based on the MemRef type of buffer, making attributions compose with the rest of the MemRef lowering and transparent for use with std.load and std.store. While using raw pointers instead of descriptors might have been more efficient, it is better implemented as a canonicalization or a separate transformation so that non-attribution memrefs could also benefit from it. PiperOrigin-RevId: 284208396	2019-12-06 10:08:43 -08:00
Kazuaki Ishizaki	84a6182ddd	minor spelling tweaks Closes tensorflow/mlir#290 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/290 from kiszk:spelling_tweaks_201912 9d9afd16a723dd65754a04698b3976f150a6054a PiperOrigin-RevId: 284169681	2019-12-06 05:59:30 -08:00
nmostafa	daff60cd68	Add UnrankedMemRef Type Closes tensorflow/mlir#261 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/261 from nmostafa:nmostafa/unranked 96b6e918f6ed64496f7573b2db33c0b02658ca45 PiperOrigin-RevId: 284037040	2019-12-05 13:13:20 -08:00
Mahesh Ravishankar	4d61a79db4	Allow specification of the workgroup size for GPUToSPIRV lowering. SPIR-V/Vulkan spec requires the workgroups size to be specified with the spv.ExecutionMode operation. This was hard-wired to be set to a particular value. It is now changed to be configurable by clients of the pass or of the patterns that implement the lowering from GPU to SPIRV. PiperOrigin-RevId: 284017482	2019-12-05 11:31:57 -08:00
Nicolas Vasilache	b3f7cf80a7	Add a CL option to Standard to LLVM lowering to use alloca instead of malloc/free. In the future, a more configurable malloc and free interface should be used and exposed via extra parameters to the `createLowerToLLVMPass`. Until requirements are gathered, a simple CL flag allows generating code that runs successfully on hardware that cannot use the stdlib. PiperOrigin-RevId: 283833424	2019-12-04 14:16:00 -08:00
Nicolas Vasilache	5c0c51a997	Refactor dependencies to expose Vector transformations as patterns - NFC This CL refactors some of the MLIR vector dependencies to allow decoupling VectorOps, vector analysis, vector transformations and vector conversions from each other. This makes the system more modular and allows extracting VectorToVector into VectorTransforms that do not depend on vector conversions. This refactoring exhibited a bunch of cyclic library dependencies that have been cleaned up. PiperOrigin-RevId: 283660308	2019-12-03 17:52:10 -08:00
Mahesh Ravishankar	c5ba37b6ae	Add a pass to legalize operations before lowering to SPIR-V. Not all StandardOps can be lowered to SPIR-V. For example, subview op implementation requires use of pointer bitcasts which is not valid according to SPIR-V spec (or at least is ambiguous about it). Such ops need to be removed/transformed before lowering to SPIR-V. The SPIRVLegalizationPass is added a place where such legalizations can be added. Current implementation folds the subview ops with load/stores so that the lowering itself does not have to convert a subview op. PiperOrigin-RevId: 283642981	2019-12-03 16:06:17 -08:00
Mahesh Ravishankar	353fb2bd38	Convert MemRefType to a linearized array in SPIR-V lowering. The SPIR-V lowering used nested !spv.arrays to represented multi-dimensional arrays, with the hope that in-conjunction with the layout annotations, the shape and layout of memref can be represented directly. It is unclear though how portable this representation will end up being. It will rely on driver compilers implementing complex index computations faithfully. A more portable approach is to use linearized arrays to represent memrefs and explicitly instantiate all the index computation in SPIR-V. This gives added benefit that we can further optimize the generated code in MLIR before generating the SPIR-V binary. PiperOrigin-RevId: 283571167	2019-12-03 10:21:16 -08:00
Alex Zinenko	993e79e9bd	Fix ViewOp to have at most one offset operand As described in the documentation, ViewOp is expected to take an optional dynamic offset followed by a list of dynamic sizes. However, the ViewOp parser did not include a check for the offset being a single value and accepeted a list of values instead. Furthermore, several tests have been exercising the wrong syntax of a ViewOp, passing multiple values to the dyanmic stride list, which was not caught by the parser. The trailing values could have been erronously interpreted as dynamic sizes. This is likely due to resyntaxing of the ViewOp, with the previous syntax taking the list of sizes before the offset. Update the tests to use the syntax with the offset preceding the sizes. Worse, the conversion of ViewOp to the LLVM dialect assumed the wrong order of operands with offset in the trailing position, and erronously relied on the permissive parsing that interpreted trailing dynamic offset values as leading dynamic sizes. Fix the lowering to use the correct order of operands. PiperOrigin-RevId: 283532506	2019-12-03 06:23:04 -08:00
Stephan Herhut	2125c0e3a8	Extend conversion of SubViewOp to llvm to also support cases where size and stride are constant (i.e., there are no size and stride operands). We recently added canonicalization that rewrites constant size and stride operands to SubViewOp into static information in the type, so these patterns now occur during code generation. PiperOrigin-RevId: 283524688	2019-12-03 05:11:49 -08:00
Alex Zinenko	fdbb99cd62	Add linkage support to LLVMFuncOp A recent commit introduced the Linkage attribute to the LLVM dialect and used it in the Global Op. Also use it in LLVMFuncOp. As per LLVM Language Reference, if the linkage attribute is omitted, the function is assumed to have external linkage. PiperOrigin-RevId: 283493299	2019-12-03 00:26:44 -08:00
Lei Zhang	0d22a3fdc8	NFC: Update std.subview op to use AttrSizedOperandSegments This turns a few manually written helper methods into auto-generated ones. PiperOrigin-RevId: 283339617	2019-12-02 07:52:00 -08:00
Alexander Belyaev	9630fcbc52	Lower linalg.indexed_generic with libcall to LLVM. PiperOrigin-RevId: 283328994	2019-12-02 06:30:52 -08:00
Alex Zinenko	d5e627f84b	Introduce Linkage attribute to the LLVM dialect LLVM IR supports linkage on global objects such as global variables and functions. Introduce the Linkage attribute into the LLVM dialect, backed by an integer storage. Use this attribute on LLVM::GlobalOp and make it mandatory. Implement parsing/printing of the attribute and conversion to LLVM IR. See tensorflow/mlir#277. PiperOrigin-RevId: 283309328	2019-12-02 03:28:10 -08:00
Lei Zhang	a4d7650230	[spirv] NFC: Add getZero() and getOne() static method to ConstantOp Getting constant zero or one is very common so it merits a special handy method on spirv::ConstantOp itself. PiperOrigin-RevId: 282832572	2019-11-27 14:13:01 -08:00
Mahesh Ravishankar	03620fa70a	Misc changes to lowering to SPIR-V. These changes to SPIR-V lowering while adding support for lowering SUbViewOp, but are not directly related. - Change the lowering of MemRefType to !spv.ptr<!spv.struct<!spv.array<...>[offset]>, ..> This is consistent with the Vulkan spec. - To enable testing a simple pattern of lowering functions is added to ConvertStandardToSPIRVPass. This is just used to convert the type of the arguments of the function. The added function lowering itself is not meant to be the way functions are eventually lowered into SPIR-V dialect. PiperOrigin-RevId: 282589644	2019-11-26 10:11:34 -08:00
Nicolas Vasilache	174076a157	Use vector.InsertStridedSlice in Vector -> Vector unrolling This CL uses the recently added op to finish the implementation of Vector -> Vector unrolling by replacing the "fake join op" by a series of InsertStridedSliceOp. Test is updated accordingly PiperOrigin-RevId: 282451126	2019-11-25 15:56:37 -08:00
Mahesh Ravishankar	f87b2fd41b	NFC: Actually expose the implementation of createGPUToSPIRVLoweringPass. A mismatch in the function declaration and function definition, prevented the implementation of the createGPUToSPIRVLoweringPass from being exposed. PiperOrigin-RevId: 282419815	2019-11-25 13:19:53 -08:00
Mahesh Ravishankar	bd485afda0	Introduce attributes that specify the final ABI for a spirv::ModuleOp. To simplify the lowering into SPIR-V, while still respecting the ABI requirements of SPIR-V/Vulkan, split the process into two 1) While lowering a function to SPIR-V (when the function is an entry point function), allow specifying attributes on arguments and function itself that describe the ABI of the function. 2) Add a pass that materializes the ABI described in the function. Two attributes are needed. 1) Attribute on arguments of the entry point function that describe the descriptor_set, binding, storage class, etc, of the spv.globalVariable this argument will be replaced by 2) Attribute on function that specifies workgroup size, etc. (for now only workgroup size). Add the pass -spirv-lower-abi-attrs to materialize the ABI described by the attributes. This change makes the SPIRVBasicTypeConverter class unnecessary and is removed, further simplifying the SPIR-V lowering path. PiperOrigin-RevId: 282387587	2019-11-25 11:19:56 -08:00
River Riddle	b8ee563449	NFC: Remove unnecessarily guarded tablegen includes. Support for including a file multiple times was added in tablegen, removing the need for these extra guards. This is because we already insert c/c++ style header guards within each of the specific .td files. PiperOrigin-RevId: 282076728	2019-11-22 18:01:57 -08:00
Jean-Michel Gorius	104777d8e6	Unify vector op names with other dialects. Change vector op names from VectorFooOp to Vector_FooOp and from vector::VectorFooOp to vector::FooOp. Closes tensorflow/mlir#257 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/257 from Kayjukh:master dfc3a0e04114885aaec8740d5951d6984d6e1577 PiperOrigin-RevId: 281967461	2019-11-22 08:24:49 -08:00
Nicolas Vasilache	6755543af5	Move Linalg Transforms that are actually Conversions - NFC PiperOrigin-RevId: 281844602	2019-11-21 15:41:32 -08:00
Mahesh Ravishankar	19212105dd	Changes to SubViewOp to make it more amenable to canonicalization. The current SubViewOp specification allows for either all offsets, shape and stride to be dynamic or all of them to be static. There are opportunities for more fine-grained canonicalization based on which of these are static. For example, if the sizes are static, the result memref is of static shape. The specification of SubViewOp is modified to allow on or more of offsets, shapes and strides to be statically specified. The verification is updated to ensure that the result type of the subview op is consistent with which of these are static and which are dynamic. PiperOrigin-RevId: 281560457	2019-11-20 12:32:51 -08:00
Nicolas Vasilache	fa14d4f6ab	Implement unrolling of vector ops to finer-grained vector ops as a pattern. This CL uses the pattern rewrite infrastructure to implement a simple VectorOps -> VectorOps legalization strategy to unroll coarse-grained vector operations into finer grained ones. The transformation is written using local pattern rewrites to allow composition with other rewrites. It proceeds by iteratively introducing fake cast ops and cleaning canonicalizing or lowering them away where appropriate. This is an example of writing transformations as compositions of local pattern rewrites that should enable us to make them significantly more declarative. PiperOrigin-RevId: 281555100	2019-11-20 11:49:36 -08:00
Christian Sigg	f868adafee	Make type and rank explicit in mcuMemHostRegister function. Fix registered size of indirect MemRefType kernel arguments. PiperOrigin-RevId: 281362940	2019-11-19 13:13:02 -08:00
Alex Zinenko	8961d8e32f	Change conversion CLI flag from -lower-to-llvm to -convert-std-to-llvm The command-line flag name `lower-to-llvm` for the pass performing dialect conversion from the Standard dialect to the LLVM dialect is misleading and inconsistent with most of the conversion passses. It leads the user to believe that there are no restrictions on what can be converted, while in fact only a subset of the Standard dialect can be converted (with operations from other dialects converted by separate passes). Use `convert-std-to-llvm` that better reflects what the pass does and is consistent with most other conversions. PiperOrigin-RevId: 281238797	2019-11-19 00:34:51 -08:00
Alex Zinenko	062dd406b1	ConvertStandardToLLVM: replace assertion with graceful failure The assertion was introduced in the early days of dialect conversion infrastructure when we had the matching function separate from the rewriting function. The infrastructure evolved to have a common matchAndRewrite function and the separate matching function was dropped without chaning the rewriting that became matchAndRewrite. This has led to assertion being triggered. Return a matchFailure instead of failing an assertion on unsupported types. Closes tensorflow/mlir#230 PiperOrigin-RevId: 281113741	2019-11-18 11:26:24 -08:00
Nicolas Vasilache	9732bb533c	Standardize all VectorOps class names to be prefixed by Vector - NFC This improves consistency and will concretely avoid collisions between VectorExtractElementOp and ExtractElementOp when they are included in the same transforms / rewrites. PiperOrigin-RevId: 281101588	2019-11-18 10:39:07 -08:00
Alex Zinenko	b8dc3fd812	Rename CLI flags -lower-gpu-ops-to--ops to -convert-gpu-to- This makes the flags consistent with the naming scheme used elsewhere in the codebase for dialect conversions. PiperOrigin-RevId: 281027517	2019-11-18 02:43:10 -08:00
Lei Zhang	a0986bf43d	NFC: Convert CmpIPredicate in StandardOps to use EnumAttr This turns several hand-written functions to auto-generated ones. PiperOrigin-RevId: 280684326	2019-11-15 10:17:31 -08:00
Nicolas Vasilache	0b271b7dfe	Refactor the LowerVectorTransfers pass to use the RewritePattern infra - NFC This is step 1/n in refactoring infrastructure along the Vector dialect to make it ready for retargetability and composable progressive lowering. PiperOrigin-RevId: 280529784	2019-11-14 15:40:07 -08:00
Mahesh Ravishankar	a78bd84cf8	NFC: Refactor Dialect Conversion targeting SPIR-V. Refactoring the conversion from StandardOps/GPU dialect to SPIR-V dialect: 1) Move the SPIRVTypeConversion and SPIRVOpLowering class into SPIR-V dialect. 2) Add header files that expose functions to add patterns for the dialects to SPIR-V lowering, as well as a pass that does the dialect to SPIR-V lowering. 3) Make SPIRVOpLowering derive from OpLowering class. PiperOrigin-RevId: 280486871	2019-11-14 12:34:54 -08:00
Alex Zinenko	e0a0ac4b00	Add CMakeLists.txt for AffineToStandard conversion PiperOrigin-RevId: 280470142	2019-11-14 11:28:29 -08:00
Alex Zinenko	971b8dd4d8	Move Affine to Standard conversion to lib/Conversion This is essentially a dialect conversion and conceptually belongs to conversions. PiperOrigin-RevId: 280460034	2019-11-14 10:35:21 -08:00
Alex Zinenko	b34a861d5a	Make positions of elements in MemRef descriptor private Previous commits removed all uses of LLVMTypeConverter::k*PosInMemRefDescriptor outside of the MemRefDescriptor class. These numbers are an implementation detail and can be hidden under a layer of more semantic APIs. PiperOrigin-RevId: 280442444	2019-11-14 09:17:38 -08:00
Alex Zinenko	bf5916e7a4	Use MemRefDescriptor in Vector-to-LLVM convresion Following up on the consolidation of MemRef descriptor conversion, update Vector-to-LLVM conversion to use the helper class that abstracts away the implementation details of the MemRef descriptor. This also makes the types of the attributes in emitted llvm.insert/extractelement operations consistently i64 instead of a mix of index and i64. PiperOrigin-RevId: 280441451	2019-11-14 09:05:42 -08:00
MLIR Team	62d5b1de45	Adapt code to LLVM API updates. PiperOrigin-RevId: 280431812	2019-11-14 08:27:19 -08:00
Nicolas Vasilache	f2b6ae9991	Move VectorOps to Tablegen - (almost) NFC This CL moves VectorOps to Tablegen and cleans up the implementation. This is almost NFC but 2 changes occur: 1. an interface change occurs in the padding value specification in vector_transfer_read: the value becomes non-optional. As a shortcut we currently use %f0 for all paddings. This should become an OpInterface for vectorization in the future. 2. the return type of vector.type_cast is trivial and simplified to `memref<vector<...>>` Relevant roundtrip and invalid tests that used to sit in core are moved to the vector dialect. The op documentation is moved to the .td file. PiperOrigin-RevId: 280430869	2019-11-14 08:15:23 -08:00
Alex Zinenko	7c28de4aef	Use MemRefDescriptor in Linalg-to-LLVM conversion Following up on the consolidation of MemRef descriptor conversion, update Linalg-to-LLVM conversion to use the helper class that abstracts away the implementation details of the MemRef descriptor. This required MemRefDescriptor to become publicly visible. Since this conversion is heavily EDSC-based, introduce locally an additional wrapper that uses builder and location pointed to by the EDSC context while emitting descriptor manipulation operations. PiperOrigin-RevId: 280429228	2019-11-14 08:04:10 -08:00
Alex Zinenko	ee5c2256ef	Concentrate memref descriptor manipulation logic in one place Memref descriptor is becoming increasingly complex. Memrefs are manipulated by multiple standard instructions, each of which has a non-trivial lowering to the LLVM dialect. This leads to verbose code that manipulates the descriptors exposing the internals of insert/extractelement opreations. Implement a wrapper class that contains a memref descriptor and provides semantically named methods that build the primitive IR operations instead. PiperOrigin-RevId: 280371225	2019-11-14 00:49:12 -08:00
River Riddle	d985c74883	NFC: Refactor block signature conversion to not erase the original arguments. This refactors the implementation of block signature(type) conversion to not insert fake cast operations to perform the type conversion, but to instead create a new block containing the proper signature. This has the benefit of enabling the use of pre-computed analyses that rely on mapping values. It also leads to a much cleaner implementation overall. The major user facing change is that applySignatureConversion will now replace the entry block of the region, meaning that blocks generally shouldn't be cached over calls to applySignatureConversion. PiperOrigin-RevId: 280226936	2019-11-13 10:27:53 -08:00
Mahesh Ravishankar	2be53603e9	Add operations needed to support lowering of AffineExpr to SPIR-V. Lowering of CmpIOp, DivISOp, RemISOp, SubIOp and SelectOp to SPIR-V dialect enables the lowering of operations generated by AffineExpr -> StandardOps conversion into the SPIR-V dialect. PiperOrigin-RevId: 280039204	2019-11-12 13:20:06 -08:00
Mahesh Ravishankar	9d985141ef	Make legality check in GPU->SPIR-V lowering of FuncOp kernel specific. Existing check that sets FuncOp to be dynamically legal was just checking that the types of the argument are SPIR-V compatible. Since the current conversion from GPU to SPIR-V does not handle lowering non-kernel functions, change the legality check to verify that the FuncOp has the gpu.kernel attribute and has void(void) return type. PiperOrigin-RevId: 280032782	2019-11-12 12:52:53 -08:00
Mahesh Ravishankar	104af84f4c	Add Conversion to lower loop::ForOp to spirv::LoopOp. loop::ForOp can be lowered to the structured control flow represented by spirv::LoopOp by making the continue block of the spirv::LoopOp the loop latch and the merge block the exit block. The resulting spirv::LoopOp has a single back edge from the continue to header block, and a single exit from header to merge. PiperOrigin-RevId: 280015614	2019-11-12 11:33:27 -08:00
Nicolas Vasilache	51de3f688e	Add LLVM lowering of std.subview A followup CL will replace usage of linalg.subview by std.subview. PiperOrigin-RevId: 279961981	2019-11-12 07:23:18 -08:00
Nicolas Vasilache	f51a155337	Add support for alignment attribute in std.alloc. This CL adds an extra pointer to the memref descriptor to allow specifying alignment. In a previous implementation, we used 2 types: `linalg.buffer` and `view` where the buffer type was the unit of allocation/deallocation/alignment and `view` was the unit of indexing. After multiple discussions it was decided to use a single type, which conflates both, so the memref descriptor now needs to carry both pointers. This is consistent with the [RFC-Proposed Changes to MemRef and Tensor MLIR Types](https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ). PiperOrigin-RevId: 279959463	2019-11-12 07:06:54 -08:00
Nicolas Vasilache	72040bf7c8	Update Linalg to use std.view Now that a view op has graduated to the std dialect, we can update Linalg to use it and remove ops that have become obsolete. As a byproduct, the linalg buffer and associated ops can also disappear. PiperOrigin-RevId: 279073591	2019-11-07 06:33:10 -08:00
Nicolas Vasilache	7f6c6084b5	Add lowering of std.view to LLVM This CL ports the lowering of linalg.view to the newly introduced std.view. Differences in implementation relate to std.view having slightly different semantics: 1. a static or dynamic offset can be specified. 2. the size of the (contiguous) shape is passed instead of a range. 3. static size and stride information is extracted from the memref type rather than the range. Besides these differences, lowering behaves the same. A future CL will update Linalg to use this unified infrastructure. PiperOrigin-RevId: 278948853	2019-11-06 15:06:16 -08:00

... 6 7 8 9 10 ...

891 Commits