llvm-project

Commit Graph

Author	SHA1	Message	Date
Aart Bik	54759cefdb	[mlir] [VectorOps] changes to printing support for integers (1) simplify integer printing logic by always using 64-bit print (2) add index support (since vector<16xindex> is planned to be added) (3) adjust naming convention print_x -> printX Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D88436	2020-09-28 11:43:31 -07:00
Aart Bik	b8880f5f97	[mlir] [VectorOps] generalize printing support for integers This generalizes printing beyond just i1,i32,i64 and also accounts for signed and unsigned interpretation in the output. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D88290	2020-09-25 04:52:21 -07:00
Artur Bialas	396e7f4548	[mlir][SCFToGPU] LaunchOp propagate optional attributes Allow propagating optional user defined attributes during SCF to GPU conversion. Gives opportunity to use user defined attributes in the further lowering. For example setting subgroup size, or other options for GPU dispatch. This does not break backward compatibility and does not require new attributes, just allow passing optional ones. Differential Revision: https://reviews.llvm.org/D88203	2020-09-25 09:21:16 +02:00
Sean Silva	9ed1e5873c	[mlir][shape] Start a pass that lowers shape constraints. This pass converts shape.cstr_* ops to eager (side-effecting) error-handling code. After that conversion is done, the witnesses are trivially satisfied and are replaced with `shape.const_witness true`. Differential Revision: https://reviews.llvm.org/D87941	2020-09-24 12:25:30 -07:00
Alexander Belyaev	56ffb8d169	[mlir] Stop allowing LLVMType Int arguments for GPULaunchFuncOp. Conversion to LLVM becomes confusing and incorrect if someone tries to lower STD -> LLVM and only then GPULaunchFuncOp to LLVM separately. Although it is technically allowed now, it works incorrectly because of the argument promotion. The correct way to use this conversion pattern is to add to the STD->LLVM patterns before running the pass. Differential Revision: https://reviews.llvm.org/D88147	2020-09-24 11:16:23 +02:00
Nicolas Vasilache	ed229132f1	[mlir][Linalg] Uniformize linalg.generic with named ops. This revision allows representing a reduction at the level of linalg on tensors for generic ops by uniformizing with the named ops approach.	2020-09-22 04:13:22 -04:00
Benjamin Kramer	2d76274b99	[mlir][VectorOps] Loosen restrictions on vector.reduction types LLVM can deal with any integer or float type, don't arbitrarily restrict it to f32/f64/i32/i64. Differential Revision: https://reviews.llvm.org/D88010	2020-09-21 12:45:23 +02:00
Hanhan Wang	1909b6ac0d	[mlir][StandardToSPIRV] Handle vector of i1 case for lowering zexti to SPIR-V. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D87887	2020-09-18 07:07:22 -07:00
Nicolas Vasilache	93fd30bac3	[mlir][Linalg] Evolve named ops to use assembly form and support linalg on tensors. This revision allows representing a reduction at the level of linalg on tensors for named ops. When a structured op has a reduction and returns tensor(s), new conventions are added and documented. As an illustration, the syntax for a `linalg.matmul` writing into a buffer is: ``` linalg.matmul ins(%a, %b : memref<?x?xf32>, tensor<?x?xf32>) outs(%c : memref<?x?xf32>) ``` , whereas the syntax for a `linalg.matmul` returning a new tensor is: ``` %d = linalg.matmul ins(%a, %b : tensor<?x?xf32>, memref<?x?xf32>) init(%c : memref<?x?xf32>) -> tensor<?x?xf32> ``` Other parts of linalg will be extended accordingly to allow mixed buffer/tensor semantics in the presence of reductions.	2020-09-18 06:14:30 -04:00
Jakub Lichman	347d59b16c	[mlir][Linalg] Convolution tiling added to ConvOp vectorization pass ConvOp vectorization supports now only convolutions of static shapes with dimensions of size either 3(vectorized) or 1(not) as underlying vectors have to be of static shape as well. In this commit we add support for convolutions of any size as well as dynamic shapes by leveraging existing matmul infrastructure for tiling of both input and kernel to sizes accepted by the previous version of ConvOp vectorization. In the future this pass can be extended to take "tiling mask" as a user input which will enable vectorization of user specified dimensions. Differential Revision: https://reviews.llvm.org/D87676	2020-09-17 09:39:41 +00:00
Alex Zinenko	967c7b6936	[mlir] check for failures when packing function sigunatures in std->llvm conversion When packing function results into a structure during the standard-to-llvm dialect conversion, do not assume the conversion was successful and propagate nullptr as error state. Fixes PR45184. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D87605	2020-09-15 12:30:44 +02:00
Lubomir Litchev	ef7a255c03	Add support for casting elements in vectors for certain Std dialect type conversion operations. Added support to the Std dialect cast operations to do casts in vector types when feasible. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D87410	2020-09-14 07:45:46 -07:00
Alex Zinenko	5cac85c931	[mlir] Check for type conversion success in std->llvm function conversion Type converter may fail and return nullptr on unconvertible types. The function conversion did not include a check and was attempting to use a nullptr type to construct an LLVM function, leading to a crash. Add a check and return early. The rest of the call stack propagates errors properly. Fixes PR47403. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D87075	2020-09-14 13:16:42 +02:00
Sean Silva	84a6da67e6	[mlir] Fix some edge cases around 0-element TensorFromElementsOp This introduces a builder for the more general case that supports zero elements (where the element type can't be inferred from the ValueRange, since it might be empty). Also, fix up some cases in ShapeToStandard lowering that hit this. It happens very easily when dealing with shapes of 0-D tensors. The SameOperandsAndResultElementType is redundant with the new TypesMatchWith and prevented having zero elements. Differential Revision: https://reviews.llvm.org/D87492	2020-09-11 10:58:35 -07:00
aartbik	3c42c0dcf6	[mlir] [VectorOps] Enable 32-bit index optimizations Rationale: After some discussion we decided that it is safe to assume 32-bit indices for all subscripting in the vector dialect (it is unlikely the dialect will be used; or even work; for such long vectors). So rather than detecting specific situations that can exploit 32-bit indices with higher parallel SIMD, we just optimize it by default, and let users that don't want it opt-out. Reviewed By: nicolasvasilache, bkramer Differential Revision: https://reviews.llvm.org/D87404	2020-09-10 00:26:27 -07:00
Frederik Gossen	5106a8b8f8	[MLIR][Shape] Lower `shape_of` to `dynamic_tensor_from_elements` Take advantage of the new `dynamic_tensor_from_elements` operation in `std`. Instead of stack-allocated memory, we can now lower directly to a single `std` operation. Differential Revision: https://reviews.llvm.org/D86935	2020-09-09 07:55:13 +00:00
Frederik Gossen	133322d2e3	[MLIR][Standard] Update `tensor_from_elements` assembly format Remove the redundant parenthesis that are used for none of the other operation formats. Differential Revision: https://reviews.llvm.org/D86287	2020-09-09 07:45:46 +00:00
Benjamin Kramer	239eff502b	[mlir][VectorOps] Redo the scalar loop emission in VectoToSCF to pad instead of clipping This replaces the select chain for edge-padding with an scf.if that performs the memory operation when the index is in bounds and uses the pad value when it's not. For transfer_write the same mechanism is used, skipping the store when the index is out of bounds. The integration test has a bunch of cases of how I believe this should work. Differential Revision: https://reviews.llvm.org/D87241	2020-09-08 11:15:25 +02:00
Jakub Lichman	67b37f571c	[mlir] Conv ops vectorization pass In this commit a new way of convolution ops lowering is introduced. The conv op vectorization pass lowers linalg convolution ops into vector contractions. This lowering is possible when conv op is first tiled by 1 along specific dimensions which transforms it into dot product between input and kernel subview memory buffers. This pass converts such conv op into vector contraction and does all necessary vector transfers that make it work. Differential Revision: https://reviews.llvm.org/D86619	2020-09-08 08:47:42 +00:00
Nicolas Vasilache	9be6178449	[mlir][Vector] Make VectorToSCF deterministic Differential Revision: https://reviews.llvm.org/D87273	2020-09-08 04:18:22 -04:00
Frederik Gossen	a70f2eb3e3	[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings. Merge the two lowering passes because they are not useful by themselves. The new pass lowers to `std` and `scf` is considered an auxiliary dialect. See also https://llvm.discourse.group/t/conversions-with-multiple-target-dialects/1541/12 Differential Revision: https://reviews.llvm.org/D86779	2020-09-07 14:39:37 +00:00
David Truby	973800dc7c	Revert "[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings." This reverts commit `15acdd7543`.	2020-09-07 13:37:32 +01:00
Nicolas Vasilache	1c849ec40a	[MLIR] Fix Win test due to partial order of CHECK directives Differential Revision: https://reviews.llvm.org/D87230	2020-09-07 08:14:35 -04:00
Frederik Gossen	15acdd7543	[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings. Merge the two lowering passes because they are not useful by themselves. The new pass lowers to `std` and `scf` is considered an auxiliary dialect. See also https://llvm.discourse.group/t/conversions-with-multiple-target-dialects/1541/12 Differential Revision: https://reviews.llvm.org/D86779	2020-09-07 12:12:36 +00:00
Nicolas Vasilache	8d64df9f13	[mlir][Vector] Revisit VectorToSCF. Vector to SCF conversion still had issues due to the interaction with the natural alignment derived by the LLVM data layout. One traditional workaround is to allocate aligned. However, this does not always work for vector sizes that are non-powers of 2. This revision implements a more portable mechanism where the intermediate allocation is always a memref of elemental vector type. AllocOp is extended to use the natural LLVM DataLayout alignment for non-scalar types, when the alignment is not specified in the first place. An integration test is added that exercises the transfer to scf.for + scalar lowering with a 5x5 transposition. Differential Revision: https://reviews.llvm.org/D87150	2020-09-07 05:19:43 -04:00
aartbik	060c9dd1cc	[mlir] [VectorOps] Improve SIMD compares with narrower indices When allowed, use 32-bit indices rather than 64-bit indices in the SIMD computation of masks. This runs up to 2x and 4x faster on a number of AVX2 and AVX512 microbenchmarks. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D87116	2020-09-03 21:43:38 -07:00
Lei Zhang	8d420fb3a0	[spirv][nfc] Simplify resource limit with default values These deafult values are gotten from Vulkan required limits. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D87090	2020-09-03 13:29:26 -04:00
Benjamin Kramer	dfb7b3fe02	[mlir][VectorOps] Fall back to a loop when accessing a vector from a strided memref The scalar loop is slow but correct. Differential Revision: https://reviews.llvm.org/D87082	2020-09-03 16:05:38 +02:00
Jakub Lichman	f5ed22f09d	[mlir][VectorToSCF] 128 byte alignment of alloc ops Added 128 byte alignment to alloc ops created in VectorToSCF pass. 128b alignment was already introduced to this pass but not to all alloc ops. This commit changes that by adding 128b alignment to the remaining ops. The point of specifying alignment is to prevent possible memory alignment errors on weakly tested architectures. Differential Revision: https://reviews.llvm.org/D86454	2020-09-02 12:37:35 +00:00
Kiran Chandramohan	875074c8a9	[OpenMP][MLIR] Conversion pattern for OpenMP to LLVM Adding a conversion pattern for the parallel Operation. This will help the conversion of parallel operation with standard dialect to parallel operation with llvm dialect. The type conversion of the block arguments in a parallel region are controlled by the pattern for the parallel Operation. Without this pattern, a parallel Operation with block arguments cannot be converted from standard to LLVM dialect. Other OpenMP operations without regions are marked as legal. When translation of OpenMP operations with regions are added then patterns for these operations can also be added. Also uses all the standard to llvm patterns. Patterns of other dialects can be added later if needed. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D86273	2020-08-27 19:32:15 +01:00
George Mitenkov	d48b84eb8a	[MLIR][GPUToSPIRV] Passing gpu module name to SPIR-V module This patch allows to pass the gpu module name to SPIR-V module during conversion. This has many benefits as we can lookup converted to SPIR-V kernel in the symbol table. In order to avoid symbol conflicts, `"__spv__"` is added to the gpu module name to form the new one. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86384	2020-08-27 09:19:24 +03:00
George Mitenkov	e850558cdc	[MLIR][SPIRVToLLVM] Added a hook for descriptor set / binding encoding This patch introduces a hook to encode descriptor set and binding number into `spv.globalVariable`'s symbolic name. This allows to preserve this information, and at the same time legalize the global variable for the conversion to LLVM dialect. This is required for `mlir-spirv-cpu-runner` to convert kernel arguments into LLVM. Also, a couple of some nits added: - removed unused comment - changed to a capital letter in the comment Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86515	2020-08-27 08:27:42 +03:00
Kazuaki Ishizaki	40cbb2484d	[mlir] NFC: fix typo in FileCheck prefix CHECL -> CHECK Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D86550	2020-08-26 03:12:14 +09:00
Thomas Raoux	36ee9a322a	[mlir][GPUToVulkan] Fix signature of bindMemRef function for f16 Binding MemRefs of f16 needs special handling as the type is not supported on CPU. There was a bug in the type used. Differential Revision: https://reviews.llvm.org/D86328	2020-08-21 10:48:00 -07:00
George Mitenkov	dc693a036d	[MLIR][SPIRVToLLVM] Removed std to llvm patterns from the conversion Removed the Standard to LLVM conversion patterns that were previously pulled in for testing purposes. This helps to separate the conversion to LLVM dialect of the MLIR module with both SPIR-V and Standard dialects in it (particularly helpful for SPIR-V cpu runner). Also, tests were changed accordingly. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86285	2020-08-21 00:26:33 +03:00
Mars Saxman	d34df52377	Implement FPToUI and UIToFP ops in standard dialect Add the unsigned complements to the existing FPToSI and SIToFP operations in the standard dialect, with one-to-one lowerings to the corresponding LLVM operations. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85557	2020-08-19 22:49:09 +02:00
Jakub Lichman	8dace28f92	[mlir][VectorToSCF] Bug in TransferRead lowering fixed If Memref has rank > 1 this pass emits N-1 loops around TransferRead op and transforms the op itself to 1D read. Since vectors must have static shape while memrefs don't the pass emits if condition to prevent out of bounds accesses in case some memref dimension is smaller than the corresponding dimension of targeted vector. This logic is fine but authors forgot to apply `permutation_map` on loops upper bounds and thus if condition compares induction variable to incorrect loop upper bound (dimension of the memref) in case `permutation_map` is not identity map. This commit aims to fix that.	2020-08-19 15:34:34 +00:00
Rob Suderman	5556575230	Added std.floor operation to match std.ceil There should be an equivalent std.floor op to std.ceil. This includes matching lowerings for SPIRV, NVVM, ROCDL, and LLVM. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85940	2020-08-18 10:25:32 -07:00
George Mitenkov	cc98a0fbe4	[MLIR][SPIRVToLLVM] Additional conversions for spirv-runner This patch adds more op/type conversion support necessary for `spirv-runner`: - EntryPoint/ExecutionMode: currently removed since we assume having only one kernel function in the kernel module. - StorageBuffer storage class is now supported. We are not concerned with multithreading so this is fine for now. - Type conversion enhanced, now regular offsets and strides for structs and arrays are supported (based on `VulkanLayoutUtils`). - Support of `spc.AccessChain` that is modelled with GEP op in LLVM dialect. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86109	2020-08-18 19:09:59 +03:00
Alex Zinenko	9c4825ce28	[mlir] do not use llvm.cmpxchg with floats According to the LLVM Language Reference, 'cmpxchg' accepts integer or pointer types. Several MLIR tests were using it with floats as it appears possible to programmatically construct and print such an instruction, but it cannot be parsed back. Use integers instead. Depends On D85899 Reviewed By: flaub, rriddle Differential Revision: https://reviews.llvm.org/D85900	2020-08-17 15:44:23 +02:00
Alex Zinenko	168213f91c	[mlir] Move data layout from LLVMDialect to module Op attributes Legacy implementation of the LLVM dialect in MLIR contained an instance of llvm::Module as it was required to parse LLVM IR types. The access to the data layout of this module was exposed to the users for convenience, but in practice this layout has always been the default one obtained by parsing an empty layout description string. Current implementation of the dialect no longer relies on wrapping LLVM IR types, but it kept an instance of DataLayout for compatibility. This effectively forces a single data layout to be used across all modules in a given MLIR context, which is not desirable. Remove DataLayout from the LLVM dialect and attach it as a module attribute instead. Since MLIR does not yet have support for data layouts, use the LLVM DataLayout in string form with verification inside MLIR. Introduce the layout when converting a module to the LLVM dialect and keep the default "" description for compatibility. This approach should be replaced with a proper MLIR-based data layout when it becomes available, but provides an immediate solution to compiling modules with different layouts, e.g. for GPUs. This removes the need for LLVMDialectImpl, which is also removed. Depends On D85650 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D85652	2020-08-17 15:12:36 +02:00
Alex Zinenko	339eba0805	[mlir] do not emit bitcasts between structs in StandardToLLVM The convresion of memref cast operaitons from the Standard dialect to the LLVM dialect has been emitting bitcasts from a struct type to itself. Beyond being useless, such casts are invalid as bitcast does not operate on aggregate types. This kept working by accident because LLVM IR bitcast construction API skips the construction if types are equal before it verifies that the types are acceptable in a bitcast. Do not emit such bitcasts, the memref cast that only adds/erases size information is in fact a noop on the current descriptor as it always contains dynamic values for all sizes. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D85899	2020-08-14 11:33:10 +02:00
George Mitenkov	2ad7e1a301	[MLIR][SPIRVToLLVM] Conversion for global and addressof Inital conversion of `spv._address_of` and `spv.globalVariable`. In SPIR-V, the global returns a pointer, whereas in LLVM dialect the global holds an actual value. This difference is handled by `spv._address_of` and `llvm.mlir.addressof`ops that both return a pointer. Moreover, only current invocation is in conversion's scope. Reviewed By: antiagainst, mravishankar Differential Revision: https://reviews.llvm.org/D84626	2020-08-12 09:41:14 +03:00
Christian Sigg	0d4b7adb82	[MLIR] Make gpu.launch_func rewrite pattern part of the LLVM lowering pass. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D85073	2020-08-10 19:28:30 +02:00
Thomas Raoux	68330ee0a9	[mlir][vector] Relax transfer_read/transfer_write restriction on memref operand Relax the verifier for transfer_read/transfer_write operation so that it can take a memref with a different element type than the vector being read/written. This is based on the discourse discussion: https://llvm.discourse.group/t/memref-cast/1514 Differential Revision: https://reviews.llvm.org/D85244	2020-08-10 08:57:48 -07:00
Konrad Dobros	9414a71aaa	[mlir][spirv] Add correct handling of Kernel and Addresses capabilities This change adds initial support needed to generate OpenCL compliant SPIRV. If Kernel capability is declared then memory model becomes OpenCL. If Addresses capability is declared then addressing model becomes Physical64. Additionally for Kernel capability interface variable ABI attributes are not generated as entry point function is expected to have normal arguments. Differential Revision: https://reviews.llvm.org/D85196	2020-08-07 12:29:21 -07:00
aartbik	c3c95b9c80	[mlir] [VectorOps] Improve lowering of extract_strided_slice (and friends like shape_cast) Using a shuffle for the last recursive step in progressive lowering not only results in much more compact IR, but also more efficient code (since the backend is no longer confused on subvector aliasing for longer vectors). E.g. the following %f = vector.shape_cast %v0: vector<1024xf32> to vector<32x32xf32> yields much better x86-64 code that runs 3x faster than the original. Reviewed By: bkramer, nicolasvasilache Differential Revision: https://reviews.llvm.org/D85482	2020-08-07 09:21:05 -07:00
Alexander Belyaev	3effc35015	[mlir] Lower DimOp to LLVM for unranked memrefs. Differential Revision: https://reviews.llvm.org/D85361	2020-08-06 11:46:11 +02:00
aartbik	39379916a7	[mlir] [VectorOps] Add masked load/store operations to Vector dialect The intrinsics were already supported and vector.transfer_read/write lowered direclty into these operations. By providing them as individual ops, however, clients can used them directly, and it opens up progressively lowering transfer operations at higher levels (rather than direct lowering to LLVM IR as done now). Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D85357	2020-08-05 16:45:24 -07:00
Lei Zhang	0d03b3901d	[mlir][StandardToSPIRV] Use spv.UMod for index re-calculation Per Vulkan's SPIR-V environment spec: "While the OpSRem and OpSMod instructions are supported by the Vulkan environment, they require non-negative values and thus do not enable additional functionality beyond what OpUMod provides." The `getOffsetForBitwidth` function is used for lowering std.load and std.store, whose indices are of `index` type and cannot be negative. So we should be okay to use spv.UMod directly here to be exact. Also made the comment explicit about the assumption. Differential Revision: https://reviews.llvm.org/D83714	2020-08-05 14:52:04 -04:00
Lei Zhang	48378a32af	[spirv] Fix bitwidth emulation for Workgroup storage class If Int16 is not available, 16-bit integers inside Workgroup storage class should be emulated via 32-bit integers. This was previously broken because the capability querying logic was incorrectly intercepting all storage classes where it meant to only handle interface storage classes. Adjusted where we return to fix this. Differential Revision: https://reviews.llvm.org/D85308	2020-08-05 14:44:03 -04:00
Alexander Belyaev	bc7456fd8a	[mlir] Fix rank bitwidth in UnrankedMemRefType conversion. Differential Revision: https://reviews.llvm.org/D85300	2020-08-05 18:35:23 +02:00
Alex Zinenko	bdb9295664	[mlir] Fix convert-to-llvmir.mlir test broken due to syntax change The syntax of the LLVM dialect types changed between the time the code was written and it was submitted, leading to a test failure. Update the syntax.	2020-08-05 13:29:35 +02:00
Arpith C. Jacob	fab4b59961	[mlir] Conversion of ViewOp with memory space to LLVM. Handle the case where the ViewOp takes in a memref that has an memory space. Reviewed By: ftynse, bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D85048	2020-08-05 12:19:52 +02:00
Alexander Belyaev	a3d427d30c	[mlir] Lower RankOp to LLVM for unranked memrefs. Differential Revision: https://reviews.llvm.org/D85273	2020-08-05 12:13:43 +02:00
George Mitenkov	e739648cfa	[MLIR][SPIRVToLLVM] Conversion pattern for loop op This patch introduces a conversion of `spv.loop` to LLVM dialect. Similarly to `spv.selection`, op's control attributes are not mapped to LLVM yet and therefore the conversion fails if the loop control is not `None`. Also, all blocks within the loop should be reachable in order for conversion to succeed. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D84245	2020-08-05 10:33:54 +03:00
aartbik	e8dcf5f87d	[mlir] [VectorOps] Add expand/compress operations to Vector dialect Introduces the expand and compress operations to the Vector dialect (important memory operations for sparse computations), together with a first reference implementation that lowers to the LLVM IR dialect to enable running on CPU (and other targets that support the corresponding LLVM IR intrinsics). Reviewed By: reidtatge Differential Revision: https://reviews.llvm.org/D84888	2020-08-04 12:00:42 -07:00
George Mitenkov	b9266f81bc	[MLIR][SPIRVToLLVM] Indentation and style fix in tests Second patch with test fixes. Redundant `%{{.*}} = ` removed, label checks added, tabs converted to spaces and some namings are changed to match the convention. Fixed tests: - constant-op-to-llvm - func-ops-to-llvm (renamed) - memory-ops-to-llvm - misc-ops-to-llvm - module-ops-to-llvm - shift-ops-to-llvm (renamed) - spirv-types-to-llvm-invalid (renamed) Reviewed By: ftynse, rriddle Differential Revision: https://reviews.llvm.org/D85206	2020-08-04 20:53:20 +03:00
Alex Zinenko	ec1f4e7c3b	[mlir] switch the modeling of LLVM types to use the new mechanism A new first-party modeling for LLVM IR types in the LLVM dialect has been developed in parallel to the existing modeling based on wrapping LLVM `Type *` instances. It resolves the long-standing problem of modeling identified structure types, including recursive structures, and enables future removal of LLVMContext and related locking mechanisms from LLVMDialect. This commit only switches the modeling by (a) renaming LLVMTypeNew to LLVMType, (b) removing the old implementaiton of LLVMType, and (c) updating the tests. It is intentionally minimal. Separate commits will remove the infrastructure built for the transition and update API uses where appropriate. Depends On D85020 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85021	2020-08-04 14:29:25 +02:00
George Mitenkov	f003b28363	[MLIR][SPIRVToLLVM] Indentation and style fix in tests This is a first patch that sweeps over tests to fix indentation (tabs to spaces). It also adds label checks and removes redundant matching of `%{{.*}} = `. The following tests have been fixed: - arithmetic-ops-to-llvm - bitwise-ops-to-llvm - cast-ops-to-llvm - comparison-ops-to-llvm - logical-ops-to-llvm (renamed to match the rest) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85181	2020-08-04 14:30:49 +03:00
Frederik Gossen	11492be9d7	[MLIR][Shape] Lower `shape.broadcast` to `scf` Differential Revision: https://reviews.llvm.org/D85027	2020-08-03 08:20:14 +00:00
Thomas Raoux	59156bad03	[mlir][spirv] Add support for converting memref of vector to SPIR-V This allow declaring buffers and alloc of vectors so that we can support vector load/store. Differential Revision: https://reviews.llvm.org/D84982	2020-07-30 15:05:40 -07:00
Alexander Belyaev	6b8c641d8e	[mlir] NFC: Expose `getElementPtrType` and `getSizes` methods of AllocOpLowering. Differential Revision: https://reviews.llvm.org/D84917	2020-07-30 20:18:29 +02:00
Stephan Herhut	85defd23aa	[mlir][shape] Use memref of index in shape lowering Now that we can have a memref of index type, we no longer need to materialize shapes in i64 and then index_cast. Differential Revision: https://reviews.llvm.org/D84938	2020-07-30 15:12:43 +02:00
Stephan Herhut	e12db3ed99	[mlir] Allow index as element type of memref Differential Revision: https://reviews.llvm.org/D84934	2020-07-30 14:35:22 +02:00
Frederik Gossen	a97940d4e0	[MLIR][Shape] Limit `shape.rank` lowering to its extent tensor variant When lowering to the standard dialect, we currently support only the extent tensor variant of the shape.rank operation. This change lets the conversion pattern fail in a well-defined manner. Differential Revision: https://reviews.llvm.org/D84852	2020-07-30 11:43:08 +00:00
George Mitenkov	1880532036	[MLIR][SPIRVToLLVM] Conversion of GLSL ops to LLVM intrinsics This patch introduces new intrinsics in LLVM dialect: - `llvm.intr.floor` - `llvm.intr.maxnum` - `llvm.intr.minnum` - `llvm.intr.smax` - `llvm.intr.smin` These intrinsics correspond to SPIR-V ops from GLSL extended instruction set (`spv.GLSL.Floor`, `spv.GLSL.FMax`, `spv.GLSL.FMin`, `spv.GLSL.SMax` and `spv.GLSL.SMin` respectively). Also conversion patterns for them were added. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84661	2020-07-30 11:22:44 +03:00
George Mitenkov	3aab320557	[MLIR][SPIRVToLLVM] Conversion for inverse sqrt and tanh This is a second patch on conversion of GLSL ops to LLVM dialect. It introduces patterns to convert `spv.InverseSqrt` and `spv.Tanh`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84633	2020-07-30 10:50:48 +03:00
George Mitenkov	647e9a54c7	[MLIR][SPIRVToLLVM] Conversion patterns for GLSL ops This is the first patch that adds support for GLSL extended instruction set ops. These are direct conversions, apart from `spv.Tan` that is lowered to `sin() / cos()`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84627	2020-07-30 10:20:11 +03:00
Frederik Gossen	5fc34fafa7	[MLIR][Shape] Limit shape to SCF lowering patterns to their supported types Differential Revision: https://reviews.llvm.org/D84444	2020-07-29 14:54:09 +00:00
Frederik Gossen	6673c6cd82	[MLIR][Shape] Limit shape to standard lowerings to their supported types The lowering does not support all types for its source operations. This change makes the patterns fail in a well-defined manner. Differential Revision: https://reviews.llvm.org/D84443	2020-07-29 13:56:52 +00:00
Frederik Gossen	b6b9d3ea85	[MLIR][Shape] Remove type conversion from lowering to standard Operating on indices and extent tensors directly, the type conversion is no longer needed for the supported cases. Differential Revision: https://reviews.llvm.org/D84442	2020-07-29 10:48:05 +00:00
Stephan Herhut	5d9f33aaa0	[MLIR][Shape] Add conversion for missing ops to standard This adds conversions for const_size and to_extent_tensor. Also, cast-like operations are now folded away if the source and target types are the same. Differential Revision: https://reviews.llvm.org/D84745	2020-07-29 12:46:18 +02:00
George Mitenkov	1f4aa30a4f	[MLIR][SPIRVToLLVM] Branch weights support for BranchConditional conversion Conversion of `spv.BranchConditional` now supports branch weights that are mapped to weights vector in `llvm.cond_br`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84657	2020-07-29 10:11:10 +03:00
George Mitenkov	b1e398920f	[MLIR][SPIRVToLLVM] Support of volatile/nontemporal memory access in load/store This patch adds support of Volatile and Nontemporal memory accesses to `spv.Load` and `spv.Store`. These attributes are modelled with a `volatile` and `nontemporal` flags. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84739	2020-07-29 08:45:40 +03:00
MaheshRavishankar	e5608cacfd	[mlir][GPUToSPIRV] Add a test pass to set workgroup size for kernel functions. This allows using command line flags to lowere from GPU to SPIR-V. The pass added is only for testing/example purposes. Most uses cases will need more fine-grained control on setting workgroup sizes for kernel functions. Differential Revision: https://reviews.llvm.org/D84619	2020-07-28 12:10:30 -07:00
Frederik Gossen	dfcc09890a	[MLIR][Shape] Lower `shape.const_shape` to `tensor_from_elements` Differential Revision: https://reviews.llvm.org/D82848	2020-07-28 15:40:55 +00:00
Christian Sigg	c64c04bbaa	Clean up cuda-runtime-wrappers API. Do not return error code, instead return created resource handles or void. Error reporting is done by the library function. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D84660	2020-07-28 16:34:08 +02:00
MaheshRavishankar	fbe911ee75	[mlir][AffineToStandard] Make LowerAffine pass Op-agnostic. The LowerAffine psas was a FunctionPass only for legacy reasons. Making this Op-agnostic allows it to be used from command line when affine expressions are within operations other than `std.func`. Differential Revision: https://reviews.llvm.org/D84590	2020-07-27 12:14:17 -07:00
Jacques Pienaar	595d214f47	[mlir][shape] Further operand and result type generalization Previous changes generalized some of the operands and results. Complete a larger group of those to simplify progressive lowering. Also update some of the declarative asm form due to generalization. Tried to keep it mostly mechanical.	2020-07-25 21:41:31 -07:00
George Mitenkov	8be0371eb7	[MLIR][SPIRVToLLVM] Conversion of load and store SPIR-V ops This patch introduces conversion pattern for `spv.Store` and `spv.Load`. Only op with `Function` Storage Class is supported at the moment because `spv.GlobalVariable` has not been introduced yet. If the op has memory access attribute, then there are the following cases. If the access is `Aligned`, add alignment to the op builder. Otherwise the conversion fails as other cases are not supported yet because they require additional attributes for `llvm.store`/`llvm.load` ops: e.g. `volatile` and `nontemporal`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84236	2020-07-24 16:31:45 +03:00
Frederik Gossen	783a351785	[MLIR][Shape] Allow `shape.mul` to operate in indices Differential Revision: https://reviews.llvm.org/D84437	2020-07-24 13:25:40 +00:00
George Mitenkov	5c98631391	[MLIR][SPIRVToLLVM] Conversion of SPIR-V variable op The patch introduces the conversion pattern for function-level `spv.Variable`. It is modelled as `llvm.alloca` op. If initialized, then additional store instruction is used. Note that there is no initialization for arrays and structs since constants of these types are not supported in LLVM dialect yet. Also, at the moment initialisation is only possible via `spv.constant` (since `spv.GlobalVariable` conversion is not implemented yet). The input code has some scoping is not taken into account and will be addressed in a different patch. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D84224	2020-07-24 15:49:55 +03:00
Frederik Gossen	bb442bb51a	[MLIR][Shape] Remove deprecated and unused lowerings This concerns `from/to_extent_tensor`, `size_to_index`, `index_to_size`, and `const_size` conversion patterns. The new lowering will work directly on indices and extent tensors. The shape and size values will allow for error values but are not yet supported by the dialect conversion. Differential Revision: https://reviews.llvm.org/D84436	2020-07-24 11:19:36 +00:00
Frederik Gossen	5984d74139	[MLIR][Shape] Allow `get_extent` to operate on extent tensors and indices Differential Revision: https://reviews.llvm.org/D84435	2020-07-24 11:13:17 +00:00
Frederik Gossen	7f600da828	[MLIR][Shape] Allow `shape.any` to operate on extent tensors Differential Revision: https://reviews.llvm.org/D84433	2020-07-24 11:03:10 +00:00
Frederik Gossen	23a65648c0	[MLIR][Shape] Allow `shape.rank` to operate on extent tensors Differential Revision: https://reviews.llvm.org/D84429	2020-07-24 10:43:39 +00:00
Frederik Gossen	d4e4d5d780	[MLIR][Shape] Allow for `shape_of` to return extent tensors The operation `shape.shape_of` now returns an extent tensor `tensor<?xindex>` in cases when no error are possible. All consuming operation will eventually accept both, shapes and extent tensors. Differential Revision: https://reviews.llvm.org/D84160	2020-07-24 08:40:40 +00:00
Frederik Gossen	0e1a42efd8	[MLIR][Shape] Allow `shape.get_extent` to operate on extent tensors `shape.get_extent` now accepts extent tensors `tensor<?xindex>` as an argument. Differential Revision: https://reviews.llvm.org/D84158	2020-07-24 08:34:37 +00:00
Frederik Gossen	fb1e571687	[MLIR][Standard] Add default lowering for `assert` The default lowering of `assert` calls `abort` in case the assertion is violated. The failure message is ignored but should be used by custom lowerings that can assume more about their environment. Differential Revision: https://reviews.llvm.org/D83886	2020-07-24 08:31:12 +00:00
River Riddle	4589dd924d	[mlir][DialectConversion] Enable deeper integration of type conversions This revision adds support for much deeper type conversion integration into the conversion process, and enables auto-generating cast operations when necessary. Type conversions are now largely automatically managed by the conversion infra when using a ConversionPattern with a provided TypeConverter. This removes the need for patterns to do type cast wrapping themselves and moves the burden to the infra. This makes it much easier to perform partial lowerings when type conversions are involved, as any lingering type conversions will be automatically resolved/legalized by the conversion infra. To support this new integration, a few changes have been made to the type materialization API on TypeConverter. Materialization has been split into three separate categories: * Argument Materialization: This type of materialization is used when converting the type of block arguments when calling `convertRegionTypes`. This is useful for contextually inserting additional conversion operations when converting a block argument type, such as when converting the types of a function signature. * Source Materialization: This type of materialization is used to convert a legal type of the converter into a non-legal type, generally a source type. This may be called when uses of a non-legal type persist after the conversion process has finished. * Target Materialization: This type of materialization is used to convert a non-legal, or source, type into a legal, or target, type. This type of materialization is used when applying a pattern on an operation, but the types of the operands have not yet been converted. Differential Revision: https://reviews.llvm.org/D82831	2020-07-23 19:40:31 -07:00
aartbik	1485fd295b	[mlir] [VectorOps] Improve scatter/gather CPU performance Replaced the linearized address with the proper LLVM way of defining vector of base + indices in SIMD style. This yields much better code. Some prototype results with microbencmarking sparse matrix x vector with 50% sparsity (about 2-3x faster): LINEARIZED IMPROVED GFLOPS sdot saxpy sdot saxpy 16x16 1.6 1.4 4.4 2.1 32x32 1.7 1.6 5.8 5.9 64x64 1.7 1.7 6.4 6.4 128x128 1.7 1.7 5.9 5.9 256x256 1.6 1.6 6.1 6.0 512x512 1.4 1.4 4.9 4.7 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D84368	2020-07-22 23:47:36 -07:00
aartbik	19dbb230a2	[mlir] [VectorOps] Add scatter/gather operations to Vector dialect Introduces the scatter/gather operations to the Vector dialect (important memory operations for sparse computations), together with a first reference implementation that lowers to the LLVM IR dialect to enable running on CPU (and other targets that support the corresponding LLVM IR intrinsics). The operations can be used directly where applicable, or can be used during progressively lowering to bring other memory operations closer to hardware ISA support for a gather/scatter. The semantics of the operation closely correspond to those of the corresponding llvm intrinsics. Note that the operation allows for a dynamic index vector (which is important for sparse computations). However, this first reference lowering implementation "serializes" the address computation when base + index_vector is converted to a vector of pointers. Exploring how to use SIMD properly during these step is TBD. More general memrefs and idiomatic versions of striding are also TBD. Reviewed By: arpith-jacob Differential Revision: https://reviews.llvm.org/D84039	2020-07-21 10:57:40 -07:00
George Mitenkov	61dd481f11	[MLIR][LLVMDialect] SelectionOp conversion pattern This patch introduces conversion pattern for `spv.selection` op. The conversion can only be applied to selection with all blocks being reachable. Moreover, selection with control attributes "Flatten" and "DontFlatten" is not supported. Since the `PatternRewriter` hook for block merging has not been implemented for `ConversionPatternRewriter`, merge and continue blocks are kept separately. Reviewed By: antiagainst, ftynse Differential Revision: https://reviews.llvm.org/D83860	2020-07-21 17:11:46 +03:00
George Mitenkov	05d3160c9c	[MLIR][SPIRVToLLVM] Conversion of SPIR-V branch ops This patch introduces conversion for `spv.Branch` and `spv.BranchConditional` ops. Branch weigths for `spv.BranchConditional` are not supported at the moment, and conversion in this case fails. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D83784	2020-07-21 10:52:20 +03:00
Frederik Gossen	71e7a37e7e	[MLIR][Shape] Allow `shape.rank` to accept extent tensors `tensor?xindex>` Differential Revision: https://reviews.llvm.org/D84156	2020-07-20 14:47:19 +00:00
Yash Jain	3382b7177f	[MLIR] Add lowering for affine.parallel to scf.parallel Add lowering conversion from affine.parallel to scf.parallel. Differential Revision: https://reviews.llvm.org/D83239	2020-07-18 13:13:49 +05:30
Nicolas Vasilache	cc0a58d7cd	[mlir][Vector] Fix masking logic in VectorToSCF Summary: The logic was conservative but inverted: cases that should remain unmasked became 1-D masked. Differential Revision: https://reviews.llvm.org/D84051	2020-07-17 13:24:07 -04:00
Frederik Gossen	aca7b8dd63	[MLIR][Shape] Lower `shape.shape_eq` to `scf` Lower `shape.shape_eq` to the `scf` (and `std`) dialect. For now, this lowering is limited to extent tensor operands. Differential Revision: https://reviews.llvm.org/D82530	2020-07-16 14:44:29 +00:00
Frederik Gossen	67391a7045	[MLIR] Lower `shape.reduce` to `scf.for` only when argument is `tensor<?xindex>` To make it clear when shape error values cannot occur the shape operations can operate on extent tensors. This change updates the lowering for `shape.reduce` accordingly. Differential Revision: https://reviews.llvm.org/D83944	2020-07-16 13:55:48 +00:00
Frederik Gossen	0eb50e614c	[MLIR][Shape] Allow `shape.reduce` to operate on extent tensors Allow `shape.reduce` to take both `shape.shape` and `tensor<?xindex>` as an argument. Differential Revision: https://reviews.llvm.org/D83943	2020-07-16 13:53:37 +00:00
George Mitenkov	be15284ef6	[MLIR][StdToSPIRV] Fixed a typo in ops conversion tests Fixed a typo in `std-ops-to-spitv.mlir` test. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D83791	2020-07-14 21:46:07 +03:00
Nicolas Vasilache	affbc0cd1c	[mlir] Add alignment attribute to LLVM memory ops and use in vector.transfer Summary: The native alignment may generally not be used when lowering a vector.transfer to the underlying load/store operation. This revision fixes the unmasked load/store alignment to match that of the masked path. Differential Revision: https://reviews.llvm.org/D83684	2020-07-13 17:35:20 -04:00
Lei Zhang	4ba45a778a	[mlir][StandardToSPIRV] Fix conversion for signed remainder Per the Vulkan's SPIR-V environment spec, "for the OpSRem and OpSMod instructions, if either operand is negative the result is undefined." So we cannot directly use spv.SRem/spv.SMod if either operand can be negative. Emulate it via spv.UMod. Because the emulation uses spv.SNegate, this commit also defines spv.SNegate. Differential Revision: https://reviews.llvm.org/D83679	2020-07-13 16:15:31 -04:00
Benjamin Kramer	3bffe6022c	[mlir][VectorOps] Lower vector.fma to llvm.fmuladd instead of llvm.fma Summary: These are semantically equivalent, but fmuladd allows decaying the op into fmul+fadd if there is no fma instruction available. llvm.fma lowers to scalar calls to libm fmaf, which is a lot slower. Reviewers: nicolasvasilache, aartbik, ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, Kayjukh, jurahul, msifontes Tags: #mlir Differential Revision: https://reviews.llvm.org/D83666	2020-07-13 12:26:03 +02:00
Frederik Gossen	9df6afbb5c	[MLIR][Shape] Lower `shape.any` Lower `shape.any` to its first operand. Differential Revision: https://reviews.llvm.org/D83123	2020-07-13 08:30:05 +00:00
Nicolas Vasilache	22c8a08fd8	[mlir][Vector] Fold chains of ExtractOp This revision adds folding to ExtractOp by simply concatenating the position attributes.	2020-07-10 09:32:02 -04:00
George Mitenkov	eb6b7c5d4f	[MLIR][SPIRVToLLVM] Conversion of SPIR-V struct type without offset This patch introduces type conversion for SPIR-V structs. Since handling offset case requires thorough testing, it was left out for now. Hence, only structs with no offset are currently supported. Also, structs containing member decorations cannot be translated. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D83403	2020-07-10 10:15:45 +03:00
George Mitenkov	28cd3cbc12	[MLIR][SPIRVToLLVM] Conversion of SPIR-V array, runtime array, and pointer types This patch adds type conversion for 4 SPIR-V types: array, runtime array, pointer and struct. This conversion is integrated using a separate function `populateSPIRVToLLVMTypeConversion()` that adds new type conversions. At the moment, this is a basic skeleton that allows to perfom conversion from SPIR-V array, runtime array and pointer types to LLVM typesystem. There is no support of array strides or storage classes. These will be supported on the case by case basis. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D83399	2020-07-09 18:11:45 +03:00
George Mitenkov	7a4e39b326	[MLIR][SPIRVToLLVM] Implementation of spv.BitFieldSExtract and spv.BitFieldUExtract patterns This patch adds conversion patterns for `spv.BitFieldSExtract` and `spv.BitFieldUExtract`. As in the patch for `spv.BitFieldInsert`, `offset` and `count` have to be broadcasted in vector case and casted to match the type of the base. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D82640	2020-07-08 12:37:37 +03:00
George Mitenkov	00580349c3	[MLIR][SPIRVToLLVM] Miscellaneous ops conversion: select, fmul and undef This patch introduces 3 new direct conversions for SPIR-V ops: - `spv.Select` - `spv.Undef` - `spv.FMul` that was skipped in the patch with arithmetic ops Differential Revision: https://reviews.llvm.org/D83291	2020-07-08 11:06:04 +03:00
River Riddle	9db53a1827	[mlir][NFC] Remove usernames and google bug numbers from TODO comments. These were largely leftover from when MLIR was a google project, and don't really follow LLVM guidelines.	2020-07-07 01:40:52 -07:00
Nicolas Vasilache	bd87c6bce1	[mlir][Vector] Add custom slt / SCF.if folding to VectorToSCF scf.if currently lacks folding on true / false conditionals. Such foldings are a bit more involved than can be addressed immediately. This revision introduces an eager folding for lowering vector.transfer operations in the presence of unrolling. Differential revision: https://reviews.llvm.org/D83146	2020-07-06 08:21:21 -04:00
George Mitenkov	1cfaaf6455	[MLIR][SPIRVToLLVM] Convert spv.constant scalars and vectors This patch introduces conversion pattern for `spv.constant` with scalar and vector types. There is a special case when the constant value is a signed/unsigned integer (vector of integers). Since LLVM dialect does not have signedness semantics, the types had to be converted to signless ints. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D82936	2020-07-02 14:26:58 -04:00
George Mitenkov	8119a374bc	[MLIR][SPIRVToLLVM] SPIR-V function call conversion pattern Added conversion pattern for SPIR-V `FunctionCallOp`. Based on specification, it returns no results or a single result, so can be mapped directly to LLVM dialect's `llvm.call`. Reviewed By: antiagainst, ftynse Differential Revision: https://reviews.llvm.org/D83030	2020-07-02 12:38:27 -04:00
George Mitenkov	03fe7eb16f	[MLIR][SPIRVToLLVM] Implementation of spv.BitFieldInsert pattern This patch introduces conversion pattern for `spv.BitFiledInsert` op, as well as some utility functions to facilitate code reading. Since `spv.BitFiledInsert` may take both vector and integer operands, this case was specifically handled by broadcasting values (`count` and `offset` here) to vectors. Moreover, the types had to be converted to same bitwidth in order to conform with LLVM dialect rules. This was done with `zext` when extending (Note that `count` and `offset` are treated as unsigned) and `trunc` in the opposite case. For the latter one, truncation is safe since the op is defined only when `count`/`offset`/their sum is less than the bitwidth of the result. This introduces a natural bound of the value of 64, which can be expressed as `i8`. Reviewed By: antiagainst, ftynse Differential Revision: https://reviews.llvm.org/D82639	2020-07-02 12:19:12 -04:00
Thomas Raoux	0670f855a7	[mlir][spirv] Add support for lowering scf.for scf/if with return value This allow lowering to support scf.for and scf.if with results. As right now spv region operations don't have return value the results are demoted to Function memory. We create one allocation per result right before the region and store the yield values in it. Then we can load back the value from allocation to be able to use the results. Differential Revision: https://reviews.llvm.org/D82246	2020-07-01 17:08:08 -07:00
George Mitenkov	3819789be6	[MLIR][SPIRVToLLVM] Added Bitcast conversion pattern Added conversion pattern and tests for `spv.Bitcast` op. This one has a direct mapping in LLVM dialect so `DirectConversionPattern` was used. Differential Revision: https://reviews.llvm.org/D82748	2020-06-29 20:32:48 -04:00
George Mitenkov	cd1bc5c15d	[MLIR][SPIRVToLLVM] Convert bitwise and logical not This patch introduces new conversion patterns for bit and logical negation op: `spv.Not` and `spv.LogicalNot`. They are implemented by applying xor on the operand and mask with all bits set. Differential Revision: https://reviews.llvm.org/D82637	2020-06-29 19:16:50 -04:00
Tobias Gysi	10643c9ad8	[mlir] make the bitwidth of device side index computations configurable (reland) Summary: The patch makes the index type lowering of the GPU to NVVM/ROCDL conversion configurable. It introduces a pass option that controls the bitwidth used when lowering index computations and uses the LowerToLLVMOptions structure to control the Standard to LLVM lowering. This commit fixes a use-after-free bug introduced by the reverted commit `d10b1a3`. It implements the following changes: - Added a getDefaultOptions method to the LowerToLLVMOptions struct that returns a reference to statically allocated default options. - Use the getDefaultOptions method to provide default LowerToLLVMOptions (instead of an initializer list). - Added comments to clarify the required lifetime of the LowerToLLVMOptions Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D82475	2020-06-29 12:22:39 +02:00
Alex Zinenko	cba733edf5	[mlir] LLVM dialect: use addressof instead of constant to create function pointers `llvm.mlir.constant` was originally introduced as an LLVM dialect counterpart to `std.constant`. As such, it was supporting "function pointer" constants derived from the symbol name. This is different from `std.constant` that allows for creation of a "function" constant since MLIR, unlike LLVM IR, supports this. Later, `llvm.mlir.addressof` was introduced as an Op that obtains a constant pointer to a global in the LLVM dialect. It naturally extends to functions (in LLVM IR, functions are globals) and should be used for defining "function pointer" values instead. Fixes PR46344. Differential Revision: https://reviews.llvm.org/D82667	2020-06-29 12:21:33 +02:00
Frederik Gossen	d876e3202a	[MLIR][Shape] Lower `shape.get_extent` to `extract_element` when possible When the origin of a shape is an extent tensor the operation `get_extent` can be lowered directly to `extract_element`. This choice circumvents the necessity to materialize the shape in memory. Differential Revision: https://reviews.llvm.org/D82645	2020-06-29 08:39:22 +00:00
Frederik Gossen	76d72c941d	[MLIR][Shape] Lower `shape.get_extent` to `std.dim` when possible When the shape is derived from a tensor argument the shape extent can be derived directly from that tensor with `std.dim`. This lowering pattern circumvents the necessity to materialize the shape in memory. Differential Revision: https://reviews.llvm.org/D82644	2020-06-29 08:38:22 +00:00
aartbik	ceb1b327b5	[mlir] [VectorOps] Add the ability to mark FP reductions with "reassociate" attribute Rationale: In general, passing "fastmath" from MLIR to LLVM backend is not supported, and even just providing such a feature for experimentation is under debate. However, passing fine-grained fastmath related attributes on individual operations is generally accepted. This CL introduces an option to instruct the vector-to-llvm lowering phase to annotate floating-point reductions with the "reassociate" fastmath attribute, which allows the LLVM backend to use SIMD implementations for such constructs. Oher lowering passes can start using this mechanism right away in cases where reassociation is allowed. Benefit: For some microbenchmarks on x86-avx2, speedups over 20 were observed for longer vector (due to cleaner, spill-free and SIMD exploiting code). Usage: mlir-opt --convert-vector-to-llvm="reassociate-fp-reductions" Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D82624	2020-06-26 11:03:14 -07:00
George Mitenkov	c8295de4a6	[MLIR][SPIRVToLLVM] Conversion for bitrverse and bitcount ops Implemented conversion for `spv.BitReverse` and `spv.BitCount`. Since ODS generates builders in a different way for LLVM dialect intrinsics, I added attributes to build method in `DirectConversionPattern` class. The tests for these ops are in `bitwise-ops-to-llvm.mlir`. Differential Revision: https://reviews.llvm.org/D82286	2020-06-26 10:30:52 -04:00
Alex Zinenko	6323065fd6	[mlir] support returning unranked memrefs Initially, unranked memref descriptors in the LLVM dialect were designed only to be passed into functions. An assertion was guarding against returning unranked memrefs from functions in the standard-to-LLVM conversion. This is insufficient for functions that wish to return an unranked memref such that the caller does not know the rank in advance, and hence cannot allocate the descriptor and pass it in as an argument. Introduce a calling convention for returning unranked memref descriptors as follows. An unranked memref descriptor always points to a ranked memref descriptor stored on stack of the current function. When an unranked memref descriptor is returned from a function, the ranked memref descriptor it points to is copied to dynamically allocated memory, the ownership of which is transferred to the caller. The caller is responsible for deallocating the dynamically allocated memory and for copying the pointed-to ranked memref descriptor onto its stack. Provide default lowerings for std.return, std.call and std.indirect_call that maintain the conversion defined above. This convention is additionally exercised by a runtime test to guard against memory errors. Differential Revision: https://reviews.llvm.org/D82647	2020-06-26 15:37:37 +02:00
Frederik Gossen	e34b88309e	[MLIR][Shape] Lower `shape_of` for unranked tensors Lower `shape_of` for unranked tensors. Materializes shape in stack-allocated memory. Differential Revision: https://reviews.llvm.org/D82196	2020-06-25 08:50:45 +00:00
Frederik Gossen	24debf5a76	[MLIR][Shape] Lower `shape.rank` Lower `shape.rank` to standard dialect. A shape's size is the same as the extent of the first and only dimension of the `tensor<?xindex>` it is represented by. Differential Revision: https://reviews.llvm.org/D82080	2020-06-25 08:44:06 +00:00
George Mitenkov	b5c24c24a4	[MLIR][SPIRVToLLVM] Implementation of SPIR-V module conversion pattern This patch introduces conversion patterns for `spv.module` and `spv._module_end`. SPIR-V module is converted into `ModuleOp`. This will play a role of enclosing scope to LLVM ops. At the moment, SPIR-V module attributes (such as memory model, etc) are ignored. Differential Revision: https://reviews.llvm.org/D82468	2020-06-24 20:42:50 -04:00
Tobias Gysi	2ff6fad700	Revert "[mlir] make the bitwidth of device side index computations configurable" This reverts commit `d10b1a38a7`.	2020-06-23 19:21:36 +02:00
George Mitenkov	a4dc61344f	[MLIR][SPIRVToLLVM] Implementation of spv.func conversion, and return ops This patch provides an implementation for `spv.func` conversion. The pattern is populated in a separate method added to the pass. At the moment, the type signature conversion only includes the supported types. The conversion pattern also matches SPIR-V function control attributes to LLVM function attributes. Those are modelled as `passthrough` attributes in LLVM dialect. The following mapping are used: - None: no attributes passed - Inline: `alwaysinline` seems to be the right equivalent (`inlinehint` is semantically weaker in my opinion) - DontInline: `noinline` - Pure and Const: I think those can be modelled as `readonly` and `readnone` attributes respectively. Also, 2 patterns added for return ops conversion (`spv.Return` for void return and `spv.ReturnValue` for a single value return). Differential Revision: https://reviews.llvm.org/D81931	2020-06-23 11:34:11 -04:00
HazemAbdelhafez	02022ff2e3	[mlir][spirv] Enhance AccessChainOp index type handling This patch extends the AccessChainOp index type handling to be able to deal with all Integer type indices (i.e., all bit-widths and signedness symantics). There were two ways of achieving this: 1- Backward compatible: The new way of handling the indices will assume that an index type is i32 by default if not specified in the assembly format, this way all the old tests would pass correctly. 2- Enforce the format: This unifies the spv.AccessChain Op format and all the old tests had to be updated to reflect this change or else they fail. I picked option-2 to unify the Op format and avoid having optional index-type fields that can lead to somewhat confusing tests format and multiple representations for the same Op with undocumented assumption that an index is i32 unless stated. Nonetheless, reverting to option-1 should be straightforward if preferred or needed. Differential Revision: https://reviews.llvm.org/D81763	2020-06-22 10:11:33 -04:00
Tobias Gysi	d10b1a38a7	[mlir] make the bitwidth of device side index computations configurable The patch makes the index type lowering of the GPU to NVVM/ROCDL conversion configurable. It introduces a pass option that controls the bitwidth used when lowering index computations. Differential Revision: https://reviews.llvm.org/D80285	2020-06-22 11:43:37 +02:00
Thomas Raoux	670455c77d	[mlir][spirv] Legalize subviewop when used with vector transfer Subview operations are not natively supported downstream in the spirv path. This change allows removing subview when used by vector transfer the same way we already do it when they are used by LoadOp/StoreOp Differential Revision: https://reviews.llvm.org/D82106	2020-06-19 17:33:15 -07:00
aartbik	0d82ab7885	[mlir] [VectorOps] Improve vector.constant_mask lowering Use direct vector constants for the 1-D case. This approach scales much better than generating elaborate insertion operations that are eventually folded into a constant. We could of course generalize the 1-D case to higher ranks, but this simplification already helps in scaling some microbenchmarks that would formerly crash on the intermediate IR length. Reviewed By: reidtatge Differential Revision: https://reviews.llvm.org/D82144	2020-06-19 10:40:08 -07:00
Frederik Gossen	ac3e5c4d93	[MLIR][Shape] Lower `shape.shape_of` to standard dialect Lower `shape.shape_of` to standard dialect. This lowering supports statically and dynamically shaped tensors. Support for unranked tensors will be added as part of the lowering to `scf`. Differential Revision: https://reviews.llvm.org/D82098	2020-06-19 15:21:13 +00:00
aartbik	c9eeeb3871	[mlir] [VectorOps] remove print_i1 from runtime support library Summary: The "i1" (viz. bool) type does not have a proper equivalent on the "C" size. So, to avoid any ABIs issues, we simply use print_i32 on an i32 value of one or zero for true and false. This has the added advantage that one less function needs to be implemented when porting the runtime support library. Reviewers: ftynse, bkramer, nicolasvasilache Reviewed By: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, msifontes Tags: #mlir Differential Revision: https://reviews.llvm.org/D82048	2020-06-18 11:07:43 -07:00
George Mitenkov	771b788687	[MLIR][SPIRVToLLVM] Support cast ops, some logical ops, UModOp Added support of simple logical ops: `LogicalAnd`, `LogicalOr`, `LogicalEqual` and `LogicalNotEqual`. Added a missing conversion for `UMod` op. Also, implemented SPIR-V cast ops conversion. There are 4 simple case where there is a clear equivalent in LLVM (e.g. `ConvertFToS` is `fptosi`). For `FConvert`, `SConvert` and `UConvert` we distinguish between truncation and extension based on the bit width of the operand. Differential Revision: https://reviews.llvm.org/D81812	2020-06-17 17:46:45 -04:00
Frederik Gossen	0990f1a3ad	[MLIR][Standard] Lower `std.dim` with dynamic dimension operand to LLVM Implement the missing lowering from `std.dim` to the LLVM dialect in case of a dynamic dimension. Differential Revision: https://reviews.llvm.org/D81834	2020-06-16 20:57:42 +00:00
Jacques Pienaar	2b41bc5a8b	[mlir][shape] Update test case to new op asm format	2020-06-15 09:04:54 -07:00
Alexander Belyaev	3813f24e97	[mlir][shape] Add a pattern to rewrite `shape.reduce` as `scf.for`. Differential Revision: https://reviews.llvm.org/D81694	2020-06-15 17:54:50 +02:00
Frederik Gossen	361f664850	[MLIR][Standard] Add documentation for `std.dim` and fix test cases Apply post-commit suggestions (see https://reviews.llvm.org/D81551). Add documentation, simplify, and fix test cases. Differential Revision: https://reviews.llvm.org/D81722	2020-06-15 10:40:36 +00:00
Alexander Belyaev	cd320446f4	[mlir][shape] Lower Shape `ConstSizeOp` to Standard `ConstantOp`. Differential Revision: https://reviews.llvm.org/D81735	2020-06-15 10:42:05 +02:00
George Mitenkov	cf2b4d5cb6	[MLIR][SPIRVToLLVM] Implemented shift conversion pattern This patch has shift ops conversion implementation. In SPIR-V dialect, `Shift` and `Base` may have different bit width. On the contrary, in LLVM dialect both `Base` and `Shift` have to be of the same bit width. This leads to the following cases: - if `Base` has the same bit width as `Shift`, the conversion is straightforward. - if `Base` has a greater bit width than `Shift`, shift is sign/zero extended first. Then the extended value is passed to the shift. - otherwise the conversion is considered to be illegal. Differential Revision: https://reviews.llvm.org/D81546	2020-06-12 19:04:30 -04:00
Mehdi Amini	95371ce9c2	Enable FileCheck -enable-var-scope by default in MLIR test This option avoids to accidentally reuse variable across -LABEL match, it can be explicitly opted-in by prefixing the variable name with $ Differential Revision: https://reviews.llvm.org/D81531	2020-06-12 00:43:09 +00:00
George Mitenkov	fc148a4c88	[MLIR][SPIRVToLLVM] Added conversion for SPIR-V comparison ops Implemented `FComparePattern` and `IComparePattern` classes that provide conversion of SPIR-V comparison ops (such as `spv.FOrdGreaterThanEqual` and others) to LLVM dialect. Also added tests in `comparison-ops-to-llvm.mlir`. Differential Revision: https://reviews.llvm.org/D81487	2020-06-11 18:46:17 -04:00
jerryyin	2abad3433f	[mlir][rocdl] Adding vector to ROCDL dialect lowering * Created the vector to ROCDL lowering pass * The lowering pass lowers vector transferOps to rocdl mubufOps * Added unit test and functional test	2020-06-11 14:28:13 +00:00
George Mitenkov	d93d8fcdec	[MLIR][SPIRVToLLVM] Implemented conversion for arithmetic ops and 3 bitwise ops. Following the previous revision `D81100`, this commit implements a templated class that would provide conversion patterns for “straightforward” SPIR-V ops into LLVM dialect. Templating allows to abstract away from concrete implementation for each specific op. Those are mainly binary operations. Currently supported and tested ops are: - Arithmetic ops: `IAdd`, `ISub`, `IMul`, `FAdd`, `FSub`, `FMul`, `FDiv`, `FNegate`, `SDiv`, `SRem` and `UDiv` - Bitwise ops: `BitwiseAnd`, `BitwiseOr`, `BitwiseXor` The implementation relies on `SPIRVToLLVMConversion` class that makes use of `OpConversionPattern`. Differential Revision: https://reviews.llvm.org/D81305	2020-06-10 19:10:31 -04:00
Mehdi Amini	83d920c72a	Fix MLIR test: -dump-input-on-failure is no longer a valid option	2020-06-10 15:58:58 +00:00
Frederik Gossen	904f91db5f	[MLIR][Standard] Make the `dim` operation index an operand. Allow for dynamic indices in the `dim` operation. Rather than an attribute, the index is now an operand of type `index`. This allows to apply the operation to dynamically ranked tensors. The correct lowering of dynamic indices remains to be implemented. Differential Revision: https://reviews.llvm.org/D81551	2020-06-10 13:54:47 +00:00
Mehdi Amini	d31c9e5a46	Change filecheck default to dump input on failure Having the input dumped on failure seems like a better default: I debugged FileCheck tests for a while without knowing about this option, which really helps to understand failures. Remove `-dump-input-on-failure` and the environment variable FILECHECK_DUMP_INPUT_ON_FAILURE which are now obsolete. Differential Revision: https://reviews.llvm.org/D81422	2020-06-09 18:57:46 +00:00
Stephan Herhut	2c8afe1298	[mlir][gpu] Add support for f16 when lowering to nvvm intrinsics Summary: The NVVM target only provides implementations for tanh etc. on f32 and f64 operands. To also support f16, we now insert operations to extend to f32 and truncate back to f16 around the intrinsic call. Differential Revision: https://reviews.llvm.org/D81473	2020-06-09 19:33:45 +02:00
George Mitenkov	fda5192d4f	[MLIR][SPIRVToLLVM] Add skeleton for SPIR-V to LLVM dialect conversion These commits set up the skeleton for SPIR-V to LLVM dialect conversion. I created SPIR-V to LLVM pass, registered it in Passes.td, InitAllPasses.h. Added a pattern for `spv.BitwiseAndOp` and tests for it. Integer, float and vector types are converted through LLVMTypeConverter. Differential Revision: https://reviews.llvm.org/D81100	2020-06-08 18:22:42 -04:00
Alexander Belyaev	80be54c08f	[mlir] Lower Shape binary ops (AddOp, MulOp) to Standard. Differential Revision: https://reviews.llvm.org/D81344	2020-06-08 17:48:01 +02:00
Frederik Gossen	970bb4a291	[MLIR] Add `to/from_extent_tensor` lowering to the standard dialect The operations `to_extent_tensor` and `from_extent_tensor` become no-ops when lowered to the standard dialect. This is possible with a lowering from `shape.shape` to `tensor<?xindex>`. Differential Revision: https://reviews.llvm.org/D81162	2020-06-08 09:38:18 +00:00
Frederik Gossen	867bc41e85	[MLIR] Add type conversion for `shape.shape` Convert `shape.shape` to `tensor<?xindex>` when lowering the `shape` to the `std` dialect. Differential Revision: https://reviews.llvm.org/D81161	2020-06-08 09:34:03 +00:00
Nicolas Vasilache	247e185dd5	[mlir][Vector] Move temporary alloc to top of the function alloca when lowering vector_transfers Recently introduced allocation hoisting is quite conservative on the cases when it triggers. This revision makes it such that the allocations for vector transfer lowerings are hoisted to the top of the function. This should be revisited in the context of parallelism and is a temporary workaround. Differential Revision: https://reviews.llvm.org/D81253	2020-06-05 08:45:52 -04:00
River Riddle	c0cd1f1c5c	[mlir] Refactor BoolAttr to be a special case of IntegerAttr This simplifies a lot of handling of BoolAttr/IntegerAttr. For example, a lot of places currently have to handle both IntegerAttr and BoolAttr. In other places, a decision is made to pick one which can lead to surprising results for users. For example, DenseElementsAttr currently uses BoolAttr for i1 even if the user initialized it with an Array of i1 IntegerAttrs. Differential Revision: https://reviews.llvm.org/D81047	2020-06-04 16:41:24 -07:00
Diego Caballero	5c990d6994	[mlir] Add support for bf16 to StandardToLLVM conversion Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D81127	2020-06-04 14:36:36 -07:00
Thomas Raoux	661235e126	[mlir][gpu] Add subgroup Id/Size/Num to GPU dialect Add SubgroupId, SubgroupSize and NumSubgroups to GPU dialect ops and add the lowering of those ops to SPIRV. Differential Revision: https://reviews.llvm.org/D81042	2020-06-04 10:52:40 -07:00
Hanhan Wang	0b025d2733	[mlir][StandardToSPIRV] Handle i1 case for lowering std.zexti to SPIR-V. Differential Revision: https://reviews.llvm.org/D80965	2020-06-03 15:01:18 -07:00
Frederik Gossen	3713314bfa	[MLIR] Shape to standard dialect lowering Add a new pass to lower operations from the `shape` to the `std` dialect. The conversion applies only to the `size_to_index` and `index_to_size` operations and affected types. Other patterns will be added as needed. Differential Revision: https://reviews.llvm.org/D81091	2020-06-03 16:17:03 +00:00
MaheshRavishankar	2bcd1927dd	[mlir][SCFToGPU] Remove conversions from scf.for to gpu.launch. Keeping in the affine.for to gpu.launch conversions, which should probably be the affine.parallel to gpu.launch conversion as well. Differential Revision: https://reviews.llvm.org/D80747	2020-06-01 23:06:20 -07:00
Nicolas Vasilache	5f9e0466f2	[mlir][Vector] Fix vector.transfer alignment calculation https://reviews.llvm.org/D79246 introduces alignment propagation for vector transfer operations. Unfortunately, the alignment calculation is incorrect and can result in crashes. This revision fixes the calculation by using the natural alignment of the memref elemental type, instead of the resulting vector type. If more alignment is desired, it can be done in 2 ways: 1. use a proper vector.type_cast to transform a memref<axbxcxdxf32> into a memref<axbxvector<cxdxf32>> giving a natural alignment of vector<cxdxf32> 2. add an alignment attribute to vector transfer operations and propagate it. With this change the alignment in the relevant tests goes down from 128 to 4. Lastly, a few minor cleanups are performed and the custom `isMinorIdentityMap` is deprecated. Differential Revision: https://reviews.llvm.org/D80734	2020-05-28 17:58:51 -04:00
Wen-Heng (Jack) Chung	061fb8eb2d	[mlir][gpu][mlir-cuda-runner] Refactor ConvertKernelFuncToCubin to be generic. Make ConvertKernelFuncToCubin pass to be generic: - Rename to ConvertKernelFuncToBlob. - Allow specifying triple, target chip, target features. - Initializing LLVM backend is supplied by a callback function. - Lowering process from MLIR module to LLVM module is via another callback. - Change mlir-cuda-runner to adopt the revised pass. - Add new tests for lowering to ROCm HSA code object (HSACO). - Tests for CUDA and ROCm are kept in separate directories. Differential Revision: https://reviews.llvm.org/D80142	2020-05-28 09:08:28 -05:00
aartbik	c295a65da4	[mlir] [VectorOps] Add 'vector.flat_transpose' operation Summary: Provides a representation of the linearized LLVM instrinsic. With tests and lowering implementation to LLVM IR dialect. Prepares better lowering for 2-D vector.transpose. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, dcaballe Reviewed By: ftynse, dcaballe Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80419	2020-05-27 11:09:48 -07:00
MaheshRavishankar	4d6f44f5f0	[mlir][spirv] Lower allocation/deallocations of workgroup memory. This allocation of a workgroup memory is lowered to a spv.globalVariable. Only static size allocation with element type being int or float is handled. The lowering does account for the element type that are not supported in the lowered spv.module based on the extensions/capabilities and adjusts the number of elements to get the same byte length. Differential Revision: https://reviews.llvm.org/D80411	2020-05-27 09:53:16 -07:00
Wen-Heng (Jack) Chung	2cbbc266ec	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-21 08:53:47 -05:00
Mehdi Amini	5c3ebd7725	Revert "[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass." This reverts commit `cdb6f05e2d`. The build is broken with: You have called ADD_LIBRARY for library obj.MLIRGPUtoCUDATransforms without any source files. This typically indicates a problem with your CMakeLists.txt file	2020-05-21 03:44:35 +00:00
Wen-Heng (Jack) Chung	cdb6f05e2d	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-20 16:11:48 -05:00
MaheshRavishankar	0e88eb5c51	[mlir][spirv] Adapt subview legalization to the updated op semantics. The subview semantics changes recently to allow for more natural representation of constant offsets and strides. The legalization of subview op for lowering to SPIR-V needs to account for this. Also change the linearization to use the strides from the affine map of a memref. Differential Revision: https://reviews.llvm.org/D80270	2020-05-20 12:00:21 -07:00
Nicolas Vasilache	7c3c5b11b1	[mlir][Vector] Add option to fully unroll for VectorTransfer to SCF lowering Summary: Previously, the only support partial lowering from vector transfers to SCF was going through loops. This requires a dedicated allocation and extra memory roundtrips because LLVM aggregates cannot be indexed dynamically (for more details see the [deep-dive](https://mlir.llvm.org/docs/Dialects/Vector/#deeperdive)). This revision allows specifying full unrolling which removes this additional roundtrip. This should be used carefully though because full unrolling will spill, negating the benefits of removing the interim alloc in the first place. Proper heuristics are left for a later time. Differential Revision: https://reviews.llvm.org/D80100	2020-05-20 11:02:13 -04:00
Alex Zinenko	a7d88a9038	[mlir] SCFToStandard: support any ops in and around the control flow ops Originally, the SCFToStandard conversion only declared Ops from the Standard dialect as legal after conversion. This is undesirable as it would fail the conversion if the SCF ops contained ops from any other dialect. Furthermore, this would be problematic for progressive lowering of `scf.parallel` to `scf.for` after `ensureRegionTerminator` is made aware of the pattern rewriting infrastructure because it creates temporary `scf.yield` operations declared illegal. Change the legalization target to declare any op other than `scf.for`, `scf.if` and `scf.parallel` legal. Differential Revision: https://reviews.llvm.org/D80137	2020-05-20 16:12:05 +02:00
Alex Zinenko	eab4a199d1	[mlir] NFC: rename tests related to SCF dialect from Loops to SCF The dialect and conversions from/to it were renamed in previous commits. Differential Revision: https://reviews.llvm.org/D80216	2020-05-20 15:08:23 +02:00
Hanhan Wang	520a570268	[mlir][StandardToSPIRV] Fix signedness issue in bitwidth emulation. Summary: Previously, after applying the mask, a negative number would convert to a positive number because the sign flag was forgotten. This patch adds two more shift operations to do the sign extension. This assumes that we're using two's complement. This patch applies sign extension unconditionally when loading a unspported integer width, and it relies the pattern to do the casting because the signedness semantic is carried by operator itself. Differential Revision: https://reviews.llvm.org/D79753	2020-05-19 11:00:01 -07:00
Nicolas Vasilache	1870e787af	[mlir][Vector] Add an optional "masked" boolean array attribute to vector transfer operations Summary: Vector transfer ops semantic is extended to allow specifying a per-dimension `masked` attribute. When the attribute is false on a particular dimension, lowering to LLVM emits unmasked load and store operations. Differential Revision: https://reviews.llvm.org/D80098	2020-05-18 11:52:08 -04:00
Nicolas Vasilache	36cdc17f8c	[mlir][Vector] Make minor identity permutation map optional in transfer op printing and parsing Summary: This revision makes the use of vector transfer operatons more idiomatic by allowing to omit and inferring the permutation_map. Differential Revision: https://reviews.llvm.org/D80092	2020-05-18 11:41:27 -04:00
Alex Zinenko	4ead2cf76c	[mlir] Rename conversions involving ex-Loop dialect to mention SCF The following Conversions are affected: LoopToStandard -> SCFToStandard, LoopsToGPU -> SCFToGPU, VectorToLoops -> VectorToSCF. Full file paths are affected. Additionally, drop the 'Convert' prefix from filenames living under lib/Conversion where applicable. API names and CLI options for pass testing are also renamed when applicable. In particular, LoopsToGPU contains several passes that apply to different kinds of loops (`for` or `parallel`), for which the original names are preserved. Differential Revision: https://reviews.llvm.org/D79940	2020-05-15 10:45:11 +02:00
MaheshRavishankar	0b3e478b10	[mlir][GPUToSPIRV] Use default ABI only when none of the arguments have abi attributes. To ensure there is no conflict, use the default ABI only when none of the arguments have the spv.interface_var_abi attribute. This also implies that if one of the arguments has a spv.interface_var_abi attribute, all of them should have it as well. Differential Revision: https://reviews.llvm.org/D77232	2020-05-14 21:48:51 -07:00
Nicolas Vasilache	f1b972041a	[mlir][Linalg] Start a LinalgToStandard pass and move conversion to library calls. This revision starts decoupling the include the kitchen sink behavior of Linalg to LLVM lowering by inserting a -convert-linalg-to-std pass. The lowering of linalg ops to function calls was previously lowering to memref descriptors by having both linalg -> std and std -> LLVM patterns in the same rewrite. When separating this step, a new issue occurred: the layout is automatically type-erased by this process. This revision therefore introduces memref casts to perform these type erasures explicitly. To connect everything end-to-end, the LLVM lowering of MemRefCastOp is relaxed because it is artificially more restricted than the op semantics. The op semantics already guarantee that source and target MemRefTypes are cast-compatible. An invalid lowering test now becomes valid and is removed. Differential Revision: https://reviews.llvm.org/D79468	2020-05-15 00:24:03 -04:00
Diego Caballero	bc5565f9ea	[mlir][Affine] Introduce affine.vector_load and affine.vector_store This patch adds `affine.vector_load` and `affine.vector_store` ops to the Affine dialect and lowers them to `vector.transfer_read` and `vector.transfer_write`, respectively, in the Vector dialect. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D79658	2020-05-14 13:17:58 -07:00
Alex Zinenko	60f443bb3b	[mlir] Change dialect namespace loop->scf All ops of the SCF dialect now use the `scf.` prefix instead of `loop.`. This is a part of dialect renaming. Differential Revision: https://reviews.llvm.org/D79844	2020-05-13 19:20:21 +02:00
MaheshRavishankar	49e6c19100	[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. Differential Revision: https://reviews.llvm.org/D79505	2020-05-12 23:15:25 -07:00
aartbik	fb2c4d50f1	[mlir] [VectorOps] Implement vector.constant_mask lowering to LLVM IR Summary: Makes this operation runnable on CPU by generating MLIR instructions that are eventually folded into an LLVM IR constant for the mask. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, andydavis1 Reviewed By: nicolasvasilache, ftynse, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79815	2020-05-12 19:44:23 -07:00
Nicolas Vasilache	63c0e72b2f	[mlir] Revisit std.subview handling of static information. The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. Lowering to LLVM is updated, simplified and now supports all cases. A mixed static-dynamic mode test that wouldn't previously lower is added. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Differential Revision: https://reviews.llvm.org/D79662	2020-05-12 20:04:44 -04:00
Sam McCall	691e826995	Revert "[mlir] Revisit std.subview handling of static information." This reverts commit `80d133b24f`. Per Stephan Herhut: The canonicalizer pattern that was added creates forms of the subview op that cannot be lowered. This is shown by failing Tensorflow XLA tests such as: tensorflow/compiler/xla/service/mlir_gpu/tests:abs.hlo.test Will provide more details offline, they rely on logs from private CI.	2020-05-12 15:18:50 +02:00
Hanhan Wang	756d6959d7	[mlir][StandardToSPIRV] Add support for lowering index_cast to SPIR-V. Differential Revision: https://reviews.llvm.org/D79644	2020-05-11 15:41:25 -07:00
Nicolas Vasilache	80d133b24f	[mlir] Revisit std.subview handling of static information. Summary: The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Reviewers: ftynse, mravishankar, antiagainst, rriddle!, andydavis1, timshen, asaadaldien, stellaraccident Reviewed By: mravishankar Subscribers: aartbik, bondhugula, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, bader, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79662	2020-05-11 17:44:24 -04:00
Reid Tatge	334a4159ec	[mlir][Vector] NFC - Rename vector.strided_slice into vector.extract_strided_slice Differential Revision: https://reviews.llvm.org/D79734	2020-05-11 14:21:10 -07:00
Nicolas Vasilache	6ed61a26c2	[mlir] Simplify and better document std.view semantics This [discussion](https://llvm.discourse.group/t/viewop-isnt-expressive-enough/991/2) raised some concerns with ViewOp. In particular, the handling of offsets is incorrect and does not match the op description. Note that with an elemental type change, offsets cannot be part of the type in general because sizeof(srcType) != sizeof(dstType). Howerver, offset is a poorly chosen term for this purpose and is renamed to byte_shift. Additionally, for all intended purposes, trying to support non-identity layouts for this op does not bring expressive power but rather increases code complexity. This revision simplifies the existing semantics and implementation. This simplification effort is voluntarily restrictive and acts as a stepping stone towards supporting richer semantics: treat the non-common cases as YAGNI for now and reevaluate based on concrete use cases once a round of simplification occurred. Differential revision: https://reviews.llvm.org/D79541	2020-05-11 12:29:23 -04:00
Hanhan Wang	3f07cab312	[mlir][StandardToLLVM] Add support for lowering FPToSIOp to LLVM. Summary: Depends On D79374 Differential Revision: https://reviews.llvm.org/D79455	2020-05-11 01:29:24 -07:00
Hanhan Wang	ac691c4fe7	[mlir][StandardToSPIRV] Add support for lowering FPToSIOp to SPIR-V. Summary: Depends On D79373 Differential Revision: https://reviews.llvm.org/D79374	2020-05-11 01:27:54 -07:00
Frederik Gossen	5d5f61fc89	[MLIR] Add complex addition and substraction to the standard dialect Complex addition and substraction are the first two binary operations on complex numbers. Remaining operations will follow the same pattern. Differential Revision: https://reviews.llvm.org/D79479	2020-05-08 09:54:18 +00:00
Wen-Heng (Jack) Chung	a23f190213	[mlir][vector] set alignment when lowering transfer_read and transfer_write. When emitting masked load / store, set alignment from data layout. Differential Revision: https://reviews.llvm.org/D79246	2020-05-07 11:44:25 +02:00
Alexander Belyaev	b79751e83d	[MLIR] Add conversion from AtomicRMWOp -> GenericAtomicRMWOp. Adding this pattern reduces code duplication. There is no need to have a custom implementation for lowering to llvm.cmpxchg. Differential Revision: https://reviews.llvm.org/D78753	2020-05-05 10:32:13 +02:00
Hanhan Wang	5d10613b6e	[mlir][StandardToSPIRV] Emulate bitwidths not supported for store op. Summary: As D78974, this patch implements the emulation for store op. The emulation is done with atomic operations. E.g., if the storing value is i8, rewrite the StoreOp to: 1) load a 32-bit integer 2) clear 8 bits in the loading value 3) store 32-bit value back 4) load a 32-bit integer 5) modify 8 bits in the loading value 6) store 32-bit value back The step 1 to step 3 are done by AtomicAnd as one atomic step, and the step 4 to step 6 are done by AtomicOr as another atomic step. Differential Revision: https://reviews.llvm.org/D79272	2020-05-04 15:18:44 -07:00
Frederik Gossen	031265ad8a	[MLIR] Add complex numbers to standard dialect Add `CreateComplexOp`, `ReOp`, and `ImOp` to the standard dialect. This is the first step to support complex numbers. Differential Revision: https://reviews.llvm.org/D79159	2020-05-04 14:04:28 +00:00
Wen-Heng (Jack) Chung	bc23c1d85e	[mlir][rocdl] add rocdl.barier op. - Add rocdl.barrier op. - Lower gpu.barier to rocdl.barrier in -convert-gpu-to-rocdl. Differential Revision: https://reviews.llvm.org/D79126	2020-05-04 10:35:01 +02:00
Wen-Heng (Jack) Chung	a581c6f8cd	[mlir][vector] add tests for type_cast taking non-zero addrspace Add tests for vector.type_cast that takes memrefs on non-zero addrspaces. Differential Revision: https://reviews.llvm.org/D79099	2020-05-04 10:31:12 +02:00
MaheshRavishankar	43b89ecdb9	[mlir] Add sine operation to Standard dialect. Also add lowering of sine operation to SPIR-V dialect. Differential Revision: https://reviews.llvm.org/D79102	2020-04-30 22:15:42 -07:00

... 2 3 4 5 6 ...

581 Commits