llvm-project

Commit Graph

Author	SHA1	Message	Date
Lei Zhang	5299843c31	[mlir][spirv] Add control for non-32-bit scalar type emulation Non-32-bit scalar types require special hardware support that may not exist on all GPUs. This is reflected in SPIR-V as that non-32-bit scalar types require special capabilities or extensions. Previously when there is a non-32-bit type and no native support, we unconditionally emulate it with 32-bit ones. This isn't good given that it can have implications over ABI and data layout consistency. This commit introduces an option to control whether to use 32-bit types to emulate. Differential Revision: https://reviews.llvm.org/D100059	2021-04-08 08:19:47 -04:00
Tobias Gysi	b614ada0e8	[mlir] add support for index type in vectors. The patch enables the use of index type in vectors. It is a prerequisite to support vectorization for indexed Linalg operations. This refactoring became possible due to the newly introduced data layout infrastructure. The data layout of a module defines the bitwidth of the index type needed to verify bitcasts and similar vector operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D99948	2021-04-08 08:17:13 +00:00
Matthias Springer	65a3f28939	[mlir] Add "mask" operand to vector.transfer_read/write. Also factors out out-of-bounds mask generation from vector.transfer_read/write into a new MaterializeTransferMask pattern. Differential Revision: https://reviews.llvm.org/D100001	2021-04-07 21:33:13 +09:00
Rob Suderman	0312b25df0	[mlir][tosa] Add tosa.table lowering to linalg.generic Table op lowering to linalg.generic for both i8 (behaves like a gather) and a pair of gathers with a quantized interpolation. Differential Revision: https://reviews.llvm.org/D99756	2021-04-06 13:57:18 -07:00
Alex Zinenko	7dc7790ec5	[mlir] Fix support for lowering non-32-bit affine reductions. The existing implementation was always creating 32-bit constants for floating-point and integer reductions regardless of the actual type, which resulted in invalid IR being generated for any types other than f32 and i32 when lowering affine.parallel to SCF. Use the actual type instead. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D99942	2021-04-06 14:00:15 +02:00
Rob Suderman	eb1b55c652	[mlir][tosa] Add tosa.reduce_any and tosa.reduce_all linalg lowering Added lowerings for Tosa's reduce boolean operations. This includes a fix to maintain the output rank of reduce operations. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D99228	2021-04-02 14:32:18 -07:00
Lei Zhang	6dd07fa513	[mlir][spirv] Add utilities for push constant value This commit add utility functions for creating push constant storage variable and loading values from it. Along the way, performs some clean up: * Deleted `setABIAttrs`, which is just a 4-liner function with one user. * Moved `SPIRVConverstionTarget` into `mlir` namespace, to be consistent with `SPIRVTypeConverter` and `LLVMConversionTarget`. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D99725	2021-04-02 07:51:07 -04:00
natashaknk	a879a1b034	[mlir][tosa] Add tosa.reciprocal and tosa.sigmoid lowerings Lowering reciprocal and sigmoid elementwise operations to linalg dialect. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D99676	2021-03-31 14:21:03 -07:00
Matthias Springer	95f8135043	[mlir] Change vector.transfer_read/write "masked" attribute to "in_bounds". This is in preparation for adding a new "mask" operand. The existing "masked" attribute was used to specify dimensions that may be out-of-bounds. Such transfers can be lowered to masked load/stores. The new "in_bounds" attribute is used to specify dimensions that are guaranteed to be within bounds. (Semantics is inverted.) Differential Revision: https://reviews.llvm.org/D99639	2021-03-31 18:04:22 +09:00
Mehdi Amini	973ddb7d6e	Define a `NoTerminator` traits that allows operations with a single block region to not provide a terminator In particular for Graph Regions, the terminator needs is just a historical artifact of the generalization of MLIR from CFG region. Operations like Module don't need a terminator, and before Module migrated to be an operation with region there wasn't any needed. To validate the feature, the ModuleOp is migrated to use this trait and the ModuleTerminator operation is deleted. This patch is likely to break clients, if you're in this case: - you may iterate on a ModuleOp with `getBody()->without_terminator()`, the solution is simple: just remove the ->without_terminator! - you created a builder with `Builder::atBlockTerminator(module_body)`, just use `Builder::atBlockEnd(module_body)` instead. - you were handling ModuleTerminator: it isn't needed anymore. - for generic code, a `Block::mayNotHaveTerminator()` may be used. Differential Revision: https://reviews.llvm.org/D98468	2021-03-25 03:59:03 +00:00
Rob Suderman	f5ba3eea67	[mlir][tosa] Add tosa.bitwise_not lowering to constant and xor Lowering of bitwise_not to linalg dialect using a xor operation with a constant of all-bits-one. Differential Revision: https://reviews.llvm.org/D99221	2021-03-24 17:27:27 -07:00
Alex Zinenko	b3386a734e	[mlir] introduce data layout entry for index type Index type is an integer type of target-specific bitwidth present in many MLIR operations (loops, memory accesses). Converting values of this type to fixed-size integers has always been problematic. Introduce a data layout entry to specify the bitwidth of `index` in a given layout scope, defaulting to 64 bits, which is a commonly used assumption, e.g., in constants. Port builtin-to-LLVM type conversion to use this data layout entry when converting `index` type and untie it from pointer size. This is particularly relevant for GPU targets. Keep a possibility to forcibly override the index type in lowerings. Depends On D98525 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98937	2021-03-24 15:13:42 +01:00
Vladislav Vinogradov	18a2f479bf	[mlir][NFC] Replace `getMemorySpaceAsInt` with `getMemorySpace` where possible Use new `MemRefType::getMemorySpace` method with generic Attribute in cases, where there is no specific logic around the memory space. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99154	2021-03-24 13:23:59 +03:00
Rob Suderman	28e6420744	[mlir][tosa] Add tosa.argmax to linalg lowering Tosa's argmax lowering is representable as a linalg.indexed_generic operation. Include the lowering to this type for both integer and floating point types. Differential Revision: https://reviews.llvm.org/D99137	2021-03-23 16:06:55 -07:00
Rob Suderman	4157a079af	[mlir][tosa] Add tosa.pad to linalg.pad operation Lowers from tosa's pad op to the linalg equivalent for floating, integer, and quantized values. Differential Revision: https://reviews.llvm.org/D98990	2021-03-23 14:15:48 -07:00
River Riddle	76f3c2f3f3	[mlir][Pattern] Add better support for using interfaces/traits to match root operations in rewrite patterns To match an interface or trait, users currently have to use the `MatchAny` tag. This tag can be quite problematic for compile time for things like the canonicalizer, as the `MatchAny` patterns may get applied to every operation. This revision adds better support by bucketing interface/trait patterns based on which registered operations have them registered. This means that moving forward we will only attempt to match these patterns to operations that have this interface registered. Two simplify defining patterns that match traits and interfaces, two new utility classes have been added: OpTraitRewritePattern and OpInterfaceRewritePattern. Differential Revision: https://reviews.llvm.org/D98986	2021-03-23 14:05:33 -07:00
Rob Suderman	2d72b675d5	[mlir][tosa] Add tosa.tile to linalg.generic lowering Tiling operations are generic operations with modified indexing. Updated to to linalg lowerings to perform this lowering. Differential Revision: https://reviews.llvm.org/D99113	2021-03-23 13:13:54 -07:00
natashaknk	e20911b5c0	[mlir][tosa] Add tosa.matmul and tosa.fully_connected lowering Adds lowerings for matmul and fully_connected. Only supports 2D tensors for inputs and weights, and 1D tensors for bias. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D99211	2021-03-23 13:09:53 -07:00
Chris Lattner	79d7f618af	Rename FrozenRewritePatternList -> FrozenRewritePatternSet; NFC. This nicely aligns the naming with RewritePatternSet. This type isn't as widely used, but we keep a using declaration in to help with downstream consumption of this change. Differential Revision: https://reviews.llvm.org/D99131	2021-03-22 17:40:45 -07:00
Chris Lattner	dc4e913be9	[PatternMatch] Big mechanical rename OwningRewritePatternList -> RewritePatternSet and insert -> add. NFC This doesn't change APIs, this just cleans up the many in-tree uses of these names to use the new preferred names. We'll keep the old names around for a couple weeks to help transitions. Differential Revision: https://reviews.llvm.org/D99127	2021-03-22 17:20:50 -07:00
Rob Suderman	d7c44a5c78	[mlir][tosa] Fix tosa.mul to use tosa.apply_scale Multiply-shift requires wider compute types or CPU specific code to avoid premature truncation, apply_shift fixes this issue Also, Tosa's mul op supports different input / output types. Added path that sign-extends input values to int-32 values before multiplying. Differential Revision: https://reviews.llvm.org/D99011	2021-03-22 11:01:35 -07:00
Chris Lattner	1d909c9a35	Remove the extraneous MLIRContext argument from populateWithGenerated. NFC.	2021-03-21 10:38:35 -07:00
Chris Lattner	3a506b31a3	Change OwningRewritePatternList to carry an MLIRContext with it. This updates the codebase to pass the context when creating an instance of OwningRewritePatternList, and starts removing extraneous MLIRContext parameters. There are many many more to be removed. Differential Revision: https://reviews.llvm.org/D99028	2021-03-21 10:06:31 -07:00
Rob Suderman	e990fa2170	[mlir][tosa] Add tosa.reverse lowering to linalg.generic Reverse lowers to a linalg.generic op by reversing the read order in the index map. Differential Revision: https://reviews.llvm.org/D98997	2021-03-19 21:46:47 -07:00
Rob Suderman	47286fc530	[mlir][tosa] Add tosa.cast to linalg lowering Handles lowering from the tosa CastOp to the equivalent linalg lowering. It includes support for interchange between bool, int, and floating point. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98828	2021-03-19 11:48:37 -07:00
Rob Suderman	1b7498120d	[mlir][tosa] Add tosa.logical_* to linalg lowerings Adds lowerings for logical_* boolean operations. Each of these ops only operate on booleans allowing simple lowerings. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D98910	2021-03-19 11:30:42 -07:00
Christian Sigg	a5f9cda173	[mlir] Rename gpu-to-llvm pass implementation file Also remove populate patterns function and binary annotation name option. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98930	2021-03-19 13:58:13 +01:00
Christian Sigg	74ffe8dc59	[mlir] Remove ConvertKernelFuncToBlob All users have been converted to gpu::SerializeToBlobPass. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98928	2021-03-19 09:33:47 +01:00
Rob Suderman	286a9d467e	[mlir][tosa] Add lowering for tosa.rescale to linalg.generic This adds a tosa.apply_scale operation that handles the scaling operation common to quantized operatons. This scalar operation is lowered in TosaToStandard. We use a separate ApplyScale factorization as this is a replicable pattern within TOSA. ApplyScale can be reused within pool/convolution/mul/matmul for their quantized variants. Tests are added to both tosa-to-standard and tosa-to-linalg-on-tensors that verify each pass is correct. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D98753	2021-03-18 16:14:05 -07:00
Rob Suderman	5627564fe0	[mlir][tosa] Add tosa.concat to subtensor inserts lowering Includes lowering for tosa.concat with indice computation with subtensor insert operations. Includes tests along two different indices. Differential Revision: https://reviews.llvm.org/D98813	2021-03-18 15:59:07 -07:00
thomasraoux	44f24f3996	[mlir] Fix build failure due to `1a572f4`	2021-03-18 14:58:32 -07:00
thomasraoux	1a572f4509	[mlir] Add vector op support to cuda-runner including vector.print Differential Revision: https://reviews.llvm.org/D97346	2021-03-18 13:03:08 -07:00
Rob Suderman	f4bb076a44	[mlir][tosa] Add tosa.slice to std.subtensor lowering Lowering to subtensor is added for tosa.slice operator. Differential Revision: https://reviews.llvm.org/D98825	2021-03-17 17:28:18 -07:00
Vladislav Vinogradov	fee9054232	[mlir][ODS] Support specialized Attribute class for Enums Add a feature to `EnumAttr` definition to generate specialized Attribute class for the particular enumeration. This class will inherit `StringAttr` or `IntegerAttr` and will override `classof` and `getValue` methods. With this class the enumeration predicate can be checked with simple RTTI calls (`isa`, `dyn_cast`) and it will return the typed enumeration directly instead of raw string/integer. Based on the following discussion: https://llvm.discourse.group/t/rfc-add-enum-attribute-decorator-class/2252 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97836	2021-03-17 16:44:24 +03:00
Stephan Herhut	5837fdc4cc	[mlir][llvm] Pass struct results as parameter in c wrapper Returning structs directly in LLVM does not necessarily align with the C ABI of the platform. This might happen to work on Linux but for small structs this breaks on Windows. With this change, the wrappers work platform independently. Differential Revision: https://reviews.llvm.org/D98725	2021-03-17 12:58:52 +01:00
Gaurav Shukla	8e3075c2b0	[MLIR] Fix lowering of Affine IfOp in the presence of yield values. This commit fixes the lowering of `Affine.IfOp` to `SCF.IfOp` in the presence of yield values. These changes have been made as a part of `-lower-affine` pass. Differential Revision: https://reviews.llvm.org/D98760	2021-03-17 16:33:32 +05:30
River Riddle	1f13963ec1	[mlir][pdl] Cast the OperationPosition to Position to fix MSVC miscompile If we don't cast, MSVC picks an overload that hasn't been defined yet(not sure why) and miscompiles.	2021-03-16 16:11:14 -07:00
Eugene Zhulenev	74f6138bd9	[mlir] Add lowering from math::Log1p to LLVM [mlir] Add lowering from math::Log1p to LLVM Reviewed By: cota Differential Revision: https://reviews.llvm.org/D98662	2021-03-16 15:59:09 -07:00
River Riddle	3a833a0e0e	[mlir][PDL] Add support for variadic operands and results in the PDL Interpreter This revision extends the PDL Interpreter dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl_interp.check_types : Compare a range of types with a known range. * pdl_interp.create_types : Create a constant range of types. * pdl_interp.get_operands : Get a range of operands from an operation. * pdl_interp.get_results : Get a range of results from an operation. * pdl_interp.switch_types : Switch on a range of types. This revision handles adding support in the interpreter dialect and the conversion from PDL to PDLInterp. Support for variadic operands and results in the bytecode will be added in a followup revision. Differential Revision: https://reviews.llvm.org/D95722	2021-03-16 13:20:19 -07:00
River Riddle	02c4c0d5b2	[mlir][pdl] Remove CreateNativeOp in favor of a more general ApplyNativeRewriteOp. This has a numerous amount of benefits, given the overly clunky nature of CreateNativeOp: * Users can now call into arbitrary rewrite functions from inside of PDL, allowing for more natural interleaving of PDL/C++ and enabling for more of the pattern to be in PDL. * Removes the need for an additional set of C++ functions/registry/etc. The new ApplyNativeRewriteOp will use the same PDLRewriteFunction as the existing RewriteOp. This reduces the API surface area exposed to users. This revision also introduces a new PDLResultList class. This class is used to provide results of native rewrite functions back to PDL. We introduce a new class instead of using a SmallVector to simplify the work necessary for variadics, given that ranges will require some changes to the structure of PDLValue. Differential Revision: https://reviews.llvm.org/D95720	2021-03-16 13:20:18 -07:00
River Riddle	242762c9a3	[mlir][pdl] Restructure how results are represented. Up until now, results have been represented as additional results to a pdl.operation. This is fairly clunky, as it mismatches the representation of the rest of the IR constructs(e.g. pdl.operand) and also isn't a viable representation for operations returned by pdl.create_native. This representation also creates much more difficult problems when factoring in support for variadic result groups, optional results, etc. To resolve some of these problems, and simplify adding support for variable length results, this revision extracts the representation for results out of pdl.operation in the form of a new `pdl.result` operation. This operation returns the result of an operation at a given index, e.g.: ``` %root = pdl.operation ... %result = pdl.result 0 of %root ``` Differential Revision: https://reviews.llvm.org/D95719	2021-03-16 13:20:18 -07:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00
Alex Zinenko	a776942ba1	[mlir] squash LLVM_AVX512 dialect into AVX512 The dialect separation was introduced to demarkate ops operating in different type systems. This is no longer the case after the LLVM dialect has migrated to using built-in vector types, so the original reason for separation is no longer valid. Squash the two dialects into one. The code size decrease isn't quite large: the ops originally in LLVM_AVX512 are preserved because they match LLVM IR intrinsics specialized for vector element bitwidth. However, it is still conceptually beneficial to have only one dialect. I originally considered to use Tablegen multiclasses to define both the type-polymorphic op and its two intrinsic-related instantiations, but decided against it given both the complexity of the required Tablegen input and its dissimilarity with the rest of ODS-defined ops, both potentially resulting in very poor maintainability. Depends On D98327 Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98328	2021-03-10 13:07:26 +01:00
Christian Sigg	4d295cf5b5	[mlir] Add base class for GpuKernelToBlobPass Instead of configuring kernel-to-cubin/rocdl lowering through callbacks, introduce a base class that target-specific passes can derive from. Put the base class in GPU/Transforms, according to the discussion in D98203. The mlir-cuda-runner will go away shortly, and the mlir-rocdl-runner as well at some point. I therefore kept the existing code path working and will remove it in a separate step. Depends On D98168 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98279	2021-03-10 12:14:43 +01:00
Christian Sigg	840ff84d33	[mlir] Default for gpu-binary-annotation option. Provide default for gpuBinaryAnnotation so that we don't need to specify it in tests. The annotation likely only needs to be target specific if we want to lower to e.g. both CUDA and ROCDL. Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D98168	2021-03-09 21:01:50 +01:00
Mehdi Amini	038f2a337d	Move LLVM::FMFAttr definition to TableGen (NFC) This is using the new Attribute storage generation support in TableGen to define the LLVM FastMathFlags. Differential Revision: https://reviews.llvm.org/D98007	2021-03-09 05:29:54 +00:00
Rob Suderman	cb3542e1ca	[MLIR][TOSA] Added lowerings for Reduce operations to Linalg Lowerings for min, max, prod, and sum reduction operations on int and float values. This includes reduction tests for both cases. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97893	2021-03-08 10:57:19 -08:00
Benjamin Kramer	42c195f0ec	[mlir][Shape] Allow shape.split_at to return extent tensors and lower it to std.subtensor split_at can return an error if the split index is out of bounds. If the user knows that the index can never be out of bounds it's safe to use extent tensors. This has a straight-forward lowering to std.subtensor. Differential Revision: https://reviews.llvm.org/D98177	2021-03-08 16:48:05 +01:00
KareemErgawy-TomTom	3fb384d50e	[MLIR][SPIRV] Rename `spv.selection` to `spv.mlir.selection`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98014	2021-03-06 16:05:31 +01:00
Lei Zhang	bb6f5c8314	[mlir][spirv] Convert tensor.extract for very small tensors Normally tensors will be stored in buffers before converting to SPIR-V, given that is how a large amount of data is sent to the GPU. However, SPIR-V supports converting from tensors directly too. This is for the cases where the tensor just contains a small amount of elements and it makes sense to directly inline them as a small data array in the shader. To handle this, internally the conversion might create new local variables. SPIR-V consumers in GPU drivers may or may not optimize that away. So this has implications over register pressure. Therefore, a threshold is used to control when the patterns should kick in. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D98052	2021-03-06 08:03:36 -05:00
Matthias Springer	acce0ea70c	[mlir][AVX512] Add mask.compress to AVX512 dialect. Adds mask.compress to the AVX512 dialect and defines a lowering to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D97611	2021-03-06 10:02:48 +09:00
Alex Zinenko	6410ee0d09	[mlir] Squash LLVM_ArmNeon dialect into ArmNeon The two dialects are largely redundant. The former was introduced as a mirror of the latter operating on LLVM dialect types. This is no longer necessary since the LLVM dialect operates on built-in types. Combine the two dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98060	2021-03-05 23:33:32 +01:00
KareemErgawy-TomTom	29812a6195	[MLIR][SPIRV] Rename `spv.loop` to `spv.mlir.loop`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97918	2021-03-05 15:44:30 -05:00
KareemErgawy-TomTom	c74eb466d2	[MLIR][SPIRV] Rename `spv.globalVariable` to `spv.GlobalVariable`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97919	2021-03-04 16:24:59 -05:00
KareemErgawy-TomTom	5abdca47b3	[MLIR][SPIRV] Rename `spv.constant` to `spv.Constant`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from `spv.camelCase` to `spv.CamelCase` everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97917	2021-03-04 16:15:56 -05:00
Alex Zinenko	19db802e7b	[mlir] make implementations of translation to LLVM IR interfaces private There is no need for the interface implementations to be exposed, opaque registration functions are sufficient for all users, similarly to passes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97852	2021-03-04 09:16:32 +01:00
River Riddle	e07c968a6d	[mlir][pdl][NFC] Rename InputOp to OperandOp This better matches the actual IR concept that is being modeled, and is consistent with how the rest of PDL is structured. Differential Revision: https://reviews.llvm.org/D95718	2021-03-03 15:48:00 -08:00
River Riddle	3dfa86149e	[mlir][IR] Refactor the internal implementation of Value The current implementation of Value involves a pointer int pair with several different kinds of owners, i.e. BlockArgumentImpl, Operation , TrailingOpResult. This design arose from the desire to save memory overhead for operations that have a very small number of results (generally 0-2). There are, unfortunately, many problematic aspects of the current implementation that make Values difficult to work with or just inefficient. Operation result types are stored as a separate array on the Operation. This is very inefficient for many reasons: we use TupleType for multiple results, which can lead to huge amounts of memory usage if multi-result operations change types frequently(they do). It also means that simple methods like Value::getType/Value::setType now require complex logic to get to the desired type. Value only has one pointer bit free, severely limiting the ability to use it in things like PointerUnion/PointerIntPair. Given that we store the kind of a Value along with the "owner" pointer, we only leave one bit free for users of Value. This creates situations where we end up nesting PointerUnions to be able to use Value in one. As noted above, most of the methods in Value need to branch on at least 3 different cases which is both inefficient, possibly error prone, and verbose. The current storage of results also creates problems for utilities like ValueRange/TypeRange, which want to efficiently store base pointers to ranges (of which Operation isn't really useful as one). This revision greatly simplifies the implementation of Value by the introduction of a new ValueImpl class. This class contains all of the state shared between all of the various derived value classes; i.e. the use list, the type, and the kind. This shared implementation class provides several large benefits: * Most of the methods on value are now branchless, and often one-liners. * The "kind" of the value is now stored in ValueImpl instead of Value This frees up all of Value's pointer bits, allowing for users to take full advantage of PointerUnion/PointerIntPair/etc. It also allows for storing more operation results as "inline", 6 now instead of 2, freeing up 1 word per new inline result. * Operation result types are now stored in the result, instead of a side array This drops the size of zero-result operations by 1 word. It also removes the memory crushing use of TupleType for operations results (which could lead up to hundreds of megabytes of "dead" TupleTypes in the context). This also allowed restructured ValueRange, making it simpler and one word smaller. This revision does come with two conceptual downsides: * Operation::getResultTypes no longer returns an ArrayRef<Type> This conceptually makes some usages slower, as the iterator increment is slightly more complex. * OpResult::getOwner is slightly more expensive, as it now requires a little bit of arithmetic From profiling, neither of the conceptual downsides have resulted in any perceivable hit to performance. Given the advantages of the new design, most compiles are slightly faster. Differential Revision: https://reviews.llvm.org/D97804	2021-03-03 14:33:37 -08:00
Benjamin Kramer	73cb58dc48	[mlir][Shape] Lower cstr_eq to shape_eq + assert Differential Revision: https://reviews.llvm.org/D97860	2021-03-03 17:22:28 +01:00
Benjamin Kramer	24acadef8a	[mlir][Shape] Make shape_eq nary This gets rid of a dubious shape_eq %a, %a fold, that folds shape_eq even if %a is not an Attribute. Differential Revision: https://reviews.llvm.org/D97728	2021-03-03 16:26:40 +01:00
Vladislav Vinogradov	37eca08e5b	[mlir][NFC] Rename `MemRefType::getMemorySpace` to `getMemorySpaceAsInt` Just a pure method renaming. It is a preparation step for replacing "memory space as raw integer" with more generic "memory space as attribute", which will be done in separate commit. The `MemRefType::getMemorySpace` method will return `Attribute` and become the main API, while `getMemorySpaceAsInt` will be declared as deprecated and will be replaced in all in-tree dialects (also in separate commits). Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D97476	2021-03-02 11:08:54 +03:00
Stella Stamenova	801067f4c0	[mlir][lldb] Fix several gcc warnings in mlir and lldb These warnings are raised when compiling with gcc due to either having too few or too many commas, or in the case of lldb, the possibility of a nullptr. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97586	2021-03-01 13:48:22 -08:00
Rob Suderman	087bc20fe4	[MLIR][TOSA] Lower tosa.transpose to linalg.generic Lowers the transpose operation to a generic linalg op when permutations is a constant value. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97508	2021-03-01 11:09:49 -08:00
Rob Suderman	16abacaea9	[MLIR][TOSA] Resubmit Tosa to Standard/SCF Lowerings (const, if, while)" Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Resubmission of https://reviews.llvm.org/D97518 with ASAN fixes. Differential Revision: https://reviews.llvm.org/D97529	2021-02-26 17:44:12 -08:00
Rob Suderman	f685c9ac86	[MLIR][TOSA] Lower tosa.identity and tosa.identitiyn to linalg Both identity ops can be loweried by replacing their results with their inputs. We keep this as a linalg lowering as other backends may choose to create copies. Differential Revision: https://reviews.llvm.org/D97517	2021-02-26 15:45:07 -08:00
Aart Bik	df5ccf5a94	[mlir][vector] add higher dimensional support to gather/scatter Similar to mask-load/store and compress/expand, the gather and scatter operation now allow for higher dimension uses. Note that to support the mixed-type index, the new syntax is: vector.gather %base [%i,%j] [%kvector] .... The first client of this generalization is the sparse compiler, which needs to define scatter and gathers on dense operands of higher dimensions too. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97422	2021-02-26 14:20:19 -08:00
Rob Suderman	caccddc52a	[MLIR][TOSA] Lower tosa.reshape to linalg.reshape Lowering from the tosa.reshape op to linalg.reshape. For same-rank or non-collapsed/expanded cases two linalg.reshapes are inserted. Differential Revision: https://reviews.llvm.org/D97439	2021-02-26 12:57:57 -08:00
Benjamin Kramer	4941fef9c4	[mlir] Silence some deprecation warnings after `dffc487b07`	2021-02-26 15:15:56 +01:00
Marius Brehler	56774bdda5	[mlir] Replace deprecated 'getAttrs' 'getAttrs' has been explicitly marked deprecated. This patch refactors to use Operation::getAttrs(). Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D97546	2021-02-26 14:52:40 +01:00
Christian Sigg	dffc487b07	[mlir] Mark OpState::removeAttr() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97530	2021-02-26 12:04:41 +01:00
Rob Suderman	c47aa3c8de	Revert [MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) This reverts commit `a813e9be5b`. Results in an ASAN failure due to bypassing rewriter. Differential Revision: https://reviews.llvm.org/D97518	2021-02-25 18:05:16 -08:00
Rob Suderman	a813e9be5b	[MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97352	2021-02-25 14:35:21 -08:00
Christian Sigg	8c074cb0b7	[mlir] Mark OpState::getAttrs() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97464	2021-02-25 20:54:42 +01:00
Christian Sigg	f03826f896	Pass GPU events instead of streams across async regions. Lower !gpu.async.tokens returned from async.execute regions to events instead of streams. Make !gpu.async.token returned from !async.execute single-use. This allows creating one event per use and destroying them without leaking or ref-counting. Technically we only need this for stream/event-based lowering. I kept the code separate from the rest of the gpu-async-region pass so that we can make this optional or move to a separate pass as needed. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D96965	2021-02-25 13:18:18 +01:00
Lei Zhang	5f8a80882b	[mlir] Add constBuilderCall to TypeAttr to simplify builders Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97344	2021-02-24 13:04:03 -05:00
Christian Sigg	eb8d6af5e4	[mlir] Specify cuda-runner pass pipeline as command line options. The cuda-runner registers two pass pipelines for nested passes, so that we don't have to use verbose textual pass pipeline specification. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97091	2021-02-24 14:36:52 +01:00
River Riddle	ddd556f10e	[mlir][pdl] Fix bug when ordering predicates We should be ordering predicates with higher primary/secondary sums first, but we are currently ordering them last. This allows for predicates more frequently encountered to be checked first. Differential Revision: https://reviews.llvm.org/D95715	2021-02-22 19:02:48 -08:00
Andrew Pritchard	08c681f645	Perform memory accesses in the same addrspace as the corresponding memref. It's not necessarily the case on all architectures that all memory is addressable in addrspace 0, so casting the pointer to addrspace 0 is liable to cause problems. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D96380	2021-02-18 12:36:16 -08:00
natashaknk	25b4a6a7f0	[MLIR][TOSA] Add lowering from TOSA to Linalg for math-based and elementwise ops This patch adds lowering to Linalg for the following TOSA ops: negate, rsqrt, mul, select, clamp and reluN and includes support for signless integer and floating point types Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D96924	2021-02-18 12:10:10 -08:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Eugene Zhulenev	519f5917b4	[mlir] Add fma operation to std dialect Will remove `vector.fma` operation in the followup CLs. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96801	2021-02-17 10:06:01 -08:00
Hanhan Wang	c80484e16e	[mlir][StandardToSPIRV] Add support for lowering trunci to SPIR-V to i1 types. Add a pattern to converting some value to a boolean. spirv.S/UConvert does not work on i1 types. Thus, the pattern is lowered to cmpi + select. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96851	2021-02-17 07:23:41 -08:00
Adrian Kuegel	07cc77187a	Lower math.expm1 to intrinsics in the GPUToNVVM and GPUToROCDL conversions. This adds the lowering for expm1 for GPU backends. Differential Revision: https://reviews.llvm.org/D96756	2021-02-16 10:23:42 +01:00
Tres Popp	3842d4b679	Make shape.is_broadcastable/shape.cstr_broadcastable nary This corresponds with the previous work to make shape.broadcast nary. Additionally, simplify the ConvertShapeConstraints pass. It now doesn't lower an implicit shape.is_broadcastable. This is still the same in combination with shape-to-standard when the 2 passes are used in either order. Differential Revision: https://reviews.llvm.org/D96401	2021-02-15 16:05:32 +01:00
Mehdi Amini	aa4e466caa	[mlir][Linalg] Improve region support in Linalg ops This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. This reverts commit `3f22547fd1` and reland `973e133b76` with a workaround for a gcc bug that does not accept lambda default parameters: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59949 Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 19:11:24 +00:00
Diego Caballero	656674a7c4	[mlir][Vector] Align gather/scatter/expand/compress API Align the vector gather/scatter/expand/compress API with the vector load/store/maskedload/maskedstore API. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D96396	2021-02-12 20:48:38 +02:00
Diego Caballero	ee66e43a96	[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops This patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect [1]. These operations model contiguous vector loads and stores from/to memory. Their semantics are similar to the 'affine.vector_load' and 'affine.vector_store' counterparts but without the affine constraints. The most relevant feature is that these new vector operations may perform a vector load/store on memrefs with a non-vector element type, unlike 'std.load' and 'std.store' ops. This opens the representation to model more generic vector load/store scenarios: unaligned vector loads/stores, perform scalar and vector memory access on the same memref, decouple memory allocation constraints from memory accesses, etc [1]. These operations will also facilitate the progressive lowering of both Affine vector loads/stores and Vector transfer reads/writes for those that read/write contiguous slices from/to memory. In particular, this patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect, implements their lowering to the LLVM dialect, and changes the lowering of 'affine.vector_load' and 'affine.vector_store' ops to the new vector ops. The lowering of Vector transfer reads/writes will be implemented in the future, probably as an independent pass. The API of 'vector.maskedload' and 'vector.maskedstore' has also been changed slightly to align it with the transfer read/write ops and the vector new ops. This will improve reusability among all these operations. For example, the lowering of 'vector.load', 'vector.store', 'vector.maskedload' and 'vector.maskedstore' to the LLVM dialect is implemented with a single template conversion pattern. [1] https://llvm.discourse.group/t/memref-type-and-data-layout/ Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96185	2021-02-12 20:48:37 +02:00
Mehdi Amini	3f22547fd1	Revert "[mlir][Linalg] Improve region support in Linalg ops." This reverts commit `973e133b76`. It triggers an issue in gcc5 that require investigation, the build is broken with: /tmp/ccdpj3B9.s: Assembler messages: /tmp/ccdpj3B9.s:5821: Error: symbol `_ZNSt17_Function_handlerIFvjjEUljjE2_E9_M_invokeERKSt9_Any_dataOjS6_' is already defined /tmp/ccdpj3B9.s:5860: Error: symbol `_ZNSt14_Function_base13_Base_managerIUljjE2_E10_M_managerERSt9_Any_dataRKS3_St18_Manager_operation' is already defined	2021-02-12 18:15:51 +00:00
Nicolas Vasilache	973e133b76	[mlir][Linalg] Improve region support in Linalg ops. This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 14:51:03 +00:00
Benjamin Kramer	530d6ea97b	[mlir][spirv] Lower sexti -> SConvert	2021-02-12 15:04:12 +01:00
Alex Zinenko	4c4876c314	[mlir] Use target-specific GPU kernel attributes in lowering pipelines Until now, the GPU translation to NVVM or ROCDL intrinsics relied on the presence of the generic `gpu.kernel` attribute to attach additional LLVM IR metadata to the relevant functions. This would be problematic if each dialect were to handle the conversion of its own options, which is the intended direction for the translation infrastructure. Introduce `nvvm.kernel` and `rocdl.kernel` in addition to `gpu.kernel` and base translation on these new attributes instead. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D96591	2021-02-12 14:09:24 +01:00
Stephan Herhut	4348d8ab7f	[mlir][math] Split off the math dialect. This does not split transformations, yet. Those will be done as future clean ups. Differential Revision: https://reviews.llvm.org/D96272	2021-02-12 10:55:12 +01:00
Nicolas Vasilache	5bc4f8846c	s[mlir] Tighten computation of inferred SubView result type. The AffineMap in the MemRef inferred by SubViewOp may have uncompressed symbols which result in type mismatch on otherwise unused symbols. Make the computation of the AffineMap compress those unused symbols which results in better canonical types. Additionally, improve the error message to report which inferred type was expected. Differential Revision: https://reviews.llvm.org/D96551	2021-02-11 22:38:16 +00:00
Nicolas Vasilache	e332c22cdf	[mlir][LLVM] NFC - Refactor a lookupOrCreateFn to reuse common function creation. Differential revision: https://reviews.llvm.org/D96488	2021-02-11 15:52:33 +00:00
Stephan Herhut	33a58c1c5c	[mlir][gpu] Allow all dialects in SCF to GPU conversion. With the standard dialect being split up, the set of dialects that are used when converting to GPU is growing. This change modifies the SCFToGpu pass to allow all operations inside launch bodies. Differential Revision: https://reviews.llvm.org/D96480	2021-02-11 10:02:26 +01:00
Rob Suderman	c19a412809	[MLIR][TOSA] Tosa elementwise broadcasting Added support for broadcasting size-1 dimensions for TOSA elemtnwise operations. Differential Revision: https://reviews.llvm.org/D96190	2021-02-10 15:28:18 -08:00
Tres Popp	f30f347da1	[mlir][shape] Generalize broadcast to a variadic number of shapes Previously broadcast was a binary op. Now it can support more inputs. This has been changed in such a way that for now, this is an NFC for all broadcast operations that were previously legal. Differential Revision: https://reviews.llvm.org/D95777	2021-02-10 08:31:28 +01:00
Tres Popp	c2c83e97c3	Revert "Revert "Reorder MLIRContext location in BuiltinAttributes.h"" This reverts commit `511dd4f438` along with a couple fixes. Original message: Now the context is the first, rather than the last input. This better matches the rest of the infrastructure and makes it easier to move these types to being declaratively specified. Phabricator: https://reviews.llvm.org/D96111	2021-02-08 10:39:58 +01:00
Tres Popp	511dd4f438	Revert "Reorder MLIRContext location in BuiltinAttributes.h" This reverts commit `7827753f98`.	2021-02-08 09:32:42 +01:00
Tres Popp	7827753f98	Reorder MLIRContext location in BuiltinAttributes.h Now the context is the first, rather than the last input. This better matches the rest of the infrastructure and makes it easier to move these types to being declaratively specified. Differential Revision: https://reviews.llvm.org/D96111	2021-02-08 09:28:09 +01:00
Lei Zhang	9f622b3d5d	[mlir][spirv] Add more vector conversion patterns This patch introduces a few more straightforward patterns to convert vector ops operating on 1-4 element vectors to their corresponding SPIR-V counterparts. This patch also enables converting vector<1xT> to T. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D96042	2021-02-05 09:11:16 -05:00
Alex Zinenko	1b101038dc	[mlir] Turn Linalg to LLVM into a partial conversion Historically, Linalg To LLVM conversion subsumed numerous other conversions, including (affine) loop lowerings to CFG and conversions from the Standard and Vector dialects to the LLVM dialect. This was due to the insufficient support for partial conversions in the infrastructure that essentially required conversions that involve type change (in this case, !linalg.range to !llvm.struct) to be performed in a single conversion sweep. This is no longer the case so remove the subsumed conversions and run them as separate passes when necessary. Depends On D95317 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96008	2021-02-05 14:31:19 +01:00
River Riddle	e21adfa32d	[mlir] Mark LogicalResult as LLVM_NODISCARD This makes ignoring a result explicit by the user, and helps to prevent accidental errors with dropped results. Marking LogicalResult as no discard was always the intention from the beginning, but got lost along the way. Differential Revision: https://reviews.llvm.org/D95841	2021-02-04 15:10:10 -08:00
Nicolas Vasilache	f4ac9f0334	[mlir][Linalg] Drop SliceOp This op is subsumed by rank-reducing SubViewOp and has become useless. Differential revision: https://reviews.llvm.org/D95317	2021-02-04 11:22:01 +00:00
Alex Zinenko	ba87f99168	[mlir] make vector to llvm conversion truly partial Historically, the Vector to LLVM dialect conversion subsumed the Standard to LLVM dialect conversion patterns. This was necessary because the conversion infrastructure did not have sufficient support for reconciling type conversions. This support is now available. Only keep the patterns related to the Vector dialect in the Vector to LLVM conversion and require type casts operations to be inserted if necessary. These casts will be removed by following conversions if possible. Update integration tests to also run the Standard to LLVM conversion. There is a significant amount of test churn, which is due to (a) unnecessarily strict tests in VectorToLLVM and (b) many patterns actually targeting Standard dialect ops instead of LLVM dialect ops leading to tests actually exercising a Vector->Standard->LLVM conversion. This churn is a good illustration of the reason to make the conversion partial: now the tests only check the code in the Vector to LLVM conversion and will not be randomly broken by changes in Standard to LLVM conversion. Arguably, it may be possible to extract Vector to Standard patterns into a separate pass, but given the ongoing splitting of the Standard dialect, such pass will be short-lived and will require further refactoring. Depends On D95626 Reviewed By: nicolasvasilache, aartbik Differential Revision: https://reviews.llvm.org/D95685	2021-02-04 11:33:24 +01:00
Christian Sigg	8d73bee4ed	[mlir] Add gpu async integration test. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D94421	2021-02-03 21:45:23 +01:00
Diego Caballero	cf5c517c05	[mlir][Vector] Add lowering to LLVM for vector.bitcast Add the conversion pattern for vector.bitcast to lower it to the LLVM Dialect. Reviewed By: ThomasRaoux, aartbik Differential Revision: https://reviews.llvm.org/D95579	2021-02-03 01:19:20 +02:00
Christian Sigg	5b3881691f	[mlir] Delay adding the __resume function The __resume function trips up LLVM's 'X86 DAG->DAG Instruction Selection' unless optimizations are disabled. Only adding the __resume function when it's needed allows lowering through AsyncToLLVM and LLVM without '-O0' as long as the coroutine functionality is not used. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D95868	2021-02-02 20:02:54 +01:00
Nicolas Vasilache	49c9c3a59e	[mlir][Standard] Extend n-D vector lowering to LLVM to [s\|z]exti ops. [s\|z]exti ops do not have the same operand and result type. As a consequence, the lowering of the n-D vector form needs to be relaxed a bit. This revision additionally performs a few NFC renamings of variables to make them more intuitive. Differential Revision: https://reviews.llvm.org/D95760	2021-02-02 07:45:50 +00:00
natashaknk	21724ddcb7	[MLIR][TOSA] Comparison based elementwise operations for tosa-to-linalg Comitted log, exp, maximum, minimum, comparison, ceil and floor conversions from TOSA to LinAlg. Support for signless integer and floating point. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D95839	2021-02-01 21:37:52 -08:00
Alex Zinenko	d6be277347	[mlir] turn complex-to-llvm into a partial conversion It is no longer necessary to also convert other "standard" ops along with the complex dialect: the element types are now built-in integers or floating point types, and the top-level cast between complex and struct is automatically inserted and removed in progressive lowering. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D95625	2021-01-28 19:14:01 +01:00
Nicolas Vasilache	7e6fe5c48a	[mlir] Fix subview verifier. The subview verifier in the rank-reduced case is plainly skipping verification when the resulting type is a memref with empty affine map. This is generally incorrect. Instead, form the actual expected rank-reduced MemRefType that takes into account the projections of 1's dimensions. Then, check the canonicalized expected rank-reduced type against the canonicalized candidate type. Differential Revision: https://reviews.llvm.org/D95316	2021-01-28 13:55:39 +00:00
Nicolas Vasilache	5133673df4	[mlir] Extend semantic of OffsetSizeAndStrideOpInterface. OffsetSizeAndStrideOpInterface now have the ability to specify only a leading subset of offset, sizes, strides operands/attributes. The size of that leading subset must be limited by the corresponding entry in `getArrayAttrMaxRanks` to avoid overflows. Missing trailing dimensions are assumed to span the whole range (i.e. [0 .. dim)). This brings more natural semantics to slice-like op on top of subview and is a simplifies to removing all uses of SliceOp in dependent projects. Differential revision: https://reviews.llvm.org/D95441	2021-01-27 09:02:35 +00:00
Alex Zinenko	91bd1156f3	[mlir] drop unused statics	2021-01-26 13:30:45 +01:00
Eugene Zhulenev	25f80e16d1	[mlir] Async: add a separate pass to lower from async to async.coro and async.runtime Depends On D95000 Move async.execute outlining and async -> async.runtime lowering into the separate Async transformation pass Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95311	2021-01-26 03:33:20 -08:00
Matthias Springer	90ebc489de	Add vp2intersect to AVX512 dialect. Adds vp2intersect to the AVX512 dialect and defines a lowering to the LLVM dialect. Author: Matthias Springer <springerm@google.com> Differential Revision: https://reviews.llvm.org/D95301	2021-01-26 07:32:26 +00:00
Eugene Zhulenev	d37b5393e8	[mlir:Async] Use LLVM coro operations in async.coro lowering Instead of using llvm.call operations to call LLVM coro intrinsics use Coro operations from the LLVM dialect. (This was reviewed as a part of https://reviews.llvm.org/D94923 but was lost in arc land from local branch) Differential Revision: https://reviews.llvm.org/D95405	2021-01-25 16:42:11 -08:00
Eugene Zhulenev	9c53b8e52e	[mlir:Async] Add intermediate async.coro and async.runtime operations to simplify Async to LLVM lowering [NFC] No new functionality, mostly a cleanup and one more abstraction level between Async and LLVM IR. Instead of lowering from Async to LLVM coroutines and Async Runtime API in one shot, do it progressively via async.coro and async.runtime operations. 1. Lower from async to async.runtime/coro (e.g. async.execute to function with coro setup and runtime calls) 2. Lower from async.runtime/coro to LLVM intrinsics and runtime API calls Intermediate coro/runtime operations will allow to run transformations on a higher level IR and do not try to match IR based on the LLVM::CallOp properties. Although async.coro is very close to LLVM coroutines, it is not exactly the same API, instead it is optimized for usability in async lowering, and misses a lot of details that are present in @llvm.coro intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94923	2021-01-25 14:04:33 -08:00
Lei Zhang	e27197f360	[mlir][spirv] Define spv.IsNan/spv.IsInf and add lowerings spv.Ordered/spv.Unordered are meant for OpenCL Kernel capability. For Vulkan Shader capability, we should use spv.IsNan to check whether a number is NaN. Add a new pattern for converting `std.cmpf ord\|uno` to spv.IsNan and bumped the pattern converting to spv.Ordered/spv.Unordered to a higher benefit. The SPIR-V target environment will properly select between these two patterns. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D95237	2021-01-22 13:09:33 -05:00
Hanhan Wang	2cb130f766	[mlir][StandardToSPIRV] Add support for lowering uitofp to SPIR-V - Extend spirv::ConstantOp::getZero/One to handle float, vector of int, and vector of float. - Refactor ZeroExtendI1Pattern to use getZero/One methods. - Add one more test for lowering std.zexti which extends vector<4xi1> to vector<4xi64>. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D95120	2021-01-21 22:20:32 -08:00
MaheshRavishankar	615167c9f7	[mlir]][SPIRV] Define OrderedOp and UnorderedOp and add lowerings from Standard. Define OrderedOp and UnorderedOp instructions in SPIR-V and convert cmpf operations with `ord` and `uno` tag to these instructions respectively. Differential Revision: https://reviews.llvm.org/D95098	2021-01-21 07:56:44 -08:00
Frederik Gossen	4ef38f9c12	Add log1p lowering from standard to ROCDL intrinsics Differential Revision: https://reviews.llvm.org/D95129	2021-01-21 14:02:48 +01:00
Frederik Gossen	294e2544c9	Add log1p lowering from standard to NVVM intrinsics Differential Revision: https://reviews.llvm.org/D95130	2021-01-21 14:00:38 +01:00
Alexander Belyaev	fc58bfd02f	[mlir] Remove complex ops from Standard dialect. `complex` dialect should be used instead. https://llvm.discourse.group/t/rfc-split-the-complex-dialect-from-std/2496/2 Differential Revision: https://reviews.llvm.org/D95077	2021-01-21 10:34:26 +01:00
Alexander Belyaev	b1e1bbae0e	[mlir] Add ComplexDialect to SCF->GPU pass.	2021-01-20 21:18:09 +01:00
Sean Silva	be7352c00d	[mlir][splitting std] move 2 more ops to `tensor` - DynamicTensorFromElementsOp - TensorFromElements Differential Revision: https://reviews.llvm.org/D94994	2021-01-19 13:49:25 -08:00
Lei Zhang	3a56a96664	[mlir][spirv] Define spv.GLSL.Fma and add lowerings Also changes some rewriter.create + rewriter.replaceOp calls into rewriter.replaceOpWithNewOp calls. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D94965	2021-01-19 09:14:21 -05:00
Alexander Belyaev	11f4c58c15	[mlir] Add `complex.abs`, `complex.div` and `complex.mul` to ComplexOps. Differential Revision: https://reviews.llvm.org/D94911	2021-01-19 12:09:59 +01:00
Alexander Belyaev	d0cb0d30a4	[mlir] Add Complex dialect. Differential Revision: https://reviews.llvm.org/D94764	2021-01-15 19:58:10 +01:00
Tres Popp	5cf2696317	[mlir] Remove TosaToLinalg dependency on all Passes TosaToLinalg was depending on its header file indirectly through Passes.h rather than directly. This removes that indirection. Differential Revision: https://reviews.llvm.org/D94706	2021-01-14 21:08:32 +01:00
Rob Suderman	1d973b7ded	[MLIR][TOSA] First lowerings from Tosa to Linalg Initial commit to add support for lowering from TOSA to Linalg. The focus is on the essential infrastructure for these lowerings and integration with existing passes. Includes lowerings for a subset of operations including: abs, add, sub, pow, and, or, xor, left shift, right shift, tanh Lit tests are used to validate correctness. Differential Revision: https://reviews.llvm.org/D94247	2021-01-14 11:24:23 -08:00
Alex Zinenko	bd30a796fc	[mlir] use built-in vector types instead of LLVM dialect types when possible Continue the convergence between LLVM dialect and built-in types by using the built-in vector type whenever possible, that is for fixed vectors of built-in integers and built-in floats. LLVM dialect vector type is still in use for pointers, less frequent floating point types that do not have a built-in equivalent, and scalable vectors. However, the top-level `LLVMVectorType` class has been removed in favor of free functions capable of inspecting both built-in and LLVM dialect vector types: `LLVM::getVectorElementType`, `LLVM::getNumVectorElements` and `LLVM::getFixedVectorType`. Additional work is necessary to design an implemented the extensions to built-in types so as to remove the `LLVMFixedVectorType` entirely. Note that the default output format for the built-in vectors does not have whitespace around the `x` separator, e.g., `vector<4xf32>` as opposed to the LLVM dialect vector type format that does, e.g., `!llvm.vec<4 x fp128>`. This required changing the FileCheck patterns in several tests. Reviewed By: mehdi_amini, silvas Differential Revision: https://reviews.llvm.org/D94405	2021-01-12 10:04:28 +01:00
Christian Sigg	195728c75a	[mlir] Add structural conversion to async dialect lowering. Lowering of async dialect uses a fixed type converter and therefore does not support lowering non-standard types. This revision adds a structural conversion so that non-standard types in `!async.value`s can be lowered to LLVM before lowering the async dialect itself. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D94404	2021-01-11 20:36:49 +01:00
Christian Sigg	d59ddba777	[mlir] Fix gpu-to-llvm lowering for gpu.alloc with dynamic sizes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94402	2021-01-11 15:55:48 +01:00
Christian Sigg	4fe7b16ae3	[mlir] Remove unnecessary llvm.mlir.cast in AsyncToLLVM lowering. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94400	2021-01-11 14:41:07 +01:00
Adrian Kuegel	af339f89a1	Remove redundant casts. Differential Revision: https://reviews.llvm.org/D94305	2021-01-11 08:51:47 +01:00
Lei Zhang	7c3ae48fe8	[mlir][spirv] Replace SPIRVOpLowering with OpConversionPattern The dialect conversion framework was enhanced to handle type conversion automatically. OpConversionPattern already contains a pointer to the TypeConverter. There is no need to duplicate it in a separate subclass. This removes the only reason for a SPIRVOpLowering subclass. It adapts to use core infrastructure and simplifies the code. Also added a utility function to OpConversionPattern for getting TypeConverter as a certain subclass. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D94080	2021-01-09 08:04:53 -05:00
Aart Bik	a57def30f5	[mlir][vector] generalized masked l/s and compressed l/s with indices Adding the ability to index the base address brings these operations closer to the transfer read and write semantics (with lowering advantages), ensures more consistent use in vector MLIR code (easier to read), and reduces the amount of code duplication to lower memrefs into base addresses considerably (making codegen less error-prone). Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D94278	2021-01-08 13:59:34 -08:00
River Riddle	e45840f4af	[mlir][PDL] Use ODS for defining PDL types This removes the need to define these classes and their parser/printers in C++. Differential Revision: https://reviews.llvm.org/D94135	2021-01-08 12:32:28 -08:00
Alex Zinenko	dd5165a920	[mlir] replace LLVM dialect float types with built-ins Continue the convergence between LLVM dialect and built-in types by replacing the bfloat, half, float and double LLVM dialect types with their built-in counterparts. At the API level, this is a direct replacement. At the syntax level, we change the keywords to `bf16`, `f16`, `f32` and `f64`, respectively, to be compatible with the built-in type syntax. The old keywords can still be parsed but produce a deprecation warning and will be eventually removed. Depends On D94178 Reviewed By: mehdi_amini, silvas, antiagainst Differential Revision: https://reviews.llvm.org/D94179	2021-01-08 17:38:12 +01:00
Alex Zinenko	2230bf99c7	[mlir] replace LLVMIntegerType with built-in integer type The LLVM dialect type system has been closed until now, i.e. did not support types from other dialects inside containers. While this has had obvious benefits of deriving from a common base class, it has led to some simple types being almost identical with the built-in types, namely integer and floating point types. This in turn has led to a lot of larger-scale complexity: simple types must still be converted, numerous operations that correspond to LLVM IR intrinsics are replicated to produce versions operating on either LLVM dialect or built-in types leading to quasi-duplicate dialects, lowering to the LLVM dialect is essentially required to be one-shot because of type conversion, etc. In this light, it is reasonable to trade off some local complexity in the internal implementation of LLVM dialect types for removing larger-scale system complexity. Previous commits to the LLVM dialect type system have adapted the API to support types from other dialects. Replace LLVMIntegerType with the built-in IntegerType plus additional checks that such types are signless (these are isolated in a utility function that replaced `isa<LLVMType>` and in the parser). Temporarily keep the possibility to parse `!llvm.i32` as a synonym for `i32`, but add a deprecation notice. Reviewed By: mehdi_amini, silvas, antiagainst Differential Revision: https://reviews.llvm.org/D94178	2021-01-07 19:48:31 +01:00
Kazuaki Ishizaki	f88fab5006	[mlir] NFC: fix trivial typos fix typo under include and lib directories Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D94220	2021-01-08 02:10:12 +09:00
Alex Zinenko	a7cbc32a91	[mlir] remove a use of deprecated OpState::setAttr	2021-01-07 14:20:36 +01:00
Ivan Butygin	c1d58c2b00	[mlir] Add fastmath flags support to some LLVM dialect ops Add fastmath enum, attributes to some llvm dialect ops, `FastmathFlagsInterface` op interface, and `translateModuleToLLVMIR` support. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D92485	2021-01-07 14:00:09 +01:00
Christian Sigg	badc7606b0	[mlir] Remove a number of methods from mlir::OpState that just forward to mlir::Operation. All call sites have been converted in previous changes.	2021-01-06 21:36:38 +01:00
KareemErgawy-TomTom	f60e0a91fb	[MLIR][SPIRV] Add `UnsignedOp` trait. This commit adds a new trait that can be attached to ops that have unsigned semantics. TODO: - Check if other places in code can use the new attribute (possibly in this patch). - Add a similar `SignedOp` attribute (in a new patch). Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D94068	2021-01-06 15:28:41 +01:00
Alex Zinenko	c69c9e0f0f	[mlir] Remove LLVMType, LLVM dialect types now derive Type directly BEGIN_PUBLIC [mlir] Remove LLVMType, LLVM dialect types now derive Type directly This class has become a simple `isa` hook with no proper functionality. Removing will allow us to eventually make the LLVM dialect type infrastructure open, i.e., support non-LLVM types inside container types, which itself will make the type conversion more progressive. Introduce a call `LLVM::isCompatibleType` to be used instead of `isa<LLVMType>`. For now, this is strictly equivalent. END_PUBLIC Depends On D93681 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93713	2021-01-05 17:36:54 +01:00
Chris Lattner	9eb3e564d3	[ODS] Make the getType() method on a OneResult instruction return a specific type. Implement Bug 46698, making ODS synthesize a getType() method that returns a specific C++ class for OneResult methods where we know that class. This eliminates a common source of casts in things like: myOp.getType().cast<FIRRTLType>().getPassive() because we know that myOp always returns a FIRRTLType. This also encourages op authors to type their results more tightly (which is also good for verification). I chose to implement this by splitting the OneResult trait into itself plus a OneTypedResult trait, given that many things are using `hasTrait<OneResult>` to conditionalize various logic. While this changes makes many many ops get more specific getType() results, it is generally drop-in compatible with the previous behavior because 'x.cast<T>()' is allowed when x is already known to be a T. The one exception to this is that we need declarations of the types used by ops, which is why a couple headers needed additional #includes. I updated a few things in tree to remove the now-redundant `.cast<>`'s, but there are probably many more than can be removed. Differential Revision: https://reviews.llvm.org/D93790	2020-12-26 13:52:40 -08:00
Eugene Zhulenev	61422c8b66	[mlir] Async: add support for lowering async value operands to LLVM Depends On D93592 Add support for `async.execute` async value unwrapping operands: ``` %token = async.execute(%async_value as %unwrapped : !async.value<!my.type>) { ... async.yield } ``` Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D93598	2020-12-25 02:25:20 -08:00
Eugene Zhulenev	621ad468d9	[mlir] Async: lowering async.value to LLVM 1. Add new methods to Async runtime API to support yielding async values 2. Add lowering from `async.yield` with value payload to the new runtime API calls `async.value` lowering requires that payload type is convertible to LLVM and supported by `llvm.mlir.cast` (DialectCast) operation. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D93592	2020-12-25 02:23:48 -08:00
Lei Zhang	930c74f12d	[mlir][spirv] NFC: rename SPIR-V conversion files for consistency This commit renames various SPIR-V related conversion files for consistency. It drops the "Convert" prefix to various files and fixes various comment headers. Reviewed By: hanchung, ThomasRaoux Differential Revision: https://reviews.llvm.org/D93489	2020-12-23 14:36:46 -05:00
Lei Zhang	a16fbff17d	[mlir][spirv] Create a pass for testing SCFToSPIRV patterns Previously all SCF to SPIR-V conversion patterns were tested as the -convert-gpu-to-spirv pass. That obscured the structure we want. This commit fixed it. Reviewed By: ThomasRaoux, hanchung Differential Revision: https://reviews.llvm.org/D93488	2020-12-23 14:31:55 -05:00
Lei Zhang	42980a789d	[mlir][spirv] Convert functions returning one value Reviewed By: hanchung, ThomasRaoux Differential Revision: https://reviews.llvm.org/D93468	2020-12-23 13:27:31 -05:00
Alex Zinenko	7ed9cfc7b1	[mlir] Remove static constructors from LLVMType LLVMType contains numerous static constructors that were initially introduced for API compatibility with LLVM. Most of these merely forward to arguments to `SpecificType::get` (MLIR defines classes for all types, unlike LLVM IR), while some introduce subtle semantics differences due to different modeling of MLIR types (e.g., structs are not auto-renamed in case of conflicts). Furthermore, these constructors don't match MLIR idioms and actively prevent us from making the LLVM dialect type system more open. Remove them and use `SpecificType::get` instead. Depends On D93680 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93681	2020-12-23 13:12:47 +01:00
Christian Sigg	19a0d0a40c	[mlir] Rename ConvertToLLVMPattern::isSupportedMemRefType() to isConvertibleAndHasIdentityMaps(). Reviewed By: ftynse, herhut Differential Revision: https://reviews.llvm.org/D93752	2020-12-23 12:23:29 +01:00
Christian Sigg	8451d4872e	[mlir] NFC: Remove ConvertToLLVMPattern::getDataPtr(). All call sites use getStridedElementPtr() now. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D93751	2020-12-23 11:35:01 +01:00
Chris Lattner	75a3f326c3	[IR] Add an ImplicitLocOpBuilder helper class for building IR with the same loc. One common situation is to create a lot of IR at a well known location, e.g. when doing a big rewrite from one dialect to another where you're expanding ops out into lots of other ops. For these sorts of situations, it is annoying to pass the location into every create call. As we discused in a few threads on the forum, a way to help with this is to produce a new sort of builder that holds a location and provides it to each of the create<> calls automatically. This patch implements an ImplicitLocOpBuilder class that does this. We've had good experience with this in the CIRCT project, and it makes sense to upstream to MLIR. I picked a random pass to adopt it to show the impact, but I don't think there is any particular need to force adopt it in the codebase. Differential Revision: https://reviews.llvm.org/D93717	2020-12-22 14:47:33 -08:00
Alex Zinenko	8de43b926f	[mlir] Remove instance methods from LLVMType LLVMType contains multiple instance methods that were introduced initially for compatibility with LLVM API. These methods boil down to `cast` followed by type-specific call. Arguably, they are mostly used in an LLVM cast-follows-isa anti-pattern. This doesn't connect nicely to the rest of the MLIR infrastructure and actively prevents it from making the LLVM dialect type system more open, e.g., reusing built-in types when appropriate. Remove such instance methods and replaces their uses with apporpriate casts and methods on derived classes. In some cases, the result may look slightly more verbose, but most cases should actually use a stricter subtype of LLVMType anyway and avoid the isa/cast. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93680	2020-12-22 23:34:54 +01:00
Christian Sigg	df6cbd37f5	[mlir] Lower gpu.memcpy to GPU runtime calls. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D93204	2020-12-22 22:49:19 +01:00
Prateek Gupta	3e07b0b9d3	[MLIR] Fix lowering of affine operations with return values This commit addresses the issue of lowering affine.for and affine.parallel having return values. Relevant test cases are also added. Signed-off-by: Prateek Gupta <prateek@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D93090	2020-12-22 21:44:31 +05:30
Thomas Raoux	26c8f9081b	[mlir[[vector] Extend Transfer read/write ops to support tensor types. Transfer_ops can now work on both buffers and tensor. Right now, lowering of the tensor case is not supported yet. Differential Revision: https://reviews.llvm.org/D93500	2020-12-21 08:55:04 -08:00
River Riddle	fc5cf50e89	[mlir] Remove the MutableDictionaryAttr class This class used to serve a few useful purposes: * Allowed containing a null DictionaryAttr * Provided some simple mutable API around a DictionaryAttr The first of which is no longer an issue now that there is much better caching support for attributes in general, and a cache in the context for empty dictionaries. The second results in more trouble than it's worth because it mutates the internal dictionary on every action, leading to a potentially large number of dictionary copies. NamedAttrList is a much better alternative for the second use case, and should be modified as needed to better fit it's usage as a DictionaryAttrBuilder. Differential Revision: https://reviews.llvm.org/D93442	2020-12-17 17:18:42 -08:00
Sean Silva	129d6e554e	[mlir] Move `std.tensor_cast` -> `tensor.cast`. This is almost entirely mechanical. Differential Revision: https://reviews.llvm.org/D93357	2020-12-17 16:06:56 -08:00
River Riddle	1b97cdf885	[mlir][IR][NFC] Move context/location parameters of builtin Type::get methods to the start of the parameter list This better matches the rest of the infrastructure, is much simpler, and makes it easier to move these types to being declaratively specified. Differential Revision: https://reviews.llvm.org/D93432	2020-12-17 13:01:36 -08:00
Lei Zhang	0117865412	[mlir][spirv] NFC: Shuffle code around to better follow convention This commit shuffles SPIR-V code around to better follow MLIR convention. Specifically, * Created IR/, Transforms/, Linking/, and Utils/ subdirectories and moved suitable code inside. * Created SPIRVEnums.{h\|cpp} for SPIR-V C/C++ enums generated from SPIR-V spec. Previously they are cluttered inside SPIRVTypes.{h\|cpp}. * Fixed include guards in various header files (both .h and .td). * Moved serialization tests under test/Target/SPIRV. * Renamed TableGen backend -gen-spirv-op-utils into -gen-spirv-attr-utils as it is only generating utility functions for attributes. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D93407	2020-12-17 11:03:26 -05:00
Tres Popp	c77ea40528	[mlir] Add std.pow lowering to LLVMIR Differential Revision: https://reviews.llvm.org/D93311	2020-12-15 18:54:29 +01:00
Tres Popp	9adc64539f	[mlir] Add std.powf to ROCDL lowering. Differential Revision: https://reviews.llvm.org/D93313	2020-12-15 18:47:49 +01:00
Tres Popp	e04785b131	[mlir] Add NVVM lowering for std.pow Differential Revision: https://reviews.llvm.org/D93303	2020-12-15 18:28:23 +01:00
Javier Setoain	aece4e2793	[mlir][ArmSVE][RFC] Add an ArmSVE dialect This revision starts an Arm-specific ArmSVE dialect discussed in the discourse RFC thread: https://llvm.discourse.group/t/rfc-vector-dialects-neon-and-sve/2284 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92172	2020-12-14 21:35:01 +00:00
ergawy	ecab63894b	[MLIR][SPIRV] Refactoring serialization and deserialization This commit splits SPIR-V's serialization and deserialization code into separate libraries. The motiviation being that the serializer is used more often the deserializer and therefore lumping them together unnecessarily increases binary size for the most common case. This commit also moves these libraries into the Target/ directory to follow MLIR convention. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91548	2020-12-14 12:28:16 -05:00
Frederik Gossen	75d9a46090	[MLIR] Add atan and atan2 lowerings to CUDA intrinsics Differential Revision: https://reviews.llvm.org/D93124	2020-12-14 10:45:28 +01:00
Frederik Gossen	1c6bc2c0b5	[MLIR] Add lowerings for atan and atan2 to ROCDL intrinsics Differential Revision: https://reviews.llvm.org/D93123	2020-12-14 10:43:19 +01:00
Christian Sigg	1ffc1aaa09	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove those methods from OpState. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93098	2020-12-13 09:58:16 +01:00
Sean Silva	444822d77a	Revert "Revert "[mlir] Start splitting the `tensor` dialect out of `std`."" This reverts commit `0d48d265db`. This reapplies the following commit, with a fix for CAPI/ir.c: [mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 14:30:50 -08:00
Sean Silva	0d48d265db	Revert "[mlir] Start splitting the `tensor` dialect out of `std`." This reverts commit `cab8dda90f`. I mistakenly thought that CAPI/ir.c failure was unrelated to this change. Need to debug it.	2020-12-11 14:15:41 -08:00
Sean Silva	cab8dda90f	[mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 13:50:55 -08:00
Nicolas Vasilache	7310501f74	[mlir][ArmNeon][RFC] Add a Neon dialect This revision starts an Arm-specific ArmNeon dialect discussed in the [discourse RFC thread](https://llvm.discourse.group/t/rfc-vector-dialects-neon-and-sve/2284). Differential Revision: https://reviews.llvm.org/D92171	2020-12-11 13:49:40 +00:00
Adrian Kuegel	9122070563	[mlir] Expose target configuration for lowering to ROCDL. Differential Revision: https://reviews.llvm.org/D93028	2020-12-11 13:20:53 +01:00
Adrian Kuegel	ada4c7a351	Add rsqrt lowering from standard to ROCDL. Add a lowering for rsqrt from standard dialect to ROCDL. Differential Revision: https://reviews.llvm.org/D93011	2020-12-11 13:18:57 +01:00
Rahul Joshi	563879b6f9	[NFC] Use ConvertOpToLLVMPattern instead of ConvertToLLVMPattern. - use ConvertOpToLLVMPattern to avoid explicit casting and in most cases the constructor can be reused to save a few lines of code. Differential Revision: https://reviews.llvm.org/D92989	2020-12-10 09:33:43 -08:00
Adrian Kuegel	09f717b929	Add sqrt lowering from standard to ROCDL Add a lowering for sqrt from standard dialect to ROCDL. Differential Revision: https://reviews.llvm.org/D92921	2020-12-10 09:47:37 +01:00
Christian Sigg	0bf4a82a5a	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove the corresponding methods from OpState. Reviewed By: silvas, rriddle Differential Revision: https://reviews.llvm.org/D92878	2020-12-09 12:11:32 +01:00
Frederik Gossen	d536569009	[MLIR] Expose target configuration for lowering to NVVM Differential Revision: https://reviews.llvm.org/D92871	2020-12-09 10:29:38 +01:00
Tres Popp	111ae220a3	[mlir] Use rewriting infrastructure in AsyncToLLVM This is needed so a listener hears all changes during the dialect conversion to allow correct rollbacks upon failure. Differential Revision: https://reviews.llvm.org/D92685	2020-12-08 17:30:01 +01:00
Frederik Gossen	b4750f58d8	Add sqrt lowering from standard to NVVM Differential Revision: https://reviews.llvm.org/D92850	2020-12-08 17:08:27 +01:00
Frederik Gossen	bb7d43e7d5	Add rsqrt lowering from standard to NVVM Differential Revision: https://reviews.llvm.org/D92838	2020-12-08 14:33:58 +01:00
Christian Sigg	dcec2ca5bd	Remove typeConverter from ConvertToLLVMPattern and use the existing one in ConversionPattern. ftynse Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D92564	2020-12-04 14:27:16 +01:00
River Riddle	09f7a55fad	[mlir][Types][NFC] Move all of the builtin Type classes to BuiltinTypes.h This is part of a larger refactoring the better congregates the builtin structures under the BuiltinDialect. This also removes the problematic "standard" naming that clashes with the "standard" dialect, which is not defined within IR/. A temporary forward is placed in StandardTypes.h to allow time for downstream users to replaced references. Differential Revision: https://reviews.llvm.org/D92435	2020-12-03 18:02:10 -08:00
Aart Bik	c95acf052b	[mlir][vector][avx512] move avx512 lowering pass into general vector lowering A separate AVX512 lowering pass does not compose well with the regular vector lowering pass. As such, it is at risk of code duplication and lowering inconsistencies. This change removes the separate AVX512 lowering pass and makes it an "option" in the regular vector lowering pass (viz. vector dialect "augmented" with AVX512 dialect). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92614	2020-12-03 17:23:46 -08:00
River Riddle	672cc75cce	[mlir][IR] Remove references to BuiltinOps from IR/ There isn't a good reason for anything within IR to specifically reference any of the builtin operations. The only place that had a good reason in the past was AsmPrinter, but the behavior there doesn't need to hardcode ModuleOp anymore. Differential Revision: https://reviews.llvm.org/D92448	2020-12-03 15:47:01 -08:00
Christian Sigg	c4a0405902	Add `Operation* OpState::operator->()` to provide more convenient access to members of Operation. Given that OpState already implicit converts to Operator*, this seems reasonable. The alternative would be to add more functions to OpState which forward to Operation. Reviewed By: rriddle, ftynse Differential Revision: https://reviews.llvm.org/D92266	2020-12-02 15:46:20 +01:00
Christian Sigg	e9e45b3887	[mlir] Fix bad rebase landed in `acb69f3b7c`. Differential Revision: https://reviews.llvm.org/D92265	2020-11-28 13:57:01 +01:00
Christian Sigg	acb69f3b7c	[mlir] Change ConvertOpToLLVMPattern::matchAndRewrite argument to concrete operand type. Reviewed By: herhut, ftynse Differential Revision: https://reviews.llvm.org/D92111	2020-11-28 13:09:25 +01:00
Christian Sigg	5535696c38	[mlir] Add gpu.allocate, gpu.deallocate ops with LLVM lowering to runtime function calls. The ops are very similar to the std variants, but support async GPU execution. gpu.alloc does not currently support an alignment attribute, and the new ops do not have canonicalizers/folders like their std siblings do. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91698	2020-11-27 09:40:59 +01:00
Alex Zinenko	119545f433	[mlir] Add conversion from SCF parallel loops to OpenMP Introduce a conversion pass from SCF parallel loops to OpenMP dialect constructs - parallel region and workshare loop. Loops with reductions are not supported because the OpenMP dialect cannot model them yet. The conversion currently targets only one level of parallelism, i.e. only one top-level `omp.parallel` operation is produced even if there are nested `scf.parallel` operations that could be mapped to `omp.wsloop`. Nested parallelism support is left for future work. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D91982	2020-11-24 21:12:56 +01:00
Nicolas Vasilache	a8de412f51	[mlir] NFC - Expose an OffsetSizeAndStrideOpInterface This revision will make it easier to create new ops base on the strided memref abstraction outside of the std dialect. OffsetSizeAndStrideOpInterface is an interface for ops that allow specifying mixed dynamic and static offsets, sizes and strides variadic operands. Ops that implement this interface need to expose the following methods: 1. `getArrayAttrRanks` to specify the length of static integer attributes. 2. `offsets`, `sizes` and `strides` variadic operands. 3. `static_offsets`, resp. `static_sizes` and `static_strides` integer array attributes. The invariants of this interface are: 1. `static_offsets`, `static_sizes` and `static_strides` have length exactly `getArrayAttrRanks()`[0] (resp. [1], [2]). 2. `offsets`, `sizes` and `strides` have each length at most `getArrayAttrRanks()`[0] (resp. [1], [2]). 3. if an entry of `static_offsets` (resp. `static_sizes`, `static_strides`) is equal to a special sentinel value, namely `ShapedType::kDynamicStrideOrOffset` (resp. `ShapedType::kDynamicSize`, `ShapedType::kDynamicStrideOrOffset`), then the corresponding entry is a dynamic offset (resp. size, stride). 4. a variadic `offset` (resp. `sizes`, `strides`) operand must be present for each dynamic offset (resp. size, stride). This interface is useful to factor out common behavior and provide support for carrying or injecting static behavior through the use of the static attributes. Differential Revision: https://reviews.llvm.org/D92011	2020-11-24 14:42:47 +00:00
Alex Zinenko	f7d033f4d8	[mlir] Support WsLoopOp in OpenMP to LLVM dialect conversion It is a simple conversion that only requires to change the region argument types, generalize it from ParallelOp. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D91989	2020-11-23 23:28:02 +01:00
Alex Zinenko	1ec60862d7	[mlir] Avoid cloning ops in SCF parallel conversion to CFG The existing implementation of the conversion from SCF Parallel operation to SCF "for" loops in order to further convert those loops to branch-based CFG has been cloning the loop and reduction body operations into the new loop because ConversionPatternRewriter was missing support for moving blocks while replacing their arguments. This functionality now available, use it to implement the conversion and avoid cloning operations, which may lead to doubling of the IR size during the conversion. In addition, this fixes an issue with converting nested SCF "if" conditionals present in "parallel" operations that would cause the conversion infrastructure to stop because of the repeated application of the pattern converting "newly" created "if"s (which were in fact just moved). Arguably, this should be fixed at the infrastructure level and this fix is a workaround. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91955	2020-11-23 14:01:22 +01:00
Ella Ma	1756d67934	[llvm][clang][mlir] Add checks for the return values from Target::createXXX to prevent protential null deref All these potential null pointer dereferences are reported by my static analyzer for null smart pointer dereferences, which has a different implementation from `alpha.cplusplus.SmartPtr`. The checked pointers in this patch are initialized by Target::createXXX functions. When the creator function pointer is not correctly set, a null pointer will be returned, or the creator function may originally return a null pointer. Some of them may not make sense as they may be checked before entering the function, but I fixed them all in this patch. I submit this fix because 1) similar checks are found in some other places in the LLVM codebase for the same return value of the function; and, 2) some of the pointers are dereferenced before they are checked, which may definitely trigger a null pointer dereference if the return value is nullptr. Reviewed By: tejohnson, MaskRay, jpienaar Differential Revision: https://reviews.llvm.org/D91410	2020-11-21 21:04:12 -08:00
Eugene Zhulenev	13ab072b25	[mlir] AsynToLLVM: do no use op->getOperands() in conversion patterns Differential Revision: https://reviews.llvm.org/D91910	2020-11-21 04:57:26 -08:00
Eugene Zhulenev	a86a9b5ef7	[mlir] Automatic reference counting for Async values + runtime support for ref counted objects Depends On D89963 Automatic reference counting algorithm outline: 1. `ReturnLike` operations forward the reference counted values without modifying the reference count. 2. Use liveness analysis to find blocks in the CFG where the lifetime of reference counted values ends, and insert `drop_ref` operations after the last use of the value. 3. Insert `add_ref` before the `async.execute` operation capturing the value, and pairing `drop_ref` before the async body region terminator, to release the captured reference counted value when execution completes. 4. If the reference counted value is passed only to some of the block successors, insert `drop_ref` operations in the beginning of the blocks that do not have reference coutned value uses. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D90716	2020-11-20 03:08:44 -08:00
Alex Zinenko	9bb5bff570	[mlir] Add an assertion on creating an Operation with null result types Null types are commonly used as an error marker. Catch them in the constructor of Operation if they are present in the result type list, as otherwise this could lead to further surprising behavior when querying op result types. Fix AsyncToLLVM and StandardToLLVM that were using null types when constructing operations. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91770	2020-11-19 22:28:38 +01:00
River Riddle	65fcddff24	[mlir][BuiltinDialect] Resolve comments from D91571 * Move ops to a BuiltinOps.h * Add file comments	2020-11-19 11:12:49 -08:00
Stella Stamenova	332710e704	[mlir] Add a missing dependency to LinalgToLLVM Generate passes.h before trying to use it Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D91750	2020-11-19 10:30:40 -08:00
Tres Popp	74170a3aef	Use rewriter in SCFToSPIRV conversion. Additionally, clear a data structure to ensure a proper state if multiple conversion attempts are needed. Differential Revision: https://reviews.llvm.org/D91791	2020-11-19 17:50:14 +01:00
Christian Sigg	8b97e17d16	[mlir] Simplify code generated by ConvertToLLVMPattern::getStridedElementPtr(). Make the interface match the one of ConvertToLLVMPattern::getDataPtr() (to be removed in a separate change). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91599	2020-11-18 11:52:09 +01:00
Christian Sigg	bedaad4495	[mlir] Simplify std.alloc lowering to LLVM. std.alloc only supports memrefs with identity layout, which means we can simplify the lowering to LLVM and compute strides only from (static and dynamic) sizes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91549	2020-11-17 18:55:34 +01:00
ergawy	9793edd5bf	[MLIR][SPIRV] Rename `spv._address_of` to `spv.mlir.addressof` This commit does the renaming mentioned in the title in order to bring `spv` dialect closer to the MLIR naming conventions. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91609	2020-11-17 12:12:27 -05:00
Christian Sigg	43ede0e2a7	[mlir] Remove unused ConvertToLLVMPattern::linearizeSubscripts(). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91594	2020-11-17 17:25:45 +01:00
River Riddle	73ca690df8	[mlir][NFC] Remove references to Module.h and Function.h These includes have been deprecated in favor of BuiltinDialect.h, which contains the definitions of ModuleOp and FuncOp. Differential Revision: https://reviews.llvm.org/D91572	2020-11-17 00:55:47 -08:00
Rahul Joshi	b7382ed3fe	[MLIR] Extend Symbol verification to reject public symbol declarations. - Extend the Symbol interface with `isDeclaration` to identify operations that declare a symbol as opposed to define it. - Extend verification to disallow public declarations as per the discussion in https://llvm.discourse.group/t/rfc-symbol-definition-declaration-x-visibility-checks/2140 - Adopt the new interface for `FuncOp` and fix test and code to not have/create public function declarations. Differential Revision: https://reviews.llvm.org/D91456	2020-11-16 16:05:32 -08:00
Christian Sigg	04481f26fa	[mlir] Require std.alloc() ops to have canonical layout during LLVM lowering. The current code allows strided layouts, but the number of elements allocated is ambiguous. It could be either the number of elements in the shape (the current implementation), or the amount of elements required to not index out-of-bounds with the given maps (which would require evaluating the layout map). If we require the canonical layouts, the two will be the same. Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D91523	2020-11-16 17:29:36 +01:00
Hanhan Wang	47fd19f22e	[mlir][StandardToSPIRV] Extend support for lowering cmpi to SPIRV. The logic of vector on boolean was missed. This patch adds the logic and test on it. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D91403	2020-11-16 06:51:05 -08:00
Rahul Joshi	d843755220	[NFC] Refactor function declaration addition in AsyncToLLVM - Extract repeated code into helper function/lambdas. Differential Revision: https://reviews.llvm.org/D91453	2020-11-13 12:53:19 -08:00
Eugene Zhulenev	c30ab6c2a3	[mlir] Transform scf.parallel to scf.for + async.execute Depends On D89958 1. Adds `async.group`/`async.awaitall` to group together multiple async tokens/values 2. Rewrite scf.parallel operation into multiple concurrent async.execute operations over non overlapping subranges of the original loop. Example: ``` scf.for (%i, %j) = (%lbi, %lbj) to (%ubi, %ubj) step (%si, %sj) { "do_some_compute"(%i, %j): () -> () } ``` Converted to: ``` %c0 = constant 0 : index %c1 = constant 1 : index // Compute blocks sizes for each induction variable. %num_blocks_i = ... : index %num_blocks_j = ... : index %block_size_i = ... : index %block_size_j = ... : index // Create an async group to track async execute ops. %group = async.create_group scf.for %bi = %c0 to %num_blocks_i step %c1 { %block_start_i = ... : index %block_end_i = ... : index scf.for %bj = %c0 t0 %num_blocks_j step %c1 { %block_start_j = ... : index %block_end_j = ... : index // Execute the body of original parallel operation for the current // block. %token = async.execute { scf.for %i = %block_start_i to %block_end_i step %si { scf.for %j = %block_start_j to %block_end_j step %sj { "do_some_compute"(%i, %j): () -> () } } } // Add produced async token to the group. async.add_to_group %token, %group } } // Await completion of all async.execute operations. async.await_all %group ``` In this example outer loop launches inner block level loops as separate async execute operations which will be executed concurrently. At the end it waits for the completiom of all async execute operations. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D89963	2020-11-13 04:02:56 -08:00
Stephan Herhut	5da2423bc0	[mlir][gpu] Only transform mapped parallel loops to GPU. This exposes a hook to configure legality of operations such that only `scf.parallel` operations that have mapping attributes are marked as illegal. Consequently, the transformation can now also be applied to mixed forms. Differential Revision: https://reviews.llvm.org/D91340	2020-11-13 09:15:17 +01:00
Rahul Joshi	5883c4b470	[MLIR] Fix standard -> LLVM conversion to fail for unsupported memref element type. - Move isSupportedMemRefType() to ConvertToLLVMPatterns and check if the memref element type is supported there. Differential Revision: https://reviews.llvm.org/D91374	2020-11-12 17:06:05 -08:00
Adrian Kuegel	a719eef73e	MLIR: Remove TanhOp from ops list. It caused a build failure.	2020-11-11 14:58:55 +01:00
Adrian Kuegel	5248047c93	MLIR: add SinOp Lowering to __ocml_sin_f32 and __ocml_sin_f64 This mimics the recent similar patch for GPUToNVVM. Differential Revision: https://reviews.llvm.org/D91252	2020-11-11 14:38:23 +01:00
George Mitenkov	de3ad5bb09	[MLIR][SPIRVToLLVM] Enhanced conversion for execution mode This patch introduces a new conversion pattern for `spv.ExecutionMode`. `spv.ExecutionMode` may contain important information about the entry point, which we want to preserve. For example, `LocalSize` provides information about the work-group size that can be reused. Hence, the pattern for entry-point ops changes to the following: - `spv.EntryPoint` is still simply removed - Info from `spv.ExecutionMode` is used to create a global struct variable, which looks like: ``` struct { int32_t executionMode; int32_t values[]; // optional values }; ``` Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D89989	2020-11-10 18:33:54 +03:00
Artur Bialas	3035e676a3	[mlir][spirv] Add VectorInsertDynamicOp and vector.insertelement lowering VectorInsertDynamicOp in SPIRV dialect conversion from vector.insertelement to spirv VectorInsertDynamicOp Differential Revision: https://reviews.llvm.org/D90927	2020-11-10 09:49:12 +01:00
Alexander Belyaev	9d02e0e38d	[mlir][std] Add ExpandOps pass. The pass combines patterns of ExpandAtomic, ExpandMemRefReshape, StdExpandDivs passes. The pass is meant to legalize STD for conversion to LLVM. Differential Revision: https://reviews.llvm.org/D91082	2020-11-09 21:58:28 +01:00
Rahul Joshi	e29cb0908b	[MLIR] Fix GCC build failure	2020-11-09 11:57:52 -08:00
Rahul Joshi	a97e357e8e	[MLIR] Support `global_memref` and `get_global_memref` in standard -> LLVM conversion. - Convert `global_memref` to LLVM::GlobalOp. - Convert `get_global_memref` to a memref descriptor with a pointer to the first element of the global stashed in it. - Extend unit test and a mlir-cpu-runner test to validate the generated LLVM IR. Differential Revision: https://reviews.llvm.org/D90803	2020-11-09 10:54:21 -08:00
George Mitenkov	89eed79c1f	[MLIR][SPIRVToLLVM] Added module name conversion Since SPIR-V module has an optional name, this patch makes a change to pass it to `ModuleOp` during conversion. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90904	2020-11-07 12:27:44 +03:00
Artur Bialas	f9dca1039a	[mlir][spirv] Add VectorExtractDynamicOp and vector.extractelement lowering VectorExtractDynamicOp in SPIRV dialect conversion from vector.extractelement to spirv VectorExtractDynamicOp Differential Revision: https://reviews.llvm.org/D90679	2020-11-05 08:26:54 +01:00
Rahul Joshi	8c2025cc61	[MLIR] Refactor memref type -> LLVM Type conversion - Eliminate duplicated information about mapping from memref -> its descriptor fields by consolidating that mapping in two functions: getMemRefDescriptorFields and getUnrankedMemRefDescriptorFields. - Change convertMemRefType() and convertUnrankedMemRefType() to use these functions. - Remove convertMemrefSignature and convertUnrankedMemrefSignature. Differential Revision: https://reviews.llvm.org/D90707	2020-11-04 10:32:56 -08:00
Mehdi Amini	c7994bd939	Switch from C-style comments `/* ... /` to C++ style `//` (NFC) This is mostly a scripted update, it may not be perfect. function replace() { FROM=$1 TO=$2 git grep "$FROM" $REPO_PATH \|cut -f 1 -d : \| sort -u \| \ while read file; do sed -i "s#$FROM#$TO#" $file ; done } replace '\|\===----------------------------------------------------------------------===\\|$' '//===----------------------------------------------------------------------===//' replace '^/\ =' '//==' replace '^/\=' '//=' replace '^\\\=' '//=' replace '^\|\' '//' replace ' \\|$' '' replace '=\\\$' '=//' replace '== \/$' '===//' replace '==\/$' '==//' replace '^/\\$.$\/$' '///\1' replace '^/\$.$\/$' '//\1' replace '//============================================================================//' '//===----------------------------------------------------------------------===//' Differential Revision: https://reviews.llvm.org/D90732	2020-11-04 18:11:13 +00:00
Alex Zinenko	8475fa6ed6	[mlir] Add a simpler lowering pattern for WhileOp representing a do-while loop When the "after" region of a WhileOp is merely forwarding its arguments back to the "before" region, i.e. WhileOp is a canonical do-while loop, a simpler CFG subgraph that omits the "after" region with its extra branch operation can be produced. Loop rotation from general "while" to "if { do-while }" is left for a future canonicalization pattern when it becomes necessary. Differential Revision: https://reviews.llvm.org/D90604	2020-11-04 09:43:13 +01:00
Alex Zinenko	4c0e255c98	[mlir] Add lowering to CFG for WhileOp The lowering is a straightforward inlining of the "before" and "after" regions connected by (conditional) branches. This plugs the WhileOp into the progressive lowering scheme. Future commits may choose to target WhileOp instead of CFG when lowering ForOp. Differential Revision: https://reviews.llvm.org/D90603	2020-11-04 09:43:13 +01:00
Alexander Belyaev	9925168576	[mlir] Convert `memref_reshape` to LLVM. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://reviews.llvm.org/D90377	2020-11-03 11:39:08 +01:00
Tres Popp	d05d42199f	[mlir] Add partial lowering of shape.cstr_broadcastable. Because cstr operations allow more instruction reordering than asserts, we only lower cstr_broadcastable to std ops with cstr_require. This ensures that the more drastic lowering to asserts can happen specifically with the user's desire. Differential Revision: https://reviews.llvm.org/D89325	2020-11-03 09:57:23 +01:00
Eugene Zhulenev	f507aa17b7	[mlir] Implement lowering to LLVM of async.execute ops with token dependencies Add support for lowering `async.execute` operations with token dependencies Example: ``` %dep = ... : !async.token %token = async.execute[%dep] { ... } ``` Token dependencies lowered to `async.await` operations inside the outline coroutine body. Reviewed By: herhut, mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D89958	2020-10-30 05:59:03 -07:00
Tres Popp	511484f27d	[mlir] Add lowering for IsBroadcastable to Std dialect. Differential Revision: https://reviews.llvm.org/D90407	2020-10-30 10:44:27 +01:00
Christian Sigg	fce99e5f73	[mlir][gpu] Handle async in gpu.launch_func lowering. For the synchronous case, destroy the stream after synchronization. Sneak in a unrelated change to report why the gpu.wait conversion pattern didn't match. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89933	2020-10-29 22:16:42 +01:00
Christian Sigg	97b351a827	[mlir][gpu] Fix leaked stream and module when lowering gpu.launch_func to runtime calls. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D90370	2020-10-29 08:40:51 +01:00
Alexander Belyaev	7a996027b9	[mlir] Convert memref_reshape to memref_reinterpret_cast. Differential Revision: https://reviews.llvm.org/D90235	2020-10-28 21:15:32 +01:00
Kazuaki Ishizaki	41b09f4eff	[mlir] NFC: fix trivial typos fix typos in comments and documents Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D90089	2020-10-29 04:05:22 +09:00
Qingyi Liu	1ec893c574	MLIR: add SinOp Lowering to __nv_sinf and __nv_sin Added lowering rule from `SinOp` to `__nv_sinf` and `__nv_sin` Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90147	2020-10-28 14:15:26 +01:00
River Riddle	3fffffa882	[mlir][Pattern] Add a new FrozenRewritePatternList class This class represents a rewrite pattern list that has been frozen, and thus immutable. This replaces the uses of OwningRewritePatternList in pattern driver related API, such as dialect conversion. When PDL becomes more prevalent, this API will allow for optimizing a set of patterns once without the need to do this per run of a pass. Differential Revision: https://reviews.llvm.org/D89104	2020-10-26 18:01:06 -07:00
River Riddle	b6eb26fd0e	[mlir][NFC] Move around the code related to PatternRewriting to improve layering There are several pieces of pattern rewriting infra in IR/ that really shouldn't be there. This revision moves those pieces to a better location such that they are easier to evolve in the future(e.g. with PDL). More concretely this revision does the following: * Create a Transforms/GreedyPatternRewriteDriver.h and move the applyandFold methods there. The definitions for these methods are already in Transforms/ so it doesn't make sense for the declarations to be in IR. Create a new lib/Rewrite library and move PatternApplicator there. This new library will be focused on applying rewrites, and will also include compiling rewrites with PDL. Differential Revision: https://reviews.llvm.org/D89103	2020-10-26 18:01:06 -07:00
River Riddle	b99bd77162	[mlir][Pattern] Refactor the Pattern class into a "metadata only" class The Pattern class was originally intended to be used for solely matching operations, but that use never materialized. All of the pattern infrastructure uses RewritePattern, and the infrastructure for pure matching(Matchers.h) is implemented inline. This means that this class isn't a useful abstraction at the moment, so this revision refactors it to solely encapsulate the "metadata" of a pattern. The metadata includes the various state describing a pattern; benefit, root operation, etc. The API on PatternApplicator is updated to now operate on `Pattern`s as nothing special from `RewritePattern` is necessary. This refactoring is also necessary for the upcoming use of PDL patterns alongside C++ rewrite patterns. Differential Revision: https://reviews.llvm.org/D86258	2020-10-26 18:01:06 -07:00
River Riddle	8a1ca2cd34	[mlir] Add a conversion pass between PDL and the PDL Interpreter Dialect The conversion between PDL and the interpreter is split into several different parts. ** The Matcher: The matching section of all incoming pdl.pattern operations is converted into a predicate tree and merged. Each pattern is first converted into an ordered list of predicates starting from the root operation. A predicate is composed of three distinct parts: * Position - A position refers to a specific location on the input DAG, i.e. an existing MLIR entity being matched. These can be attributes, operands, operations, results, and types. Each position also defines a relation to its parent. For example, the operand `[0] -> 1` has a parent operation position `[0]` (the root). * Question - A question refers to a query on a specific positional value. For example, an operation name question checks the name of an operation position. * Answer - An answer is the expected result of a question. For example, when matching an operation with the name "foo.op". The question would be an operation name question, with an expected answer of "foo.op". After the predicate lists have been created and ordered(based on occurrence of common predicates and other factors), they are formed into a tree of nodes that represent the branching flow of a pattern match. This structure allows for efficient construction and merging of the input patterns. There are currently only 4 simple nodes in the tree: * ExitNode: Represents the termination of a match * SuccessNode: Represents a successful match of a specific pattern * BoolNode/SwitchNode: Branch to a specific child node based on the expected answer to a predicate question. Once the matcher tree has been generated, this tree is walked to generate the corresponding interpreter operations. ** The Rewriter: The rewriter portion of a pattern is generated in a very straightforward manor, similarly to lowerings in other dialects. Each PDL operation that may exist within a rewrite has a mapping into the interpreter dialect. The code for the rewriter is generated within a FuncOp, that is invoked by the interpreter on a successful pattern match. Referenced values defined in the matcher become inputs the generated rewriter function. An example lowering is shown below: ```mlir // The following high level PDL pattern: pdl.pattern : benefit(1) { %resultType = pdl.type %inputOperand = pdl.input %root, %results = pdl.operation "foo.op"(%inputOperand) -> %resultType pdl.rewrite %root { pdl.replace %root with (%inputOperand) } } // is lowered to the following: module { // The matcher function takes the root operation as an input. func @matcher(%arg0: !pdl.operation) { pdl_interp.check_operation_name of %arg0 is "foo.op" -> ^bb2, ^bb1 ^bb1: pdl_interp.return ^bb2: pdl_interp.check_operand_count of %arg0 is 1 -> ^bb3, ^bb1 ^bb3: pdl_interp.check_result_count of %arg0 is 1 -> ^bb4, ^bb1 ^bb4: %0 = pdl_interp.get_operand 0 of %arg0 pdl_interp.is_not_null %0 : !pdl.value -> ^bb5, ^bb1 ^bb5: %1 = pdl_interp.get_result 0 of %arg0 pdl_interp.is_not_null %1 : !pdl.value -> ^bb6, ^bb1 ^bb6: // This operation corresponds to a successful pattern match. pdl_interp.record_match @rewriters::@rewriter(%0, %arg0 : !pdl.value, !pdl.operation) : benefit(1), loc([%arg0]), root("foo.op") -> ^bb1 } module @rewriters { // The inputs to the rewriter from the matcher are passed as arguments. func @rewriter(%arg0: !pdl.value, %arg1: !pdl.operation) { pdl_interp.replace %arg1 with(%arg0) pdl_interp.return } } } ``` Differential Revision: https://reviews.llvm.org/D84580	2020-10-26 18:01:06 -07:00
Alexander Belyaev	d6ab0474c6	[mlir] Convert MemRefReinterpretCastOp to LLVM. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://reviews.llvm.org/D90033	2020-10-26 20:13:17 +01:00
George Mitenkov	cae4067ec1	[MLIR][mlir-spirv-cpu-runner] A pass to emulate a call to kernel in LLVM This patch introduces a pass for running `mlir-spirv-cpu-runner` - LowerHostCodeToLLVMPass. This pass emulates `gpu.launch_func` call in LLVM dialect and lowers the host module code to LLVM. It removes the `gpu.module`, creates a sequence of global variables that are later linked to the varables in the kernel module, as well as a series of copies to/from them to emulate the memory transfer to/from the host or to/from the device sides. It also converts the remaining Standard dialect into LLVM dialect, emitting C wrappers. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86112	2020-10-26 08:11:04 -04:00
Lei Zhang	36ce915ac5	Revert "Revert "[mlir] Convert from Async dialect to LLVM coroutines"" This reverts commit `4986d5eaff` with proper patches to CMakeLists.txt: - Add MLIRAsync as a dependency to MLIRAsyncToLLVM - Add Coroutines as a dependency to MLIRExecutionEngine	2020-10-22 15:23:11 -04:00
Mehdi Amini	4986d5eaff	Revert "[mlir] Convert from Async dialect to LLVM coroutines" This reverts commit `a8b0ae3bdd` and commit `f8fcff5a9d`. The build with SHARED_LIBRARY=ON is broken.	2020-10-22 19:12:19 +00:00
Eugene Zhulenev	f8fcff5a9d	[mlir] Convert from Async dialect to LLVM coroutines Lower from Async dialect to LLVM by converting async regions attached to `async.execute` operations into LLVM coroutines (https://llvm.org/docs/Coroutines.html): 1. Outline all async regions to functions 2. Add LLVM coro intrinsics to mark coroutine begin/end 3. Use MLIR conversion framework to convert all remaining async types and ops to LLVM + Async runtime function calls All `async.await` operations inside async regions converted to coroutine suspension points. Await operation outside of a coroutine converted to the blocking wait operations. Implement simple runtime to support concurrent execution of coroutines. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89292	2020-10-22 06:30:46 -07:00
Thomas Raoux	ac2cf07195	[spirv] Fix legalize standard to spir-v for transfer ops Forward missing attributes when creating the new transfer op otherwise the builder would use default values. Differential Revision: https://reviews.llvm.org/D89907	2020-10-21 13:56:01 -07:00
Christian Sigg	3ac561d8c3	[mlir][gpu] Add lowering to LLVM for `gpu.wait` and `gpu.wait async`. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89686	2020-10-21 18:20:42 +02:00
Tres Popp	72d5ac90b9	[mlir] Use affine dim instead of symbol in SCFToGPU lowering. This still satisfies the constraints required by the affine dialect and gives more flexibility in what iteration bounds can be used when loewring to the GPU dialect. Differential Revision: https://reviews.llvm.org/D89782	2020-10-20 11:56:34 +02:00
Sean Silva	57211fd239	[mlir] Use dynamic_tensor_from_elements in shape.broadcast conversion Now, convert-shape-to-std doesn't internally create memrefs, which was previously a bit of a layering violation. The conversion to memrefs should logically happen as part of bufferization. Differential Revision: https://reviews.llvm.org/D89669	2020-10-19 15:51:46 -07:00
Nicolas Vasilache	af5be38a01	[mlir][Linalg] Make a Linalg CodegenStrategy available. This revision adds a programmable codegen strategy from linalg based on staged rewrite patterns. Testing is exercised on a simple linalg.matmul op. Differential Revision: https://reviews.llvm.org/D89374	2020-10-14 11:11:26 +00:00
Alexander Belyaev	323fd11df7	[mlir][nfc] Add a func to compute numElements of a shape in Std -> LLVM. For some reason the variable `cumulativeSizeInBytes` in `getCumulativeSizeInBytes` was actually storing number of elements. I decided to fix it and refactor the function a bit. Differential Revision: https://reviews.llvm.org/D89336	2020-10-13 21:41:25 +02:00
Christian Sigg	01dc85c173	[mlir][gpu] Adding gpu runtime wrapper functions for async execution. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89037	2020-10-12 14:07:27 +02:00
Nicolas Vasilache	422aaf31da	[mlir][Linalg] Add named Linalg ops on tensor to buffer support. This revision introduces support for buffer allocation for any named linalg op. To avoid template instantiating many ops, a new ConversionPattern is created to capture the LinalgOp interface. Some APIs are updated to remain consistent with MLIR style: `OwningRewritePatternList * -> OwningRewritePatternList &` `BufferAssignmentTypeConverter * -> BufferAssignmentTypeConverter &` Differential revision: https://reviews.llvm.org/D89226	2020-10-12 11:20:23 +00:00
Tres Popp	8178e41dc1	[mlir] Type erase inputs to select statements in shape.broadcast lowering. This is required or broadcasting with operands of different ranks will lead to failures as the select op requires both possible outputs and its output type to be the same. Differential Revision: https://reviews.llvm.org/D89134	2020-10-11 21:58:06 +02:00
Nicolas Vasilache	e0dc3dba3b	[mlir][Linalg] NFC - Cleanup explicitly instantiated paterns 1/n - LinalgToStandard.cpp This revision belongs to a series of patches that reduce reliance of Linalg transformations on templated rewrite and conversion patterns. Instead, this uses a MatchAnyTag pattern for the vast majority of cases and dispatches internally. Differential Revision: https://reviews.llvm.org/D89133	2020-10-09 19:41:41 +00:00
Jakub Lichman	e547b1e243	[mlir] Rank reducing subview conversion to LLVM This commit adjusts SubViewOp lowering to take rank reduction into account. Differential Revision: https://reviews.llvm.org/D88883	2020-10-08 13:47:22 +00:00
Nicolas Vasilache	30e6033b45	[mlir][Linalg] Add TensorsToBuffers support for Constant ops. This revision also inserts an end-to-end test that lowers tensors to buffers all the way to executable code on CPU. Differential revision: https://reviews.llvm.org/D88998	2020-10-08 13:15:45 +00:00
Christian Sigg	cc83dc191c	Import llvm::StringSwitch into mlir namespace. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D88971	2020-10-08 11:39:24 +02:00
Amara Emerson	322d0afd87	[llvm][mlir] Promote the experimental reduction intrinsics to be first class intrinsics. This change renames the intrinsics to not have "experimental" in the name. The autoupgrader will handle legacy intrinsics. Relevant ML thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html Differential Revision: https://reviews.llvm.org/D88787	2020-10-07 10:36:44 -07:00
Tobias Gysi	149dc94c1d	[mlir] fix the types used during the generation of the kernel param array The patch fixes the types used to access the elements of the kernel parameter structure from a pointer to the structure to a pointer to the actual parameter type. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D88959	2020-10-07 16:18:46 +02:00
Thomas Raoux	6e557bc405	[mlir][spirv] Add Vector to SPIR-V conversion pass Add conversion pass for Vector dialect to SPIR-V dialect and add some simple conversion pattern for vector.broadcast, vector.insert, vector.extract. Differential Revision: https://reviews.llvm.org/D88761	2020-10-06 11:53:23 -07:00
George Mitenkov	b81bedf714	[MLIR][SPIRVToLLVM] Conversion for composite extract and insert A pattern to convert `spv.CompositeInsert` and `spv.CompositeExtract`. In LLVM, there are 2 ops that correspond to each instruction depending on the container type. If the container type is a vector type, then the result of conversion is `llvm.insertelement` or `llvm.extractelement`. If the container type is an aggregate type (i.e. struct, array), the result of conversion is `llvm.insertvalue` or `llvm.extractvalue`. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D88205	2020-10-06 11:46:25 +03:00
Mehdi Amini	afd729edee	Add definition for static constexpr member (NFC) Fix the build for some toolchain and config.	2020-10-05 16:56:27 +00:00
Christian Sigg	665371d0b2	[mlir] Split alloc-like op LLVM lowerings into base and separate derived classes. The previous code did the lowering to alloca, malloc, and aligned_malloc in a single class with different code paths that are somewhat difficult to follow. This change moves the common code to a base class and has a separte derived class per lowering target that contains the specifics. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D88696	2020-10-05 17:36:01 +02:00
Benjamin Kramer	6e2b267d1c	Promote transpose from linalg to standard dialect While affine maps are part of the builtin memref type, there is very limited support for manipulating them in the standard dialect. Add transpose to the set of ops to complement the existing view/subview ops. This is a metadata transformation that encodes the transpose into the strides of a memref. I'm planning to use this when lowering operations on strided memrefs, using the transpose to remove the stride without adding a dependency on linalg dialect. Differential Revision: https://reviews.llvm.org/D88651	2020-10-05 10:58:20 +02:00
Diego Caballero	a611f9a5c6	[mlir] Fix call op conversion in bare-ptr calling convention We hit an llvm_unreachable related to unranked memrefs for call ops with scalar types. Removing the llvm_unreachable since the conversion should gracefully bail out in the presence of unranked memrefs. Adding tests to verify that. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D88709	2020-10-02 08:48:21 -07:00
Geoffrey Martin-Noble	d4e889f1f5	Remove `Ops` suffix from dialect library names Dialects include more than just ops, so this suffix is outdated. Follows discussion in https://llvm.discourse.group/t/rfc-canonical-file-paths-to-dialects/621 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D88530	2020-09-30 18:00:44 -07:00
Diego Caballero	a89fc12653	[mlir] Support return and call ops in bare-ptr calling convention This patch adds support for the 'return' and 'call' ops to the bare-ptr calling convention. These changes also align the bare-ptr calling convention code with the latest changes in the default calling convention and reduce the amount of customization code needed. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D87724	2020-09-29 12:00:47 -07:00
Sean Silva	a975be0e00	[mlir][shape] Make conversion passes more consistent. - use select-ops to make the lowering simpler - change style of FileCheck variables names to be consistent - change some variable names in the code to be more explicit Differential Revision: https://reviews.llvm.org/D88258	2020-09-28 14:55:42 -07:00
Aart Bik	e9628955f5	[mlir] [VectorOps] Relaxed restrictions on vector.reduction types even more Recently, restrictions on vector reductions were made more relaxed by accepting any width signless integer and floating-point. This CL relaxes the restriction even more by including unsigned and signed integers. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D88442	2020-09-28 13:38:03 -07:00
Aart Bik	54759cefdb	[mlir] [VectorOps] changes to printing support for integers (1) simplify integer printing logic by always using 64-bit print (2) add index support (since vector<16xindex> is planned to be added) (3) adjust naming convention print_x -> printX Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D88436	2020-09-28 11:43:31 -07:00
Rahul Joshi	2d128b04d9	[NFC] Fix build warnings	2020-09-25 09:35:41 -07:00
Aart Bik	b8880f5f97	[mlir] [VectorOps] generalize printing support for integers This generalizes printing beyond just i1,i32,i64 and also accounts for signed and unsigned interpretation in the output. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D88290	2020-09-25 04:52:21 -07:00
Artur Bialas	396e7f4548	[mlir][SCFToGPU] LaunchOp propagate optional attributes Allow propagating optional user defined attributes during SCF to GPU conversion. Gives opportunity to use user defined attributes in the further lowering. For example setting subgroup size, or other options for GPU dispatch. This does not break backward compatibility and does not require new attributes, just allow passing optional ones. Differential Revision: https://reviews.llvm.org/D88203	2020-09-25 09:21:16 +02:00
Sean Silva	9ed1e5873c	[mlir][shape] Start a pass that lowers shape constraints. This pass converts shape.cstr_* ops to eager (side-effecting) error-handling code. After that conversion is done, the witnesses are trivially satisfied and are replaced with `shape.const_witness true`. Differential Revision: https://reviews.llvm.org/D87941	2020-09-24 12:25:30 -07:00
Rahul Joshi	08e4f07852	[MLIR][NFC] Adopt use of TypeRange in build() methods. - Use TypeRange instead of ArrayRef<Type> where possible. - Change some of the custom builders to also use TypeRange Differential Revision: https://reviews.llvm.org/D87944	2020-09-23 09:07:57 -07:00
Nicolas Vasilache	ed229132f1	[mlir][Linalg] Uniformize linalg.generic with named ops. This revision allows representing a reduction at the level of linalg on tensors for generic ops by uniformizing with the named ops approach.	2020-09-22 04:13:22 -04:00
Christian Sigg	9ba3b7449d	[MLIR] Fix typo and expand gpu.host_register description. See comments in https://reviews.llvm.org/D85631. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D86214	2020-09-21 13:44:39 +02:00
Benjamin Kramer	2d76274b99	[mlir][VectorOps] Loosen restrictions on vector.reduction types LLVM can deal with any integer or float type, don't arbitrarily restrict it to f32/f64/i32/i64. Differential Revision: https://reviews.llvm.org/D88010	2020-09-21 12:45:23 +02:00
Hanhan Wang	1909b6ac0d	[mlir][StandardToSPIRV] Handle vector of i1 case for lowering zexti to SPIR-V. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D87887	2020-09-18 07:07:22 -07:00
Alex Zinenko	967c7b6936	[mlir] check for failures when packing function sigunatures in std->llvm conversion When packing function results into a structure during the standard-to-llvm dialect conversion, do not assume the conversion was successful and propagate nullptr as error state. Fixes PR45184. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D87605	2020-09-15 12:30:44 +02:00
Alex Zinenko	5cac85c931	[mlir] Check for type conversion success in std->llvm function conversion Type converter may fail and return nullptr on unconvertible types. The function conversion did not include a check and was attempting to use a nullptr type to construct an LLVM function, leading to a crash. Add a check and return early. The rest of the call stack propagates errors properly. Fixes PR47403. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D87075	2020-09-14 13:16:42 +02:00
Sean Silva	84a6da67e6	[mlir] Fix some edge cases around 0-element TensorFromElementsOp This introduces a builder for the more general case that supports zero elements (where the element type can't be inferred from the ValueRange, since it might be empty). Also, fix up some cases in ShapeToStandard lowering that hit this. It happens very easily when dealing with shapes of 0-D tensors. The SameOperandsAndResultElementType is redundant with the new TypesMatchWith and prevented having zero elements. Differential Revision: https://reviews.llvm.org/D87492	2020-09-11 10:58:35 -07:00
Eugene Burmako	5638df1950	Introduce linalg.vecmat This patch adds a new named structured op to accompany linalg.matmul and linalg.matvec. We needed it for our codegen, so I figured it would be useful to add it to Linalg. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D87292	2020-09-10 18:48:14 +02:00
Christian Sigg	3a577f5446	Rename MemRefDescriptor::getElementType() to MemRefDescriptor::getElementPtrType(). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D87284	2020-09-09 11:45:39 +02:00
Frederik Gossen	5106a8b8f8	[MLIR][Shape] Lower `shape_of` to `dynamic_tensor_from_elements` Take advantage of the new `dynamic_tensor_from_elements` operation in `std`. Instead of stack-allocated memory, we can now lower directly to a single `std` operation. Differential Revision: https://reviews.llvm.org/D86935	2020-09-09 07:55:13 +00:00
Benjamin Kramer	51d30c3429	[mlir][VectorOps] Fix more GCC5 weirdness VectorToSCF.cpp:515:47: error: specialization of 'template<class TransferOpTy> mlir::LogicalResult mlir::VectorTransferRewriter<TransferOpTy>::matchAndRewrite(mlir::Operation*, mlir::PatternRewriter&) const' in different namespace [-fpermissive]	2020-09-08 15:41:39 +02:00
Benjamin Kramer	df63eedef6	[mlir][VectorOps] Put back anonymous namespace to work around GCC5 bug. VectorToSCF.cpp:241:61: error: specialization of 'template<class ConcreteOp> mlir::LogicalResult {anonymous}::NDTransferOpHelper<ConcreteOp>::doReplace()' in different namespace [-fpermissive]	2020-09-08 14:03:30 +02:00
Benjamin Kramer	307dc7b236	[mlir][VectorOps] Clean up outdated comments. NFCI. While there - De-templatify code that can use function_ref - Make BoundCaptures usable when they're const - Address post-submit review comment (static function into global namespace)	2020-09-08 12:02:00 +02:00
Benjamin Kramer	239eff502b	[mlir][VectorOps] Redo the scalar loop emission in VectoToSCF to pad instead of clipping This replaces the select chain for edge-padding with an scf.if that performs the memory operation when the index is in bounds and uses the pad value when it's not. For transfer_write the same mechanism is used, skipping the store when the index is out of bounds. The integration test has a bunch of cases of how I believe this should work. Differential Revision: https://reviews.llvm.org/D87241	2020-09-08 11:15:25 +02:00
Nicolas Vasilache	9be6178449	[mlir][Vector] Make VectorToSCF deterministic Differential Revision: https://reviews.llvm.org/D87273	2020-09-08 04:18:22 -04:00
Frederik Gossen	a70f2eb3e3	[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings. Merge the two lowering passes because they are not useful by themselves. The new pass lowers to `std` and `scf` is considered an auxiliary dialect. See also https://llvm.discourse.group/t/conversions-with-multiple-target-dialects/1541/12 Differential Revision: https://reviews.llvm.org/D86779	2020-09-07 14:39:37 +00:00
David Truby	973800dc7c	Revert "[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings." This reverts commit `15acdd7543`.	2020-09-07 13:37:32 +01:00
Frederik Gossen	15acdd7543	[MLIR][Shape] Merge `shape` to `std`/`scf` lowerings. Merge the two lowering passes because they are not useful by themselves. The new pass lowers to `std` and `scf` is considered an auxiliary dialect. See also https://llvm.discourse.group/t/conversions-with-multiple-target-dialects/1541/12 Differential Revision: https://reviews.llvm.org/D86779	2020-09-07 12:12:36 +00:00
Frederik Gossen	136eb79a88	[MLIR][Standard] Add `dynamic_tensor_from_elements` operation With `dynamic_tensor_from_elements` tensor values of dynamic size can be created. The body of the operation essentially maps the index space to tensor elements. Declare SCF operations in the `scf` namespace to avoid name clash with the new `std.yield` operation. Resolve ambiguities between `linalg/shape/std/scf.yield` operations. Differential Revision: https://reviews.llvm.org/D86276	2020-09-07 11:44:43 +00:00
Nicolas Vasilache	8d64df9f13	[mlir][Vector] Revisit VectorToSCF. Vector to SCF conversion still had issues due to the interaction with the natural alignment derived by the LLVM data layout. One traditional workaround is to allocate aligned. However, this does not always work for vector sizes that are non-powers of 2. This revision implements a more portable mechanism where the intermediate allocation is always a memref of elemental vector type. AllocOp is extended to use the natural LLVM DataLayout alignment for non-scalar types, when the alignment is not specified in the first place. An integration test is added that exercises the transfer to scf.for + scalar lowering with a 5x5 transposition. Differential Revision: https://reviews.llvm.org/D87150	2020-09-07 05:19:43 -04:00
Benjamin Kramer	0c2a4d3c1c	[mlir][VectorOps] Simplify code. NFCI.	2020-09-04 11:10:20 +02:00
aartbik	060c9dd1cc	[mlir] [VectorOps] Improve SIMD compares with narrower indices When allowed, use 32-bit indices rather than 64-bit indices in the SIMD computation of masks. This runs up to 2x and 4x faster on a number of AVX2 and AVX512 microbenchmarks. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D87116	2020-09-03 21:43:38 -07:00
Benjamin Kramer	dfb7b3fe02	[mlir][VectorOps] Fall back to a loop when accessing a vector from a strided memref The scalar loop is slow but correct. Differential Revision: https://reviews.llvm.org/D87082	2020-09-03 16:05:38 +02:00
Jakub Lichman	f5ed22f09d	[mlir][VectorToSCF] 128 byte alignment of alloc ops Added 128 byte alignment to alloc ops created in VectorToSCF pass. 128b alignment was already introduced to this pass but not to all alloc ops. This commit changes that by adding 128b alignment to the remaining ops. The point of specifying alignment is to prevent possible memory alignment errors on weakly tested architectures. Differential Revision: https://reviews.llvm.org/D86454	2020-09-02 12:37:35 +00:00
Benjamin Kramer	2bf491c729	[mlir][VectorOps] Fail fast when a strided memref is passed to vector_transfer Otherwise we'll silently miscompile things. Differential Revision: https://reviews.llvm.org/D86951	2020-09-02 10:34:36 +02:00
River Riddle	431bb8b318	[mlir][ODS] Use c++ types for integer attributes of fixed width when possible. Unsigned and Signless attributes use uintN_t and signed attributes use intN_t, where N is the fixed width. The 1-bit variants use bool. Differential Revision: https://reviews.llvm.org/D86739	2020-09-01 13:43:32 -07:00
Kiran Chandramohan	875074c8a9	[OpenMP][MLIR] Conversion pattern for OpenMP to LLVM Adding a conversion pattern for the parallel Operation. This will help the conversion of parallel operation with standard dialect to parallel operation with llvm dialect. The type conversion of the block arguments in a parallel region are controlled by the pattern for the parallel Operation. Without this pattern, a parallel Operation with block arguments cannot be converted from standard to LLVM dialect. Other OpenMP operations without regions are marked as legal. When translation of OpenMP operations with regions are added then patterns for these operations can also be added. Also uses all the standard to llvm patterns. Patterns of other dialects can be added later if needed. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D86273	2020-08-27 19:32:15 +01:00
Benjamin Kramer	fddf543e6e	[MLIR][GPUToSPIRV] Fix use-after-free. Found by asan.	2020-08-27 17:57:11 +02:00
George Mitenkov	d48b84eb8a	[MLIR][GPUToSPIRV] Passing gpu module name to SPIR-V module This patch allows to pass the gpu module name to SPIR-V module during conversion. This has many benefits as we can lookup converted to SPIR-V kernel in the symbol table. In order to avoid symbol conflicts, `"__spv__"` is added to the gpu module name to form the new one. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86384	2020-08-27 09:19:24 +03:00
George Mitenkov	e850558cdc	[MLIR][SPIRVToLLVM] Added a hook for descriptor set / binding encoding This patch introduces a hook to encode descriptor set and binding number into `spv.globalVariable`'s symbolic name. This allows to preserve this information, and at the same time legalize the global variable for the conversion to LLVM dialect. This is required for `mlir-spirv-cpu-runner` to convert kernel arguments into LLVM. Also, a couple of some nits added: - removed unused comment - changed to a capital letter in the comment Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86515	2020-08-27 08:27:42 +03:00
Thomas Raoux	6a3c69e918	[mlir][spirv] Infer converted type of scf.for from the init value Instead of using the TypeConverter infer the value of the alloca created based on the init value. This will allow some ambiguous types like multidimensional vectors to be converted correctly. Differential Revision: https://reviews.llvm.org/D86582	2020-08-25 23:35:01 -07:00
Thomas Raoux	36ee9a322a	[mlir][GPUToVulkan] Fix signature of bindMemRef function for f16 Binding MemRefs of f16 needs special handling as the type is not supported on CPU. There was a bug in the type used. Differential Revision: https://reviews.llvm.org/D86328	2020-08-21 10:48:00 -07:00
George Mitenkov	dc693a036d	[MLIR][SPIRVToLLVM] Removed std to llvm patterns from the conversion Removed the Standard to LLVM conversion patterns that were previously pulled in for testing purposes. This helps to separate the conversion to LLVM dialect of the MLIR module with both SPIR-V and Standard dialects in it (particularly helpful for SPIR-V cpu runner). Also, tests were changed accordingly. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86285	2020-08-21 00:26:33 +03:00
Mars Saxman	d34df52377	Implement FPToUI and UIToFP ops in standard dialect Add the unsigned complements to the existing FPToSI and SIToFP operations in the standard dialect, with one-to-one lowerings to the corresponding LLVM operations. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85557	2020-08-19 22:49:09 +02:00
Jakub Lichman	aeb338cc3e	[mlir][VectorToSCF] Fix of broken build - missing link to MLIRLinalgUtils	2020-08-19 17:28:49 +00:00
Jakub Lichman	8dace28f92	[mlir][VectorToSCF] Bug in TransferRead lowering fixed If Memref has rank > 1 this pass emits N-1 loops around TransferRead op and transforms the op itself to 1D read. Since vectors must have static shape while memrefs don't the pass emits if condition to prevent out of bounds accesses in case some memref dimension is smaller than the corresponding dimension of targeted vector. This logic is fine but authors forgot to apply `permutation_map` on loops upper bounds and thus if condition compares induction variable to incorrect loop upper bound (dimension of the memref) in case `permutation_map` is not identity map. This commit aims to fix that.	2020-08-19 15:34:34 +00:00
Benjamin Kramer	b98e25b6d7	Make helpers static. NFC.	2020-08-19 16:00:03 +02:00
Mehdi Amini	f9dc2b7079	Separate the Registration from Loading dialects in the Context This changes the behavior of constructing MLIRContext to no longer load globally registered dialects on construction. Instead Dialects are only loaded explicitly on demand: - the Parser is lazily loading Dialects in the context as it encounters them during parsing. This is the only purpose for registering dialects and not load them in the context. - Passes are expected to declare the dialects they will create entity from (Operations, Attributes, or Types), and the PassManager is loading Dialects into the Context when starting a pipeline. This changes simplifies the configuration of the registration: a compiler only need to load the dialect for the IR it will emit, and the optimizer is self-contained and load the required Dialects. For example in the Toy tutorial, the compiler only needs to load the Toy dialect in the Context, all the others (linalg, affine, std, LLVM, ...) are automatically loaded depending on the optimization pipeline enabled. To adjust to this change, stop using the existing dialect registration: the global registry will be removed soon. 1) For passes, you need to override the method: virtual void getDependentDialects(DialectRegistry &registry) const {} and registery on the provided registry any dialect that this pass can produce. Passes defined in TableGen can provide this list in the dependentDialects list field. 2) For dialects, on construction you can register dependent dialects using the provided MLIRContext: `context.getOrLoadDialect<DialectName>()` This is useful if a dialect may canonicalize or have interfaces involving another dialect. 3) For loading IR, dialect that can be in the input file must be explicitly registered with the context. `MlirOptMain()` is taking an explicit registry for this purpose. See how the standalone-opt.cpp example is setup: mlir::DialectRegistry registry; registry.insert<mlir::standalone::StandaloneDialect>(); registry.insert<mlir::StandardOpsDialect>(); Only operations from these two dialects can be in the input file. To include all of the dialects in MLIR Core, you can populate the registry this way: mlir::registerAllDialects(registry); 4) For `mlir-translate` callback, as well as frontend, Dialects can be loaded in the context before emitting the IR: context.getOrLoadDialect<ToyDialect>() Differential Revision: https://reviews.llvm.org/D85622	2020-08-19 01:19:03 +00:00
Mehdi Amini	e75bc5c791	Revert "Separate the Registration from Loading dialects in the Context" This reverts commit `d14cf45735`. The build is broken with GCC-5.	2020-08-19 01:19:03 +00:00
Mehdi Amini	d14cf45735	Separate the Registration from Loading dialects in the Context This changes the behavior of constructing MLIRContext to no longer load globally registered dialects on construction. Instead Dialects are only loaded explicitly on demand: - the Parser is lazily loading Dialects in the context as it encounters them during parsing. This is the only purpose for registering dialects and not load them in the context. - Passes are expected to declare the dialects they will create entity from (Operations, Attributes, or Types), and the PassManager is loading Dialects into the Context when starting a pipeline. This changes simplifies the configuration of the registration: a compiler only need to load the dialect for the IR it will emit, and the optimizer is self-contained and load the required Dialects. For example in the Toy tutorial, the compiler only needs to load the Toy dialect in the Context, all the others (linalg, affine, std, LLVM, ...) are automatically loaded depending on the optimization pipeline enabled. To adjust to this change, stop using the existing dialect registration: the global registry will be removed soon. 1) For passes, you need to override the method: virtual void getDependentDialects(DialectRegistry &registry) const {} and registery on the provided registry any dialect that this pass can produce. Passes defined in TableGen can provide this list in the dependentDialects list field. 2) For dialects, on construction you can register dependent dialects using the provided MLIRContext: `context.getOrLoadDialect<DialectName>()` This is useful if a dialect may canonicalize or have interfaces involving another dialect. 3) For loading IR, dialect that can be in the input file must be explicitly registered with the context. `MlirOptMain()` is taking an explicit registry for this purpose. See how the standalone-opt.cpp example is setup: mlir::DialectRegistry registry; registry.insert<mlir::standalone::StandaloneDialect>(); registry.insert<mlir::StandardOpsDialect>(); Only operations from these two dialects can be in the input file. To include all of the dialects in MLIR Core, you can populate the registry this way: mlir::registerAllDialects(registry); 4) For `mlir-translate` callback, as well as frontend, Dialects can be loaded in the context before emitting the IR: context.getOrLoadDialect<ToyDialect>() Differential Revision: https://reviews.llvm.org/D85622	2020-08-18 23:23:56 +00:00
Mehdi Amini	d84fe55e0d	Revert "Separate the Registration from Loading dialects in the Context" This reverts commit `e1de2b7550`. Broke a build bot.	2020-08-18 22:16:34 +00:00
Mehdi Amini	e1de2b7550	Separate the Registration from Loading dialects in the Context This changes the behavior of constructing MLIRContext to no longer load globally registered dialects on construction. Instead Dialects are only loaded explicitly on demand: - the Parser is lazily loading Dialects in the context as it encounters them during parsing. This is the only purpose for registering dialects and not load them in the context. - Passes are expected to declare the dialects they will create entity from (Operations, Attributes, or Types), and the PassManager is loading Dialects into the Context when starting a pipeline. This changes simplifies the configuration of the registration: a compiler only need to load the dialect for the IR it will emit, and the optimizer is self-contained and load the required Dialects. For example in the Toy tutorial, the compiler only needs to load the Toy dialect in the Context, all the others (linalg, affine, std, LLVM, ...) are automatically loaded depending on the optimization pipeline enabled. To adjust to this change, stop using the existing dialect registration: the global registry will be removed soon. 1) For passes, you need to override the method: virtual void getDependentDialects(DialectRegistry &registry) const {} and registery on the provided registry any dialect that this pass can produce. Passes defined in TableGen can provide this list in the dependentDialects list field. 2) For dialects, on construction you can register dependent dialects using the provided MLIRContext: `context.getOrLoadDialect<DialectName>()` This is useful if a dialect may canonicalize or have interfaces involving another dialect. 3) For loading IR, dialect that can be in the input file must be explicitly registered with the context. `MlirOptMain()` is taking an explicit registry for this purpose. See how the standalone-opt.cpp example is setup: mlir::DialectRegistry registry; mlir::registerDialect<mlir::standalone::StandaloneDialect>(); mlir::registerDialect<mlir::StandardOpsDialect>(); Only operations from these two dialects can be in the input file. To include all of the dialects in MLIR Core, you can populate the registry this way: mlir::registerAllDialects(registry); 4) For `mlir-translate` callback, as well as frontend, Dialects can be loaded in the context before emitting the IR: context.getOrLoadDialect<ToyDialect>()	2020-08-18 21:14:39 +00:00
Rob Suderman	5556575230	Added std.floor operation to match std.ceil There should be an equivalent std.floor op to std.ceil. This includes matching lowerings for SPIRV, NVVM, ROCDL, and LLVM. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85940	2020-08-18 10:25:32 -07:00
George Mitenkov	cc98a0fbe4	[MLIR][SPIRVToLLVM] Additional conversions for spirv-runner This patch adds more op/type conversion support necessary for `spirv-runner`: - EntryPoint/ExecutionMode: currently removed since we assume having only one kernel function in the kernel module. - StorageBuffer storage class is now supported. We are not concerned with multithreading so this is fine for now. - Type conversion enhanced, now regular offsets and strides for structs and arrays are supported (based on `VulkanLayoutUtils`). - Support of `spc.AccessChain` that is modelled with GEP op in LLVM dialect. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86109	2020-08-18 19:09:59 +03:00
Jakub Lichman	a4b8c2de1d	[mlir] VectorToSCF bug in setAllocAtFunctionEntry fixed. The function makes too strong assumption regarding parent FuncOp which gets broken when FuncOp is first lowered to llvm function. In this fix we generalize the assumption to allocation scope and add assertion to produce user friendly message in case our assumption is broken. Differential Revision: https://reviews.llvm.org/D86086	2020-08-18 07:12:40 +00:00
Alex Zinenko	168213f91c	[mlir] Move data layout from LLVMDialect to module Op attributes Legacy implementation of the LLVM dialect in MLIR contained an instance of llvm::Module as it was required to parse LLVM IR types. The access to the data layout of this module was exposed to the users for convenience, but in practice this layout has always been the default one obtained by parsing an empty layout description string. Current implementation of the dialect no longer relies on wrapping LLVM IR types, but it kept an instance of DataLayout for compatibility. This effectively forces a single data layout to be used across all modules in a given MLIR context, which is not desirable. Remove DataLayout from the LLVM dialect and attach it as a module attribute instead. Since MLIR does not yet have support for data layouts, use the LLVM DataLayout in string form with verification inside MLIR. Introduce the layout when converting a module to the LLVM dialect and keep the default "" description for compatibility. This approach should be replaced with a proper MLIR-based data layout when it becomes available, but provides an immediate solution to compiling modules with different layouts, e.g. for GPUs. This removes the need for LLVMDialectImpl, which is also removed. Depends On D85650 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D85652	2020-08-17 15:12:36 +02:00
Mehdi Amini	25ee851746	Revert "Separate the Registration from Loading dialects in the Context" This reverts commit `2056393387`. Build is broken on a few bots	2020-08-15 09:21:47 +00:00
Mehdi Amini	2056393387	Separate the Registration from Loading dialects in the Context This changes the behavior of constructing MLIRContext to no longer load globally registered dialects on construction. Instead Dialects are only loaded explicitly on demand: - the Parser is lazily loading Dialects in the context as it encounters them during parsing. This is the only purpose for registering dialects and not load them in the context. - Passes are expected to declare the dialects they will create entity from (Operations, Attributes, or Types), and the PassManager is loading Dialects into the Context when starting a pipeline. This changes simplifies the configuration of the registration: a compiler only need to load the dialect for the IR it will emit, and the optimizer is self-contained and load the required Dialects. For example in the Toy tutorial, the compiler only needs to load the Toy dialect in the Context, all the others (linalg, affine, std, LLVM, ...) are automatically loaded depending on the optimization pipeline enabled. Differential Revision: https://reviews.llvm.org/D85622	2020-08-15 08:07:31 +00:00
Mehdi Amini	ba92dadf05	Revert "Separate the Registration from Loading dialects in the Context" This was landed by accident, will reland with the right comments addressed from the reviews. Also revert dependent build fixes.	2020-08-15 07:35:10 +00:00
Mehdi Amini	ebf521e784	Separate the Registration from Loading dialects in the Context This changes the behavior of constructing MLIRContext to no longer load globally registered dialects on construction. Instead Dialects are only loaded explicitly on demand: - the Parser is lazily loading Dialects in the context as it encounters them during parsing. This is the only purpose for registering dialects and not load them in the context. - Passes are expected to declare the dialects they will create entity from (Operations, Attributes, or Types), and the PassManager is loading Dialects into the Context when starting a pipeline. This changes simplifies the configuration of the registration: a compiler only need to load the dialect for the IR it will emit, and the optimizer is self-contained and load the required Dialects. For example in the Toy tutorial, the compiler only needs to load the Toy dialect in the Context, all the others (linalg, affine, std, LLVM, ...) are automatically loaded depending on the optimization pipeline enabled.	2020-08-14 09:40:27 +00:00
Alex Zinenko	339eba0805	[mlir] do not emit bitcasts between structs in StandardToLLVM The convresion of memref cast operaitons from the Standard dialect to the LLVM dialect has been emitting bitcasts from a struct type to itself. Beyond being useless, such casts are invalid as bitcast does not operate on aggregate types. This kept working by accident because LLVM IR bitcast construction API skips the construction if types are equal before it verifies that the types are acceptable in a bitcast. Do not emit such bitcasts, the memref cast that only adds/erases size information is in fact a noop on the current descriptor as it always contains dynamic values for all sizes. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D85899	2020-08-14 11:33:10 +02:00
River Riddle	65277126bf	[mlir][Type] Remove the remaining usages of Type::getKind in preparation for its removal This revision removes all of the lingering usages of Type::getKind. A consequence of this is that FloatType is now split into 4 derived types that represent each of the possible float types(BFloat16Type, Float16Type, Float32Type, and Float64Type). Other than this split, this revision is NFC. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D85566	2020-08-12 19:33:58 -07:00
George Mitenkov	2ad7e1a301	[MLIR][SPIRVToLLVM] Conversion for global and addressof Inital conversion of `spv._address_of` and `spv.globalVariable`. In SPIR-V, the global returns a pointer, whereas in LLVM dialect the global holds an actual value. This difference is handled by `spv._address_of` and `llvm.mlir.addressof`ops that both return a pointer. Moreover, only current invocation is in conversion's scope. Reviewed By: antiagainst, mravishankar Differential Revision: https://reviews.llvm.org/D84626	2020-08-12 09:41:14 +03:00
Thomas Raoux	0de60b550b	[mlir] Fix mlir build break due to warning when NDEBUG is not set	2020-08-10 15:35:02 -07:00
Christian Sigg	2c48e3629c	[MLIR] Adding gpu.host_register op and lower it to a runtime call. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D85631	2020-08-10 22:46:17 +02:00
Christian Sigg	0d4b7adb82	[MLIR] Make gpu.launch_func rewrite pattern part of the LLVM lowering pass. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D85073	2020-08-10 19:28:30 +02:00
Thomas Raoux	68330ee0a9	[mlir][vector] Relax transfer_read/transfer_write restriction on memref operand Relax the verifier for transfer_read/transfer_write operation so that it can take a memref with a different element type than the vector being read/written. This is based on the discourse discussion: https://llvm.discourse.group/t/memref-cast/1514 Differential Revision: https://reviews.llvm.org/D85244	2020-08-10 08:57:48 -07:00
Konrad Dobros	9414a71aaa	[mlir][spirv] Add correct handling of Kernel and Addresses capabilities This change adds initial support needed to generate OpenCL compliant SPIRV. If Kernel capability is declared then memory model becomes OpenCL. If Addresses capability is declared then addressing model becomes Physical64. Additionally for Kernel capability interface variable ABI attributes are not generated as entry point function is expected to have normal arguments. Differential Revision: https://reviews.llvm.org/D85196	2020-08-07 12:29:21 -07:00
aartbik	c3c95b9c80	[mlir] [VectorOps] Improve lowering of extract_strided_slice (and friends like shape_cast) Using a shuffle for the last recursive step in progressive lowering not only results in much more compact IR, but also more efficient code (since the backend is no longer confused on subvector aliasing for longer vectors). E.g. the following %f = vector.shape_cast %v0: vector<1024xf32> to vector<32x32xf32> yields much better x86-64 code that runs 3x faster than the original. Reviewed By: bkramer, nicolasvasilache Differential Revision: https://reviews.llvm.org/D85482	2020-08-07 09:21:05 -07:00
Alex Zinenko	87a89e0f77	[mlir] Remove llvm::LLVMContext and llvm::Module from mlir::LLVMDialectImpl Original modeling of LLVM IR types in the MLIR LLVM dialect had been wrapping LLVM IR types and therefore required the LLVMContext in which they were created to outlive them, which was solved by placing the LLVMContext inside the dialect and thus having the lifetime of MLIRContext. This has led to numerous issues caused by the lack of thread-safety of LLVMContext and the need to re-create LLVM IR modules, obtained by translating from MLIR, in different LLVM contexts to enable parallel compilation. Similarly, llvm::Module had been introduced to keep track of identified structure types that could not be modeled properly. A recent series of commits changed the modeling of LLVM IR types in the MLIR LLVM dialect so that it no longer wraps LLVM IR types and has no dependence on LLVMContext and changed the ownership model of the translated LLVM IR modules. Remove LLVMContext and LLVM modules from the implementation of MLIR LLVM dialect and clean up the remaining uses. The only part of LLVM IR that remains necessary for the LLVM dialect is the data layout. It should be moved from the dialect level to the module level and replaced with an MLIR-based representation to remove the dependency of the LLVMDialect on LLVM IR library. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85445	2020-08-07 14:30:31 +02:00
Alex Zinenko	db1c197bf8	[mlir] take LLVMContext in MLIR-to-LLVM-IR translation Due to the original type system implementation, LLVMDialect in MLIR contains an LLVMContext in which the relevant objects (types, metadata) are created. When an MLIR module using the LLVM dialect (and related intrinsic-based dialects NVVM, ROCDL, AVX512) is converted to LLVM IR, it could only live in the LLVMContext owned by the dialect. The type system no longer relies on the LLVMContext, so this limitation can be removed. Instead, translation functions now take a reference to an LLVMContext in which the LLVM IR module should be constructed. The caller of the translation functions is responsible for ensuring the same LLVMContext is not used concurrently as the translation no longer uses a dialect-wide context lock. As an additional bonus, this change removes the need to recreate the LLVM IR module in a different LLVMContext through printing and parsing back, decreasing the compilation overhead in JIT and GPU-kernel-to-blob passes. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D85443	2020-08-07 14:22:30 +02:00
Christian Sigg	45676a8936	[MLIR] Change GpuLaunchFuncToGpuRuntimeCallsPass to wrap a RewritePattern with the same functionality. The RewritePattern will become one of several, and will be part of the LLVM conversion pass (instead of a separate pass following LLVM conversion). Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D84946	2020-08-06 11:55:46 +02:00
Alexander Belyaev	3effc35015	[mlir] Lower DimOp to LLVM for unranked memrefs. Differential Revision: https://reviews.llvm.org/D85361	2020-08-06 11:46:11 +02:00
Alex Zinenko	5446ec8507	[mlir] take MLIRContext instead of LLVMDialect in getters of LLVMType's Historical modeling of the LLVM dialect types had been wrapping LLVM IR types and therefore needed access to the instance of LLVMContext stored in the LLVMDialect. The new modeling does not rely on that and only needs the MLIRContext that is used for uniquing, similarly to other MLIR types. Change LLVMType::get<Kind>Ty functions to take `MLIRContext ` instead of `LLVMDialect ` as first argument. This brings the code base closer to completely removing the dependence on LLVMContext from the LLVMDialect, together with additional support for thread-safety of its use. Depends On D85371 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85372	2020-08-06 11:05:40 +02:00
Alex Zinenko	d3a9807674	[mlir] Remove most uses of LLVMDialect::getModule This prepares for the removal of llvm::Module and LLVMContext from the mlir::LLVMDialect. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85371	2020-08-06 10:54:30 +02:00
aartbik	39379916a7	[mlir] [VectorOps] Add masked load/store operations to Vector dialect The intrinsics were already supported and vector.transfer_read/write lowered direclty into these operations. By providing them as individual ops, however, clients can used them directly, and it opens up progressively lowering transfer operations at higher levels (rather than direct lowering to LLVM IR as done now). Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D85357	2020-08-05 16:45:24 -07:00
Alex Zinenko	b2ab375d1f	[mlir] use the new stateful LLVM type translator by default Previous type model in the LLVM dialect did not support identified structure types properly and therefore could use stateless translations implemented as free functions. The new model supports identified structs and must keep track of the identified structure types present in the target context (LLVMContext or MLIRContext) to avoid creating duplicate structs due to LLVM's type auto-renaming. Expose the stateful type translation classes and use them during translation, storing the state as part of ModuleTranslation. Drop the test type translation mechanism that is no longer necessary and update the tests to exercise type translation as part of the main translation flow. Update the code in vector-to-LLVM dialect conversion that relied on stateless translation to use the new class in a stateless manner. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85297	2020-08-06 00:36:33 +02:00
Lei Zhang	0d03b3901d	[mlir][StandardToSPIRV] Use spv.UMod for index re-calculation Per Vulkan's SPIR-V environment spec: "While the OpSRem and OpSMod instructions are supported by the Vulkan environment, they require non-negative values and thus do not enable additional functionality beyond what OpUMod provides." The `getOffsetForBitwidth` function is used for lowering std.load and std.store, whose indices are of `index` type and cannot be negative. So we should be okay to use spv.UMod directly here to be exact. Also made the comment explicit about the assumption. Differential Revision: https://reviews.llvm.org/D83714	2020-08-05 14:52:04 -04:00

... 5 6 7 8 9 ...

1318 Commits