llvm-project

Commit Graph

Author	SHA1	Message	Date
Mahesh Ravishankar	ea56025f1e	Initial implementation to translate kernel fn in GPU Dialect to SPIR-V Dialect This CL adds an initial implementation for translation of kernel function in GPU Dialect (used with a gpu.launch_kernel) op to a spv.Module. The original function is translated into an entry function. Most of the heavy lifting is done by adding TypeConversion and other utility functions/classes that provide most of the functionality to translate from Standard Dialect to SPIR-V Dialect. These are intended to be reusable in implementation of different dialect conversion pipelines. Note : Some of the files for have been renamed to be consistent with the norm used by the other Conversion frameworks. PiperOrigin-RevId: 260759165	2019-07-30 11:55:55 -07:00
Alex Zinenko	60965b4612	Move GPU dialect to {lib,include/mlir}/Dialect Per tacit agreement, individual dialects should now live in lib/Dialect/Name with headers in include/mlir/Dialect/Name and tests in test/Dialect/Name. PiperOrigin-RevId: 259896851	2019-07-25 00:41:17 -07:00
Mahesh Ravishankar	2ad92b6e50	Add a utility function to populate StdOp to SPIRV Conversion Patterns The function populateStdOpsToSPIRVPatterns appends the conversion patterns automatically generated from StdOpsToSPIRVConversion.td to a list of patterns PiperOrigin-RevId: 259677890	2019-07-23 22:38:51 -07:00
MLIR Team	8cb82c9478	Add sitofp to the standard dialect Conversion from integers (window or input size, padding etc) to floating point is required to express many ML kernels, for example average pooling. PiperOrigin-RevId: 259575284	2019-07-23 11:23:40 -07:00
River Riddle	3edbd8bf80	NFC: Update the LoopToStd conversion patterns to use RewritePattern instead of ConversionPattern. These patterns don't require type changes so they don't need to be using ConversionPattern. PiperOrigin-RevId: 259393151	2019-07-22 13:22:49 -07:00
River Riddle	00bdc8e070	Refactor region type signature conversion to be explicit via patterns. This cl enforces that the conversion of the type signatures for regions, and thus their entry blocks, is handled via ConversionPatterns. A new hook 'applySignatureConversion' is added to the ConversionPatternRewriter to perform the desired conversion on a region. This also means that the handling of rewriting the signature of a FuncOp is moved to a pattern. A default implementation is provided via 'mlir::populateFuncOpTypeConversionPattern'. This removes the hacky implicit 'dynamically legal' status of FuncOp that was present previously, and leaves it up to the user to decide when/how to convert the signature of a function. PiperOrigin-RevId: 259161999	2019-07-20 19:06:07 -07:00
Lei Zhang	9291868960	Place generated StandardOps to SPIR-V patterns in anonymous namespace This avoids polluting the mlir namespace. PiperOrigin-RevId: 258826497	2019-07-19 11:40:06 -07:00
River Riddle	8b447b6cad	NFC: Expose a ConversionPatternRewriter for use with ConversionPatterns. This specific PatternRewriter will allow for exposing hooks in the future that are only useful for the conversion framework, e.g. type conversions. PiperOrigin-RevId: 258818122	2019-07-19 11:40:00 -07:00
River Riddle	9e3c2650d2	Refactor the conversion of block argument types in DialectConversion. This cl begins a large refactoring over how signature types are converted in the DialectConversion infrastructure. The signatures of blocks are now converted on-demand when an operation held by that block is being converted. This allows for handling the case where a region is created as part of a pattern, something that wasn't possible previously. This cl also generalizes the region signature conversion used by FuncOp to work on any region of any operation. This generalization allows for removing the 'apply*Conversion' functions that were specific to FuncOp/ModuleOp. The implementation currently uses a new hook on TypeConverter, 'convertRegionSignature', but this should ideally be removed in favor of using Patterns. That depends on adding support to the PatternRewriter used by ConversionPattern to allow applying signature conversions to regions, which should be coming in a followup. PiperOrigin-RevId: 258645733	2019-07-19 11:38:45 -07:00
Nicolas Vasilache	0002e2964d	Move affine.for and affine.if to ODS As the move to ODS is made, body and region names across affine and loop dialects are uniformized. PiperOrigin-RevId: 258416590	2019-07-16 13:45:47 -07:00
River Riddle	2b9855b5b4	Refactor DialectConversion to support different conversion modes. Users generally want several different modes of conversion. This cl refactors DialectConversion to provide two: * Partial (applyPartialConversion) - This mode allows for illegal operations to exist in the IR, and does not fail if an operation fails to be legalized. * Full (applyFullConversion) - This mode fails if any operation is not properly legalized to the conversion target. This allows for ensuring that the IR after a conversion only contains operations legal for the target. PiperOrigin-RevId: 258412243	2019-07-16 13:45:41 -07:00
Lei Zhang	d36dd94c75	NFC: Move SPIR-V dialect to Dialect/ subdirectory PiperOrigin-RevId: 258345603	2019-07-16 13:45:09 -07:00
Nicolas Vasilache	e78ea03b24	Replace linalg.for by loop.for With the introduction of the Loop dialect, uses of the `linalg.for` operation can now be subsumed 1-to-1 by `loop.for`. This CL performs the replacement and tests are updated accordingly. PiperOrigin-RevId: 258322565	2019-07-16 13:44:57 -07:00
River Riddle	2087bf6386	Remove lowerAffineConstructs and lowerControlFlow in favor of providing patterns. These methods don't compose well with the rest of conversion framework, and create artificial breaks in conversion. Replace these methods with two(populateAffineToStdConversionPatterns and populateLoopToStdConversionPatterns respectively) that populate a list of patterns to perform the same behavior. PiperOrigin-RevId: 258219277	2019-07-16 13:44:45 -07:00
Alex Zinenko	ec82e1c907	Decouple LLVM dialect from Standard dialect Due to the absence of ODS support for enum attributes, the implementation of the LLVM dialect `icmp` operation was reusing the comparison predicate from the Standard dialect, creating an avoidable library dependency. With ODS support and ICmpPredicate attribute recently introduced, the dependency is no longer justified. Update the Standard to LLVM convresion to also convert the CmpIPredicate into LLVM::ICmpPredicate and remove the unnecessary includes. Note that the MLIRLLVMIR library did not explicitly depend on MLIRStandardOps, requiring dependees of MLIRLLVMIR to also depend on MLIRStandardOps, which should no longer be the case. PiperOrigin-RevId: 258148456	2019-07-16 13:43:31 -07:00
Nicolas Vasilache	cca53e8527	Extract std.for std.if and std.terminator in their own dialect These ops should not belong to the std dialect. This CL extracts them in their own dialect and updates the corresponding conversions and tests. PiperOrigin-RevId: 258123853	2019-07-16 13:43:18 -07:00
River Riddle	8e349a48b6	Remove the 'region' field from OpBuilder. This field wasn't updated as the insertion point changed, making it potentially dangerous given the multi-level of MLIR(e.g. 'createBlock' would always insert the new block in 'region'). This also allows for building an OpBuilder with just a context. PiperOrigin-RevId: 257829135	2019-07-12 17:42:41 -07:00
Nicolas Vasilache	cab671d166	Lower affine control flow to std control flow to LLVM dialect This CL splits the lowering of affine to LLVM into 2 parts: 1. affine -> std 2. std -> LLVM The conversions mostly consists of splitting concerns between the affine and non-affine worlds from existing conversions. Short-circuiting of affine `if` conditions was never tested or exercised and is removed in the process, it can be reintroduced later if needed. LoopParametricTiling.cpp is updated to reflect the newly added ForOp::build. PiperOrigin-RevId: 257794436	2019-07-12 08:44:28 -07:00
Alex Zinenko	2178467dca	LoopsToGPU: use PassRegistration with constructor PassRegistration with an optional constructor was introduced after the LoopsToGPUPass, which resorted to deriving one pass from another as a means of accepting options supplied as command-line arguments. Use PassRegistration with constructor instead of defining a derived pass for LoopsToGPU. Also rename the pass to better reflect its current nature. PiperOrigin-RevId: 257786923	2019-07-12 08:44:14 -07:00
River Riddle	9dbef0bf96	Rename FunctionAttr to SymbolRefAttr. This allows for the attribute to hold symbolic references to other operations than FuncOp. This also allows for removing the dependence on FuncOp from the base Builder. PiperOrigin-RevId: 257650017	2019-07-12 08:43:42 -07:00
River Riddle	6da343ecfc	NFC: Replace Module::getNamedFunction with lookupSymbol<FuncOp>. This allows for removing the last direct reference to FuncOp from ModuleOp. PiperOrigin-RevId: 257498296	2019-07-12 08:43:03 -07:00
River Riddle	b3e28fca53	NFC: Remove Function::getModule. There is already a more general 'getParentOfType' method, and 'getModule' is likely to be misused as functions get placed within different regions than ModuleOp. PiperOrigin-RevId: 257442243	2019-07-12 08:42:21 -07:00
River Riddle	fec20e590f	NFC: Rename Module to ModuleOp. Module is a legacy name that only exists as a typedef of ModuleOp. PiperOrigin-RevId: 257427248	2019-07-10 10:11:21 -07:00
River Riddle	6b6dc59f30	Update ModuleOp::create(...) to take a Location instead of a context. This allows for giving a Module a more interesting location than 'Unknown'. PiperOrigin-RevId: 257310117	2019-07-10 10:11:00 -07:00
River Riddle	8c44367891	NFC: Rename Function to FuncOp. PiperOrigin-RevId: 257293379	2019-07-10 10:10:53 -07:00
Alex Zinenko	80e2871087	Extend AffineToGPU to support Linalg loops Extend the utility that converts affine loop nests to support other types of loops by abstracting away common behavior through templates. This also slightly simplifies the existing Affine to GPU conversion by always passing in the loop step as an additional kernel argument even though it is a known constant. If it is used, it will be propagated into the loop body by the existing canonicalization pattern and can be further constant-folded, otherwise it will be dropped by canonicalization. This prepares for the common loop abstraction that will be used for converting to GPU kernels, which is conceptually close to Linalg loops, while maintaining the existing conversion operational. PiperOrigin-RevId: 257172216	2019-07-09 05:26:50 -07:00
River Riddle	626b8b6a5d	NFC: Remove `Module::getFunctions` in favor of a general `getOps<T>`. Modules can now contain more than just Functions, this just updates the iteration API to reflect that. The 'begin'/'end' methods have also been updated to iterate over opaque Operations. PiperOrigin-RevId: 257099084	2019-07-08 18:28:17 -07:00
Lei Zhang	891a7911c2	Add dependencies for standard ops to SPIR-V conversion PiperOrigin-RevId: 257026374	2019-07-08 12:40:21 -07:00
River Riddle	ce502af9cd	NFC: Remove the various "::getFunction" methods. These methods assume that a function is a valid builtin top-level operation, and removing these methods allows for decoupling FuncOp and IR/. Utility "getParentOfType" methods have been added to Operation/OpState to allow for querying the first parent operation of a given type. PiperOrigin-RevId: 257018913	2019-07-08 12:40:08 -07:00
Stephan Herhut	e8b21a75f8	Add an mlir-cuda-runner tool. This tool allows to execute MLIR IR snippets written in the GPU dialect on a CUDA capable GPU. For this to work, a working CUDA install is required and the build has to be configured with MLIR_CUDA_RUNNER_ENABLED set to 1. PiperOrigin-RevId: 256551415	2019-07-04 07:53:54 -07:00
Stephan Herhut	1bcaa3185d	Add missing mlir:: namespace in definition of createConvertToLLVMIRPass. PiperOrigin-RevId: 256546769	2019-07-04 07:53:31 -07:00
Alex Zinenko	9a1b6fec79	Make ConvertStandardToLLVMPass extendable with other patterns Extend the LLVM lowering pass to accept callbacks that construct an instance of (a subclass of) LLVMTypeConverter and populate a list of conversion patterns. These callbacks will be called when the pass processes a module and their results will be used to set up the dialect conversion infrastructure. Clients can now provide additional conversion patterns to avoid the need of materializing type conversions between LLVM and other types. PiperOrigin-RevId: 256532415	2019-07-04 07:53:19 -07:00
Lei Zhang	0782b37936	NFC: Move Standard to SPIR-V conversion to lib/Conversion PiperOrigin-RevId: 256271759	2019-07-03 14:35:42 -07:00
River Riddle	206e55cc16	NFC: Refactor Module to be value typed. As with Functions, Module will soon become an operation, which are value-typed. This eases the transition from Module to ModuleOp. A new class, OwningModuleRef is provided to allow for owning a reference to a Module, and will auto-delete the held module on destruction. PiperOrigin-RevId: 256196193	2019-07-02 16:43:36 -07:00
River Riddle	54cd6a7e97	NFC: Refactor Function to be value typed. Move the data members out of Function and into a new impl storage class 'FunctionStorage'. This allows for Function to become value typed, which will greatly simplify the transition of Function to FuncOp(given that FuncOp is also value typed). PiperOrigin-RevId: 255983022	2019-07-01 11:39:00 -07:00
Alex Zinenko	d046b2ddec	Expose AffineToGPUPass for use with PassManager Originally, AffineToGPUPass was created and registered in the source file mainly for testing purposes. Provide a factory function that constructs AffineToGPU pass to make it usable in pass pipelines. PiperOrigin-RevId: 255902831	2019-07-01 09:55:24 -07:00
Stephan Herhut	630119f84f	Add a pass that inserts getters for all cubins found via nvvm.cubin annotations. Getters are required as there are currently no global constants in MLIR and this is an easy way to unblock CUDA execution while waiting for those. PiperOrigin-RevId: 255169002	2019-06-26 05:33:11 -07:00
Stephan Herhut	c72c6c3907	Make GPU to CUDA transformations independent of CUDA runtime. The actual transformation from PTX source to a CUDA binary is now factored out, enabling compiling and testing the transformations independently of a CUDA runtime. MLIR has still to be built with NVPTX target support for the conversions to be built and tested. PiperOrigin-RevId: 255167139	2019-06-26 05:16:37 -07:00
River Riddle	a4c3a6455c	Move the emitError/Warning/Remark utility methods out of MLIRContext and into the mlir namespace. Now that Locations are attributes, they have direct access to the MLIR context. This allows for simplifying error emission by removing unnecessary context lookups. PiperOrigin-RevId: 255112791	2019-06-25 21:32:23 -07:00
Alex Zinenko	2628641b23	GPUtoNVVM: adjust integer bitwidth when lowering special register ops GPU dialect operations (launch and launch_func) use `index` type for thread and block index values inside the kernel, for compatibility with affine loops. NVVM dialect operations, following the NVVM intrinsics, use `!llvm.i32` type, which does not necessarily have the same bit width as the lowered `index` type. Optionally sign-extend (indices are signed) or truncate the result of the NVVM dialect operation to the bit width of the lowered `index` type before passing it to other operations. This behavior is consistent with `std.index_cast`. We cannot use the latter since we are targeting LLVM dialect types directly, rather than standard integer types. PiperOrigin-RevId: 254980868	2019-06-25 09:21:26 -07:00
Stephan Herhut	10f320f7c0	Add gpu::GPUDialect::isKernel helper. Also some mild cleanup of the kernel to cubin conversion pass. PiperOrigin-RevId: 254959303	2019-06-25 09:20:40 -07:00
Alex Zinenko	f35d0c8570	NVVM target: emit nvvm.annotations for kernel functions PTX backend in LLVM expects additional module-level metadata `!nvvm.annotations` that lists functions that can be used as GPU kernels. Generate this metadata based on the `gpu.kernel` attribute attached to functions. This attribute is added automatically by the kernel outlining pass in the GPU dialect lowering flow. PiperOrigin-RevId: 254957345	2019-06-25 09:19:27 -07:00
River Riddle	9764ae3f24	Refactor the TypeConverter to support more robust type conversions: * Support for 1->0 type mappings, i.e. when the argument is being removed. * Reordering types when converting a type signature. * Adding new inputs when converting a type signature. This cl also lays down the initial foundation for supporting 1->N type mappings, but full support will come in a followup. Moving forward, function signature changes will be driven by populating a SignatureConversion instance. This class contains all of the necessary information for adding/removing/remapping function signatures; e.g. addInputs, addResults, remapInputs, etc. PiperOrigin-RevId: 254064665	2019-06-19 23:08:33 -07:00
Stephan Herhut	9d81081d90	Add a pass that translates GPU.launch_func into a series of runtime calls. This does not map the calls to the CUDA libary directly but uses a slim wrapper ABI on top that has more convenient types for code generation and is stable. Such ABI is expected to be provided by the actual runner. PiperOrigin-RevId: 253983833	2019-06-19 23:07:43 -07:00
Alex Zinenko	14e2f4a22b	Fix GPUToNVVM naming: NNVM should have been NVVM Rename `createLowerGpuOpsToNNVMOpsPass` to `createLowerGpuOpsToNVVMOpsPass`. PiperOrigin-RevId: 253801577	2019-06-19 23:06:36 -07:00
Alex Zinenko	b9beff0384	Make examples/Linalg3 depend on the new standard to LLVM conversion library. PiperOrigin-RevId: 253767820	2019-06-19 23:05:57 -07:00
Stephan Herhut	e0596a4d63	Use llvm::StringSwitch in lowering of GPU ops to NVVM ops. PiperOrigin-RevId: 253767688	2019-06-19 23:05:48 -07:00
Stephan Herhut	893374bfa2	Add a pass that translates a CUDA kernel function (tagged with nvvm.kernel) to a CUBIN blob for execution on CUDA GPUs. This is a first in a series of patches to build a simple CUDA runner to allow experimenting with MLIR code on GPUs. PiperOrigin-RevId: 253758915	2019-06-19 23:05:37 -07:00
Alex Zinenko	f218519cc2	Introduce std.index_cast and its lowering+translation to LLVM Index types integers of platform-specific bit width. They are used to index memrefs and as loop induction variables, however they could not be obtained from an integer until now, making it virtually impossible to express indirect accesses (given that memrefs of indices are not allowed) or data-dependent loops. Introduce `std.index_cast` to transform indices into integers and vice versa. The semantics of this cast is to sign-extend when casting to a wider integer, and to truncate when casting to a narrower integer. It belongs to StandardOps because both types it operates on are standard types, and because its results are likely to be used in std.load and std.store. Introduce llvm.sext, llvm.zext and llvm.trunc operations to the LLVM dialect. Provide the conversion of `std.index_cast` to llvm.sext or llvm.trunc, depending on the actual bitwidth of `index` known during the conversion. PiperOrigin-RevId: 253624100	2019-06-19 23:04:01 -07:00
Alex Zinenko	4291ae7431	Factor Region::getUsedValuesDefinedAbove into Transforms/RegionUtils Arguably, this function is only useful for transformations and should not pollute the main IR. Also make sure it accepts a the resulting container by-reference instead of returning it. PiperOrigin-RevId: 253622981	2019-06-19 23:03:51 -07:00

1 2

53 Commits