llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	21ee4e987f	Add @below and @above directives to verify-diagnostics. This simplifies defining expected-* directives when there are multiple that apply to the next or previous line. @below applies the directive to the next non-designator line, i.e. the next line that does not contain an expected-* designator. @above applies to the previous non designator line. Examples: // Expect an error on the next line that does not contain a designator. // expected-remark@below {{remark on function below}} // expected-remark@below {{another remark on function below}} func @bar(%a : f32) // Expect an error on the previous line that does not contain a designator. func @baz(%a : f32) // expected-remark@above {{remark on function above}} // expected-remark@above {{another remark on function above}} PiperOrigin-RevId: 276369085	2019-10-23 15:56:29 -07:00
Alex Zinenko	edffbbcdae	Fix "set-but-unused" warning in DialectConversion The variable in question is only used in an assertion, leading to a warning in opt builds. PiperOrigin-RevId: 276352259	2019-10-23 14:32:13 -07:00
Alex Zinenko	0d33703f2a	Drop MemRefUtils from the ExecutionEngine The ExecutionEngine was updated recently to only take the LLVM dialect as input. Memrefs are no longer expected in the signature of the entry point function by the executor so there is no need to allocate and free them. The code in MemRefUtils is therefore dead and furthermore out of sync with the recent evolution of memref type to support strides. Drop it. PiperOrigin-RevId: 276272302	2019-10-23 07:43:06 -07:00
Uday Bondhugula	ad6925f479	Update loop.for verifier message fix: nonnegative -> positive Closes tensorflow/mlir#206 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/206 from bondhugula:bondhugula-patch-1 9a47ca7dfd230180a9df33e9a64b33d02252d30a PiperOrigin-RevId: 276060885	2019-10-22 07:34:56 -07:00
River Riddle	057ee97c73	NFC: Add support for parsing attributes programmatically via mlir::parseAttribute. This matches the behavior of the public mlir::parseType, and even uses the internal implementation. PiperOrigin-RevId: 275989777	2019-10-21 21:34:51 -07:00
Lei Zhang	020f9eb68c	[DRR] Allow interleaved operands and attributes Previously DRR assumes attributes to appear after operands. This was the previous requirements on ODS, but that has changed some time ago. Fix DRR to also support interleaved operands and attributes. PiperOrigin-RevId: 275983485	2019-10-21 20:48:17 -07:00
Lei Zhang	d9fe892e42	[spirv] Allow block arguments on spv.Branch(Conditional) We will use block arguments as the way to model SPIR-V OpPhi in the SPIR-V dialect. This CL also adds a few useful helper methods to both ops to get the block arguments. Also added tests for branch weight (de)serialization. PiperOrigin-RevId: 275960797	2019-10-21 17:32:00 -07:00
Christian Sigg	b74af4aa5c	Unify GPU op definition names with other dialects. Rename GPU op names from gpu_Foo to GPU_FooOp. PiperOrigin-RevId: 275882232	2019-10-21 11:10:56 -07:00
River Riddle	03d7be2aca	NFC: Elide the value of a UnitAttr within nested attribute dictionaries. This matches the behavior of the top level attribute dictionary. PiperOrigin-RevId: 275879828	2019-10-21 11:02:07 -07:00
River Riddle	9ac459e871	Add a Symbol trait to simplify defining operations that represent symbols. This trait provides accessors for the name, symbol use list methods, verification, with more to be added. PiperOrigin-RevId: 275864554	2019-10-21 09:58:59 -07:00
Kazuaki Ishizaki	8bfedb3ca5	Fix minor spelling tweaks (NFC) Closes tensorflow/mlir#177 PiperOrigin-RevId: 275692653	2019-10-20 00:11:34 -07:00
Jacques Pienaar	305dafd3b1	Add missing include to StringMap in Verifier and DialectConversion. PiperOrigin-RevId: 275656416	2019-10-19 13:39:02 -07:00
Mehdi Amini	5b1345ff76	Add missing include to llvm Allocator.h This header is not self-contained otherwise. PiperOrigin-RevId: 275651582	2019-10-19 12:11:01 -07:00
Geoffrey Martin-Noble	bc577eaf44	Use new eraseOp instead of replaceOp with empty values PiperOrigin-RevId: 275631166	2019-10-19 06:04:18 -07:00
Christian Sigg	c3e56cd12c	Get active source lane predicate from shuffle instruction. nvvm.shfl.sync.bfly optionally returns a predicate whether source lane was active. Support for this was added to clang in https://reviews.llvm.org/D68892. Add an optional 'pred' unit attribute to the instruction to return this predicate. Specify this attribute in the partial warp reduction so we don't need to manually compute the predicate. PiperOrigin-RevId: 275616564	2019-10-19 01:53:25 -07:00
River Riddle	5f6bdd144a	NFC: Cleanup the implementation of walkSymbolUses. Refactor the implementation to be much cleaner by adding a `make_second_range` utility to walk the `second` value of a range of pairs. PiperOrigin-RevId: 275598985	2019-10-18 21:29:15 -07:00
Lei Zhang	c5b9fefddc	NFC: Rename SPIR-V serializer findID() to getID() to be consistent We use get*() in deserizer and other places across the codebase. PiperOrigin-RevId: 275582390	2019-10-18 18:16:05 -07:00
Sean Silva	9c9a7e9268	Add support for function result attributes. This allows dialect-specific attributes to be attached to func results. (or more specifically, FunctionLike ops). For example: ``` func @f() -> (i32 {my_dialect.some_attr = 3}) ``` This attaches my_dialect.some_attr with value 3 to the first result of func @f. Another more complex example: ``` func @g() -> (i32, f32 {my_dialect.some_attr = "foo", other_dialect.some_other_attr = [1,2,3]}, i1) ``` Here, the second result has two attributes attached. PiperOrigin-RevId: 275564165	2019-10-18 16:03:28 -07:00
Nicolas Vasilache	9e7e297da3	Lower vector transfer ops to loop.for operations. This allows mixing linalg operations with vector transfer operations (with additional modifications to affine ops) and is a step towards solving tensorflow/mlir#189. PiperOrigin-RevId: 275543361	2019-10-18 14:10:10 -07:00
Nicolas Vasilache	2823b68580	Implement lowering of VectorTypeCastOp to LLVM A VectorTypeCastOp can only be used to lower between statically sized contiguous memrefs of scalar and matching vector type. The sizes and strides are thus fully static and easy to determine. A relevant test is added. This is a step towards solving tensorflow/mlir#189. PiperOrigin-RevId: 275538981	2019-10-18 14:00:06 -07:00
Nicolas Vasilache	151e7e61e8	Automated rollback of commit `575405f4d6` PiperOrigin-RevId: 275461067	2019-10-18 06:45:06 -07:00
Nicolas Vasilache	3e3ab38021	Fix OSS target name GPUtoNVVMTransforms -> MLIRGPUtoNVVMTransforms This unbreaks the `cmake -G Ninja ../llvm -DLLVM_BUILD_EXAMPLES=ON -DLLVM_TARGETS_TO_BUILD="host"` in my local OSS build PiperOrigin-RevId: 275452330	2019-10-18 05:22:38 -07:00
Stephan Herhut	3622e1833f	Use StrEnumAttr for gpu.allreduce op instead of StringAttr to better encode constraints. PiperOrigin-RevId: 275448372	2019-10-18 04:44:48 -07:00
Christian Sigg	fe0ee32da5	Add gpu.barrier op to synchronize invocations of a local workgroup. Adding gen table for rewrite patterns from GPU to NVVM dialect. Copy missing op documentation from GPUOps.td to GPU.md. PiperOrigin-RevId: 275419588	2019-10-18 00:30:44 -07:00
River Riddle	2acc220f17	NFC: Remove trivial builder get methods. These don't add any value, and some are even more restrictive than the respective static 'get' method. PiperOrigin-RevId: 275391240	2019-10-17 20:08:34 -07:00
River Riddle	575405f4d6	Automated rollback of commit `b65c8bb5d6` PiperOrigin-RevId: 275370861	2019-10-17 17:11:39 -07:00
Geoffrey Martin-Noble	6090643877	Introduce a wrapper around ConversionPattern that operates on the derived class Analogous to OpRewritePattern, this makes writing conversion patterns more convenient. PiperOrigin-RevId: 275349854	2019-10-17 15:30:38 -07:00
Nicolas Vasilache	b65c8bb5d6	Add EDSC support for loop.for operations This CL adds support for loop.for operations in EDSC and adds a test. This will be used in a followup commit to implement lowering of vector_transfer ops so that it works more generally and is not subject to affine constraints. PiperOrigin-RevId: 275349796	2019-10-17 15:18:34 -07:00
Nicolas Vasilache	5b03e692f6	Decouple Linalg promotion from Linalg tiling - NFC This CL creates a new Linalg promotion pass that operates on SubViewOp and decouples it from Linalg tiling. This is mostly moving code around. PiperOrigin-RevId: 275329213	2019-10-17 13:41:17 -07:00
Denis Khalikov	a560505d1a	[spirv] Add a canonicalization pattern for spv.selection. Add a canonicalization pattern for spv.selection operation. Convert spv.selection operation to spv.Select based on simple pattern. Closes tensorflow/mlir#183 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/183 from denis0x0D:sandbox/canon_select 43d04d923272dd60b9da39f70bdbc51a5168db62 PiperOrigin-RevId: 275312748	2019-10-17 12:36:47 -07:00
Lei Zhang	057dc41bf6	Allow '_' when pretty printing dialect symbols '_' is used frequently enough as the separator of words in symbols. We should allow it in dialect symbols when considering pretty printing. Also updated LangRef.md regarding pretty form. PiperOrigin-RevId: 275312494	2019-10-17 12:24:18 -07:00
Nicolas Vasilache	10039d04e2	Rename LoopNestBuilder to AffineLoopNestBuilder - NFC PiperOrigin-RevId: 275310747	2019-10-17 12:13:59 -07:00
Mehdi Amini	6ebc7318b0	Use a SmallVector instead of an ArrayRef to materialize a temporary local array This pattern is error prone and unfortunately none of the sanitizer is catching it at the moment. Fixes tensorflow/mlir#192 Closes tensorflow/mlir#193 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/193 from joker-eph:fix_array_ref 8092252e64c426c6a8a790b7638f847bea4818b1 PiperOrigin-RevId: 275280201	2019-10-17 10:10:46 -07:00
Lei Zhang	23d21af65c	[DRR] Allow capturing and referencing no-result ops Previously when we bind a symbol to an op in DRR, it means to capture the op's result(s) and later references will be expanded to result(s). This means for ops without result, we are replacing the symbol with nothing. This CL treats non-result op capturing and referencing as a special case to mean the op itself. PiperOrigin-RevId: 275269702	2019-10-17 09:02:31 -07:00
Lei Zhang	1358df19ca	Add LLVM_DEBUG in RewritersGen.cpp and Pattern.cpp It's usually hard to understand what went wrong if mlir-tblgen crashes on some input. This CL adds a few useful LLVM_DEBUG statements so that we can use mlir-tblegn -debug to figure out the culprit for a crash. PiperOrigin-RevId: 275253532	2019-10-17 07:26:22 -07:00
Lei Zhang	0e3efb32c6	[spirv] Implement inliner interface We just need to implement a few interface hooks to DialectInlinerInterface and CallOpInterface to gain the benefits of an inliner. :) Right now only supports some trivial cases: * Inlining single block with spv.Return/spv.ReturnValue * Inlining multi block with spv.Return * Inlining spv.selection/spv.loop without return ops More advanced cases will require block argument and Phi support. PiperOrigin-RevId: 275151132	2019-10-16 17:46:19 -07:00
Sana Damani	3940b90d84	Update Chapter 4 of the Toy tutorial This Chapter now introduces and makes use of the Interface concept in MLIR to demonstrate ShapeInference. END_PUBLIC Closes tensorflow/mlir#191 PiperOrigin-RevId: 275085151	2019-10-16 12:19:39 -07:00
Geoffrey Martin-Noble	a3726a13f7	NFC: Update VectorOrTensor -> Shaped This was missed when the type was renamed. PiperOrigin-RevId: 275082588	2019-10-16 11:58:26 -07:00
Mahesh Ravishankar	54a8473470	Makes spv.module generated by GPU->SPIRV conversion spec compliant Makes the spv.module generated by the GPU to SPIR-V conversion SPIR-V spec compliant (validated using spirv-val from Vulkan tools). 1) Separate out the VulkanLayoutUtils from DecorateSPIRVCompositeTypeLayoutPass to make it reusable within the Type converter in SPIR-V lowering infrastructure. This is used to compute the layout of the !spv.struct used in global variable type description. 2) Set the capabilities of the spv.module to Shader (needed for use of Logical Memory Model, and the extensions to SPV_KHR_storage_buffer_storage_class for use of Storage Buffer) PiperOrigin-RevId: 275081486	2019-10-16 11:53:07 -07:00
Christian Sigg	d2f0f847af	Support custom accumulator provided as region to gpu.all_reduce. In addition to specifying the type of accumulation through the 'op' attribute, the accumulation can now also be specified as arbitrary code region. Adds a gpu.yield op to specify the result of the accumulation. Also support more types (integers) and accumulations (mul). PiperOrigin-RevId: 275065447	2019-10-16 10:43:44 -07:00
Mahesh Ravishankar	e7b49eef1d	Allow for remapping argument to a Value in SignatureConversion. The current SignatureConversion framework (part of DialectConversion) allows remapping input arguments to a function from 1->0, 1->1 or 1->many arguments during conversion. Another case is where the argument itself is dropped, but it's use are remapped to another Value*. An example of this is: The Vulkan/SPIR-V spec requires entry functions to be of type void(void). The GPU -> SPIR-V conversion implemented this without having the DialectConversion framework track the remapping that lead to some undefined behavior. The changes here addresses that. PiperOrigin-RevId: 275059656	2019-10-16 10:21:03 -07:00
River Riddle	dfe09cc621	Add support for PatternRewriter::eraseOp. This hook is useful when an operation is known to be dead, and no replacement values make sense. PiperOrigin-RevId: 275052756	2019-10-16 09:50:57 -07:00
Mehdi Amini	f1f9e3b8d1	Fix CMake configuration after introduction of LICM and LoopLikeInterface `b843cc5d5a` introduced a new op LICM transformation and a LoopLike interface, but missed the CMake aspects of it. This should fix the build. PiperOrigin-RevId: 275038533	2019-10-16 08:37:39 -07:00
Stephan Herhut	b843cc5d5a	Implement simple loop-invariant-code-motion based on dialect interfaces. PiperOrigin-RevId: 275004258	2019-10-16 04:28:38 -07:00
Lei Zhang	e03e151983	[spirv] Add support for SpecId decoration on spv.specConstant The SpecId decoration is the handle for providing external specialization. Similar to descriptor set and binding on global variables, we directly bake it into assembly parsing and printing. PiperOrigin-RevId: 274893879	2019-10-15 14:53:30 -07:00
Nicolas Vasilache	abf5c60af9	Add conversion for splat of vectors of 2+D This CL adds a missing lowering for splat of multi-dimensional vectors. Additional support is also added to the runtime utils library to allow printing memrefs with such vectors. PiperOrigin-RevId: 274794723	2019-10-15 06:53:08 -07:00
Alex Zinenko	c50e53c109	Expose mlir::parseType to bindings Python bindings currently currently provide a makeScalarType function that constructs one of the predefined types. It was implemented in the bindings directly to circumvent the absence of standalone type parsing function. Now that mlir::parseType has been made available, rely on the core parsing procedure to construct types from strings in the bindings. This changes includes a library reshuffling that splits out "CoreAPIs" implementing the binding helper APIs into a separate library and makes that dependent on the Parser library. PiperOrigin-RevId: 274794516	2019-10-15 06:52:04 -07:00
Alex Zinenko	98815cfdd9	AsmPrinter: avoid unused-variable warning The value defined in a loop was not being used and the function producing it re-evaluated instead. Use the value to avoid both the warning and the re-evaluation. PiperOrigin-RevId: 274794459	2019-10-15 06:51:01 -07:00
River Riddle	f29731d17f	NFC: Replace usages of Value::getKind with explicit isa/casts. It is more idiomatic to use the llvm::cast infrastructure for checking the type of a value. PiperOrigin-RevId: 274684945	2019-10-14 16:21:51 -07:00
River Riddle	96de7091bc	Allowing replacing non-root operations in DialectConversion. When dealing with regions, or other patterns that need to generate temporary operations, it is useful to be able to replace other operations than the root op being matched. Before this PR, these operations would still be considered for legalization meaning that the conversion would either fail, erroneously need to mark these ops as legal, or add unnecessary patterns. PiperOrigin-RevId: 274598513	2019-10-14 10:01:59 -07:00
Nicolas Vasilache	5c5d83afb4	Fix linalg.subview behavior in (partially) static cases. When the implementation of the strided memref [RFC](https://groups.google.com/a/tensorflow.org/forum/#!msg/mlir/MaL8m2nXuio/1scRqZa6AQAJ) landed, linalg started using this type instead of the now retired !linalg.view. As static and partially static cases appear, the stride information needs to be maintained properly. In particular, the result type of the subview op was generally incorrect. This CL fixes the issue by computing a return type that: 1. always has dynamic sizes, which is generally the only correct way to construct a subview in the absence of data padding and/or code versioning. 2. has the same strides as the base strided memref. Point 1. above can be further refined but will needs further analysis and canonicalization to optimize the particular case where: 1. The base memref has static size along a given dimension. 2. The subview size can be statically derived (e.g. after canonicalization). 3. And the subview size is an even divisor of the base memref. This 3rd constraint is well-known in the case of tiled layouts that don't assume implicit padding: the boundary tile may be only partial and has size given by `problem_size % tile_size`. Tests are updated as appropriate. PiperOrigin-RevId: 274578624	2019-10-14 08:43:53 -07:00
Nicolas Vasilache	c2285b619d	Add lowering of VectorOps dialect to LLVM to the Linalg LLVM lowering pass This fixes an omission that prevents Linalg to lower generic ops regions operating on ops in the VectorOps dialect. To achieve this we simply need to `populateVectorToLLVMConversionPatterns` in the conversion. Relevant tests are added. PiperOrigin-RevId: 274577325	2019-10-14 08:43:26 -07:00
Eric Schweitz	a3d084848d	Add LLVM IR dialect hooks for FP128 and X86_FP80 types Closes tensorflow/mlir#184 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/184 from schweitzpgi:more-float-types ca27d00510a86ffc9c79c65fb3a0193b5ea097a0 PiperOrigin-RevId: 274288813	2019-10-11 18:35:33 -07:00
Alex Zinenko	8c2ea32072	Emit LLVM IR equivalent of sizeof when lowering alloc operations Originally, the lowering of `alloc` operations has been computing the number of bytes to allocate when lowering based on the properties of MLIR type. This does not take into account type legalization that happens when compiling LLVM IR down to target assembly. This legalization can widen the type, potentially leading to out-of-bounds accesses to `alloc`ed data due to mismatches between address computation that takes the widening into account and allocation that does not. Use the LLVM IR's equivalent of `sizeof` to compute the number of bytes to be allocated: %0 = getelementptr %type* null, %indexType 0 %1 = ptrtoint %type* %0 to %indexType adapted from http://nondot.org/sabre/LLVMNotes/SizeOf-OffsetOf-VariableSizedStructs.txt PiperOrigin-RevId: 274159900	2019-10-11 06:33:26 -07:00
Alex Zinenko	71b82bcbf6	LLVM Dialect: introduce llvm.mlir.null operation Similarly to `llvm.mlir.undef`, this auxiliary operation creates an SSA value that corresponds to `null` in LLVM IR. This operation is necessary to model sizeof(<...>) behavior when allocating memory. PiperOrigin-RevId: 274158760	2019-10-11 06:32:24 -07:00
Uday Bondhugula	47596f5345	Drop obsolete code from std to llvm memref lowering - dropping what looks like outdated code post some of the previous updates Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#179 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/179 from bondhugula:llfix 2a72ea441fe1b3924802273ffbe9870afeb90f91 PiperOrigin-RevId: 274158273	2019-10-11 06:31:18 -07:00
Alexander Belyaev	7301ac72bc	Rename LLVM::exp and LLVM::fmuladd to LLVM::ExpOP and LLVM::FMulAddOp. PiperOrigin-RevId: 274154655	2019-10-11 05:38:37 -07:00
Alexander Belyaev	00d2a37e32	Add unary ops and ExpOp to Standard Dialect. PiperOrigin-RevId: 274152154	2019-10-11 05:13:55 -07:00
River Riddle	978b209d38	NFC: Print the generic op form after pass failure. On failure, the IR is likely to be in an invalid state, meaning the custom printer for some operations may now crash. Using the generic op form prevents this from happening. PiperOrigin-RevId: 274104146	2019-10-10 21:57:50 -07:00
River Riddle	7a7dcc171d	Add support for generating reproducers on pass crash and failure. This cl adds support for generating a .mlir file containing a reproducer for crashes and failures that happen during pass execution. The reproducer contains a comment detailing the configuration of the pass manager(e.g. the textual description of the pass pipeline that the pass manager was executing), along with the original input module. Example Output: // configuration: -pass-pipeline='func(cse, canonicalize), inline' // note: verifyPasses=false module { ... } PiperOrigin-RevId: 274088134	2019-10-10 19:36:54 -07:00
River Riddle	b245e9519c	NFC: Initialize pass manager option fields inline instead of the class constructor. PiperOrigin-RevId: 274087577	2019-10-10 19:35:55 -07:00
Alex Zinenko	08a2ce8a14	Standard-to-LLVM conversion: check that operands have LLVM types In Standard to LLVM dialect conversion, the binary op conversion pattern implicitly assumed some operands were of LLVM IR dialect type. This is not necessarily true, for example if the Ops that produce those operands did not match the existing convresion patterns. Check if all operands are of LLVM IR dialect type and if not, fail to patch the binary op pattern. Closes tensorflow/mlir#168 PiperOrigin-RevId: 274063207	2019-10-10 17:19:57 -07:00
Alex Zinenko	4dde19f024	Translation to LLVM: check the validity of module-level Ops Translation to LLVM expects the entry module to have only specific types of ops that correspond to LLVM IR entities allowed in a module. Currently those are restricted to functions and globals. Introduce an additional check at the module level. Inside individual functions, the check for supported Ops is already performed, but it accepts all LLVM dialect Ops and wouldn't be immediately applicable at the module level. PiperOrigin-RevId: 274058651	2019-10-10 17:19:57 -07:00
Mahesh Ravishankar	28d7f9c052	Add lowering of constant ops to SPIR-V. The lowering is specified as a pattern and is done only if the result is a SPIR-V scalar type or vector type. Handling ConstantOp with index return type needs special handling since SPIR-V dialect does not have index types. Based on the bitwidth of the attribute value, either i32 or i64 is chosen. Other constant lowerings are left as a TODO. PiperOrigin-RevId: 274056805	2019-10-10 17:19:57 -07:00
River Riddle	6b1cc3c6ea	Add support for canonicalizing callable regions during inlining. This will allow for inlining newly devirtualized calls, as well as give a more accurate cost model(when we have one). Currently canonicalization will only run for nodes that have no child edges, as the child nodes may be erased during canonicalization. We can support this in the future, but it requires more intricate deletion tracking. PiperOrigin-RevId: 274011386	2019-10-10 17:06:33 -07:00
River Riddle	438dc176b1	Remove the need to convert operations in regions of operations that have been replaced. When an operation with regions gets replaced, we currently require that all of the remaining nested operations are still converted even though they are going to be replaced when the rewrite is finished. This cl adds a tracking for a minimal set of operations that are known to be "dead". This allows for ignoring the legalization of operations that are won't survive after conversion. PiperOrigin-RevId: 274009003	2019-10-10 17:06:25 -07:00
Christian Sigg	82dc6c4492	Mark GPU dialect as illegal when lowering to NVVM. PiperOrigin-RevId: 273948293	2019-10-10 06:32:12 -07:00
Alex Zinenko	5e7959a353	Use llvm.func to define functions with wrapped LLVM IR function type This function-like operation allows one to define functions that have wrapped LLVM IR function type, in particular variadic functions. The operation was added in parallel to the existing lowering flow, this commit only switches the flow to use it. Using a custom function type makes the LLVM IR dialect type system more consistent and avoids complex conversion rules for functions that previously had to use the built-in function type instead of a wrapped LLVM IR dialect type and perform conversions during the analysis. PiperOrigin-RevId: 273910855	2019-10-10 01:34:06 -07:00
Kazuaki Ishizaki	f5813ff8e1	Fix typo in QuantizedType method names Closes tensorflow/mlir#172 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/172 from kiszk:quantops e27b57eac8f4c6ef7ee6a6f7b497d3e2f56f6798 PiperOrigin-RevId: 273879164	2019-10-09 20:32:47 -07:00
MLIR Team	221e661e91	Pre-allocate space for results from a regex match that uses 3 match strings. That space is 4 StringRefs, not 3, because element 0 of the match always contains the entire source string. PiperOrigin-RevId: 273875606	2019-10-09 20:07:46 -07:00
MLIR Team	ae6946ec11	Add ::printAsTextualPipeline to Pass and OpPassManager. Allow printing out pipelines in a format that is as close as possible to the textual pass pipeline format. Individual passes can override the print function in order to format any options that may have been used to construct that pass. PiperOrigin-RevId: 273813627	2019-10-09 13:49:17 -07:00
Christian Sigg	35bb732032	Guard rewriter insertion point during signature conversion. Avoid unexpected side effect in rewriter insertion point. PiperOrigin-RevId: 273785794	2019-10-09 11:33:28 -07:00
Mahesh Ravishankar	e2ed25bc43	Make SPIR-V lowering infrastructure follow Vulkan SPIR-V validation. The lowering infrastructure needs to be enhanced to lower into a spv.Module that is consistent with the SPIR-V spec. The following changes are needed 1) The Vulkan/SPIR-V validation rules dictates entry functions to have signature of void(void). This requires changes to the function signature conversion infrastructure within the dialect conversion framework. When an argument is dropped from the original function signature, a function can be specified that when invoked will return the value to use as a replacement for the argument from the original function. 2) Some changes to the type converter to make the converted type consistent with the Vulkan/SPIR-V validation rules, a) Add support for converting dynamically shaped tensors to spv.rtarray type. b) Make the global variable of type !spv.ptr<!spv.struct<...>> 3) Generate the entry point operation for the kernel functions and automatically compute all the interface variables needed PiperOrigin-RevId: 273784229	2019-10-09 11:25:58 -07:00
Diego Caballero	3451055614	Add support for some multi-store cases in affine fusion This PR is a stepping stone towards supporting generic multi-store source loop nests in affine loop fusion. It extends the algorithm to support fusion of multi-store loop nests that: 1. have only one store that writes to a function-local live out, and 2. the remaining stores are involved in loop nest self dependences or no dependences within the function. Closes tensorflow/mlir#162 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/162 from dcaballe:dcaballe/multi-output-fusion 7fb7dec6fe8b45f5ce176f018bfe37b256420c45 PiperOrigin-RevId: 273773907	2019-10-09 10:37:30 -07:00
Christian Sigg	48f819c113	Change to doxygen comments. NFC. PiperOrigin-RevId: 273707610	2019-10-09 02:46:37 -07:00
Christian Sigg	7c67ec0f03	Assert that region is not cloned into itself. PiperOrigin-RevId: 273707291	2019-10-09 02:43:52 -07:00
Smit Hinsu	85b46314c0	Allow dynamic but ranked types in ops with SameOperandsAndResultShape and SameOperandsAndResultType traits Currently SameOperandsAndResultShape trait allows operands to have tensor<*xf32> and tensor<2xf32> but doesn't allow tensor<?xf32> and tensor<10xf32>. Also, use the updated shape compatibility helper function in TensorCastOp::areCastCompatible method. PiperOrigin-RevId: 273658336	2019-10-08 19:37:11 -07:00
River Riddle	b3a6ae8363	Update the symbol utility methods to handle the case of unknown operations. This enhances the symbol table utility methods to handle the case where an unknown operation may define a symbol table. When walking symbols, we now collect all symbol uses before allowing the user to iterate. This prevents the user from assuming that all symbols are actually known before performing a transformation. PiperOrigin-RevId: 273651963	2019-10-08 18:38:37 -07:00
MLIR Team	7446151236	Add Instance Specific Pass Options. This allows individual passes to define options structs and for these options to be parsed per instance of the pass while building the pass pipeline from the command line provided textual specification. The user can specify these per-instance pipeline options like so: ``` struct MyPassOptions : public PassOptions<MyPassOptions> { Option<int> exampleOption{this, "flag-name", llvm:🆑:desc("...")}; List<int> exampleListOption{this, "list-flag-name", llvm:🆑:desc("...")}; }; static PassRegistration<MyPass, MyPassOptions> pass("my-pass", "description"); ``` PiperOrigin-RevId: 273650140	2019-10-08 18:23:43 -07:00
River Riddle	71c7962201	Add support for parsing/printing non bare-identifier SymbolRefs. The restriction that symbols can only have identifier names is arbitrary, and artificially limits the names that a symbol may have. This change adds support for parsing and printing symbols that don't fit in the 'bare-identifier' grammar by printing the reference in quotes, e.g. @"0_my_reference" can now be used as a symbol name. PiperOrigin-RevId: 273644768	2019-10-08 17:45:07 -07:00
Deven Desai	956a831130	[ROCm] Fix the return type for the device function calls from i32 to i64. This is matching what the runtime library is expecting. Closes tensorflow/mlir#171 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/171 from deven-amd:deven-rocdl-device-func-i64 80762629a8c34e844ebdc542b34dd783990db9db PiperOrigin-RevId: 273640767	2019-10-08 17:41:42 -07:00
Denis Khalikov	d21ba951de	[spirv] Add a pass to decorate the composite types with layout info. Add a pass to decorate the composite types used by composite objects in the StorageBuffer, PhysicalStorageBuffer, Uniform, and PushConstant storage classes with layout information. Closes tensorflow/mlir#156 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/156 from denis0x0D:sandbox/layout_info_decoration 7c50840fd38ca169a2da7ce9886b52b50c868b84 PiperOrigin-RevId: 273634140	2019-10-08 16:54:11 -07:00
River Riddle	49b29dd186	Add a PatternRewriter hook for cloning a region into another. This is similar to the `inlineRegionBefore` hook, except the original blocks are unchanged. The region to be cloned must not have been modified during the conversion process at the point of cloning, i.e. it must belong an operation that has yet to be converted, or the operation that is currently being converted. PiperOrigin-RevId: 273622533	2019-10-08 15:45:08 -07:00
Uday Bondhugula	6136f33d59	unroll and jam: fix order of jammed bodies - bodies would earlier appear in the order (i, i+3, i+2, i+1) instead of (i, i+1, i+2, i+3) for example for factor 4. - clean up hardcoded test cases Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#170 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/170 from bondhugula:ujam b66b405b2b1894a03b376952e32a9d0292042665 PiperOrigin-RevId: 273613131	2019-10-08 15:13:11 -07:00
River Riddle	ac91e67375	Add support for walking the uses of a symbol. MLIR uses symbol references to model references to many global entities, such as functions/variables/etc. Before this change, there is no way to actually reason about the uses of such entities. This change provides a walker for symbol references(via SymbolTable::walkSymbolUses), as well as 'use_empty' support(via SymbolTable::symbol_use_empty). It also resolves some deficiencies in the LangRef definition of SymbolRefAttr, namely the restrictions on where a SymbolRefAttr can be stored, ArrayAttr and DictionaryAttr, and the relationship with operations containing the SymbolTable trait. PiperOrigin-RevId: 273549331	2019-10-08 10:21:59 -07:00
River Riddle	0dd404e4e1	NFC: Remove unused default cl::opt value. The default value is never used as the value of the elide option is only used if it has an occurrence. PiperOrigin-RevId: 273545143	2019-10-08 10:04:28 -07:00
Alex Zinenko	0cdc53a762	Linalg to LLVM lowering: decrease the reliance on symbol lookup in a module During the conversion, both the original and the converted function may coexist in the module and have the same symbol name. There is no guarantee which of the two will be found by the symbol lookup. Avoid returning the result of the library function lookup when lowering Linalg to Standard or LLVM. Use the symbol reference instead. After the conversion completes, only one symbol will remain and the Ops using SymbolRefAttrs will be referring to the correct one. PiperOrigin-RevId: 273510079	2019-10-08 06:55:25 -07:00
Alex Zinenko	11d12670da	GPUToCUDA: attach CUBIN to the nested module rather than to the function Originally, we were attaching attributes containing CUBIN blobs to the kernel function called by `gpu.launch_func`. This kernel is now contained in a nested module that is used as a compilation unit. Attach compiled CUBIN blobs to the module rather than to the function since we were compiling the module. This also avoids duplication of the attribute on multiple kernels within the same module. PiperOrigin-RevId: 273497303	2019-10-08 05:11:26 -07:00
Alex Zinenko	52e082b6ed	GPUToCUDA: emit addressof directly instead of wrapping it into a getter function Originally, the CUBIN getter function was introduced as a mechanism to circumvent the absence of globals in the LLVM dialect. It would allocate memory and populate it with the CUBIN data. LLVM dialect now supports globals and they are already used to store CUBIN data, making the getter function a trivial address computation of a global. Emit the address computation directly at the place of `gpu.launch_func` instead of putting it in a function and calling it. This simplifies the conversion flow and prepares it for using the DialectConversion infrastructure. PiperOrigin-RevId: 273496221	2019-10-08 05:03:42 -07:00
Alex Zinenko	16af5924cb	Fuse GenerateCubinAccessors pass into LaunchFunctToCuda Now that the accessor function is a trivial getter of the global variable, it makes less sense to have the getter generation as a separate pass. Move the getter generation into the lowering of `gpu.launch_func` to CUDA calls. This change is mostly code motion, but the process can be simplified further by generating the addressof inplace instead of using a call. This is will be done in a follow-up. PiperOrigin-RevId: 273492517	2019-10-08 04:35:33 -07:00
Alex Zinenko	90d65d32d6	Use named modules for gpu.launch_func The kernel function called by gpu.launch_func is now placed into an isolated nested module during the outlining stage to simplify separate compilation. Until recently, modules did not have names and could not be referenced. This limitation was circumvented by introducing a stub kernel at the same name at the same nesting level as the module containing the actual kernel. This relation is only effective in one direction: from actual kernel function to its launch_func "caller". Leverage the recently introduced symbol name attributes on modules to refer to a specific nested module from `gpu.launch_func`. This removes the implicit connection between the identically named stub and kernel functions. It also enables support for `gpu.launch_func`s to call different kernels located in the same module. PiperOrigin-RevId: 273491891	2019-10-08 04:30:32 -07:00
Jing Pu	780f107a57	Update upgrade some uses of mlir::interleave API to take container argument directly. PiperOrigin-RevId: 273446814	2019-10-07 21:53:11 -07:00
River Riddle	a8a73f0640	Add a flag to the AsmPrinter for eliding large ElementsAttrs. Some modules may have extremely large ElementsAttrs, which makes debugging involving IR dumping extremely slow and painful. This change adds a flag that will elide ElementsAttrs with a "large"(as defined by the user) number of elements by printing "..." instead of the element data. PiperOrigin-RevId: 273413100	2019-10-07 17:19:20 -07:00
Jing Pu	17606a108b	Print result types when dumping graphviz. PiperOrigin-RevId: 273406833	2019-10-07 16:45:53 -07:00
MLIR Team	6b3462a77b	Expose `fuseProducerOf` in Linalg/Utils/Utils.h. PiperOrigin-RevId: 273384063	2019-10-07 15:01:07 -07:00
Mahesh Ravishankar	37e0e8cf16	Do not add spirv::BitcastOp for cast from signed to unsigned type. Since MLIR integer types don't make a distinction between signed vs unsigned integers, during deserialization of SPIR-V binaries, the OpBitcast might result in a cast from/to the same type. Do not add a spv.Bitcast operation to the spv.module in these cases. PiperOrigin-RevId: 273381887	2019-10-07 14:52:00 -07:00
River Riddle	aeada290b8	Add a new class, OpPrintingFlags, to enable programmatic control of Operation::print behavior. This allows for controlling the behavior of the AsmPrinter programmatically, instead of relying exclusively on cl::opt flags. This will also allow for more fine-tuned control of printing behavior per callsite, instead of being applied globally. PiperOrigin-RevId: 273368361	2019-10-07 13:54:49 -07:00
Mahesh Ravishankar	9e9c3a009a	Update UndefOp (de)serialization to generate OpUndef at module level. The SPIR-V spec recommends all OpUndef instructions be generated at module level. For the SPIR-V dialect its better for UndefOp to produce an SSA value for use with other instructions. If UndefOp is to be used at module level, it cannot produce an SSA value (use of this SSA value within FuncOp would need implicit capture). To satisfy needs of the SPIR-V spec while making it simpler to represent UndefOp in the SPIR-V dialect, the serialization is updated to create OpUndef instruction at module scope. PiperOrigin-RevId: 273355526	2019-10-07 12:56:38 -07:00
Lei Zhang	ebf584b813	[spirv] Fix function entry block erase after moving to spv.selection The structured selection/loop's entry block does not have arguments. If the function's header block is also part of the structured control flow, we cannot just simply erase it because it may contain arguments matching the function signature and used by the cloned blocks. Instead, turn it into a block only containing a spv.Branch op. Also, we can directly emit instructions for the spv.selection header block to the block containing the spv.selection op. This eliminates unnecessary branches in the SPIR-V blob. Added a test for nested spv.loop. PiperOrigin-RevId: 273351424	2019-10-07 12:37:13 -07:00
Uday Bondhugula	89e7a76a1c	fix simplify-affine-structures bug Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#157 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/157 from bondhugula:quickfix bd1fcd79825fc0bd5b4a3e688153fa0993ab703d PiperOrigin-RevId: 273316498	2019-10-07 10:04:50 -07:00
Christian Sigg	9f11b0e12f	Change Block::getParent() to be a const function. This is only necessary because ilist_node_with_parent specifically requires a 'getParent() const' method. If/When ilist_node removes this constraint we should drop the const to fit the rest of the MLIR const model. PiperOrigin-RevId: 273316153	2019-10-07 10:03:28 -07:00
Nicolas Vasilache	9f98bcda47	Support AllocOp terminal in Linalg::AliasAnalysis. Now that linalg.view and strided memrefs are unified, there is no reason to disallow AllocOp in alias analysis. This CLs adds support for AllocOp which allows writing shorter tests that do not require explicitly creating a view for each operation. PiperOrigin-RevId: 273303060	2019-10-07 09:01:18 -07:00
Jacques Pienaar	27e8efedf8	Add DialectType and generate docs for dialect types Add new `typeDescription` (description was already used by base constraint class) field to type to allow writing longer descriptions about a type being defined. This allows for providing additional information/rationale for a defined type. This currently uses `description` as the heading/name for the type in the generated documentation. PiperOrigin-RevId: 273299332	2019-10-07 08:41:13 -07:00
MLIR Team	da984166df	Add OpaqueLoc to MLIR locations. See RFC: https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/xE2IzfhE3Wg. Opaque location stores two pointers, one of them points to some data structure that is external to MLIR, and the other one is unique for each type and represents type id of that data structure. OpaqueLoc also stores an optional location that can be used if the first one is not suitable. OpaqueLoc is managed similar to FileLineColLoc. It is passed around by MLIR transformations and can be used in compound locations like CallSiteLoc. PiperOrigin-RevId: 273266510	2019-10-07 05:05:42 -07:00
Christian Sigg	7c765d97f9	Support reduction of partial warps. gpu.all_reduce now supports block sizes that are not multiple of 32. PiperOrigin-RevId: 273255204	2019-10-07 03:31:00 -07:00
Jacques Pienaar	77672c9777	Enable emitting dialect summary & description during op generation Sort ops per dialect and emit summary & description (if provided) of each dialect before emitting the ops of the dialect. PiperOrigin-RevId: 273077138	2019-10-05 12:21:51 -07:00
Geoffrey Martin-Noble	18db4ce493	Allow element type traits to operate on scalars This allows confirming that a scalar argument has the same element type as a shaped one. It's easy to validate a type is shaped on its own if that's desirable, so this shouldn't make that use case harder. This matches the behavior of other traits that operate on element type (e.g. AllElementTypesMatch). Also this makes the code simpler because now we just use getElementTypeOrSelf. Verified that all uses in core already check the type is shaped in another way. PiperOrigin-RevId: 273068507	2019-10-05 10:06:06 -07:00
Lei Zhang	c020480fc6	[spirv] Allow return ops to be in control flow ops Use `getParentOfType<FunctionOp>()` instead of `cast<FuncOp>(getParentOp())` to avoid crash when return ops are used inside spv.selection/spv.loop. PiperOrigin-RevId: 273006041	2019-10-04 20:08:52 -07:00
Mahesh Ravishankar	3f8bde40cb	Add spv.Undef op to support OpUndef instruction in SPIR-V. Adding support for OpUndef instruction. Updating the dialect generation script to fix a few bugs in the instruction spec generation. PiperOrigin-RevId: 272975685	2019-10-04 16:00:22 -07:00
Mahesh Ravishankar	77a809d7a1	Add some utility builder functions for SPIR-V operations. Add builder functions for spv._address_of, spv.EntryPoint, spv.ExecutionMode and spv.Load to make it easier to create these operations. Fix a minor bug in printing of spv.EntryPoint Add a utility function to get the attribute name associated with a decoration. PiperOrigin-RevId: 272952846	2019-10-04 14:02:48 -07:00
Nicolas Vasilache	754ea72794	Replace constexpr MemRefType::kDynamicStrideOrOffset by a MemRefType:;getDynamicStrideOrOffset() method - NFC This fixes global ODR-use issues, some of which manifest in Parser.cpp. Fixes tensorflow/mlir#167. PiperOrigin-RevId: 272886347	2019-10-04 08:58:09 -07:00
Nicolas Vasilache	516f6a3477	Add missing Linalg lowerings to allow roundtrip.mlir to lower to LLVM Certain lowering patterns were reported as [missing](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/dkdmHa77sSQ). This CL adds them and allows Linalg/roundtrip.mlir and Linalg/loops.mlir to lower to LLVM directly. Those 2 tests are updated to additionally check that the direct lowering to LLVM does not crash. The following points, left as TODOs still need to be addressed for correct end-to-end execution: 1. the lowering for ConvOp needs to pass attributes such as strides and dilations; the external library call needs to support it. 2. the lowering for GenericOp needs to support lowering to loops as a DialectConversion pattern. This is blocked on the DialectConversion infrastructure accepting an OperationFolder. PiperOrigin-RevId: 272878131	2019-10-04 08:07:54 -07:00
Deven Desai	d064469f6f	Moving the GPUIndexIntrinsicOpLowering template to a common location The GPUIndexIntrinsicOpLowering template is currently used by the code in both the GPUToNVVM and GPUToROCDL dirs. Moving it to a common location to remove code duplication. Closes tensorflow/mlir#163 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/163 from deven-amd:deven-refactor-gpu-index-ops-lowering b8dc2a5f5353df196039b6ff2ad42106028693ed PiperOrigin-RevId: 272863297	2019-10-04 06:20:05 -07:00
Christian Sigg	85dcaf19c7	Fix typos, NFC. PiperOrigin-RevId: 272851237	2019-10-04 04:37:53 -07:00
River Riddle	5830f71a45	Add support for inlining calls with different arg/result types from the callable. Some dialects have implicit conversions inherent in their modeling, meaning that a call may have a different type that the type that the callable expects. To support this, a hook is added to the dialect interface that allows for materializing conversion operations during inlining when there is a mismatch. A hook is also added to the callable interface to allow for introspecting the expected result types. PiperOrigin-RevId: 272814379	2019-10-03 23:10:51 -07:00
River Riddle	a20d96e436	Update the Inliner pass to work on SCCs of the CallGraph. This allows for the inliner to work on arbitrary call operations. The updated inliner will also work bottom-up through the callgraph enabling support for multiple levels of inlining. PiperOrigin-RevId: 272813876	2019-10-03 23:05:21 -07:00
Feng Liu	8c95223e3c	Add `axis` attribute to the quant.stats op The first dim length of the axisStats attribute should equals to the slice size of the input argument when splitted by the axis dimension. PiperOrigin-RevId: 272798042	2019-10-03 20:29:08 -07:00
MLIR Team	0dfa7fc908	Add fpext and fptrunc to the Standard dialect and includes conversion to LLVM PiperOrigin-RevId: 272768027	2019-10-03 16:37:24 -07:00
Christian Sigg	496f4590a1	Generalize parse/printBinaryOp to parse/printOneResultOp. PiperOrigin-RevId: 272722539	2019-10-03 13:00:12 -07:00
Nicolas Vasilache	218f0e611a	Add syntactic sugar for strided memref parsing. This CL implements the last remaining bit of the [strided memref proposal](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). The syntax is a bit more explicit than what was originally proposed and resembles: `memref<?x?xf32, offset: 0 strides: [?, 1]>` Nonnegative strides and offsets are currently supported. Future extensions will include negative strides. This also gives a concrete example of syntactic sugar for the ([RFC] Proposed Changes to MemRef and Tensor MLIR Types)[https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/-wKHANzDNTg]. The underlying implementation still uses AffineMap layout. PiperOrigin-RevId: 272717437	2019-10-03 12:34:36 -07:00
Alex Zinenko	0b93c092b6	Make Module::getName return Optional<StringRef> Module names are optional so it makes more sense to take and return an optional any time the name is involved. Also update the language reference to reflect the module names. PiperOrigin-RevId: 272684698	2019-10-03 10:04:48 -07:00
Alex Zinenko	8633b6bc8e	Give modules a name Modules are now Ops and, as such, can be nested. They do not produce an SSA value so there is no possibility to refer to them in the IR. Introduce support for symbol names attached to the module Op so that it can be referred to using SymbolRefAttrs. The name is optional, for example the implicit top-level module does not have a name. PiperOrigin-RevId: 272671600	2019-10-03 08:56:38 -07:00
Alex Zinenko	bd4762502c	Add parentheses around boolean operators in assert This removes a warning and is generally a good practice. PiperOrigin-RevId: 272613597	2019-10-03 01:47:14 -07:00
Alex Zinenko	e0d78eac23	NFC: rename Conversion/ControlFlowToCFG to Conversion/LoopToStandard This makes the name of the conversion pass more consistent with the naming scheme, since it actually converts from the Loop dialect to the Standard dialect rather than working with arbitrary control flow operations. PiperOrigin-RevId: 272612112	2019-10-03 01:35:03 -07:00
Alex Zinenko	44ef5e5525	Disallow index types in memrefs. As specified in the MLIR language reference and rationale documents, `memref` types should not be allowed to have `index` as element types. As observed in https://groups.google.com/a/tensorflow.org/forum/#!msg/mlir/P49hVWqTMNc/nW89a4i_AgAJ this restriction was lifted when canonicalization unit tests for affine operations were introduced, without sufficient motivation to lift the restriction itself. The test in question can be trivially rewritten (return the value from a function instead of storing it to prevent DCE from removing the producer operation) and the restriction put back in place. If `memref<...x index>` is relevant for some use cases, the relaxation of the type system can be implemented separately with appropriate modifications to the documentation. PiperOrigin-RevId: 272607043	2019-10-03 00:58:29 -07:00
Nicolas Vasilache	9604bb6269	Extract MemRefType::getStridesAndOffset as a free function and fix dynamic offset determination. This also adds coverage with a missing test, which uncovered a bug in the conditional for testing whether an offset is dynamic or not. PiperOrigin-RevId: 272505798	2019-10-02 13:25:05 -07:00
Lei Zhang	f294e0e513	[spirv] Add support for spv.selection Similar to spv.loop, spv.selection is another op for modelling SPIR-V structured control flow. It covers both OpBranchConditional and OpSwitch with OpSelectionMerge. Instead of having a `spv.SelectionMerge` op to directly model selection merge instruction for indicating the merge target, we use regions to delimit the boundary of the selection: the merge target is the next op following the `spv.selection` op. This way it's easier to discover all blocks belonging to the selection and it plays nicer with the MLIR system. PiperOrigin-RevId: 272475006	2019-10-02 11:01:57 -07:00
Deven Desai	e81b3129b4	[ROCm] Adding pass to lower GPU Dialect to ROCDL Dialect. This is a follow-up to the PRtensorflow/mlir#146 which introduced the ROCDL Dialect. This PR introduces a pass to lower GPU Dialect to the ROCDL Dialect. As with the previous PR, this one builds on the work done by @whchung, and addresses most of the review comments in the original PR. Closes tensorflow/mlir#154 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/154 from deven-amd:deven-lower-gpu-to-rocdl 809893e08236da5ab6a38e3459692fa04247773d PiperOrigin-RevId: 272390729	2019-10-02 01:50:30 -07:00
Jacques Pienaar	2b86e27dbd	Show type even if elementsattr is elided in graph The type is quite useful for debugging and shouldn't be too large. PiperOrigin-RevId: 272390311	2019-10-02 01:46:12 -07:00
Eric Schweitz	9e6dde3977	Add a pair of hooks to DominanceInfo. This exposes hooks for accessing internal dominance nodes, and updating the internal DFS numbers. Closes tensorflow/mlir#151 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/151 from schweitzpgi:dominance_hooks 69d14214a423b816cbd59feffcacdd02f3b5f921 PiperOrigin-RevId: 272287352	2019-10-01 14:06:50 -07:00
Alex Zinenko	c760f233b3	Fix and simplify CallOp/CallIndirectOp to LLVM::CallOp conversion A recent ABI compatibility change affected the conversion from standard CallOp/CallIndirectOp to LLVM::CallOp by changing its signature. In order to analyze the signature, the code was looking up the callee symbol in the module. This is incorrect since, during the conversion, the module may contain both the original and the converted function op that have the same symbol name. There is no strict guarantee on which of the two symbols will be found by the lookup. The conversion was not failing because the type legalizer converts the LLVM types to themselves making the original and the converted function signatures ultimately produce the same type. Instead of looking up the function signature to get the list of result types, use the types of the CallOp/CallIndirectOp results which must match those of the function in valid IR. These types are guaranteed to be the original, unconverted types when converting the operation. Furthermore, this avoids the need to perform a lookup of a symbol name in the module which may be expensive. Finally, propagate attributes as-is from the original op to the converted op since they share the attribute name for the callee of direct calls and the rest of attributes are not affected by the conversion. This removes the need for additional contorsions between direct and indirect calls to extract the name of the optional callee attribute only to insert it back. This also prevents the conversion from unintentionally dropping the other attributes of the op. PiperOrigin-RevId: 272218871	2019-10-01 08:41:50 -07:00
Nicolas Vasilache	e36337a998	Unify Linalg types by using strided memrefs This CL finishes the implementation of the Linalg + Affine type unification of the [strided memref RFC](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). As a consequence, the !linalg.view type, linalg::DimOp, linalg::LoadOp and linalg::StoreOp can now disappear and Linalg can use standard types everywhere. PiperOrigin-RevId: 272187165	2019-10-01 05:23:21 -07:00
Christian Sigg	1129931a62	Change all_reduce lowering to support 2D and 3D blocks. Perform second reduce only with first warp. This requires an additional __sync_threads(), but doesn't need special handling when the last warp is small. This simplifies support for block sizes that are not multiple of 32. Supporting partial warp reduce will be done in a separate CL. PiperOrigin-RevId: 272168917	2019-10-01 02:51:15 -07:00
Christian Sigg	8503ffbe3a	Add verification error message for ops that require at least one operand or result. PiperOrigin-RevId: 272153634	2019-10-01 00:57:18 -07:00
River Riddle	1c649d5785	Pass the pointer of the parent pipeline collection pass to PassInstrumentation::run*Pipeline. For the cases where there are multiple levels of nested pass managers, the parent thread ID is not enough to distinguish the parent of a given pass pipeline. Passing in the parent pass gives an exact anchor point. PiperOrigin-RevId: 272105461	2019-09-30 17:44:55 -07:00
Denis Khalikov	219421ece7	[spirv] Add array length check. According to the SPIR-V spec: "Length is the number of elements in the array. It must be at least 1." Closes tensorflow/mlir#160 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/160 from denis0x0D:sandbox/array_len 0840dc0986ad0088a3aa7d5d8d3e97d489377ed9 PiperOrigin-RevId: 272094669	2019-09-30 16:43:26 -07:00
Jacques Pienaar	f015b020f3	Add missing file from cmakelist PiperOrigin-RevId: 272054623	2019-09-30 13:37:54 -07:00
Jacques Pienaar	0b81eb928b	Enable autogenerating OpInterface method declarations Add DeclareOpInterfaceFunctions to enable specifying whether OpInterfaceMethods for an OpInterface should be generated automatically. This avoids needing to declare the extra methods, while also allowing adding function declaration by way of trait/inheritance. Most of this change is mechanical/extracting classes to be reusable. PiperOrigin-RevId: 272042739	2019-09-30 12:42:58 -07:00
Nicolas Vasilache	923b33ea16	Normalize MemRefType lowering to LLVM as strided MemRef descriptor This CL finishes the implementation of the lowering part of the [strided memref RFC](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). Strided memrefs correspond conceptually to the following templated C++ struct: ``` template <typename Elem, size_t Rank> struct { Elem ptr; int64_t offset; int64_t sizes[Rank]; int64_t strides[Rank]; }; ``` The linearization procedure for address calculation for strided memrefs is the same as for linalg views: `base_offset + SUM_i index_i stride_i`. The following CL will unify Linalg and Standard by removing !linalg.view in favor of strided memrefs. PiperOrigin-RevId: 272033399	2019-09-30 11:58:54 -07:00
Mahesh Ravishankar	2f7bb1e25f	Add support for Logical Ops in SPIR-V dialect Add operations corresponding to OpLogicalAnd, OpLogicalNot, OpLogicalEqual, OpLogicalNotEqual and OpLogicalOr instructions in SPIR-V dialect. This needs changes to class hierarchy in SPIR-V TableGen files to split SPIRVLogicalOp into SPIRVLogicalUnaryOp and SPIRVLogicalBinaryOp. All derived classes of SPIRVLogicalOp are updated accordingly. Update the spirv dialect generation script to 1) Allow specifying base class to use for instruction spec generation and file name to generate the specification in separately. 2) Use the existing descriptions for operations. 3) Update define_inst.sh to also invoke define_opcode.sh to also define the corresponding SPIR-V instruction opcode enum. PiperOrigin-RevId: 272014876	2019-09-30 10:40:36 -07:00
Nicolas Vasilache	1ce524623c	Fix MemRefType::getStrides corner case MemRefType::getStrides uses AffineExpr::walk which operates in post-order from the leaves. In order to compute strides properly, it needs to escape on terminal nodes and analyze binary ops only. This did not work for AffineExpr that consist of a single term (i.e. without a binary op). This CL fixes the corner case and adds relevant tests. PiperOrigin-RevId: 271975746	2019-09-30 07:27:39 -07:00
Christian Sigg	3d9679bde4	Switch comments from GPU dialect terms to CUDA terms (NFC). local workgroup -> block, subgroup -> warp, invocation -> thread. PiperOrigin-RevId: 271946342	2019-09-30 03:19:45 -07:00
Jacques Pienaar	e5a43186d3	Add InferTypeOpTrait & enable generating its member function definition Use OpInterfaces to add an interface for ops defining a return type function. This change does not use this trait in any meaningful way, I'll use it in a follow up to generalize and unify some of the op type traits/constraints. Also, currently the infer type function can only be manually specified in C++, that should rather be the fallback in future. PiperOrigin-RevId: 271883746	2019-09-29 17:29:00 -07:00
Jacques Pienaar	c57f202c8c	Switch explicit create methods to match generated build's order The generated build methods have result type before the arguments (operands and attributes, which are also now adjacent in the explicit create method). This also results in changing the create method's ordering to match most build method's ordering. PiperOrigin-RevId: 271755054	2019-09-28 09:35:58 -07:00
Yanan Cao	5f8dff936b	Append a newline when dumping a Value. This is more consistent with other dump methods. Otherwise successive Value dumps are concatenated in same line, hurting readability. PiperOrigin-RevId: 271669846	2019-09-27 16:20:46 -07:00
Nicolas Vasilache	bc4984e4f7	Add TODO to revisit coupling of CallOp to MemRefType lowering PiperOrigin-RevId: 271619132	2019-09-27 12:03:00 -07:00
Uday Bondhugula	74eabdd14e	NFC - clean up op accessor usage, std.load/store op verify, other stale info - also remove stale terminology/references in docs Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#148 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/148 from bondhugula:cleanup e846b641a3c2936e874138aff480a23cdbf66591 PiperOrigin-RevId: 271618279	2019-09-27 11:58:24 -07:00
Nicolas Vasilache	ddf737c5da	Promote MemRefDescriptor to a pointer to struct when passing function boundaries in LLVMLowering. The strided MemRef RFC discusses a normalized descriptor and interaction with library calls (https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). Lowering of nested LLVM structs as value types does not play nicely with externally compiled C/C++ functions due to ABI issues. Solving the ABI problem generally is a very complex problem and most likely involves taking a dependence on clang that we do not want atm. A simple workaround is to pass pointers to memref descriptors at function boundaries, which this CL implement. PiperOrigin-RevId: 271591708	2019-09-27 09:57:36 -07:00
Nicolas Vasilache	6543e99fe5	Fix JitRunner.cpp Error creation pattern and reactivate tests. linalg_integration_test.mlir and simple.mlir were temporarily disabled due to an OSS-only failure. The issue is that, once created, an llvm::Error must be explicitly checked before it can be discarded or overwritten. This CL fixes the issue and reenable the test. PiperOrigin-RevId: 271589651	2019-09-27 09:56:40 -07:00
Deven Desai	fee40fef5c	[ROCm] Adding ROCDL Dialect. This commit introduces the ROCDL Dialect (i.e. the ROCDL ops + the code to lower those ROCDL ops to LLWM intrinsics/functions). Think of ROCDL Dialect as analogous to the NVVM Dialect, but for AMD GPUs. This patch contains just the essentials needed to get a simple example up and running. We expect to make further additions to the ROCDL Dialect. This is the first of 3 commits, the follow-up will be: * add a pass that lowers GPU Dialect to ROCDL Dialect * add a "mlir-rocm-runner" utility Closes tensorflow/mlir#146 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/146 from deven-amd:deven-rocdl-dialect e78e8005c75a78912631116c78dc844fcc4b0de9 PiperOrigin-RevId: 271511259	2019-09-27 00:22:32 -07:00

1 2 3 4 5 ...

2234 Commits