llvm-project

Commit Graph

Author	SHA1	Message	Date
Tres Popp	4261b026ad	Revert "[mlir] Canonicalization and folding of shape.cstr_broadcastable" This reverts commit `6aab709459`. Some users have failing builds with ShapeCanonicalization.td, so revert for now.	2020-06-06 11:17:44 +02:00
Tres Popp	12e31f6e40	Revert "[mlir] Folding and canonicalization of shape.cstr_eq" This reverts commit `0a554e607f`. Some users have build failures when building ShapeCanonicalization.td, so revert changes that created and rely on it.	2020-06-06 11:08:41 +02:00
Diego Caballero	7d59f49bda	[mlir] Fix representation of BF16 constants This patch is a follow-up on https://reviews.llvm.org/D81127 BF16 constants were represented as 64-bit floating point values due to the lack of support for BF16 in APFloat. APFloat was recently extended to support BF16 so this patch is fixing the BF16 constant representation to be 16-bit. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D81218	2020-06-05 17:43:06 -07:00
Nicolas Vasilache	b54a4d0f8f	[mlir][Linalg] NFC - Make useFullTileBuffersByDefault option take a boolean.	2020-06-05 17:44:29 -04:00
Nicolas Vasilache	b6c88549bc	[mlir] Fix spurious f64 -> f16 change in CPU runner test	2020-06-05 17:23:21 -04:00
Nicolas Vasilache	eb7db879af	[mlir][test][CPU] Reduce the size of mlir-cpu-runner-tests Two tests regularly show up on the long tail when testing MLIR. This revision reduces their size.	2020-06-05 13:47:29 -04:00
Nicolas Vasilache	b56bf30d3c	[mlir][Vector] Add folding of memref_cast into vector_transfer ops Summary: This revision adds a common folding pattern that starts appearing on vector_transfer ops. Differential Revision: https://reviews.llvm.org/D81281	2020-06-05 13:27:00 -04:00
Jacques Pienaar	b0921f68e1	[mlir] Add verify method to adaptor This allows verifying op-indepent attributes (e.g., attributes that do not require the op to have been created) before constructing an operation. These include checking whether required attributes are defined or constraints on attributes (such as I32 attribute). This is not perfect (e.g., if one had a disjunctive constraint where one part relied on the op and the other doesn't, then this would not try and extract the op independent from the op dependent). The next step is to move these out to a trait that could be verified earlier than in the generated method. The first use case is for inferring the return type while constructing the op. At that point you don't have an Operation yet and that ends up in one having to duplicate the same checks, e.g., verify that attribute A is defined before querying A in shape function which requires that duplication. Instead this allows one to invoke a method to verify all the traits and, if this is checked first during verification, then all other traits could use attributes knowing they have been verified. It is a little bit funny to have these on the adaptor, but I see the adaptor as a place to collect information about the op before the op is constructed (e.g., avoiding stringly typed accessors, verifying what is possible to verify before the op is constructed) while being cheap to use even with constructed op (so layer of indirection between the op constructed/being constructed). And from that point of view it made sense to me. Differential Revision: https://reviews.llvm.org/D80842	2020-06-05 09:47:37 -07:00
Julian Lettner	99d6e05e71	[lit] Improve naming of test result categories Improve consistency when printing test results: Previously we were using different labels for group names (the header for the list of, e.g., failing tests) and summary count lines. For example, "Failing Tests"/"Unexpected Failures". This commit changes lit to label things consistently. Improve wording of labels: When talking about individual test results, the first word in "Unexpected Failures", "Expected Passes", and "Individual Timeouts" is superfluous. Some labels contain the word "Tests" and some don't. Let's simplify the names. Before: ``` Failing Tests (1): ... Expected Passes : 3 Unexpected Failures: 1 ``` After: ``` Failed Tests (1): ... Passed: 3 Failed: 1 ``` Reviewed By: ldionne Differential Revision: https://reviews.llvm.org/D77708	2020-06-05 08:14:42 -07:00
Wen-Heng (Jack) Chung	2fd6403a6d	[mlir][gpu] Introduce mlir-rocm-runner. Summary: `mlir-rocm-runner` is introduced in this commit to execute GPU modules on ROCm platform. A small wrapper to encapsulate ROCm's HIP runtime API is also inside the commit. Due to behavior of ROCm, raw pointers inside memrefs passed to `gpu.launch` must be modified on the host side to properly capture the pointer values addressable on the GPU. LLVM MC is used to assemble AMD GCN ISA coming out from `ConvertGPUKernelToBlobPass` to binary form, and LLD is used to produce a shared ELF object which could be loaded by ROCm HIP runtime. gfx900 is the default target be used right now, although it could be altered via an option in `mlir-rocm-runner`. Future revisions may consider using ROCm Agent Enumerator to detect the right target on the system. Notice AMDGPU Code Object V2 is used in this revision. Future enhancements may upgrade to AMDGPU Code Object V3. Bitcode libraries in ROCm-Device-Libs, which implements math routines exposed in `rocdl` dialect are not yet linked, and is left as a TODO in the logic. Reviewers: herhut Subscribers: mgorny, tpr, dexonsmith, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #mlir, #llvm Differential Revision: https://reviews.llvm.org/D80676	2020-06-05 09:46:39 -05:00
HazemAbdelhafez	cc2349e3cf	[MLIR][SPIRV] Support flat, location, and noperspective decorations Add support for flat, location, and noperspective decorations in the serializer and deserializer to be able to process basic shader files for graphics applications. Differential Revision: https://reviews.llvm.org/D80837	2020-06-05 08:55:22 -04:00
Nicolas Vasilache	247e185dd5	[mlir][Vector] Move temporary alloc to top of the function alloca when lowering vector_transfers Recently introduced allocation hoisting is quite conservative on the cases when it triggers. This revision makes it such that the allocations for vector transfer lowerings are hoisted to the top of the function. This should be revisited in the context of parallelism and is a temporary workaround. Differential Revision: https://reviews.llvm.org/D81253	2020-06-05 08:45:52 -04:00
Nicolas Vasilache	6953cf6502	[mlir][Linalg] Add a hoistRedundantVectorTransfers helper function This revision adds a helper function to hoist vector.transfer_read / vector.transfer_write pairs out of immediately enclosing scf::ForOp iteratively, if the following conditions are true: 1. The 2 ops access the same memref with the same indices. 2. All operands are invariant under the enclosing scf::ForOp. 3. No uses of the memref either dominate the transfer_read or are dominated by the transfer_write (i.e. no aliasing between the write and the read across the loop) To improve hoisting opportunities, call the `moveLoopInvariantCode` helper function on the candidate loop above which to hoist. Hoisting the transfers results in scf::ForOp yielding the value that originally transited through memory. This revision additionally exposes `moveLoopInvariantCode` as a helper in LoopUtils.h and updates SliceAnalysis to support return scf::For values and allow hoisting across multiple scf::ForOps. Differential Revision: https://reviews.llvm.org/D81199	2020-06-05 06:50:24 -04:00
Alexander Belyaev	04fb2b6123	[Mlir] Implement printer, parser, verifier and builder for shape.reduce. Differential Revision: https://reviews.llvm.org/D81186	2020-06-05 11:25:32 +02:00
Tres Popp	655e08ceeb	[mlir] Canonicalization of shape.assuming Summary: This will inline the region to a shape.assuming in the case that the input witness is found to be statically true. Differential Revision: https://reviews.llvm.org/D80302	2020-06-05 11:00:20 +02:00
Tres Popp	0a554e607f	[mlir] Folding and canonicalization of shape.cstr_eq In the case of all inputs being constant and equal, cstr_eq will be replaced with a true_witness. Differential Revision: https://reviews.llvm.org/D80303	2020-06-05 11:00:20 +02:00
Tres Popp	6aab709459	[mlir] Canonicalization and folding of shape.cstr_broadcastable This allows replacing of this op with a true witness in the case of both inputs being const_shapes and being found to be broadcastable. Differential Revision: https://reviews.llvm.org/D80304	2020-06-05 11:00:19 +02:00
Tres Popp	4a255bbd29	[mlir] Add folding for shape.any If any input to shape.any is a const_shape, shape.any can be replaced with that input. Differential Revision: https://reviews.llvm.org/D80305	2020-06-05 11:00:19 +02:00
Tres Popp	6b3a5bff93	[mlir] Folding of shape.assuming_all This allows assuming_all to be replaced when all inputs are known to be statically passing witnesses. Differential Revision: https://reviews.llvm.org/D80306	2020-06-05 11:00:19 +02:00
Tres Popp	1c3e38d98c	[mlir] Add a shape op that returns a constant witness This will later be used during canonicalization and folding steps to replace statically known passing constraints. Differential Revision: https://reviews.llvm.org/D80307	2020-06-05 11:00:19 +02:00
Alexander Belyaev	5a675f0552	[Mlir] Add assembly format for `shape.mul`. Differential Revision: https://reviews.llvm.org/D81194	2020-06-05 10:55:54 +02:00
Uday Bondhugula	0f6999af88	[MLIR] Update linalg.conv lowering to use affine load in the absence of padding Update linalg to affine lowering for convop to use affine load for input whenever there is no padding. It had always been using std.loads because max in index functions (needed for non-zero padding if not materializing zeros) couldn't be represented in the non-zero padding cases. In the future, the non-zero padding case could also be made to use affine - either by materializing or using affine.execute_region. The latter approach will not impact the scf/std output obtained after lowering out affine. Differential Revision: https://reviews.llvm.org/D81191	2020-06-05 12:28:30 +05:30
River Riddle	c0cd1f1c5c	[mlir] Refactor BoolAttr to be a special case of IntegerAttr This simplifies a lot of handling of BoolAttr/IntegerAttr. For example, a lot of places currently have to handle both IntegerAttr and BoolAttr. In other places, a decision is made to pick one which can lead to surprising results for users. For example, DenseElementsAttr currently uses BoolAttr for i1 even if the user initialized it with an Array of i1 IntegerAttrs. Differential Revision: https://reviews.llvm.org/D81047	2020-06-04 16:41:24 -07:00
Nicolas Vasilache	3463d9835b	[mlir][Linalg] Add a hoistViewAllocOps helper function This revision adds a helper function to hoist alloc/dealloc pairs and alloca op out of immediately enclosing scf::ForOp if both conditions are true: 1. all operands are defined outside the loop. 2. all uses are ViewLikeOp or DeallocOp. This is now considered Linalg-specific and will be generalized on a per-need basis. Differential Revision: https://reviews.llvm.org/D81152	2020-06-04 18:59:03 -04:00
Diego Caballero	5c990d6994	[mlir] Add support for bf16 to StandardToLLVM conversion Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D81127	2020-06-04 14:36:36 -07:00
Thomas Raoux	661235e126	[mlir][gpu] Add subgroup Id/Size/Num to GPU dialect Add SubgroupId, SubgroupSize and NumSubgroups to GPU dialect ops and add the lowering of those ops to SPIRV. Differential Revision: https://reviews.llvm.org/D81042	2020-06-04 10:52:40 -07:00
Hanhan Wang	0b025d2733	[mlir][StandardToSPIRV] Handle i1 case for lowering std.zexti to SPIR-V. Differential Revision: https://reviews.llvm.org/D80965	2020-06-03 15:01:18 -07:00
Hanhan Wang	27fca57546	[mlir][Linalg] Add support for fusion between indexed_generic ops and tensor_reshape ops Summary: The fusion for tensor_reshape is embedding the information to indexing maps, thus the exising pattenr also works for indexed_generic ops. Depends On D80347 Differential Revision: https://reviews.llvm.org/D80348	2020-06-03 14:59:47 -07:00
Hanhan Wang	cc11ceda16	[mlir][Linalg] Add support for fusion between indexed_generic ops and generic ops on tensors. Summary: Different from the fusion between generic ops, indices are involved. In this context, we need to re-map the indices for producer since the fused op is built on consumer's perspective. This patch supports all combination of the fusion between indexed_generic ops and generic ops, which includes tests case: 1) generic op as producer and indexed_generic op as consumer. 2) indexed_generic op as producer and generic op as consumer. 3) indexed_generic op as producer and indexed_generic op as consumer. Differential Revision: https://reviews.llvm.org/D80347	2020-06-03 14:58:43 -07:00
aartbik	6391da98f4	[mlir] [VectorOps] Use 'vector.flat_transpose' for 2-D 'vector.tranpose' Summary: Progressive lowering of vector.transpose into an operation that is closer to an intrinsic, and thus the hardware ISA. Currently under the common vector transform testing flag, as we prepare deploying this transformation in the LLVM lowering pipeline. Reviewers: nicolasvasilache, reidtatge, andydavis1, ftynse Reviewed By: nicolasvasilache, ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm, #mlir Differential Revision: https://reviews.llvm.org/D80772	2020-06-03 14:55:50 -07:00
Frederik Gossen	3713314bfa	[MLIR] Shape to standard dialect lowering Add a new pass to lower operations from the `shape` to the `std` dialect. The conversion applies only to the `size_to_index` and `index_to_size` operations and affected types. Other patterns will be added as needed. Differential Revision: https://reviews.llvm.org/D81091	2020-06-03 16:17:03 +00:00
Nicolas Vasilache	e349fb70a2	[mlir][Linalg] NFC - Make markers use Identifier instead of StringRef Summary: This removes string ownership worries by putting everything into the context and allows more constructing identifiers programmatically. Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul Tags: #mlir Differential Revision: https://reviews.llvm.org/D81027	2020-06-03 05:52:32 -04:00
Diego Caballero	8a418e5f8e	[mlir][Affine] Enable fusion of loops with vector loads/stores This patch enables affine loop fusion for loops with affine vector loads and stores. For that, we only had to use affine memory op interfaces in LoopFusionUtils.cpp and Utils.cpp so that vector loads and stores are also taken into account. Reviewed By: andydavis1, ftynse Differential Revision: https://reviews.llvm.org/D80971	2020-06-03 01:26:22 +03:00
HazemAbdelhafez	915e55c910	[mlir][spirv] Add support for matrix type This commit adds basic matrix type support to the SPIR-V dialect including type definition, IR assembly, parsing, printing, and (de)serialization. Differential Revision: https://reviews.llvm.org/D80594	2020-06-02 16:30:58 -04:00
Alex Zinenko	5c5dafc534	[mlir] support materialization for 1-1 type conversions Dialect conversion infrastructure supports 1->N type conversions by requiring individual conversions to provide facilities to generate operations retrofitting N values into 1 of the original type when N > 1. This functionality can also be used to materialize explicit "cast"-like operations, but it did not support 1->1 type conversions until now. Modify TypeConverter to support materialization of cast operations for 1-1 conversions. This also makes materialization specification more extensible following the same pattern as type conversions. Instead of overloading a virtual function, users or subclasses of TypeConversion can now register type-specific materialization callbacks that will be called in order for the given type. Differential Revision: https://reviews.llvm.org/D79729	2020-06-02 13:48:33 +02:00
Ehsan Toosi	3f6a35e3ff	[mlir] Introduce CallOp converter for buffer placement Add BufferAssignmentCallOpConverter as a pattern rewriter for Buffer Placement. It matches the signature of the caller operation with the callee after rewriting the callee with FunctionAndBlockSignatureConverter. Differential Revision: https://reviews.llvm.org/D80785	2020-06-02 11:35:24 +02:00
MaheshRavishankar	2bcd1927dd	[mlir][SCFToGPU] Remove conversions from scf.for to gpu.launch. Keeping in the affine.for to gpu.launch conversions, which should probably be the affine.parallel to gpu.launch conversion as well. Differential Revision: https://reviews.llvm.org/D80747	2020-06-01 23:06:20 -07:00
Thomas Raoux	c652c306a6	[mlir][spirv] Clean up coop matrix assembly declaration. Address code review feedback and use declarative assembly format. Differential Revision: https://reviews.llvm.org/D80687	2020-05-29 16:37:35 -07:00
Nicolas Vasilache	9534192c3b	[mlir][Linalg] Make contraction vectorization use vector transfers This revision replaces the load + vector.type_cast by appropriate vector transfer operations. These play more nicely with other vector abstractions and canonicalization patterns and lower to load/store with or without masks when appropriate. Differential Revision: https://reviews.llvm.org/D80809	2020-05-29 15:03:46 -04:00
Anchu Rajendran	dbb5979d15	[MLIR][OpenMP] Defined master operation in OpenMP Dialect Summary: Implemented the basic changes for defining master operation in OpenMP. It uses the generic parser and printer. Reviewed By: kiranchandramohan, ftynse Differential Revision: https://reviews.llvm.org/D80689	2020-05-29 22:46:02 +05:30
Nicolas Vasilache	1ee114322c	[mlir][Linalg][Vector] Add forwarding patterns between linalg.copy and vector.transfer This revision adds custom rewrites for patterns that arise during linalg structured ops vectorization. These patterns allow the composition of linalg promotion, vectorization and removal of redundant copies. The patterns are voluntarily limited and restrictive atm. More robust behavior will be implemented once more powerful side effect modeling and analyses are available on view/subview. On the transfer_read side, the following pattern is rewritten: ``` %alloc = ... [optional] %view = std.view %alloc ... %subView = subview %allocOrView ... [optional] linalg.fill(%allocOrView, %cst) ... ... linalg.copy(%in, %subView) ... vector.transfer_read %allocOrView[...], %cst ... ``` into ``` [unchanged] %alloc = ... [unchanged] [optional] %view = std.view %alloc ... [unchanged] [unchanged] %subView = subview %allocOrView ... ... vector.transfer_read %in[...], %cst ... ``` On the transfer_write side, the following pattern is rewriten: ``` %alloc = ... [optional] %view = std.view %alloc ... %subView = subview %allocOrView... ... vector.transfer_write %..., %allocOrView[...] linalg.copy(%subView, %out) ``` Differential Revision: https://reviews.llvm.org/D80728	2020-05-29 08:08:34 -04:00
Nicolas Vasilache	aa93659c9f	[mlir][SCF] Add utility to clone an scf.ForOp while appending new yield values. This utility factors out the machinery required to add iterArgs and yield values to an scf.ForOp. Differential Revision: https://reviews.llvm.org/D80656	2020-05-29 07:28:17 -04:00
Ehsan Toosi	7a3a253585	[MLIR][BufferPlacement] Support functions that return Memref typed results Buffer placement can now operates on functions that return buffers. These buffers escape from the deallocation phase of buffer placement. Differential Revision: https://reviews.llvm.org/D80696	2020-05-29 11:03:22 +02:00
Marius Brehler	b0b2507717	[mlir] Add test to check if standalone dialect is registered Summary: Add a test to check if the standalone dialect is registered within standalone-opt. Similar to the mlir-opt commandline.mlir test. Reviewers: Kayjukh, stephenneuendorffer Reviewed By: Kayjukh Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80764	2020-05-29 00:34:34 +02:00
Nicolas Vasilache	5f9e0466f2	[mlir][Vector] Fix vector.transfer alignment calculation https://reviews.llvm.org/D79246 introduces alignment propagation for vector transfer operations. Unfortunately, the alignment calculation is incorrect and can result in crashes. This revision fixes the calculation by using the natural alignment of the memref elemental type, instead of the resulting vector type. If more alignment is desired, it can be done in 2 ways: 1. use a proper vector.type_cast to transform a memref<axbxcxdxf32> into a memref<axbxvector<cxdxf32>> giving a natural alignment of vector<cxdxf32> 2. add an alignment attribute to vector transfer operations and propagate it. With this change the alignment in the relevant tests goes down from 128 to 4. Lastly, a few minor cleanups are performed and the custom `isMinorIdentityMap` is deprecated. Differential Revision: https://reviews.llvm.org/D80734	2020-05-28 17:58:51 -04:00
Marius Brehler	3bff62d45f	[mlir] Extend standalone example by standalone-translate Extend the standalone by standalone-translate, based on mlir-translate. Differential Revision: https://reviews.llvm.org/D80737	2020-05-28 14:07:55 -07:00
MaheshRavishankar	2b0c8546ac	[mlir][Linalg] Add pass to remove unit-extent dims from tensor operands of Generic ops. Unit-extent dimensions are typically used for achieving broadcasting behavior. The pattern added (along with canonicalization patterns added previously) removes the use of unit-extent dimensions, and instead uses a more canonical representation of the computation. This new pattern is not added as a canonicalization for now since it entails adding additional reshape operations. A pass is added to exercise these patterns, along with an API entry to populate a patterns list with these patterns. Differential Revision: https://reviews.llvm.org/D79766	2020-05-28 11:06:47 -07:00
Alex Zinenko	72ede60b75	[mlir][GPU] Link relevant LLVM components in GPUCommon instead of test D80142 restructured MLIR-to-GPU-binary conversion to support multiple targets. It also modified cmake files to link relevant LLVM components in test/lib, which broke shared-library builds, and likely made the conversions unusable outside mlir-opt (or other tools that link in test library targets). Link these components to GPUCommon instead. Differential Revision: https://reviews.llvm.org/D80739	2020-05-28 20:01:54 +02:00
Jacques Pienaar	fefe4366c3	[mlir] Use ValueRange instead of ArrayRef<Value> This allows constructing operand adaptor from existing op (useful for commonalizing verification as I want to do in a follow up). I also add ability to use member initializers for the generated adaptor constructors for convenience. Differential Revision: https://reviews.llvm.org/D80667	2020-05-28 09:05:24 -07:00
Wen-Heng (Jack) Chung	061fb8eb2d	[mlir][gpu][mlir-cuda-runner] Refactor ConvertKernelFuncToCubin to be generic. Make ConvertKernelFuncToCubin pass to be generic: - Rename to ConvertKernelFuncToBlob. - Allow specifying triple, target chip, target features. - Initializing LLVM backend is supplied by a callback function. - Lowering process from MLIR module to LLVM module is via another callback. - Change mlir-cuda-runner to adopt the revised pass. - Add new tests for lowering to ROCm HSA code object (HSACO). - Tests for CUDA and ROCm are kept in separate directories. Differential Revision: https://reviews.llvm.org/D80142	2020-05-28 09:08:28 -05:00

1 2 3 4 5 ...

2228 Commits