llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	9235e597a4	[mlir][bufferize] Fix missing copies when writing to a buffer in a loop Writes into tensors that are definied outside of a repetitive region, but with the write happening inside of the repetitive region were previously not considered conflicts. This was incorrect. E.g.: ``` %0 = ... : tensor<?xf32> scf.for ... { "reading_op"(%0) : tensor<?xf32> %1 = "writing_op"(%0) : tensor<?xf32> -> tensor<?xf32> ... } ``` In the above example, "writing_op" should be out-of-place. This commit fixes the bufferization for any op that declares its repetitive semantics via RegionBranchOpInterface.	2022-04-20 18:51:06 +09:00
jacquesguan	61baf2ffa7	[mlir][Vector] Add check of supported reduction kind for ScanOp. This patch adds check of supported reduction kind for ScanOp to avoid using and/or/xor for floating point type. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123977	2022-04-20 02:42:19 +00:00
Mehdi Amini	8608ed1441	Apply clang-tidy fixes for llvm-twine-local in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-20 00:39:10 +00:00
John Demme	6b0bed7ea5	[MLIR] [Python] Add a method to clear live operations map Introduce a method on PyMlirContext (and plumb it through to Python) to invalidate all of the operations in the live operations map and clear it. Since Python has no notion of private data, an end-developer could reach into some 3rd party API which uses the MLIR Python API (that is behaving correctly with regard to holding references) and grab a reference to an MLIR Python Operation, preventing it from being deconstructed out of the live operations map. This allows the API developer to clear the map when it calls C++ code which could delete operations, protecting itself from its users. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123895	2022-04-19 15:14:09 -07:00
Krzysztof Drewniak	ddc2eb0ada	[mlir] Adds getUpperBound() to LoopLikeInterface. getUpperBound is analogous to getLowerBound(), except for the upper bound, and is used in range analysis. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124020	2022-04-19 19:56:44 +00:00
Alex Zinenko	0eb403ad1b	[mlir][transform] Introduce transform.sequence op Sequence is an important transform combination primitive that just indicates transform ops being applied in a row. The simplest version requires fails immediately if any transformation in the sequence fails. Introducing this operation allows one to start placing transform IR within other IR. Depends On D123135 Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123664	2022-04-19 21:41:02 +02:00
Ashay Rane	25c218be36	[MLIR] Add function to create BFloat16 array attribute This patch adds a new function `mlirDenseElementsAttrBFloat16Get()`, which accepts the shaped type, the number of BFloat16 values, and a pointer to an array of BFloat16 values, each of which is a `uint16_t` value. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D123981	2022-04-19 19:27:06 +00:00
Mehdi Amini	83892d76f4	Print custom assembly on pass failure by default The printer is now resilient to invalid IR and will already automatically fallback to the generic form on invalid IR. Using the generic printer on pass failure was a conservative option before the printer was made failsafe. Reviewed By: lattner, rriddle, jpienaar, bondhugula Differential Revision: https://reviews.llvm.org/D123915	2022-04-19 17:29:08 +00:00
Mehdi Amini	2d6335421f	Apply clang-tidy fixes for llvm-qualified-auto in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-19 17:20:57 +00:00
Mehdi Amini	f9735be7e2	Apply clang-tidy fixes for performance-unnecessary-value-param in ControlFlowInterfaces.cpp (NFC)	2022-04-19 17:20:57 +00:00
Arnab Dutta	12f55cac69	[MLIR][GPU] Add canonicalizer for gpu.memcpy Fold away gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D121279	2022-04-19 17:54:00 +05:30
Marius Brehler	2ba865903d	[mlir][emitc] Add test for invalid type Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D123503	2022-04-19 11:03:56 +02:00
Matthias Springer	a3005a406e	[mlir][interfaces] Fix infinite loop in insideMutuallyExclusiveRegions This function was missing a termination condition.	2022-04-19 16:28:52 +09:00
Mehdi Amini	4e01184ad5	Apply clang-tidy fixes for performance-unnecessary-value-param in JitRunner.cpp (NFC)	2022-04-19 07:23:12 +00:00
Mehdi Amini	722a3a58e2	Apply clang-tidy fixes for performance-for-range-copy in MemRefOps.cpp (NFC)	2022-04-19 07:23:12 +00:00
Matthias Springer	0f4ba02db3	[mlir][interfaces] Add helpers for detecting recursive regions Add helper functions to check if an op may be executed multiple times based on RegionBranchOpInterface. Differential Revision: https://reviews.llvm.org/D123789	2022-04-19 16:13:32 +09:00
Michael Kruse	2d92ee97f1	Reapply "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `af0285122f`. The test "libomp::loop_dispatch.c" on builder openmp-gcc-x86_64-linux-debian fails from time-to-time. See #54969. This patch is unrelated.	2022-04-18 21:56:47 -05:00
Mehdi Amini	1881d6fc80	Apply clang-tidy fixes for performance-unnecessary-copy-initialization in MathOps.cpp (NFC)	2022-04-19 00:47:58 +00:00
Mehdi Amini	6120bd4703	Apply clang-tidy fixes for performance-for-range-copy in ElementwiseOpFusion.cpp (NFC)	2022-04-19 00:47:58 +00:00
Groverkss	15650b320b	[MLIR][Presburger] Remove inheritence in MultiAffineFunction This patch removes inheritence of MultiAffineFunction from IntegerPolyhedron and instead makes IntegerPolyhedron as a member. This patch removes virtualization in MultiAffineFunction and also removes unnecessary functions inherited from IntegerPolyhedron. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123921	2022-04-19 01:25:13 +05:30
Michael Kruse	af0285122f	Revert "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `9ec501da76`. It may have caused the openmp-gcc-x86_64-linux-debian buildbot to fail. https://lab.llvm.org/buildbot/#/builders/4/builds/20377	2022-04-18 14:38:31 -05:00
Michael Kruse	9ec501da76	[OpenMP] Refactor OMPScheduleType enum. The OMPScheduleType enum stores the constants from libomp's internal sched_type in kmp.h and are used by several kmp API functions. The enum values have an internal structure, namely each scheduling algorithm (e.g.) exists in four variants: unordered, orderend, normerge unordered, and nomerge ordered. This patch (basically a followup to D114940) splits the "ordered" and "nomerge" bits into separate flags, as was already done for the "monotonic" and "nonmonotonic", so we can apply bit flags operations on them. It also now contains all possible combinations according to kmp's sched_type. Deriving of the OMPScheduleType enum from clause parameters has been moved form MLIR's OpenMPToLLVMIRTranslation.cpp to OpenMPIRBuilder to make available for clang as well. Since the primary purpose of the flag is the binary interface to libomp, it has been made more private to LLVMFrontend. The primary interface for generating worksharing-loop using OpenMPIRBuilder code becomes `applyWorkshareLoop` which derives the OMPScheduleType automatically and calls the appropriate emitter function. While this is mostly a NFC refactor, it still applies the following functional changes: * The logic from OpenMPToLLVMIRTranslation to derive the OMPScheduleType also applies to clang. Most notably, it now applies the nonmonotonic flag for non-static schedules by default. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was previously not applied if the simd modifier was used. I assume this was a bug, since the effect was due to `loop.schedule_modifier()` returning `mlir::omp::ScheduleModifier::none` instead of `llvm::Optional::None`. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was set even if ordered was specified, in breach to what the comment before citing the OpenMP specification says. I assume this was an oversight. The ordered flag with parameter was not considered in this patch. Changes will need to be made (e.g. adding/modifying function parameters) when support for it is added. The lengthy names of the enum values can be discussed, for the moment this is avoiding reusing previously existing enum value names such as `StaticChunked` to avoid confusion. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D123403	2022-04-18 14:03:17 -05:00
River Riddle	58ceae9561	[mlir:NFC] Remove the forward declaration of FuncOp in the mlir namespace FuncOp has been moved to the `func` namespace for a little over a month, the using directive can be dropped now.	2022-04-18 12:01:55 -07:00
Mehdi Amini	21b251624b	Apply clang-tidy fixes for readability-identifier-naming in GPUDialect.cpp (NFC)	2022-04-18 18:15:30 +00:00
Mehdi Amini	c817628b67	Apply clang-tidy fixes for readability-simplify-boolean-expr in TypeConverter.cpp (NFC)	2022-04-18 18:15:30 +00:00
Jacques Pienaar	f4085c57dd	[mlir] Fix two AttributeParser aborts Reproducers that resulted in triggering the following asserts mlir::NamedAttribute::NamedAttribute(mlir::StringAttr, mlir::Attribute) mlir/lib/IR/Attributes.cpp:29:3 consumeToken mlir/lib/Parser/Parser.h:126 Differential Revision: https://reviews.llvm.org/D122240	2022-04-18 09:30:35 -07:00
Groverkss	4ffd0b6fde	[MLIR][Presburger] Make IntegerRelation::mergeLocalIds not delete duplicates This patch modifies mergeLocalIds to not delete duplicate local ids in `this` relation. This allows the ordering of the final local ids for `this` to be determined more easily, which is generally required when other objects refer to these local ids. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D123866	2022-04-18 10:15:35 +05:30
jacquesguan	5479044bfc	[mlir][Vector] Fold transpose splat to splat with transposed type. This revision folds transpose splat to a new splat with the transposed vector type. For a splat, there is no need to actually do transpose for it, it would be more effective to just build a new splat as the result. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123765	2022-04-18 03:00:17 +00:00
Mehdi Amini	d98481a1e7	Revert "[MLIR] Provide a way to print ops in custom form on pass failure" This reverts commit `daabcf5f04`. This patch still had on-going discussion that should be closed before committing.	2022-04-17 18:55:09 +00:00
Uday Bondhugula	daabcf5f04	[MLIR] Provide a way to print ops in custom form on pass failure The generic form of the op is too verbose and in some cases not readable. On pass failure, ops have been so far printed in generic form to provide a (stronger) guarantee that the IR print succeeds. However, in a large number of pass failure cases, the IR is still valid and the custom printers for the ops will succeed. In fact, readability is highly desirable post pass failure. This revision provides an option to print ops in their custom/pretty-printed form on IR failure -- this option is unsafe and there is no guarantee it will succeed. It's disabled by default and can be turned on only if needed. Differential Revision: https://reviews.llvm.org/D123893	2022-04-17 20:10:40 +05:30
Valentin Clement	bd514967aa	[mlir][CSE] Add ability to remove commutative operations This patch takes advantage of the Commutative trait on operation to remove identical commutative operations where the operands are swapped. The second operation below can be removed since `arith.addi` is commutative. ``` %1 = arith.addi %a, %b : i32 %2 = arith.addi %b, %a : i32 ``` Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123492	2022-04-16 21:09:47 +02:00
River Riddle	0f304ef017	[mlir] Add asserts when changing various MLIRContext configurations This helps to prevent tsan failures when users inadvertantly mutate the context in a non-safe way. Differential Revision: https://reviews.llvm.org/D112021	2022-04-15 21:49:03 -07:00
Mogball	b1d3faab7a	[NFC] fix cmake build	2022-04-16 00:56:38 +00:00
Mogball	fa26c7ff4b	[mlir] Refactor LICM into a utility LICM is refactored into a utility that is application on any region. The implementation is moved to Transform/Utils.	2022-04-16 00:37:07 +00:00
Stella Stamenova	353f0a8e43	Revert "[mlir] Refactor LICM into a utility" This reverts commit `3131f80824`. This commit broke the Windows mlir bot: https://lab.llvm.org/buildbot/#/builders/13/builds/19745	2022-04-15 17:09:16 -07:00
Mogball	3131f80824	[mlir] Refactor LICM into a utility LICM is refactored into a utility that is application on any region. The implementation is moved to Transform/Utils.	2022-04-15 22:07:01 +00:00
Arjun P	ef8b2a7cea	[MLIR][Presburger] addSymbolicCut: fix the integral symbols heuristic to match the docs Previously this checked if the entire symbolic numerator was divisible by the denominator, which is never the case when this function is called. Fixed this to check only the non-const coefficients in the numerator, which was what was intended and documented. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123592	2022-04-15 20:34:06 +01:00
Thomas Raoux	b4bcef05b7	[mlir][vector] Fix bug in extractFromBroadcast folding extract was incorrectly folded when the source was coming from a broadcast that was both adding new rank and broadcasting the inner dimension. Differential Revision: https://reviews.llvm.org/D123867	2022-04-15 19:21:45 +00:00
Arjun P	69c1a35488	[MLIR][Presburger][Simplex] moveRowUnknownToColumn: support the row sample value being zero When the sample value is zero, everything is the same except that failure to pivot does not imply emptiness. So, leave it to the user to mark as empty if necessary, if they know the sample value is strictly negative. This is needed for an upcoming symbolic lexmin heuristic. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123604	2022-04-15 20:15:21 +01:00
William S. Moses	9a8bb4bc63	[NFC] Update comments	2022-04-15 14:33:13 -04:00
Mogball	3430ae1e7b	[mlir] Update LICM to support Graph Regions Changes the algorithm of LICM to support graph regions (no guarantee of topologically sorted order). Also fixes an issue where ops with recursive side effects and regions would not be hoisted if any nested ops used operands that were defined within the nested region. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122465	2022-04-15 17:30:27 +00:00
William S. Moses	ed499ddcda	[MLIR] Fix operation clone Operation clone is currently faulty. Suppose you have a block like as follows: ``` (%x0 : i32) { %x1 = f(%x0) return %x1 } ``` The test case we have is that we want to "unroll" this, in which we want to change this to compute `f(f(x0))` instead of just `f(x0)`. We do so by making a copy of the body at the end of the block and set the uses of the argument in the copy operations with the value returned from the original block. This is implemented as follows: 1) map to the block arguments to the returned value (`map[x0] = x1`). 2) clone the body Now for this small example, this works as intended and we get the following. ``` (%x0 : i32) { %x1 = f(%x0) %x2 = f(%x1) return %x2 } ``` This is because the current logic to clone `x1 = f(x0)` first looks up the arguments in the map (which finds `x0` maps to `x1` from the initialization), and then sets the map of the result to the cloned result (`map[x1] = x2`). However, this fails if `x0` is not an argument to the op, but instead used inside the region, like below. ``` (%x0 : i32) { %x1 = f() { yield %x0 } return %x1 } ``` This is because cloning an op currently first looks up the args (none), sets the map of the result (`map[%x1] = %x2`), and then clones the regions. This results in the following, which is clearly illegal: ``` (%x0 : i32) { %x1 = f() { yield %x0 } %x2 = f() { yield %x2 } return %x2 } ``` Diving deeper, this is partially due to the ordering (how this PR fixes it), as well as how region cloning works. Namely it will first clone with the mapping, and then it will remap all operands. Since the ordering above now has a map of `x0 -> x1` and `x1 -> x2`, we end up with the incorrect behavior here. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122531	2022-04-15 13:09:13 -04:00
Alex Zinenko	6c5ae8e974	[mlir] Support opaque types in LLVM IR -> MLIR translation LLVM IR is moving towards adoption of opaque pointer types. These require extra information to be passed when constructing some operations, in particular GEP and Alloca. Adapt the builders of said operations and modify the translation code to handle both opaque and non-opaque pointers. This incidentally adds the translation for Alloca alignment and fixes the translation of struct-related GEP indices that must be constant. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123792	2022-04-15 17:51:31 +02:00
Lei Zhang	4db65e279b	[mlir][vector] Reorder elementwise(transpose) Similar to the existing pattern for reodering cast(transpose), this makes transpose following transpose and increases the chance of embedding the transposition inside contraction op. Actually cast ops are just special instances of elementwise ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123596	2022-04-15 09:05:35 -04:00
Mehdi Amini	7a80912dd8	Apply clang-tidy fixes for modernize-use-default-member-init in ControlFlowSinkUtils.cpp (NFC)	2022-04-15 08:01:04 +00:00
Chia-hung Duan	5232c5c5d4	[mlir] Fix verification order of nested ops. In order to increase parallism, certain ops with regions and have the IsIsolatedFromAbove trait will have their verification delayed. That means the region verifier may access the invalid ops and may lead to a crash. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122771	2022-04-15 04:41:10 +00:00
Thomas Raoux	59058c441a	[mlir][vector] Add operations used for Vector distribution Add vector op warp_execute_on_lane_0 that will be used to do incremental vector distribution in order to target warp level vector programming for architectures with GPU-like SIMT programming model. The idea behing the op is discussed further on discourse: https://discourse.llvm.org/t/vector-vector-distribution-large-vector-to-small-vector/1983/23 Differential Revision: https://reviews.llvm.org/D123703	2022-04-15 03:47:52 +00:00
Jacques Pienaar	35bd41916b	[mlir] Add assert to fail with more info (NFC) This would have assert before during tensor type construction with opaque error, assert and fail earlier now.	2022-04-14 19:54:12 -07:00
Thomas Raoux	894a591cf6	[mlir][nvgpu] Move mma.sync and ldmatrix in nvgpu dialect Move gpu operation mma.sync and ldmatrix in nvgpu as they are specific to nvidia target. Differential Revision: https://reviews.llvm.org/D123824	2022-04-14 23:44:52 +00:00
Mehdi Amini	72e2b4e7a7	Apply clang-tidy fixes for modernize-use-default-member-init in PDLLServer.cpp (NFC)	2022-04-14 22:59:14 +00:00
Mehdi Amini	db6796dfa8	Apply clang-tidy fixes for modernize-use-default-member-init in SparseTensorUtils.cpp (NFC)	2022-04-14 22:59:14 +00:00
Thomas Raoux	4c564940a1	[mlir][nvgpu] Add NVGPU dialect (architectural specific gpu dialect) This introduce a new dialect for vendro specific ptx operations. This also adds the first operation ldmatrix as an example. More operations will be added in follow up patches. This new dialect is meant to be a bridge between GPU and Vector dialectis and NVVM dialect. This is based on the RFC proposed here: https://discourse.llvm.org/t/rfc-add-nv-gpu-dialect-hw-specific-extension-of-gpu-dialect-for-nvidia-gpus/61466/8 Differential Revision: https://reviews.llvm.org/D123266	2022-04-14 16:33:46 +00:00
Lei Zhang	e54236dfb5	[mlir][vector] Cast away leading one dims for insert ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123621	2022-04-14 08:57:32 -04:00
Lei Zhang	bc408afbfe	[mlir][vector] Fold splat constant transpose Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123595	2022-04-14 08:51:25 -04:00
Alex Zinenko	d064c4801c	[mlir] Introduce Transform dialect This dialect provides operations that can be used to control transformation of the IR using a different portion of the IR. It refers to the IR being transformed as payload IR, and to the IR guiding the transformation as transform IR. The main use case for this dialect is orchestrating fine-grain transformations on individual operations or sets thereof. For example, it may involve finding loop-like operations with specific properties (e.g., large size) in the payload IR, applying loop tiling to those and only those operations, and then applying loop unrolling to the inner loops produced by the previous transformations. As such, it is not intended as a replacement for the pass infrastructure, nor for the pattern rewriting infrastructure. In the most common case, the transform IR will be processed and applied to payload IR by a pass. Transformations expressed by the transform dialect may be implemented using the pattern infrastructure or any other relevant MLIR component. This dialect is designed to be extensible, that is, clients of this dialect are allowed to inject additional operations into this dialect using the newly introduced in this patch `TransformDialectExtension` mechanism. This allows the dialect to avoid a dependency on the implementation of the transformation as well as to avoid introducing dialect-specific transform dialects. See https://discourse.llvm.org/t/rfc-interfaces-and-dialects-for-precise-ir-transformation-control/60927. Reviewed By: nicolasvasilache, Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123135	2022-04-14 13:48:45 +02:00
Alex Zinenko	09141f1adf	[mlir] Split intrinsics out of LLVMOps.td Move the operations that correspond to LLVM IR intrinsics in a separate .td file. This makes it easier to maintain the intrinsics and decreases the compile time of LLVMDialect.cpp by ~25%. Depends On D123310 Reviewed By: wsmoses, jacquesguan Differential Revision: https://reviews.llvm.org/D123315	2022-04-14 13:23:31 +02:00
Alex Zinenko	2366a43b3c	[mlir] initial support for opaque pointers in the LLVM dialect LLVM IR has introduced and is moving forward with the concept of opaque pointers, i.e. pointer types that are not carrying around the pointee type. Instead, memory-related operations indicate the type of the data being accessed through the opaque pointer. Introduce the initial support for opaque pointers in the LLVM dialect: - `LLVMPointerType` to support omitting the element type; - alloca/load/store/gep to support opaque pointers in their operands and results; this requires alloca and gep to store the element type as an attribute; - memory-related intrinsics to support opaque pointers in their operands; - translation to LLVM IR for the ops above is no longer using methods deprecated in LLVM API due to the introduction of opaque pointers. Unlike LLVM IR, MLIR can afford to support both opaque and non-opaque pointers at the same time and simplify the transition. Translation to LLVM IR of MLIR that involves opaque pointers requires the LLVMContext to be configured to always use opaque pointers. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123310	2022-04-14 13:23:29 +02:00
Mehdi Amini	35f48edb91	Apply clang-tidy fixes for llvm-qualified-auto in VectorTransforms.cpp (NFC)	2022-04-14 09:42:37 +00:00
Mehdi Amini	2d06420879	Apply clang-tidy fixes for performance-for-range-copy in SCF.cpp (NFC)	2022-04-14 09:42:36 +00:00
Arnab Dutta	392d55c1e2	[MLIR][GPU] Add canonicalization patterns for folding simple gpu.wait ops. * Fold away redundant %t = gpu.wait async + gpu.wait [%t] pairs. * Fold away %t = gpu.wait async ... ops when %t has no uses. * Fold away gpu.wait [] ops. * In case of %t1 = gpu.wait async [%t0], replace all uses of %t1 with %t0. Differential Revision: https://reviews.llvm.org/D121878	2022-04-14 12:30:55 +05:30
Stella Laurenzo	61352a580a	[mlir] Introduce ml_program dialect. Differential Revision: https://reviews.llvm.org/D120203	2022-04-13 21:38:14 -07:00
Arjun P	094ad0667c	[MLIR][Presburger] change some `push_back`s to `emplace_back`s	2022-04-14 04:27:21 +01:00
Arjun P	ef95a6e827	[MLIR][Presburger] change some post-increments/decrements to pre-increments/decrements	2022-04-14 04:26:29 +01:00
Mehdi Amini	93b23c8439	Apply clang-tidy fixes for llvm-else-after-return in SCF.cpp (NFC)	2022-04-13 23:14:44 +00:00
Mehdi Amini	f19e90bc1f	Apply clang-tidy fixes for readability-identifier-naming in OpenMPDialect.cpp (NFC)	2022-04-13 23:14:44 +00:00
Christopher Bate	77d2c815f5	[MLIR][GPU] Add GPU ops nvvm.mma.sync, nvvm.mma.ldmatrix, lane_id This change adds three new operations to the GPU dialect: gpu.mma.sync, gpu.mma.ldmatrix, and gpu.lane_id. The former two are meant to target the lower level nvvm.mma.sync and nvvm.ldmatrix instructions, respectively. Lowerings are added for the new GPU operations for conversion to NVVM. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123647	2022-04-13 22:50:07 +00:00
Thomas Raoux	5b1b7108c8	[mlir][vector] Add unrolling pattern for TransposeOp Support unrolling for vector.transpose following the same interface as other vector unrolling ops. Differential Revision: https://reviews.llvm.org/D123688	2022-04-13 19:44:16 +00:00
Mogball	60e34f8ddd	[mlir][ods] Remove StrEnumAttr StrEnumAttr has been deprecated in favour of EnumAttr, a solution based on AttrDef (https://reviews.llvm.org/D115181). This patch removes StrEnumAttr, along with all the custom ODS logic required to handle it. See https://discourse.llvm.org/t/psa-stop-using-strenumattr-do-use-enumattr/5710 on how to transition to EnumAttr. In short, ``` // Before def MyEnumAttr : StrEnumAttr<"MyEnum", "", [ StrEnumAttrCase<"A">, StrEnumAttrCase<"B"> ]>; // After (pick an integer enum of your choice) def MyEnum : I32EnumAttr<"MyEnum", "", [ I32EnumAttrCase<"A", 0>, I32EnumAttrCase<"B", 1> ]> { // Don't generate a C++ class! We want to use the AttrDef let genSpecializedAttr = 0; } // Define the AttrDef def MyEnum : EnumAttr<MyDialect, MyEnum, "my_enum">; ``` Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D120834	2022-04-13 17:49:02 +00:00
Jacques Pienaar	5a236e69f2	[mlir][pdll] Include string in PDLLServer.h	2022-04-13 10:31:06 -07:00
Jacques Pienaar	a9d7598ef8	[mlir][pdll] Rename extra dir flag Not sure why this is overlapping, need to repro locally, but just rename to something more explicit instead.	2022-04-13 10:00:10 -07:00
Jacques Pienaar	aa9da11f5d	[mlir][pdll] Add extra-dirs for LSP includes. Enable specifying additional include directories to search. This is consistent with what one can do with clangd (although there it is more general compilation options) and Python LSP. We would in general expect these to be provided by compilation database equivalent. Differential Revision: https://reviews.llvm.org/D123474	2022-04-13 09:41:45 -07:00
Shraiysh Vaishay	f0ba32d666	[mlir][LLVM-IR] Added support for global variable attributes This patch adds thread_local to llvm.mlir.global and adds translation for dso_local and addr_space to and from LLVM IR. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D123412	2022-04-13 08:31:01 +05:30
jacquesguan	605fc89a61	[mlir][Arithmetic] Add common constant folder function for type cast ops. This revision replaces current type cast constant folder with a new common type cast constant folder function template. It will cover all former folder and support fold the constant splat and vector. Differential Revision: https://reviews.llvm.org/D123489	2022-04-13 02:11:59 +00:00
Mahesh Ravishankar	b40e901333	[mlir][Linalg] Allow collapsing subset of the reassociations when fusing by collapsing. This change generalizes the fusion of `tensor.expand_shape` -> `linalg.generic` op by collapsing to handle cases where only a subset of the reassociations specified in the `tensor.expand_shape` are valid to be collapsed. The method that does the collapsing is refactored to allow it to be a generic utility when required. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D123153	2022-04-12 18:56:32 +00:00
Shraiysh Vaishay	b18e82186f	[mlir][OpenMP] Added omp.task This patch adds tasking construct according to Section 2.10.1 of OpenMP 5.0 Reviewed By: peixin, kiranchandramohan, abidmalikwaterloo Differential Revision: https://reviews.llvm.org/D123575	2022-04-12 23:55:47 +05:30
Groverkss	20aedb148b	[MLIR][Presburger] Remove inheritance from PresburgerSpace in IntegerRelation, PresburgerRelation and PWMAFunction This patch removes inheritence from PresburgerSpace in IntegerRelation and instead makes it a member of these classes. This is required for three reasons: - It prevents implicit casting to PresburgerSpace. - Not all functions of PresburgerSpace need to be exposed by the deriving classes. - IntegerRelation and IntegerPolyhedron are defined in a PresburgerSpace. It makes more sense for the space to be a member instead of them inheriting from a space. Reviewed By: arjunp, ftynse Differential Revision: https://reviews.llvm.org/D123585	2022-04-12 22:48:52 +05:30
Arjun P	0ac213667d	[MLIR][Presburger] normalizeDiv: add assert that denom > 0	2022-04-12 13:06:53 +01:00
jacquesguan	83bd4fe2e8	[mlir][Math] Replace some constant folder functions with common folder functions. Differential Revision: https://reviews.llvm.org/D123485	2022-04-12 11:34:47 +00:00
Arjun P	4aeb2a57f4	[MLIR][Presburger][Simplex] addSymbolicCut: don't add symbol div if denom is 1 This is unncessary, so we remove it as an optimization. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123540	2022-04-12 12:27:27 +01:00
Andrzej Warzynski	fb16ed258c	[mlir] Prefix pass manager options with `mlir-` With this change, there's going to be a clear distinction between LLVM and MLIR pass maanger options (e.g. `-mlir-print-after-all` vs `-print-after-all`). This change is desirable from the point of view of projects that depend on both LLVM and MLIR, e.g. Flang. For consistency, all pass manager options in MLIR are prefixed with `mlir-`, even options that don't have equivalents in LLVM . Differential Revision: https://reviews.llvm.org/D123495	2022-04-12 09:32:44 +00:00
Matthias Springer	fa087b4352	[mlir][scf][bufferize][NFC] Lookup buffer using helper function Lookup iter_arg buffers using `lookupBuffer` instead of always creating a new `ToMemrefOp`. Also cast all yielded buffers (if necessary), regardless of whether they are an equivalent buffer or a new allocation. Note: This should have been part of D123369. Differential Revision: https://reviews.llvm.org/D123383	2022-04-12 18:09:30 +09:00
Mehdi Amini	eeef40a6cd	Apply clang-tidy fixes for performance-unnecessary-value-param in LLVMDialect.cpp (NFC)	2022-04-12 07:47:04 +00:00
Mehdi Amini	e188ad8bf0	Apply clang-tidy fixes for performance-unnecessary-value-param in SplitReduction.cpp (NFC)	2022-04-12 07:47:04 +00:00
Mehdi Amini	30846d2916	Guard copy of std::function to llvm::function_ref (fix crash) This is a footgun: assigning a null std::function to a function_ref does not yield a null function_ref...	2022-04-12 07:44:04 +00:00
Mehdi Amini	07db69effe	Use std::function instead of function_ref in MLIR JitRunner This fixes an ASAN failure.	2022-04-12 07:28:19 +00:00
Mehdi Amini	6b7e6ea489	Revert "Fix CUDA runtime wrapper for GPU mem alloc/free to async" This reverts commit `b4117fede2`. This broke one of the MLIR bot, a test is failing.	2022-04-12 06:50:27 +00:00
Mehdi Amini	3c9e09036c	Apply clang-tidy fixes for readability-identifier-naming in LinalgOps.cpp (NFC)	2022-04-12 05:30:01 +00:00
Mehdi Amini	47286e17ed	Apply clang-tidy fixes for performance-for-range-copy in LinalgOps.cpp (NFC)	2022-04-12 05:30:01 +00:00
Mehdi Amini	0149a437e2	Apply clang-tidy fixes for llvm-qualified-auto in LinalgOps.cpp (NFC)	2022-04-12 04:39:58 +00:00
Mehdi Amini	bf62a4b9c5	Apply clang-tidy fixes for performance-move-const-arg in ArithmeticOps.cpp (NFC)	2022-04-12 04:39:57 +00:00
Uday Bondhugula	56245cc18c	[MLIR] NFC. Address clang-tidy warning in AffineOps.cpp NFC. Address clang-tidy warning in AffineOps.cpp.	2022-04-12 09:58:29 +05:30
Uday Bondhugula	b4117fede2	Fix CUDA runtime wrapper for GPU mem alloc/free to async Switch CUDA runtime wrapper for GPU mem alloc/free to async. The semantics of the GPU dialect ops (gpu.alloc/dealloc) and the wrappers it lowered to (gpu-to-llvm) was for the async versions -- however, this was being incorrectly mapped to cuMemAlloc/cuMemFree instead of cuMemAllocAsync/cuMemFreeAsync. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D123482	2022-04-12 09:04:02 +05:30
Peixin-Qiao	139c5b8a96	[MLIR][OpenMP] Add support for threadprivate directive This supports the threadprivate directive in OpenMP dialect following the OpenMP 5.1 [2.21.2] standard. Also lowering to LLVM IR using OpenMP IRBduiler. Reviewed By: kiranchandramohan, shraiysh, arnamoy10 Differential Revision: https://reviews.llvm.org/D123350	2022-04-12 10:15:15 +08:00
jacquesguan	920527f4b5	[mlir][NFC] Remove some redundant code. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123487	2022-04-12 02:10:04 +00:00
Eugene Zhulenev	b35b9e307f	[mlir] Add msan memory unpoisoning macros to mlir ExecutionEngine Adding annotations on as-needed bases, currently only for memrefCopy, but in general all C API functions that take pointers to memory allocated/initialized inside the jit-compiled code must be annotated, to be able to run with msan. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D123557	2022-04-11 18:58:28 -07:00
Arthur Eubanks	7ccd026cf2	Reland [mlir] Remove uses of LLVM's legacy pass manager Use the new pass manager. This also removes the ability to run arbitrary sets of passes. Not sure if this functionality is used, but it doesn't seem to be tested. No need to initialize passes outside of constructing the PassBuilder with the new pass manager. Reland: Fixed custom calls to `-lower-matrix-intrinsics` in integration tests by replacing them with `-O0 -enable-matrix`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D123425	2022-04-11 16:53:32 -07:00
Arthur Eubanks	c3efb75061	Revert "[mlir] Remove uses of LLVM's legacy pass manager" This reverts commit `b0f7f6f78d`. Causes test failures: https://lab.llvm.org/buildbot#builders/61/builds/24879	2022-04-11 16:45:19 -07:00
Mahesh Ravishankar	2291705d2b	[mlir][Linalg] Split `populateElementwiseOpsFusionPatterns`. The method to add elementwise ops fusion patterns pulls in many other patterns by default. The patterns to pull in along with the elementwise op fusion should be upto the caller. Split the method to pull in just the elementwise ops fusion pattern. Other cleanup changes include - Move the pattern for constant folding of generic ops (currently only constant folds transpose) into a separate file, cause it is not related to fusion - Drop the uber LinalgElementwiseFusionOptions. With the populateElementwiseOpsFusionPatterns being split, this has no utility now. - Drop defaults for the control function. - Fusion of splat constants with generic ops doesnt need a control function. It is always good to do. Differential Revision: https://reviews.llvm.org/D123236	2022-04-11 23:37:23 +00:00
Arthur Eubanks	b0f7f6f78d	[mlir] Remove uses of LLVM's legacy pass manager Use the new pass manager. This also removes the ability to run arbitrary sets of passes. Not sure if this functionality is used, but it doesn't seem to be tested. No need to initialize passes outside of constructing the PassBuilder with the new pass manager. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D123425	2022-04-11 16:18:45 -07:00
Mehdi Amini	f74569b37c	Apply clang-tidy fixes for llvm-qualified-auto in AffineOps.cpp (NFC)	2022-04-11 23:14:47 +00:00

1 2 3 4 5 ...

8225 Commits