llvm-project

Commit Graph

Author	SHA1	Message	Date
Min-Yih Hsu	e927a336a5	[mlir][LLVMIR] Add support for translating FCmp & FP constants This patch add supports for translating FCmp and more kinds of FP constants in addition to 32 & 64-bit ones. However, we can't express ppc_fp128 constants right now because the semantics for its underlying APFloat is `S_PPCDoubleDouble` but mlir::FloatType doesn't support such semantics right now. Differential Revision: https://reviews.llvm.org/D124630	2022-05-02 16:22:35 -07:00
Raghu Maddhipatla	c685f82126	[mlir][OpenMP] Add omp.cancel and omp.cancellationpoint. Reviewed By: kiranchandramohan, peixin, shraiysh Differential Revision: https://reviews.llvm.org/D123828	2022-05-02 12:23:38 -05:00
Eugene Zhulenev	38d0df5577	[mlir] CRunnerUtils: qualify UnrankedMemRefType to avoid collisions with mlir::UnrankedMemRefType When CRunnerUtils included together with MLIR IR headers, it can lead to compilation errors. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124744	2022-05-02 10:19:02 -07:00
Shraiysh Vaishay	a60fda59dc	[mlir][OpenMP] Restrict types for omp.parallel args This patch restricts the value of `if` clause expression to an I1 value. It also restricts the value of `num_threads` clause expression to an I32 value. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D124142	2022-05-02 14:17:34 +05:30
Alex Zinenko	946311b893	[mlir] support isa/cast/dyn_cast<Operation >(operation) This enables one to write generic code that can be instantiated for both specific operation classes and the common base class without specialization. Examples include functions that take/return ops, such as: ```mlir template <typename FnTy> void applyIf(FnTy &&lambda, ...) { for (Operation op : ...) { auto specific = dyn_cast<function_traits<FnTy>::template arg_t<0>>(op); if (specific) lambda(specific); } } ``` that would otherwise need to rely on template specialization to support lambdas that take specific operations and those that take `Operation *`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124675	2022-05-02 10:07:09 +02:00
River Riddle	3c75228991	[mlir:PDLInterp] Refactor the implementation of result type inferrence The current implementation uses a discrete "pdl_interp.inferred_types" operation, which acts as a "fake" handle to a type range. This op is used as a signal to pdl_interp.create_operation that types should be inferred. This is terribly awkward and clunky though: * This op doesn't have a byte code representation, and its conversion to bytecode kind of assumes that it is only used in a certain way. The current lowering is also broken and seemingly untested. * Given that this is a different operation, it gives off the assumption that it can be used multiple times, or that after the first use the value contains the inferred types. This isn't the case though, the resultant type range can never actually be used as a type range. This commit refactors the representation by removing the discrete InferredTypesOp, and instead adds a UnitAttr to pdl_interp.CreateOperation that signals when the created operations should infer their types. This leads to a much much cleaner abstraction, a more optimal bytecode lowering, and also allows for better error handling and diagnostics when a created operation doesn't actually support type inferrence. Differential Revision: https://reviews.llvm.org/D124587	2022-05-01 12:25:05 -07:00
Arjun P	ebbfe0136e	[MLIR][Presburger] subtraction: add support for divs defined by equalties Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D124668	2022-04-30 14:45:13 +01:00
Chris Lattner	d85eb4e2d6	[AsmParser] Introduce a new "Argument" abstraction + supporting logic MLIR has a common pattern for "arguments" that uses syntax like `%x : i32 {attrs} loc("sourceloc")` which is implemented in adhoc ways throughout the codebase. The approach this uses is verbose (because it is implemented with parallel arrays) and inconsistent (e.g. lots of things drop source location info). Solve this by introducing OpAsmParser::Argument and make addRegion (which sets up BlockArguments for the region) take it. Convert the world to propagating this down. This means that we correctly capture and propagate source location information in a lot more cases (e.g. see the affine.for testcase example), and it also simplifies much code. Differential Revision: https://reviews.llvm.org/D124649	2022-04-29 12:19:34 -07:00
Vitaly Buka	f735b3a2b0	[mlir] Prevent argStorage relocations This fixes msan reports like https://reviews.llvm.org/P8285 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124576	2022-04-29 11:09:32 -07:00
Matthias Springer	3c2a74a3ae	[mlir][linalg][transform] Add TileOp to transform dialect This commit adds a tiling op to the transform dialect as an external op. Differential Revision: https://reviews.llvm.org/D124661	2022-04-29 21:35:31 +09:00
River Riddle	651d9f70ed	[mlir:PDLL] Fix the import of native constraints from ODS We weren't properly returning the result of the constraint, which leads to errors when actually trying to use the generated C++. Differential Revision: https://reviews.llvm.org/D124586	2022-04-28 12:58:00 -07:00
River Riddle	ebb1e900d3	[mlir:PDLL] Fix error handling of eof within a string literal We currently aren't handling this properly, and in the case of a string block just crash. This commit adds proper error handling and detection for eof. Differential Revision: https://reviews.llvm.org/D124585	2022-04-28 12:58:00 -07:00
River Riddle	32bf1f1d57	[mlir:LSP] Improve conversion between SourceMgr and LSP locations SourceMgr generally uses 1-based locations, whereas the LSP is zero based. This commit corrects this conversion and also enhances the conversion from SMLoc to SMRange to support string tokens. Differential Revision: https://reviews.llvm.org/D124584	2022-04-28 12:58:00 -07:00
River Riddle	9613a850b6	[mlir:PDL] Rework errors for pdl.operations with non-inferrable results We currently emit an error during verification if a pdl.operation with non-inferrable results is used within a rewrite. This allows for catching some errors during compile time, but is slightly broken. For one, the verification at the PDL level assumes that all dialects have been loaded, which is true at run time, but may not be true when the PDL is generated (such as via PDLL). This commit fixes this by not emitting the error if the operation isn't registered, i.e. it uses the `mightHave` variant of trait/interface methods. Secondly, we currently don't verify when a pdl.operation has no explicit results, but the operation being created is known to expect at least one. This commit adds a heuristic error to detect these cases when possible and fail. We can't always capture when the user made an error, but we can capture the most common case where the user expected an operation to infer its result types (when it actually isn't possible). Differential Revision: https://reviews.llvm.org/D124583	2022-04-28 12:58:00 -07:00
River Riddle	d4381b3f93	[mlir:PDL] Fix a syntax ambiguity in pdl.attribute pdl.attribute currently has a syntax ambiguity that leads to the incorrect parsing of pdl.attribute operations with locations that don't also have a constant value. For example: ``` pdl.attribute loc("foo") ``` The above IR is treated as being a pdl.attribute with a constant value containing the location, `loc("foo")`, which is incorrect. This commit changes the syntax to use `= <constant-value>` to clearly distinguish when the constant value is present, as opposed to just trying to parse an attribute. Differential Revision: https://reviews.llvm.org/D124582	2022-04-28 12:57:59 -07:00
River Riddle	92a836da07	[mlir] Attach InferTypeOpInterface on SameOperandsAndResultType operations when possible This allows for inferring the result types of operations in certain situations by using the type of an operand. This commit allowed for automatically supporting type inference for many more operations with no additional effort, e.g. nearly all Arithmetic operations now support result type inferrence with no additional changes. Differential Revision: https://reviews.llvm.org/D124581	2022-04-28 12:57:59 -07:00
River Riddle	1bd1edaf40	[mlir:ODS] Support using attributes in AllTypesMatch to automatically add InferTypeOpInterface This allows for using attribute types in result type inference for use with InferTypeOpInterface. This was a TODO before, but it isn't much additional work to properly support this. After this commit, arith::ConstantOp can now have its InferTypeOpInterface implementation automatically generated. Differential Revision: https://reviews.llvm.org/D124580	2022-04-28 12:57:59 -07:00
Chris Lattner	99499c3ea7	[OpAsmParser] Simplify logic for requiredOperandCount in parseOperandList. I would ideally like to eliminate 'requiredOperandCount' as a bit of verification that should be in the client side, but it is much more widely used than I expected. Just tidy some pieces up around it given we can't drop it immediately. NFC. Differential Revision: https://reviews.llvm.org/D124629	2022-04-28 12:05:10 -07:00
Jacques Pienaar	9a4472c56c	[mlir] Add basic tree-sitter grammar file tree-sitter grammar file that tries to closely matches LangRef (it could use some tweaking and cleanup, but kept fairly basic). Also updated LangRef in places where found some issues while doing the nearly direct transcription. This only adds a grammar file, not all the other parts (npm etc) that accompanies it. Those I'll propose for separate repo like we do for vscode extension. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124352	2022-04-28 11:42:46 -07:00
Chris Lattner	5dedf911de	[AsmParser] Rework logic around "region argument parsing" The asm parser had a notional distinction between parsing an operand (like "%foo" or "%4#3") and parsing a region argument (which isn't supposed to allow a result number like #3). Unfortunately the implementation has two problems: 1) It didn't actually check for the result number and reject it. parseRegionArgument and parseOperand were identical. 2) It had a lot of machinery built up around it that paralleled operand parsing. This also was functionally identical, but also had some subtle differences (e.g. the parseOptional stuff had a different result type). I thought about just removing all of this, but decided that the missing error checking was important, so I reimplemented it with a `allowResultNumber` flag on parseOperand. This keeps the codepaths unified and adds the missing error checks. Differential Revision: https://reviews.llvm.org/D124470	2022-04-28 11:12:44 -07:00
Vitaly Buka	6e1ac68a0c	[mlir] Don't iterate mutable user list executeOp.operandsMutable().append(asyncTokens) in addAsyncDependencyAfter can resize and invalidate iterators. Fixes reports like https://reviews.llvm.org/P8286 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D124577	2022-04-28 08:59:55 -07:00
Vitaly Buka	9f235a88f1	[mlir][msan] Don't access destroyed node	2022-04-28 08:58:27 -07:00
Marius Brehler	84fe39a45b	[mlir][emitc] Add a cast op This adds a cast operation that allows to perform an explicit type conversion. The cast op is emitted as a C-style cast. It can be applied to integer, float, index and EmitC types. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D123514	2022-04-28 15:50:59 +00:00
Vitaly Buka	0d70bc990b	[mlir][msan][test] Disable jit tests I am going to enable MLIR test on msan bot https://lab.llvm.org/buildbot/#/builders/sanitizer-x86_64-linux-bootstrap-msan Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D124574	2022-04-28 08:50:13 -07:00
Marius Brehler	50d648b40e	[mlir][emitc] Replace !emitc.opaque pointers Replaces using !emitc.opaque pointers which using !emitc.ptr types.	2022-04-28 15:20:39 +00:00
Marius Brehler	39dd29736f	[mlir][emitc] Disallow !emitc.opaque pointers Fordbids to express pointer via the `!emitc.opaque` type. Point the user to use the `!emitc.ptr` type instead. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D124002	2022-04-28 15:08:21 +00:00
Lei Zhang	bbffece383	[mlir][spirv] Remove layout decoration on unneeded storage classes Per SPIR-V validation rules, explict layout decorations are only needed for StorageBuffer, PhysicalStorageBuffer, Uniform, and PushConstant storage classes. (And even that is for Shader capabilities). So we don't need such decorations on the rest. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124543	2022-04-28 08:18:23 -04:00
Lei Zhang	8854b73606	[mlir][spirv] Convert memref.alloca to spv.Variable Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124542	2022-04-28 08:13:40 -04:00
Javier Setoain	6301574206	[mlir][SparseTensor] Enable VLA ops in index value generation Current index value generation uses fixed-length vector ops, this patch adds an alterantive codegen path compatible with scalable vectors by using `LLVM::StepVectorOp`. Differential Revision: https://reviews.llvm.org/D124454	2022-04-28 09:39:07 +01:00
Aart Bik	ccd047cba4	[mlir][sparse] optimize COO index handling By using a shared index pool, we reduce the footprint of each "Element" in the COO scheme and, in addition, reduce the overhead of allocating indices (trading many allocations of vectors for allocations in a single vector only). When the capacity is known, this means all allocation can be done in advance. This is a big win. For example, reading matrix SK-2005, with dimensions 50,636,154 x 50,636,154 and 1,949,412,601 nonzero elements improves as follows (time in ms), or about 3.5x faster overall ``` SK-2005 before after speedup --------------------------------------------- read 305,086.65 180,318.12 1.69 sort 2,836,096.23 510,492.87 5.56 pack 364,485.67 312,009.96 1.17 --------------------------------------------- TOTAL 3,505,668.56 1,002,820.95 3.50 ``` Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D124502	2022-04-27 10:20:47 -07:00
Min-Yih Hsu	a75657d66a	[mlir][LLVMIR] Do not cache llvm::Constant into instMap Constants in MLIR are not globally unique, unlike that in LLVM IR. Therefore, reusing previous-translated constants might cause the user operations not being dominated by the constant (because the previous-translated ones can be placed in arbitrary place) This indeed misses some opportunities where we actually can reuse a previous-translated constants, but verbosity is not our first priority here. Differential Revision: https://reviews.llvm.org/D124404	2022-04-27 09:43:49 -07:00
Min-Yih Hsu	ea9bcb8b27	[mlir][LLVMIR] Do not cache Instruction generated on-the-fly More specifically, the llvm::Instruction generated by llvm::ConstantExpr::getAsInstruction. Such Instruction will be deleted right away, but it's possible that when getAsInstruction is called again, it will create a new Instruction that has the same address with the one we just deleted. Thus, we shouldn't keep it in the `instMap` to avoid a conflicting index that triggers an assertion in processInstruction. Differential Revision: https://reviews.llvm.org/D124402	2022-04-27 09:42:59 -07:00
Min-Yih Hsu	00fcf9e95a	[mlir][LLVMIR] Add support for importing struct-type ConstantAggregate(Zero) And move importer test files from `test/Target/LLVMIR` into `test/Target/LLVMIR/Import`. We simply translate struct-type ConstantAggregate(Zero) into a serious of `llvm.insertvalue` operations against a `llvm.undef` root. Note that this doesn't affect the original logics on translating vector/array-type ConstantAggregate values. Differential Revision: https://reviews.llvm.org/D124399	2022-04-27 09:42:26 -07:00
Mathieu Fehr	88bc24a7e3	[mlir] Allow setting operation legality with an OperationName This is necessary to handle conversions of operations defined at runtime in extensible dialects. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124353	2022-04-27 08:54:51 -07:00
Lei Zhang	d137c05fc9	[mlir][spirv] Add conversion from vector.reduction Only supports addition and multiplication for now; other cases to be implemented. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124380	2022-04-27 10:29:46 -04:00
Lei Zhang	38e802a09d	[mlir][spirv] Allow converting from index type in unsigned ops `index` type is converted to `i32` in SPIR-V. This is fine to support for all signed/unsigned ops. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124451	2022-04-27 10:13:50 -04:00
Mathieu Fehr	9e0b553359	[mlir] Add extensible dialects Depends on D104534 Add support for extensible dialects, which are dialects that can be extended at runtime with new operations and types. These operations and types cannot at the moment implement traits or interfaces. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104554	2022-04-26 19:48:22 -07:00
River Riddle	71aad31c0b	[mlir:PDLL] Use normalized paths in compilation database test This fixes issues with the compilation database when the file path isn't in the correct form.	2022-04-26 19:45:30 -07:00
River Riddle	021b254547	[mlir:PDLL] Fix build on windows related to different file paths This fixes issues with the compilation database when the file path isn't in the correct form.	2022-04-26 19:40:41 -07:00
River Riddle	41d2c6df5c	[mlir][PDLL-LSP] Add code completion for include file paths This allows for providing completion results for include directive file paths by searching the set of include directories for the current file. Differential Revision: https://reviews.llvm.org/D124112	2022-04-26 18:33:17 -07:00
River Riddle	09af7fefc8	[mlir][PDLL] Add document link and hover support to mlir-pdll-lsp-server This allows for navigating to included files on click, and also provides hover information about the include file (similarly to clangd). Differential Revision: https://reviews.llvm.org/D124077	2022-04-26 18:33:17 -07:00
River Riddle	fb5a59f6e1	[mlir][PDLL] Add initial support for a PDLL compilation database The compilation database acts in a similar way to the compilation database (compile_commands.json) used by clang-tidy, i.e. it provides additional information about the compilation of project files to help the language server. The main piece of information provided by the PDLL compilation database in this commit is the set of include directories used when processing the input .pdll file. This allows for the server to properly process .pdll files that use includes anchored by the include directories set up in the build system. The structure of the textual form of a compilation database is a yaml file containing documents of the following form: ``` --- !FileInfo: filepath: <string> - Absolute file path of the file. includes: <string> - Semi-colon delimited list of include directories. ``` This commit also adds support to cmake for automatically generating a `pdll_compile_commands.yml` file at the top-level of the build directory. Differential Revision: https://reviews.llvm.org/D124076	2022-04-26 18:33:17 -07:00
River Riddle	597fde54a8	[mlir][PDLL] Add support for generating PDL patterns from PDLL at build time This essentially sets up mlir-pdll to function in a similar manner to mlir-tblgen. Aside from the boilerplate of configuring CMake and setting up a basic initial test, two new options are added to mlir-pdll to mirror options provided by tblgen: * -d This option generates a dependency file (i.e. a set of build time dependencies) while processing the input file. * --write-if-changed This option only writes to the output file if the data would have changed, which for the build system prevents unnecesarry rebuilds if the file was touched but not actually changed. Differential Revision: https://reviews.llvm.org/D124075	2022-04-26 18:33:16 -07:00
River Riddle	b3fc0fa84a	[mlir][PDLL] Don't use the result of `Constraint::getDefName()` when uniquing In the case of anonymous defs this may return the name of the base def class, which can lead to two different defs with the same name (which hits an assert). This commit adds a new `getUniqueDefName` method that returns a unique name for the constraint. Differential Revision: https://reviews.llvm.org/D124074	2022-04-26 18:33:16 -07:00
Aart Bik	33e8ab8ea0	[mlir][sparse] support pattern-only matrices from Matrix Market We simply set nonzero entries to the value "1" in this case. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D124475	2022-04-26 15:50:21 -07:00
Michael Kruse	ff289feeba	[OpenMPIRBuilder] Remove ContinuationBB argument from Body callback. The callback is expected to create a branch to the ContinuationBB (sometimes called FiniBB in some lambdas) argument when finishing. This creates problems: 1. The InsertPoint used for CodeGenIP does not need to be the end of a block. If it is not, a naive callback will insert a branch instruction into the middle of the block. 2. The BasicBlock the CodeGenIP is pointing to may or may not have a terminator. There is an conflict where to branch to if the block already has a terminator. 3. Some API functions work only with block having a terminator. Some workarounds have been used to insert a temporary terminator that is removed again. 4. Some callbacks are sensitive to whether the BasicBlock has a terminator or not. This creates a callback ordering problem where different callback may have different behaviour depending on whether a previous callback created a terminator or not. The problem also exists for FinalizeCallbackTy where some callbacks do create branch to another "continue" block, but unlike BodyGenCallbackTy does not receive the target as argument. This is not addressed in this patch. With this patch, the callback receives an CodeGenIP into a BasicBlock where to insert instructions. If it has to insert control flow, it can split the block at that position as needed but otherwise no separate ContinuationBB is needed. In particular, a callback can be empty without breaking the emitted IR. If the caller needs the control flow to branch to a specific target, it can insert the branch instruction itself and pass an InsertPoint before the terminator to the callback. Certain frontends such as Clang may expect the current IRBuilder position to be at the end of a basic block. In this case its callbacks must split the block at CodeGenIP before setting the IRBuilder position such that the instructions after CodeGenIP are moved to another basic block and before returning create a new branch instruction to the split block. Some utility functions such as `splitBB` are supporting correct splitting of BasicBlocks, independent of whether they have a terminator or not, returning/setting the InsertPoint of an IRBuilder to the end of split predecessor block, and optionally omitting creating a branch to the split successor block to be added later. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D118409	2022-04-26 16:35:01 -05:00
Yi Zhang	e1318078a4	Support non identity layout map for reshape ops in MemRefToLLVM lowering This change borrows the ideas from `computeExpanded/CollapsedLayoutMap` and computes the dynamic strides at runtime for the memref descriptors. Differential Revision: https://reviews.llvm.org/D124001	2022-04-26 13:03:53 -04:00
Alex Zinenko	b84f95fe53	[mlir] Fix -Wunused-private-field in the Transform dialect Classes derived from TransformState::Extension may need access to the parent state.	2022-04-26 14:05:24 +02:00
Alex Zinenko	2b985a7ae8	[mlir] Add a title to the Transform Dialect doc	2022-04-26 13:04:41 +02:00
Krzysztof Drewniak	d35f7f254f	[mlir] Allow data flow analysis of non-control flow branch arguments This commit adds the visitNonControlFlowArguments method to DataFlowAnalysis, allowing analyses to provide lattice values for the arguments to a RegionSuccessor block that aren't directly tied to an op's inputs. For example, integer range interface can use this method to infer bounds for the step values in loops. This method has a default implementation that keeps the old behavior of assigning a pessimistic fixedpoint state to all such arguments. Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D124021	2022-04-25 20:19:34 +00:00
jfurtek	c4caa90b15	[mlir][tblgen] Generate builders with inferred return types and unwrapped attributes This diff causes mlir-tblgen to generate code for an additional builder for an operation argument with a return type that can be inferred AND an attribute in the argument list can be "unwrapped." (Previously, the unwrapped build function was only generated for builders with explicit return types in separate or aggregate form.) As an example, this builder might be used by code that creates operations that implement the `SameOperandsAndResultType` interface. A test case was created. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D124043	2022-04-25 19:00:44 +00:00
Jeremy Furtek	a266a21000	[mlir][ods] Extend the EnumAttr tablegen class to support BitEnum attributes This diff allows the EnumAttr class to be used for bit enum attributes (in addition to previously supported integer enum attributes). While integer and bit enum attributes share many common implementation aspects, parsing bit enum values requires a separate implementation. This is accomplished by creating empty parser and printer strings in the EnumAttrInfo record, and having derived classes (specific to bit and integer enums) override with an appropriate parser/printer string. To support existing bit enums that may use a vertical bar separator, the parser is modified to support the \| token. Tests were added for bit enums alongside integer enums. Future diffs for fastmath attributes in the arithmetic dialect will use these changes. (resubmission of earlier abaondoned diff, updated to reflect subsequent changes in the repository) Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D123880	2022-04-25 19:00:00 +00:00
jfurtek	4e5dee2f30	[mlir][ods] Add tablegen field for concise printing of BitEnum attributes This diff introduces a tablegen field for bit enum attributes (`printBitEnumPrimaryGroups`) to control printing when the enum uses "group" cases. An example would be an implementation that uses a `fastmath` enum value as an alias for individual fastmath flags. The proposed field would allow printing of simply `fast` for the enum value, instead of the more verbose list that would include `fast` as well as the individual flags (e.g. `reassoc,nnan, ninf,nsz,arcp,contract,afn,fast`). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123871	2022-04-25 18:48:35 +00:00
Markus Böck	12a2716953	[mlir][LLVM] Support opaque pointers in `llvm.mlir.addressof` The verifier of llvm.mlir.addressof did not properly account for opaque pointers, that is, the pointer type not having an element type equal to the type of the referenced global or function. This patch fixes that by skipping the test for the element type if the pointer is opaque. Differential Revision: https://reviews.llvm.org/D124333	2022-04-25 12:23:16 +02:00
Alex Zinenko	4c807f2f57	[mlir][vector] insert `alloca`s outside of loops After https://reviews.llvm.org/D119743 added the `AutomaticAllocationScope` trait to loop-like constructs, the vector transfer full/partial splitting pass started inserting allocations for temporaries within the closest loop rather than the closest function (or other allocation scope such as `async.execute`). While this is correct as long as the lowered code takes care of automatic deallocation at the end of each iteration of the loop, this interferes with downstream optimizations that expect `alloca`s to be at the function level. Step over loops when looking for the closest allocation scope in vector transfer full/partial splitting pass thus restoring the original behavior. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124366	2022-04-25 10:49:09 +02:00
Markus Böck	34312f1f0c	[mlir][LLVM] Support opaque pointers in data layout entries This is likely preferable to having it crash if one were to specify an opaque pointer type, and the actual element type is unused either way. Differential Revision: https://reviews.llvm.org/D124334	2022-04-25 09:14:33 +02:00
Nick Kreeger	4620032ee3	Revert "[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options." This reverts commit `d59cf901cb`. Build fails on NVIDIA Sparse tests: https://lab.llvm.org/buildbot/#/builders/61/builds/25447	2022-04-23 20:14:48 -05:00
Nick Kreeger	d59cf901cb	[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options. The SparseTensor passes currently use opaque numbers for the CLI, despite using an enum internally. This patch exposes the enums instead of numbered items that are matched back to the enum. Fixes GitHub issue #53389 Reviewed by: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D123876	2022-04-23 19:16:57 -05:00
Matthias Springer	48b8edac1c	[mlir][bufferize][NFC] Remove old references to Comprehensive Bufferize Differential Revision: https://reviews.llvm.org/D124324	2022-04-23 18:01:05 +09:00
Matthias Springer	940a3f6b3d	[mlir][bufferize][NFC] Clean up test cases Run `one-shot-bufferize` instead of `linalg-comprehensive-module-bufferize` and move some test cases to their respective dialects. Differential Revision: https://reviews.llvm.org/D124323	2022-04-23 18:00:55 +09:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
River Riddle	f3ebf828dc	[mlir] Generate Dialect constructors in .cpp instead of .h By generating in the .h file, we were forcing dialects to include a lot of additional header files because: * Fields of the dialect, e.g. std::unique_ptr<>, were unable to use forward declarations. * Dependent dialects are loaded in the constructor, requiring the full definition of each dependent dialect (which, depending on the file structure of the dialect, may include the operations). By generating in the .cpp we get much faster builds, and also better align with the rest of the code base. Fixes #55044 Differential Revision: https://reviews.llvm.org/D124297	2022-04-23 00:44:54 -07:00
Markus Böck	8ed2bd1e74	[mlir][LLVM] Fix `DataLayoutTypeInterface` for opqaue pointers with non-default address space As a fallback mechanism, if no entry was supplied for a given address space, the size or alignment for a pointer type with the default address space is returned instead. This code currently crashes with opaque pointers, as it tries to construct a typed pointer type from the opaque pointer type, leading to a null pointer dereference when fetching the element type. This patch fixes the issue by handling the opaque pointer cases explicitly. Differential Revision: https://reviews.llvm.org/D124290	2022-04-23 00:10:31 +02:00
Markus Böck	bab3d3778d	[mlir][LLVM] Fix crash when using opaque pointers in function signatures Using opaque pointers in function signatures leads to an attempt to recursively convert all types, including sub types in LLVM types. In the case of LLVM pointers, it may not have a subtype aka element type if it is opaque which would then lead to a null pointer dereference. Differential Revision: https://reviews.llvm.org/D124291	2022-04-23 00:10:31 +02:00
Yi Zhang	1cddcfdc3c	Fix CollapsedLayoutMap for dim size 1 case This change fixes `CollapsedLayoutMap` for cases where the collapsed dims are size 1. The cases where inner most dims are size 1 and noncontiguous can be represented by the strided form and therefore can be allowed. For such cases, the new stride should be of the next entry in an association whose dimension is not size 1. If the next entry is dynamic, it's not possible to decide which stride to use at compilation time and the stride is set to dynamic. Differential Revision: https://reviews.llvm.org/D124137	2022-04-22 17:48:24 -04:00
Alex Zinenko	40a8bd635b	[mlir] use side effects in the Transform dialect Currently, the sequence of Transform dialect operations only supports a single use of each operand (verified by the `transform.sequence` operation). This was originally motivated by the need to guard against accessing a payload IR operation associated with a transform IR value after this operation has likely been rewritten by a transformation. However, not all Transform dialect operations rewrite payload IR, in particular the "navigation" operation such as `transform.pdl_match` do not. Introduce memory effects to the Transform dialect operations to describe their effect on the payload IR and the mapping between payload IR opreations and transform IR values. Use these effects to replace the single-use rule, allowing repeated reads and disallowing use-after-free, where operations with the "free" effect are considered to "consume" the transform IR value and rewrite the corresponding payload IR operations). As an additional improvement, this enables code motion transformation on the transform IR itself. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124181	2022-04-22 23:29:11 +02:00
Okwan Kwon	ee285faed2	[mlir] Do not bubble up extract slice when it is rank-reducing. The bubble up logic was written by assuming the slice operation is always a normal slice that outputs a tensor with the same rank. Differential Revision: https://reviews.llvm.org/D124283	2022-04-22 12:21:47 -07:00
cpillmayer	3e8560f890	[MLIR] Add option to print users of an operation as comment in the printer This allows printing the users of an operation as proposed in the git issue #53286. To be able to refer to operations with no result, these operations are assigned an ID in SSANameState. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124048	2022-04-22 18:58:10 +00:00
Jacques Pienaar	9bae20b528	[mlir] Add shape.func Add shape func op for use (primarily) in shape function_library op. Allows setting default dialect for some simpler authoring. This is a minimal version of the ops needed. Differential Revision: https://reviews.llvm.org/D124055	2022-04-22 11:35:35 -07:00
Lei Zhang	6f28fd0bf7	[mlir][vector] Fold 1-element reduction into extract or arith ops If there is only one single element in the vector, then we can just extract the element to compute the final result. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124129	2022-04-22 14:24:46 -04:00
Matthias Springer	d6dab38ae4	[mlir][bufferize][NFC] Add function boundary bufferization flag to BufferizationOptions This makes the API easier to use. Also allows us to check for incorrect API usage for easier debugging. Differential Revision: https://reviews.llvm.org/D124265	2022-04-23 01:11:37 +09:00
Matthias Springer	b0b19fae81	[mlir][bufferize][NFC] Rewrite op filter logic The `hasFilter` field is not needed. Instead, the filter accepts ops by default if no ALLOW rule was specified. Differential Revision: https://reviews.llvm.org/D124264	2022-04-23 00:25:24 +09:00
Lei Zhang	fc760c0260	[mlir][vector] Fold cancelling vector.shape_cast(vector.broadcast) vector.broadcast can inject all size one dimensions. If it's followed by a vector.shape_cast to the original type, we can cancel the op pair, like cancelling consecutive shape_cast ops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124094	2022-04-22 08:58:26 -04:00
Matthias Springer	494505f39f	[mlir][bufferize][NFC] Move SCF test cases to SCF dialect Differential Revision: https://reviews.llvm.org/D124249	2022-04-22 20:35:20 +09:00
Matthias Springer	e07a7fd5c0	[mlir][bufferization] Move ModuleBufferization to bufferization dialect * Move Module Bufferization to the bufferization dialect. The implementation is split into `OneShotModuleBufferize.cpp` and `FuncBufferizableOpInterfaceImpl.cpp`, so that the external model implementation can be easily moved to the func dialect in the future. * Split and clean up test cases. A few test cases are still remaining in Linalg and will be updated separately. * `linalg.inplaceable` is renamed to `bufferization.writable` to accurately reflect its current usage. * Attributes and their verifiers are moved from the Linalg dialect to the Bufferization dialect. * Expand documentation. * Add a new flag to One-Shot Bufferize to allow for function boundary bufferization. Differential Revision: https://reviews.llvm.org/D122229	2022-04-22 19:37:28 +09:00
Matthias Springer	bd1d87e3d1	[mlir][bufferization][NFC] Remove layout post processing step The layout postprocessing step was removed and is now part of the FuncOp bufferization. If the user specified a certain layout map for a tensor function arg, use that layout map directly when bufferizing the function signature. Previously, the bufferization used a generic layout map for every tensor function arg and then updated function signatures and CallOps in a separate step. Differential Revision: https://reviews.llvm.org/D122228	2022-04-22 18:49:47 +09:00
Matthias Springer	70777d967f	[mlir][bufferize][NFC] Move FuncOp bufferization to BufferizableOpInterface impl FuncOps are now less special. They must still be analyzed + bufferized in a certain order, but they are now bufferized same as other ops that have a region: Bufferize the op first (`bufferize` interface method), then bufferize the region body with other bufferization patterns. In the case of FuncOps, the function signature is bufferized together with ReturnOps. Similar to how, e.g., scf.for ops are bufferized together with scf.yield ops. This change is essentially a reimplementation of the FuncOp bufferization, but mostly NFC from a user's perspective (apart from error messages). This change is in preparation of moving the code to the bufferization dialect. Differential Revision: https://reviews.llvm.org/D123214	2022-04-22 18:47:12 +09:00
Matthias Springer	d820acdde1	[mlir][bufferize][NFC] Use custom walk instead of GreedyPatternRewriter The bufferization driver was previously using a GreedyPatternRewriter. This was problematic because bufferization must traverse ops top-to-bottom. The GreedyPatternRewriter was previously configured via `useTopDownTraversal`, but this was a hack; this API was just meant for performance improvements and should not affect the result of the rewrite. BEGIN_PUBLIC No public commit message needed. END_PUBLIC Differential Revision: https://reviews.llvm.org/D123618	2022-04-22 18:23:09 +09:00
jacquesguan	9b32886e7e	[mlir][Arithmetic] Use common constant fold function in RemSI and RemUI to cover splat. This patch replaces current fold function with the common constant fold funtion in order to cover the situation of constant splat. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D124236	2022-04-22 09:20:18 +00:00
jacquesguan	abc17a6751	[mlir][Arithmetic] Use matchPattern to simplify code. This patch replaces some code with matchPattern and move them before the constant folder function in order to avoid redundant invoking. Differential Revision: https://reviews.llvm.org/D124235	2022-04-22 08:42:51 +00:00
Adrian Kuegel	a74e5a89b9	[mlir] Move isGuaranteedCollapsible to CollapseShapeOp (NFC). It seems more natural than to have it as a static method of ExpandShapeOp. Also fix a typo ("the the" -> "the"). Differential Revision: https://reviews.llvm.org/D124234	2022-04-22 10:31:25 +02:00
Will Dietz	bb8c8751cf	[MLIR] prefer /bin/sh over /bin/bash for simple test scripts These scripts do not appear to require bash, and while /bin/sh is not guaranteed either it's more commonly available. Fixes tests on NixOS and in certain sandbox build environments. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124205	2022-04-21 20:25:17 -05:00
Amy Zhuang	5bd4bcfc04	[mlir] Modify SuperVectorize to generate select op->combiner op Insert the select op before the combiner op when vectorizing a reduction loop that needs a mask, so the vectorized reduction loop can pass isLoopParallel check and be transformed correctly in later passes. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D124047	2022-04-21 17:09:13 -07:00
Mahesh Ravishankar	0c090dcc8a	[mlir][Linalg] Deprecate legacy reshape + generic op folding patterns. These patterns have been superceded by the fusion by collapsing patterns. Differential Revision: https://reviews.llvm.org/D124145	2022-04-21 22:25:23 +00:00
Chris Lattner	31c8abc3f1	[AsmParser/Printer] Rework sourceloc support for function arguments. When Location tracking support for block arguments was added, we discussed various approaches to threading support for this through function-like argument parsing. At the time, we added a parallel array of locations that could hold this. It turns out that that approach was verbose and error prone, roughly no one adopted it. This patch takes a different approach, adding an optional source locator to the UnresolvedOperand class. This fits much more naturally into the standard structure we use for representing locators, and gives all the function like dialects locator support for free (e.g. see the test adding an example for the LLVM dialect). Differential Revision: https://reviews.llvm.org/D124188	2022-04-21 12:43:36 -07:00
Frederik Gossen	673e9828be	[MLIR] Fix iteration counting in greedy pattern application Previously, checking that a fix point is reached was counted as a full iteration. As this "iteration" never changes the IR, this seems counter- intuitive. Differential Revision: https://reviews.llvm.org/D123641	2022-04-21 15:17:28 -04:00
Alex Zinenko	0edb262d91	[mlir] enable doc generation for the transform dialect	2022-04-21 18:52:08 +02:00
Fangrui Song	ae46b3e01f	Revert D121279 "[MLIR][GPU] Add canonicalizer for gpu.memcpy" This reverts commit `12f55cac69`. Causes miscompile. Will follow up with a reproduce.	2022-04-21 08:55:13 -07:00
Alex Zinenko	30f22429d3	[mlir] Connect Transform dialect to PDL This introduces a pair of ops to the Transform dialect that connect it to PDL patterns. Transform dialect relies on PDL for matching the Payload IR ops that are about to be transformed. For this purpose, it provides a container op for patterns, a "pdl_match" op and transform interface implementations that call into the pattern matching infrastructure. To enable the caching of compiled patterns, this also provides the extension mechanism for TransformState. Extensions allow one to store additional information in the TransformState and thus communicate it between different Transform dialect operations when they are applied. They can be added and removed when applying transform ops. An extension containing a symbol table in which the pattern names are resolved and a pattern compilation cache is introduced as the first client. Depends On D123664 Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124007	2022-04-21 16:23:10 +02:00
Markus Böck	850b2c6b3c	[mlir] Fix `Region`s `takeBody` method if the region is not empty The current implementation of takeBody first clears the Region, before then taking ownership of the blocks of the other regions. The issue here however, is that when clearing the region, it does not take into account references of operations to each other. In particular, blocks are deleted from front to back, and operations within a block are very likely to be deleted despite still having uses, causing an assertion to trigger [0]. This patch fixes that issue by simply calling dropAllReferences()before clearing the blocks. [0] `9a8bb4bc63/mlir/lib/IR/Operation.cpp (L154)` Differential Revision: https://reviews.llvm.org/D123913	2022-04-21 15:32:59 +02:00
Markus Böck	a41aaf166f	[mlir] Make `Regions`s `cloneInto` multithread-readable Prior to this patch, `cloneInto` would do a simple walk over the blocks and contained operations and clone and map them as it encounters them. As finishing touch it then remaps any successor and operands it has remapped during that process. This is generally fine, but sadly leads to a lot of uses of both operations and blocks from the source region, in the cloned operations in the target region. Those uses lead to writes in the use-def list of the operations, making `cloneInto` never thread safe. This patch reimplements `cloneInto` in three steps to avoid ever creating any extra uses on elements in the source region: * It first creates the mapping of all blocks and block operands * It then clones all operations to create the mapping of all operation results, but does not yet clone any regions or set the operands * After all operation results have been mapped, it now sets the operations operands and clones their regions. That way it is now possible to call `cloneInto` from multiple threads if the Region or Operation is isolated-from-above. This allows creating copies of functions or to use `mlir::inlineCall` with the same source region from multiple threads. In the general case, the method is thread-safe if through cloning, no new uses of `Value`s from outside the cloned Operation/Region are created. This can be ensured by mapping any outside operands via the `BlockAndValueMapping` to `Value`s owned by the caller thread. While I was at it, I also reworked the `clone` method of `Operation` a little bit and added a proper options class to avoid having a `cloneWithoutRegionsAndOperands` method, and be more extensible in the future. `cloneWithoutRegions` is now also a simple wrapper that calls `clone` with the proper options set. That way all the operation cloning code is now contained solely within `clone`. Differential Revision: https://reviews.llvm.org/D123917	2022-04-21 13:43:00 +02:00
Uday Bondhugula	f47a38f517	Add async dependencies support for gpu.launch op Add async dependencies support for gpu.launch op: this allows specifying a list of async tokens ("streams") as dependencies for the launch. Update the GPU kernel outlining pass lowering to propagate async dependencies from gpu.launch to gpu.launch_func op. Previously, a new stream was being created and destroyed for a kernel launch. The async deps support allows the kernel launch to be serialized on an existing stream. Differential Revision: https://reviews.llvm.org/D123499	2022-04-21 16:25:59 +05:30
Nimish Mishra	00c511b351	Added lowering support for atomic read and write constructs This patch adds lowering support for atomic read and write constructs. Also added is pointer modelling code to allow FIR pointer like types to be inferred and converted while lowering. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D122725 Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com>	2022-04-21 12:19:13 +05:30
River Riddle	0fd3a1ce60	[mlir][NFC] Update remaining textual references of un-namespaced `func` operations The special case parsing of operations in the `func` dialect is being removed, and operations will require the dialect namespace prefix.	2022-04-20 22:17:31 -07:00
River Riddle	cda6aa78f8	[mlir][NFC] Update textual references of `func` to `func.func` in Transform tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	a4936cb3e8	[mlir][NFC] Update textual references of `func` to `func.func` in Pass/Target tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	63237cddc1	[mlir][NFC] Update textual references of `func` to `func.func` in tool/runner tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	6a99d29022	[mlir][NFC] Update textual references of `func` to `func.func` in IR/Interface tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	87db8e4439	[mlir][NFC] Update textual references of `func` to `func.func` in Integration tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00
River Riddle	c48e3a13f3	[mlir][NFC] Update textual references of `func` to `func.func` in Tensor/Tosa/Vector tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00
River Riddle	2c7836ef15	[mlir][NFC] Update textual references of `func` to `func.func` in SPIRV tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00
River Riddle	fb35cd3baf	[mlir][NFC] Update textual references of `func` to `func.func` in SparseTensor tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00
River Riddle	0254b0bcf0	[mlir][NFC] Update textual references of `func` to `func.func` in LLVM/Math/MemRef/NVGPU/OpenACC/OpenMP/Quant/SCF/Shape tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:28 -07:00
River Riddle	92d38adb83	[mlir][NFC] Update textual references of `func` to `func.func` in Linalg tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:28 -07:00
River Riddle	412b8850f6	[mlir][NFC] Update textual references of `func` to `func.func` in Bufferization/Complex/EmitC/CF/Func/GPU tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:28 -07:00
River Riddle	5e7dea225b	[mlir][NFC] Update textual references of `func` to `func.func` in AMX/Arithmetic/ArmSVE/Async tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:28 -07:00
River Riddle	227ed2f448	[mlir][NFC] Update textual references of `func` to `func.func` in Affine/ tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:27 -07:00
River Riddle	3028bf740e	[mlir][NFC] Update textual references of `func` to `func.func` in Conversion/ tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:27 -07:00
River Riddle	ccaabff131	[mlir][NFC] Update textual references of `func` to `func.func` in Analysis/ tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:27 -07:00
River Riddle	8ae83bb8be	[mlir][NFC] Update textual references of `func` to `func.func` in ODS documentation The special case parsing of `func` operations is being removed.	2022-04-20 22:17:26 -07:00
River Riddle	2310ced874	[mlir][NFC] Update textual references of `func` to `func.func` in examples+python scripts The special case parsing of `func` operations is being removed.	2022-04-20 22:17:26 -07:00
Shraiysh Vaishay	88bb2521b0	[mlir][OpenMP] Add checks and tests for hint clause and fix empty hint This patch handles empty hint value for critical and atomic constructs. This also adds checks and tests for hint clause on atomic constructs. Reviewed By: peixin, kiranchandramohan, NimishMishra Differential Revision: https://reviews.llvm.org/D123186	2022-04-21 07:31:03 +05:30
Mehdi Amini	02eac667ed	Improve invalid-ir-print-after-failure.mlir to show the effects of -mlir-print-assume-verified (NFC)	2022-04-20 20:40:41 +00:00
Matthias Springer	8544523dcb	[mlir][tensor] Promote extract(from_elements(...)) to folding pattern Differential Revision: https://reviews.llvm.org/D123617	2022-04-20 23:47:42 +09:00
gysit	407b351da2	[mlir][linalg] Add ods-gen helper to simplify the build methods. Add a helper used to implement the build methods generated by ods-gen. The change reduces code size and compilation time since all structured op builders use the same build method. The change reduces the LinalgOps.cpp compilation time from 10.2s to 9.8s (debug build). Depends On D123987 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D124003	2022-04-20 13:14:38 +00:00
gysit	17721b6915	[mlir][linalg] Avoid template methods for parsing and printing. The revision avoids template methods for parsing and printing that are replicated for every named operation. Instead, the new methods take a regionBuilder argument. The revision reduces the compile time of LinalgOps.cpp from 11.2 to 10.2 seconds (debug build). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D123987	2022-04-20 13:06:31 +00:00
Uday Bondhugula	d7565de6cc	[MLIR] NFC. Drop trailing white space in GPU async ops print NFC. Drop trailing end of line white space in GPU async ops' printer whenever the list of async deps is empty. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D123754	2022-04-20 17:56:53 +05:30
Uday Bondhugula	d423fc3724	Add RegionBranchOpInterface on affine.for op Add RegionBranchOpInterface on affine.for op so that transforms relying on RegionBranchOpInterface can support affine.for. E.g.: buffer-deallocation pass. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D123568	2022-04-20 17:46:07 +05:30
Matthias Springer	9235e597a4	[mlir][bufferize] Fix missing copies when writing to a buffer in a loop Writes into tensors that are definied outside of a repetitive region, but with the write happening inside of the repetitive region were previously not considered conflicts. This was incorrect. E.g.: ``` %0 = ... : tensor<?xf32> scf.for ... { "reading_op"(%0) : tensor<?xf32> %1 = "writing_op"(%0) : tensor<?xf32> -> tensor<?xf32> ... } ``` In the above example, "writing_op" should be out-of-place. This commit fixes the bufferization for any op that declares its repetitive semantics via RegionBranchOpInterface.	2022-04-20 18:51:06 +09:00
Ingo Müller	1fe1f913c5	[mlir][docs] Add missing parentheses in example code on walkers. `getFunction` was missing parentheses. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D123999	2022-04-20 09:43:41 +00:00
jacquesguan	61baf2ffa7	[mlir][Vector] Add check of supported reduction kind for ScanOp. This patch adds check of supported reduction kind for ScanOp to avoid using and/or/xor for floating point type. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123977	2022-04-20 02:42:19 +00:00
jacquesguan	590a38920f	[mlir][LLVMIR] Add vector predication type cast intrinsic ops. This patch adds vector predication type cast intrinsic ops. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123996	2022-04-20 02:11:14 +00:00
Mehdi Amini	8608ed1441	Apply clang-tidy fixes for llvm-twine-local in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-20 00:39:10 +00:00
John Demme	6b0bed7ea5	[MLIR] [Python] Add a method to clear live operations map Introduce a method on PyMlirContext (and plumb it through to Python) to invalidate all of the operations in the live operations map and clear it. Since Python has no notion of private data, an end-developer could reach into some 3rd party API which uses the MLIR Python API (that is behaving correctly with regard to holding references) and grab a reference to an MLIR Python Operation, preventing it from being deconstructed out of the live operations map. This allows the API developer to clear the map when it calls C++ code which could delete operations, protecting itself from its users. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123895	2022-04-19 15:14:09 -07:00
Krzysztof Drewniak	ddc2eb0ada	[mlir] Adds getUpperBound() to LoopLikeInterface. getUpperBound is analogous to getLowerBound(), except for the upper bound, and is used in range analysis. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124020	2022-04-19 19:56:44 +00:00
Alex Zinenko	0eb403ad1b	[mlir][transform] Introduce transform.sequence op Sequence is an important transform combination primitive that just indicates transform ops being applied in a row. The simplest version requires fails immediately if any transformation in the sequence fails. Introducing this operation allows one to start placing transform IR within other IR. Depends On D123135 Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123664	2022-04-19 21:41:02 +02:00
Ashay Rane	25c218be36	[MLIR] Add function to create BFloat16 array attribute This patch adds a new function `mlirDenseElementsAttrBFloat16Get()`, which accepts the shaped type, the number of BFloat16 values, and a pointer to an array of BFloat16 values, each of which is a `uint16_t` value. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D123981	2022-04-19 19:27:06 +00:00
Mehdi Amini	83892d76f4	Print custom assembly on pass failure by default The printer is now resilient to invalid IR and will already automatically fallback to the generic form on invalid IR. Using the generic printer on pass failure was a conservative option before the printer was made failsafe. Reviewed By: lattner, rriddle, jpienaar, bondhugula Differential Revision: https://reviews.llvm.org/D123915	2022-04-19 17:29:08 +00:00
Mehdi Amini	2d6335421f	Apply clang-tidy fixes for llvm-qualified-auto in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-19 17:20:57 +00:00
Mehdi Amini	f9735be7e2	Apply clang-tidy fixes for performance-unnecessary-value-param in ControlFlowInterfaces.cpp (NFC)	2022-04-19 17:20:57 +00:00
Arnab Dutta	12f55cac69	[MLIR][GPU] Add canonicalizer for gpu.memcpy Fold away gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D121279	2022-04-19 17:54:00 +05:30
Marius Brehler	2ba865903d	[mlir][emitc] Add test for invalid type Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D123503	2022-04-19 11:03:56 +02:00
Matthias Springer	a3005a406e	[mlir][interfaces] Fix infinite loop in insideMutuallyExclusiveRegions This function was missing a termination condition.	2022-04-19 16:28:52 +09:00
Mehdi Amini	4e01184ad5	Apply clang-tidy fixes for performance-unnecessary-value-param in JitRunner.cpp (NFC)	2022-04-19 07:23:12 +00:00
Mehdi Amini	722a3a58e2	Apply clang-tidy fixes for performance-for-range-copy in MemRefOps.cpp (NFC)	2022-04-19 07:23:12 +00:00
Matthias Springer	0f4ba02db3	[mlir][interfaces] Add helpers for detecting recursive regions Add helper functions to check if an op may be executed multiple times based on RegionBranchOpInterface. Differential Revision: https://reviews.llvm.org/D123789	2022-04-19 16:13:32 +09:00
Michael Kruse	2d92ee97f1	Reapply "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `af0285122f`. The test "libomp::loop_dispatch.c" on builder openmp-gcc-x86_64-linux-debian fails from time-to-time. See #54969. This patch is unrelated.	2022-04-18 21:56:47 -05:00
Mehdi Amini	1881d6fc80	Apply clang-tidy fixes for performance-unnecessary-copy-initialization in MathOps.cpp (NFC)	2022-04-19 00:47:58 +00:00
Mehdi Amini	6120bd4703	Apply clang-tidy fixes for performance-for-range-copy in ElementwiseOpFusion.cpp (NFC)	2022-04-19 00:47:58 +00:00
Groverkss	15650b320b	[MLIR][Presburger] Remove inheritence in MultiAffineFunction This patch removes inheritence of MultiAffineFunction from IntegerPolyhedron and instead makes IntegerPolyhedron as a member. This patch removes virtualization in MultiAffineFunction and also removes unnecessary functions inherited from IntegerPolyhedron. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123921	2022-04-19 01:25:13 +05:30
Michael Kruse	af0285122f	Revert "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `9ec501da76`. It may have caused the openmp-gcc-x86_64-linux-debian buildbot to fail. https://lab.llvm.org/buildbot/#/builders/4/builds/20377	2022-04-18 14:38:31 -05:00
Michael Kruse	9ec501da76	[OpenMP] Refactor OMPScheduleType enum. The OMPScheduleType enum stores the constants from libomp's internal sched_type in kmp.h and are used by several kmp API functions. The enum values have an internal structure, namely each scheduling algorithm (e.g.) exists in four variants: unordered, orderend, normerge unordered, and nomerge ordered. This patch (basically a followup to D114940) splits the "ordered" and "nomerge" bits into separate flags, as was already done for the "monotonic" and "nonmonotonic", so we can apply bit flags operations on them. It also now contains all possible combinations according to kmp's sched_type. Deriving of the OMPScheduleType enum from clause parameters has been moved form MLIR's OpenMPToLLVMIRTranslation.cpp to OpenMPIRBuilder to make available for clang as well. Since the primary purpose of the flag is the binary interface to libomp, it has been made more private to LLVMFrontend. The primary interface for generating worksharing-loop using OpenMPIRBuilder code becomes `applyWorkshareLoop` which derives the OMPScheduleType automatically and calls the appropriate emitter function. While this is mostly a NFC refactor, it still applies the following functional changes: * The logic from OpenMPToLLVMIRTranslation to derive the OMPScheduleType also applies to clang. Most notably, it now applies the nonmonotonic flag for non-static schedules by default. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was previously not applied if the simd modifier was used. I assume this was a bug, since the effect was due to `loop.schedule_modifier()` returning `mlir::omp::ScheduleModifier::none` instead of `llvm::Optional::None`. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was set even if ordered was specified, in breach to what the comment before citing the OpenMP specification says. I assume this was an oversight. The ordered flag with parameter was not considered in this patch. Changes will need to be made (e.g. adding/modifying function parameters) when support for it is added. The lengthy names of the enum values can be discussed, for the moment this is avoiding reusing previously existing enum value names such as `StaticChunked` to avoid confusion. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D123403	2022-04-18 14:03:17 -05:00
River Riddle	58ceae9561	[mlir:NFC] Remove the forward declaration of FuncOp in the mlir namespace FuncOp has been moved to the `func` namespace for a little over a month, the using directive can be dropped now.	2022-04-18 12:01:55 -07:00
Mehdi Amini	b968c59050	Improve terminator doc in MLIR LangRef Reviewed By: rriddle, bondhugula Differential Revision: https://reviews.llvm.org/D123886	2022-04-18 18:43:30 +00:00
Mehdi Amini	21b251624b	Apply clang-tidy fixes for readability-identifier-naming in GPUDialect.cpp (NFC)	2022-04-18 18:15:30 +00:00
Mehdi Amini	c817628b67	Apply clang-tidy fixes for readability-simplify-boolean-expr in TypeConverter.cpp (NFC)	2022-04-18 18:15:30 +00:00
Jacques Pienaar	f4085c57dd	[mlir] Fix two AttributeParser aborts Reproducers that resulted in triggering the following asserts mlir::NamedAttribute::NamedAttribute(mlir::StringAttr, mlir::Attribute) mlir/lib/IR/Attributes.cpp:29:3 consumeToken mlir/lib/Parser/Parser.h:126 Differential Revision: https://reviews.llvm.org/D122240	2022-04-18 09:30:35 -07:00
Groverkss	4ffd0b6fde	[MLIR][Presburger] Make IntegerRelation::mergeLocalIds not delete duplicates This patch modifies mergeLocalIds to not delete duplicate local ids in `this` relation. This allows the ordering of the final local ids for `this` to be determined more easily, which is generally required when other objects refer to these local ids. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D123866	2022-04-18 10:15:35 +05:30
jacquesguan	5479044bfc	[mlir][Vector] Fold transpose splat to splat with transposed type. This revision folds transpose splat to a new splat with the transposed vector type. For a splat, there is no need to actually do transpose for it, it would be more effective to just build a new splat as the result. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123765	2022-04-18 03:00:17 +00:00
Chris Lattner	cac19f4141	[LogicalResult.h] Move ParseResult to the bottom of file and fix comment, NFC. This was review feedback that I missed in the phab review: https://reviews.llvm.org/D123760	2022-04-17 15:34:26 -07:00
Chris Lattner	81b2dc548b	[Support] Move ParseResult from OpDefinition.h to LogicalResult.h This class is a helper for 'parser-like' use cases of LogicalResult where the implicit conversion to bool is tolerable. It is used by the operation asmparsers, but is more generic functionality that is closely aligned with LogicalResult. Hoist it up to LogicalResult.h to make it more accessible. This is part of Issue #54884 Differential Revision: https://reviews.llvm.org/D123760	2022-04-17 15:18:33 -07:00
Mehdi Amini	d98481a1e7	Revert "[MLIR] Provide a way to print ops in custom form on pass failure" This reverts commit `daabcf5f04`. This patch still had on-going discussion that should be closed before committing.	2022-04-17 18:55:09 +00:00
Uday Bondhugula	daabcf5f04	[MLIR] Provide a way to print ops in custom form on pass failure The generic form of the op is too verbose and in some cases not readable. On pass failure, ops have been so far printed in generic form to provide a (stronger) guarantee that the IR print succeeds. However, in a large number of pass failure cases, the IR is still valid and the custom printers for the ops will succeed. In fact, readability is highly desirable post pass failure. This revision provides an option to print ops in their custom/pretty-printed form on IR failure -- this option is unsafe and there is no guarantee it will succeed. It's disabled by default and can be turned on only if needed. Differential Revision: https://reviews.llvm.org/D123893	2022-04-17 20:10:40 +05:30
Jacques Pienaar	bdabe505f4	[mlir][docs] Add missing directory separator	2022-04-16 21:59:18 -07:00
Valentin Clement	bd514967aa	[mlir][CSE] Add ability to remove commutative operations This patch takes advantage of the Commutative trait on operation to remove identical commutative operations where the operands are swapped. The second operation below can be removed since `arith.addi` is commutative. ``` %1 = arith.addi %a, %b : i32 %2 = arith.addi %b, %a : i32 ``` Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123492	2022-04-16 21:09:47 +02:00
Mehdi Amini	709868707c	Apply clang-tidy fixes for readability-identifier-naming in OpFormatGen.cpp (NFC)	2022-04-16 06:36:12 +00:00
Mehdi Amini	cdbd9c9bab	Apply clang-tidy fixes for llvm-qualified-auto in OpFormatGen.cpp (NFC)	2022-04-16 06:36:11 +00:00
River Riddle	0f304ef017	[mlir] Add asserts when changing various MLIRContext configurations This helps to prevent tsan failures when users inadvertantly mutate the context in a non-safe way. Differential Revision: https://reviews.llvm.org/D112021	2022-04-15 21:49:03 -07:00
Mogball	b1d3faab7a	[NFC] fix cmake build	2022-04-16 00:56:38 +00:00
Mehdi Amini	94244bda9d	Apply clang-tidy fixes for readability-identifier-naming in mlir-parser-fuzzer.cpp (NFC)	2022-04-16 00:44:51 +00:00
Mehdi Amini	fc9e440942	Apply clang-tidy fixes for readability-identifier-naming in DummyParserFuzzer.cpp (NFC)	2022-04-16 00:44:51 +00:00
Mogball	fa26c7ff4b	[mlir] Refactor LICM into a utility LICM is refactored into a utility that is application on any region. The implementation is moved to Transform/Utils.	2022-04-16 00:37:07 +00:00
Stella Stamenova	353f0a8e43	Revert "[mlir] Refactor LICM into a utility" This reverts commit `3131f80824`. This commit broke the Windows mlir bot: https://lab.llvm.org/buildbot/#/builders/13/builds/19745	2022-04-15 17:09:16 -07:00
Mogball	3131f80824	[mlir] Refactor LICM into a utility LICM is refactored into a utility that is application on any region. The implementation is moved to Transform/Utils.	2022-04-15 22:07:01 +00:00
River Riddle	ac860240ad	[mlir][NFC] Cleanup the TestClone pass Fix variable naming convention and cleanup a clang-tidy warning.	2022-04-15 12:57:07 -07:00
River Riddle	31c88660ab	[mlir] Remove the use of FilterTypes for template metaprogramming This technique results in an explosion in compile time, resulting from a huge number of std::tuple/concat instatiations. This technique is replaced by simpler metaprogramming and results in a signficant reduction in compile time. A local debug/asan build saw a 4x speed up in the processing of ArithmeticOps.h.inc, and given the nature of this change every dialect should see similar reductions in compile time. Differential Revision: https://reviews.llvm.org/D123360	2022-04-15 12:57:07 -07:00
Arjun P	ef8b2a7cea	[MLIR][Presburger] addSymbolicCut: fix the integral symbols heuristic to match the docs Previously this checked if the entire symbolic numerator was divisible by the denominator, which is never the case when this function is called. Fixed this to check only the non-const coefficients in the numerator, which was what was intended and documented. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123592	2022-04-15 20:34:06 +01:00
Thomas Raoux	b4bcef05b7	[mlir][vector] Fix bug in extractFromBroadcast folding extract was incorrectly folded when the source was coming from a broadcast that was both adding new rank and broadcasting the inner dimension. Differential Revision: https://reviews.llvm.org/D123867	2022-04-15 19:21:45 +00:00
Arjun P	69c1a35488	[MLIR][Presburger][Simplex] moveRowUnknownToColumn: support the row sample value being zero When the sample value is zero, everything is the same except that failure to pivot does not imply emptiness. So, leave it to the user to mark as empty if necessary, if they know the sample value is strictly negative. This is needed for an upcoming symbolic lexmin heuristic. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123604	2022-04-15 20:15:21 +01:00
William S. Moses	0df963e817	[MLIR][ClonePass] Attempt fix for anonymous pass name	2022-04-15 15:14:20 -04:00
rdzhabarov	3ef4099a61	[mlir] Fix BUILD issues and dependencies. Differential Revision: https://reviews.llvm.org/D123868	2022-04-15 19:05:02 +00:00
William S. Moses	9a8bb4bc63	[NFC] Update comments	2022-04-15 14:33:13 -04:00
Mogball	3430ae1e7b	[mlir] Update LICM to support Graph Regions Changes the algorithm of LICM to support graph regions (no guarantee of topologically sorted order). Also fixes an issue where ops with recursive side effects and regions would not be hoisted if any nested ops used operands that were defined within the nested region. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122465	2022-04-15 17:30:27 +00:00
William S. Moses	ed499ddcda	[MLIR] Fix operation clone Operation clone is currently faulty. Suppose you have a block like as follows: ``` (%x0 : i32) { %x1 = f(%x0) return %x1 } ``` The test case we have is that we want to "unroll" this, in which we want to change this to compute `f(f(x0))` instead of just `f(x0)`. We do so by making a copy of the body at the end of the block and set the uses of the argument in the copy operations with the value returned from the original block. This is implemented as follows: 1) map to the block arguments to the returned value (`map[x0] = x1`). 2) clone the body Now for this small example, this works as intended and we get the following. ``` (%x0 : i32) { %x1 = f(%x0) %x2 = f(%x1) return %x2 } ``` This is because the current logic to clone `x1 = f(x0)` first looks up the arguments in the map (which finds `x0` maps to `x1` from the initialization), and then sets the map of the result to the cloned result (`map[x1] = x2`). However, this fails if `x0` is not an argument to the op, but instead used inside the region, like below. ``` (%x0 : i32) { %x1 = f() { yield %x0 } return %x1 } ``` This is because cloning an op currently first looks up the args (none), sets the map of the result (`map[%x1] = %x2`), and then clones the regions. This results in the following, which is clearly illegal: ``` (%x0 : i32) { %x1 = f() { yield %x0 } %x2 = f() { yield %x2 } return %x2 } ``` Diving deeper, this is partially due to the ordering (how this PR fixes it), as well as how region cloning works. Namely it will first clone with the mapping, and then it will remap all operands. Since the ordering above now has a map of `x0 -> x1` and `x1 -> x2`, we end up with the incorrect behavior here. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122531	2022-04-15 13:09:13 -04:00
jfurtek	bed8212157	[mlir][ods][NFC] Move enum attribute definitions from OpBase.td to EnumAttr.td This diff moves `EnumAttr` tablegen definitions (specifically, `IntEnumAttr` and `BitEnumAttr`-related classes) from `OpBase.td` to `EnumAttr.td`. No functionality is changed. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123551	2022-04-15 16:51:14 +00:00
Alex Zinenko	6c5ae8e974	[mlir] Support opaque types in LLVM IR -> MLIR translation LLVM IR is moving towards adoption of opaque pointer types. These require extra information to be passed when constructing some operations, in particular GEP and Alloca. Adapt the builders of said operations and modify the translation code to handle both opaque and non-opaque pointers. This incidentally adds the translation for Alloca alignment and fixes the translation of struct-related GEP indices that must be constant. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123792	2022-04-15 17:51:31 +02:00
Lei Zhang	4db65e279b	[mlir][vector] Reorder elementwise(transpose) Similar to the existing pattern for reodering cast(transpose), this makes transpose following transpose and increases the chance of embedding the transposition inside contraction op. Actually cast ops are just special instances of elementwise ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123596	2022-04-15 09:05:35 -04:00
Mehdi Amini	4197475eb0	Apply clang-tidy fixes for readability-identifier-naming in TestTypes.cpp (NFC)	2022-04-15 08:01:04 +00:00
Mehdi Amini	7a80912dd8	Apply clang-tidy fixes for modernize-use-default-member-init in ControlFlowSinkUtils.cpp (NFC)	2022-04-15 08:01:04 +00:00
Chia-hung Duan	5232c5c5d4	[mlir] Fix verification order of nested ops. In order to increase parallism, certain ops with regions and have the IsIsolatedFromAbove trait will have their verification delayed. That means the region verifier may access the invalid ops and may lead to a crash. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122771	2022-04-15 04:41:10 +00:00
Thomas Raoux	59058c441a	[mlir][vector] Add operations used for Vector distribution Add vector op warp_execute_on_lane_0 that will be used to do incremental vector distribution in order to target warp level vector programming for architectures with GPU-like SIMT programming model. The idea behing the op is discussed further on discourse: https://discourse.llvm.org/t/vector-vector-distribution-large-vector-to-small-vector/1983/23 Differential Revision: https://reviews.llvm.org/D123703	2022-04-15 03:47:52 +00:00
Jacques Pienaar	35bd41916b	[mlir] Add assert to fail with more info (NFC) This would have assert before during tensor type construction with opaque error, assert and fail earlier now.	2022-04-14 19:54:12 -07:00
jacquesguan	3d79c52f31	[mlir][LLVMIR] Add more vector predication intrinsic ops. This revision adds vector predication select, merge and load/store intrinsic ops. Differential Revision: https://reviews.llvm.org/D123477	2022-04-15 02:13:42 +00:00
Mehdi Amini	ffdba713f5	Fix MLIR website generation	2022-04-15 01:28:30 +00:00
Thomas Raoux	894a591cf6	[mlir][nvgpu] Move mma.sync and ldmatrix in nvgpu dialect Move gpu operation mma.sync and ldmatrix in nvgpu as they are specific to nvidia target. Differential Revision: https://reviews.llvm.org/D123824	2022-04-14 23:44:52 +00:00
Mehdi Amini	72e2b4e7a7	Apply clang-tidy fixes for modernize-use-default-member-init in PDLLServer.cpp (NFC)	2022-04-14 22:59:14 +00:00
Mehdi Amini	db6796dfa8	Apply clang-tidy fixes for modernize-use-default-member-init in SparseTensorUtils.cpp (NFC)	2022-04-14 22:59:14 +00:00
Bixia Zheng	cb6f8d77a2	[mlir][sparse][taco] Use the SparseCompiler from python/tools. Copy the implementation of SparseCompiler from python/tools to taco/tools until we have a common place to install it. Modify TACO to use this SparseCompiler for compilation and jitting. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D123696	2022-04-14 15:17:18 -07:00
Thomas Raoux	4c564940a1	[mlir][nvgpu] Add NVGPU dialect (architectural specific gpu dialect) This introduce a new dialect for vendro specific ptx operations. This also adds the first operation ldmatrix as an example. More operations will be added in follow up patches. This new dialect is meant to be a bridge between GPU and Vector dialectis and NVVM dialect. This is based on the RFC proposed here: https://discourse.llvm.org/t/rfc-add-nv-gpu-dialect-hw-specific-extension-of-gpu-dialect-for-nvidia-gpus/61466/8 Differential Revision: https://reviews.llvm.org/D123266	2022-04-14 16:33:46 +00:00
Alex Zinenko	e5a5e00825	[mlir] fix compiler warnings -Wsign-compare and -Wunsued-value in the recently introduced code.	2022-04-14 15:26:50 +02:00
Lei Zhang	e54236dfb5	[mlir][vector] Cast away leading one dims for insert ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123621	2022-04-14 08:57:32 -04:00
Lei Zhang	bc408afbfe	[mlir][vector] Fold splat constant transpose Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123595	2022-04-14 08:51:25 -04:00
Alex Zinenko	d064c4801c	[mlir] Introduce Transform dialect This dialect provides operations that can be used to control transformation of the IR using a different portion of the IR. It refers to the IR being transformed as payload IR, and to the IR guiding the transformation as transform IR. The main use case for this dialect is orchestrating fine-grain transformations on individual operations or sets thereof. For example, it may involve finding loop-like operations with specific properties (e.g., large size) in the payload IR, applying loop tiling to those and only those operations, and then applying loop unrolling to the inner loops produced by the previous transformations. As such, it is not intended as a replacement for the pass infrastructure, nor for the pattern rewriting infrastructure. In the most common case, the transform IR will be processed and applied to payload IR by a pass. Transformations expressed by the transform dialect may be implemented using the pattern infrastructure or any other relevant MLIR component. This dialect is designed to be extensible, that is, clients of this dialect are allowed to inject additional operations into this dialect using the newly introduced in this patch `TransformDialectExtension` mechanism. This allows the dialect to avoid a dependency on the implementation of the transformation as well as to avoid introducing dialect-specific transform dialects. See https://discourse.llvm.org/t/rfc-interfaces-and-dialects-for-precise-ir-transformation-control/60927. Reviewed By: nicolasvasilache, Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123135	2022-04-14 13:48:45 +02:00
Alex Zinenko	09141f1adf	[mlir] Split intrinsics out of LLVMOps.td Move the operations that correspond to LLVM IR intrinsics in a separate .td file. This makes it easier to maintain the intrinsics and decreases the compile time of LLVMDialect.cpp by ~25%. Depends On D123310 Reviewed By: wsmoses, jacquesguan Differential Revision: https://reviews.llvm.org/D123315	2022-04-14 13:23:31 +02:00
Alex Zinenko	2366a43b3c	[mlir] initial support for opaque pointers in the LLVM dialect LLVM IR has introduced and is moving forward with the concept of opaque pointers, i.e. pointer types that are not carrying around the pointee type. Instead, memory-related operations indicate the type of the data being accessed through the opaque pointer. Introduce the initial support for opaque pointers in the LLVM dialect: - `LLVMPointerType` to support omitting the element type; - alloca/load/store/gep to support opaque pointers in their operands and results; this requires alloca and gep to store the element type as an attribute; - memory-related intrinsics to support opaque pointers in their operands; - translation to LLVM IR for the ops above is no longer using methods deprecated in LLVM API due to the introduction of opaque pointers. Unlike LLVM IR, MLIR can afford to support both opaque and non-opaque pointers at the same time and simplify the transition. Translation to LLVM IR of MLIR that involves opaque pointers requires the LLVMContext to be configured to always use opaque pointers. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123310	2022-04-14 13:23:29 +02:00
Mehdi Amini	35f48edb91	Apply clang-tidy fixes for llvm-qualified-auto in VectorTransforms.cpp (NFC)	2022-04-14 09:42:37 +00:00
Mehdi Amini	2d06420879	Apply clang-tidy fixes for performance-for-range-copy in SCF.cpp (NFC)	2022-04-14 09:42:36 +00:00
Uday Bondhugula	3766ca75f8	[MLIR] Fix missing return statement warning in PatternMatch.h Fix missing return statement warning in PatternMatch.h. NFC. ``` mlir/include/mlir/IR/PatternMatch.h:983:3: warning: no return statement in function returning non-void [-Wreturn-type] ``` Differential Revision: https://reviews.llvm.org/D123756	2022-04-14 13:03:21 +05:30
Arnab Dutta	392d55c1e2	[MLIR][GPU] Add canonicalization patterns for folding simple gpu.wait ops. * Fold away redundant %t = gpu.wait async + gpu.wait [%t] pairs. * Fold away %t = gpu.wait async ... ops when %t has no uses. * Fold away gpu.wait [] ops. * In case of %t1 = gpu.wait async [%t0], replace all uses of %t1 with %t0. Differential Revision: https://reviews.llvm.org/D121878	2022-04-14 12:30:55 +05:30
Stella Laurenzo	61352a580a	[mlir] Introduce ml_program dialect. Differential Revision: https://reviews.llvm.org/D120203	2022-04-13 21:38:14 -07:00

... 2 3 4 5 6 ...

11290 Commits