llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	c9d5f3418a	Cleanup SuperVectorization dialect printing and parsing. On the read side, ``` %3 = vector_transfer_read %arg0, %i2, %i1, %i0 {permutation_map: (d0, d1, d2)->(d2, d0)} : (memref<?x?x?xf32>, index, index, index) -> vector<32x256xf32> ``` becomes: ``` %3 = vector_transfer_read %arg0[%i2, %i1, %i0] {permutation_map: (d0, d1, d2)->(d2, d0)} : memref<?x?x?xf32>, vector<32x256xf32> ``` On the write side, ``` vector_transfer_write %0, %arg0, %c3, %c3 {permutation_map: (d0, d1)->(d0)} : vector<128xf32>, memref<?x?xf32>, index, index ``` becomes ``` vector_transfer_write %0, %arg0[%c3, %c3] {permutation_map: (d0, d1)->(d0)} : vector<128xf32>, memref<?x?xf32> ``` Documentation will be cleaned up in a followup commit that also extracts a proper .md from the top of the file comments. PiperOrigin-RevId: 241021879	2019-03-29 17:56:42 -07:00
Feng Liu	a38792f7d1	remove the const quantifier before temp variable PiperOrigin-RevId: 240997262	2019-03-29 17:56:27 -07:00
Nicolas Vasilache	f93a5be65f	Make createMaterializeVectorsPass take a vectorSize parameter - NFC This CL allows the programmatic control of the target hardware vector size when creating a MaterializeVectorsPass. This is useful for registering passes for the tutorial. PiperOrigin-RevId: 240996136	2019-03-29 17:56:12 -07:00
Feng Liu	5303587448	[TableGen] Support benefit score in pattern definition. A integer number can be specified in the pattern definition and used as the adjustment to the default benefit score in the generated rewrite pattern C++ definition. PiperOrigin-RevId: 240994192	2019-03-29 17:55:55 -07:00
Nicolas Vasilache	094ca64ab0	Refactor vectorization patterns This CL removes the reliance of the vectorize pass on the specification of a `fastestVaryingDim` parameter. This parameter is a restriction meant to more easily target a particular loop/memref combination for vectorization and is mainly used for testing. This also had the side-effect of restricting vectorization patterns to only the ones in which all memrefs were contiguous along the same loop dimension. This simple restriction prevented matmul to vectorize in 2-D. this CL removes the restriction and adds the matmul test which vectorizes in 2-D along the parallel loops. Support for reduction loops is left for future work. PiperOrigin-RevId: 240993827	2019-03-29 17:55:36 -07:00
River Riddle	8a0622c986	[PassManager] Add a utility class, PrettyStackTraceParallelDiagnosticEntry, to emit any queued up diagnostics in the event of a crash when multi-threading. PiperOrigin-RevId: 240986566	2019-03-29 17:54:51 -07:00
MLIR Team	9d30b36aaf	Enable input-reuse fusion to search function arguments for fusion candidates (takes care of a TODO, enables another tutorial test case). PiperOrigin-RevId: 240979894	2019-03-29 17:54:36 -07:00
River Riddle	106dd08e99	Change the vectorizer test pass to output via diagnostics instead of llvm::outs. This allows for the output to be deterministic when multi-threading is enabled. PiperOrigin-RevId: 240905858	2019-03-29 17:54:21 -07:00
Jacques Pienaar	e7111fd62c	Address some errors from g++ These fail with: could not convert ‘module’ from ‘llvm::orc::ThreadSafeModule’ to ‘llvm::Expected<llvm::orc::ThreadSafeModule>’ PiperOrigin-RevId: 240892583	2019-03-29 17:53:36 -07:00
River Riddle	76181a7b38	Remove the LowerEDSCTestPass. Most of the tests have been ported to be unit-tests and this pass is problematic in the way it depends on TableGen-generated files. This pass is also non-deterministic during multi-threading and a blocker to turning it on by default. PiperOrigin-RevId: 240889154	2019-03-29 17:53:05 -07:00
River Riddle	909a63d8bf	Tidy up a few comments and error messages related to parsing multi-result operations. PiperOrigin-RevId: 240876306	2019-03-29 17:52:51 -07:00
Jacques Pienaar	cd0b925dc2	Remove extra qualification PiperOrigin-RevId: 240875432	2019-03-29 17:52:36 -07:00
Alex Zinenko	85bbde483d	LLVM IR Dialect: separate the conversion tool from the conversion pass Originally, the conversion to the LLVM IR dialect had been implemented as pass. The common conversion infrastructure was factored into DialectConversion from which the conversion pass inherited. The conversion being a pass is undesirable for callers that only need the conversion done, for example as a part of sequence of conversions or outside the pass manager infrastructure. Split the LLVM IR Dialect conversion into the conversion proper and the conversion pass, where the latter contains the former instead of inheriting. NFC. PiperOrigin-RevId: 240874740	2019-03-29 17:52:20 -07:00
Alex Zinenko	3173a63f3f	Dialect Conversion: convert regions of operations when cloning them Dialect conversion currently clones the operations that did not match any pattern. This includes cloning any regions that belong to these operations. Instead, apply conversion recursively to the nested regions. Note that if an operation matched one of the conversion patterns, it is up to the pattern rewriter to fill in the regions of the converted operation. This may require calling back to the converter and is left for future work. PiperOrigin-RevId: 240872410	2019-03-29 17:52:04 -07:00
River Riddle	01140bd137	Change the muli-return syntax for operations. The name of the operation result now contains the number of results that it refers to if the number of results is greater than 1. Example: %call:2 = call @multi_return() : () -> (f32, i32) use(%calltensorflow/mlir#0, %calltensorflow/mlir#1) This cl also adds parser support for uniquely named result values. This means that a test writer can now write something like: %foo, %bar = call @multi_return() : () -> (f32, i32) use(%foo, %bar) Note: The printer will still print the collapsed form. PiperOrigin-RevId: 240860058	2019-03-29 17:51:32 -07:00
MLIR Team	9d9675fc8f	Remove overly conservative check in LoopFusion pass (enables fusion in tutorial example). PiperOrigin-RevId: 240859227	2019-03-29 17:51:16 -07:00
River Riddle	07c1a96abf	[PassManager] Define a ParallelDiagnosticHandler to ensure that diagnostics are still produced in a deterministic order when multi-threading. PiperOrigin-RevId: 240817922	2019-03-29 17:50:59 -07:00
River Riddle	213b8d4d3b	Rename InstOperand to OpOperand. PiperOrigin-RevId: 240814651	2019-03-29 17:50:41 -07:00
Dimitrios Vytiniotis	79bd6badb2	Remove global LLVM CLI variables from library code Plus move parsing code into the MLIR CPU runner binary. PiperOrigin-RevId: 240786709	2019-03-29 17:50:23 -07:00
River Riddle	af9760fe18	Replace remaining usages of the Instruction class with Operation. PiperOrigin-RevId: 240777521	2019-03-29 17:50:04 -07:00
Nicolas Vasilache	4dc7af9da8	Make vectorization aware of loop semantics Now that we have a dependence analysis, we can check that loops are indeed parallel and make vectorization correct. PiperOrigin-RevId: 240682727	2019-03-29 17:49:30 -07:00
Nicolas Vasilache	c3742d20b5	Give the Vectorize pass a virtualVectorSize argument. This CL allows vectorization to be called and configured in other ways than just via command line arguments. This allows triggering vectorization programmatically. PiperOrigin-RevId: 240638208	2019-03-29 17:48:12 -07:00
River Riddle	3a845be7d1	Add support for multi-threaded pass timing. When multi-threading is enabled in the pass manager the meaning of the display slightly changes. First, a new timing column is added, `User Time`, that displays the total time spent across all threads. Secondly, the `Wall Time` column displays the longest individual time spent amongst all of the threads. This means that the `Wall Time` column will continue to give an indicator on the perceived time, or clock time, whereas the `User Time` will display the total cpu time. Example: $ mlir-opt foo.mlir -experimental-mt-pm -cse -canonicalize -convert-to-llvmir -pass-timing ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0078 seconds ---User Time--- ---Wall Time--- --- Name --- 0.0175 ( 88.3%) 0.0055 ( 70.4%) Function Pipeline 0.0018 ( 9.3%) 0.0006 ( 8.1%) CSE 0.0013 ( 6.3%) 0.0004 ( 5.8%) (A) DominanceInfo 0.0017 ( 8.7%) 0.0006 ( 7.1%) FunctionVerifier 0.0128 ( 64.6%) 0.0039 ( 50.5%) Canonicalizer 0.0011 ( 5.7%) 0.0004 ( 4.7%) FunctionVerifier 0.0004 ( 2.1%) 0.0004 ( 5.2%) ModuleVerifier 0.0010 ( 5.3%) 0.0010 ( 13.4%) LLVMLowering 0.0009 ( 4.3%) 0.0009 ( 11.0%) ModuleVerifier 0.0198 (100.0%) 0.0078 (100.0%) Total PiperOrigin-RevId: 240636269	2019-03-29 17:47:41 -07:00
River Riddle	99b87c9707	Replace usages of Instruction with Operation in the Transforms/ directory. PiperOrigin-RevId: 240636130	2019-03-29 17:47:26 -07:00
Mehdi Amini	3518122e86	Simplify API uses of `getContext()` (NFC) The Pass base class is providing a convenience getContext() accessor. PiperOrigin-RevId: 240634961	2019-03-29 17:47:11 -07:00
Jacques Pienaar	b0244b66a5	Fix include path in test pass. PiperOrigin-RevId: 240628260	2019-03-29 17:46:41 -07:00
Jacques Pienaar	b15ac2d999	Initialize std::atomic directly. Avoids error in OSS build: error: copying variable of type 'std::atomic<unsigned int>' invokes deleted constructor PiperOrigin-RevId: 240618765	2019-03-29 17:46:26 -07:00
Jacques Pienaar	ed4fa52b4a	Add missing numeric header for std::accumulate. PiperOrigin-RevId: 240593135	2019-03-29 17:45:42 -07:00
Alex Zinenko	e2f9079a71	LLVM IR Conversion: support zero-dimensional memrefs The spec allows zero-dimensional memrefs to exist and treats them essentially as single-element buffers. Unlike single-dimensional memrefs of static shape <1xTy>, zero-dimensional memrefs do not require indices to access the only element they store. Add support of zero-dimensional memrefs to the LLVM IR conversion. In particular, such memrefs are converted into bare pointers, and accesses to them are converted to bare loads and stores, without the overhead of `getelementptr %buffer, 0`. PiperOrigin-RevId: 240579456	2019-03-29 17:45:26 -07:00
Alex Zinenko	5c285f228c	LLVM IR Conversion: keep LLVM dialect types as is during conversion When converting to the LLVM IR Dialect, it is possible for the input IR to contain LLVM IR Dialect operation and/or types, for example, some functions may have been coverted to the LLVM IR Dialect already, or may have been created using this dialect directly. Make sure that type conversion keeps LLVM IR Dialect types unmodified and does not error out. Operations are already kept as is. PiperOrigin-RevId: 240574972	2019-03-29 17:45:11 -07:00
River Riddle	9c08540690	Replace usages of Instruction with Operation in the /Analysis directory. PiperOrigin-RevId: 240569775	2019-03-29 17:44:56 -07:00
Alex Zinenko	5a5bba0279	Introduce affine terminator Due to legacy reasons (ML/CFG function separation), regions in affine control flow operations require contained blocks not to have terminators. This is inconsistent with the notion of the block and may complicate code motion between regions of affine control operations and other regions. Introduce `affine.terminator`, a special terminator operation that must be used to terminate blocks inside affine operations and transfers the control back to he region enclosing the affine operation. For brevity and readability reasons, allow `affine.for` and `affine.if` to omit the `affine.terminator` in their regions when using custom printing and parsing format. The custom parser injects the `affine.terminator` if it is missing so as to always have it present in constructed operations. Update transformations to account for the presence of terminator. In particular, most code motion transformation between loops should leave the terminator in place, and code motion between loops and non-affine blocks should drop the terminator. PiperOrigin-RevId: 240536998	2019-03-29 17:44:24 -07:00
River Riddle	af45236c70	Add experimental support for multi-threading the pass manager. This adds support for running function pipelines on functions across multiple threads, and is guarded by an off-by-default flag 'experimental-mt-pm'. There are still quite a few things that need to be done before multi-threading is ready for general use(e.g. pass-timing), but this allows for those things to be tested in a multi-threaded environment. PiperOrigin-RevId: 240489002	2019-03-29 17:44:08 -07:00
Jacques Pienaar	c6b294ac7b	Include numeric header for std::accumulate. PiperOrigin-RevId: 240462910	2019-03-29 17:43:52 -07:00
River Riddle	f9d91531df	Replace usages of Instruction with Operation in the /IR directory. This is step 2/N to renaming Instruction to Operation. PiperOrigin-RevId: 240459216	2019-03-29 17:43:37 -07:00
Feng Liu	c489f50e6f	Add a trait to set the result type by attribute Before this CL, the result type of the pattern match results need to be as same as the first operand type, operand broadcast type or a generic tensor type. This CL adds a new trait to set the result type by attribute. For example, the TFL_ConstOp can use this to set the output type to its value attribute. PiperOrigin-RevId: 240441249	2019-03-29 17:43:06 -07:00
River Riddle	9ffdc930c0	Rename the Instruction class to Operation. This just renames the class, usages of Instruction will still refer to a typedef in the interim. This is step 1/N to renaming Instruction to Operation. PiperOrigin-RevId: 240431520	2019-03-29 17:42:50 -07:00
River Riddle	97db10d413	Add a utility Instruction::getDialect method to return the dialect an operation is associated with, or nullptr if the associated dialect has not been registered. PiperOrigin-RevId: 240402300	2019-03-29 17:42:19 -07:00
River Riddle	bee7b53031	Update the canonicalization patterns for AffineApply and AffineForOp to use matchAndRewrite. PiperOrigin-RevId: 240398220	2019-03-29 17:42:03 -07:00
Alex Zinenko	a7215a9032	Allow creating standalone Regions Currently, regions can only be constructed by passing in a `Function` or an `Instruction` pointer referencing the parent object, unlike `Function`s or `Instruction`s themselves that can be created without a parent. It leads to a rather complex flow in operation construction where one has to create the operation first before being able to work with its regions. It may be necessary to work with the regions before the operation is created. In particular, in `build` and `parse` functions that are executed _before_ the operation is created in cases where boilerplate region manipulation is required (for example, inserting the hypothetical default terminator in affine regions). Allow creating standalone regions. Such regions are meant to own a list of blocks and transfer them to other regions on demand. Each instruction stores a fixed number of regions as trailing objects and has ownership of them. This decreases the size of the Instruction object for the common case of instructions without regions. Keep this behavior intact. To allow some flexibility in construction, make OperationState store an owning vector of regions. When the Builder creates an Instruction from OperationState, the bodies of the regions are transferred into the instruction-owned regions to minimize copying. Thus, it becomes possible to fill standalone regions with blocks and move them to an operation when it is constructed, or move blocks from a region to an operation region, e.g., for inlining. PiperOrigin-RevId: 240368183	2019-03-29 17:40:59 -07:00
Chris Lattner	46ade282c8	Make FunctionPass::getFunction() return a reference to the function, instead of a pointer. This makes it consistent with all the other methods in FunctionPass, as well as with ModulePass::getModule(). NFC. PiperOrigin-RevId: 240257910	2019-03-29 17:40:44 -07:00
River Riddle	5f3b914a6e	Replace remaining usages of "Op::operator->" with "." and remove it. PiperOrigin-RevId: 240210336	2019-03-29 17:40:25 -07:00
River Riddle	96ebde9cfd	Replace usages of "Op::operator->" with ".". This is step 2/N of removing the temporary operator-> method as part of the de-const transition. PiperOrigin-RevId: 240200792	2019-03-29 17:40:09 -07:00
River Riddle	5de726f493	Refactor the Pattern framework to allow for combined match/rewrite patterns. This is done by adding a new 'matchAndRewrite' function to RewritePattern that performs the match and rewrite in one step. The default behavior simply calls into the existing 'match' and 'rewrite' functions. The 'PatternMatcher' class has now been specialized for RewritePatterns and has been rewritten to make use of the new matchAndRewrite functionality. This combined match/rewrite functionality allows simplifying the majority of existing RewritePatterns, as they do not benefit from separate match and rewrite functions. Some of the existing canonicalization patterns in StandardOps have been modified to take advantage of this functionality. PiperOrigin-RevId: 240187856	2019-03-29 17:39:35 -07:00
River Riddle	af1abcc80b	Replace usages of "operator->" with "." for the AffineOps. Note: The "operator->" method is a temporary helper for the de-const transition and is gradually being phased out. PiperOrigin-RevId: 240179439	2019-03-29 17:39:19 -07:00
River Riddle	832567b379	NFC: Rename the 'for' operation in the AffineOps dialect to 'affine.for' and set the namespace of the AffineOps dialect to 'affine'. PiperOrigin-RevId: 240165792	2019-03-29 17:39:03 -07:00
Lei Zhang	8f5fa56623	[TableGen] Consolidate constraint related concepts Previously we have multiple mechanisms to specify op definition and match constraints: TypeConstraint, AttributeConstraint, Type, Attr, mAttr, mAttrAnyOf, mPat. These variants are not added because there are so many distinct cases we need to model; essentially, they are all carrying a predicate. It's just an artifact of implementation. It's quite confusing for users to grasp these variants and choose among them. Instead, as the OpBase TableGen file, we need to strike to provide an unified mechanism. Each dialect has the flexibility to define its own aliases if wanted. This CL removes mAttr, mAttrAnyOf, mPat. A new base class, Constraint, is added. Now TypeConstraint and AttrConstraint derive from Constraint. Type and Attr further derive from TypeConstraint and AttrConstraint, respectively. Comments are revised and examples are added to make it clear how to use constraints. PiperOrigin-RevId: 240125076	2019-03-29 17:38:46 -07:00
Mehdi Amini	bb621a5596	Using getContext() instead of getInstruction()->getContext() on Operation (NFC) PiperOrigin-RevId: 240088209	2019-03-29 17:38:29 -07:00
River Riddle	63e8725bc2	Update some of the derived type classes to use getImpl instead of a static_cast. PiperOrigin-RevId: 240084937	2019-03-29 17:38:14 -07:00
Chris Lattner	e510de0305	Various small cleanups to the code, mostly removing const_cast's. PiperOrigin-RevId: 240083489	2019-03-29 17:37:58 -07:00

1 2 3 4 5 ...

1018 Commits