llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	c9d5f3418a	Cleanup SuperVectorization dialect printing and parsing. On the read side, ``` %3 = vector_transfer_read %arg0, %i2, %i1, %i0 {permutation_map: (d0, d1, d2)->(d2, d0)} : (memref<?x?x?xf32>, index, index, index) -> vector<32x256xf32> ``` becomes: ``` %3 = vector_transfer_read %arg0[%i2, %i1, %i0] {permutation_map: (d0, d1, d2)->(d2, d0)} : memref<?x?x?xf32>, vector<32x256xf32> ``` On the write side, ``` vector_transfer_write %0, %arg0, %c3, %c3 {permutation_map: (d0, d1)->(d0)} : vector<128xf32>, memref<?x?xf32>, index, index ``` becomes ``` vector_transfer_write %0, %arg0[%c3, %c3] {permutation_map: (d0, d1)->(d0)} : vector<128xf32>, memref<?x?xf32> ``` Documentation will be cleaned up in a followup commit that also extracts a proper .md from the top of the file comments. PiperOrigin-RevId: 241021879	2019-03-29 17:56:42 -07:00
Feng Liu	a38792f7d1	remove the const quantifier before temp variable PiperOrigin-RevId: 240997262	2019-03-29 17:56:27 -07:00
Nicolas Vasilache	f93a5be65f	Make createMaterializeVectorsPass take a vectorSize parameter - NFC This CL allows the programmatic control of the target hardware vector size when creating a MaterializeVectorsPass. This is useful for registering passes for the tutorial. PiperOrigin-RevId: 240996136	2019-03-29 17:56:12 -07:00
Feng Liu	5303587448	[TableGen] Support benefit score in pattern definition. A integer number can be specified in the pattern definition and used as the adjustment to the default benefit score in the generated rewrite pattern C++ definition. PiperOrigin-RevId: 240994192	2019-03-29 17:55:55 -07:00
Nicolas Vasilache	094ca64ab0	Refactor vectorization patterns This CL removes the reliance of the vectorize pass on the specification of a `fastestVaryingDim` parameter. This parameter is a restriction meant to more easily target a particular loop/memref combination for vectorization and is mainly used for testing. This also had the side-effect of restricting vectorization patterns to only the ones in which all memrefs were contiguous along the same loop dimension. This simple restriction prevented matmul to vectorize in 2-D. this CL removes the restriction and adds the matmul test which vectorizes in 2-D along the parallel loops. Support for reduction loops is left for future work. PiperOrigin-RevId: 240993827	2019-03-29 17:55:36 -07:00
River Riddle	3ddd0411d0	Slight rewording of TupleType rationale. PiperOrigin-RevId: 240991400	2019-03-29 17:55:21 -07:00
River Riddle	d16213bf66	Update the QuickstartRewrites document to include information about the new 'matchAndRewrite' functionality in RewritePatterns. PiperOrigin-RevId: 240987764	2019-03-29 17:55:05 -07:00
River Riddle	8a0622c986	[PassManager] Add a utility class, PrettyStackTraceParallelDiagnosticEntry, to emit any queued up diagnostics in the event of a crash when multi-threading. PiperOrigin-RevId: 240986566	2019-03-29 17:54:51 -07:00
MLIR Team	9d30b36aaf	Enable input-reuse fusion to search function arguments for fusion candidates (takes care of a TODO, enables another tutorial test case). PiperOrigin-RevId: 240979894	2019-03-29 17:54:36 -07:00
River Riddle	106dd08e99	Change the vectorizer test pass to output via diagnostics instead of llvm::outs. This allows for the output to be deterministic when multi-threading is enabled. PiperOrigin-RevId: 240905858	2019-03-29 17:54:21 -07:00
MLIR Team	dd0029e4f6	Support for type constraints across operand and results -- PiperOrigin-RevId: 240905555	2019-03-29 17:54:06 -07:00
Tatiana Shpeisman	65a5f73ab3	Fixed a few instances of inconsistent grammar. PiperOrigin-RevId: 240896336	2019-03-29 17:53:50 -07:00
Jacques Pienaar	e7111fd62c	Address some errors from g++ These fail with: could not convert ‘module’ from ‘llvm::orc::ThreadSafeModule’ to ‘llvm::Expected<llvm::orc::ThreadSafeModule>’ PiperOrigin-RevId: 240892583	2019-03-29 17:53:36 -07:00
Jacques Pienaar	b633fcf9c0	Add README file for MLIR. PiperOrigin-RevId: 240889350	2019-03-29 17:53:21 -07:00
River Riddle	76181a7b38	Remove the LowerEDSCTestPass. Most of the tests have been ported to be unit-tests and this pass is problematic in the way it depends on TableGen-generated files. This pass is also non-deterministic during multi-threading and a blocker to turning it on by default. PiperOrigin-RevId: 240889154	2019-03-29 17:53:05 -07:00
River Riddle	909a63d8bf	Tidy up a few comments and error messages related to parsing multi-result operations. PiperOrigin-RevId: 240876306	2019-03-29 17:52:51 -07:00
Jacques Pienaar	cd0b925dc2	Remove extra qualification PiperOrigin-RevId: 240875432	2019-03-29 17:52:36 -07:00
Alex Zinenko	85bbde483d	LLVM IR Dialect: separate the conversion tool from the conversion pass Originally, the conversion to the LLVM IR dialect had been implemented as pass. The common conversion infrastructure was factored into DialectConversion from which the conversion pass inherited. The conversion being a pass is undesirable for callers that only need the conversion done, for example as a part of sequence of conversions or outside the pass manager infrastructure. Split the LLVM IR Dialect conversion into the conversion proper and the conversion pass, where the latter contains the former instead of inheriting. NFC. PiperOrigin-RevId: 240874740	2019-03-29 17:52:20 -07:00
Alex Zinenko	3173a63f3f	Dialect Conversion: convert regions of operations when cloning them Dialect conversion currently clones the operations that did not match any pattern. This includes cloning any regions that belong to these operations. Instead, apply conversion recursively to the nested regions. Note that if an operation matched one of the conversion patterns, it is up to the pattern rewriter to fill in the regions of the converted operation. This may require calling back to the converter and is left for future work. PiperOrigin-RevId: 240872410	2019-03-29 17:52:04 -07:00
Nicolas Vasilache	abe881d565	NFC - Handle IndexedValue corner case Implicit conversion don't play nicely in expressions such as: `C() = A(i) * B(i)`. Make `C()` return an IndexedValue instead of casting to ValueHandle. This prevents double capture errors and is useful for the tutorial. PiperOrigin-RevId: 240863223	2019-03-29 17:51:48 -07:00
River Riddle	01140bd137	Change the muli-return syntax for operations. The name of the operation result now contains the number of results that it refers to if the number of results is greater than 1. Example: %call:2 = call @multi_return() : () -> (f32, i32) use(%calltensorflow/mlir#0, %calltensorflow/mlir#1) This cl also adds parser support for uniquely named result values. This means that a test writer can now write something like: %foo, %bar = call @multi_return() : () -> (f32, i32) use(%foo, %bar) Note: The printer will still print the collapsed form. PiperOrigin-RevId: 240860058	2019-03-29 17:51:32 -07:00
MLIR Team	9d9675fc8f	Remove overly conservative check in LoopFusion pass (enables fusion in tutorial example). PiperOrigin-RevId: 240859227	2019-03-29 17:51:16 -07:00
River Riddle	07c1a96abf	[PassManager] Define a ParallelDiagnosticHandler to ensure that diagnostics are still produced in a deterministic order when multi-threading. PiperOrigin-RevId: 240817922	2019-03-29 17:50:59 -07:00
River Riddle	213b8d4d3b	Rename InstOperand to OpOperand. PiperOrigin-RevId: 240814651	2019-03-29 17:50:41 -07:00
Dimitrios Vytiniotis	79bd6badb2	Remove global LLVM CLI variables from library code Plus move parsing code into the MLIR CPU runner binary. PiperOrigin-RevId: 240786709	2019-03-29 17:50:23 -07:00
River Riddle	af9760fe18	Replace remaining usages of the Instruction class with Operation. PiperOrigin-RevId: 240777521	2019-03-29 17:50:04 -07:00
Nicolas Vasilache	31442a66ef	Cleanup vectorize_1d.mlir test - NFC This CL splits a large monolithic test function into smaller ones that are each CHECK-LABEL'd PiperOrigin-RevId: 240684979	2019-03-29 17:49:45 -07:00
Nicolas Vasilache	4dc7af9da8	Make vectorization aware of loop semantics Now that we have a dependence analysis, we can check that loops are indeed parallel and make vectorization correct. PiperOrigin-RevId: 240682727	2019-03-29 17:49:30 -07:00
River Riddle	21547ace87	Update the multi-threaded pass timing to not assume that total time will be different from user time. PiperOrigin-RevId: 240681618	2019-03-29 17:49:14 -07:00
Jacques Pienaar	8f1e744169	Move test of trait using dialect ops, to dialects of ops. PiperOrigin-RevId: 240680010	2019-03-29 17:48:59 -07:00
MLIR Team	b8874c679f	Small edit for clarity. ("Zero dimensions" reads to me as "rank of zero.") PiperOrigin-RevId: 240664300	2019-03-29 17:48:44 -07:00
Jacques Pienaar	d7e386cea9	Move TF dialect test to dialect. PiperOrigin-RevId: 240646586	2019-03-29 17:48:28 -07:00
Nicolas Vasilache	c3742d20b5	Give the Vectorize pass a virtualVectorSize argument. This CL allows vectorization to be called and configured in other ways than just via command line arguments. This allows triggering vectorization programmatically. PiperOrigin-RevId: 240638208	2019-03-29 17:48:12 -07:00
Lei Zhang	d5524388ab	[TableGen] Change names for Builder* and OperationState* parameters to avoid collision The `Builder*` parameter is unused in both generated build() methods so that we can leave it unnamed. Changed stand-alone parameter build() to take `_tblgen_state` instead of `result` to allow `result` to avoid having name collisions with op operand, attribute, or result. PiperOrigin-RevId: 240637700	2019-03-29 17:47:57 -07:00
River Riddle	3a845be7d1	Add support for multi-threaded pass timing. When multi-threading is enabled in the pass manager the meaning of the display slightly changes. First, a new timing column is added, `User Time`, that displays the total time spent across all threads. Secondly, the `Wall Time` column displays the longest individual time spent amongst all of the threads. This means that the `Wall Time` column will continue to give an indicator on the perceived time, or clock time, whereas the `User Time` will display the total cpu time. Example: $ mlir-opt foo.mlir -experimental-mt-pm -cse -canonicalize -convert-to-llvmir -pass-timing ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0078 seconds ---User Time--- ---Wall Time--- --- Name --- 0.0175 ( 88.3%) 0.0055 ( 70.4%) Function Pipeline 0.0018 ( 9.3%) 0.0006 ( 8.1%) CSE 0.0013 ( 6.3%) 0.0004 ( 5.8%) (A) DominanceInfo 0.0017 ( 8.7%) 0.0006 ( 7.1%) FunctionVerifier 0.0128 ( 64.6%) 0.0039 ( 50.5%) Canonicalizer 0.0011 ( 5.7%) 0.0004 ( 4.7%) FunctionVerifier 0.0004 ( 2.1%) 0.0004 ( 5.2%) ModuleVerifier 0.0010 ( 5.3%) 0.0010 ( 13.4%) LLVMLowering 0.0009 ( 4.3%) 0.0009 ( 11.0%) ModuleVerifier 0.0198 (100.0%) 0.0078 (100.0%) Total PiperOrigin-RevId: 240636269	2019-03-29 17:47:41 -07:00
River Riddle	99b87c9707	Replace usages of Instruction with Operation in the Transforms/ directory. PiperOrigin-RevId: 240636130	2019-03-29 17:47:26 -07:00
Mehdi Amini	3518122e86	Simplify API uses of `getContext()` (NFC) The Pass base class is providing a convenience getContext() accessor. PiperOrigin-RevId: 240634961	2019-03-29 17:47:11 -07:00
Mehdi Amini	7641900d2f	Allow to mutate the type of MLIR Value in-place This avoid trashing memory by cloning and replaceAllUseswith when performing type inference. PiperOrigin-RevId: 240631137	2019-03-29 17:46:56 -07:00
Jacques Pienaar	b0244b66a5	Fix include path in test pass. PiperOrigin-RevId: 240628260	2019-03-29 17:46:41 -07:00
Jacques Pienaar	b15ac2d999	Initialize std::atomic directly. Avoids error in OSS build: error: copying variable of type 'std::atomic<unsigned int>' invokes deleted constructor PiperOrigin-RevId: 240618765	2019-03-29 17:46:26 -07:00
Mehdi Amini	a5f253a335	Add a method to swap the type of a function in-place This is motivated by the need to translate function across dialect which requires morphing their type, as well as the Toy tutorial part on interprocedural shape inference. The alternative is cloning the function, but it is heavy and it seems like an arbitrary restriction to forbid morphing the function type. PiperOrigin-RevId: 240615755	2019-03-29 17:46:11 -07:00
Jacques Pienaar	810e95b861	Use dereference instead of implicit conversion for IndexedValue to Value*. Avoids ambiguous constructor error on some compilers. PiperOrigin-RevId: 240606838	2019-03-29 17:45:56 -07:00
Jacques Pienaar	ed4fa52b4a	Add missing numeric header for std::accumulate. PiperOrigin-RevId: 240593135	2019-03-29 17:45:42 -07:00
Alex Zinenko	e2f9079a71	LLVM IR Conversion: support zero-dimensional memrefs The spec allows zero-dimensional memrefs to exist and treats them essentially as single-element buffers. Unlike single-dimensional memrefs of static shape <1xTy>, zero-dimensional memrefs do not require indices to access the only element they store. Add support of zero-dimensional memrefs to the LLVM IR conversion. In particular, such memrefs are converted into bare pointers, and accesses to them are converted to bare loads and stores, without the overhead of `getelementptr %buffer, 0`. PiperOrigin-RevId: 240579456	2019-03-29 17:45:26 -07:00
Alex Zinenko	5c285f228c	LLVM IR Conversion: keep LLVM dialect types as is during conversion When converting to the LLVM IR Dialect, it is possible for the input IR to contain LLVM IR Dialect operation and/or types, for example, some functions may have been coverted to the LLVM IR Dialect already, or may have been created using this dialect directly. Make sure that type conversion keeps LLVM IR Dialect types unmodified and does not error out. Operations are already kept as is. PiperOrigin-RevId: 240574972	2019-03-29 17:45:11 -07:00
River Riddle	9c08540690	Replace usages of Instruction with Operation in the /Analysis directory. PiperOrigin-RevId: 240569775	2019-03-29 17:44:56 -07:00
Nicolas Vasilache	04b925f1b8	Port api-test::tile_2d to the edsc::Builder API The AST-based EDSCs implementation will be retired soon, this test was missing from the builders API. PiperOrigin-RevId: 240547453	2019-03-29 17:44:40 -07:00
Alex Zinenko	5a5bba0279	Introduce affine terminator Due to legacy reasons (ML/CFG function separation), regions in affine control flow operations require contained blocks not to have terminators. This is inconsistent with the notion of the block and may complicate code motion between regions of affine control operations and other regions. Introduce `affine.terminator`, a special terminator operation that must be used to terminate blocks inside affine operations and transfers the control back to he region enclosing the affine operation. For brevity and readability reasons, allow `affine.for` and `affine.if` to omit the `affine.terminator` in their regions when using custom printing and parsing format. The custom parser injects the `affine.terminator` if it is missing so as to always have it present in constructed operations. Update transformations to account for the presence of terminator. In particular, most code motion transformation between loops should leave the terminator in place, and code motion between loops and non-affine blocks should drop the terminator. PiperOrigin-RevId: 240536998	2019-03-29 17:44:24 -07:00
River Riddle	af45236c70	Add experimental support for multi-threading the pass manager. This adds support for running function pipelines on functions across multiple threads, and is guarded by an off-by-default flag 'experimental-mt-pm'. There are still quite a few things that need to be done before multi-threading is ready for general use(e.g. pass-timing), but this allows for those things to be tested in a multi-threaded environment. PiperOrigin-RevId: 240489002	2019-03-29 17:44:08 -07:00
Jacques Pienaar	c6b294ac7b	Include numeric header for std::accumulate. PiperOrigin-RevId: 240462910	2019-03-29 17:43:52 -07:00

1 2 3 4 5 ...

1268 Commits