llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	d4921f4a96	Address Performance issue in NestedMatcher A performance issue was reported due to the usage of NestedMatcher in ComposeAffineMaps. The main culprit was the ubiquitous copies that were occuring when appending even a single element in `matchOne`. This CL generally simplifies the implementation and removes one level of indirection by getting rid of auxiliary storage as well as simplifying the API. The users of the API are updated accordingly. The implementation was tested on a heavily unrolled example with ComposeAffineMaps and is now close in performance with an implementation based on stateless InstWalker. As a reminder, the whole ComposeAffineMaps pass is slated to disappear but the bug report was very useful as a stress test for NestedMatchers. Lastly, the following cleanups reported by @aminim were addressed: 1. make NestedPatternContext scoped within runFunction rather than at the Pass level. This was caused by a previous misunderstanding of Pass lifetime; 2. use defensive assertions in the constructor of NestedPatternContext to make it clear a unique such locally scoped context is allowed to exist. PiperOrigin-RevId: 231781279	2019-03-29 16:04:07 -07:00
Nicolas Vasilache	35200435e7	Address cleanups from previous CL This CL addresses some cleanups that were leftover after an incorrect rebase: 1. use StringSwitch 2. use // NOLINTNEXTLINE 3. remove a dead line of code PiperOrigin-RevId: 231726640	2019-03-29 16:03:53 -07:00
MLIR Team	1e85191d07	Fix ASAN issue: snapshot edge list before loop which can modify this list. PiperOrigin-RevId: 231686040	2019-03-29 16:03:38 -07:00
MLIR Team	d7c824451f	LoopFusion: insert the source loop nest slice at a depth in the destination loop nest which preserves dependences (above any loop carried or other dependences). This is accomplished by updating the maximum destination loop depth based on dependence checks between source loop nest loads and stores which access the memref on which the source loop nest has a store op. In addition, prevent fusing in source loop nests which write to memrefs which escape or are live out. PiperOrigin-RevId: 231684492	2019-03-29 16:03:23 -07:00
River Riddle	a642bb1779	Update tests using affine maps to not rely on specific map numbers in the output IR. This is necessary to remove the dependency on ForInst not numbering the AffineMap bounds it has custom formatting for. PiperOrigin-RevId: 231634812	2019-03-29 16:03:08 -07:00
Uday Bondhugula	44064d5b3b	3000x speed improvement on compose-affine-maps by dropping NestedMatcher for a trivial inst walker :-) (reduces pass time from several minutes non-terminating to 120ms) - (fixes b/123541184) - use a simple 7-line inst walker to collect affine_apply op's instead of the nested matcher; -compose-affine-maps pass runs in 120ms now instead of 5 minutes + (non- terminating / out of memory) - on a realistic test case that is 20,000 lines 12-d loop nest - this CL is also pushing for simple existing/standard patterns unless there is a real efficiency issue (OTOH, fixing nested matcher to address this issue requires cl/231400521) - the improvement is from swapping out the nested walker as opposed to from a bug or anything else that this CL changes - update stale comment PiperOrigin-RevId: 231623619	2019-03-29 16:02:53 -07:00
River Riddle	b6928c945c	Standardize the spelling of debug info to "debuginfo" in opt flags. PiperOrigin-RevId: 231610337	2019-03-29 16:02:38 -07:00
Lei Zhang	66647a313a	[tablegen] Use tblgen:: classes for NamedAttribute and Operand fields This is another step towards hiding raw TableGen API calls. PiperOrigin-RevId: 231580827	2019-03-29 16:02:23 -07:00
Lei Zhang	b7d2e32c84	[doc] Use table to list all attributes For each attribute, list its MLIR type and description. PiperOrigin-RevId: 231580353	2019-03-29 16:02:08 -07:00
Lei Zhang	726dc08e4d	[doc] Generate more readable description for attributes This CL added "description" field to AttrConstraint and Attr, like what we have for type classes. PiperOrigin-RevId: 231579853	2019-03-29 16:01:53 -07:00
Lei Zhang	18219caeb2	[doc] Generate more readable description for operands This CL mandated TypeConstraint and Type to provide descriptions and fixed various subclasses and definitions to provide so. The purpose is to enforce good documentation; using empty string as the default just invites oversight. PiperOrigin-RevId: 231579629	2019-03-29 16:01:38 -07:00
River Riddle	994111238b	Fold CallIndirectOp to CallOp when the callee operand is a known constant function. PiperOrigin-RevId: 231511697	2019-03-29 16:01:23 -07:00
Jacques Pienaar	b52dd7f788	Use formatv for the error instead of string stream. PiperOrigin-RevId: 231507680	2019-03-29 16:01:08 -07:00
Lei Zhang	a759cf3190	Include op results in generate TensorFlow/TFLite op docs * Emitted result lists for ops. * Changed to allow empty summary and description for ops. * Avoided indenting description to allow proper MarkDown rendering of formatting markers inside description content. * Used fixed width font for operand/attribute names. * Massaged TensorFlow op docs and generated dialect op doc. PiperOrigin-RevId: 231427574	2019-03-29 16:00:53 -07:00
Uday Bondhugula	c0e9e5eb07	Fix getFullMemRefAsRegion() and FlatAffineConstraints::reset PiperOrigin-RevId: 231426734	2019-03-29 16:00:39 -07:00
Lei Zhang	c224a518f5	TableGen: Use DAG for op results Similar to op operands and attributes, use DAG to specify operation's results. This will allow us to provide names and matchers for outputs. Also Defined `outs` as a marker to indicate the start of op result list. PiperOrigin-RevId: 231422455	2019-03-29 16:00:22 -07:00
MLIR Team	a0f3db4024	Support fusing loop nests which require insertion into a new instruction Block position while preserving dependences, opening up additional fusion opportunities. - Adds SSA Value edges to the data dependence graph used in the loop fusion pass. PiperOrigin-RevId: 231417649	2019-03-29 16:00:04 -07:00
Lei Zhang	1dfc3ac5ce	Prefix Operator getter methods with "get" to be consistent PiperOrigin-RevId: 231416230	2019-03-29 15:59:46 -07:00
River Riddle	755538328b	Recommit: Define a AffineOps dialect as well as an AffineIfOp operation. Replace all instances of IfInst with AffineIfOp and delete IfInst. PiperOrigin-RevId: 231342063	2019-03-29 15:59:30 -07:00
Nicolas Vasilache	39d81f246a	Introduce python bindings for MLIR EDSCs This CL also introduces a set of python bindings using pybind11. The bindings are exercised using a `test_py2andpy3.py` test suite that works for both python 2 and 3. `test_py3.py` on the other hand uses the more idiomatic, python 3 only "PEP 3132 -- Extended Iterable Unpacking" to implement a rank and type-agnostic copy with transposition. Because python assignment is by reference, we cannot easily make the assignment operator use the same type of sugaring as in C++; i.e. the following: ```cpp Stmt block = edsc::Block({ For(ivs, zeros, shapeA, ones, { C[ivs] = IA[ivs] + IB[ivs] })}); ``` has no equivalent in the native Python EDSCs at this time. However, the sugaring can be built as a simple DSL in python and is left as future work. PiperOrigin-RevId: 231337667	2019-03-29 15:59:14 -07:00
Nicolas Vasilache	0f9436e56a	Move google-mlir to google_mlir Python modules cannot be defined under a directory that has a `-` character in its name inside of Google code. Rename to `google_mlir` which circumvents this limitation. PiperOrigin-RevId: 231329321	2019-03-29 15:42:55 -07:00
Nicolas Vasilache	ae772b7965	Automated rollback of changelist 231318632. PiperOrigin-RevId: 231327161	2019-03-29 15:42:38 -07:00
Jacques Pienaar	ad637f3cce	Enable using constant attribute as matchers. Update to allow constant attribute values to be used to match or as result in rewrite rule. Define variable ctx in the matcher to allow matchers to refer to the context of the operation being matched. PiperOrigin-RevId: 231322019	2019-03-29 15:42:23 -07:00
River Riddle	5ecef2b3f6	Define a AffineOps dialect as well as an AffineIfOp operation. Replace all instances of IfInst with AffineIfOp and delete IfInst. PiperOrigin-RevId: 231318632	2019-03-29 15:42:08 -07:00
Nicolas Vasilache	cacf05892e	Add a C API for EDSCs in other languages + python This CL adds support for calling EDSCs from other languages than C++. Following the LLVM convention this CL: 1. declares simple opaque types and a C API in mlir-c/Core.h; 2. defines the implementation directly in lib/EDSC/Types.cpp and lib/EDSC/MLIREmitter.cpp. Unlike LLVM however the nomenclature for these types and API functions is not well-defined, naming suggestions are most welcome. To avoid the need for conversion functions, Types.h and MLIREmitter.h include mlir-c/Core.h and provide constructors and conversion operators between the mlir::edsc type and the corresponding C type. In this first commit, mlir-c/Core.h only contains the types for the C API to allow EDSCs to work from Python. This includes both a minimal set of core MLIR types (mlir_context_t, mlir_type_t, mlir_func_t) as well as the EDSC types (edsc_mlir_emitter_t, edsc_expr_t, edsc_stmt_t, edsc_indexed_t). This can be restructured in the future as concrete needs arise. For now, the API only supports: 1. scalar types; 2. memrefs of scalar types with static or symbolic shapes; 3. functions with input and output of these types. The C API is not complete wrt ownership semantics. This is in large part due to the fact that python bindings are written with Pybind11 which allows very idiomatic C++ bindings. An effort is made to write a large chunk of these bindings using the C API but some C++isms are used where the design benefits from this simplication. A fully isolated C API will make more sense once we also integrate with another language like Swift and have enough use cases to drive the design. Lastly, this CL also fixes a bug in mlir::ExecutionEngine were the order of declaration of llvmContext and the JIT result in an improper order of destructors (which used to crash before the fix). PiperOrigin-RevId: 231290250	2019-03-29 15:41:53 -07:00
Lei Zhang	eb753f4aec	Add tblgen::Pattern to model Patterns defined in TableGen Similar to other tblgen:: abstractions, tblgen::Pattern hides the native TableGen API and provides a nicer API that is more coherent with the TableGen definitions. PiperOrigin-RevId: 231285143	2019-03-29 15:41:38 -07:00
Jacques Pienaar	0fbf4ff232	Define mAttr in terms of AttrConstraint. * Matching an attribute and specifying a attribute constraint is the same thing executionally, so represent it such. * Extract AttrConstraint helper to match TypeConstraint and use that where mAttr was previously used in RewriterGen. PiperOrigin-RevId: 231213580	2019-03-29 15:41:23 -07:00
Nicolas Vasilache	1a5287d594	Replace too obscure usage of functional::map by declare + reserve + loop. Cleanup a usage of functional::map that is deemed too obscure in `reindexAffineIndices`. Also fix a stale comment in `reindexAffineIndices`. PiperOrigin-RevId: 231211184	2019-03-29 15:41:08 -07:00
Jacques Pienaar	8c7f106e53	Add value member to constant attribute specification base. String specification of the default value is the common case so just make it so. PiperOrigin-RevId: 231204081	2019-03-29 15:40:53 -07:00
Chris Lattner	b42bea215a	Change AffineApplyOp to produce a single result, simplifying the code that works with it, and updating the g3docs. PiperOrigin-RevId: 231120927	2019-03-29 15:40:38 -07:00
River Riddle	36babbd781	Change the ForInst induction variable to be a block argument of the body instead of the ForInst itself. This is a necessary step in converting ForInst into an operation. PiperOrigin-RevId: 231064139	2019-03-29 15:40:23 -07:00
Nicolas Vasilache	0e7a8a9027	Drop AffineMap::Null and IntegerSet::Null Addresses b/122486036 This CL addresses some leftover crumbs in AffineMap and IntegerSet by removing the Null method and cleaning up the constructors. As the ::Null uses were tracked down, opportunities appeared to untangle some of the Parsing logic and make it explicit where AffineMap/IntegerSet have ambiguous syntax. Previously, ambiguous cases were hidden behind the implicit pointer values of AffineMap* and IntegerSet* that were passed as function parameters. Depending the values of those pointers one of 3 behaviors could occur. This parsing logic convolution is one of the rare cases where I would advocate for code duplication. The more proper fix would be to make the syntax unambiguous or to allow some lookahead. PiperOrigin-RevId: 231058512	2019-03-29 15:40:08 -07:00
Nicolas Vasilache	81c7f2e2f3	Cleanup resource management and rename recursive matchers This CL follows up on a memory leak issue related to SmallVector growth that escapes the BumpPtrAllocator. The fix is to properly use ArrayRef and placement new to define away the issue. The following renaming is also applied: 1. MLFunctionMatcher -> NestedPattern 2. MLFunctionMatches -> NestedMatch As a consequence all allocations are now guaranteed to live on the BumpPtrAllocator. PiperOrigin-RevId: 231047766	2019-03-29 15:39:53 -07:00
River Riddle	75c21e1de0	Wrap cl::opt flags within passes in a category with the pass name. This improves the help output of tools like mlir-opt. Example: dma-generate options: -dma-fast-mem-capacity - Set fast memory space ... -dma-fast-mem-space=<uint> - Set fast memory space ... loop-fusion options: -fusion-compute-tolerance=<number> - Fractional increase in ... -fusion-maximal - Enables maximal loop fusion loop-tile options: -tile-size=<uint> - Use this tile size for ... loop-unroll options: -unroll-factor=<uint> - Use this unroll factor ... -unroll-full - Fully unroll loops -unroll-full-threshold=<uint> - Unroll all loops with ... -unroll-num-reps=<uint> - Unroll innermost loops ... loop-unroll-jam options: -unroll-jam-factor=<uint> - Use this unroll jam factor ... PiperOrigin-RevId: 231019363	2019-03-29 15:39:38 -07:00
Chris Lattner	146ad7cf43	Finish removing multi-result affine maps from the testsuite, and disable them. PiperOrigin-RevId: 231014261	2019-03-29 15:39:23 -07:00
Feng Liu	ebac3528d0	Add an option to improve the readibility of the printed MLIR debuginfo Use `-mlir-pretty-debuginfo` if the user wants line breaks between different callsite lines. The print results before and after this CL are shown in the tests. PiperOrigin-RevId: 231013812	2019-03-29 15:39:08 -07:00
Uday Bondhugula	fb679fc2b5	Drop unused result from affine map in test case - NFC PiperOrigin-RevId: 231008044	2019-03-29 15:38:53 -07:00
Chris Lattner	607d1c2ca7	More updates of tests to move towards single result affine maps. PiperOrigin-RevId: 230991929	2019-03-29 15:38:38 -07:00
Uday Bondhugula	b4a1443508	Update replaceAllMemRefUsesWith to generate single result affine_apply's for index remapping - generate a sequence of single result affine_apply's for the index remapping (instead of one multi result affine_apply) - update dma-generate and loop-fusion test cases; while on this, change test cases to use single result affine apply ops - some fusion comment fix/cleanup PiperOrigin-RevId: 230985830	2019-03-29 15:38:23 -07:00
Nicolas Vasilache	629f5b7fcb	Add a simple arity-agnostic invocation of JIT-compiled functions. This is useful to call generic function with unspecified number of arguments e.g. when interfacing with ML frameworks. PiperOrigin-RevId: 230974736	2019-03-29 15:38:08 -07:00
Uday Bondhugula	b588d58c5f	Update createAffineComputationSlice to generate single result affine maps - Update createAffineComputationSlice to generate a sequence of single result affine apply ops instead of one multi-result affine apply - update pipeline-data-transfer test case; while on this, also update the test case to use only single result affine maps, and make it more robust to change. PiperOrigin-RevId: 230965478	2019-03-29 15:37:53 -07:00
River Riddle	c3424c3c75	Allow operations to hold a blocklist and add support for parsing/printing a block list for verbose printing. PiperOrigin-RevId: 230951462	2019-03-29 15:37:37 -07:00
Alex Zinenko	6d37a255e2	Generic dialect conversion pass exercised by LLVM IR lowering This commit introduces a generic dialect conversion/lowering/legalization pass and illustrates it on StandardOps->LLVMIR conversion. It partially reuses the PatternRewriter infrastructure and adds the following functionality: - an actual pass; - non-default pattern constructors; - one-to-many rewrites; - rewriting terminators with successors; - not applying patterns iteratively (unlike the existing greedy rewrite driver); - ability to change function signature; - ability to change basic block argument types. The latter two things required, given the existing API, to create new functions in the same module. Eventually, this should converge with the rest of PatternRewriter. However, we may want to keep two pass versions: "heavy" with function/block argument conversion and "light" that only touches operations. This pass creates new functions within a module as a means to change function signature, then creates new blocks with converted argument types in the new function. Then, it traverses the CFG in DFS-preorder to make sure defs are converted before uses in the dominated blocks. The generic pass has a minimal interface with two hooks: one to fill in the set of patterns, and another one to convert types for functions and blocks. The patterns are defined as separate classes that can be table-generated in the future. The LLVM IR lowering pass partially inherits from the existing LLVM IR translator, in particular for type conversion. It defines a conversion pattern template, instantiated for different operations, and is a good candidate for tablegen. The lowering does not yet support loads and stores and is not connected to the translator as it would have broken the existing flows. Future patches will add missing support before switching the translator in a single patch. PiperOrigin-RevId: 230951202	2019-03-29 15:37:23 -07:00
Mehdi Amini	d9ce382fc9	Use a unique_ptr instead of manual deletion for PIMPL idiom (NFC) PiperOrigin-RevId: 230930254	2019-03-29 15:37:07 -07:00
Lei Zhang	ba1715f407	Pull TableGen op argument definitions into their own files PiperOrigin-RevId: 230923050	2019-03-29 15:36:52 -07:00
Lei Zhang	2de5e9fd19	Support op removal patterns in TableGen This CL adds a new marker, replaceWithValue, to indicate that no new result op is generated by applying a pattern. Instead, the matched DAG is replaced by an existing SSA value. Converted the tf.Identity converter to use the pattern. PiperOrigin-RevId: 230922323	2019-03-29 15:36:37 -07:00
Uday Bondhugula	95f19d558c	Fix return value logic / error reporting in -dma-generate PiperOrigin-RevId: 230906158	2019-03-29 15:36:23 -07:00
Alex Zinenko	5a4403787f	Simple CPU runner This implements a simple CPU runner based on LLVM Orc JIT. The base functionality is provided by the ExecutionEngine class that compiles and links the module, and provides an interface for obtaining function pointers to the JIT-compiled MLIR functions and for invoking those functions directly. Since function pointers need to be casted to the correct pointer type, the ExecutionEngine wraps LLVM IR functions obtained from MLIR into a helper function with the common signature `void (void **)` where the single argument is interpreted as a list of pointers to the actual arguments passed to the function, eventually followed by a pointer to the result of the function. Additionally, the ExecutionEngine is set up to resolve library functions to those available in the current process, enabling support for, e.g., simple C library calls. For integration purposes, this also provides a simplistic runtime for memref descriptors as expected by the LLVM IR code produced by MLIR translation. In particular, memrefs are transformed into LLVM structs (can be mapped to C structs) with a pointer to the data, followed by dynamic sizes. This implementation only supports statically-shaped memrefs of type float, but can be extened if necessary. Provide a binary for the runner and a test that exercises it. PiperOrigin-RevId: 230876363	2019-03-29 15:36:08 -07:00
MLIR Team	5c5739d42b	Change the dependence check in the loop fusion pass to use the MLIR instruction list ordering (instead of the dependence graph node id ordering). This breaks the overloading of dependence graph node ids as both edge endpoints and instruction list position. PiperOrigin-RevId: 230849232	2019-03-29 15:35:53 -07:00
Uday Bondhugula	f94b15c247	Update dma-generate: update for multiple load/store op's per memref - introduce a way to compute union using symbolic rectangular bounding boxes - handle multiple load/store op's to the same memref by taking a union of the regions - command-line argument to provide capacity of the fast memory space - minor change to replaceAllMemRefUsesWith to not generate affine_apply if the supplied index remap was identity PiperOrigin-RevId: 230848185	2019-03-29 15:35:38 -07:00

... 2 3 4 5 6 ...

986 Commits All Branches Search

986 Commits

All Branches