Add doc for declarative rewrite rules

This doc serves as a manual for table-driven declarative rewrite rules. It lists all the details regarding supported mechanisms. PiperOrigin-RevId: 267761702
2019-09-07 05:25:39 -07:00 · 2019-09-07 05:25:39 -07:00 · 6e5d1b9d62
parent 06398f32f6
commit 6e5d1b9d62
2 changed files with 610 additions and 51 deletions
--- a/mlir/g3doc/DeclarativeRewrites.md
+++ b/mlir/g3doc/DeclarativeRewrites.md
@ -0,0 +1,610 @@
 # Table-driven Declarative Rewrite Rule (DRR)
 In addition to subclassing the `mlir::RewritePattern` C++ class, MLIR also
 supports defining rewrite rules in a declarative manner. Similar to
 [Op Definition Specification](OpDefinitions.md) (ODS), this is achieved via
 [TableGen][TableGen], which is a language to maintain records of domain-specific
 information. The rewrite rules are specified concisely in a TableGen record,
 which will be expanded into an equivalent `mlir::RewritePattern` subclass at
 compiler build time.
 This manual explains in detail all of the available mechanisms for defining
 rewrite rules in such a declarative manner. It aims to be a specification
 instead of a tutorial. Please refer to
 [Quickstart tutorial to adding MLIR graph rewrite](QuickstartRewrites.md) for
 the latter.
 Given that declarative rewrite rules depend on op definition specification, this
 manual assumes knowledge of the [ODS](OpDefinitions.md) doc.
 ## Benefits
 Compared to the hand-written C++ classes, this declarative approach has several
 benefits, including but not limited to:
 *   **Being declarative**: The pattern creator just needs to state the rewrite
    pattern declaratively, without worrying about the concrete C++ methods to
    call.
 *   **Removing boilerplate and showing the very essense the the rewrite**:
    `mlir::RewritePattern` is already good at hiding boilerplate for defining a
    rewrite rule. But we still need to write the class and function structures
    required by the C++ programming language, inspect ops for matching, and call
    op `build()` methods for constructing. These statements are typically quite
    simple and similar, so they can be further condensed with auto-generation.
    Because we reduce the boilerplate to the bare minimum, the declarative
    rewrite rule will just contain the very essense of the rewrite. This makes
    it very easy to understand the pattern.
 ## Strengths and Limitations
 The declarative rewrite rule is **operation-based**: it describes a rule to
 match against a directed acyclic graph (DAG) of operations and generate DAGs of
 operations. This gives DRR both its strengths and limitations: it is good at
 expressing op to op conversions, but not that well suited for, say, converting
 an op into a loop nest.
 Per the current implementation, DRR also does not have good support for regions
 in general.
 ## Rule Definition
 The core construct for defining a rewrite rule is defined in
 [`OpBase.td`][OpBase] as
 ```tblgen
 class Pattern<
    dag sourcePattern, list<dag> resultPatterns,
    list<dag> additionalConstraints = [],
    dag benefitsAdded = (addBenefit 0)>;
 ```
 A declarative rewrite rule contains two main components:
 *   A _source pattern_, which is used for matching a DAG of operations.
 *   One or more _result patterns_, which are used for generating DAGs of
    operations to replace the matched DAG of operations.
 We allow multiple result patterns to support
 [multi-result ops](#supporting-multi-result-ops) and
 [auxiliary ops](#supporting-auxiliary-ops), but frequently we just want to
 convert one DAG of operations to another DAG of operations. There is a handy
 wrapper of `Pattern`, `Pat`, which takes a single result pattern:
 ```tblgen
 class Pat<
    dag sourcePattern, dag resultPattern,
    list<dag> additionalConstraints = [],
    dag benefitsAdded = (addBenefit 0)> :
  Pattern<sourcePattern, [resultPattern], additionalConstraints, benefitAdded>;
 ```
 Each pattern is specified as a TableGen `dag` object with the syntax of
 `(operator arg0, arg1, ...)`.
 `operator` is typically an MLIR op, but it can also be other
 [directives](#special-directives). `argN` is for matching (if used in source
 pattern) or generating (if used in result pattern) the `N`-th argument for
 `operator`. If the `operator` is some MLIR operation, it means the `N`-th
 argument as specified in the `arguments` list of the op's definition.
 Therefore, we say op argument specification in pattern is **position-based**:
 the position where they appear matters.
 `argN` can be a `dag` object itself, thus we can have nested `dag` tree to model
 the def-use relationship between ops.
 ### Source pattern
 The source pattern is for matching a DAG of operations. Arguments in the `dag`
 object are intended to **capture** the op arguments. They can also be used to
 **further limit** the match criteria. The capturing is done by specifying a
 symbol starting with the `$` sign, while further constraints are introduced by
 specifying a `TypeConstraint` (for an operand) or a `AttrConstraint` (for an
 attribute).
 #### Binding op arguments and limiting the match
 For example,
 ```tblgen
 def AOp : Op<"a_op"> {
    let arguments = (ins
      AnyType:$a_input,
      AnyAttr:$a_attr
    );
    let results = (outs
      AnyType:$a_output
    );
 }
 def : Pat<(AOp $input, F32Attr:$attr), ...>;
 ```
 In the above, we are matching an `AOp` whose `$input` can be anything valid as
 defined by the op and whose `$attr` must be a float attribute. If the match
 succeeds, we bind the `$input` symbol to the op's only input (`$a_input`) and
 `$attr` to the only attribute (`$a_attr`); we can reference them using `$input`
 and `$attr` in result patterns and additional constraints.
 The pattern is position-based: the symbol names used for capturing here do not
 need to match with the op definition as shown in the above example. As another
 example, the pattern can be written as ` def : Pat<(AOp $a, F32Attr:$b), ...>;`
 and use `$a` and `$b` to refer to the captured input and attribute. But using
 the ODS name directly in the pattern is also allowed.
 Also note that we only need to add `TypeConstraint` or `AttributeConstraint`
 when we need to further limit the match criteria. If all valid cases to the op
 are acceptable, then we can leave the constraint unspecified.
 #### Matching DAG of operations
 To match an DAG of ops, use nested `dag` objects:
 ```tblgen
 def BOp : Op<"b_op"> {
    let arguments = (ins);
    let results = (outs
      AnyType:$b_output
    );
 }
 def : Pat<(AOp (BOp), $attr), ...>;
 ```
 The above pattern matches an `AOp` whose only operand is generated by a `BOp`,
 that is, the following MLIR code:
 ```mlir
 %0 = "b_op"() : () -> (...)
 %1 = "a_op"(%0) {attr: ...} : () -> (...)
 ```
 #### Binding op results
 To bind a symbol to the results of a matched op for later reference, attach the
 symbol to the op itself:
 ```tblgen
 def : Pat<(AOp (BOp:$b_result), $attr), ...>;
 ```
 The above will bind `$b_result` to the matched `BOp`'s result. (There are more
 details regarding multi-result ops, which is covered
 [later](#supporting-multi-result-ops).)
 ### Result pattern
 The result pattern is for generating a DAG of operations. Arguments in the `dag`
 object are intended to **reference** values captured in the source pattern and
 potentially **apply transformations**.
 #### Referencing bound symbols
 For example,
 ```tblgen
 def COp : Op<"c_op"> {
    let arguments = (ins
      AnyType:$c_input,
      AnyAttr:$c_attr
    );
    let results = (outs
      AnyType:$c_output
    );
 }
 def : Pat<(AOp $input, $attr), (COp $input, $attr)>;
 ```
 In the above, `AOp`'s only operand and attribute are bound to `$input` and
 `$attr`, respectively. We then reference them in the result pattern for
 generating the `COp` by passing them in as arguments to `COp`'s `build()`
 method.
 We can also reference symbols bound to matched op's results:
 ```tblgen
 def : Pat<(AOp (BOp:$b_result) $attr), (COp $b_result $attr)>;
 ```
 In the above, we are using `BOp`'s result for building `COp`.
 #### Building operations
 Given that `COp` was specified with table-driven op definition, there will be
 several `build()` methods generated for it. One of them has a separate argument
 in the signature for each argument appearing in the op's `arguments` list:
 `void COp::build(..., Value *input, Attribute attr)`. The pattern in the above
 calls this `build()` method for constructing the `COp`.
 In general, arguments in the the result pattern will be passed directly to the
 `build()` method to leverage the auto-generated `build()` method, list them in
 the pattern by following the exact same order as the ODS `arguments` definition.
 Otherwise, a custom `build()` method that matches the argument list is required.
 #### Generating DAG of operations
 `dag` objects can be nested to generate a DAG of operations:
 ```tblgen
 def : Pat<(AOp $input, $attr), (COp (BOp), $attr)>;
 ```
 In the above, we generate a `BOp`, and then use its result to generate the `COp`
 to replace the matched `AOp`.
 #### Binding op results
 In the result pattern, we can bind to the result(s) of an newly built op by
 attaching symbols to the op. (But we **cannot** bind to op arguments given that
 they are referencing previously bound symbols.) This is useful for reusing
 newly created results where suitable. For example,
 ```tblgen
 def DOp : Op<"d_op"> {
    let arguments = (ins
      AnyType:$d_input1,
      AnyType:$d_input2,
    );
    let results = (outs
      AnyType:$d_output
    );
 }
 def : Pat<(AOp $input, $ignored_attr), (DOp (BOp:$b_result) $b_result)>;
 ```
 In this pattern, a `AOp` is matched and replaced with a `DOp` whose two operands
 are from the result of a single `BOp`. This is only possible by binding the
 result of the `BOp` to a name and reuse it for the second operand of the `DOp`
 #### `NativeCodeCall`: transforming the generated op
 Sometimes the captured arguments are not exactly what we want so they cannot be
 directly fed in as arguments to build the new op. For such cases, we can apply
 transformations on the arguments by calling into C++ helper functions. This is
 achieved by `NativeCodeCall`.
 For example, if we want to catpure some op's attributes and group them as an
 array attribute to construct a new op:
 ```tblgen
 def TwoAttrOp : Op<"two_attr_op"> {
    let arguments = (ins
      AnyAttr:$op_attr1,
      AnyAttr:$op_attr2
    );
    let results = (outs
      AnyType:$op_output
    );
 }
 def OneAttrOp : Op<"one_attr_op"> {
    let arguments = (ins
      ArrayAttr:$op_attr
    );
    let results = (outs
      AnyType:$op_output
    );
 }
 ```
 We can write a C++ helper function:
 ```c++
 Attribute createArrayAttr(Builder &builder, Attribute a, Attribute b) {
  return builder.getArrayAttr({a, b});
 }
 ```
 And then write the pattern as:
 ```tblgen
 def createArrayAttr : NativeCodeCall<"createArrayAttr($_builder, $0, $1)">;
 def : Pat<(TwoAttrOp $attr1, $attr2),
          (OneAttrOp (createArrayAttr $attr1, $attr2))>;
 ```
 And make sure the generated C++ code from the above pattern has access to the
 definition of the C++ helper function.
 In the above example, we are using a string to specialize the `NativeCodeCall`
 template. The string can be an arbitrary C++ expression that evaluates into
 some C++ object expected at the `NativeCodeCall` site (here it would be
 expecting an array attribute). Typically the string should be a function call.
 ##### `NativeCodeCall` placeholders
 In `NativeCodeCall`, we can use placeholders like `$_builder`, `$N`. The former
 is called _special placeholder_, while the latter is called _positional
 placeholder_.
 `NativeCodeCall` right now only supports two special placeholders: `$_builder`
 and `$_self`:
 *   `$_builder` will be replaced by the current `mlir::PatternRewriter`.
 *   `$_self` will be replaced with the entity `NativeCodeCall` is attached to.
 We have seen how `$_builder` can be used in the above; it allows us to pass a
 `mlir::Builder` (`mlir::PatternRewriter` is a subclass of `mlir::OpBuilder`,
 which is a subclass of `mlir::Builder`) to the C++ helper function to use the
 handy methods on `mlir::Builder`.
 `$_self` is useful when we want to write something in the form of
 `NativeCodeCall<"...">:$symbol`. For example, if we want to reverse the previous
 example and decompose the array attribute into two attributes:
 ```tblgen
 class getNthAttr<int n> : NativeCodeCall<"$_self.getValue()[" # n # "]">;
 def : Pat<(OneAttrOp $attr),
          (TwoAttrOp (getNthAttr<0>:$attr), (getNthAttr<1>:$attr)>;
 ```
 In the above, `$_self` is substitutated by the attribute bound by `$attr`, which
 is `OnAttrOp`'s array attribute.
 Positional placeholders will be substituted by the `dag` object parameters at
 the `NativeCodeCall` use site. For example, if we define `SomeCall :
 NativeCodeCall<"someFn($1, $2, $0)">` and use it like `(SomeCall $in0, $in1,
 $in2)`, then this will be translated into C++ call `someFn($in1, $in2, $in0)`.
 ##### Customizing entire op building
 `NativeCodeCall` is not only limited to transforming arguments for building an
 op; it can also used to specify how to build an op entirely. An example:
 If we have a C++ function for building an op:
 ```c++
 Operation *createMyOp(OpBuilder builder, Value *input, Attribute attr);
 ```
 We can wrap it up and invoke it like:
 ```tblgen
 def createMyOp : NativeCodeCall<"createMyOp($_builder, $0, $1)">;
 def : Pat<(... $input, $attr), (createMyOp $input, $attr)>;
 ```
 ### Supporting auxiliary ops
 A declarative rewrite rule supports multiple result patterns. One of the purpose
 is to allow generating _auxiliary ops_. Auxiliary ops are operations used for
 building the replacement ops; but they are not directly used for replacement
 themselves.
 For the case of uni-result ops, if there are multiple result patterns, only the
 value generated from the last result pattern will be used to replace the matched
 root op's result; all other result patterns will be considered as generating
 auxiliary ops.
 Normally we want to specify ops as nested `dag` objects if their def-use
 relationship can be expressed in the way that an op's result can feed as the
 argument to consuming op. But that is not always possible. For example, if we
 want to allocate memory and store some computation (in pseudocode):
 ```mlir
 %dst = addi %lhs, %rhs
 ```
 into
 ```mlir
 %shape = shape %lhs
 %mem = alloc %shape
 %sum = addi %lhs, %rhs
 store %mem, %sum
 %dst = load %mem
 ```
 We cannot fit in with just one result pattern given `store` does not return a
 value. Instead we can use multiple result patterns:
 ```tblgen
 def : Pattern<(AddIOp $lhs, $rhs),
              [(StoreOp (AllocOp:$mem (ShapeOp %lhs)), (AddIOp $lhs, $rhs)),
               (LoadOp $mem)];
 ```
 In the above we use the first result pattern to generate the first four ops, and
 use the last pattern to generate the last op, which is used to replace the
 matched op.
 ### Supporting multi-result ops
 Multi-result ops bring extra complexity to declarative rewrite rules. We use
 TableGen `dag` objects to represent ops in patterns; there is no native way to
 indicate that an op generates multiple results. The approach adopted is based
 on **naming convention**: a `__N` suffix is added to a symbol to indicate the
 `N`-th result.
 #### `__N` suffix
 The `__N` sufix is specifying the `N`-th result as a whole (which can be
 [variadic](#supporting-variadic-ops)). For example, we can bind a symbol to some
 multi-result op and reference a specific result later:
 ```tblgen
 def ThreeResultOp : Op<"three_result_op"> {
    let arguments = (ins ...);
    let results = (outs
      AnyTensor:$op_output1,
      AnyTensor:$op_output2,
      AnyTensor:$op_output3
    );
 }
 def : Pattern<(ThreeResultOp:$results ...),
              [(... $results__0), ..., (... $results__2), ...]>;
 ```
 In the above pattern we bind `$results` to all the results generated by
 `ThreeResultOp` and references its `$input1` and `$input3` later in the result
 patterns.
 We can also bind a symbol and reference one of its specific result at the same
 time, which is typically useful when generating multi-result ops:
 ```tblgen
 // TwoResultOp has similar definition as ThreeResultOp, but only has two
 // results.
 def : Pattern<(TwoResultOp ...),
              [(ThreeResultOp:$results__2, ...),
               (replaceWithValue $results__0)]>;
 ```
 In the above, we created a `ThreeResultOp` and bind `results` to its results,
 and uses its last result (`$output3`) and first result (`$output1`) to replace
 the `TwoResultOp`'s two results, respectively.
 #### Replacing multi-result ops
 The above example also shows how to replace a matched multi-result op.
 To replace a `N`-result op, the result patterns must generate at least `N`
 declared values (see [Declared vs. actual value](#declared-vs-actual-value) for
 definition). If there are more than `N` declared values generated, only the
 last `N` declared values will be used to replace the matched op. Note that
 because of the existence of multi-result op, one result pattern **may** generate
 multiple declared values. So it means we do not necessarily need `N` result
 patterns to replace an `N`-result op. For example, to replace an op with three
 results, you can have
 ```tblgen
 // ThreeResultOp/TwoResultOp/OneResultOp generates three/two/one result(s),
 // respectively.
 // Replace each result with a result generated from an individual op.
 def : Pattern<(ThreeResultOp ...),
              [(OneResultOp ...), (OneResultOp ...), (OneResultOp ...)]>;
 // Replace the first two results with two results generated from the same op.
 def : Pattern<(ThreeResultOp ...),
              [(TwoResultOp ...), (OneResultOp ...)]>;
 // Replace all three results with three results generated from the same op.
 def : Pat<(ThreeResultOp ...), (ThreeResultOp ...)>;
 def : Pattern<(ThreeResultOp ...),
              [(AuxiliaryOp ...), (ThreeResultOp ...)]>;
 ```
 But using a single op to serve as both auxiliary op and replacement op is
 forbidden, i.e., the following is not allowed because that the first
 `TwoResultOp` generates two results but only the second result is used for
 replacing the matched op's result:
 ```tblgen
 def : Pattern<(ThreeResultOp ...),
              [(TwoResultOp ...), (TwoResultOp ...)]>;
 ```
 ### Supporting variadic ops
 #### Declared vs. actual value
 Before going into details on variadic op support, we need to define a few terms
 regarding an op's values.
 *   _Value_: either an operand or a result
 *   _Declared operand/result/value_: an operand/result/value statically declared
    in ODS of the op
 *   _Actual operand/result/value_: an operand/result/value of an op instance at
    runtime
 The above terms are needed because ops can have multiple results, and some of the
 results can also be variadic. For example,
 ```tblgen
 def MultiVariadicOp : Op<"multi_variadic_op"> {
    let arguments = (ins
      AnyTensor:$input1,
      Variadic<AnyTensor>:$input2,
      AnyTensor:$input3
    );
    let results = (outs
      AnyTensor:$output1,
      Variadic<AnyTensor>:$output2,
      AnyTensor:$output3
    );
 }
 ```
 We say the above op has 3 declared operands and 3 declared results. But at
 runtime, an instance can have 3 values corresponding to `$input2` and 2 values
 correspond to `$output2`; we say it has 5 actual operands and 4 actual
 results. A variadic operand/result is a considered as a declared value that can
 correspond to multiple actual values.
 [TODO]
 ### Supplying additional constraints
 Constraints can be placed on op arguments when matching. But sometimes we need
 to also place constraints on the matched op's results or sometimes need to limit
 the matching with some constraints that cover both the arugments and the
 results. The third parameter to `Pattern` (and `Pat`) is for this purpose.
 For example, we can write
 ```tblgen
 def HasNoUseOf: Constraint<
    CPred<"$_self->use_begin() == $_self->use_end()">, "has no use">;
 def HasSameElementType : Constraint<
    CPred<"$0.cast<ShapedType>().getElementType() == "
          "$1.cast<ShapedType>().getElementType()">,
    "has same element type">;
 def : Pattern<(TwoResultOp:$results $input),
              [(...), (...)],
              [(F32Tensor:$results__0), (HasNoUseOf:$results__1),
               (HasSameElementShape $results__0, $input)]>;
 ```
 You can
 *   Use normal `TypeConstraint`s on previous bound symbols (the first result of
    `TwoResultOp` must be a float tensor);
 *   Define new `Constraint` for previous bound symbols (the second result of
    `TwoResultOp` must has no use);
 *   Apply constraints on multiple bound symbols (`$input` and `TwoResultOp`'s
    first result must have the same element type).
 ### Adjusting benefits
 The benefit of a `Pattern` is an integer value indicating the benfit of matching
 the pattern. It determines the priorities of patterns inside the pattern rewrite
 driver. A pattern with a higher benefit is applied before one with a lower
 benefit.
 In DRR, a rule is set to have a benefit of the number of ops in the source
 pattern. This is based on the heuristics and assumptions that:
 *   Larger matches are more beneficial than smaller ones.
 *   If a smaller one is applied first the larger one may not apply anymore.
 The forth parameter to `Pattern` (and `Pat`) allows to manually tweak a
 pattern's benefit. Just supply `(addBenefit N)` to add `N` to the benefit value.
 ## Special directives
 [TODO]
 [TableGen]: https://llvm.org/docs/TableGen/index.html
 [OpBase]: https://github.com/tensorflow/mlir/blob/master/include/mlir/IR/OpBase.td
--- a/mlir/g3doc/OpDefinitions.md
+++ b/mlir/g3doc/OpDefinitions.md
@ -930,57 +930,6 @@ the shape function. The reference implementation is general and can support the
 arbitrary computations needed to specify output shapes.
 # Rewrite pattern description
 TODO: Move this section to a dedicated doc for graph rewrites
 MLIR aims to support many graph transformations across multiple levels of
 representation using declarative patterns. These patterns can be expressed using
 TableGen as well as dynamically (TBD).
 ## Op DAG pattern rewrites
 The most direct pattern supported in MLIR is rewrites of the form `(dag of
 operations) -> (dag of operations)` along with constraints (on operands and
 operations). The matchers require both dialects being matched between to be
 included in the same TableGen file. Hence pattern matching is normally defined
 in either a separate file that imports both. Matchers are defined in terms of
 the TableGen instances rather than mnemonics to allow for better error checking
 and verification generation.
 ```tablegen
 def : Pat<(TF_LeakyReluOp $arg, F32Attr:$a), (TFL_LeakyReluOp $arg, $a)>;
 def : Pat<(TF_ReluOp (TF_AddOp $lhs, $rhs)), (TFL_AddOp $lhs, $rhs, TFL_AF_Relu)>;
 def : Pat<(TF_BiasAddOp F32Tensor:$l, F32Tensor:$r),
          (TFL_AddOp $l, $r, TFL_AF_None)>;
 ```
 In the above examples it was shown how to construct matching rules between two
 dialects (TensorFlow and TensorFlowLite), showing matching arguments (attributes
 and operands) as well as matching a DAG pattern of multiple input operations to
 single output.
 1.  Matchers can be partially specified on the input (e.g., not all arguments
    constrained) and so multiple matchers can match the same set of nodes. The
    most discriminative matcher (as determined by the number of
    constrained/matching terms) will be selected, if two patterns are equally
    discriminative then an error will be reported.
 1.  Matchers between dialects have to be completely specified on the output
    (i.e., there can be no unspecified attributes of the op generated).
 1.  Operands and attributes can be further constrained from the op definition
    (e.g., bias add rule only matches the case where both Tensors have F32
    elements).
    1.  Attributes can be transformed by transform rules to produce an attribute
        of a type different than the type matched.
 TODO: Add constraints on the matching rules.
 TODO: Describe the generation of benefit metric given pattern.
 [TableGen]: https://llvm.org/docs/TableGen/index.html
 [TableGenIntro]: https://llvm.org/docs/TableGen/LangIntro.html
 [TableGenRef]: https://llvm.org/docs/TableGen/LangRef.html