2019-09-07 20:25:39 +08:00
|
|
|
# Table-driven Declarative Rewrite Rule (DRR)
|
|
|
|
|
|
|
|
In addition to subclassing the `mlir::RewritePattern` C++ class, MLIR also
|
|
|
|
supports defining rewrite rules in a declarative manner. Similar to
|
|
|
|
[Op Definition Specification](OpDefinitions.md) (ODS), this is achieved via
|
|
|
|
[TableGen][TableGen], which is a language to maintain records of domain-specific
|
|
|
|
information. The rewrite rules are specified concisely in a TableGen record,
|
|
|
|
which will be expanded into an equivalent `mlir::RewritePattern` subclass at
|
|
|
|
compiler build time.
|
|
|
|
|
|
|
|
This manual explains in detail all of the available mechanisms for defining
|
|
|
|
rewrite rules in such a declarative manner. It aims to be a specification
|
|
|
|
instead of a tutorial. Please refer to
|
2020-04-19 12:51:03 +08:00
|
|
|
[Quickstart tutorial to adding MLIR graph
|
|
|
|
rewrite](Tutorials/QuickstartRewrites.md) for the latter.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
Given that declarative rewrite rules depend on op definition specification, this
|
|
|
|
manual assumes knowledge of the [ODS](OpDefinitions.md) doc.
|
|
|
|
|
|
|
|
## Benefits
|
|
|
|
|
|
|
|
Compared to the hand-written C++ classes, this declarative approach has several
|
|
|
|
benefits, including but not limited to:
|
|
|
|
|
|
|
|
* **Being declarative**: The pattern creator just needs to state the rewrite
|
|
|
|
pattern declaratively, without worrying about the concrete C++ methods to
|
|
|
|
call.
|
2019-09-26 02:57:13 +08:00
|
|
|
* **Removing boilerplate and showing the very essence of the rewrite**:
|
2019-09-07 20:25:39 +08:00
|
|
|
`mlir::RewritePattern` is already good at hiding boilerplate for defining a
|
|
|
|
rewrite rule. But we still need to write the class and function structures
|
|
|
|
required by the C++ programming language, inspect ops for matching, and call
|
|
|
|
op `build()` methods for constructing. These statements are typically quite
|
|
|
|
simple and similar, so they can be further condensed with auto-generation.
|
|
|
|
Because we reduce the boilerplate to the bare minimum, the declarative
|
2019-09-26 02:57:13 +08:00
|
|
|
rewrite rule will just contain the very essence of the rewrite. This makes
|
2019-09-07 20:25:39 +08:00
|
|
|
it very easy to understand the pattern.
|
|
|
|
|
|
|
|
## Strengths and Limitations
|
|
|
|
|
|
|
|
The declarative rewrite rule is **operation-based**: it describes a rule to
|
|
|
|
match against a directed acyclic graph (DAG) of operations and generate DAGs of
|
|
|
|
operations. This gives DRR both its strengths and limitations: it is good at
|
|
|
|
expressing op to op conversions, but not that well suited for, say, converting
|
|
|
|
an op into a loop nest.
|
|
|
|
|
2019-11-07 01:56:01 +08:00
|
|
|
Per the current implementation, DRR does not have good support for the following
|
|
|
|
features:
|
|
|
|
|
|
|
|
* Matching and generating ops with regions.
|
|
|
|
* Matching and generating ops with block arguments.
|
|
|
|
* Matching multi-result ops in nested patterns.
|
|
|
|
* Matching and generating variadic operand/result ops in nested patterns.
|
2019-12-04 20:58:12 +08:00
|
|
|
* Packing and unpacking variadic operands/results during generation.
|
2019-11-07 01:56:01 +08:00
|
|
|
* [`NativeCodeCall`](#native-code-call-transforming-the-generated-op)
|
|
|
|
returning more than one results.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
## Rule Definition
|
|
|
|
|
|
|
|
The core construct for defining a rewrite rule is defined in
|
|
|
|
[`OpBase.td`][OpBase] as
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
class Pattern<
|
|
|
|
dag sourcePattern, list<dag> resultPatterns,
|
|
|
|
list<dag> additionalConstraints = [],
|
|
|
|
dag benefitsAdded = (addBenefit 0)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
A declarative rewrite rule contains two main components:
|
|
|
|
|
|
|
|
* A _source pattern_, which is used for matching a DAG of operations.
|
|
|
|
* One or more _result patterns_, which are used for generating DAGs of
|
|
|
|
operations to replace the matched DAG of operations.
|
|
|
|
|
|
|
|
We allow multiple result patterns to support
|
|
|
|
[multi-result ops](#supporting-multi-result-ops) and
|
|
|
|
[auxiliary ops](#supporting-auxiliary-ops), but frequently we just want to
|
|
|
|
convert one DAG of operations to another DAG of operations. There is a handy
|
|
|
|
wrapper of `Pattern`, `Pat`, which takes a single result pattern:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
class Pat<
|
|
|
|
dag sourcePattern, dag resultPattern,
|
|
|
|
list<dag> additionalConstraints = [],
|
|
|
|
dag benefitsAdded = (addBenefit 0)> :
|
|
|
|
Pattern<sourcePattern, [resultPattern], additionalConstraints, benefitAdded>;
|
|
|
|
```
|
|
|
|
|
|
|
|
Each pattern is specified as a TableGen `dag` object with the syntax of
|
|
|
|
`(operator arg0, arg1, ...)`.
|
|
|
|
|
|
|
|
`operator` is typically an MLIR op, but it can also be other
|
|
|
|
[directives](#special-directives). `argN` is for matching (if used in source
|
|
|
|
pattern) or generating (if used in result pattern) the `N`-th argument for
|
|
|
|
`operator`. If the `operator` is some MLIR operation, it means the `N`-th
|
|
|
|
argument as specified in the `arguments` list of the op's definition.
|
|
|
|
Therefore, we say op argument specification in pattern is **position-based**:
|
|
|
|
the position where they appear matters.
|
|
|
|
|
|
|
|
`argN` can be a `dag` object itself, thus we can have nested `dag` tree to model
|
|
|
|
the def-use relationship between ops.
|
|
|
|
|
|
|
|
### Source pattern
|
|
|
|
|
|
|
|
The source pattern is for matching a DAG of operations. Arguments in the `dag`
|
|
|
|
object are intended to **capture** the op arguments. They can also be used to
|
|
|
|
**further limit** the match criteria. The capturing is done by specifying a
|
|
|
|
symbol starting with the `$` sign, while further constraints are introduced by
|
|
|
|
specifying a `TypeConstraint` (for an operand) or a `AttrConstraint` (for an
|
|
|
|
attribute).
|
|
|
|
|
|
|
|
#### Binding op arguments and limiting the match
|
|
|
|
|
|
|
|
For example,
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def AOp : Op<"a_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
AnyType:$a_input,
|
|
|
|
AnyAttr:$a_attr
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$a_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
def : Pat<(AOp $input, F32Attr:$attr), ...>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above, we are matching an `AOp` whose `$input` can be anything valid as
|
|
|
|
defined by the op and whose `$attr` must be a float attribute. If the match
|
|
|
|
succeeds, we bind the `$input` symbol to the op's only input (`$a_input`) and
|
|
|
|
`$attr` to the only attribute (`$a_attr`); we can reference them using `$input`
|
|
|
|
and `$attr` in result patterns and additional constraints.
|
|
|
|
|
|
|
|
The pattern is position-based: the symbol names used for capturing here do not
|
|
|
|
need to match with the op definition as shown in the above example. As another
|
2020-10-20 12:51:41 +08:00
|
|
|
example, the pattern can be written as `def : Pat<(AOp $a, F32Attr:$b), ...>;`
|
2019-09-07 20:25:39 +08:00
|
|
|
and use `$a` and `$b` to refer to the captured input and attribute. But using
|
2020-10-20 12:51:41 +08:00
|
|
|
the ODS name directly in the pattern is also allowed. Operands in the source
|
|
|
|
pattern can have the same name. This bounds one operand to the name while
|
|
|
|
verifying the rest are all equal.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
Also note that we only need to add `TypeConstraint` or `AttributeConstraint`
|
|
|
|
when we need to further limit the match criteria. If all valid cases to the op
|
|
|
|
are acceptable, then we can leave the constraint unspecified.
|
|
|
|
|
2019-12-02 23:54:23 +08:00
|
|
|
`$_` is a special symbol to mean ignore capturing an argument. For example,
|
|
|
|
`def : Pat<(AOp $_, $b), ...>` means only `$b` is interesting to capture and
|
|
|
|
will be referenced later in result patterns. It's still possible to place
|
|
|
|
additional constraints even if the symbol is not to be captured; for such case,
|
|
|
|
you can simply use just the `TypeConstraint` or `AttributeConstraint` without a
|
|
|
|
bound symbol, for example, `def : Pat<(AOp $a, F32Attr), ...>`.
|
|
|
|
|
2019-09-07 20:25:39 +08:00
|
|
|
#### Matching DAG of operations
|
|
|
|
|
2020-04-29 13:47:35 +08:00
|
|
|
To match a DAG of ops, use nested `dag` objects:
|
2019-09-07 20:25:39 +08:00
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
def BOp : Op<"b_op"> {
|
|
|
|
let arguments = (ins);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$b_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
|
|
def : Pat<(AOp (BOp), $attr), ...>;
|
|
|
|
```
|
|
|
|
|
|
|
|
The above pattern matches an `AOp` whose only operand is generated by a `BOp`,
|
|
|
|
that is, the following MLIR code:
|
|
|
|
|
|
|
|
```mlir
|
|
|
|
%0 = "b_op"() : () -> (...)
|
|
|
|
%1 = "a_op"(%0) {attr: ...} : () -> (...)
|
|
|
|
```
|
|
|
|
|
|
|
|
#### Binding op results
|
|
|
|
|
|
|
|
To bind a symbol to the results of a matched op for later reference, attach the
|
|
|
|
symbol to the op itself:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def : Pat<(AOp (BOp:$b_result), $attr), ...>;
|
|
|
|
```
|
|
|
|
|
|
|
|
The above will bind `$b_result` to the matched `BOp`'s result. (There are more
|
|
|
|
details regarding multi-result ops, which is covered
|
|
|
|
[later](#supporting-multi-result-ops).)
|
|
|
|
|
|
|
|
### Result pattern
|
|
|
|
|
|
|
|
The result pattern is for generating a DAG of operations. Arguments in the `dag`
|
|
|
|
object are intended to **reference** values captured in the source pattern and
|
|
|
|
potentially **apply transformations**.
|
|
|
|
|
|
|
|
#### Referencing bound symbols
|
|
|
|
|
|
|
|
For example,
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def COp : Op<"c_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
AnyType:$c_input,
|
|
|
|
AnyAttr:$c_attr
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$c_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
def : Pat<(AOp $input, $attr), (COp $input, $attr)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above, `AOp`'s only operand and attribute are bound to `$input` and
|
|
|
|
`$attr`, respectively. We then reference them in the result pattern for
|
|
|
|
generating the `COp` by passing them in as arguments to `COp`'s `build()`
|
|
|
|
method.
|
|
|
|
|
|
|
|
We can also reference symbols bound to matched op's results:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def : Pat<(AOp (BOp:$b_result) $attr), (COp $b_result $attr)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above, we are using `BOp`'s result for building `COp`.
|
|
|
|
|
|
|
|
#### Building operations
|
|
|
|
|
|
|
|
Given that `COp` was specified with table-driven op definition, there will be
|
2019-11-15 23:33:21 +08:00
|
|
|
several `build()` methods generated for it. One of them has aggregated
|
|
|
|
parameters for result types, operands, and attributes in the signature: `void
|
2019-12-24 06:45:01 +08:00
|
|
|
COp::build(..., ArrayRef<Type> resultTypes, Array<Value> operands,
|
2019-11-15 23:33:21 +08:00
|
|
|
ArrayRef<NamedAttribute> attr)`. The pattern in the above calls this `build()`
|
|
|
|
method for constructing the `COp`.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
2019-11-20 21:37:49 +08:00
|
|
|
In general, arguments in the result pattern will be passed directly to the
|
2019-09-07 20:25:39 +08:00
|
|
|
`build()` method to leverage the auto-generated `build()` method, list them in
|
|
|
|
the pattern by following the exact same order as the ODS `arguments` definition.
|
|
|
|
Otherwise, a custom `build()` method that matches the argument list is required.
|
|
|
|
|
2019-10-01 02:48:30 +08:00
|
|
|
Right now all ODS-generated `build()` methods require specifying the result
|
|
|
|
type(s), unless the op has known traits like `SameOperandsAndResultType` that
|
|
|
|
we can use to auto-generate a `build()` method with result type deduction.
|
|
|
|
When generating an op to replace the result of the matched root op, we can use
|
|
|
|
the matched root op's result type when calling the ODS-generated builder.
|
|
|
|
Otherwise (e.g., generating an [auxiliary op](#supporting-auxiliary-ops) or
|
|
|
|
generating an op with a nested result pattern), DRR will not be able to deduce
|
|
|
|
the result type(s). The pattern author will need to define a custom builder
|
|
|
|
that has result type deduction ability via `OpBuilder` in ODS. For example,
|
|
|
|
in the following pattern
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-11-15 23:33:21 +08:00
|
|
|
def : Pat<(AOp $input, $attr), (COp (AOp $input, $attr) $attr)>;
|
2019-10-01 02:48:30 +08:00
|
|
|
```
|
|
|
|
|
2019-11-15 23:33:21 +08:00
|
|
|
`AOp` is generated via a nested result pattern; DRR won't be able to deduce the
|
|
|
|
result type for it. A custom builder for `AOp` should be defined and it should
|
2019-12-06 21:58:59 +08:00
|
|
|
deduce the result type by itself. The builder should have the separate parameter
|
|
|
|
for each operand and attribute and deduce the result type internally by itself.
|
|
|
|
For example, for the above `AOp`, a possible builder is:
|
2019-10-01 02:48:30 +08:00
|
|
|
|
2019-11-15 23:33:21 +08:00
|
|
|
```c++
|
|
|
|
|
2020-04-23 22:02:46 +08:00
|
|
|
void AOp::build(OpBuilder &builder, OperationState &state,
|
2019-12-24 06:45:01 +08:00
|
|
|
Value input, Attribute attr) {
|
2019-11-15 23:33:21 +08:00
|
|
|
state.addOperands({input});
|
|
|
|
state.addAttribute("a_attr", attr);
|
|
|
|
Type type = ...; // Deduce result type here
|
|
|
|
state.addTypes({type});
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
Failing to define such a builder will result in an error at C++ compilation time
|
|
|
|
saying the call to `AOp::build()` cannot be resolved because of the number of
|
|
|
|
parameters mismatch.
|
2019-10-01 02:48:30 +08:00
|
|
|
|
2019-09-07 20:25:39 +08:00
|
|
|
#### Generating DAG of operations
|
|
|
|
|
|
|
|
`dag` objects can be nested to generate a DAG of operations:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def : Pat<(AOp $input, $attr), (COp (BOp), $attr)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above, we generate a `BOp`, and then use its result to generate the `COp`
|
|
|
|
to replace the matched `AOp`.
|
|
|
|
|
|
|
|
#### Binding op results
|
|
|
|
|
2019-09-26 02:57:13 +08:00
|
|
|
In the result pattern, we can bind to the result(s) of a newly built op by
|
2019-09-07 20:25:39 +08:00
|
|
|
attaching symbols to the op. (But we **cannot** bind to op arguments given that
|
|
|
|
they are referencing previously bound symbols.) This is useful for reusing
|
|
|
|
newly created results where suitable. For example,
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def DOp : Op<"d_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
AnyType:$d_input1,
|
|
|
|
AnyType:$d_input2,
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$d_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
def : Pat<(AOp $input, $ignored_attr), (DOp (BOp:$b_result) $b_result)>;
|
|
|
|
```
|
|
|
|
|
2019-12-06 21:58:59 +08:00
|
|
|
In this pattern, an `AOp` is matched and replaced with a `DOp` whose two
|
|
|
|
operands are from the result of a single `BOp`. This is only possible by binding
|
|
|
|
the result of the `BOp` to a name and reuse it for the second operand of the
|
|
|
|
`DOp`
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
#### `NativeCodeCall`: transforming the generated op
|
|
|
|
|
|
|
|
Sometimes the captured arguments are not exactly what we want so they cannot be
|
|
|
|
directly fed in as arguments to build the new op. For such cases, we can apply
|
|
|
|
transformations on the arguments by calling into C++ helper functions. This is
|
|
|
|
achieved by `NativeCodeCall`.
|
|
|
|
|
2019-09-26 02:57:13 +08:00
|
|
|
For example, if we want to capture some op's attributes and group them as an
|
2019-09-07 20:25:39 +08:00
|
|
|
array attribute to construct a new op:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
def TwoAttrOp : Op<"two_attr_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
AnyAttr:$op_attr1,
|
|
|
|
AnyAttr:$op_attr2
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$op_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
def OneAttrOp : Op<"one_attr_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
ArrayAttr:$op_attr
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyType:$op_output
|
|
|
|
);
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
We can write a C++ helper function:
|
|
|
|
|
|
|
|
```c++
|
|
|
|
Attribute createArrayAttr(Builder &builder, Attribute a, Attribute b) {
|
|
|
|
return builder.getArrayAttr({a, b});
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
And then write the pattern as:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def createArrayAttr : NativeCodeCall<"createArrayAttr($_builder, $0, $1)">;
|
|
|
|
|
|
|
|
def : Pat<(TwoAttrOp $attr1, $attr2),
|
|
|
|
(OneAttrOp (createArrayAttr $attr1, $attr2))>;
|
|
|
|
```
|
|
|
|
|
|
|
|
And make sure the generated C++ code from the above pattern has access to the
|
|
|
|
definition of the C++ helper function.
|
|
|
|
|
|
|
|
In the above example, we are using a string to specialize the `NativeCodeCall`
|
|
|
|
template. The string can be an arbitrary C++ expression that evaluates into
|
|
|
|
some C++ object expected at the `NativeCodeCall` site (here it would be
|
|
|
|
expecting an array attribute). Typically the string should be a function call.
|
|
|
|
|
2019-11-07 01:56:01 +08:00
|
|
|
Note that currently `NativeCodeCall` must return no more than one value or
|
|
|
|
attribute. This might change in the future.
|
|
|
|
|
2019-09-07 20:25:39 +08:00
|
|
|
##### `NativeCodeCall` placeholders
|
|
|
|
|
|
|
|
In `NativeCodeCall`, we can use placeholders like `$_builder`, `$N`. The former
|
|
|
|
is called _special placeholder_, while the latter is called _positional
|
|
|
|
placeholder_.
|
|
|
|
|
2020-08-12 05:06:16 +08:00
|
|
|
`NativeCodeCall` right now only supports three special placeholders:
|
|
|
|
`$_builder`, `$_loc`, and `$_self`:
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
* `$_builder` will be replaced by the current `mlir::PatternRewriter`.
|
2020-08-12 05:06:16 +08:00
|
|
|
* `$_loc` will be replaced by the fused location or custom location (as
|
|
|
|
determined by location directive).
|
2019-09-07 20:25:39 +08:00
|
|
|
* `$_self` will be replaced with the entity `NativeCodeCall` is attached to.
|
|
|
|
|
|
|
|
We have seen how `$_builder` can be used in the above; it allows us to pass a
|
|
|
|
`mlir::Builder` (`mlir::PatternRewriter` is a subclass of `mlir::OpBuilder`,
|
|
|
|
which is a subclass of `mlir::Builder`) to the C++ helper function to use the
|
|
|
|
handy methods on `mlir::Builder`.
|
|
|
|
|
|
|
|
`$_self` is useful when we want to write something in the form of
|
|
|
|
`NativeCodeCall<"...">:$symbol`. For example, if we want to reverse the previous
|
|
|
|
example and decompose the array attribute into two attributes:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2020-02-15 14:54:18 +08:00
|
|
|
class getNthAttr<int n> : NativeCodeCall<"$_self[" # n # "]">;
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
def : Pat<(OneAttrOp $attr),
|
|
|
|
(TwoAttrOp (getNthAttr<0>:$attr), (getNthAttr<1>:$attr)>;
|
|
|
|
```
|
|
|
|
|
2019-10-21 00:44:06 +08:00
|
|
|
In the above, `$_self` is substituted by the attribute bound by `$attr`, which
|
2020-03-30 16:20:08 +08:00
|
|
|
is `OneAttrOp`'s array attribute.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
Positional placeholders will be substituted by the `dag` object parameters at
|
|
|
|
the `NativeCodeCall` use site. For example, if we define `SomeCall :
|
|
|
|
NativeCodeCall<"someFn($1, $2, $0)">` and use it like `(SomeCall $in0, $in1,
|
|
|
|
$in2)`, then this will be translated into C++ call `someFn($in1, $in2, $in0)`.
|
|
|
|
|
|
|
|
##### Customizing entire op building
|
|
|
|
|
|
|
|
`NativeCodeCall` is not only limited to transforming arguments for building an
|
2019-09-26 02:57:13 +08:00
|
|
|
op; it can be also used to specify how to build an op entirely. An example:
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
If we have a C++ function for building an op:
|
|
|
|
|
|
|
|
```c++
|
2019-12-24 06:45:01 +08:00
|
|
|
Operation *createMyOp(OpBuilder builder, Value input, Attribute attr);
|
2019-09-07 20:25:39 +08:00
|
|
|
```
|
|
|
|
|
|
|
|
We can wrap it up and invoke it like:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def createMyOp : NativeCodeCall<"createMyOp($_builder, $0, $1)">;
|
|
|
|
|
|
|
|
def : Pat<(... $input, $attr), (createMyOp $input, $attr)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
### Supporting auxiliary ops
|
|
|
|
|
2019-09-26 02:57:13 +08:00
|
|
|
A declarative rewrite rule supports multiple result patterns. One of the
|
|
|
|
purposes is to allow generating _auxiliary ops_. Auxiliary ops are operations
|
|
|
|
used for building the replacement ops; but they are not directly used for
|
|
|
|
replacement themselves.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
For the case of uni-result ops, if there are multiple result patterns, only the
|
|
|
|
value generated from the last result pattern will be used to replace the matched
|
|
|
|
root op's result; all other result patterns will be considered as generating
|
|
|
|
auxiliary ops.
|
|
|
|
|
|
|
|
Normally we want to specify ops as nested `dag` objects if their def-use
|
|
|
|
relationship can be expressed in the way that an op's result can feed as the
|
|
|
|
argument to consuming op. But that is not always possible. For example, if we
|
|
|
|
want to allocate memory and store some computation (in pseudocode):
|
|
|
|
|
|
|
|
```mlir
|
|
|
|
%dst = addi %lhs, %rhs
|
|
|
|
```
|
|
|
|
|
|
|
|
into
|
|
|
|
|
|
|
|
```mlir
|
|
|
|
%shape = shape %lhs
|
|
|
|
%mem = alloc %shape
|
|
|
|
%sum = addi %lhs, %rhs
|
|
|
|
store %mem, %sum
|
|
|
|
%dst = load %mem
|
|
|
|
```
|
|
|
|
|
|
|
|
We cannot fit in with just one result pattern given `store` does not return a
|
|
|
|
value. Instead we can use multiple result patterns:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def : Pattern<(AddIOp $lhs, $rhs),
|
2020-03-30 16:20:08 +08:00
|
|
|
[(StoreOp (AllocOp:$mem (ShapeOp $lhs)), (AddIOp $lhs, $rhs)),
|
2019-09-07 20:25:39 +08:00
|
|
|
(LoadOp $mem)];
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above we use the first result pattern to generate the first four ops, and
|
|
|
|
use the last pattern to generate the last op, which is used to replace the
|
|
|
|
matched op.
|
|
|
|
|
|
|
|
### Supporting multi-result ops
|
|
|
|
|
|
|
|
Multi-result ops bring extra complexity to declarative rewrite rules. We use
|
|
|
|
TableGen `dag` objects to represent ops in patterns; there is no native way to
|
|
|
|
indicate that an op generates multiple results. The approach adopted is based
|
|
|
|
on **naming convention**: a `__N` suffix is added to a symbol to indicate the
|
|
|
|
`N`-th result.
|
|
|
|
|
|
|
|
#### `__N` suffix
|
|
|
|
|
2019-12-03 01:12:48 +08:00
|
|
|
The `__N` suffix is specifying the `N`-th result as a whole (which can be
|
2019-09-07 20:25:39 +08:00
|
|
|
[variadic](#supporting-variadic-ops)). For example, we can bind a symbol to some
|
|
|
|
multi-result op and reference a specific result later:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def ThreeResultOp : Op<"three_result_op"> {
|
|
|
|
let arguments = (ins ...);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyTensor:$op_output1,
|
|
|
|
AnyTensor:$op_output2,
|
|
|
|
AnyTensor:$op_output3
|
|
|
|
);
|
|
|
|
}
|
|
|
|
|
|
|
|
def : Pattern<(ThreeResultOp:$results ...),
|
|
|
|
[(... $results__0), ..., (... $results__2), ...]>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above pattern we bind `$results` to all the results generated by
|
|
|
|
`ThreeResultOp` and references its `$input1` and `$input3` later in the result
|
|
|
|
patterns.
|
|
|
|
|
|
|
|
We can also bind a symbol and reference one of its specific result at the same
|
|
|
|
time, which is typically useful when generating multi-result ops:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
// TwoResultOp has similar definition as ThreeResultOp, but only has two
|
|
|
|
// results.
|
|
|
|
|
|
|
|
def : Pattern<(TwoResultOp ...),
|
|
|
|
[(ThreeResultOp:$results__2, ...),
|
|
|
|
(replaceWithValue $results__0)]>;
|
|
|
|
```
|
|
|
|
|
|
|
|
In the above, we created a `ThreeResultOp` and bind `results` to its results,
|
|
|
|
and uses its last result (`$output3`) and first result (`$output1`) to replace
|
|
|
|
the `TwoResultOp`'s two results, respectively.
|
|
|
|
|
|
|
|
#### Replacing multi-result ops
|
|
|
|
|
|
|
|
The above example also shows how to replace a matched multi-result op.
|
|
|
|
|
2020-04-29 13:47:35 +08:00
|
|
|
To replace an `N`-result op, the result patterns must generate at least `N`
|
2019-09-07 20:25:39 +08:00
|
|
|
declared values (see [Declared vs. actual value](#declared-vs-actual-value) for
|
|
|
|
definition). If there are more than `N` declared values generated, only the
|
|
|
|
last `N` declared values will be used to replace the matched op. Note that
|
|
|
|
because of the existence of multi-result op, one result pattern **may** generate
|
|
|
|
multiple declared values. So it means we do not necessarily need `N` result
|
|
|
|
patterns to replace an `N`-result op. For example, to replace an op with three
|
|
|
|
results, you can have
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
// ThreeResultOp/TwoResultOp/OneResultOp generates three/two/one result(s),
|
|
|
|
// respectively.
|
|
|
|
|
|
|
|
// Replace each result with a result generated from an individual op.
|
|
|
|
def : Pattern<(ThreeResultOp ...),
|
|
|
|
[(OneResultOp ...), (OneResultOp ...), (OneResultOp ...)]>;
|
|
|
|
|
|
|
|
// Replace the first two results with two results generated from the same op.
|
|
|
|
def : Pattern<(ThreeResultOp ...),
|
|
|
|
[(TwoResultOp ...), (OneResultOp ...)]>;
|
|
|
|
|
|
|
|
// Replace all three results with three results generated from the same op.
|
|
|
|
def : Pat<(ThreeResultOp ...), (ThreeResultOp ...)>;
|
|
|
|
|
|
|
|
def : Pattern<(ThreeResultOp ...),
|
|
|
|
[(AuxiliaryOp ...), (ThreeResultOp ...)]>;
|
|
|
|
```
|
|
|
|
|
|
|
|
But using a single op to serve as both auxiliary op and replacement op is
|
|
|
|
forbidden, i.e., the following is not allowed because that the first
|
|
|
|
`TwoResultOp` generates two results but only the second result is used for
|
|
|
|
replacing the matched op's result:
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def : Pattern<(ThreeResultOp ...),
|
|
|
|
[(TwoResultOp ...), (TwoResultOp ...)]>;
|
|
|
|
```
|
|
|
|
|
|
|
|
### Supporting variadic ops
|
|
|
|
|
|
|
|
#### Declared vs. actual value
|
|
|
|
|
|
|
|
Before going into details on variadic op support, we need to define a few terms
|
|
|
|
regarding an op's values.
|
|
|
|
|
|
|
|
* _Value_: either an operand or a result
|
|
|
|
* _Declared operand/result/value_: an operand/result/value statically declared
|
|
|
|
in ODS of the op
|
|
|
|
* _Actual operand/result/value_: an operand/result/value of an op instance at
|
|
|
|
runtime
|
|
|
|
|
|
|
|
The above terms are needed because ops can have multiple results, and some of the
|
|
|
|
results can also be variadic. For example,
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2019-09-07 20:25:39 +08:00
|
|
|
def MultiVariadicOp : Op<"multi_variadic_op"> {
|
|
|
|
let arguments = (ins
|
|
|
|
AnyTensor:$input1,
|
|
|
|
Variadic<AnyTensor>:$input2,
|
|
|
|
AnyTensor:$input3
|
|
|
|
);
|
|
|
|
|
|
|
|
let results = (outs
|
|
|
|
AnyTensor:$output1,
|
|
|
|
Variadic<AnyTensor>:$output2,
|
|
|
|
AnyTensor:$output3
|
|
|
|
);
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
We say the above op has 3 declared operands and 3 declared results. But at
|
|
|
|
runtime, an instance can have 3 values corresponding to `$input2` and 2 values
|
|
|
|
correspond to `$output2`; we say it has 5 actual operands and 4 actual
|
|
|
|
results. A variadic operand/result is a considered as a declared value that can
|
|
|
|
correspond to multiple actual values.
|
|
|
|
|
|
|
|
[TODO]
|
|
|
|
|
|
|
|
### Supplying additional constraints
|
|
|
|
|
|
|
|
Constraints can be placed on op arguments when matching. But sometimes we need
|
|
|
|
to also place constraints on the matched op's results or sometimes need to limit
|
2019-09-26 02:57:13 +08:00
|
|
|
the matching with some constraints that cover both the arguments and the
|
2019-09-07 20:25:39 +08:00
|
|
|
results. The third parameter to `Pattern` (and `Pat`) is for this purpose.
|
|
|
|
|
|
|
|
For example, we can write
|
|
|
|
|
2020-01-01 01:52:19 +08:00
|
|
|
```tablegen
|
2020-01-12 00:54:04 +08:00
|
|
|
def HasNoUseOf: Constraint<CPred<"$_self.use_empty()">, "has no use">;
|
2019-09-07 20:25:39 +08:00
|
|
|
|
|
|
|
def HasSameElementType : Constraint<
|
|
|
|
CPred<"$0.cast<ShapedType>().getElementType() == "
|
|
|
|
"$1.cast<ShapedType>().getElementType()">,
|
|
|
|
"has same element type">;
|
|
|
|
|
|
|
|
def : Pattern<(TwoResultOp:$results $input),
|
|
|
|
[(...), (...)],
|
|
|
|
[(F32Tensor:$results__0), (HasNoUseOf:$results__1),
|
|
|
|
(HasSameElementShape $results__0, $input)]>;
|
|
|
|
```
|
|
|
|
|
|
|
|
You can
|
|
|
|
|
|
|
|
* Use normal `TypeConstraint`s on previous bound symbols (the first result of
|
|
|
|
`TwoResultOp` must be a float tensor);
|
|
|
|
* Define new `Constraint` for previous bound symbols (the second result of
|
|
|
|
`TwoResultOp` must has no use);
|
|
|
|
* Apply constraints on multiple bound symbols (`$input` and `TwoResultOp`'s
|
|
|
|
first result must have the same element type).
|
|
|
|
|
|
|
|
### Adjusting benefits
|
|
|
|
|
2019-09-26 02:57:13 +08:00
|
|
|
The benefit of a `Pattern` is an integer value indicating the benefit of matching
|
2019-09-07 20:25:39 +08:00
|
|
|
the pattern. It determines the priorities of patterns inside the pattern rewrite
|
|
|
|
driver. A pattern with a higher benefit is applied before one with a lower
|
|
|
|
benefit.
|
|
|
|
|
|
|
|
In DRR, a rule is set to have a benefit of the number of ops in the source
|
|
|
|
pattern. This is based on the heuristics and assumptions that:
|
|
|
|
|
|
|
|
* Larger matches are more beneficial than smaller ones.
|
|
|
|
* If a smaller one is applied first the larger one may not apply anymore.
|
|
|
|
|
|
|
|
|
2019-09-26 02:57:13 +08:00
|
|
|
The fourth parameter to `Pattern` (and `Pat`) allows to manually tweak a
|
2019-09-07 20:25:39 +08:00
|
|
|
pattern's benefit. Just supply `(addBenefit N)` to add `N` to the benefit value.
|
|
|
|
|
2020-04-07 22:44:19 +08:00
|
|
|
## Rewrite directives
|
2019-09-07 20:25:39 +08:00
|
|
|
|
2020-04-07 22:44:19 +08:00
|
|
|
### `location`
|
|
|
|
|
|
|
|
By default the C++ pattern expanded from a DRR pattern uses the fused location
|
|
|
|
of all source ops as the location for all generated ops. This is not always the
|
|
|
|
best location mapping relationship. For such cases, DRR provides the `location`
|
|
|
|
directive to provide finer control.
|
|
|
|
|
|
|
|
`location` is of the following syntax:
|
|
|
|
|
2020-04-29 13:47:35 +08:00
|
|
|
```tablegen
|
2020-04-07 22:44:19 +08:00
|
|
|
(location $symbol0, $symbol1, ...)
|
|
|
|
```
|
|
|
|
|
2020-04-10 08:18:34 +08:00
|
|
|
where all `$symbol` should be bound previously in the pattern and one optional
|
2020-04-29 13:47:35 +08:00
|
|
|
string may be specified as an attribute. The following locations are created:
|
2020-04-10 08:18:34 +08:00
|
|
|
|
|
|
|
* If only 1 symbol is specified then that symbol's location is used,
|
|
|
|
* If multiple are specified then a fused location is created;
|
|
|
|
* If no symbol is specified then string must be specified and a NamedLoc is
|
|
|
|
created instead;
|
2020-04-07 22:44:19 +08:00
|
|
|
|
|
|
|
`location` must be used as the last argument to an op creation. For example,
|
|
|
|
|
|
|
|
```tablegen
|
|
|
|
def : Pat<(LocSrc1Op:$src1 (LocSrc2Op:$src2 ...),
|
2020-04-10 08:18:34 +08:00
|
|
|
(LocDst1Op (LocDst2Op ..., (location $src2)), (location "outer"))>;
|
2020-04-07 22:44:19 +08:00
|
|
|
```
|
|
|
|
|
|
|
|
In the above pattern, the generated `LocDst2Op` will use the matched location
|
2020-04-10 08:18:34 +08:00
|
|
|
of `LocSrc2Op` while the root `LocDst1Op` node will used the named location
|
|
|
|
`outer`.
|
2020-04-07 22:44:19 +08:00
|
|
|
|
|
|
|
### `replaceWithValue`
|
|
|
|
|
|
|
|
The `replaceWithValue` directive is used to eliminate a matched op by replacing
|
|
|
|
all of it uses with a captured value. It is of the following syntax:
|
|
|
|
|
|
|
|
```tablegen
|
|
|
|
(replaceWithValue $symbol)
|
|
|
|
```
|
|
|
|
|
|
|
|
where `$symbol` should be a symbol bound previously in the pattern.
|
|
|
|
|
|
|
|
For example,
|
|
|
|
|
|
|
|
```tablegen
|
|
|
|
def : Pat<(Foo $input), (replaceWithValue $input)>;
|
|
|
|
```
|
|
|
|
|
|
|
|
The above pattern removes the `Foo` and replaces all uses of `Foo` with
|
|
|
|
`$input`.
|
2019-09-07 20:25:39 +08:00
|
|
|
|
2019-11-14 20:25:56 +08:00
|
|
|
## Debugging Tips
|
|
|
|
|
|
|
|
### Run `mlir-tblgen` to see the generated content
|
|
|
|
|
|
|
|
TableGen syntax sometimes can be obscure; reading the generated content can be
|
|
|
|
a very helpful way to understand and debug issues. To build `mlir-tblgen`, run
|
|
|
|
`cmake --build . --target mlir-tblgen` in your build directory and find the
|
|
|
|
`mlir-tblgen` binary in the `bin/` subdirectory. All the supported generators
|
|
|
|
can be found via `mlir-tblgen --help`.
|
|
|
|
|
|
|
|
To see the generated code, invoke `mlir-tblgen` with a specific generator by
|
|
|
|
providing include paths via `-I`. For example,
|
|
|
|
|
|
|
|
```sh
|
|
|
|
# To see all the C++ pattern rewrite classes
|
|
|
|
mlir-tblgen --gen-rewriters -I /path/to/mlir/include /path/to/input/td/file
|
|
|
|
```
|
|
|
|
|
2019-11-15 23:33:21 +08:00
|
|
|
### Compilation error: no matching member function for call to 'build'
|
|
|
|
|
2019-12-03 01:12:48 +08:00
|
|
|
This is because DRR is failing to call a `build()` method with result type
|
2019-11-15 23:33:21 +08:00
|
|
|
deduction ability. See [building operations](#building-operations) for more
|
|
|
|
details.
|
|
|
|
|
2019-09-07 20:25:39 +08:00
|
|
|
[TableGen]: https://llvm.org/docs/TableGen/index.html
|
2021-02-01 15:24:21 +08:00
|
|
|
[OpBase]: https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/IR/OpBase.td
|