2019-03-29 08:37:29 +08:00
# Multi-Level Intermediate Representation Overview
The MLIR project aims to define a common intermediate representation (IR) that
will unify the infrastructure required to execute high performance machine
learning models in TensorFlow and similar ML frameworks. This project will
include the application of HPC techniques, along with integration of search
algorithms like reinforcement learning. This project aims to reduce the cost to
bring up new hardware, and improve usability for existing TensorFlow users.
2019-04-03 23:28:29 +08:00
Note that this repository contains the core of the MLIR framework, the
Tensorflow compilers we are building on top of MLIR will be part of the
main Tensorflow repository soon.
2019-03-29 08:37:29 +08:00
## More resources
For more information on MLIR, please see:
2019-04-02 02:19:02 +08:00
* [The MLIR draft specification ](g3doc/LangRef.md ), which describes the IR
itself,
* [The MLIR rationale document ](g3doc/Rationale.md ), covering motivation
behind some decisions,
2019-03-29 08:37:29 +08:00
* previous external [talks ](#talks ),
2019-03-30 13:10:12 +08:00
or join the [MLIR mailing list ](https://groups.google.com/a/tensorflow.org/forum/#!forum/mlir ).
2019-04-04 00:40:08 +08:00
Please be mindful of the [TensorFlow Code of Conduct ](https://github.com/tensorflow/tensorflow/blob/master/CODE_OF_CONDUCT.md )
that pledges to foster an open and welcoming environment.
2019-03-29 08:37:29 +08:00
## What is MLIR for?
MLIR is intended to be a hybrid IR which can support multiple different
requirements in a unified infrastructure. For example, this includes:
* The ability to represent all TensorFlow graphs, including dynamic shapes,
the user-extensible op ecosystem, TensorFlow variables, etc.
* Optimizations and transformations typically done on a TensorFlow graph, e.g.
in Grappler.
* Quantization and other graph transformations done on a TensorFlow graph or
the TF Lite representation.
* Representation of kernels for ML operations in a form suitable for
optimization.
* Ability to host high-performance-computing-style loop optimizations across
kernels (fusion, loop interchange, tiling, etc), and transform memory
layouts of data.
* Code generation "lowering" transformations such as DMA insertion, explicit
cache management, memory tiling, and vectorization for 1D and 2D register
architectures.
* Ability to represent target-specific operations, e.g. the MXU on TPUs.
2019-03-29 09:28:14 +08:00
MLIR is a common IR which also supports hardware specific operations. Thus,
2019-03-29 08:37:29 +08:00
any investment into the infrastructure surrounding MLIR (e.g. the compiler
passes that work on it) should yield good returns; many targets can use that
infrastructure and will benefit from it.
MLIR is a powerful representation, but it also has non-goals. We do not try to
support low level machine code generation algorithms (like register allocation
and instruction scheduling). They are a better fit for lower level optimizers
(such as LLVM). Also, we do not intend MLIR to be a source language that
end-users would themselves write kernels in (analogous to CUDA C++). While we'd
love to see a kernel language happen someday, that will be an independent
project that compiles down to MLIR.
## Compiler Infrastructure {#compiler-infrastructure}
We benefitted from the experience gained building HLO, LLVM and SIL when
building MLIR. We will directly adopt existing best practices, e.g. writing and
2019-03-29 09:28:14 +08:00
maintaining an IR spec, building an IR verifier, providing the ability to dump
and parse MLIR files to text, writing extensive unit tests with the
[FileCheck ](https://llvm.org/docs/CommandGuide/FileCheck.html ) tool, and
building the infrastructure as a set of modular libraries that can be combined
in new ways. We plan to use the infrastructure developed by the XLA team for
2019-03-29 08:37:29 +08:00
performance analysis and benchmarking.
Other lessons have been incorporated and integrated into the design in subtle
ways. For example, LLVM has non-obvious design mistakes that prevent a
multithreaded compiler from working on multiple functions in an LLVM module at
the same time. MLIR solves these problems by having per-function constant pools
and by making references explicit with function_ref.
2019-03-30 13:10:12 +08:00
# Getting started with MLIR
2019-04-02 02:19:02 +08:00
MLIR has been tested on Linux and MacOS, with a recent clang or with gcc 7.
2019-03-30 13:10:12 +08:00
```
git clone https://github.com/llvm/llvm-project.git
2019-04-02 02:19:02 +08:00
cd llvm-projects/llvm/projects/
2019-03-30 13:10:12 +08:00
git clone https://github.com/tensorflow/mlir
cd ../../
mkdir build
cd build
2019-04-03 06:15:20 +08:00
cmake -G Ninja ../llvm/ -DLLVM_BUILD_EXAMPLES=ON
2019-03-30 13:10:12 +08:00
ninja check-mlir
```
2019-04-03 09:07:26 +08:00
As a starter, you may try [the tutorial ](g3doc/Tutorials/Toy/Ch-1.md ) on
building a compiler for a Toy language.
2019-03-29 08:37:29 +08:00
# MLIR talks {#talks}
* "[MLIR Primer: A Compiler Infrastructure for the End of Moore’ s Law](https://drive.google.com/file/d/1hUeAJXcAXwz82RXA5VtO5ZoH8cVQhrOK/view?usp=sharing)",
Chris Lattner & Jacques Pienaar, Google at
[Compilers for Machine Learning ](https://www.c4ml.org/ ) workshop at
[CGO 2019 ](http://cgo.org/cgo2019/ ).