llvm-project/mlir/README.md

# Multi-Level Intermediate Representation Overview

The MLIR project aims to define a common intermediate representation (IR) that
will unify the infrastructure required to execute high performance machine
learning models in TensorFlow and similar ML frameworks. This project will
include the application of HPC techniques, along with integration of search
algorithms like reinforcement learning. This project aims to reduce the cost to
bring up new hardware, and improve usability for existing TensorFlow users.

Note that this repository contains the core of the MLIR framework. The
TensorFlow compilers we are building on top of MLIR will be part of the
main TensorFlow repository soon.

# How to Contribute

Thank you for your interest in contributing to MLIR! If you want to contribute
to MLIR, be sure to review the [contribution guidelines](CONTRIBUTING.md).

## More resources

For more information on MLIR, please see:

*   [The MLIR draft specification](g3doc/LangRef.md), which describes the IR
    itself.
*   [The MLIR rationale document](g3doc/Rationale.md), covering motivation
    behind some decisions.
*   Previous external [talks](#mlir-talks).

Join the [MLIR mailing list](https://groups.google.com/a/tensorflow.org/forum/#!forum/mlir)
to hear about announcements and discussions.
Please be mindful of the [TensorFlow Code of Conduct](https://github.com/tensorflow/tensorflow/blob/master/CODE_OF_CONDUCT.md),
which pledges to foster an open and welcoming environment.

## What is MLIR for?

MLIR is intended to be a hybrid IR which can support multiple different
requirements in a unified infrastructure. For example, this includes:

*   The ability to represent all TensorFlow graphs, including dynamic shapes,
    the user-extensible op ecosystem, TensorFlow variables, etc.
*   Optimizations and transformations typically done on a TensorFlow graph, e.g.
    in Grappler.
*   Quantization and other graph transformations done on a TensorFlow graph or
    the TF Lite representation.
*   Representation of kernels for ML operations in a form suitable for
    optimization.
*   Ability to host high-performance-computing-style loop optimizations across
    kernels (fusion, loop interchange, tiling, etc) and to transform memory
    layouts of data.
*   Code generation "lowering" transformations such as DMA insertion, explicit
    cache management, memory tiling, and vectorization for 1D and 2D register
    architectures.
*   Ability to represent target-specific operations, e.g. the MXU on TPUs.

MLIR is a common IR that also supports hardware specific operations. Thus,
any investment into the infrastructure surrounding MLIR (e.g. the compiler
passes that work on it) should yield good returns; many targets can use that
infrastructure and will benefit from it.

MLIR is a powerful representation, but it also has non-goals. We do not try to
support low level machine code generation algorithms (like register allocation
and instruction scheduling). They are a better fit for lower level optimizers
(such as LLVM). Also, we do not intend MLIR to be a source language that
end-users would themselves write kernels in (analogous to CUDA C++). While we
would love to see a kernel language happen someday, that will be an independent
project that compiles down to MLIR.

## Compiler infrastructure

We benefited from experience gained from building other IRs (HLO, LLVM and SIL)
when building MLIR. We will directly adopt existing best practices, e.g. writing
and maintaining an IR spec, building an IR verifier, providing the ability to
dump and parse MLIR files to text, writing extensive unit tests with the
[FileCheck](https://llvm.org/docs/CommandGuide/FileCheck.html) tool, and
building the infrastructure as a set of modular libraries that can be combined
in new ways. We plan to use the infrastructure developed by the XLA team for
performance analysis and benchmarking.

Other lessons have been incorporated and integrated into the design in subtle
ways. For example, LLVM has non-obvious design mistakes that prevent a
multithreaded compiler from working on multiple functions in an LLVM module at
the same time. MLIR solves these problems by having per-function constant pools
and by making references explicit with `function_ref`.

# Getting started with MLIR

The following instructions for compiling and testing MLIR assume that you have
`git`, [`ninja`](https://ninja-build.org/), and a working C++ toolchain. In the
future, we aim to align on the same level of platform support as
[LLVM](https://llvm.org/docs/GettingStarted.html#requirements). For now, MLIR
has been tested on Linux and macOS, with recent versions of clang and with
gcc 7.

```sh
git clone https://github.com/llvm/llvm-project.git
git clone https://github.com/tensorflow/mlir llvm-project/llvm/projects/mlir
mkdir llvm-project/build
cd llvm-project/build
cmake -G Ninja ../llvm -DLLVM_BUILD_EXAMPLES=ON -DLLVM_ENABLE_CXX1Y=Y -DLLVM_TARGETS_TO_BUILD="host"
cmake --build . --target check-mlir
```

To compile and test on Windows using Visual Studio 2017:

```bat
REM In shell with Visual Studio environment set up, e.g., with command such as
REM   <visual-studio-install>\Auxiliary\Build\vcvarsall.bat" x64
REM invoked.
git clone https://github.com/llvm/llvm-project.git
git clone https://github.com/tensorflow/mlir llvm-project\llvm\projects\mlir
mkdir llvm-project\build
cd llvm-project\build
cmake ..\llvm -G "Visual Studio 15 2017 Win64" -DLLVM_BUILD_EXAMPLES=ON -DLLVM_ENABLE_CXX1Y=Y -DLLVM_TARGETS_TO_BUILD="host" -DCMAKE_BUILD_TYPE=Release -Thost=x64
cmake --build . --target check-mlir
```

As a starter, you may try [the tutorial](g3doc/Tutorials/Toy/Ch-1.md) on
building a compiler for a Toy language.

# MLIR talks

* "[MLIR Primer: A Compiler Infrastructure for the End of Moore’s Law](https://ai.google/research/pubs/pub48035.pdf)"
  * Chris Lattner & Jacques Pienaar, Google at
    [Compilers for Machine Learning](https://www.c4ml.org/) workshop at
    [CGO 2019](http://cgo.org/cgo2019/)
* "[MLIR: Multi-Level Intermediate Representation for Compiler
    Infrastructure](https://llvm.org/devmtg/2019-04/talks.html#Keynote_1)"
  * Tatiana Shpeisman & Chris Lattner, Google at
    [EuroLLVM 2019](https://llvm.org/devmtg/2019-04)
* "[Tutorial: Building a Compiler with MLIR](https://llvm.org/devmtg/2019-04/talks.html#Tutorial_1)"
  * Mehdi Amini, Jacques Pienaar, Nicolas Vasilache, Google at
    [EuroLLVM 2019](https://llvm.org/devmtg/2019-04)
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								# Multi-Level Intermediate Representation Overview
 								The MLIR project aims to define a common intermediate representation (IR) that
 								will unify the infrastructure required to execute high performance machine
 								learning models in TensorFlow and similar ML frameworks. This project will
 								include the application of HPC techniques, along with integration of search
 								algorithms like reinforcement learning. This project aims to reduce the cost to
 								bring up new hardware, and improve usability for existing TensorFlow users.
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								Note that this repository contains the core of the MLIR framework. The
 								TensorFlow compilers we are building on top of MLIR will be part of the
 								main TensorFlow repository soon.
-												    Update Readme to mention that the TensorFlow specific will be upstreamed later

--

PiperOrigin-RevId: 241735845

											
										
										
											2019-04-03 23:28:29 +08:00
-												    Add mention that we don't (yet) accept contributions in the README

--

PiperOrigin-RevId: 242049527

											
										
										
											2019-04-05 09:22:17 +08:00
+								# How to Contribute
-												Update readme to reflect accepting contributions.

PiperOrigin-RevId: 256171145

											
										
										
											2019-07-02 23:39:46 +08:00
+								Thank you for your interest in contributing to MLIR! If you want to contribute
 								to MLIR, be sure to review the [contribution guidelines](CONTRIBUTING.md).
-												    Add mention that we don't (yet) accept contributions in the README

--

PiperOrigin-RevId: 242049527

											
										
										
											2019-04-05 09:22:17 +08:00
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								## More resources
 								For more information on MLIR, please see:
-												    Minor fixes to README.md (NFC)

    In particular fix the links to work on GitHub, and do not include RTTI to build.

--

PiperOrigin-RevId: 241362244

											
										
										
											2019-04-02 02:19:02 +08:00
+								*   [The MLIR draft specification](g3doc/LangRef.md), which describes the IR
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								    itself.
-												    Minor fixes to README.md (NFC)

    In particular fix the links to work on GitHub, and do not include RTTI to build.

--

PiperOrigin-RevId: 241362244

											
										
										
											2019-04-02 02:19:02 +08:00
+								*   [The MLIR rationale document](g3doc/Rationale.md), covering motivation
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								    behind some decisions.
-												    [README] Fix links, minor edits.

    Fix links without breaking g3doc.
    Minor stylistic edits.

--

PiperOrigin-RevId: 241846985

											
										
										
											2019-04-04 09:27:23 +08:00
+								*   Previous external [talks](#mlir-talks).
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								Join the [MLIR mailing list](https://groups.google.com/a/tensorflow.org/forum/#!forum/mlir)
 								to hear about announcements and discussions.
 								Please be mindful of the [TensorFlow Code of Conduct](https://github.com/tensorflow/tensorflow/blob/master/CODE_OF_CONDUCT.md),
 								which pledges to foster an open and welcoming environment.
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
 								## What is MLIR for?
 								MLIR is intended to be a hybrid IR which can support multiple different
 								requirements in a unified infrastructure. For example, this includes:
 								*   The ability to represent all TensorFlow graphs, including dynamic shapes,
 								    the user-extensible op ecosystem, TensorFlow variables, etc.
 								*   Optimizations and transformations typically done on a TensorFlow graph, e.g.
 								    in Grappler.
 								*   Quantization and other graph transformations done on a TensorFlow graph or
 								    the TF Lite representation.
 								*   Representation of kernels for ML operations in a form suitable for
 								    optimization.
 								*   Ability to host high-performance-computing-style loop optimizations across
-												    [README] Fix links, minor edits.

    Fix links without breaking g3doc.
    Minor stylistic edits.

--

PiperOrigin-RevId: 241846985

											
										
										
											2019-04-04 09:27:23 +08:00
+								    kernels (fusion, loop interchange, tiling, etc) and to transform memory
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								    layouts of data.
 								*   Code generation "lowering" transformations such as DMA insertion, explicit
 								    cache management, memory tiling, and vectorization for 1D and 2D register
 								    architectures.
 								*   Ability to represent target-specific operations, e.g. the MXU on TPUs.
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								MLIR is a common IR that also supports hardware specific operations. Thus,
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								any investment into the infrastructure surrounding MLIR (e.g. the compiler
 								passes that work on it) should yield good returns; many targets can use that
 								infrastructure and will benefit from it.
 								MLIR is a powerful representation, but it also has non-goals. We do not try to
 								support low level machine code generation algorithms (like register allocation
 								and instruction scheduling). They are a better fit for lower level optimizers
 								(such as LLVM). Also, we do not intend MLIR to be a source language that
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								end-users would themselves write kernels in (analogous to CUDA C++). While we
 								would love to see a kernel language happen someday, that will be an independent
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								project that compiles down to MLIR.
-												    [README] Fix links, minor edits.

    Fix links without breaking g3doc.
    Minor stylistic edits.

--

PiperOrigin-RevId: 241846985

											
										
										
											2019-04-04 09:27:23 +08:00
+								## Compiler infrastructure
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
-												Update readme to reflect accepting contributions.

PiperOrigin-RevId: 256171145

											
										
										
											2019-07-02 23:39:46 +08:00
+								We benefited from experience gained from building other IRs (HLO, LLVM and SIL)
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								when building MLIR. We will directly adopt existing best practices, e.g. writing
 								and maintaining an IR spec, building an IR verifier, providing the ability to
 								dump and parse MLIR files to text, writing extensive unit tests with the
-												Fixed a few instances of inconsistent grammar.

PiperOrigin-RevId: 240896336

											
										
										
											2019-03-29 09:28:14 +08:00
+								[FileCheck](https://llvm.org/docs/CommandGuide/FileCheck.html) tool, and
 								building the infrastructure as a set of modular libraries that can be combined
 								in new ways. We plan to use the infrastructure developed by the XLA team for
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								performance analysis and benchmarking.
 								Other lessons have been incorporated and integrated into the design in subtle
 								ways. For example, LLVM has non-obvious design mistakes that prevent a
 								multithreaded compiler from working on multiple functions in an LLVM module at
 								the same time. MLIR solves these problems by having per-function constant pools
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								and by making references explicit with `function_ref`.
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
-												    Add build files and update README.

    * Add initial version of build files;
    * Update README with instructions to download and build MLIR from github;

--

PiperOrigin-RevId: 241102092

											
										
										
											2019-03-30 13:10:12 +08:00
+								# Getting started with MLIR
-												Clarify documentation for MLIR to indicate that the build instructions run the test suite.

PiperOrigin-RevId: 254421795

											
										
										
											2019-06-22 01:23:18 +08:00
+								The following instructions for compiling and testing MLIR assume that you have
 								`git`, [`ninja`](https://ninja-build.org/), and a working C++ toolchain. In the
 								future, we aim to align on the same level of platform support as
-												    Add mentions of requirements before the build instructions

    The `Getting Started` instructions requires git, ninja, and a working C++ host toolchain.

--

PiperOrigin-RevId: 242121953

											
										
										
											2019-04-05 22:30:16 +08:00
+								[LLVM](https://llvm.org/docs/GettingStarted.html#requirements). For now, MLIR
-												Clarify documentation for MLIR to indicate that the build instructions run the test suite.

PiperOrigin-RevId: 254421795

											
										
										
											2019-06-22 01:23:18 +08:00
+								has been tested on Linux and macOS, with recent versions of clang and with
 								gcc 7.
-												    Minor fixes to README.md (NFC)

    In particular fix the links to work on GitHub, and do not include RTTI to build.

--

PiperOrigin-RevId: 241362244

											
										
										
											2019-04-02 02:19:02 +08:00
-												Add Windows build instructions.

PiperOrigin-RevId: 251269521

											
										
										
											2019-06-04 02:09:02 +08:00
+								```sh
-												    Add build files and update README.

    * Add initial version of build files;
    * Update README with instructions to download and build MLIR from github;

--

PiperOrigin-RevId: 241102092

											
										
										
											2019-03-30 13:10:12 +08:00
+								git clone https://github.com/llvm/llvm-project.git
-												    Simplify build instructions.

--

PiperOrigin-RevId: 244120877

											
										
										
											2019-04-18 11:56:00 +08:00
+								git clone https://github.com/tensorflow/mlir llvm-project/llvm/projects/mlir
 								mkdir llvm-project/build
 								cd llvm-project/build
 								cmake -G Ninja ../llvm -DLLVM_BUILD_EXAMPLES=ON -DLLVM_ENABLE_CXX1Y=Y -DLLVM_TARGETS_TO_BUILD="host"
 								cmake --build . --target check-mlir
-												    Add build files and update README.

    * Add initial version of build files;
    * Update README with instructions to download and build MLIR from github;

--

PiperOrigin-RevId: 241102092

											
										
										
											2019-03-30 13:10:12 +08:00
+								```
-												Clarify documentation for MLIR to indicate that the build instructions run the test suite.

PiperOrigin-RevId: 254421795

											
										
										
											2019-06-22 01:23:18 +08:00
+								To compile and test on Windows using Visual Studio 2017:
-												Add Windows build instructions.

PiperOrigin-RevId: 251269521

											
										
										
											2019-06-04 02:09:02 +08:00
 								```bat
 								REM In shell with Visual Studio environment set up, e.g., with command such as
 								REM   <visual-studio-install>\Auxiliary\Build\vcvarsall.bat" x64
 								REM invoked.
 								git clone https://github.com/llvm/llvm-project.git
 								git clone https://github.com/tensorflow/mlir llvm-project\llvm\projects\mlir
 								mkdir llvm-project\build
 								cd llvm-project\build
 								cmake ..\llvm -G "Visual Studio 15 2017 Win64" -DLLVM_BUILD_EXAMPLES=ON -DLLVM_ENABLE_CXX1Y=Y -DLLVM_TARGETS_TO_BUILD="host" -DCMAKE_BUILD_TYPE=Release -Thost=x64
 								cmake --build . --target check-mlir
 								```
-												    Add link to the Toy tutorial in the Readme

--

PiperOrigin-RevId: 241642739

											
										
										
											2019-04-03 09:07:26 +08:00
+								As a starter, you may try [the tutorial](g3doc/Tutorials/Toy/Ch-1.md) on
 								building a compiler for a Toy language.
-												    [README] Fix links, minor edits.

    Fix links without breaking g3doc.
    Minor stylistic edits.

--

PiperOrigin-RevId: 241846985

											
										
										
											2019-04-04 09:27:23 +08:00
+								# MLIR talks
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
-												Update link to presentation
											
										
										
											2019-06-29 09:06:45 +08:00
+								* "[MLIR Primer: A Compiler Infrastructure for the End of Moore’s Law](https://ai.google/research/pubs/pub48035.pdf)"
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								  * Chris Lattner & Jacques Pienaar, Google at
-												Add README file for MLIR.

PiperOrigin-RevId: 240889350

											
										
										
											2019-03-29 08:37:29 +08:00
+								    [Compilers for Machine Learning](https://www.c4ml.org/) workshop at
-												    [README] Minor edits.

    Minor stylistic edits.
    Use standard "TensorFlow" spelling.

--

PiperOrigin-RevId: 241842272

											
										
										
											2019-04-04 08:50:41 +08:00
+								    [CGO 2019](http://cgo.org/cgo2019/)
-												    Add references to the EuroLLVM talks in the README

--

PiperOrigin-RevId: 244433092

											
										
										
											2019-04-20 06:26:52 +08:00
+								* "[MLIR: Multi-Level Intermediate Representation for Compiler
 								    Infrastructure](https://llvm.org/devmtg/2019-04/talks.html#Keynote_1)"
 								  * Tatiana Shpeisman & Chris Lattner, Google at
 								    [EuroLLVM 2019](https://llvm.org/devmtg/2019-04)
 								* "[Tutorial: Building a Compiler with MLIR](https://llvm.org/devmtg/2019-04/talks.html#Tutorial_1)"
 								  * Mehdi Amini, Jacques Pienaar, Nicolas Vasilache, Google at
 								    [EuroLLVM 2019](https://llvm.org/devmtg/2019-04)