Go to file

Jianzhou Zhao ea981165a4 [dfsan] Track field/index-level shadow values in variables ************* * The problem ************* See motivation examples in compiler-rt/test/dfsan/pair.cpp. The current DFSan always uses a 16bit shadow value for a variable with any type by combining all shadow values of all bytes of the variable. So it cannot distinguish two fields of a struct: each field's shadow value equals the combined shadow value of all fields. This introduces an overtaint issue. Consider a parsing function std::pair<char, int> get_token(char p); where p points to a buffer to parse, the returned pair includes the next token and the pointer to the position in the buffer after the token. If the token is tainted, then both the returned pointer and int ar tainted. If the parser keeps on using get_token for the rest parsing, all the following outputs are tainted because of the tainted pointer. The CL is the first change to address the issue. ************************** * The proposed improvement ************************ Eventually all fields and indices have their own shadow values in variables and memory. For example, variables with type {i1, i3}, [2 x i1], {[2 x i4], i8}, [2 x {i1, i1}] have shadow values with type {i16, i16}, [2 x i16], {[2 x i16], i16}, [2 x {i16, i16}] correspondingly; variables with primary type still have shadow values i16. ************************* * An potential implementation plan ************************* The idea is to adopt the change incrementially. 1) This CL Support field-level accuracy at variables/args/ret in TLS mode, load/store/alloca still use combined shadow values. After the alloca promotion and SSA construction phases (>=-O1), we assume alloca and memory operations are reduced. So if struct variables do not relate to memory, their tracking is accurate at field level. 2) Support field-level accuracy at alloca 3) Support field-level accuracy at load/store These two should make O0 and real memory access work. 4) Support vector if necessary. 5) Support Args mode if necessary. 6) Support passing more accurate shadow values via custom functions if necessary. ************* * About this CL. *************** The CL did the following 1) extended TLS arg/ret to work with aggregate types. This is similar to what MSan does. 2) implemented how to map between an original type/value/zero-const to its shadow type/value/zero-const. 3) extended (insert\|extract)value to use field/index-level progagation. 4) for other instructions, propagation rules are combining inputs by or. The CL converts between aggragate and primary shadow values at the cases. 5) Custom function interfaces also need such a conversion because all existing custom functions use i16. It is unclear whether custome functions need more accurate shadow propagation yet. 6) Added test cases for aggregate type related cases. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92261		2020-12-09 19:38:35 +00:00
.github/workflows	[PATCH] github actions: Pepare main branch for being read/write	2020-12-06 22:16:36 -08:00
clang	Don't setup inalloca for swiftcc on i686-windows-msvc	2020-12-09 11:08:48 -08:00
clang-tools-extra	[clangd] Extract per-dir CDB cache to its own threadsafe class. NFC	2020-12-09 17:40:12 +01:00
compiler-rt	[dfsan] Track field/index-level shadow values in variables	2020-12-09 19:38:35 +00:00
debuginfo-tests	Include BuiltinAttributes.h in llvm-prettyprinters/gdb/mlir-support.cpp	2020-12-04 15:57:05 -06:00
flang	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove the corresponding methods from OpState.	2020-12-09 12:11:32 +01:00
libc	[libc] Raise x87 exceptions by synchronizing with "fwait".	2020-12-08 13:16:19 -08:00
libclc	libclc: Use find_package to find Python 3 and require it	2020-10-01 22:31:33 +02:00
libcxx	[libc++] Run back-deployment CI on macOS 10.15 instead of 10.14	2020-12-09 11:35:15 -05:00
libcxxabi	[libc++abi] Don't try calling __libcpp_aligned_free when aligned allocation is disabled	2020-12-01 17:45:14 -05:00
libunwind	[libunwind] Multiple preprocessor fixes on PowerPC*	2020-11-23 19:07:21 -08:00
lld	[COFF][LTO][NPM] Use NPM for LTO with ENABLE_EXPERIMENTAL_NEW_PASS_MANAGER	2020-12-09 08:53:50 -08:00
lldb	[lldb][NFC] Refactor _get_bool_config_skip_if_decorator	2020-12-09 20:02:06 +01:00
llvm	[dfsan] Track field/index-level shadow values in variables	2020-12-09 19:38:35 +00:00
mlir	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove the corresponding methods from OpState.	2020-12-09 12:11:32 +01:00
openmp	[libomptarget][amdgpu] Let default number of teams equal number of CUs	2020-12-09 19:35:34 +00:00
parallel-libs	Reapply "Try enabling -Wsuggest-override again, using add_compile_options instead of add_compile_definitions for disabling it in unittests/ directories."	2020-07-22 17:50:19 -07:00
polly	[Polly][CodeGen] Remove use of ScalarEvolution.	2020-12-07 15:21:51 -06:00
pstl	[pstl] Replace direct use of assert() with _PSTL_ASSERT	2020-11-02 18:35:54 -05:00
utils/arcanist	Fix arc lint's clang-format rule: only format the file we were asked to format.	2020-10-11 14:24:23 -07:00
.arcconfig	Set the target branch for `arc land` to main	2020-12-07 21:57:32 +00:00
.arclint	PR46997: don't run clang-format on clang's testcases.	2020-08-04 17:53:25 -07:00
.clang-format	…
.clang-tidy	- Update .clang-tidy to ignore parameters of main like functions for naming violations in clang and llvm directory	2020-01-31 16:49:45 +00:00
.git-blame-ignore-revs	NFC: Add whitespace-changing revisions to .git-blame-ignore-revs	2020-09-21 20:17:24 -04:00
.gitignore	[NFC] Adding pythonenv* to .gitignore	2020-09-03 22:42:27 -04:00
CONTRIBUTING.md	…
README.md	Revert "This is a test commit"	2020-10-21 09:34:15 +08:00

README.md

The LLVM Compiler Infrastructure

This directory and its sub-directories contain source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The README briefly describes how to get started with building LLVM. For more information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting Started with the LLVM System

Taken from https://llvm.org/docs/GettingStarted.html.

Overview

Welcome to the LLVM project!

The LLVM project has multiple components. The core of the project is itself called "LLVM". This contains all of the tools, libraries, and header files needed to process intermediate representations and converts it into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer. It also contains basic regression tests.

C-like languages use the Clang front end. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

The LLVM Getting Started documentation may be out of date. The Clang Getting Started page might have more accurate information.

This is an example work-flow and configuration to get and build the LLVM source:

Checkout LLVM (including related sub-projects like Clang):
- git clone https://github.com/llvm/llvm-project.git
- Or, on windows, git clone --config core.autocrlf=false https://github.com/llvm/llvm-project.git
Configure and build LLVM and Clang:
- cd llvm-project
- mkdir build
- cd build
- cmake -G <generator> [options] ../llvm
  
  Some common build system generators are:
  - Ninja --- for generating Ninja build files. Most llvm developers use Ninja.
  - Unix Makefiles --- for generating make-compatible parallel makefiles.
  - Visual Studio --- for generating Visual Studio projects and solutions.
  - Xcode --- for generating Xcode projects.
  Some Common options:
  - -DLLVM_ENABLE_PROJECTS='...' --- semicolon-separated list of the LLVM sub-projects you'd like to additionally build. Can include any of: clang, clang-tools-extra, libcxx, libcxxabi, libunwind, lldb, compiler-rt, lld, polly, or debuginfo-tests.
    
    For example, to build LLVM, Clang, libcxx, and libcxxabi, use -DLLVM_ENABLE_PROJECTS="clang;libcxx;libcxxabi".
  - -DCMAKE_INSTALL_PREFIX=directory --- Specify for directory the full path name of where you want the LLVM tools and libraries to be installed (default /usr/local).
  - -DCMAKE_BUILD_TYPE=type --- Valid options for type are Debug, Release, RelWithDebInfo, and MinSizeRel. Default is Debug.
  - -DLLVM_ENABLE_ASSERTIONS=On --- Compile with assertion checks enabled (default is Yes for Debug builds, No for all other build types).
- cmake --build . [-- [options] <target>] or your build system specified above directly.
  - The default target (i.e. ninja or make) will build all of LLVM.
  - The check-all target (i.e. ninja check-all) will run the regression tests to ensure everything is in working order.
  - CMake will generate targets for each tool and library, and most LLVM sub-projects generate their own check-<project> target.
  - Running a serial build will be slow. To improve speed, try running a parallel build. That's done by default in Ninja; for make, use the option -j NNN, where NNN is the number of parallel jobs, e.g. the number of CPUs you have.
- For more information see CMake

Consult the Getting Started with LLVM page for detailed information on configuring and compiling LLVM. You can visit Directory Layout to learn about the layout of the source code tree.