llvm-project

Commit Graph

Author	SHA1	Message	Date
NAKAMURA Takumi	a0943a2e19	[Bazel] Add JITLink/COFFOptions.td (llvmorg-16-init-398-g88181375a3db)	2022-08-01 07:07:13 +09:00
Tue Ly	2ff187fbc9	[libc] Implement cosf function that is correctly rounded to all rounding modes. Implement cosf function that is correctly rounded to all rounding modes. Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master) on Ryzen 1700: Before this patch (not correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf CORE-MATH reciprocal throughput : 19.043 System LIBC reciprocal throughput : 26.328 LIBC reciprocal throughput : 30.955 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf --latency GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 49.995 System LIBC latency : 59.286 LIBC latency : 60.174 ``` After this patch (correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 19.072 System LIBC reciprocal throughput : 26.286 LIBC reciprocal throughput : 13.631 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf --latency GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 49.872 System LIBC latency : 59.468 LIBC latency : 56.119 ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D130644	2022-07-29 21:08:31 -04:00
Guillaume Chatelet	f72261508a	[libc][NFC] Use STL case for type_traits Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Starting with the type_traits header. Differential Revision: https://reviews.llvm.org/D130727	2022-07-29 09:57:03 +00:00
Daniele Vettorel	e7c004854d	Add `llvm-dwarfutil` to Bazel targets Adds support for building the `llvm-dwarfutil` tool with Bazel Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D130720	2022-07-28 19:53:37 +00:00
Christian Sigg	f983bdbdae	[MLIR] Fix bazel build after `7356404ace`.	2022-07-28 08:14:18 +02:00
Stella Laurenzo	7356404ace	[mlir] Delete most of the ops from the quant dialect. * https://discourse.llvm.org/t/rfc-removing-the-quant-dialect/3643/8 * Removes most ops. Leaves casts given final comment (can remove more in a followup). * There are a few uses in Tosa keeping some of the utilities alive. In a followup, I will probably elect to just move simplified versions of them into Tosa itself vs having this quasi-library dependency. Differential Revision: https://reviews.llvm.org/D120204	2022-07-27 17:50:42 -07:00
Tue Ly	15b9380dfd	[libc] Change sinf range reduction to mod pi/16 to be shared with cosf. Change `sinf` range reduction to mod pi/16 to be shared with `cosf`. Previously, `sinf` used range reduction `mod pi`, but this cannot be used to implement `cosf` since the minimax algorithm for `cosf` does not converge due to critical points at `pi/2`. In order to be able to share the same range reduction functions for both `sinf` and `cosf`, we change the range reduction to `mod pi/16` for the following reasons: - The table size is sufficiently small: 32 entries for `sin(k * pi/16)` with `k = 0..31`. It could be reduced to 16 entries if we treat the final sign separately, with an extra multiplication at the end. - The polynomials' degrees are reduced to 7/8 from 15, with extra computations to combine `sin` and `cos` with trig sum equality. - The number of exceptional cases reduced to 2 (with FMA) and 3 (without FMA). - The latency is reduced while maintaining similar throughput as before. Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D130629	2022-07-27 12:23:36 -04:00
Benjamin Kramer	1f5144cdbb	[bazel] Port `5caa941f68`	2022-07-27 16:12:58 +02:00
NAKAMURA Takumi	3e0b557002	[Bazel] Bump to v16.0.0, corresponding to llvmorg-16-init	2022-07-27 22:41:53 +09:00
Alex Zinenko	ea460b7ddb	[mlir] update Bazel for `e99fae8997`	2022-07-27 09:42:07 +00:00
Alexander Belyaev	6cfaab5692	[mlir] Sort the libraties in BUILD.bazel.	2022-07-26 16:32:40 +02:00
Alexander Belyaev	4825614a46	[mlir] Update bazel build.	2022-07-26 16:28:29 +02:00
Benjamin Kramer	9484ddbfa1	[bazel] Port `628fbbef81`	2022-07-26 15:36:15 +02:00
Dmitri Gribenko	ed33d0878f	[bazel] Run autoformatter on BUILD.bazel	2022-07-26 13:12:36 +02:00
Benjamin Kramer	bf759e3b10	[bazel] Port `7a5cb15ea6`	2022-07-26 12:53:38 +02:00
Weverything	de43f93a82	[bazel] Add new rule for `c60b897d22`	2022-07-25 20:29:01 -07:00
Alex Zinenko	333ee218ce	[mlir] Transform dialect: separate dependent and generated dialects In the Transform dialect extensions, provide the separate mechanism to declare dependent dialects (the dialects the transform IR depends on) and the generated dialects (the dialects the payload IR may be transformed into). This allows the Transform dialect clients that are only constructing the transform IR to avoid loading the dialects relevant for the payload IR along with the Transform dialect itself, thus decreasing the build/link time. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130289	2022-07-25 09:59:53 +00:00
Benjamin Kramer	66e66117ba	[bazel] Add missing dependencies after `535b507ba5`	2022-07-23 13:25:23 +02:00
Tue Ly	d883a4ad02	[libc] Implement sinf function that is correctly rounded to all rounding modes. Implement sinf function that is correctly rounded to all rounding modes. - We use a simple range reduction for `pi/16 < \|x\|` : Let `k = round(x / pi)` and `y = (x/pi) - k`. So `k` is an integer and `-0.5 <= y <= 0.5`. Then ``` sin(x) = sin(ypi + kpi) = (-1)^(k & 1) * sin(ypi) ~ (-1)^(k & 1) y * P(y^2) ``` where `yP(y^2)` is a degree-15 minimax polynomial generated by Sollya with: ``` > P = fpminimax(sin(xpi)/x, [\|0, 2, 4, 6, 8, 10, 12, 14\|], [\|D...\|], [0, 0.5]); ``` - Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master) on Ryzen 1700: Before this patch (not correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.892 System LIBC reciprocal throughput : 25.559 LIBC reciprocal throughput : 29.381 ``` After this patch (correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.896 System LIBC reciprocal throughput : 25.740 LIBC reciprocal throughput : 27.872 LIBC reciprocal throughput : 20.012 (with `-msse4.2` flag) LIBC reciprocal throughput : 14.244 (with `-mfma` flag) ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D123154	2022-07-22 10:07:31 -04:00
Augie Fackler	a4ee8a31ce	[bazel] add headers now required after `17e4c217b6`	2022-07-21 15:39:29 -04:00
Nicolas Vasilache	1f77f01c65	[mlir][Linalg] Add a Transform dialect NavigationOp op to match a list of ops or an interface. This operation is a NavigationOp that simplifies the writing of transform IR. Since there is no way of refering to an interface by name, the current implementation uses an EnumAttr and depends on the interfaces it supports. In the future, it would be worthwhile to remove this dependence and generalize. Differential Revision: https://reviews.llvm.org/D130267	2022-07-21 07:11:42 -07:00
Benjamin Kramer	439668871a	[bazel] Also add -lrt to OrcTargetProcess for `1b1f1c7786`	2022-07-20 11:28:47 +02:00
Benjamin Kramer	24c88c90a8	[bazel] Add -lrt on non-darwin/non-windows for `1b1f1c7786` For shm_open in orc jit.	2022-07-20 11:24:13 +02:00
Sriraman Tallam	16cccc66b8	Bazel BUILD file for BOLT. Differential Revision: https://reviews.llvm.org/D129899	2022-07-19 16:03:52 -07:00
Cole Kissane	e939bf67e3	[llvm] add zstd to `llvm::compression` namespace - add zstd to `llvm::compression` namespace - add a CMake option `LLVM_ENABLE_ZSTD` with behavior mirroring that of `LLVM_ENABLE_ZLIB` - add tests for zstd to `llvm/unittests/Support/CompressionTest.cpp` - debian users should install libzstd when using `LLVM_ENABLE_ZSTD=FORCE_ON` from source due to this bug https://bugs.launchpad.net/ubuntu/+source/libzstd/+bug/1941956 Reviewed By: leonardchan, MaskRay Differential Revision: https://reviews.llvm.org/D128465	2022-07-19 10:54:36 -07:00
Benjamin Kramer	b9ad55c6d4	[bazel] Fix the build after `18b92c66fe`	2022-07-19 17:34:39 +02:00
Benjamin Kramer	9235fafd6e	[bazel] Remove libraries that don't build anymore after `5e83a5b475` I don't know who uses these python extensions, probably nobody.	2022-07-19 17:13:23 +02:00
Aart Bik	28ebb0b61d	[mlir][sparse] migrate sparse rewriting to sparse transformations pass The rules in the linalg file were very specific to sparse tensors so will find a better home under sparse tensor dialect than linalg dialect. Also moved some rewriting from sparsification into this new "pre-rewriting" file. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D129910	2022-07-18 09:29:22 -07:00
Alex Zinenko	e0fc33eba5	[mlir] Fix Bazel for `5e83a5b475` Export the __init__.py from _mlir_libs.	2022-07-18 15:35:23 +02:00
Stella Laurenzo	5e83a5b475	[mlir] Overhaul C/Python registration APIs to properly scope registration/loading activities. Since the very first commits, the Python and C MLIR APIs have had mis-placed registration/load functionality for dialects, extensions, etc. This was done pragmatically in order to get bootstrapped and then just grew in. Downstreams largely bypass and do their own thing by providing various APIs to register things they need. Meanwhile, the C++ APIs have stabilized around this and it would make sense to follow suit. The thing we have observed in canonical usage by downstreams is that each downstream tends to have native entry points that configure its installation to its preferences with one-stop APIs. This patch leans in to this approach with `RegisterEverything.h` and `mlir._mlir_libs._mlirRegisterEverything` being the one-stop entry points for the "upstream packages". The `_mlir_libs.__init__.py` now allows customization of the environment and Context by adding "initialization modules" to the `_mlir_libs` package. If present, `_mlirRegisterEverything` is treated as such a module. Others can be added by downstreams by adding a `_site_initialize_{i}.py` module, where '{i}' is a number starting with zero. The number will be incremented and corresponding module loaded until one is not found. Initialization modules can: * Perform load time customization to the global environment (i.e. registering passes, hooks, etc). * Define a `register_dialects(registry: DialectRegistry)` function that can extend the `DialectRegistry` that will be used to bootstrap the `Context`. * Define a `context_init_hook(context: Context)` function that will be added to a list of callbacks which will be invoked after dialect registration during `Context` initialization. Note that the `MLIRPythonExtension.RegisterEverything` is not included by default when building a downstream (its corresponding behavior was prior). For downstreams which need the default MLIR initialization to take place, they must add this back in to their Python CMake build just like they add their own components (i.e. to `add_mlir_python_common_capi_library` and `add_mlir_python_modules`). It is perfectly valid to not do this, in which case, only the things explicitly depended on and initialized by downstreams will be built/packaged. If the downstream has not been set up for this, it is recommended to simply add this back for the time being and pay the build time/package size cost. CMake changes: * `MLIRCAPIRegistration` -> `MLIRCAPIRegisterEverything` (renamed to signify what it does and force an evaluation: a number of places were incidentally linking this very expensive target) * `MLIRPythonSoure.Passes` removed (without replacement: just drop) * `MLIRPythonExtension.AllPassesRegistration` removed (without replacement: just drop) * `MLIRPythonExtension.Conversions` removed (without replacement: just drop) * `MLIRPythonExtension.Transforms` removed (without replacement: just drop) Header changes: * `mlir-c/Registration.h` is deleted. Dialect registration functionality is now in `IR.h`. Registration of upstream features are in `mlir-c/RegisterEverything.h`. When updating MLIR and a couple of downstreams, I found that proper usage was commingled so required making a choice vs just blind S&R. Python APIs removed: * mlir.transforms and mlir.conversions (previously only had an __init__.py which indirectly triggered `mlirRegisterTransformsPasses()` and `mlirRegisterConversionPasses()` respectively). Downstream impact: Remove these imports if present (they now happen as part of default initialization). * mlir._mlir_libs._all_passes_registration, mlir._mlir_libs._mlirTransforms, mlir._mlir_libs._mlirConversions. Downstream impact: None expected (these were internally used). C-APIs changed: * mlirRegisterAllDialects(MlirContext) now takes an MlirDialectRegistry instead. It also used to trigger loading of all dialects, which was already marked with a TODO to remove -- it no longer does, and for direct use, dialects must be explicitly loaded. Downstream impact: Direct C-API users must ensure that needed dialects are loaded or call `mlirContextLoadAllAvailableDialects(MlirContext)` to emulate the prior behavior. Also see the `ir.c` test case (e.g. ` mlirContextGetOrLoadDialect(ctx, mlirStringRefCreateFromCString("func"));`). * mlirDialectHandle* APIs were moved from Registration.h (which now is restricted to just global/upstream registration) to IR.h, arguably where it should have been. Downstream impact: include correct header (likely already doing so). C-APIs added: * mlirContextLoadAllAvailableDialects(MlirContext): Corresponds to C++ API with the same purpose. Python APIs added: * mlir.ir.DialectRegistry: Mapping for an MlirDialectRegistry. * mlir.ir.Context.append_dialect_registry(MlirDialectRegistry) * mlir.ir.Context.load_all_available_dialects() * mlir._mlir_libs._mlirAllRegistration: New native extension that exposes a `register_dialects(MlirDialectRegistry)` entry point and performs all upstream pass/conversion/transforms registration on init. In this first step, we eagerly load this as part of the __init__.py and use it to monkey patch the Context to emulate prior behavior. * Type caster and capsule support for MlirDialectRegistry This should make it possible to build downstream Python dialects that only depend on a subset of MLIR. See: https://github.com/llvm/llvm-project/issues/56037 Here is an example PR, minimally adapting IREE to these changes: https://github.com/iree-org/iree/pull/9638/files In this situation, IREE is opting to not link everything, since it is already configuring the Context to its liking. For projects that would just like to not think about it and pull in everything, add `MLIRPythonExtension.RegisterEverything` to the list of Python sources getting built, and the old behavior will continue. Reviewed By: mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D128593	2022-07-16 17:27:50 -07:00
Tue Ly	0f782b84cb	[libc] Add nearest integer instructions to fputil. Add round to nearest integer instructions to fputil. This will be used in sinf implementation https://reviews.llvm.org/D123154 Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D129776	2022-07-14 13:20:35 -04:00
Amara Emerson	6e6be5f950	Revert "[llvm] add zstd to llvm::compression namespace" This reverts commit `d449c60076`. Breaks macOS builds with this: llvm/lib/Support/Compression.cpp:24:10: fatal error: 'zstd.h' file not found	2022-07-14 01:23:20 -07:00
Cole Kissane	d449c60076	[llvm] add zstd to llvm::compression namespace - add `FindZSTD.cmake` - add zstd to `llvm::compression` namespace - add a CMake option `LLVM_ENABLE_ZSTD` with behavior mirroring that of `LLVM_ENABLE_ZLIB` - add tests for zstd to `llvm/unittests/Support/CompressionTest.cpp` Reviewed By: leonardchan, MaskRay Differential Revision: https://reviews.llvm.org/D128465	2022-07-13 19:58:42 -07:00
Cole Kissane	5ecb161c64	Revert "[llvm] add zstd to `llvm::compression` namespace" This reverts commit `cef07169ec`.	2022-07-13 19:48:29 -07:00
Cole Kissane	cef07169ec	[llvm] add zstd to `llvm::compression` namespace - add `FindZSTD.cmake` - add zstd to `llvm::compression` namespace - add a CMake option `LLVM_ENABLE_ZSTD` with behavior mirroring that of `LLVM_ENABLE_ZLIB` - add tests for zstd to `llvm/unittests/Support/CompressionTest.cpp` Reviewed By: leonardchan, MaskRay Differential Revision: https://reviews.llvm.org/D128465	2022-07-13 19:06:27 -07:00
Jorge Gorbe Moya	d6071fa52d	[bazel] add missing gmock dependency to //clang/unittests:format_tests	2022-07-12 18:13:42 -07:00
Krzysztof Drewniak	d6ef3d20b4	[mlir] Remove VectorToROCDL Between issues such as https://github.com/llvm/llvm-project/issues/56323, the fact that this lowering (unlike the code in amdgpu-to-rocdl) does not correctly set up bounds checks (and thus will cause page faults on reads that might need to be padded instead), and that fixing these problems would, essentially, involve replicating amdgpu-to-rocdl, remove --vector-to-rocdl for being broken. In addition, the lowering does not support many aspects of transfer_{read,write}, like supervectors, and may not work correctly in their presence. We (the MLIR-based convolution generator at AMD) do not use this conversion pass, nor are we aware of any other clients. Migration strategies: - Use VectorToLLVM - If buffer ops are particularly needed in your application, use amdgpu.raw_buffer_{load,store} A VectorToAMDGPU pass may be introduced in the future. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D129308	2022-07-12 15:21:22 +00:00
Alex Zinenko	3963b4d0dc	[mlir] Transform op for multitile size generation Introduce a structured transform op that emits IR computing the multi-tile sizes with requested parameters (target size and divisor) for the given structured op. The sizes may fold to arithmetic constant operations when the shape is constant. These operations may then be used to call the existing tiling transformation with a single non-zero dynamic size (i.e. perform strip-mining) for each of the dimensions separately, thus achieving multi-size tiling with optional loop interchange. A separate test exercises the entire script. Depends On D129217 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D129287	2022-07-12 12:36:28 +00:00
Alex Zinenko	4e4a4c0576	[mlir] Allow Tile transform op to take dynamic sizes Extend the definition of the Tile structured transform op to enable it accepting handles to operations that produce tile sizes at runtime. This is useful by itself and prepares for more advanced tiling strategies. Note that the changes are relevant only to the transform dialect, the tiling transformation itself already supports dynamic sizes. Depends On D129216 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D129217	2022-07-12 12:21:54 +00:00
Leonard Chan	474c873148	Revert "[llvm] cmake config groundwork to have ZSTD in LLVM" This reverts commit `f07caf20b9` which seems to break upstream https://lab.llvm.org/buildbot/#/builders/109/builds/42253.	2022-07-08 13:48:05 -07:00
Cole Kissane	f07caf20b9	[llvm] cmake config groundwork to have ZSTD in LLVM - added `FindZSTD.cmake` - added a CMake option `LLVM_ENABLE_ZSTD` with behavior mirroring that of `LLVM_ENABLE_ZLIB` - likewise added have_zstd to compiler-rt/test/lit.common.cfg.py, clang-tools-extra/clangd/test/lit.cfg.py, and several lit.site.cfg.py.in files mirroring have_zlib behavior Reviewed By: leonardchan, MaskRay Differential Revision: https://reviews.llvm.org/D128465	2022-07-08 11:46:52 -07:00
Jacques Pienaar	e60cc52b79	[mlir][bzl] Update for `1a92dbcfa8` and `cab44c515c`	2022-07-07 17:36:28 -07:00
Adrian Kuegel	f066a0cd21	[llvm][Debuginfod][Bazel] Match dependencies in CMakeLists.txt. Also update llvm-config.h and llvm-config.h.cmake to match `484b1aa611` Differential Revision: https://reviews.llvm.org/D129252	2022-07-07 09:25:52 +02:00
NAKAMURA Takumi	71c9757474	[Bazel] Fixup to llvmorg-15-init-15618-ge0b520865026, s/dxil/dx/	2022-07-07 07:03:16 +09:00
Adrian Kuegel	3decc2f04d	[mlir][Bazel] Fix Bazel build after `a2158374ba`	2022-07-06 08:47:48 +02:00
Christian Sigg	3e01af093f	[mlir] Add InferIntRangeInterface to gpu.launch Infers block/grid dimensions/indices or ranges of such dimensions/indices. Reviewed By: krzysz00 Differential Revision: https://reviews.llvm.org/D129036	2022-07-05 07:14:54 +02:00
Nicolas Vasilache	7fbf55c927	[mlir][Tensor] Move ParallelInsertSlice to the tensor dialect This is moslty NFC and will allow tensor.parallel_insert_slice to gain rank-reducing semantics by reusing the vast majority of the tensor.insert_slice impl. Depends on D128857 Differential Revision: https://reviews.llvm.org/D128920	2022-07-04 01:53:12 -07:00
NAKAMURA Takumi	1ecfc12b0c	[Bazel] Make `builtin_headers_gen` as subset of CMake's `clang-resource-headers` At the moment, two files are not installed by CMake. - `lib/Headers/openmp_wrappers/time.h` - `lib/Headers/ppc_wrappers/nmmintrin.h` `builtin_headers_gen` is available as the source of rules_pkg. The difference of the layout of installed headers makes cache hit harder.	2022-07-03 15:46:38 +09:00
Arthur Eubanks	bcd153485e	[bazel] Fix invalid characters	2022-07-01 13:47:56 -07:00
Arthur Eubanks	5a65c5180e	[bazel] Port `43dc3190`, adding rules to generate dxil intrinsics	2022-07-01 13:38:43 -07:00

1 2 3 4 5 ...

556 Commits