llvm-project

Commit Graph

Author	SHA1	Message	Date
Dmitry Preobrazhensky	6b79610fd5	[AMDGPU][MC][GFX11][NFC] Correct VOPD parsing Differential Revision: https://reviews.llvm.org/D133492	2022-09-09 13:03:29 +03:00
Simon Pilgrim	55b78e28d8	[CostModel][X86] Add missing i8 throughput cost	2022-09-09 10:58:51 +01:00
Nicolas Vasilache	27cc31b64c	[mlir][vector] NFC - Clean up vector patterns and propagate benefit through populate functions Differential Revision: https://reviews.llvm.org/D133559	2022-09-09 02:45:22 -07:00
Serge Pavlov	55e1441f7b	Revert "[Clang] Use virtual FS in processing config files" This reverts commit `9424497e43`. Some buildbots failed, reverted for investigation.	2022-09-09 16:43:15 +07:00
Brad Smith	9b4c3c2c5b	[mlir] Bump building CRunnerUtils from C++11 to C++17 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D133553	2022-09-09 05:34:22 -04:00
Serge Pavlov	9424497e43	[Clang] Use virtual FS in processing config files Clang has support of virtual file system for the purpose of testing, but treatment of config files did not use it. This change enables VFS in it as well. Differential Revision: https://reviews.llvm.org/D132867	2022-09-09 16:28:51 +07:00
Graham Hunter	1f639d1bd2	[NFC][LV] Convert masked call tests to use update script	2022-09-09 10:07:39 +01:00
Djordje Todorovic	df868edee5	"Recommit "[AggressiveInstCombine] Lower Table Based CTTZ"" This reverts commit `053841c562`. We faced a use-after-free after pushing the D113291, since the foldSqrt() has a call to eraseFromParent(). The function should be at the end of the main loop that folds the patterns. This patch fixes that.	2022-09-09 10:29:39 +02:00
serge-sans-paille	6f2ed8fd3f	[OpenMP] Install ompt-multiplex.h alongside omp.h The default install direction may not be in the compiler search path. Differential Revision: https://reviews.llvm.org/D133420	2022-09-09 09:42:08 +02:00
Vitaly Buka	7dc0734567	[msan] Insert simplification passes after instrumentation This resolves TODO from D96406. InstCombine issue is fixed with D133394. Save 4.5% of .text on CTMark.	2022-09-09 00:33:04 -07:00
Benjamin Kramer	f055d9c549	[bazel] Port `7fa1d743d0`	2022-09-09 09:31:13 +02:00
Alvin Wong	a3a8bd00c8	[clang][MinGW] Add `-mguard=cf` and `-mguard=cf-nochecks` This option can be used to enable Control Flow Guard checks and generation of address-taken function table. They are equivalent to `/guard:cf` and `/guard:cf,nochecks` in clang-cl. Passing this flag to the Clang driver will also pass `--guard-cf` to the MinGW linker. This feature is disabled by default. The option `-mguard=none` is also available to explicitly disable this feature. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D132810	2022-09-09 09:55:40 +03:00
Alvin Wong	bf7c5f1fae	[LLD][MinGW] Add --[no-]guard-cf and --[no-]guard-longjmp These will be LLD-specific options to support Control Flow Guard for the MinGW target. They are disabled by default, but enabling `--guard-cf` will also enable `--guard-longjmp` unless `--no-guard-longjmp` is also specified. These options maps to `-guard:cf,[no]longjmp`. Note that these features require the `_load_config_used` symbol to contain the load config directory and be filled with the required symbols. While current versions of mingw-w64 do not supply this symbol, the user can provide their own version of it. Reviewed By: MaskRay, rnk Differential Revision: https://reviews.llvm.org/D132808	2022-09-09 09:55:40 +03:00
Thomas Raoux	06413618ea	[mlir][vector] Don't duplicate transfer_read during vector distribution Only apply the pattern if the transfer_read can be distributed for all its uses. Differential Revision: https://reviews.llvm.org/D133538	2022-09-09 06:35:40 +00:00
gonglingqin	da8c9521ee	[LoongArch] Add codegen support for frint According to the revised description in `LoongArch Reference Manual v1.02`, frint.[s/d] does not judge whether floating-point inexact exceptions are allowed indicated by FCSR, i.e. always executes roundToIntegralExact(x). What's more, the manual also specifically defines that frint.s/d is only necessary to be defined in LA64. So ISD::FRINT is legal for LA64. Differential Revision: https://reviews.llvm.org/D133337	2022-09-09 14:25:34 +08:00
Craig Topper	aa83bdd198	[DAGCombiner][X86] Fold (sub (subcarry X, 0, Carry), Y) -> (subcarry X, Y, Carry) Fixes PR57576. Differential Revision: https://reviews.llvm.org/D133471	2022-09-08 22:56:46 -07:00
Jakub Kuderski	864236d1c1	[mlir][arith] Support wide integer constant emulation Reviewed By: antiagainst, Mogball Differential Revision: https://reviews.llvm.org/D133136	2022-09-09 00:04:06 -04:00
Christopher Bate	f4a478cd01	[mlir][Tensor] Add rewrites to extract slices through `tensor.collape_shape` This change adds a set of utilities to replace the result of a `tensor.collapse_shape -> tensor.extract_slice` chain with the equivalent result formed by aggregating slices of the `tensor.collapse_shape` source. In general, it is not possible to commute `extract_slice` and `collapse_shape` if linearized dimensions are sliced. The i-th dimension of the `tensor.collapse_shape` result is a "linearized sliced dimension" if: 1) Reassociation indices of tensor.collapse_shape in the i'th position is greater than size 1 (multiple dimensions of the input are collapsed) 2) The i-th dimension is sliced by `tensor.extract_slice`. We can work around this by stitching together the result of `tensor.extract_slice` by iterating over any linearized sliced dimensions. This is equivalent to "tiling" the linearized-and-sliced dimensions of the `tensor.collapse_shape` operation in order to manifest the result tile (the result of the `tensor.extract_slice`). The user of the utilities must provide the mechanism to create the tiling (e.g. a loop). In the tests, it is demonstrated how to apply the utilities using either `scf.for` or `scf.foreach_thread`. The below example illustrates the pattern using `scf.for`: ``` %0 = linalg.generic ... -> tensor<3x7x11x10xf32> %1 = tensor.collapse_shape %0 [[0, 1, 2], [3]] : ... to tensor<341x10xf32> %2 = tensor.extract_slice %1 [13, 0] [10, 10] [2, 1] : .... tensor<10x10xf32> ``` We can construct %2 by generating the following IR: ``` %dest = linalg.init_tensor() : tensor<10x10xf32> %2 = scf.for %iv = %c0 to %c10 step %c1 iter_args(%arg0) -> tensor<10x10xf32> { // Step 1: Map this output idx (%iv) to a multi-index for the input (%3): %linear_index = affine.apply affine_map<(d0)[]->(d0*2 + 11)>(%iv) %3:3 = arith.delinearize_index %iv into (3, 7, 11) // Step 2: Extract the slice from the input %4 = tensor.extract_slice %0 [%3#0, %3#1, %3#2, 0] [1, 1, 1, 10] [1, 1, 1, 1] : tensor<3x7x11x10xf32> to tensor<1x1x1x10xf32> %5 = tensor.collapse_shape %4 [[0, 1, 2], [3]] : tensor<1x1x1x10xf32> into tensor<1x10xf32> // Step 3: Insert the slice into the destination %6 = tensor.insert_slice %5 into %arg0 [%iv, 0] [1, 10] [1, 1] : tensor<1x10xf32> into tensor<10x10xf32> scf.yield %6 : tensor<10x10xf32> } ``` The pattern was discussed in the RFC here: https://discourse.llvm.org/t/rfc-tensor-extracting-slices-from-tensor-collapse-shape/64034 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D129699	2022-09-08 21:58:21 -06:00
Jakub Kuderski	7fa1d743d0	Reland "[mlir][arith] Add wide integer emulation pass" This reverts commit `45b5e8abe5`. Relands https://reviews.llvm.org/D133135 after fixing shared libs builds.	2022-09-08 23:30:47 -04:00
Sheng	88bdc4687d	[NFC][M68k] Correct debug message.	2022-09-09 10:58:37 +08:00
Phoebe Wang	64fb629a06	[LLD] Imply "longjmp" in `/guard:cf` This is MSVC's behaviour. LLD was matching it before D99078. Let's go back this way. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D132901	2022-09-09 10:38:18 +08:00
Vitaly Buka	e261b03396	[sanitizers] Add experimental flag to insert sanitizers earlier	2022-09-08 19:05:17 -07:00
jacquesguan	9d0b90e933	[mlir][Math] Add TruncOp. This patch adds TruncOp for Math, it returns the operand rounded to the nearest integer not larger in magnitude than the operand. And this patch also adds the correspond llvm intrinsic op. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D133342	2022-09-09 10:01:28 +08:00
Alex Bradbury	51ae462447	[RISCV] Add the GlobalMerge pass (disabled by default) Split out from D129178, this just adds the GlobalMerge tests (other than global-merge-minsize.ll which is testing a specific configuration of the pass when it's enabled) and exposes `-riscv-enable-global-merge` and //doesn't enable it by default//. Note that the comment "// FIXME: Unify control over GlobalMerge." is copied from the Arm and AArch64 backends, which expose the same flag. Presumably the author is imagining some later refactoring that provides a target-independent flag. Reviewed By: craig.topper, reames, hiraditya Differential Revision: https://reviews.llvm.org/D130481	2022-09-08 18:40:38 -07:00
Fangrui Song	f9b5924975	[AArch64] Fix -Wunused-variable. NFC	2022-09-08 18:27:16 -07:00
Kai Sasaki	5bb621056b	[mlir][math] Canonicalization for math.floor op Support constant folding for math.floor op as well as math.ceil. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D133398	2022-09-09 10:21:48 +09:00
zhongyunde	b6655333c2	[Peephole] rewrite INSERT_SUBREG to SUBREG_TO_REG if upper bits zero Restrict the 32-bit form of an instruction of integer as too many test cases will be clobber as the register number updated. From %reg = INSERT_SUBREG %reg, %subreg, subidx To %reg:subidx = SUBREG_TO_REG 0, %subreg, subidx Try to prefix the redundant mov instruction at D132325 as the SUBREG_TO_REG should not generate code. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D132939	2022-09-09 09:00:54 +08:00
Peiming Liu	180bf5f940	[mlir][sparse] fix a bug in sparse2sparse reshape. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D133521	2022-09-09 00:32:00 +00:00
Aart Bik	f76dcede3f	[mlir][sparse] rename lex_insert into insert This change goes not impact any semantics yet, but it is in preparation for implementing the unordered and not-unique properties. Changing lex_insert to insert is a first step. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D133531	2022-09-08 17:26:35 -07:00
Fabian Parzefall	579a5a47a9	[BOLT] Add test checking LP trampolines in multi-split This adds a test to verify that when splitting all blocks, landing pad trampolines are inserted in all blocks. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D132426	2022-09-08 17:10:38 -07:00
Fabian Parzefall	3ac46f377a	[BOLT] Emit LSDA call sites for all fragments For exception handling, LSDA call sites have to be emitted for each fragment individually. With this patch, call sites and respective LSDA symbols are generated and associated with each fragment of their function, such that they can be used by the emitter. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D132052	2022-09-08 17:10:29 -07:00
Fabian Parzefall	ae2b4da166	[BOLT] Fragment all blocks (not just outlineable blocks) To enable split strategies that require view of the entire CFG (e.g. to estimate cost of path from entry block), with this patch, all blocks of a function are passed to `SplitStrategy::fragment`. Because this might move non-outlineable blocks into a split fragment, these blocks are moved back into the main fragment after fragmenting. This also gives strategies the option to specify whether empty fragments should be kept or removed. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D132423	2022-09-08 17:10:13 -07:00
Fangrui Song	d8c09b7bbc	Revert D111509 "[clang] use getCommonSugar in an assortment of places" This reverts commit `d42122cd5d`. `clang++ gcc/libstdc++-v3/src/c++98/complex_io.cc` (all language modes) crashes. Also see https://reviews.llvm.org/D111509#3777980	2022-09-08 17:09:18 -07:00
Stanley Winata	8ef94dde56	[mlir][linalg] add conv_1d_ncw_fcw Reviewed By: hanchung, antiagainst Differential Revision: https://reviews.llvm.org/D133465	2022-09-08 19:48:45 -04:00
Philip Reames	c6d304aff3	[docs][RISCV] Add links to respective notes	2022-09-08 16:35:49 -07:00
Fangrui Song	1f6a86b309	[cc1as] Support --compress-debug-sections=zstd `clang -gz=zstd -c a.s` generates ELFCOMPRESS_ZSTD compressed debug info sections if compression decreases size.	2022-09-08 16:31:45 -07:00
Philip Reames	3757ef2e66	Fix a warning caught by llvm-sphinx-docs	2022-09-08 16:22:50 -07:00
Joe Loser	1b3a78d1d5	[clang] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Leave the few call sites that use a locally defined `array_lengthof` that are meant to test previous bugs with NTTPs in clang analyzer and SemaTemplate. Differential Revision: https://reviews.llvm.org/D133520	2022-09-08 17:20:25 -06:00
Fangrui Song	4d4ca6c9d0	[Driver] Rename Z to Zlib	2022-09-08 16:14:31 -07:00
Fangrui Song	781dea021a	[Support] Rename DebugCompressionType::Z to Zlib "Z" was so named when we had both gABI ELFCOMPRESS_ZLIB and the legacy .zdebug support. Now we have just one zlib format, we should use the more descriptive name.	2022-09-08 16:11:29 -07:00
Philip Reames	580f725d48	[docs] Fix a rebase error in `194900f`	2022-09-08 16:04:03 -07:00
Philip Reames	194900fc24	[docs][RISCV] Document status of scalar crypto extensions This is based on a somewhat subjective review of the in-tree support, and where I thought further work was needed before I'd consider these "done". See the review for some discussion around what is left in terms of pattern matching. Differential Revision: https://reviews.llvm.org/D133373	2022-09-08 15:55:10 -07:00
Philip Reames	44e74d8623	[docs][RISCV] Document status of assorted extensions Sorry, there's no real theme to this. It's simply the rest of the extensions we implement which don't need individual discussion. Differential Revision: https://reviews.llvm.org/D133529	2022-09-08 15:55:10 -07:00
Amir Ayupov	31abde43d6	[BOLT] Restrict ICP for functions with unknown control flow ICP has two modes: jump table promotion and indirect call promotion. The selection is based on whether an instruction has a jump table or not. An instruction with unknown control flow doesn't have a jump table and will fall under indirect call promotion policy which might be incorrect/unsafe (if an instruction is not a tail call, i.e. has local jump targets). Prevent ICP for functions containing instructions with unknown control flow. Follow-up to https://reviews.llvm.org/D128870. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D132882	2022-09-08 15:50:40 -07:00
Amir Ayupov	873942e178	[BOLT] Change reorder-blocks deprecated option warning output Revert to using `BOLT-WARNING` Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D132778	2022-09-08 15:48:41 -07:00
Christopher Bate	fb98a623fb	[mlir] NFC - move declaration of `Range` to StaticValueUtils.h `ViewLikeInterface.h` defines a struct `Range`, and it also depends on `StaticValueUtils.h/cpp`. This change moves the definition of `Range` to `StaticValueUtils.h`, which should not violate any existing dependency requirement. This is done to support adding utilities using `Range` under `Dialect/Utils` without creating a circular dependency on `ViewLikeInterface`. It is possible that some targets depend on `ViewLikeInterface` just for the `Range` struct. These can be later updated to depend on the `MLIRDialectUtils` target. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133523	2022-09-08 16:29:56 -06:00
Nicolas Vasilache	b7d47ed1da	[mlir][memref] Add support for 0-D transfer / subview fold. The 0-d case simply forwards the indexing from the source memref and works out of the box. Differential Revision: https://reviews.llvm.org/D133536	2022-09-08 15:25:05 -07:00
rdzhabarov	fba411aa47	Revert "Fix bazel deps.". Original change has been reverted. This reverts commit `b3999fd261`.	2022-09-08 22:11:44 +00:00
Louis Dionne	d529e8110b	[libc++] Fix compilation error on platforms that don't implement std::tm Instead of mentioning tm directly in the definition of __convert_to_tm, take it as a template argument. As a fly-by also fix incorrect Lit feature (should have been no-localization instead of libcpp-has-no-localization). Differential Revision: https://reviews.llvm.org/D133490	2022-09-08 18:10:53 -04:00
Fangrui Song	48203b327d	Revert D130308 "[clang] extend getCommonSugaredType to merge sugar nodes" This reverts commit `16e5d6d7f9`. There are multiple complaints on the review. In addition, it may cause spurious ``` error: invalid operands to binary expression ('SinkPrinter' and 'char[cluster_name_length]') note: candidate template ignored: substitution failure: variably modified type 'char *' cannot be used as a template argument SinkPrinter operator<<(const SinkPrinter &s, T) { ``` for some C++ code	2022-09-08 15:09:39 -07:00

1 2 3 4 5 ...

435583 Commits All Branches Search

435583 Commits

All Branches