llvm-project

Commit Graph

Author	SHA1	Message	Date
Louis Dionne	b1fb3d75c9	[libc++] Implement C++20's P0476R2: std::bit_cast Thanks to Arthur O'Dwyer for fixing up some of the tests. Differential Revision: https://reviews.llvm.org/D75960	2021-09-09 11:05:54 -04:00
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Hansang Bae	3976035d68	[OpenMP] Fix line truncation in omp_lib.h Fixed code that exceeds 72-column. Differential Revision: https://reviews.llvm.org/D109469	2021-09-09 09:33:45 -05:00
Uday Bondhugula	524eafa5b2	[MLIR] Avoid double space print on llvm global op Fix extra space print for llvm global op when the 'unamed_addr' attribute was empty. This led to two spaces being printed in the custom form between non-whitespace chars. A round trip would add an extra space to a typical spaced form. NFC. Differential Revision: https://reviews.llvm.org/D109502	2021-09-09 19:52:38 +05:30
Sam Clegg	44177e5fb2	[WebAssembly] Add explict TLS symbol flag As before we maintain backwards compat with older object files by also infering the TLS flag based on the name of the segment. This change is was split out from https://reviews.llvm.org/D108877. Differential Revision: https://reviews.llvm.org/D109426	2021-09-09 10:03:30 -04:00
Louis Dionne	3765d284c4	[libc++] Provide a way to trigger rebuild of Docker images in the CI	2021-09-09 09:59:44 -04:00
Louis Dionne	d61ec93ff2	[libc++] Move additional build bots to the from-scratch config Once all the bots are passing with from-scratch configs, we can attempt to make the from-scratch config the default configuration. Differential Revision: https://reviews.llvm.org/D103417	2021-09-09 09:14:43 -04:00
Sanjay Patel	97a4e7b7ff	[InstCombine] remove a buggy set of zext-icmp transforms The motivating case is an infinite loop shown with a reduced test from: https://llvm.org/PR51762 To solve this, I'm proposing we delete the most obviously broken part of this code. The bug example shows a fundamental problem: we ask computeKnownBits if a transform will be profitable, alter the code by creating new instructions, then rely on computeKnownBits to return the same answer to actually eliminate instructions. But there's no guarantee that the results will be the same between the 1st and 2nd calls. In the infinite loop example, we get different answers, so we add instructions that conflict with some other transform, and we're stuck. There's at least one other problem visible in the test diff for `@zext_or_masked_bit_test_uses`: the code doesn't check uses properly, so we can end up with extra instructions created. Last, it's not clear if this set of transforms actually improves analysis or codegen. I spot-checked a few targets and don't see a clear win: https://godbolt.org/z/x87EWovso If we do see a regression from this change, codegen seems like the right place to add a cmp -> bit-hack fold. If this is too big of a step, we could limit the computeKnownBits calls by not passing a context instruction and/or limiting the recursion. I checked that those would stop the infinite loop for PR51762, but that won't guarantee that some other example does not fall into the same loop. Differential Revision: https://reviews.llvm.org/D109440	2021-09-09 08:49:39 -04:00
Corentin Jabot	7fc743ff84	Mark as P0692R1 as implemented; NFC P0692R1 was implemented in https://reviews.llvm.org/D92024 but the status page was not updated.	2021-09-09 08:45:47 -04:00
Louis Dionne	8660b89c0c	[libc++] Clean up the no-unicode CI job It was added after we changed the way the CI jobs are run, in particular how they are pinned down to Linux instances only. As a result, the job would sometimes run on Mac machines, which we're trying to keep only for jobs that absolutely need it due to capacity concerns.	2021-09-09 08:39:30 -04:00
Florian Mayer	039fd9af45	[NFC] [hwasan] move prints closer together. this makes the code slightly more readable. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D109442	2021-09-09 13:39:11 +01:00
Martin Storsjö	a3870e8ab1	Reapply [runtimes] Set more paths when building runtimes standalone These paths are needed when building with per-target runtime directories. (It's possible to fix this by manually setting these when invoking cmake, but one isn't supposed to need to do that.) Also set LLVM_TOOLS_BINARY_DIR while touching this area (as it's also unset in this case) even if it isn't specifically needed by the per-target runtime configuration. Fixed since previous attempt: Don't check if the runtimes directory is the root of the CMake invocation; when the main LLVM CMake build builds runtimes, it does invoke a sub-CMake with this directory as the root too, just as if manually invoking CMake at the runtimes directory. Instead check whether LLVM_TOOLS_BINARY_DIR was set and whether find_package(LLVM) succeeded or not. Differential Revision: https://reviews.llvm.org/D107895	2021-09-09 15:30:42 +03:00
Louis Dionne	312ad74aea	[libc++] Implement P1951, default arguments for pair's forwarding constructor Differential Revision: https://reviews.llvm.org/D109066	2021-09-09 08:28:22 -04:00
Nico Weber	7484206cfd	[gn build] Make lldb build on Windows Differential Revision: https://reviews.llvm.org/D109478	2021-09-09 08:13:50 -04:00
Florian Mayer	6e12c73316	[NFC] [stack-safety] add placeholder addRange. This is in preparataion of D108457.	2021-09-09 13:13:18 +01:00
Raphael Isemann	cda1450f1c	[lldb][NFC] Add some tests for function-local classes and document some bugs This feature doesn't seem to have any dedicated test. Instead some random tests (e.g. the bitfield tests) are declaring function-local classes for some reason. This adds a dedicated test so we can clean up those other tests. Also add FIXME's for some basic stuff that doesn't work. The first FIXME is a good beginner bug which just requires prepending the function name (in case we decide to fix it instead of documenting this behaviour). The second FIXME is caused by LLDB searching for definitions by name (which also seems to miss the function name so there is a conflict with the outer type). Some more things that should be tested (and might not work): * Local classes with member functions with local classes. * Classes in different functions with same name. * Classes with the same name in different TUs with internal linkage functions of the same name. * Empty classes are parsed by the DWARF parser in a fast path, so that requires dedicated tests. * Repeat some of the tested logic for C.	2021-09-09 14:12:02 +02:00
Cullen Rhodes	6c8ff4032e	[OptParser] NFC: Remove unused template arg 'name' from bool opt Identified in D109359. Reviewed By: jansvoboda11 Differential Revision: https://reviews.llvm.org/D109489	2021-09-09 12:04:40 +00:00
Florian Mayer	d261d4cf55	[stack-safety] [NFC] do not terminate print with blank line.	2021-09-09 12:31:09 +01:00
LLVM GN Syncbot	9bb803c7a6	[gn build] Port `c58c7a6ea0`	2021-09-09 11:25:54 +00:00
Marco Gartmann	c58c7a6ea0	[clang-tidy] cppcoreguidelines-virtual-base-class-destructor: a new check Finds base classes and structs whose destructor is neither public and virtual nor protected and non-virtual. A base class's destructor should be specified in one of these ways to prevent undefined behaviour. Fixes are available for user-declared and implicit destructors that are either public and non-virtual or protected and virtual. This check implements C.35 [1] from the CppCoreGuidelines. Reviewed By: aaron.ballman, njames93 Differential Revision: http://reviews.llvm.org/D102325 [1]: http://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#Rc-dtor-virtual	2021-09-09 13:23:38 +02:00
Florian Mayer	08b4dd8b24	[NFC] [stack-safety] remove unused return value.	2021-09-09 12:19:47 +01:00
Simon Pilgrim	c31a202233	[X86][AVX] Add missing X86ISD::VBROADCAST(v2f64 -> v4f64) isel pattern for AVX1 targets As discussed on the ticket, I'm intending to add additional 128->256 patterns when we have test coverage, but this addresses a known crash. Differential Revision: https://reviews.llvm.org/D109434	2021-09-09 12:16:23 +01:00
Muhammad Omair Javaid	8901f8beea	AArch64 SVE restore SVE registers after expression This patch fixes register save/restore on expression call to also include SVE registers. This will fix expression calls like: re re p1 <Register Value P1 before expression> p <var-name or function call> re re p1 <Register Value P1 after expression> In above example register P1 should remain the same before and after the expression evaluation. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D108739	2021-09-09 16:06:48 +05:00
Alex Zinenko	1ce752b741	[mlir] support reductions in SCF to OpenMP conversion OpenMP reductions need a neutral element, so we match some known reduction kinds (integer add/mul/or/and/xor, float add/mul, integer and float min/max) to define the neutral element and the atomic version when possible to express using atomicrmw (everything except float mul). The SCF-to-OpenMP pass becomes a module pass because it now needs to introduce new symbols for reduction declarations in the module. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D107549	2021-09-09 13:04:27 +02:00
Bradley Smith	8089f9ed5a	[AArch64][SVE] Add missing patterns for unpredicated subr intrinsics Differential Revision: https://reviews.llvm.org/D109369	2021-09-09 10:28:37 +00:00
Simon Pilgrim	55d9396278	[X86] Move _mm256_set_m128* intrinsics before _mm256_loadu2_m128* intrinsics. NFC. This is necessary for PR51796 where we'll update _mm256_loadu2_m128* to use _mm256_set_m128*	2021-09-09 11:23:50 +01:00
Alfonso Sánchez-Beato	b33fd31772	[yaml2obj][COFF] Allow variable number of directories Allow variable number of directories, as allowed by the specification. NumberOfRvaAndSize will default to 16 if not specified, as in the past. Reviewed by: jhenderson Differential Revision: https://reviews.llvm.org/D108825	2021-09-09 11:16:56 +01:00
Sjoerd Meijer	ecff9e3da5	[FuncSpec] Fixed minor formatting issues. NFC.	2021-09-09 10:36:54 +01:00
Roman Lebedev	909cba9699	[SimplifyCFG] performBranchToCommonDestFolding(): require block-closed SSA form for bonus instructions (PR51125) I can't seem to wrap my head around the proper fix here, we should be fine without this requirement, iff we can form this form, but the naive attempt (https://reviews.llvm.org/D106317) has failed. So just to unblock the release, put up a restriction. Fixes https://bugs.llvm.org/show_bug.cgi?id=51125	2021-09-09 12:28:09 +03:00
Jun Ma	8ba2adcf9e	Recommit "Revert "[CVP] processSwitch: Remove default case when switch cover all possible values."" Differential Revision: https://reviews.llvm.org/D106056	2021-09-09 16:53:33 +08:00
Michał Górny	d1280f6967	[lldb] [test] Add tests for coredumps with multiple threads Differential Revision: https://reviews.llvm.org/D101157	2021-09-09 09:59:52 +02:00
Cullen Rhodes	9d4896f50e	[SelectionDAG] NFC: Remove unused template args Identified in D109359.	2021-09-09 07:29:29 +00:00
Jean Perier	d892d7323e	[flang] Fix common block size extension mistake in D109156 https://reviews.llvm.org/D109156 did not properly update the case where the equivalence symbol appearing in the common statement is the "base symbol of an equivalence group" (this was the only case that previously worked ok, and the patch broke it). Fix this and add a test that actually uses this code path. Differential Revision: https://reviews.llvm.org/D109439	2021-09-09 09:12:12 +02:00
Cullen Rhodes	d42f76fd36	[AArch64][SVE] NFC: Remove unused template args For sve_fp_3op_p_zds_zx we have zero patterns downstream but the intrinsic args can be added again if/when the patterns are implemented. Identified in D109359. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D109429	2021-09-09 07:10:57 +00:00
Cullen Rhodes	5b848a35d2	[AArch64][SVE] NFC: Use stepvector directly in index multiclasses Also fixes a couple of warnings identified in D109359: SVEInstrFormats.td:5099:59: warning: unused template argument: sve_int_index_ri::step_vector SVEInstrFormats.td:5133:59: warning: unused template argument: sve_int_index_rr::step_vector Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D109422	2021-09-09 07:10:57 +00:00
Alexander Pivovarov	4bc8dbe0ca	[RISCV] Add SiFive cores E and S series Add SiFive cores E20, E21, E24, E34, S21, S54 and S76 Differential Revision: https://reviews.llvm.org/D109260	2021-09-08 23:59:04 -07:00
Yvan Roux	261cbe98c3	[RISCV] Fix Machine Outliner jump table handling. Don't outline machine instructions which are using jump table indexes since they are materialized as local labels (like the already handled case of constant pools). Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D109436	2021-09-09 07:32:30 +02:00
Pushpinder Singh	12dcbf913c	[AMDGPU][OpenMP] Use complex definitions from complex_cmath.h Following nvptx approach, this patch uses complex function definitions from complex_cmath.h. With this patch, ovo passes 23/34 complex mathematical test cases. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D109344	2021-09-09 10:55:17 +05:30
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Tom Stellard	9ee64c3746	scudo: Only add no-omit-frame-pointer flags when the compiler supports them Reviewed By: cryptoad Differential Revision: https://reviews.llvm.org/D109196	2021-09-08 21:10:40 -07:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Leonard Chan	9da62d3ed9	[polly] Fix "no member named 'getIndexExpressionsFromGEP'" As of 741fabc222f226d34d806056b804244b012853b, polly builders are failing from this error. The signiature is slightly different and accepts a ScalarEvolution reference instead. This should fix the polly builders.	2021-09-08 20:04:56 -07:00
Peter Collingbourne	883e93cb28	gn build: Add support for building lldb-server on Android. The cross-compiled lldb-server targets are added to the lldb deps if Android cross compilation is enabled. Differential Revision: https://reviews.llvm.org/D109464	2021-09-08 19:33:51 -07:00
Peter Collingbourne	9449f441fc	gn build: Add support for building LLDB on Linux. On Linux, LLDB depends on lldb-server at runtime (on Mac, the dependency on a debug server presumably comes via the system debugserver), so I added it to deps. Differential Revision: https://reviews.llvm.org/D109463	2021-09-08 19:33:51 -07:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Leonard Chan	d96e0c5388	Revert "[runtimes] Set more paths when building runtimes standalone" This reverts commit `407e07aa67`. Reverting since this seems to break OpenMP builds and our clang builders. See thread on https://reviews.llvm.org/D107895.	2021-09-08 18:31:10 -07:00
Chris Lattner	9e46dd965a	[APInt.h] Reduce the APInt header file interface a bit. NFC This moves one mid-size function out of line, inlines the trivial tcAnd/tcOr/tcXor/tcComplement methods into their only caller, and moves the magic/umagic functions into SelectionDAG since they are implementation details of its algorithm. This also removes the unit tests for magic, but these are already tested in the divide lowering logic for various targets. This also upgrades some C style comments to C++. Differential Revision: https://reviews.llvm.org/D109476	2021-09-08 18:17:07 -07:00
Jessica Paquette	22a64d4a14	[MachineOutliner][AArch64] Ensure LR is live-in when inserting reg-save calls Similar to other code which handles creating the function frame. If LR isn't live-in to the block that we're inserting the call into, we'll get a MachineVerifier error.	2021-09-08 17:44:27 -07:00
Amara Emerson	eae44c8a86	[GlobalISel] Implement merging of stores of truncates. This is a port of a combine which matches a pattern where a wide type scalar value is stored by several narrow stores. It folds it into a single store or a BSWAP and a store if the targets supports it. Assuming little endian target: i8 p = ... i32 val = ... p[0] = (val >> 0) & 0xFF; p[1] = (val >> 8) & 0xFF; p[2] = (val >> 16) & 0xFF; p[3] = (val >> 24) & 0xFF; => ((i32)p) = val; On CTMark AArch64 -Os this results in a good amount of savings: Program before after diff SPASS 412792 412788 -0.0% kc 432528 432512 -0.0% lencod 430112 430096 -0.0% consumer-typeset 419156 419128 -0.0% bullet 475840 475752 -0.0% tramp3d-v4 367760 367628 -0.0% clamscan 383388 383204 -0.0% pairlocalalign 249764 249476 -0.1% 7zip-benchmark 570100 568860 -0.2% sqlite3 287628 286920 -0.2% Geomean difference -0.1% Differential Revision: https://reviews.llvm.org/D109419	2021-09-08 17:06:33 -07:00

1 2 3 4 5 ...

398568 Commits All Branches Search

398568 Commits

All Branches