llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Browne	d81723c99b	[DFSan] Optimize code for writing to shadow. Move SetShadow to namespace. Writing zeros to shadow (including checking for existing zero) is now ~2x faster on one example. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D110733	2021-09-30 12:42:21 -07:00
LLVM GN Syncbot	0337e228c1	[gn build] Port `050edef853`	2021-09-30 19:35:12 +00:00
Maksim Panchenko	050edef853	[MC] Make MCDwarfLineStr class public Add MCDwarfLineStr class to the public API. Note that MCDwarfLineTableHeader::Emit(), takes MCDwarfLineStr as an Optional<> parameter making it impossible to use the API if the class is not publicly defined. Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D109412	2021-09-30 12:31:59 -07:00
Albion Fung	4195ed9959	[PowerPC] Improved codegen related to xscvdpsxws/xscvdpuxws This patch removes the uneccessary mf/mtvsr generated in conjunction with xscvdpsxws/xscvdpuxws. Differential revision: https://reviews.llvm.org/D109902	2021-09-30 14:31:00 -05:00
Amara Emerson	80f4bb5c61	[GlobalISel] Extend G_SELECT of known condition combine to vectors. Adds a new utility function: isConstantOrConstantSplatVector(). Differential Revision: https://reviews.llvm.org/D110786	2021-09-30 12:16:44 -07:00
Peter Klausler	8256867508	[flang] Fold FINDLOC() Fold the transformational intrinsic function FINDLOC() for all combinations of optional arguments and data types. Differential Revision: https://reviews.llvm.org/D110757	2021-09-30 12:08:10 -07:00
Sanjay Patel	3fcb00df5d	[InstCombine] restrict shift-trunc-shift fold to opposite direction shifts This is NFCI because the pattern with 2 left-shifts should get folded independently by smaller folds. The motivation is to refine this block to avoid infinite loops seen with D110170.	2021-09-30 15:06:13 -04:00
Sanjay Patel	66c069d7d6	[InstCombine] add tests for shift-trunc-shift; NFC	2021-09-30 15:06:13 -04:00
Nico Weber	e31899c708	Reland "[clang-cl] Accept `#pragma warning(disable : N)` for some N" This reverts commit `0cd9d8a48b` and adds the changes described in https://reviews.llvm.org/D110668#3034461.	2021-09-30 15:03:23 -04:00
Nikita Popov	b989211d7d	[BasicAA] Move more extension logic into ExtendedValue (NFC) Add methods to appropriately extend KnownBits/ConstantRange there, same as with APInt. Also clean up the known bits handling by actually doing that extension rather than checking ZExtBits. This doesn't matter now, but becomes relevant once truncation is involved.	2021-09-30 20:45:12 +02:00
wren romano	218954865e	[mlir][sparse] Correcting a few typos Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110773	2021-09-30 11:42:46 -07:00
Arthur Eubanks	76902079e4	[clang] Don't modify OptRemark if the argument is not relevant A followup to D110201. For example, we'd set OptimizationRemarkMissed's Regex to '.*' when encountering -Rpass. Normally this doesn't actually affect remarks we emit because in clang::ProcessWarningOptions() we'll separately look at all -R arguments and turn on/off corresponding diagnostic groups. However, this is reproducible with -round-trip-args. Reviewed By: JamesNagurne Differential Revision: https://reviews.llvm.org/D110673	2021-09-30 11:36:30 -07:00
peter klausler	691814f9cc	[flang] Fix test regression from SQRT folding The algorithm used to fold SQRT has some holes that led to test failures; debug and add more tests. Differential Revision: https://reviews.llvm.org/D110744	2021-09-30 11:35:44 -07:00
Nico Weber	8dfbe9b0ae	[clang] Make crash reproducer work with clang-cl When clang crashes, it writes a standalone source file and shell script to reproduce the crash. The Driver used to set `Mode = CPPMode` in generateCompilationDiagnostics() to force preprocessing mode. This has the side effect of making IsCLMode() return false, which in turn meant Clang::AddClangCLArgs() didn't get called when creating the standalone source file, which meant the stand-alone file was preprocessed with the gcc driver's defaults In particular, exceptions default to on with the gcc driver, but to off with the cl driver. The .sh script did use the original command line, so in the reproducer for a clang-cl crash, the standalone source file could contain exception-using code after preprocessing that the compiler invocation in the shell script would then complain about. This patch removes the `Mode = CPPMode;` line and instead additionally checks for `CCGenDiagnostics` in most places that check `CCCIsCPP(). This also matches the strategy Clang::ConstructJob() uses to add -frewrite-includes for creating the standalone source file for a crash report. Fixes PR52007. Differential Revision: https://reviews.llvm.org/D110783	2021-09-30 14:33:14 -04:00
Zequan Wu	dbaa408336	[clang] do not emit note for bad conversion when destination type qualifiers are not compatibly include source type qualifiers llvm.org/PR52014 Differential Revision: https://reviews.llvm.org/D110780	2021-09-30 11:24:18 -07:00
Nico Weber	fa32fd3bf7	[clang] Remove duplication in types::getCompilationPhases() Call Driver::getFinalPhase() instead of duplicating it. https://reviews.llvm.org/D65993 added the duplication, then `02e35832c3` maded it more obviously a copy of getFinalPhase(). The only difference is that getCompilationPhases() used to use LastPhase / IfsMerge where getFinalPhase() used Link. Adapt getFinalPhase() to return IfsMerge when needed. No intentional behavior change. Differential Revision: https://reviews.llvm.org/D110770	2021-09-30 14:17:14 -04:00
Louis Dionne	6714e1ce3b	[libc++abi][NFCI] Consistently group new_handler, unexpected_handler and terminate_handler Previously, the definitions of __cxa_terminate_handler and __cxa_unexpected_handler (and their set_xxx_handler functions) were grouped together, but the definition of __cxa_new_handler wasn't. This commit simply moves those to the same file to treat all handlers consistently.	2021-09-30 14:15:30 -04:00
Haowei Wu	138dc27186	Revert "[libc++][libc++abi] Add tests for vendor-specific properties" This reverts commit `9892d1644f`, which causes clang test failures in libcxx tests.	2021-09-30 11:03:59 -07:00
peter klausler	faa1842875	[flang] Front-end and runtime support for CALL EXIT and ABORT Support the extension intrinsic subroutines EXIT([status]) and ABORT() in the intrinsic table and runtime support library. Lowering remains to be done. Differential Revision: https://reviews.llvm.org/D110741	2021-09-30 11:01:22 -07:00
Lei Zhang	cb2e651800	[mlir][linalg] Fix incorrect bound calculation for tiling conv For convolution, the input window dimension's access affine map is of the form `(d0 * s0 + d1)`, where `d0`/`d1` is the output/ filter window dimension, and `s0` is the stride. When tiling, https://reviews.llvm.org/D109267 changed how the way dimensions are acquired. Instead of directly querying using `.dim` ops on the original convolution op, we now get it by applying the access affine map to the loop upper bounds. This is fine for dimensions having single-dimension affine maps, like matmul, but not for convolution input. It will cause incorrect compuation and out of bound. A concrete example, say we have 1x225x225x3 (NHWC) input, 3x3x3x32 (HWCF) filter, and 1x112x112x3 (NHWC) output with stride 2, (112 2 + 3) would be 227, which is different from the correct input window dimension size 225. Instead, we should first calculate the max indices for each loop, and apply the affine map to them, and then plus one to get the dimension size. Note this makes no difference for matmul-like ops given they will have `d0 - 1 + 1` effectively. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110849	2021-09-30 13:50:57 -04:00
Jonas Devlieghere	b505ed9d31	[lldb] Remove support for replaying the test suite from a reproducer This patch removes the infrastructure to replay the test suite from a reproducer, as well as the modifications made to the individual tests.	2021-09-30 10:47:19 -07:00
Stanislav Mekhanoshin	244aa7f735	[AMDGPU] move hasAGPRs/hasVGPRs into header It is now very simple and can go right into the header allowing optimizer to combine callers, such as isVGPRClass and similar. It does not need anything from the TRI itself anymore, so make it static class member along with the callers. Differential Revision: https://reviews.llvm.org/D110762	2021-09-30 10:02:02 -07:00
Arthur Eubanks	d372e92baf	Manually create unique_ptr in various pass adaptors This avoids creating tons of make_unique template instantiations. And we only create a unique_ptr of the actual pass concept type, rather than creating a unique_ptr of the pass model subclass then casting it to the pass concept type. This reduces the work spent compiling PassBuilder.cpp from 83M -> 73M instructions according to perf stat. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D110784	2021-09-30 09:55:06 -07:00
Nikita Popov	ea02f9caff	[BasicAA] Use ExtendedValue in VariableGEPIndex (NFC) Use the ExtendedValue structure which is used for LinearExpression in VariableGEPIndex as well.	2021-09-30 18:48:51 +02:00
Arthur Eubanks	fc7604a2c0	Instantiate fewer templates in PassManager::addPass() We create many instantiations of PassManager::addPass() in PassBuilder.cpp. vector::emplace_back() and make_unique() are both templated and would have many instantiations based on the number of times we instantiate addPass(). Now we directly construct the unique_ptr with the type as the actual unique_ptr type in the vector we are adding it to, so we only have one unique_ptr constructor instantiation across all addPass() instantiations and only the non-templated push_back(). This makes PassBuilder.cpp slightly faster to build. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D110775	2021-09-30 09:47:59 -07:00
Craig Topper	765348298c	[CostModel] Update default cost model for sadd/ssub overflow to match TargetLowering The expansion for these was updated in https://reviews.llvm.org/D47927 but the cost model was not adjusted. I believe the cost model was also incorrect for the old expansion. The expansion prior to D47927 used 3 icmps using LHS, RHS, and Result to calculate theirs signs. Then 2 icmps to compare the signs. Followed by an And. The previous cost model was using 3 icmps and 2 selects. Digging back through git blame, those 2 selects in the cost model used to be 2 icmps, but were changed in https://reviews.llvm.org/D90681 Differential Revision: https://reviews.llvm.org/D110739	2021-09-30 09:41:14 -07:00
Stella Laurenzo	267bb194f3	[mlir] Remove old "tc" linalg ods generator. * This could have been removed some time ago as it only had one op left in it, which is redundant with the new approach. * `matmul_i8_i8_i32` (the remaining op) can be trivially replaced by `matmul`, which natively supports mixed precision. Differential Revision: https://reviews.llvm.org/D110792	2021-09-30 16:30:06 +00:00
Adrian Prantl	9232ca4712	Improve the effectiveness of BDCE's debug info salvaging This patch improves the effectiveness of BDCE's debug info salvaging by processing the instructions in reverse order and delaying dropAllReferences until after debug info salvaging. This allows salvaging of entire chains of deleted instructions! Previously we would remove all references from an instruction, which would make it impossible to use that instruction to salvage a later instruction in the instruction stream, because its operands were already removed. This reapplies the previous patch with a fix for a use-after-free. Differential Revision: https://reviews.llvm.org/D110568	2021-09-30 09:28:49 -07:00
Louis Dionne	981b12fe89	[libc++][NFC] Remove stray whitespace	2021-09-30 12:18:47 -04:00
Eric Schweitz	7362cc5ef5	[fir][NFC] Rename lenParams to typeparams in builders Fine tuning after couple of upstreamed patches. Rename the lenParams varaiables in buidlers to typeparams. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110821 Co-authored-by: Valentin Clement <clementval@gmail.com>	2021-09-30 17:56:32 +02:00
Kazu Hirata	f631173d80	[llvm] Migrate from arg_operands to args (NFC) Note that arg_operands is considered a legacy name. See llvm/include/llvm/IR/InstrTypes.h for details.	2021-09-30 08:51:21 -07:00
Daniil Fukalov	cf362ff4ca	[NFC][AMDGPU] Improve cost model tests coverage.	2021-09-30 18:13:17 +03:00
Florian Mayer	9c8d8fc98f	[NFC] [hwasan] Remove unneeded member. Reviewed By: hctim Differential Revision: https://reviews.llvm.org/D110791	2021-09-30 16:11:17 +01:00
Sanjay Patel	97948620b1	[x86] add test for 3 fcmps and logic; NFC This is a more complex pattern than we handled with the initial patch for PR51245: D110342 / `09e71c367a` We could extend the logic matching to allow a setcc as one operand and an extract of vector setcc (or even an arbitrary bool?) as the other.	2021-09-30 11:02:59 -04:00
Kiran Chandramohan	4cab4f6b1c	[fir] Update fir.alloca op Add pinned attributes and speicifc builders. The pinned attribute helps mark those allocas in OpenMP regions that should not be hoisted out by an alloca hoisting pass. This patch is part of the upstreaming effort from fir-dev branch. Coming from PR: https://github.com/flang-compiler/f18-llvm-project/pull/1065 Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110815 Co-authored-by: Valentin Clement <clementval@gmail.com>	2021-09-30 17:02:32 +02:00
Louis Dionne	c15bbdeaff	[libc++] Add a testing configuration specific to Apple's libc++ Apple's libc++ has a few differences with the LLVM libc++, and it is necessary to use a custom configuration file to test it properly. Differential Revision: https://reviews.llvm.org/D110777	2021-09-30 10:51:13 -04:00
Valentin Clement	0781461959	[fir][NFC] Removed unused declaration from td file Remove unused code from FIROps.td file after latest patches. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110814	2021-09-30 16:46:13 +02:00
Kirill Bobyrev	38ec2292f0	[clangd] Update comment after `dea48079b9`	2021-09-30 16:45:10 +02:00
Anna Thomas	6f2d01376d	[LoopPredication] Remove unused variable After rG452714f8f8037ff37f9358317651d1652e231db2, the Function `F` retrieved in LoopPredication is not used. Remove this unused variable to stop some buildbots (ASAN, clang-ppc) from failing.	2021-09-30 10:40:47 -04:00
Roman Lebedev	3bd02ec977	[NFC][X86][Codegen] Add test coverage for interleaved i64 load/store stride=2	2021-09-30 17:31:18 +03:00
Roman Lebedev	6be397eb35	[NFC][X86][LV] Add costmodel test coverage for interleaved i64/f64 load/store stride=2	2021-09-30 17:31:18 +03:00
Jakub Kuderski	7bb47a046f	[NFC] Improve file-level documentation for Sequence.h Add usage samples. This was extracted from a bigger patch: https://reviews.llvm.org/D107378. Reviewed By: aaron.ballman, gchatelet Differential Revision: https://reviews.llvm.org/D110760	2021-09-30 10:28:38 -04:00
Valentin Clement	a6fc555202	[fir][NFC] Move parser/printer for fir.global_len Move parser and printer to the .cpp file. Follow up to https://reviews.llvm.org/D110626. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D110828	2021-09-30 16:27:30 +02:00
Anna Thomas	452714f8f8	[BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults This is analogous to D86156 (which preserves "lossy" BFI in loop passes). Lossy means that the analysis preserved may not be up to date with regards to new blocks that are added in loop passes, but BPI will not contain stale pointers to basic blocks that are deleted by the loop passes. This is achieved through BasicBlockCallbackVH in BPI, which calls eraseBlock that updates the data structures in BPI whenever a basic block is deleted. This patch does not have any changes in the upstream pipeline, since none of the loop passes in the pipeline use BPI currently. However, since BPI wasn't previously preserved in loop passes, the loop predication pass was invoking BPI on the entire function every time it ran in an LPM. This caused massive compile time in our downstream LPM invocation which contained loop predication. See updated test with an invocation of a loop-pipeline containing loop predication and -debug-pass turned ON. Reviewed-By: asbirlea, modimo Differential Revision: https://reviews.llvm.org/D110438	2021-09-30 10:27:05 -04:00
Valentin Clement	afb30fcdf6	[fir][NFC] Move fir.shape verifier to cpp file Move verifier code to the .cpp file. Follow up to https://reviews.llvm.org/D110626. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D110826	2021-09-30 16:26:44 +02:00
Valentin Clement	ccc0f62d1b	[fir][NFC] Update and move fir.freemem and fir.store Move fir.freemem and fir.store to the Memory SSA operations sections. Move parser, printer and verifier of fir.store to the .cpp file. This patch is part of the upstreaming effort from fir-dev branch. Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D110816	2021-09-30 16:25:50 +02:00
Alex Zinenko	93a6b49d38	[mlir][python] provide bindings for ops from the sparse_tensor dialect Previously, the dialect was exposed for linking and pass management purposes, but we did not generate op classes for it. Generate them. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110819	2021-09-30 15:53:16 +02:00
David Green	f9aa8623fe	[ARM] Add more MVE intrinsics to sink splats to This adds a few more unpredicated intrinsics to sink splats to, in order to create more qr instruction variants. Notably this includes saddsat/uaddsat but also some of the unpredicated mve intrinsics. Differential Revision: https://reviews.llvm.org/D110333	2021-09-30 14:41:23 +01:00
Brock Wyma	bafd8b1add	[CodeView] Recognize Fortran95 as Fortran instead of MASM Map Fortran95 sources to Fortran so the CodeView language is not emitted as MASM. Differential Revision: https://reviews.llvm.org/D110330	2021-09-30 09:27:05 -04:00
Guillaume Chatelet	adc18ad6ac	[libc] move benchmark function registration to a different file	2021-09-30 13:11:33 +00:00

1 2 3 4 5 ...

400494 Commits All Branches Search

400494 Commits

All Branches