llvm-project

Commit Graph

Author	SHA1	Message	Date
Benoit Jacob	728b982bb2	ThreadPool: grow the pool only as needed On my 96-core cloudtop 'machine', it seems unnecessary to always start 96 threads upfront... particularly as the ThreadPool is created even with -mlir-disable-threading. Things like the resuling spew in GDB and the obfuscated output of `(gdb) info threads` are my motivation here, but it probably also doesn't hurt for at least some efficiency metrics to avoid creating many threads upfront. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D115019	2021-12-03 21:40:36 +00:00
Arthur Eubanks	93a20ecee4	[DebugInfo] Check DIEnumerator bit width when comparing for equality As mentioned in D106585, this causes non-determinism, which can also be shown by this test case being flaky without this patch. We were using the APSInt's bit width for hashing, but not for checking for equality. APInt::isSameValue() does not check bit width. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D115054	2021-12-03 13:40:22 -08:00
Amy Kwan	97eb3bb80f	[test-release.sh] Do not run chrpath on AIX. Upon testing the use of test-release.sh on AIX, the script initially fails because chrpath is not present on AIX. This patch adds checks for AIX and allows the script to continue running to completion. Differential Revision: https://reviews.llvm.org/D115046	2021-12-03 15:36:29 -06:00
Vitaly Buka	98bb198693	[sanitizer] Add Lempel–Ziv–Welch encoder/decoder It's very simple, fast and efficient for the stack depot compression if used on entire pointers. Reviewed By: morehouse, kstoimenov Differential Revision: https://reviews.llvm.org/D114918	2021-12-03 13:11:40 -08:00
Vitaly Buka	5f1d1854eb	[NFC][sanitizer] Iterator adaptors for Leb128 encoding It's similar to back_insert_iterator Needed for D114924 Reviewed By: morehouse, kstoimenov Differential Revision: https://reviews.llvm.org/D114980	2021-12-03 12:51:55 -08:00
Vitaly Buka	6318001209	[sanitizer] Support IsRssLimitExceeded in all sanitizers Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D115000	2021-12-03 12:45:44 -08:00
Choongwoo Han	46282fad06	[Sanitizer] Use CreateDirectoryA for report dirs Using `_mkdir` of CRT in Asan Init leads to launch failure and hanging in Windows. You can trigger it by calling: > set ASAN_OPTIONS=log_path=a/a/a > .\asan_program.exe And their crash dump shows the following stack trace: ``` _guard_dispatch_icall_nop() __acrt_get_utf8_acp_compatibility_codepage() _mkdir(const char * path) ``` I guess there could be a cfg guard in CRT, which may lead to calling uninitialized cfg guard function address. Also, `_mkdir` supports UTF-8 encoding of the path and calls _wmkdir, but that's not necessary for this case since other file apis in sanitizer_win.cpp assumes only ANSI code case, so it makes sense to use CreateDirectoryA matching other file api calls in the same file. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D114760	2021-12-03 12:34:05 -08:00
Florian Hahn	31413c4555	[Passes] Adjust SLPVectorizer placement in test. SLPVectorizer runs after the extra vector passes.	2021-12-03 20:27:09 +00:00
Florian Hahn	5da920bf3a	[Passes] Improve opt-pipeline-vector-passes.ll test. Add -NOT lines to ensure that no extra passes are run if -extra-vectorizer-passes is not specified. Also add a loop that actually gets vectorized in preparation for D115052.	2021-12-03 20:15:59 +00:00
Peter Collingbourne	0a14674f27	CodeGen: Strip exception specifications from function types in CFI type names. With C++17 the exception specification has been made part of the function type, and therefore part of mangled type names. However, it's valid to convert function pointers with an exception specification to function pointers with the same argument and return types but without an exception specification, which means that e.g. a function of type "void () noexcept" can be called through a pointer of type "void ()". We must therefore consider the two types to be compatible for CFI purposes. We can do this by stripping the exception specification before mangling the type name, which is what this patch does. Differential Revision: https://reviews.llvm.org/D115015	2021-12-03 14:50:52 -05:00
Hans Wennborg	c361ab0612	[msan] Don't block SIGSYS in ScopedBlockSignals Seccomp-BPF-sandboxed processes rely on being able to process SIGSYS signals. Differential revision: https://reviews.llvm.org/D115057	2021-12-03 20:41:08 +01:00
Leonard Chan	f178a05f22	[libunwind] Fix unwind_leaffunction test It's possible for this test not to pass if the libc used does not provide unwind info for raise. We can replace it with __builtin_cast, which can lead to a SIGTRAP on x86_64 and a SIGILL on aarch64. Using this alternative, a nop is needed before the __builtin_cast. This is because libunwind incorrectly decrements pc, which can cause pc to jump into the previous function and use the incorrect FDE. Differential Revision: https://reviews.llvm.org/D114818	2021-12-03 11:21:20 -08:00
Choongwoo Han	181c4ba467	[CFG] Handle calls with funclet bundle When Control Flow Guard Check is inserted, funclet bundle was not checked. Therefore, it didn't generate code correctly when a target function has "funclet" bundle. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D114914	2021-12-03 10:51:10 -08:00
Mitch Phillips	572a0721a0	[HWASan] Try 'google' prefixed apex directories in symbolizer. Google-signed apexes appear on Android build servers' symbol files as being under /apex/com.google.android.<foo>/. In reality, the apexes are always installed as /apex/com.android.<foo>/ (note the lack of 'google'). In order for local symbolization under hwasan_symbolize to work correctly, we also try the 'google' directory. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D114919	2021-12-03 10:35:03 -08:00
Stanislav Mekhanoshin	e1d6306815	[AMDGPU] Fixed incomplete definitions in twoaddr-fma.mir. NFC.	2021-12-03 10:18:03 -08:00
Stanislav Mekhanoshin	3b17cb1506	[AMDGPU] Kill def when folding immediate in two-addr pass Two-address pass works right before RA and if an immediate was folded into an instruction there is nothing to remove the dead def. We end up with something like: v_mov_b32_e32 v14, 0xc1700000 v_mov_b32_e32 v14, 0x41200000 v_fmaak_f32 v51, s67, v19, 0xc1700000 v_fmaak_f32 v38, v51, v19, 0x4120000 The patch kills the dead move instruction right in the folding. Differential Revision: https://reviews.llvm.org/D114999	2021-12-03 09:37:49 -08:00
Simon Pilgrim	ebf5271918	[DAG] PromoteIntRes_FunnelShift - rename shift Amount variable to Amt to prevent line overflow. NFC.	2021-12-03 17:24:45 +00:00
Philip Reames	7b54de5fef	[funcattrs] Fix a bug in recently introduced writeonly argument inference This fixes a bug in `740057d`. There's two ways to describe the issue: * One caller hasn't yet proven nocapture on the argument. Given that, the inference routine is responsible for bailing out on a potential capture. * Even if we know the argument is nocapture, the access inference needs to traverse the exact set of users the capture tracking would (or exit conservatively). Even if capture tracking can prove a store is non-capturing (e.g. to a local alloc which doesn't escape), we still need to track the copy of the pointer to see if it's later reloaded and accessed again. Note that all the test changes except the newly added ones appear to be false negatives. That is, cases where we could prove writeonly, but the current code isn't strong enough. That's why I didn't spot this originally.	2021-12-03 08:57:15 -08:00
Simon Pilgrim	74cc0fa1db	[IR][AutoUpgrade] Merge x86 mask load intrinsic upgrades. NFC. Helps appease MSVC which is complaining about "fatal error C1061: compiler limit: blocks nested too deeply" - we already do the same thing for avx512.mask.store intrinsics. This is only a stopgap solution until another else-if case needs adding - we really need to refactor this chain of ifs properly.	2021-12-03 16:53:59 +00:00
Muhammad Omair Javaid	80792368bb	[LLDB] XFAIL on Arm/Linux minidebuginfo-set-and-hit-breakpoint.test minidebuginfo-set-and-hit-breakpoint.test is failing on Arm/Linux most probably due to an ill formed binary after removal of certain sections from executable. I am marking it as XFAIL for further investigation.	2021-12-03 21:52:21 +05:00
David Green	08035000cd	[ARM] Separate ARM autoupgrade code into a separate function Try to appease the microsoft compiler which is apparently running out of if statements. Separate the new ARM code into a separate function to keep it simpler.	2021-12-03 16:45:26 +00:00
David Green	11f67f5a2c	[ARM] Replace if's with a switch, NFC I'm not having a lot of luck with the microosft compiler recently. Maybe this will help it with its errors: llvm\lib\IR\AutoUpgrade.cpp(3726): fatal error C1061: compiler limit: blocks nested too deeply If not, it's a good code cleanup anyway.	2021-12-03 16:16:30 +00:00
Guillaume Chatelet	b902b314ff	[libc] Fix invalid include for SqrtLongDouble.h	2021-12-03 16:13:59 +00:00
Nico Weber	1217b4b46f	[gn build] Build with Fission on non-mac non-win when using lld In release+sym builds (-O2 -g), reduces time to link `clang` from 2.3s to 1.3s (-42%). In debug builds (-g), reduces time to link `clang` from 5.4s to 4.5s (-17.4%). See the phab review for full `ministat` numbers. In the CMake build this is opt-in via LLVM_USE_SPLIT_DWARF. Since the GN build is targeted at developers, enabling it by default seems like a better default setting here. (If it turns out to cause problems, we can add an opt-out.) Time to load the binary into gdb and to set a breakpoint is unchanged. Time from `run` to hitting a breakpoint in `main` feel a bit faster (~4s -> ~2s), but I dind't do a careful statistical anlysis for this. Differential Revision: https://reviews.llvm.org/D115040	2021-12-03 11:07:52 -05:00
Florian Hahn	ead3979a92	[MemoryLocation] Move DSE intrinsic handling to MemoryLocation. (NFC) Suggested in D114872.	2021-12-03 16:00:39 +00:00
Guillaume Chatelet	71405d90f0	[libc] Select FPUtils implementations via code instead of build We want to simplify the build system and rely on code to do the implementation selection. This is in preparation of adding a Bazel configuration (D114712). Differential Revision: https://reviews.llvm.org/D115034	2021-12-03 15:48:41 +00:00
Balázs Kéri	1cefe91d40	[clang-tidy][docs][NFC] Improve documentation of bugprone-unhandled-exception-at-new Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D114602	2021-12-03 16:53:08 +01:00
Stephen Tozer	98a021fcbf	[DebugInfo] Attempt to preserve more information during tail duplication Prior to this patch, tail duplication handled debug info poorly - specifically, debug instructions would be dropped instead of being set undef, potentially extending the lifetimes of prior debug values that should be killed. The pass was also very aggressive with dropping debug info, dropping debug info even when the SSA value it referred to was still present. This patch attempts to handle debug info more carefully, checking to see whether each affected debug value can still be live, setting it undef if not. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D106875	2021-12-03 15:30:05 +00:00
David Green	ab0c5cea0b	[ARM] Use v2i1 for MVE and CDE intrinsics This adjusts all the MVE and CDE intrinsics now that v2i1 is a legal type, to use a <2 x i1> as opposed to emulating the predicate with a <4 x i1>. The v4i1 workarounds have been removed leaving the natural v2i1 types, notably in vctp64 which now generates a v2i1 type. AutoUpgrade code has been added to upgrade old IR, which needs to convert the old v4i1 to a v2i1 be converting it back and forth to an integer with arm.mve.v2i and arm.mve.i2v intrinsics. These should be optimized away in the final assembly. Differential Revision: https://reviews.llvm.org/D114455	2021-12-03 15:27:58 +00:00
Tue Ly	dbed678f4b	[libc] Fix bugs with negative and mixed normal/denormal inputs in hypot implementation. Fix a bug with negative and mixed normal/denormal inputs in hypot implementation. Differential Revision: https://reviews.llvm.org/D114726	2021-12-03 10:14:04 -05:00
Nemanja Ivanovic	d6c0ef7887	[PowerPC] Handle base load with reservation mnemonic The Power ISA defined l[bhwdq]arx as both base and extended mnemonics. The base mnemonic takes the EH bit as an operand and the extended mnemonic omits it, making it implicitly zero. The existing implementation only handles the base mnemonic when EH is 1 and internally produces a different instruction. There are historical reasons for this. This patch simply removes the limitation introduced by this implementation that disallows the base mnemonic with EH = 0 in the ASM parser. This resolves an issue that prevented some files in the Linux kernel from being built with -fintegrated-as. Also fix a crash if the value is not an integer immediate.	2021-12-03 09:13:02 -06:00
Anna Thomas	72750f0012	[TrivialDeadness] Introduce API separating two different usages The earlier usage of wouldInstructionBeTriviallyDead is based on the assumption that the use_count of that instruction being checked will be zero. This patch separates the API into two different ones: 1. The strictly conservative one where the instruction is trivially dead iff the uses are dead. 2. The slightly relaxed form, where an instruction is dead along paths where it is not used. The second form can be used in identifying instructions that are valid to sink down to uses (D109917). Reviewed-By: reames Differential Revision: https://reviews.llvm.org/D114647	2021-12-03 10:09:52 -05:00
Andy Yankovsky	0495301293	[lldb-vscode] Report supportsModulesRequest=true The adapter does support `Modules` request, implemented in `39239f9`. Reviewed By: wallace Differential Revision: https://reviews.llvm.org/D115033	2021-12-03 16:07:48 +01:00
Alexey Bataev	f6279562da	[OPENMP]Fix PR52117: Crash caused by target region inside of task construct. Need to do the analysis of the captured expressions in the clauses. Previously the compiler ignored them and it may lead to a compiler crash trying to get the address of the mapped variables. Differential Revision: https://reviews.llvm.org/D114546	2021-12-03 07:01:00 -08:00
Mehrnoosh Heidarpour	54dc03b97b	[InstSimplify] Add test case for logic 'or' fold; NFC	2021-12-03 09:29:43 -05:00
Matthias Springer	e359a1e548	[mlir][linalg][bufferize][NFC] Map only tensors in BufferizationState BufferizationState had map/lookup overloads for non-tensor values. This was necessary for IREE. There is now a better way to do this, so these overloads can be removed. Differential Revision: https://reviews.llvm.org/D114929	2021-12-03 23:07:09 +09:00
David Green	255ad73424	[ARM] Make MVE v2i1 predicates legal MVE can treat v16i1, v8i1, v4i1 and v2i1 as different views onto the same 16bit VPR.P0 register, with v2i1 holding two 8 bit values for the two halves. This was never treated as a legal type in llvm in the past as there are not many 64bit instructions and no 64bit compares. There are a few instructions that could use it though, notably a VSELECT (as it can handle any size using the underlying v16i8 VPSEL), AND/OR/XOR for similar reasons, some gathers/scatter and long multiplies and VCTP64 instructions. This patch goes through and makes v2i1 a legal type, handling all the cases that fall out of that. It also makes VSELECT legal for v2i64 as a side benefit. A lot of the codegen changes as a result - usually in way that is a little better or a little worse, but still expensive. Costs can change a little too in the process, again in a way that expensive things remain expensive. A lot of the tests that changed are mainly to ensure correctness - the code can hopefully be improved in the future where it comes up in practice. The intrinsics currently remain using the v4i1 they previously did to emulate a v2i1. This will be changed in a followup patch but this one was already large enough. Differential Revision: https://reviews.llvm.org/D114449	2021-12-03 14:05:41 +00:00
Jay Foad	b670dcb81b	[AMDGPU] Add some more GFX10 test coverage	2021-12-03 14:03:31 +00:00
Valentin Clement	d59a0f58f4	[fir] Add fir character builder This patch adds the FIR builder to generate the numeric intrinsic runtime call. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D114900 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: mleair <leairmark@gmail.com>	2021-12-03 14:58:17 +01:00
Valentin Clement	c32421c925	[fir] Add fir derived type runtime builder This patch adds the builder to generate derived type runtime API calls. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D114472 Co-authored-by: Peter Klausler <pklausler@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-12-03 14:51:59 +01:00
Jay Foad	b29b6f92af	[AMDGPU] Add some more GFX10 GlobalISel test coverage	2021-12-03 13:40:27 +00:00
Matthias Springer	ed8c63115e	[mlir][linalg][bufferize][NFC] Provide default implementation of getAliasingOpOperand This simplifies op interface implementations. Differential Revision: https://reviews.llvm.org/D115025	2021-12-03 22:36:22 +09:00
Jay Foad	d133a21b71	[SelectionDAG] Add newline to a debug message	2021-12-03 13:33:32 +00:00
Florian Hahn	af86aa7980	[MemoryLocation] Use None instead of {}. (NFC)	2021-12-03 13:19:00 +00:00
Guillaume Chatelet	cca8e1e415	[libc][NFC] Fix typo in CMakeLists documentation	2021-12-03 13:52:09 +01:00
Adrian Kuegel	04d083b19e	[mlir][NFC] Use const reference for loop variables.	2021-12-03 13:07:54 +01:00
Simon Pilgrim	e85667a2fb	[PowerPC] Add non-constant fcopysign f128 test coverage As discussed on D114589 as the constant case gets affected by SimplifyDemandedBits a lot - the non-constant case currently falls back to copysignl libcalls	2021-12-03 12:04:06 +00:00
Petar Avramovic	0b34ffe4a6	AMDGPU/GlobalISel: Add clamp combine Add clamp combine. Source is fminnum(fmaxnum(Val, 0.0), 1.0) or fmaxnum(fminnum(Val, 1.0), 0.0) or fmed3 intrinsic with 0.0 and 1.0 as two out of three operands. Differential Revision: https://reviews.llvm.org/D90052	2021-12-03 12:49:39 +01:00
Petar Avramovic	ec54867d75	AMDGPU/GlobalISel: Add floating point med3 combine Add floating point version of med3 combine. Source is fminnum(fmaxnum(Val, K0), K1) or fmaxnum(fminnum(Val, K1), K0) where K0 and K1 are constants and K0 <= K1. Differential Revision: https://reviews.llvm.org/D90051	2021-12-03 12:49:39 +01:00
Petar Avramovic	ab01f4d264	AMDGPU/GlobalISel: Do not fcanonicalize const splat padded with undef Recognize constant splat padded with undef in isCanonicalized. Fcanonicalize will be removed by RemoveFcanonicalize in post-legalizer combiner. We will treat undef as value that will result in a splat in clamp combine after regbankselect. Differential Revision: https://reviews.llvm.org/D104408	2021-12-03 12:49:38 +01:00

1 2 3 4 5 ...

406412 Commits All Branches Search

406412 Commits

All Branches