llvm-project

Commit Graph

Author	SHA1	Message	Date
Tee KOBAYASHI	eeb4266f81	[Sparc] Create an error when `__builtin_longjmp` is used Support for builtin setjmp/longjmp was removed by https://reviews.llvm.org/D51487. An error should be created when compiling C code using __builtin_setjmp or __builtin_longjmp. Reviewed By: dcederman Differential Revision: https://reviews.llvm.org/D108901	2021-12-03 23:41:50 -05:00
Chia-hung Duan	b8c6b15283	[mlir] Support collecting logs from notifyMatchFailure(). Let the user registers their own handler to processing the matching failure information. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D110896	2021-12-04 04:35:24 +00:00
Mehdi Amini	4022152b35	Use LLVM_ATTRIBUTE_UNUSED to silent warning for static function used in assert only (NFC)	2021-12-04 04:23:21 +00:00
Mehdi Amini	e846971811	Split the locking of the queue and the threads vector in the ThreadPool implementation This allows to release the QueueLock early and create Thread independently of the queue processing. Differential Revision: https://reviews.llvm.org/D115078	2021-12-04 04:10:24 +00:00
Matthias Springer	5fa0b3561a	[mlir][linalg][bufferize] Implement equivalence analysis Instead of checking buffer equivalence during bufferization, gather buffer equivalence information right after the analysis. This is in preparation of decoupling bufferization from BufferizationAliasInfo. This change also fixes equivalence analysis for scf.if op results, which was not fully implemented. scf.if op results are equivalent to their corresponding yield values if both yield values are equivalent. Differential Revision: https://reviews.llvm.org/D114774	2021-12-04 11:52:04 +09:00
Mehdi Amini	b28f317c81	Fix build for ThreadPool when using -DLLVM_ENABLE_THREADS=OFF Differential Revision: https://reviews.llvm.org/D115019	2021-12-04 02:23:20 +00:00
Uday Bondhugula	2108ed0671	[MLIR] Fix affine.for unroll for multi-result upper bound maps Fix affine.for unroll for multi-result upper bound maps: these can't be unrolled/unroll-and-jammed in cases where the trip count isn't known to be a multiple of the unroll factor. Fix and clean up repeated/unnecessary checks/comments at helper callees. Also, fix clang-tidy variable naming warnings and redundant includes. Differential Revision: https://reviews.llvm.org/D114662	2021-12-04 07:20:26 +05:30
Matthias Springer	9e42f2aa0b	[mlir][linalg][bufferize][NFC] Add inPlaceAnalysis overload Differential Revision: https://reviews.llvm.org/D114773	2021-12-04 10:41:57 +09:00
River Riddle	7169996159	[mlir] Allow shape dimensions larger than 2^32 Internally we use int64_t to hold shapes, but for some reason the parser was limiting shapes to unsigned. This change updates the parser to properly handle int64_t shape dimensions. Differential Revision: https://reviews.llvm.org/D115086	2021-12-04 01:29:50 +00:00
Uday Bondhugula	ecf458507e	[MLIR] Improve error message on missing getArgument() override on pass Improve error message while registering a pass with a missing getArgument() override. Differential Revision: https://reviews.llvm.org/D114744	2021-12-04 06:54:52 +05:30
Uday Bondhugula	d20249fde6	[MLIR] NFC. Rename test cases in test/mlir-cpu-runner per convention Test case files at most places in MLIR uses hyphens and not underscores. A counter-pattern was somehow started to use underscores in some places. Rename test cases in test/mlir-cpu-runner to use hyphens so that it's consistent at least within its directory. Differential Revision: https://reviews.llvm.org/D114672	2021-12-04 06:53:39 +05:30
Philip Reames	1a25d0bfbb	[LICM] Remove profile driven restriction on hoisting This reverts change 2c391a5/D87551. As noted in the llvm-dev thread "LICM as canonical form" sent earlier today, introducing this was a major design change made without sufficient cause. A profile driven LICM is not an unreasonable design, it simply is not what we have. Switching to such a model requires a lot more work than just this patch, and broad aggeement that is the right direction for the optimizer as a whole. Worth noting is that all the tests included in the reverted changed are probably handled if we allow running unconstrained LICM, and later run LoopSink. As such, we have no public examples which motivate a profit based hoisting approach.	2021-12-03 17:19:25 -08:00
Matthias Springer	6db200736c	[mlir][linalg][bufferize][NFC] Use same OpBuilder throughout bufferization Also set insertion point right before calling `bufferize`. No need to put an InsertionGuard anymore. Differential Revision: https://reviews.llvm.org/D114928	2021-12-04 09:57:26 +09:00
Mehdi Amini	48fb79effb	Improve error message when declarativeAssembly contains invalid literals Differential Revision: https://reviews.llvm.org/D115085	2021-12-04 00:27:32 +00:00
Kevin Athey	6c2be3015e	[NFC][sanitizer] Add test for command line flag for enable-noundef-analysis. A simple unit test to demonstrate the flags working correctly. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D114485	2021-12-03 15:44:36 -08:00
wren romano	4748cc6931	[mlir][sparse] Adding a stress test Addresses https://bugs.llvm.org/show_bug.cgi?id=52410 Depends on D114192 Reviewed By: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D114118	2021-12-03 14:59:39 -08:00
Jordan Rupprecht	fddedcaeb8	[NFC] const-ify some methods on CommandReturnObject	2021-12-03 14:54:03 -08:00
Nico Weber	6c11b872c4	[gn build] (semiautomatically) port `98bb198693`	2021-12-03 17:48:27 -05:00
Fangrui Song	9bd6f6f6d5	[ELF][test] Fix typo in aarch64-cortex-a53-843419-recognize.s	2021-12-03 14:38:56 -08:00
natashaknk	e2d8b60742	Revert "[mlir][tosa] Add tosa.conv2d as fully_connected canonicalization" This reverts commit `13bdb7ab4a`. The commit introduced/uncovered an unintended bug in models containing Conv2D. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115079	2021-12-03 14:35:48 -08:00
Nick Desaulniers	9f95bc7dc1	[clang][ARM] relax -mtp=cp15 for non-thumb cases Building -march=armv6k Linux kernels with -mtp=cp15 fails to compile: error: hardware TLS register is not supported for the arm sub-architecture @ardb found docs for ARM1176JZF-S (ARMv6K) that reference hard thread pointer. Relax our ARMv6 check for cases where we're targeting ARM via -marm (vs Thumb1 via -mthumb). This more closely matches the KConfig requirements for where we plan to use these (ie. ARMv6K, ARMv7 (arm or thumb2)). As @peter.smith mentions: on armv5 we can write the instruction to read/write to CP15 C13 with the ThreadID opcode. However on no armv5 implementation will the CP15 C13 have a Thread ID register. The GCC intent seems to be whether the instruction is encodable rather than check what the CPU supports. Link: https://github.com/ClangBuiltLinux/linux/issues/1502 Link: https://developer.arm.com/documentation/ddi0301/h/system-control-coprocessor/system-control-processor-registers/c13--thread-and-process-id-registers Reviewed By: ardb, peter.smith Differential Revision: https://reviews.llvm.org/D114116	2021-12-03 14:00:00 -08:00
Benoit Jacob	728b982bb2	ThreadPool: grow the pool only as needed On my 96-core cloudtop 'machine', it seems unnecessary to always start 96 threads upfront... particularly as the ThreadPool is created even with -mlir-disable-threading. Things like the resuling spew in GDB and the obfuscated output of `(gdb) info threads` are my motivation here, but it probably also doesn't hurt for at least some efficiency metrics to avoid creating many threads upfront. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D115019	2021-12-03 21:40:36 +00:00
Arthur Eubanks	93a20ecee4	[DebugInfo] Check DIEnumerator bit width when comparing for equality As mentioned in D106585, this causes non-determinism, which can also be shown by this test case being flaky without this patch. We were using the APSInt's bit width for hashing, but not for checking for equality. APInt::isSameValue() does not check bit width. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D115054	2021-12-03 13:40:22 -08:00
Amy Kwan	97eb3bb80f	[test-release.sh] Do not run chrpath on AIX. Upon testing the use of test-release.sh on AIX, the script initially fails because chrpath is not present on AIX. This patch adds checks for AIX and allows the script to continue running to completion. Differential Revision: https://reviews.llvm.org/D115046	2021-12-03 15:36:29 -06:00
Vitaly Buka	98bb198693	[sanitizer] Add Lempel–Ziv–Welch encoder/decoder It's very simple, fast and efficient for the stack depot compression if used on entire pointers. Reviewed By: morehouse, kstoimenov Differential Revision: https://reviews.llvm.org/D114918	2021-12-03 13:11:40 -08:00
Vitaly Buka	5f1d1854eb	[NFC][sanitizer] Iterator adaptors for Leb128 encoding It's similar to back_insert_iterator Needed for D114924 Reviewed By: morehouse, kstoimenov Differential Revision: https://reviews.llvm.org/D114980	2021-12-03 12:51:55 -08:00
Vitaly Buka	6318001209	[sanitizer] Support IsRssLimitExceeded in all sanitizers Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D115000	2021-12-03 12:45:44 -08:00
Choongwoo Han	46282fad06	[Sanitizer] Use CreateDirectoryA for report dirs Using `_mkdir` of CRT in Asan Init leads to launch failure and hanging in Windows. You can trigger it by calling: > set ASAN_OPTIONS=log_path=a/a/a > .\asan_program.exe And their crash dump shows the following stack trace: ``` _guard_dispatch_icall_nop() __acrt_get_utf8_acp_compatibility_codepage() _mkdir(const char * path) ``` I guess there could be a cfg guard in CRT, which may lead to calling uninitialized cfg guard function address. Also, `_mkdir` supports UTF-8 encoding of the path and calls _wmkdir, but that's not necessary for this case since other file apis in sanitizer_win.cpp assumes only ANSI code case, so it makes sense to use CreateDirectoryA matching other file api calls in the same file. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D114760	2021-12-03 12:34:05 -08:00
Florian Hahn	31413c4555	[Passes] Adjust SLPVectorizer placement in test. SLPVectorizer runs after the extra vector passes.	2021-12-03 20:27:09 +00:00
Florian Hahn	5da920bf3a	[Passes] Improve opt-pipeline-vector-passes.ll test. Add -NOT lines to ensure that no extra passes are run if -extra-vectorizer-passes is not specified. Also add a loop that actually gets vectorized in preparation for D115052.	2021-12-03 20:15:59 +00:00
Peter Collingbourne	0a14674f27	CodeGen: Strip exception specifications from function types in CFI type names. With C++17 the exception specification has been made part of the function type, and therefore part of mangled type names. However, it's valid to convert function pointers with an exception specification to function pointers with the same argument and return types but without an exception specification, which means that e.g. a function of type "void () noexcept" can be called through a pointer of type "void ()". We must therefore consider the two types to be compatible for CFI purposes. We can do this by stripping the exception specification before mangling the type name, which is what this patch does. Differential Revision: https://reviews.llvm.org/D115015	2021-12-03 14:50:52 -05:00
Hans Wennborg	c361ab0612	[msan] Don't block SIGSYS in ScopedBlockSignals Seccomp-BPF-sandboxed processes rely on being able to process SIGSYS signals. Differential revision: https://reviews.llvm.org/D115057	2021-12-03 20:41:08 +01:00
Leonard Chan	f178a05f22	[libunwind] Fix unwind_leaffunction test It's possible for this test not to pass if the libc used does not provide unwind info for raise. We can replace it with __builtin_cast, which can lead to a SIGTRAP on x86_64 and a SIGILL on aarch64. Using this alternative, a nop is needed before the __builtin_cast. This is because libunwind incorrectly decrements pc, which can cause pc to jump into the previous function and use the incorrect FDE. Differential Revision: https://reviews.llvm.org/D114818	2021-12-03 11:21:20 -08:00
Choongwoo Han	181c4ba467	[CFG] Handle calls with funclet bundle When Control Flow Guard Check is inserted, funclet bundle was not checked. Therefore, it didn't generate code correctly when a target function has "funclet" bundle. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D114914	2021-12-03 10:51:10 -08:00
Mitch Phillips	572a0721a0	[HWASan] Try 'google' prefixed apex directories in symbolizer. Google-signed apexes appear on Android build servers' symbol files as being under /apex/com.google.android.<foo>/. In reality, the apexes are always installed as /apex/com.android.<foo>/ (note the lack of 'google'). In order for local symbolization under hwasan_symbolize to work correctly, we also try the 'google' directory. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D114919	2021-12-03 10:35:03 -08:00
Stanislav Mekhanoshin	e1d6306815	[AMDGPU] Fixed incomplete definitions in twoaddr-fma.mir. NFC.	2021-12-03 10:18:03 -08:00
Stanislav Mekhanoshin	3b17cb1506	[AMDGPU] Kill def when folding immediate in two-addr pass Two-address pass works right before RA and if an immediate was folded into an instruction there is nothing to remove the dead def. We end up with something like: v_mov_b32_e32 v14, 0xc1700000 v_mov_b32_e32 v14, 0x41200000 v_fmaak_f32 v51, s67, v19, 0xc1700000 v_fmaak_f32 v38, v51, v19, 0x4120000 The patch kills the dead move instruction right in the folding. Differential Revision: https://reviews.llvm.org/D114999	2021-12-03 09:37:49 -08:00
Simon Pilgrim	ebf5271918	[DAG] PromoteIntRes_FunnelShift - rename shift Amount variable to Amt to prevent line overflow. NFC.	2021-12-03 17:24:45 +00:00
Philip Reames	7b54de5fef	[funcattrs] Fix a bug in recently introduced writeonly argument inference This fixes a bug in `740057d`. There's two ways to describe the issue: * One caller hasn't yet proven nocapture on the argument. Given that, the inference routine is responsible for bailing out on a potential capture. * Even if we know the argument is nocapture, the access inference needs to traverse the exact set of users the capture tracking would (or exit conservatively). Even if capture tracking can prove a store is non-capturing (e.g. to a local alloc which doesn't escape), we still need to track the copy of the pointer to see if it's later reloaded and accessed again. Note that all the test changes except the newly added ones appear to be false negatives. That is, cases where we could prove writeonly, but the current code isn't strong enough. That's why I didn't spot this originally.	2021-12-03 08:57:15 -08:00
Simon Pilgrim	74cc0fa1db	[IR][AutoUpgrade] Merge x86 mask load intrinsic upgrades. NFC. Helps appease MSVC which is complaining about "fatal error C1061: compiler limit: blocks nested too deeply" - we already do the same thing for avx512.mask.store intrinsics. This is only a stopgap solution until another else-if case needs adding - we really need to refactor this chain of ifs properly.	2021-12-03 16:53:59 +00:00
Muhammad Omair Javaid	80792368bb	[LLDB] XFAIL on Arm/Linux minidebuginfo-set-and-hit-breakpoint.test minidebuginfo-set-and-hit-breakpoint.test is failing on Arm/Linux most probably due to an ill formed binary after removal of certain sections from executable. I am marking it as XFAIL for further investigation.	2021-12-03 21:52:21 +05:00
David Green	08035000cd	[ARM] Separate ARM autoupgrade code into a separate function Try to appease the microsoft compiler which is apparently running out of if statements. Separate the new ARM code into a separate function to keep it simpler.	2021-12-03 16:45:26 +00:00
David Green	11f67f5a2c	[ARM] Replace if's with a switch, NFC I'm not having a lot of luck with the microosft compiler recently. Maybe this will help it with its errors: llvm\lib\IR\AutoUpgrade.cpp(3726): fatal error C1061: compiler limit: blocks nested too deeply If not, it's a good code cleanup anyway.	2021-12-03 16:16:30 +00:00
Guillaume Chatelet	b902b314ff	[libc] Fix invalid include for SqrtLongDouble.h	2021-12-03 16:13:59 +00:00
Nico Weber	1217b4b46f	[gn build] Build with Fission on non-mac non-win when using lld In release+sym builds (-O2 -g), reduces time to link `clang` from 2.3s to 1.3s (-42%). In debug builds (-g), reduces time to link `clang` from 5.4s to 4.5s (-17.4%). See the phab review for full `ministat` numbers. In the CMake build this is opt-in via LLVM_USE_SPLIT_DWARF. Since the GN build is targeted at developers, enabling it by default seems like a better default setting here. (If it turns out to cause problems, we can add an opt-out.) Time to load the binary into gdb and to set a breakpoint is unchanged. Time from `run` to hitting a breakpoint in `main` feel a bit faster (~4s -> ~2s), but I dind't do a careful statistical anlysis for this. Differential Revision: https://reviews.llvm.org/D115040	2021-12-03 11:07:52 -05:00
Florian Hahn	ead3979a92	[MemoryLocation] Move DSE intrinsic handling to MemoryLocation. (NFC) Suggested in D114872.	2021-12-03 16:00:39 +00:00
Guillaume Chatelet	71405d90f0	[libc] Select FPUtils implementations via code instead of build We want to simplify the build system and rely on code to do the implementation selection. This is in preparation of adding a Bazel configuration (D114712). Differential Revision: https://reviews.llvm.org/D115034	2021-12-03 15:48:41 +00:00
Balázs Kéri	1cefe91d40	[clang-tidy][docs][NFC] Improve documentation of bugprone-unhandled-exception-at-new Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D114602	2021-12-03 16:53:08 +01:00
Stephen Tozer	98a021fcbf	[DebugInfo] Attempt to preserve more information during tail duplication Prior to this patch, tail duplication handled debug info poorly - specifically, debug instructions would be dropped instead of being set undef, potentially extending the lifetimes of prior debug values that should be killed. The pass was also very aggressive with dropping debug info, dropping debug info even when the SSA value it referred to was still present. This patch attempts to handle debug info more carefully, checking to see whether each affected debug value can still be live, setting it undef if not. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D106875	2021-12-03 15:30:05 +00:00
David Green	ab0c5cea0b	[ARM] Use v2i1 for MVE and CDE intrinsics This adjusts all the MVE and CDE intrinsics now that v2i1 is a legal type, to use a <2 x i1> as opposed to emulating the predicate with a <4 x i1>. The v4i1 workarounds have been removed leaving the natural v2i1 types, notably in vctp64 which now generates a v2i1 type. AutoUpgrade code has been added to upgrade old IR, which needs to convert the old v4i1 to a v2i1 be converting it back and forth to an integer with arm.mve.v2i and arm.mve.i2v intrinsics. These should be optimized away in the final assembly. Differential Revision: https://reviews.llvm.org/D114455	2021-12-03 15:27:58 +00:00

1 2 3 4 5 ...

406433 Commits All Branches Search

406433 Commits

All Branches