llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Richardson	fc0051011e	[InstCombine][ConstantFold] Baseline tests for ptrtoint(gep null, x) Differential Revision: https://reviews.llvm.org/D110244	2021-09-28 17:57:36 +01:00
Alex Richardson	a18181931f	[NFC][clang] Add a CHECK lines to tests checking offsetof-like expressions I am looking at constant-folding changes that could affect these tests, so check that it emits the expected global value instead of just checking that it doesn't crash.	2021-09-28 17:57:36 +01:00
Alex Richardson	90179f2323	[NFC] Add a comment to member-function-pointer-calls.cpp Looking at this test I did not see why MinGW was using a different command line until I looked at the git history. Add a comment explaining what this RUN line is actually testing. Also add two more RUN lines to show that indirectly passed member pointers don't inhibit the optimization.	2021-09-28 17:57:36 +01:00
Alex Richardson	d5631d49d0	Drop REQUIRES: arm-registered-target from an IR-only test This works just fine even if the Arm backend is not built.	2021-09-28 17:57:36 +01:00
Alex Richardson	a28177035b	[UpdateTestChecks][NFC] Drop a python2 workaround	2021-09-28 17:57:36 +01:00
Alex Richardson	3c51b9e270	Fix incorrect GEP bitwidth in areNonOverlapSameBaseLoadAndStore() When using a datalayout that has pointer index width != pointer size this code triggers an assertion in Value::stripAndAccumulateConstantOffsets(). I encountered this this while compiling FreeBSD for CHERI-RISC-V. Also update LoadsTest.cpp to use a DataLayout with index width != pointer width to ensure this case is tested. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D110406	2021-09-28 17:57:36 +01:00
Alex Richardson	547e5e4ae6	[update_llc_test_checks.py] Fix MIPS ASM regex for functions with EH On MIPS, functions with exception handling code emits an additional temporary label at the start of the function (due to UseAssignmentForEHBegin): _Z8do_catchv: # @_Z8do_catchv .Ltmp3: .set .Lfunc_begin0, .Ltmp3 .cfi_startproc .cfi_personality 128, DW.ref.__gxx_personality_v0 .cfi_lsda 0, .Lexception0 .frame $c11,48,$c17 .mask 0x00000000,0 .fmask 0x00000000,0 .set noreorder .set nomacro .set noat # %bb.0: # %entry The `[^:]*` regex was terminating the search after .Ltmp<N>: and therefore not detecting functions with exception handling. Reviewed By: atanasyan, MaskRay Differential Revision: https://reviews.llvm.org/D100027	2021-09-28 17:57:36 +01:00
Alex Richardson	ee3109b044	[update_llc_test_checks] Baseline test for D100027 Show that we fail to generate CHECK lines for MIPS64 functions with EH. Differential Revision: https://reviews.llvm.org/D110408	2021-09-28 17:57:36 +01:00
Arthur O'Dwyer	a7d084a18d	[libc++] [compare] Rip out more vestiges of _equality. NFCI. There's really no reason to even have two different enums here, but definitely* we shouldn't have three, and they don't need so many synonymous enumerator values. Differential Revision: https://reviews.llvm.org/D110516	2021-09-28 12:23:41 -04:00
Roman Lebedev	b6b7860954	[X86][Costmodel] Load/store i16 Stride=6 VF=16 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For this tuple, measuring becomes problematic since there's a lot of spilling going on, but apparently all these memory ops do not affect worst-case estimate at all here. For load we have: https://godbolt.org/z/5qGb9odP6 - for intels `Block RThroughput: <=106.0`; for ryzens, `Block RThroughput: <=34.8` So pick cost of `106`. For store we have: https://godbolt.org/z/KrWcv4Ph7 - for intels `Block RThroughput: =58.0`; for ryzens, `Block RThroughput: <=20.5` So pick cost of `58`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110593	2021-09-28 19:15:08 +03:00
Roman Lebedev	24e42f7d28	[X86][Costmodel] Load/store i16 Stride=6 VF=8 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/3Tc5s897j - for intels `Block RThroughput: =39.0`; for ryzens, `Block RThroughput: <=13.5` So pick cost of `39`. For store we have: https://godbolt.org/z/fo1h9E67e - for intels `Block RThroughput: =21.0`; for ryzens, `Block RThroughput: <=12.0` So pick cost of `21`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110592	2021-09-28 19:15:07 +03:00
Roman Lebedev	b3011bcc78	[X86][Costmodel] Load/store i16 Stride=6 VF=4 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/1Wcaf9c7T - for intels `Block RThroughput: =9.0`; for ryzens, `Block RThroughput: <=4.5` So pick cost of `9`. For store we have: https://godbolt.org/z/1Wcaf9c7T - for intels `Block RThroughput: =15.0`; for ryzens, `Block RThroughput: <=6.0` So pick cost of `15`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110591	2021-09-28 19:15:01 +03:00
Roman Lebedev	aa93c55889	[X86][Costmodel] Load/store i16 Stride=6 VF=2 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/bhscej4WM - for intels `Block RThroughput: =13.0`; for ryzens, `Block RThroughput: <=7.0` So pick cost of `13`. For store we have: https://godbolt.org/z/Yf4Pfnxbq - for intels `Block RThroughput: =10.0`; for ryzens, `Block RThroughput: <=3.5` So pick cost of `10`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110590	2021-09-28 19:14:56 +03:00
wlei	ce40843a3f	[llvm-profgen][CSSPGO] On-demand function size computation for preinliner Similar to https://reviews.llvm.org/D110465, we can compute function size on-demand for the functions that's hit by samples. Here we leverage the raw range samples' address to compute a set of sample hit function. Then `BinarySizeContextTracker` just works on those function range for the size. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D110466	2021-09-28 09:09:38 -07:00
wlei	091c16f76b	[llvm-profgen] On-demand symbolization Previously we do symbolization for all the functions and actually we only need the symbols that's hit by the samples. This can significantly speed up the time for large size binary. Optimization for per-inliner will come along with next patch. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110465	2021-09-28 09:09:25 -07:00
Quinn Pham	70391b3468	[PowerPC] FP compare and test XL compat builtins. This patch is in a series of patches to provide builtins for compatability with the XL compiler. This patch adds builtins for compare exponent and test data class operations on floating point values. Reviewed By: #powerpc, lei Differential Revision: https://reviews.llvm.org/D109437	2021-09-28 11:01:51 -05:00
Kazu Hirata	9e4f1f9265	[SystemZ] Remove redundant declaration SystemZMnemonicSpellCheck (NFC) Note that SystemZMnemonicSpellCheck is defined in SystemZGenAsmMatcher.inc, which SystemZAsmParser.cpp includes. Identified with readability-redundant-declaration.	2021-09-28 08:38:05 -07:00
Roman Lebedev	c93da7d9cf	Revert "[CMake] Enable LLVM_ENABLE_PER_TARGET_RUNTIME_DIR by default on Linux" See original review https://reviews.llvm.org/D107799 This reverts commit `f9dbca68d4`.	2021-09-28 18:23:43 +03:00
Dmitry Vyukov	ccc83ac7c5	tsan: print a meaningful frame for stack races Depends on D110631. Differential Revision: https://reviews.llvm.org/D110632	2021-09-28 17:08:00 +02:00
Dmitry Vyukov	ade5023c54	tsan: fix tls_race3 test on darwin Darwin also needs to use __tsan_tls_initialization to pass the test. Differential Revision: https://reviews.llvm.org/D110631	2021-09-28 17:07:51 +02:00
Valentin Clement	b38c04ab7f	[fir][NFC] Rename operand of EmboxOp Rename `lenParams` to `typeparams` to be in sync with fir-dev. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110628 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-09-28 17:00:55 +02:00
Andrew Turner	993ada05f5	[lldb] [unittests] Fix building the FreeBSD arm64 Register Context test Differential Revision: https://reviews.llvm.org/D110545	2021-09-28 10:51:06 -04:00
Dmitry Vyukov	f3932ae1a0	tsan: fix cur_thread alignment Commit `354ded67b3` ("tsan: align ThreadState to cache line") did an incomplete thing. It marked ThreadState as cache line aligned, but the thread local ThreadState instance is declared as an aligned char array with hard-coded 64-byte alignment. On PowerPC cache line size is 128 bytes, so the hard-coded 64-byte alignment is not enough. Use cache line alignment consistently. Differential Revision: https://reviews.llvm.org/D110629	2021-09-28 16:49:44 +02:00
Michał Górny	86cd2369b6	[lldb] [DynamicRegisterInfo] Refactor SetRegisterInfo() Move the "slice" and "composite" handling into separate methods to avoid if/else hell. Use more LLVM types whenever possible. Replace printf()s with llvm::Error combined with LLDB logging. Differential Revision: https://reviews.llvm.org/D110619	2021-09-28 16:47:58 +02:00
David Green	fdd8c10959	[ARM] Delay reverting WLS in arm-block-placement As we have to split blocks, we may be left in an invalid loop state after a WLS is reverted to a DLS. Instead remember the WLS that could not be fixed and revert them after finishing processing all other loops. Differential Revision: https://reviews.llvm.org/D110567	2021-09-28 15:38:29 +01:00
Erich Keane	1cd3ae0198	Fix missing return from `9324cc2ca9` No idea how my local machine missed this, but I saw no warning for it, it seems to have been lost in some level of translating this back for upstreaming.	2021-09-28 07:30:06 -07:00
Jingu Kang	73a196a11c	Recommit "[AArch64] Split bitmask immediate of bitwise AND operation" This reverts the revert commit `f85d8a5bed` with bug fixes. Original message: MOVi32imm + ANDWrr ==> ANDWri + ANDWri MOVi64imm + ANDXrr ==> ANDXri + ANDXri The mov pseudo instruction could be expanded to multiple mov instructions later. In this case, try to split the constant operand of mov instruction into two bitmask immediates. It makes only two AND instructions intead of multiple mov + and instructions. Added a peephole optimization pass on MIR level to implement it. Differential Revision: https://reviews.llvm.org/D109963	2021-09-28 15:26:29 +01:00
Alexey Bataev	f701505c45	[SLP]Improve vectorization of phi nodes by trying wider vectors. Try to improve vectorization of the PHI nodes by trying to vectorize similar instructions at the size of the widest possible vectors, then aggregating with compatible type PHIs and trying to vectoriza again and only if this failed, try smaller sizes of the vector factors for compatible PHI nodes. This restores performance of several benchmarks after tuning of the fp/int conversion instructions costs. Differential Revision: https://reviews.llvm.org/D108740	2021-09-28 07:20:36 -07:00
Sjoerd Meijer	0ea77502e2	[LoopFlatten] Updating Phi nodes after IV widening In rG6a076fa9539e, a problem with updating the old/narrow phi nodes after IV widening was introduced. If after widening of the IV the transformation is not applied, the narrow phi node was incorrectly modified, which should only happen if flattening happens. This can be seen in the added test widen-iv2.ll, which incorrectly had 1 incoming value, but should have its original 2 incoming values, which is now restored. Differential Revision: https://reviews.llvm.org/D110234	2021-09-28 15:09:20 +01:00
serge-sans-paille	bd379915de	Refine the constraint for isInlineBuiltinDeclaration Require it to be always_inline, to more closely match how _FORITFY_SOURCE behaves. This avoids generation of `.inline` suffixed functions - these should always be inlined.	2021-09-28 16:07:33 +02:00
Sanjay Patel	1f8bead678	[InstCombine] reduce code for swapped predicate; NFC	2021-09-28 10:00:35 -04:00
Sanjay Patel	1a1aed8da8	[InstCombine] add tests for icmp-gep; NFC We need more coverage for commuted and (un)signed preds to verify that things behave as expected here. Currently, we do not transform signed preds or non-inbounds geps.	2021-09-28 10:00:35 -04:00
David Green	2c53215e99	[ARM] Skip debug info in recomputeVPTBlockMask The ARMLowOverheadLoops pass recalculates VPT block masks when it converts VCMP's inside VPT blocks into VPT's. The function to do so doesn't seem to handle debug info though, leading to invalid block creation or asserts at compile time. Make sure the function skips any debug info between the MVE instructions it inspects. Differential Revision: https://reviews.llvm.org/D110564	2021-09-28 14:58:13 +01:00
Erich Keane	9324cc2ca9	Change __builtin_sycl_unique_stable_name to just use an Itanium mangling After significant problems in our downstream with the previous implementation, the SYCL standard has opted to make using macros/etc to change kernel-naming-lambdas in any way UB (even passively). As a result, we are able to just emit the itanium mangling. However, this DOES require a little work in the CXXABI, as the microsoft and itanium mangler use different numbering schemes for lambdas. This patch adds a pair of mangling contexts that use the normal 'itanium' mangling strategy to fill in the "DeviceManglingNumber" used previously by CUDA. Differential Revision: https://reviews.llvm.org/D110281	2021-09-28 06:41:03 -07:00
Sanjay Patel	72d991c42e	[InstCombine] add/move tests for icmp with gep operand(s); NFC	2021-09-28 09:40:52 -04:00
Bjorn Pettersson	460efc1fb8	[Analysis] Be defensive when matching size_t in lib call signatures When TargetLibraryInfoImpl::isValidProtoForLibFunc is checking function signatures to detect lib calls it may check that a parameter or return value matches with the "size_t" type. For this to work it has to derive the IR type matching with "size_t". Depending on if a DataLayout is provided or not, this has been done in two different way. Either a more strict check being based on IntPtrType (which is given by the DataLayout) or a more relaxed check assuming that any integer type matches with "size_t". Given that the stricter approach exist it seems like we do not want to trigger rewrites etc if we aren't sure that a function calls actually match with the library function. Therefore it was questioned why we actually have the more relaxed approach when not being able to derive an IR type for "size_t". This patch will take a more defensive approach, requiring that a DataLayout is passed to isValidProtoForLibFunc. Differential Revision: https://reviews.llvm.org/D110584	2021-09-28 15:29:37 +02:00
Bjorn Pettersson	1f5ea14bca	[Analysis] Add FIXME:s related to size_t type checks Differential Revision: https://reviews.llvm.org/D110583	2021-09-28 15:29:37 +02:00
hyeongyu kim	86bf234d0b	[IR] Change the default value of InstertElement to poison (1/4) This patch is for fixing potential insertElement-related bugs like D93818. ``` V = UndefValue::get(VecTy); for(...) V = Builder.CreateInsertElementy(V, Elt, Idx); => V = PoisonValue::get(VecTy); for(...) V = Builder.CreateInsertElementy(V, Elt, Idx); ``` Like above, this patch changes the placeholder V to poison. The patch will be separated into several commits. Reviewed By: aqjune Differential Revision: https://reviews.llvm.org/D110311	2021-09-28 22:29:16 +09:00
Alexey Bataev	8bacfb9bed	[SLP]No need to schedule/check parent for extract{element/value} instruction. The instruction extractelement/extractvalue are not required to be scheduled since they only depend on the source vector/aggregate (with constant indices), smae applies to the parent basic block checks. Improves compile time and saves scheduling budget. Differential Revision: https://reviews.llvm.org/D108703	2021-09-28 06:13:55 -07:00
Erich Keane	45e75d9382	Update the message for template-template param keyword for C++17 C++17 permits using 'typename' or 'class' for a template template parameter, but the error message in the parser only refers to 'class'. This patch, in C++17 or newer modes, adds "or 'template'" to the diagnostic.	2021-09-28 06:02:29 -07:00
Pavel Labath	7866dbb261	[lldb/test] Remove a check from TestLoadAfterAttach The two module retrieval methods (qXfer:libraries-svr4 and manual list traversal) differ in how the handle the manually-added-but-not-yet-loaded modules. The svr4 path will remove it, while the manual one will keep in the list. It's likely the two paths need ought to be synchronized, but right now, this distinction is not relevant for the test.	2021-09-28 14:47:46 +02:00
Max Kazantsev	00be84f910	Recommit "[Test] Add more tests with cycled phis"	2021-09-28 19:36:47 +07:00
Diana Picus	37089bae29	Reland "[flang] GET_COMMAND_ARGUMENT runtime implementation" Recommit https://reviews.llvm.org/D109813 and https://reviews.llvm.org/D109814. This implements the second and final entry point for GET_COMMAND_ARGUMENT, handling the VALUE, STATUS and ERRMSG parameters. It has a small fix in that we're now using memcpy instead of strncpy (which was a bad idea to begin with, since we're not actually interested in a string copy).	2021-09-28 12:34:07 +00:00
Max Kazantsev	a91145f75a	Revert "[Test] Add more tests with cycled phis" This reverts commit `7128a545b3`. Need to regenerate tests after rebase.	2021-09-28 19:32:26 +07:00
Jingu Kang	f85d8a5bed	Revert "[AArch64] Split bitmask immediate of bitwise AND operation" This reverts commit `864b206796`. Reverting due to error on buildbots.	2021-09-28 13:28:09 +01:00
Frederic Cambus	5b125a49ba	[CMake] Add detection for the mold linker in AddLLVM.cmake. mold says it is compatible with GNU ld and gold linkers: ``` $ mold -v mold 0.9.5 (compatible with GNU ld and GNU gold) ``` And thus it currently gets detected as Gold. With the following diff, CMake now correctly reports the linker name, and mold keeps being identified as Gold internally for now. Reviewed By: ldionne, MaskRay Differential Revision: https://reviews.llvm.org/D110035	2021-09-28 17:46:52 +05:30
Pavel Labath	9413ead7bc	[lldb/test] Add ability to specify environment when spawning processes We only had that ability for regular debugger launches. This meant that it was not possible to use the normal dlopen patterns in attach tests. This fixes that.	2021-09-28 14:13:50 +02:00
Pavel Labath	156cb4cc64	[lldb] Remove non-stop mode code We added some support for this mode back in 2015, but the feature was never productionized. It is completely untested, and there are known major structural lldb issues that need to be resolved before this feature can really be supported. It also complicates making further changes to stop reply packet handling, which is what I am about to do. Differential Revision: https://reviews.llvm.org/D110553	2021-09-28 14:13:50 +02:00
Diana Picus	6359a4cdbf	Revert "[flang] GET_COMMAND_ARGUMENT(VALUE) runtime implementation" This reverts commit `0446f1299f` and `df6302311f`. There's a warning on flang-aarch64-latest-gcc related to strncpy using the result of strlen as a bound. I'll recommit with a fix.	2021-09-28 12:07:27 +00:00
Max Kazantsev	7128a545b3	[Test] Add more tests with cycled phis	2021-09-28 19:04:12 +07:00

1 2 3 4 5 ...

400236 Commits All Branches Search

400236 Commits

All Branches