llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	7de2add829	[InstSimplify] add tests for logic-of-cmps with not op; NFC One variant of this is shown in: https://llvm.org/PR47858	2020-10-25 11:13:30 -04:00
Melanie Blower	576d436c82	Correct LIT test failure detected on buildbot after mibintc committed rG2e204e23911b: [clang] Enable support for #pragma STDC FENV_ACCESS D87528	2020-10-25 08:10:34 -07:00
Florian Hahn	968aa6b917	[SLP] Add AArch64 tests with vectorizable compare/select patterns. This patch adds an additional set of tests that can be vectorized efficiently on AArch64, using CMxx & BFI.	2020-10-25 15:08:30 +00:00
Simon Pilgrim	d64ea0f189	Remove superfluous whitespace around if(). NFC.	2020-10-25 14:38:16 +00:00
Melanie Blower	2e204e2391	[clang] Enable support for #pragma STDC FENV_ACCESS Reviewers: rjmccall, rsmith, sepavloff Differential Revision: https://reviews.llvm.org/D87528	2020-10-25 06:46:25 -07:00
Simon Pilgrim	3052e474ec	[InstCombine] matchBSwapOrBitReversem - recognise or(fshl(),fshl()) bswap patterns. I'm not certain InstCombinerImpl::matchBSwapOrBitReverse needs to filter the or(op0(),op1()) ops - there are just too many cases that recognizeBSwapOrBitReverseIdiom/collectBitParts handle now (and quickly).	2020-10-25 10:17:45 +00:00
Simon Pilgrim	5e9f172295	[InstCombine] Add test for or(fshl(),fshl()) bswap pattern. Currently InstCombinerImpl::matchBSwapOrBitReverse won't match starting from funnel shifts.	2020-10-25 10:07:19 +00:00
Richard Smith	f81f09ba89	[c++20] For P0732R2: Support string literal operator templates.	2020-10-25 00:34:15 -07:00
Craig Topper	a222d832d5	[X86] Use TargetConstant for FPDiff with X86::TC_RETURN. It's required to be a constant and can never be in a register so make it explicit.	2020-10-25 00:29:11 -07:00
Martin Storsjö	1c8371692d	Revert "[CostModel] remove cost-kind predicate for vector reduction costs" This reverts commit `22d10b8ab4`. This broke compilation e.g. like this: $ cat synth.c a; float b; c() { for (;;) { float d = -b a++; d -= --b * a++; d -= --b * a; d -= --b * a; e(d); } } $ clang -target x86_64-linux-gnu -c -O2 -ffast-math synth.c clang: ../include/llvm/Support/Casting.h:104: static bool llvm::isa_impl _cl<To, const From>::doit(const From*) [with To = llvm::PointerType; Fr om = llvm::Type]: Assertion `Val && "isa<> used on a null pointer"' fail ed.	2020-10-25 08:47:54 +02:00
Teresa Johnson	13c62ce99a	[MemProf] Temporarily disable part of test Disable the part of this test that started failing only on the llvm-avr-linux bot after `5c20d7db9f`. Unfortunately, "XFAIL: avr" does not work. Still in the process of trying to figure out how to debug.	2020-10-24 23:07:34 -07:00
Richard Smith	7b3515880c	For P0732R2, P1907R1: ensure that template parameter objects don't refer to disallowed objects or have non-constant destruction.	2020-10-24 22:11:43 -07:00
Nathan Ridge	aaa8b44d19	[clangd] Add a TestWorkspace utility TestWorkspace allows easily writing tests involving multiple files that can have inclusion relationships between them. BackgroundIndexTest.RelationsMultiFile is refactored to use TestWorkspace, and moved to FileIndexTest as it no longer depends on BackgroundIndex. Differential Revision: https://reviews.llvm.org/D89297	2020-10-24 20:15:17 -04:00
Arthur Eubanks	c039e83a2c	Fix typo SSC -> SCC	2020-10-24 16:26:48 -07:00
Fangrui Song	398b81067c	[ELF] Don't crash on R_X86_64_GOTPCRELX for test/binop instructions While MC did not produce R_X86_64_GOTPCRELX for test/binop instructions (movl/adcl/addl/andl/...) before the previous commit, this code path has been exercised by -fno-integrated-as for GNU as since 2016: -no-pie relaxing may incorrectly access loc[-3] and produce a corrupted instruction. Simply handle test/binop R_X86_64_GOTPCRELX like R_X86_64_GOTPCREL.	2020-10-24 15:14:17 -07:00
Fangrui Song	f04d92af94	[X86] Produce R_X86_64_GOTPCRELX for test/binop instructions (MOV32rm/TEST32rm/...) when -Wa,-mrelax-relocations=yes is enabled We have been producing R_X86_64_REX_GOTPCRELX (MOV64rm/TEST64rm/...) and R_X86_64_GOTPCRELX for CALL64m/JMP64m without the REX prefix since 2016 (to be consistent with GNU as), but not for MOV32rm/TEST32rm/...	2020-10-24 15:14:17 -07:00
Drew Fisher	1e09dbb6a9	[asan] Fix stack-use-after-free checks on non-main thread on Fuchsia While some platforms call `AsanThread::Init()` from the context of the thread being started, others (like Fuchsia) call `AsanThread::Init()` from the context of the thread spawning a child. Since `AsyncSignalSafeLazyInitFakeStack` writes to a thread-local, we need to avoid calling it from the spawning thread on Fuchsia. Skipping the call here on Fuchsia is fine; it'll get called from the new thread lazily on first attempted access. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D89607	2020-10-24 14:29:32 -07:00
Drew Fisher	29480c6c74	[asan][fuchsia] set current thread before reading thread state When enabling stack use-after-free detection, we discovered that we read the thread ID on the main thread while it is still set to 2^24-1. This patch moves our call to AsanThread::Init() out of CreateAsanThread, so that we can call SetCurrentThread first on the main thread. Reviewed By: mcgrathr Differential Revision: https://reviews.llvm.org/D89606	2020-10-24 14:23:09 -07:00
Fangrui Song	d5adadb3a5	[AArch64][GlobalISel] Fix -Wunused-variable. NFC	2020-10-24 12:47:11 -07:00
Nico Weber	6f9d84bb26	Revert "hwasan: Disable operator {new,delete} interceptors when interceptors are disabled." This reverts commit `fa66bcf4bc`. Seems to break tests, see https://reviews.llvm.org/D89827#2351930	2020-10-24 15:04:22 -04:00
Sanjay Patel	22d10b8ab4	[CostModel] remove cost-kind predicate for vector reduction costs This is similar in spirit to `01ea93d85d` (memcpy) except that here the underlying caller assumptions were created for vectorizer use (throughput) rather than other passes. That meant targets could have an enormous throughput cost with no corresponding size, latency, or blended cost increase. The ARM costs show a small difference between throughput and size because there's an underlying difference in cmp/sel costs that is also predicated on cost-kind. Paraphrasing from the previous commits: This may not make sense for some callers, but at least now the costs will be consistently wrong instead of mysteriously wrong. Targets should provide better overrides if the current modeling is not accurate.	2020-10-24 13:20:17 -04:00
Benjamin Kramer	39a0d6889d	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00
Benjamin Kramer	bd2cf96c09	[X86] Add a stub for znver3 based on the little public information there is in AMD's manuals No scheduling, no autodetection. Just enough so -march=znver3 works.	2020-10-24 19:01:22 +02:00
Benjamin Kramer	b8d2b6f6cf	Unbreak the clang-interpreter example after `0aec49c853`	2020-10-24 19:01:21 +02:00
dfukalov	9068c20965	[AMDGPU][CostModel] Refine cost model for half- and quarter-rate instructions. 1. Throughput and codesize costs estimations was separated and updated. 2. Updated fdiv cost estimation for different cases. 3. Added scalarization processing for types that are treated as !isSimple() to improve codesize estimation in getArithmeticInstrCost() and getArithmeticInstrCost(). The code was borrowed from TCK_RecipThroughput path of base implementation. Next step is unify scalarization part in base class that is currently works for TCK_RecipThroughput path only. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D89973	2020-10-24 19:53:08 +03:00
David Green	92205bf122	[ARM] Remove some dead code. NFC	2020-10-24 17:22:49 +01:00
Andrzej Warzynski	cbb7f1420b	[flang][tests] Fix Python bug in the lit config Without this change LIT tests for Flang fail with: ``` TypeError: append() takes exactly one argument (2 given) ```	2020-10-24 17:04:25 +01:00
Stefan Gränitz	66abe650ff	Reapply "[jitlink][ELF] Add zero-fill blocks for symbols in section SHN_COMMON" Root cause of the test failure was fixed with: [JITLink][ELF] PCRel32GOTLoad edge offset can be smaller three This reverts commit `10b1a61baf`.	2020-10-24 16:58:06 +02:00
Stefan Gränitz	b6ef40891c	[JITLink][ELF] PCRel32GOTLoad edge offset can be smaller three Offset is 2 for MOVL instruction in test ELF_x86-64_common. This should fix the test failures. Differential Revision: https://reviews.llvm.org/D89795	2020-10-24 16:57:48 +02:00
Caroline Concatto	4c5906cffd	[Flang][Driver] Add infrastructure for basic frontend actions and file I/O This patch introduces the dependencies required to read and manage input files provided by the command line option. It also adds the infrastructure to create and write to output files. The output is sent to either stdout or a file (specified with the `-o` flag). Separately, in order to be able to test the code for file I/O, it adds infrastructure to create frontend actions. As a basic testable example, it adds the `InputOutputTest` FrontendAction. The sole purpose of this action is to read a file from the command line and print it either to stdout or the output file. This action is run by using the `-test-io` flag also introduced in this patch (available for `flang-new` and `flang-new -fc1`). With this patch: ``` flang-new -test-io input-file.f90 ``` will read input-file.f90 and print it in the output file. The `InputOutputTest` frontend action has been introduced primarily to facilitate testing. It is hidden from users (i.e. it's only displayed with `--help-hidden`). Currently Clang doesn’t have an equivalent action. `-test-io` is used to trigger the InputOutputTest action in the Flang frontend driver. This patch makes sure that “flang-new” forwards it to “flang-new -fc1" by creating a preprocessor job. However, in Flang.cpp, `-test-io` is passed to “flang-new -fc1” without `-E`. This way we make sure that the preprocessor is _not_ run in the frontend driver. This is the desired behaviour: `-test-io` should only read the input file and print it to the output stream. co-authored-by: Andrzej Warzynski <andrzej.warzynski@arm.com> Differential Revision: https://reviews.llvm.org/D87989	2020-10-24 14:58:32 +01:00
TaWeiTu	65a36bbc3d	[NPM] Port -loop-versioning-licm to NPM Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D89371	2020-10-24 21:51:18 +08:00
Stefan Gränitz	10b1a61baf	Revert "[jitlink][ELF] Add zero-fill blocks for symbols in section SHN_COMMON" This reverts commit `e9955b0843`. Cannot reproduce the buildbot failures yet. Reverting in the meantime.	2020-10-24 15:43:06 +02:00
TaWeiTu	060a4fccf1	[LoopVersioning] Form dedicated exits for versioned loop to preserve simplify form The exit blocks of the versioned and non-versioned loops are not dedicated and thus the two loops are not in simplify form. Insert dummy exit blocks after loop versioning with `formDedicatedExits()` to preserve the simplify form for subsequence passes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D89569	2020-10-24 21:40:46 +08:00
Stefan Gränitz	e9955b0843	[jitlink][ELF] Add zero-fill blocks for symbols in section SHN_COMMON Symbols with special section index SHN_COMMON (0xfff2) haven't been handled so far and caused an invalid section error. This is a more or less straightforward use of the code commented out at the end of the function. I checked with the ELF spec, that the symbol value gives the alignment. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D89795	2020-10-24 14:54:38 +02:00
Stefan Gränitz	138b9f1928	[JITLink][ELF] PCRel32GOTLoad relocations are resolved like regular PCRel32 ones The difference is that the former are indirect and go to the GOT while the latter go to the target directly. This info can be used to relax indirect ones that don't need the GOT (because the target is in range). We check for this optimization beforehand. For formal correctness and to avoid confusion, we should only change the relocation kind if we actually apply the relaxation.	2020-10-24 14:54:38 +02:00
Simon Pilgrim	b481e00bf4	Fix some signed/unsigned comparison gcc warnings from D87930	2020-10-24 12:51:51 +01:00
Simon Pilgrim	310f62b4ff	[InstCombine] narrowFunnelShift - fold trunc/zext or(shl(a,x),lshr(b,sub(bw,x))) -> fshl(a,b,x) (PR35155) As discussed on PR35155, this extends narrowFunnelShift (recently renamed from narrowRotate) to support basic funnel shift patterns. Unlike matchFunnelShift we don't include the computeKnownBits limitation as extracting the pattern from the zext/trunc layers should be a indicator of reasonable funnel shift codegen, in D89139 we demonstrated how to efficiently promote funnel shifts to wider types. Differential Revision: https://reviews.llvm.org/D89542	2020-10-24 12:42:43 +01:00
Simon Pilgrim	ce356e1546	[DAG] Add BuildVectorSDNode::getRepeatedSequence helper to recognise multi-element splat patterns Replace the X86 specific isSplatZeroExtended helper with a generic BuildVectorSDNode method. I've just used this to simplify the X86ISD::BROADCASTM lowering so far (and remove isSplatZeroExtended), but we should be able to use this in more places to lower to complex broadcast patterns. Differential Revision: https://reviews.llvm.org/D87930	2020-10-24 12:23:09 +01:00
Simon Pilgrim	62b17a7697	[LegalizeTypes] Legalize vector rotate operations Lower vector rotate operations as long as the legalization occurs outside of LegalizeVectorOps. This fixes https://bugs.llvm.org/show_bug.cgi?id=47320 Patch By: @rsanthir.quic (Ryan Santhirarajan) Differential Revision: https://reviews.llvm.org/D89497	2020-10-24 11:30:32 +01:00
Nikita Popov	1a7a9efec3	[BasicAA] Avoid duplicate cache lookup (NFCI) Rather than performing the cache lookup with both possible orders for the locations, use the same canonicalization as the other AliasCache lookups in BasicAA.	2020-10-24 10:19:02 +02:00
Nikita Popov	d09c592142	[BasicAA] Fix caching in the presence of phi cycles Any time we insert a block into VisitedPhiBBs, previously cached values may no longer be valid for the recursive alias queries. As such, perform them using an empty AAQueryInfo. Note that if we recurse to the same phi, the block will already be inserted, so we reuse the old AAQueryInfo, and thus still protect against infinite recursion. This problem can appear with with an without BatchAA, but is more likely to occur with BatchAA, as more values are cached. Differential Revision: https://reviews.llvm.org/D90066	2020-10-24 09:58:02 +02:00
Jonas Paulsson	7c026a83ee	[SystemZ] Define MaxInstLength to have the value of 6. This value had the default value of 4 which caused branch relaxation to fail. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D90065	2020-10-24 09:19:34 +02:00
Michał Górny	d96cb52830	[lldb] [Process/NetBSD] Use XStateRegSet for all FPU registers Unify the x86 regset API to use XStateRegSet for all FPU registers, therefore eliminating the legacy API based on FPRegSet. This makes the code a little bit simpler but most notably, it provides future compatibility for register caching. Since the NetBSD kernel takes care of providing compatibility with pre-XSAVE processors, PT_{G,S}ETXSTATE can be used on systems supporting only FXSAVE or even plain FSAVE (and unlike PT_{G,S}ETXMMREGS, it clearly indicates that XMM registers are not supported). Differential Revision: https://reviews.llvm.org/D90034	2020-10-24 09:17:53 +02:00
Martin Storsjö	84ce6b9991	[lldb] Fix building with GCC 7. NFC.	2020-10-24 09:33:01 +03:00
Tony	bf6518a806	[AMDGPU] Cleanup AMDGPUUsage.rst - Layout and typo improvements. - Add memory spaces section. - reStructure syntax fixes. Differential Revision: https://reviews.llvm.org/D90002	2020-10-24 06:21:27 +00:00
Michael Kruse	d590c85430	[flang] Fix pimpl idiom for IntrinsicProcTable. The class IntrinsicProcTable uses the pimpl idiom and manages its own pointer-to-implementation. However, it violates the rule-of-five and does not implement a move-constructor or assignment-operator. Due to differences between compilers in implementation copy elision, these may or may not be used. Due to the missing user implementation for resource handling, using the results in runtime errors. Fix my using `std::unique_ptr` instead of custom resource management. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D88794	2020-10-24 00:28:05 -05:00
Med Ismail Bennani	64c4dac60e	[llvm/DebugInfo] Emit DW_OP_implicit_value when tuning for LLDB This patch enables emitting DWARF `DW_OP_implicit_value` opcode when tuning debug information for LLDB (`-debugger-tune=lldb`). This will also propagate to Darwin platforms, since they use LLDB tuning as a default. rdar://67406059 Differential Revision: https://reviews.llvm.org/D90001 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2020-10-24 06:45:33 +02:00
Vitaly Buka	21d64c32ec	[NFC][UBSAN] Refine CHECK pattern in test As-is it was failed by unrelated linker warning with filename in the output.	2020-10-23 21:11:03 -07:00
Peter Collingbourne	fa66bcf4bc	hwasan: Disable operator {new,delete} interceptors when interceptors are disabled. Differential Revision: https://reviews.llvm.org/D89827	2020-10-23 21:03:47 -07:00
Michael Kruse	0b671a44ad	[flang][msvc] Fix lambda capture ambiguity. NFC. Patch D88695 introduces a new local variable inside a lambda with the same name as a variable outside of it. In some of the if constexpr regions, msvc prioritizes the outer declaration and emits the error. ``` C:\Users\meinersbur\src\llvm-project\flang\lib\Evaluate\fold-implementation.h(1200): error C3493: 'context' cannot be implicitly captured because no default capture mode has been specified ``` This is fixed by giving the inner variable a different name. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D89367	2020-10-23 22:58:40 -05:00

1 2 3 4 5 ...

370084 Commits All Branches Search

370084 Commits

All Branches