llvm-project

Commit Graph

Author	SHA1	Message	Date
zero9178	7f9d5d6e44	[Driver][Windows] Support per-target runtimes dir layout for profile instr generate When targeting a MSVC triple, --dependant-libs with the name of the clang runtime library for profiling is added to the command line args. In it's current implementations clang_rt.profile-<ARCH> is chosen as the name. When building a distribution using LLVM_ENABLE_PER_TARGET_RUNTIME_DIR this fails, due to the runtime file names not having an architecture suffix in the filename. This patch refactors getCompilerRT and getCompilerRTBasename to always consider per-target runtime directories. getCompilerRTBasename now simply returns the filename component of the path found by getCompilerRT Differential Revision: https://reviews.llvm.org/D96638	2021-02-23 22:35:19 +01:00
Jorge Gorbe Moya	979ca1c05f	Defer the decision whether to use the CU or TU index until after reading the unit header. In DWARF v4 compile units go in .debug_info and type units go in .debug_types. However, in v5 both kinds of units are in .debug_info. Therefore we can't decide whether to use the CU or TU index just by looking at which section we're reading from. We have to wait until we have read the unit type from the header. Differential Revision: https://reviews.llvm.org/D96194	2021-02-23 13:26:11 -08:00
Aart Bik	17fa919847	[mlir][sparse] incorporate vector index into address computation When computing dense address, a vectorized index must be accounted for properly. This bug was formerly undetected because we get 0 * prev + i in most cases, which folds away the scalar part. Now it works for all cases. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97317	2021-02-23 13:25:51 -08:00
Eric Schweitz	6740694742	[flang][fir][NFC] remove dead code Removes unused function from FatalError.h. Differential revision: https://reviews.llvm.org/D97328	2021-02-23 13:21:15 -08:00
Matthew Voss	6da7d31416	[llvm-profdata] Emit Error when Invalid MemOpSize Section is Created by llvm-profdata Under certain (currently unknown) conditions, llvm-profdata is outputting profiles that have two consecutive entries in the MemOPSize section for the value 0. This causes the PGOMemOPSizeOpt pass to output an invalid switch instruction with two cases for 0. As mentioned, we’re not quite sure what’s causing this to happen, but this patch prevents llvm-profdata from outputting a profile that has this problem and gives an error with a request for a reproducible. Differential Revision: https://reviews.llvm.org/D92074	2021-02-23 12:51:54 -08:00
David Green	f51b3de4e8	[AArch64] Introduce UDOT/SDOT DAG nodes This is used to lower UDOT/SDOT instructions, as opposed to relying on the intrinsic. Subsequent optimizations will be able to optimize them more cleanly based on these nodes.	2021-02-23 20:31:01 +00:00
Lang Hames	479db97a34	Revert "[docs][ORC] Fix section title and reference." This reverts commit `6e1affe71c`, which caused an error on the Sphinx doc bot.	2021-02-24 07:27:39 +11:00
Craig Topper	5e233ff144	[RISCV] Use a different constant in one of the smulo test cases to avoid converting the mul to an add.	2021-02-23 12:17:49 -08:00
Jessica Paquette	ef1f7f1d7d	Recommit "[AArch64][GlobalISel] Match G_SHUFFLE_VECTOR -> insert elt + extract elt" Attempted fix for the added test failing. https://lab.llvm.org/buildbot/#/builders/104/builds/2355/steps/5/logs/stdio I can't reproduce the failure anywhere, so I'm going to guess that passing a std::function as MatchInfo is sketchy in this context. Switch it to a std::tuple and hope for the best.	2021-02-23 11:55:16 -08:00
Amara Emerson	939b5ce734	[AArch64][GlobalISel] Lower G_USUBSAT and G_UADDSAT for scalars. We have some missing optimization counterparts to LowerXALUO, but it's a start.	2021-02-23 11:54:52 -08:00
Florian Hahn	fd03e359dd	[AArch64] Regenerate check lines for neon-compare-instructions.ll. Auto-generate tests so they can be updated more easily, e.g. for D97303.	2021-02-23 19:39:25 +00:00
Andrei Elovikov	3605b873f6	[NFC][VPlan] Use VPUser to store block's predicate Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D96529	2021-02-23 11:08:27 -08:00
Florian Hahn	de40423c85	[LV] Ensure fixNonInductionPHIs uses a valid insertion point. In some cases, Builder's insertion point may be invalidated before using it in VPTransformState::get. Make sure the insertion point is up-to-date. This should fix various sanitizer errors, like https://lab.llvm.org/buildbot/#/builders/5/builds/4933/steps/9/logs/stdio	2021-02-23 18:51:05 +00:00
Nathan James	2af5275f72	[clang-tidy] Add cppcoreguidelines-prefer-member-initializer to ReleaseNotes Following a discussion about the current state of this check on the 12.X branch, it was decided to purge the check as it wasn't in a fit to release state, see https://llvm.org/PR49318. This check has since had some of those issues addressed and should be good for the next release cycle now, pending any more bug reports about it. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D97275	2021-02-23 18:29:22 +00:00
Simon Pilgrim	1020d16156	[InstSimplify] Handle nsw shl -> poison patterns Pulled out from D90479 - this recognises invalid nsw shl patterns with signbit changes that result in poison. Differential Revision: https://reviews.llvm.org/D97305	2021-02-23 18:26:56 +00:00
Stanislav Mekhanoshin	d1b92c91af	[AMDGPU] Set threshold for regbanks reassign pass This is to limit compile time. I did experiments with some inputs and found that compile time keeps reasonable for this pass if we have less than 100000 virtual registers and then starts to explode somewhere between 100000 and 150000. Differential Revision: https://reviews.llvm.org/D97218	2021-02-23 10:22:31 -08:00
Andrzej Warzynski	5e54bef4d2	[flang][test] Share all driver test dirs between `f18` and `flang-new` Originally, when we added the new driver, we created dedicated test directories for `flang-new`. This way we separated the tests for the `throwaway` and the new driver. As we are increasing test coverage and starting to share tests between the two drivers, it makes sense to share all directories and instead rely on: ``` ! REQUIRES: new-flang-driver ``` to mark tests as exclusively for the new driver. Differential Revision: https://reviews.llvm.org/D97207	2021-02-23 18:21:32 +00:00
Shilei Tian	f6c2984a09	[OpenMP][NVPTX] Fixed a compilation error in deviceRTLs caused by unsupported feature in release verion of LLVM `ptx71` is not supported in release version of LLVM yet. As a result, the support of CUDA 11.2 and CUDA 11.1 caused a compilation error as mentioned in D97004. Since the support in D97004 is just a WA for releease, and we'll not use it in the near future, using `ptx70` for CUDA 11 is feasible. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97195	2021-02-23 13:20:21 -05:00
Adam Straw	af8adea155	make Affine parallel and yield ops MemRefsNormalizable Affine parallel ops may contain and yield results from MemRefsNormalizable ops in the loop body. Thus, both affine.parallel and affine.yield should have the MemRefsNormalizable trait. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96821	2021-02-23 10:16:47 -08:00
Simon Pilgrim	18b9fc48f1	[InstructionSimplify] SimplifyShift - rename shift amount KnownBits. NFCI. As suggested on D97305.	2021-02-23 18:12:59 +00:00
Duncan P. N. Exon Smith	64d8c7818d	Revert "Module: Use FileEntryRef and DirectoryEntryRef in Umbrella, Header, and DirectoryName, NFC" This (mostly) reverts `32c501dd88`. Hit a case where this causes a behaviour change, perhaps the same root cause that triggered the revert of `a40db5502b` in `7799ef7121`. (The API changes in DirectoryEntry.h have NOT been reverted as a number of subsequent commits depend on those.) https://reviews.llvm.org/D90497#2582166	2021-02-23 09:57:28 -08:00
Craig Topper	eb165090bb	[LegalizeIntegerTypes] Improve ExpandIntRes_SADDSUBO codegen on targets without SADDO/SSUBO. This code creates 3 setccs that need to be expanded. It was creating a sign bit test as setge X, 0 which is non-canonical. Canonical would be setgt X, -1. This misses the special case in IntegerExpandSetCCOperands for sign bit tests that assumes canonical form. If we don't hit this special case we end up with a multipart setcc instead of just checking the sign of the high part. To fix this I've reversed the polarity of all of the setccs to setlt X, 0 which is canonical. The rest of the logic should still work. This seems to produce better code on RISCV which lacks a setgt instruction. This probably still isn't the best code sequence we could use here. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D97181	2021-02-23 09:40:32 -08:00
Nick Desaulniers	1e204ac789	[THUMB2] add .w suffixes for ldr/str (immediate) T4 The Linux kernel when built with CONFIG_THUMB2_KERNEL makes use of these instructions with immediate operands and wide encodings. These are the T4 variants of the follow sections from the Arm ARM. F5.1.72 LDR (immediate) F5.1.229 STR (immediate) I wasn't able to represent these simple aliases using t2InstAlias due to the Constraints on the non-suffixed existing instructions, which results in some manual parsing logic needing to be added. F1.2 Standard assembler syntax fields describes the use of the .w (wide) vs .n (narrow) encoding suffix. Link: https://bugs.llvm.org/show_bug.cgi?id=49118 Link: https://github.com/ClangBuiltLinux/linux/issues/1296 Reported-by: Stefan Agner <stefan@agner.ch> Reported-by: Arnd Bergmann <arnd@kernel.org> Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D96632	2021-02-23 09:25:40 -08:00
Emily Shi	956c90d347	[darwin] use new crash reporter api Add support for the new crash reporter api if the headers are available. Falls back to the old API if they are not available. This change was based on [[ `0164d546d2/llvm/lib/Support/PrettyStackTrace.cpp (L111)` \| /llvm/lib/Support/PrettyStackTrace.cpp ]] There is a lit for this behavior here: https://reviews.llvm.org/D96737 but is not included in this diff because it is potentially flaky. rdar://69767688 Reviewed By: delcypher, yln Commited by Dan Liew on behalf of Emily Shi. Differential Revision: https://reviews.llvm.org/D96830	2021-02-23 09:23:23 -08:00
Emily Shi	b6099fa515	[darwin][asan] add test for application specific information in crash logs Added a lit test that finds its corresponding crash log and checks to make sure it has asn output under `Application Specific Information`. This required adding two python commands: - `get_pid_from_output`: takes the output from the asan instrumentation and parses out the process ID - `print_crashreport_for_pid`: takes in the pid of the process and the file name of the binary that was run and prints the contents of the corresponding crash log. This test was added in preparation for changing the integration with crash reporter from the old api to the new api, which is implemented in a subsequent commit. rdar://69767688 Reviewed By: delcypher Commited by Dan Liew on behalf of Emily Shi. Differential Revision: https://reviews.llvm.org/D96737	2021-02-23 09:22:11 -08:00
Jay Foad	a6be26710b	[GlobalISel] Make more use of replaceSingleDefInstWithReg. NFC.	2021-02-23 17:08:34 +00:00
Dave Lee	0ac42fd26d	[lldb] Add deref support and tests to shared_ptr synthetic Add `frame variable` dereference suppport to libc++ `std::shared_ptr`. This change allows for commands like `v *thing_sp` and `v thing_sp->m_id`. These commands now work the same way they do with raw pointers. This is done by adding an unaccounted for child member named `$$dereference$$`. Also, add API tests for `std::shared_ptr`, previously there were none. Differential Revision: https://reviews.llvm.org/D97165	2021-02-23 09:03:46 -08:00
Florian Hahn	437f0bbcd5	Revert "[LV] Allow tryToCreateWidenRecipe to return a VPValue, use for blends." This reverts commit `4efa097eb4`, because some the compilers used for some bots do not support automatic conversions to PointerUnion.	2021-02-23 16:57:21 +00:00
Florian Hahn	4efa097eb4	[LV] Allow tryToCreateWidenRecipe to return a VPValue, use for blends. Generalize the return value of tryToCreateWidenRecipe to return either a newly create recipe or an existing VPValue. Use this to avoid creating unnecessary VPBlendRecipes. Fixes PR44800.	2021-02-23 16:52:03 +00:00
Nicolai Hähnle	52bc2e7577	[AMDGPU][SelectionDAG] Don't combine uniform multiplies to MUL_[UI]24 Prefer to keep uniform (non-divergent) multiplies on the scalar ALU when possible. This significantly improves some game cases by eliminating v_readfirstlane instructions when the result feeds into a scalar operation, like the address calculation for a scalar load or store. Since isDivergent is only an approximation of whether a value is in SGPRs, it can potentially regress some situations where a uniform value ends up in a VGPR. These should be rare in real code, although the test changes do contain a number of examples. Most of the test changes are just using s_mul instead of v_mul/mad which is generally better for both register pressure and latency (at least on GFX10 where sgpr pressure doesn't affect occupancy and vector ALU instructions have significantly longer latency than scalar ALU). Some R600 tests now use MULLO_INT instead of MUL_UINT24. GlobalISel appears to handle more scenarios in the desirable way, although it can also be thrown off and fails to select the 24-bit multiplies in some cases. Alternative solution considered and rejected was to allow selecting MUL_[UI]24 to S_MUL_I32. I've rejected this because the definition of those SD operations works is don't-care on the most significant 8 bits, and this fact is used in some combines via SimplifyDemandedBits. Based on a patch by Nicolai Hähnle. Differential Revision: https://reviews.llvm.org/D97063	2021-02-23 15:39:19 +00:00
Juneyoung Lee	19c2e12947	[JumpThreading] Update computeValueKnownInPredecessors to recognize logical and/or patterns This allows JumpThreading's computeValueKnownInPredecessors to recognize select form of and/or patterns as well.	2021-02-24 00:06:10 +09:00
Jay Foad	64831fb089	[AMDGPU] Rename a prefix for sanity. NFC.	2021-02-23 14:53:27 +00:00
Nate Chandler	01b4890e47	Add @llvm.coro.async.size.replace intrinsic. The new intrinsic replaces the size in one specified AsyncFunctionPointer with the size in another. This ability is necessary for functions which merely forward to async functions such as those defined for partial applications. Reviewed By: aschwaighofer Differential Revision: https://reviews.llvm.org/D97229	2021-02-23 06:43:52 -08:00
Jessica Clarke	22215e4923	[Driver][NFC] Add explicit break to final case	2021-02-23 14:17:15 +00:00
Martin Storsjö	f97ea0d5b3	[libcxx] [test] Define _CRT_STDIO_ISO_WIDE_SPECIFIERS while building tests This matches how libc++ itself is built. This avoids errors due to mismatch if linking libc++ statically. Differential Revision: https://reviews.llvm.org/D97169	2021-02-23 15:57:30 +02:00
Nathan James	e96f9cca3b	[clang-tidy] Remove IncludeInserter from MoveConstructorInit check. This check registers an IncludeInserter, however the check itself doesn't actually emit any fixes or includes, so the inserter is redundant. From what I can tell the fixes were removed in D26453(rL290051) but the inserter was left in, probably an oversight. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D97243	2021-02-23 13:48:07 +00:00
Joe Ellis	1b1b30cf0f	[clang][SVE] Don't warn on vector to sizeless builtin implicit conversion This commit prevents warnings from -Wconversion when a clang vector type is implicitly converted to a sizeless builtin type -- for example, when implicitly converting a fixed-predicate to a scalable predicate. The code below: 1 #include <arm_sve.h> 2 3 #define N __ARM_FEATURE_SVE_BITS 4 #define FIXED_ATTR __attribute__((arm_sve_vector_bits (N))) 5 typedef svbool_t fixed_svbool_t FIXED_ATTR; 6 7 inline fixed_svbool_t foo(fixed_svbool_t p) { 8 return svnot_z(svptrue_b64(), p); 9 } would previously raise this warning: warning: implicit conversion turns vector to scalar: \ 'fixed_svbool_t' (vector of 8 'unsigned char' values) to 'svbool_t' \ (aka '__SVBool_t') [-Wconversion] Note that many cases of these implicit conversions were already permitted because many functions inside arm_sve.h are spawned via preprocessor macros, and the call to isInSystemMacro would cover us in this case. This commit fixes the remaining cases. Differential Revision: https://reviews.llvm.org/D97053	2021-02-23 13:40:58 +00:00
Balázs Kéri	2c54b29337	[clang-tidy] Extending bugprone-signal-handler with POSIX functions. An option is added to the check to select wich set of functions is defined as asynchronous-safe functions. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D90851	2021-02-23 14:48:00 +01:00
Michał Górny	6c06b0aa5a	[lldb] [test] Un-XFAIL TestBuiltinTrap on FreeBSD/aarch64	2021-02-23 14:35:34 +01:00
Michał Górny	2f75363a9e	[lldb] [test] Un-XFAIL a test that no longer fail on FreeBSD	2021-02-23 14:35:34 +01:00
Simon Pilgrim	2315410f57	[X86] Cleanup overflow test check prefixes. NFCI. Tidy up the check prefixes to improve reuse.	2021-02-23 13:34:06 +00:00
Jay Foad	fdaa2d0259	[AMDGPU] Use divergent addresses for vector loads Change some test cases to use divergent addresses for vector loads, which should be the common case in real world code. Using uniform addresses causes poor instruction selection for the surrounding code which has to be fixed up post-register-allocation, and this causes a lot of testsuite churn for a forthcoming patch to stop selecting 24-bit vector multiply instructions for uniform multiplies. This shows up some problems in the idot tests where we fail to select v_dot instructions because the patterns only match MUL_[UI]24 ISD nodes, but the DAG contains i16 mul nodes instead. Differential Revision: https://reviews.llvm.org/D97062	2021-02-23 13:33:15 +00:00
Sjoerd Meijer	e1c3bf6afe	[ARM] do not consider sp as deprecated for ldm/stm Early versions of the ARMv7 reference manuals considered the sp register as a deprecated register for ldm/stm familiy of instructions. However, later versions such as ARM DDI 0406C.d added a note to the Appendix: D9.3 Use of the SP as a general-purpose register Most ARM instructions, unlike Thumb instructions, provide exactly the same access to the SP as to R0-R12. This means that it is possible to use the SP as a general-purpose register. Earlier issues of this manual deprecated the use of SP in an ARM instruction, in any way that is deprecated, not permitted, or not possible in the corresponding Thumb instruction. However, user feedback indicates a number of cases where these instructions are useful. Therefore, ARM no longer deprecates these instruction uses. Also Armv8 manuals no longer consider SP as deprecated register for ldm/ stm A32 instructions. Furthermore, GNU as also does not print a deprecated warning when using SP with those instructions. Drop deprecation warning for pop/ldm/push/stm instructions. Patch by: Stefan Agner. Differential Revision: https://reviews.llvm.org/D82692	2021-02-23 13:26:18 +00:00
David Green	dd2dbf7ee2	[TTI] Change getOperandsScalarizationOverhead to take Type args As a followup to D95291, getOperandsScalarizationOverhead was still using a VF as a vector factor if the arguments were scalar, and would assert on certain matrix intrinsics with differently sized vector arguments. This patch removes the VF arg, instead passing the Types through directly. This should allow it to more accurately compute the cost without having to guess at which operands will be vectorized, something difficult with more complex intrinsics. This adjusts one SVE test as it is now calling the wrong intrinsic vs veccall. Without invalid InstructCosts the cost of the scalarized intrinsic is too low. This should get fixed when the cost of scalarization is accounted for with scalable types. Differential Revision: https://reviews.llvm.org/D96287	2021-02-23 13:04:59 +00:00
David Green	bd4b61efbd	[CostModel] Remove VF from IntrinsicCostAttributes getIntrinsicInstrCost takes a IntrinsicCostAttributes holding various parameters of the intrinsic being costed. It can either be called with a scalar intrinsic (RetTy==Scalar, VF==1), with a vector instruction (RetTy==Vector, VF==1) or from the vectorizer with a scalar type and vector width (RetTy==Scalar, VF>1). A RetTy==Vector, VF>1 is considered an error. Both of the vector modes are expected to be treated the same, but because this is confusing many backends end up getting it wrong. Instead of trying work with those two values separately this removes the VF parameter, widening the RetTy/ArgTys by VF used called from the vectorizer. This keeps things simpler, but does require some other modifications to keep things consistent. Most backends look like this will be an improvement (or were not using getIntrinsicInstrCost). AMDGPU needed the most changes to keep the code from `c230965ccf` working. ARM removed the fix in `dfac521da1`, webassembly happens to get a fixup for an SLP cost issue and both X86 and AArch64 seem to now be using better costs from the vectorizer. Differential Revision: https://reviews.llvm.org/D95291	2021-02-23 13:03:26 +00:00
Nathan James	5bf710b2a5	[clang-tidy] Update checks list.	2021-02-23 13:01:16 +00:00
Timm Bäder	64d06ed9c9	[clang][parse][NFC] Remove dead ProhibitAttributes() call GNU-style attribute in enum bodies are allowed (and used by several tests), and this call to ProhibitAttributes() was dead code. Differential Revision: https://reviews.llvm.org/D97271	2021-02-23 13:54:35 +01:00
Florian Schmaus	6c78711f10	[clang-tidy] Install run-clang-tidy.py in bin/ as run-clang-tidy The run-clang-tidy.py helper script is supposed to be used by the user, hence it should be placed in the user's PATH. Some distributions, like Gentoo [1], won't have it in PATH unless it is installed in bin/. Furthermore, installed scripts in PATH usually do not carry a filename extension, since there is no need to know that this is a Python script. For example Debian and Ubuntu already install this script as 'run-clang-tidy' [2] and hence build systems like Meson also look for this name first [3]. Hence we install run-clang-tidy.py as run-clang-tidy, as suggested by Sylvestre Ledru [4]. 1: https://bugs.gentoo.org/753380 2: `60aefb1417/debian/clang-tidy-X.Y.links.in (L2)` 3: `b6dc4d5e5c/mesonbuild/scripts/clangtidy.py (L44)` 4: https://reviews.llvm.org/D90972#2380640 Reviewed By: sylvestre.ledru, JonasToth Differential Revision: https://reviews.llvm.org/D90972	2021-02-23 12:38:27 +00:00
Matteo Favaro	633e090528	[DSE] Allow ptrs defined in the entry block in IsGuaranteedLoopInvariant. The IsGuaranteedLoopInvariant function is making sure to check if the incoming pointer is guaranteed to be loop invariant, therefore I think the case where the pointer is defined in the entry block of a function automatically guarantees the pointer to be loop invariant, as the entry block of a function cannot have predecessors or be part of a loop. I implemented this small patch and tested it using ninja check-llvm-unit and ninja check-llvm. I added a contained test file that shows the problem and used opt -O3 -debug on it to make sure the case is not currently handled (in fact the debug log is showing that the DSE pass is bailing out when testing if the killer store is able to clobber the dead store). Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D96979	2021-02-23 12:00:44 +00:00
Hsiangkai Wang	53c4c2b9f7	[RISCV] vle1.v/vse1.v should be unmasked instructions. vle1.v/vse1.v should be unmasked instructions. The vm encoding is 1 for unmasked instructions. Differential Revision: https://reviews.llvm.org/D97237	2021-02-23 19:59:22 +08:00

1 2 3 4 5 ...

380815 Commits All Branches Search

380815 Commits

All Branches