llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	f3c74b72f4	[DSE] Make DSEState non-copyable (NFC) As it contains a self-reference, the default copy/move ctors would not be safe. Move the DSEState::get() method into the ctor to make sure no move occurs here even without NRVO. This is a speculative fix for test failures on llvm-clang-x86_64-expensive-checks-win.	2021-09-26 17:54:38 +02:00
Jon Chesterfield	8cf93a35d4	[libomptarget][amdgpu] Destruct HSA queues Store queues in unique_ptr so they are destroyed when the global DeviceInfo is. Currently they leak which raises an assert in debug builds of hsa. Reviewed By: pdhaliwal Differential Revision: https://reviews.llvm.org/D109511	2021-09-26 15:34:21 +01:00
Sanjay Patel	6063e6b499	[InstCombine] move add after min/max intrinsic This is another regression noted with the proposal to canonicalize to the min/max intrinsics in D98152. Here are Alive2 attempts to show correctness without specifying exact constants: https://alive2.llvm.org/ce/z/bvfCwh (smax) https://alive2.llvm.org/ce/z/of7eqy (smin) https://alive2.llvm.org/ce/z/2Xtxoh (umax) https://alive2.llvm.org/ce/z/Rm4Ad8 (umin) (if you comment out the assume and/or no-wrap, you should see failures) The different output for the umin test is due to a fold added with `c4fc2cb5b2` : // umin(x, 1) == zext(x != 0) We probably want to adjust that, so it applies more generally (umax --> sext or patterns where we can fold to select-of-constants). Some folds that were ok when starting with cmp+select may increase instruction count for the equivalent intrinsic, so we have to decide if it's worth altering a min/max. Differential Revision: https://reviews.llvm.org/D110038	2021-09-26 09:49:10 -04:00
Simon Pilgrim	3538ee763d	[CostModel][X86] Improve AVX1/AVX2 v16i32->v16i16/v16i8 truncation costs (PR51972) Based off worst case btver2 (AVX1) and haswell (AVX2) llvm-mca reports	2021-09-26 13:43:46 +01:00
Michael Kruse	9820dd970c	[Polly] Support for InlineAsm. Inline assembly was not handled at all and treated like a llvm::Value. In particular, it tried to create a pointer it which is not allowed. Fix by handling like a llvm::Constant such that it is just reused when required, instead of trying to marshall it in memory. Fixes llvm.org/PR51960	2021-09-26 03:26:43 -05:00
Michael Kruse	d5c87162db	[Polly] Use VirtualUse to determine references. VirtualUse ensures consistency over different source of values with Polly. In particular, this enables its use of instructions moved between Statement. Before the patch, the code wrongly assumed that the BB's instructions are also the ScopStmt's instructions. Reference are determined for OpenMP outlining and GPGPU kernel extraction. GPGPU CodeGen had some problems. For one, it generated GPU kernel parameters for constants. Second, it emitted GPU-side invariant loads which have already been loaded by the host. This has been partially fixed, it still generates a store for the invariant load result, but using the value that the host has already written. WARNING: I did not test the generated PollyACC code on an actual GPU. The improved consistency will be made use of in the next patch.	2021-09-26 03:26:43 -05:00
Michael Kruse	1cea25eec9	[Polly] Remove isConstCall. The function was intended to catch OpenMP functions such as get_thread_id(). If matched, the call would be considered synthesizable. There were a few problems with this: * get_thread_id() is not 'const' in the sense of have the gcc manual defines it: "do not examine any values except their arguments". get_thread_id() reads OpenCL runtime libreary global state. What was inteded was probably 'speculable'. * isConstCall was implemented using mayReadOrWriteMemory(). 'const' is stricter than that, mayReadOrWriteMemory is e.g. true for malloc(), since it may only read/write addresses that are considered inaccessible fro the application. However, malloc is certainly not speculable. * Values that are isConstCall were not handled consistently throughout Polly. In particular, it was not considered for referenced values (OpenMP outlining and PollyACC). Fix by removing special handling for isConstCall entirely.	2021-09-26 03:26:43 -05:00
Alexandre Rames	e21b0ba8c9	[ADT] Add trailing comma on TYPED_TEST_SUITE This avoids a -pedantic warning: warning: ISO C++11 requires at least one argument for the "..." in a variadic macro See also https://github.com/google/googletest/issues/2271 Reviewed By: arames, bkramer Differential Revision: https://reviews.llvm.org/D110283	2021-09-25 18:04:16 -07:00
Mehdi Amini	c3aed0d395	MLIR can't support -Bsymbolic link option, fail at CMake time with a helpful message instead of broken runtime Differential Revision: https://reviews.llvm.org/D110483	2021-09-26 00:36:31 +00:00
Lang Hames	320832cc9b	[ORC] Wait for handleDisconnect to complete in SimpleRemoteEPC::disconnect. Disconnect should block until handleDisconnect completes, otherwise we might destroy the SimpleRemoteEPC instance while it's still in use. Thanks to Dave Blaikie for helping me track this down.	2021-09-26 10:19:26 +10:00
Lang Hames	0371049277	[ORC] Fix uninitialized variable. Spotted by Dave Blaikie. Thanks Dave!	2021-09-26 10:19:25 +10:00
Fangrui Song	db6a00daa0	[ELF] Remove unneeded binding parameter from addOptionalRegular. NFC __rela_iplt_start uses spurious STB_WEAK, but it doesn't matter because STV_HIDDEN overrides the binding.	2021-09-25 15:47:27 -07:00
Fangrui Song	d23fd8ae89	[ELF] Replace noneRel = R__NONE with static constexpr. NFC All architectures define R__NONE to 0.	2021-09-25 15:16:44 -07:00
Fangrui Song	40cd4db442	[ELF] Default gotBaseSymInGotPlt to false (NFC for most architectures) Most architectures use .got instead of .got.plt, so switching the default can minimize customization. This fixes an issue for SPARC V9 which uses .got . AVR, AMDGPU, and MSP430 don't seem to use _GLOBAL_OFFSET_TABLE_.	2021-09-25 15:06:09 -07:00
Nikita Popov	ba664d9066	[AA] Move earliest escape tracking from DSE to AA This is a followup to D109844 (and alternative to D109907), which integrates the new "earliest escape" tracking into AliasAnalysis. This is done by replacing the pre-existing context-free capture cache in AAQueryInfo with a replaceable (virtual) object with two implementations: The SimpleCaptureInfo implements the previous behavior (check whether object is captured at all), while EarliestEscapeInfo implements the new behavior from DSE. This combines the "earliest escape" analysis with the full power of BasicAA: It subsumes the call handling from D109907, considers a wider range of escape sources, and works with AA recursion. The compile-time cost is slightly higher than with D109907. Differential Revision: https://reviews.llvm.org/D110368	2021-09-25 22:40:41 +02:00
Nikita Popov	327bbbb10b	[DSE] Make capture check more precise It is sufficient that the object has not been captured before the load that produces the pointer we're loading. A capture after that can not affect the already loaded pointer. This is small part of D110368 applied separately.	2021-09-25 22:23:19 +02:00
Nikita Popov	1c3859f31d	[BasicAA] Don't consider Argument as escape source (NFCI) The case of an Argument and an identified function local is already handled earlier, because we don't care about captures in that case. As such, we don't need to additionally consider the combination of an Argument with a non-escaping identified function local. This ensures that isEscapeSource() only returns true for instructions, which is necessary for D110368.	2021-09-25 22:08:15 +02:00
Lang Hames	897eb579c6	[ORC-RT] ExecutorAddrDiff ergonomic improvements; contains and overlaps methods Renames StartAddress and EndAddress members to Start and End. Adds contains and overlap methods. Adds a constructor from an address and size. These changes are counterparts to LLVM commits `ef391df2b6`, `c0d889995e`, and `37f1b7a3f3`.	2021-09-25 12:01:42 -07:00
Fangrui Song	a892c0e49e	[ELF][test] Improve test coverage	2021-09-25 11:57:54 -07:00
Lang Hames	99951a5684	Revert "[ORC] Introduce EPCGenericRTDyldMemoryManager." This reverts commit `bef55a2b47` while I investigate failures on some bots. Also reverts "[lli] Add ChildTarget dependence on OrcTargetProcess library." (`7a219d801b`) which was a fallow-up to `bef55a2b47`.	2021-09-25 11:19:14 -07:00
Lang Hames	7a219d801b	[lli] Add ChildTarget dependence on OrcTargetProcess library. ChildTarget depends on OrcTargetProcess after `bef55a2b47`.	2021-09-25 10:51:29 -07:00
Lang Hames	bef55a2b47	[ORC] Introduce EPCGenericRTDyldMemoryManager. EPCGenericRTDyldMemoryMnaager is an EPC-based implementation of the RuntimeDyld::MemoryManager interface. It enables remote-JITing via EPC (backed by a SimpleExecutorMemoryManager instance on the executor side) for RuntimeDyld clients. The lli and lli-child-target tools are updated to use SimpleRemoteEPC and SimpleRemoteEPCServer (rather than OrcRemoteTargetClient/Server), and EPCGenericRTDyldMemoryManager for MCJIT tests. By enabling remote-JITing for MCJIT and RuntimeDyld-based ORC clients, EPCGenericRTDyldMemoryManager allows us to deprecate older remote-JITing support, including OrcTargetClient/Server, OrcRPCExecutorProcessControl, and the Orc RPC system itself. These will be removed in future patches.	2021-09-25 10:42:10 -07:00
Simon Pilgrim	18c8ed5416	[DAG] ReduceLoadOpStoreWidth - replace getABITypeAlign with allowsMemoryAccess (PR45116) One of the cases identified in PR45116 - we don't need to limit store narrowing to ABI alignment, we can use allowsMemoryAccess - which tests using getABITypeAlign, but also checks if a target permits (fast) misaligned memory access by checking allowsMisalignedMemoryAccesses as a fallback.	2021-09-25 18:35:57 +01:00
mydeveloperday	c2ec5dd209	[clang-format] Left/Right alignment fixer can cause false positive replacements when they don't actually change anything Earlier during the development of {D69764} I felt it was no longer necessary to ensure we were not trying to change code which didn't need to change and we felt this could be removed, however I'd like to bring this back for now as I am seeing some false positives in terms of the "replacements" What I see is the generation of a replacement which is a "No Op" on the original code, I think this comes about because of the merging of replacements: ``` static const a; -> const static a; -> static const a; ``` The replacements don't really merge, in such a way as to identify when we have gone back to the original Also remove the Penalty as I'm not using it (and it became marked as set and no used, I'd rather get rid of it if it means nothing) I think we need to do this step for now, as many people use the --output-replacements-xml to identify that the file "needs a clang-format" The same can be seen with the -n or --dry-run option as this uses the replacements to drive the error/warning output. Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D110392	2021-09-25 17:35:41 +01:00
Simon Pilgrim	8c83bd3bd4	[CostModel][X86] Adjust vXi32 multiply costs if it can be performed using PMADDWD Update the costs to match the codegen from combineMulToPMADDWD - not only can we use PMADDWD is its zero-extended, but also if its a constant or sign-extended from a vXi16 (which can be replaced with a zero-extension).	2021-09-25 16:28:48 +01:00
Simon Pilgrim	eb7c78c2c5	[X86][SSE] combineMulToPMADDWD - mask off upper bits of sign-extended vXi32 constants If we are multiplying by a sign-extended vXi32 constant, then we can mask off the upper 16 bits to allow folding to PMADDWD and make use of its implicit sign-extension from i16	2021-09-25 15:50:45 +01:00
Simon Pilgrim	2a4fa0c27c	[X86][SSE] combineMulToPMADDWD - enable sext(v8i16) -> zext(v8i16) fold on sub-128 bit vectors	2021-09-25 15:50:45 +01:00
Kazu Hirata	44c401bdc3	[Mips] Remove redundant declarations (NFC) Note that identical declarations immediately precede what's being removed in this patch. Identified with readability-redundant-declaration.	2021-09-25 07:41:11 -07:00
Simon Pilgrim	f5a26ccae2	[X86][SSE] combineMulToPMADDWD - enable sext(v8i16) -> zext(v8i16) fold on pre-SSE41 targets We already do this on SSE41 targets where we have sext/zext instructions, now that combineShiftToPMULH handles SSE2 targets, we can enable this here as well.	2021-09-25 14:35:31 +01:00
Simon Pilgrim	4c72b10f0a	[X86] X86FastISel::fastMaterializeConstant - break if-else chain to fix llvm-else-after-return warning. NFCI All previous if-else cases return	2021-09-25 14:31:14 +01:00
Simon Pilgrim	a25f25c3b7	[X86] combineShiftToPMULH - relax from ISA from SSE41 to SSE2 With improved shuffle combines (in particular canonicalizeShuffleWithBinOps), we can now usefully perform this on any SSE2+ target. We should be able to remove this entirely and just use DAGCombiner's combineShiftToMULH if we can someday get it to support illegal (pre-widened) types.	2021-09-25 14:08:03 +01:00
Michał Górny	3a6ba36751	[lldb] Convert misc. StringConvert uses Replace misc. StringConvert uses with llvm::to_integer() and llvm::to_float(), except for cases where further refactoring is planned. The purpose of this change is to eliminate the StringConvert API that is duplicate to LLVM, and less correct in behavior at the same time. Differential Revision: https://reviews.llvm.org/D110447	2021-09-25 14:19:19 +02:00
Valentin Clement	2c28e3f008	[fir] Add desc to fir.array_load op and update operand name This patch is part of the upstreaming effort from fir-dev branch. Add a description for the fir.array_load opeartion and rename lenParams to typeparams. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110393 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-09-25 14:10:35 +02:00
Simon Pilgrim	5a14edd8ed	[InstCombine] Ensure shifts are in range for (X << C1) / C2 -> X fold. We can get here before out of range shift amounts have been handled - limit to BW-2 for sdiv and BW-1 for udiv Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=38078	2021-09-25 12:57:43 +01:00
Markus Böck	0b61f43b60	[CMake] Consistently use the LibXml2::LibXml2 target instead of LIBXML2_LIBRARIES Linking against the LibXml2::LibXml2 target has the advantage of not only importing the library, but also adding the include path as well as any definitions the library requires. In case of a static build of libxml2, eg. a define is set on Windows to remove any DLL imports and export. LLVM already makes use of the target, but c-index-test and lldb were still linking against the library only. The workaround for Mac OS-X that I removed seems to have also been made redundant since https://reviews.llvm.org/D84563 I believe Differential Revision: https://reviews.llvm.org/D109975	2021-09-25 13:13:11 +02:00
Simon Pilgrim	ee267b1c7c	[IR] DIBuilder::createEnumerator - pass APSInt by const reference Avoid unnecessary copy by value.	2021-09-25 11:58:06 +01:00
Simon Pilgrim	6bd5b1b1ce	[DAG] combineShiftToMULH - move getValueType() inside assert. NFCI. Avoids an unnecessary (void).	2021-09-25 11:56:35 +01:00
Kunwar Shaanjeet Singh Grover	0f78ece169	[MLIR] Add functionality to remove redundant local variables This patch adds functionality to FlatAffineConstraints to remove local variables using equalities. This helps in keeping output representation of FlatAffineConstraints smaller. This patch is part of a series of patches aimed at generalizing affine dependence analysis. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D110056	2021-09-25 16:10:43 +05:30
David Green	883758ed48	[ARM] Fix Arm block placement creating branches after jump tables. Given: - A jump table - Which jumps to the next block - The next block ends in a WLS - Where the WLS conditionally jumps to block earlier in the program. The Arm block placement pass would attempt to move the block containing the WLS earlier, as the WLS instruction can only branch forward. In doing so it would add a branch from the jumptable block to the WLS block, thinking it previously fell-through. This in itself would be fine, if a little inefficient, but the constant island pass expects all instructions after a jump-table branch to have been removed by analyzeBranch. So it gets confused and can assign the same labels to multiple jump table blocks. I've changed the condition to the same as used in analyzeBranch.	2021-09-25 11:32:25 +01:00
Dmitry Vyukov	b02938439d	tsan: uninline RacyStacks::operator== It's only used during race reporting. There is no point in polluting the main header file with it. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D110470	2021-09-25 12:08:51 +02:00
Simon Pilgrim	993f3c61b3	[TTI] getUserCost - Ensure a vector insert/extract index is in unsigned 32-bit range Otherwise fallback to the generic 'unknown index' path Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=29050	2021-09-25 10:50:54 +01:00
Jim Lin	ed687c0211	[RISCV] Fix incorrect operand type of inst alias for InstR4 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D110381	2021-09-25 11:25:12 +08:00
Matheus Izvekov	37adc4f957	[clang] set templates as invalid when any of the parameters are invalid See PR51872 for the original repro. This fixes a crash when converting a templated constructor into a deduction guide, in case any of the template parameters were invalid. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D110460	2021-09-25 04:04:47 +02:00
Amara Emerson	f4cfda03d6	[AArch64][AMDGPU] Re-generate some tests with CHECK-NEXT to prepare for a patch.	2021-09-24 18:26:08 -07:00
Petr Hosek	d893692024	[CMake] Pass through CMAKE_READELF to subbuilds This matches handling of other CMake variables. Differential Revision: https://reviews.llvm.org/D110463	2021-09-24 18:20:30 -07:00
Jason Molenda	a2e1d68fa9	Add pragma to make it easier to find "image list" impl I couldn't find it; make this easier for next time.	2021-09-24 17:13:03 -07:00
David Blaikie	5cb210862b	DebugInfo: Use the signedness of the underlying enum when encoding enum non-type-template-parameters This improves the accuracy of the debug info and improves round tripping through -gsimple-template-names.	2021-09-24 17:02:55 -07:00
River Riddle	ef764eeeb9	[mlir:ElementsAttr] Avoid crash on empty contiguous ranges We currently, incorrectly, assume that a range always has at least one element when building a contiguous range. This commit adds a proper empty check to avoid crashing. Differential Revision: https://reviews.llvm.org/D110457	2021-09-24 23:48:51 +00:00
modimo	ce6ed64a69	[llvm-profdata] Extend support of --topn to sample profiles Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110449	2021-09-24 16:42:46 -07:00
Nico Weber	4a45f95e76	[llvm] Remove LLVM_CHECK_ENABLED_PROJECTS again This reverts commit `55f0b33708` and follow-up reverts commit `e9ea03c62c`. LLVM_EXTERNAL_PROJECTS is sufficient, see https://reviews.llvm.org/D110016	2021-09-24 18:49:55 -04:00

... 3 4 5 6 7 ...

400160 Commits All Branches Search

400160 Commits

All Branches