llvm-project

Commit Graph

Author	SHA1	Message	Date
Zequan Wu	cab48e2f0e	[CodeGen] don't emit addrsig symbol if it's used only by metadata Value only used by metadata can be removed from .addrsig table. This solves the undefined symbol error when enabling addrsig table on COFF LTO. Differential Revision: https://reviews.llvm.org/D101512	2021-04-29 15:39:30 -07:00
Rob Suderman	be01b091af	[mlir][tosa] Remove constant-0 dim expr values from TOSA lowerings Constant-0 dim expr values should be avoided for linalg as it can prevent fusion. This includes adding support for rank-0 reshapes. Differential Revision: https://reviews.llvm.org/D101418	2021-04-29 15:06:03 -07:00
jasonliu	7049fbf960	[XCOFF] Handle the case when personality routine is an alias Summary: Personality routine could be an alias to another personality routine. Fix the situation when we compile the file that contains the personality routine and the file also have functions that need to refer to the personality routine. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D101401	2021-04-29 22:03:30 +00:00
Alex Lorenz	6b938d2ead	Recommit "[clang][driver] Use the provided arch name for a Darwin target triple This ensures that the Darwin driver uses a consistent target triple representation when the triple is printed out to the user. This reverts the revert commit `ab0df6c034`. Differential Revision: https://reviews.llvm.org/D100807	2021-04-29 15:00:40 -07:00
zoecarver	3aaac01aab	[libcxx][ranges] Fix tests for stdlib types that conform to sized_sentinel_for. Differential Revision: https://reviews.llvm.org/D101371	2021-04-29 14:44:16 -07:00
Amara Emerson	fa2340574c	[GlobalISel][Legalizer] Bump up a smallvector size that was found to be too small. NFC.	2021-04-29 14:41:34 -07:00
Vladimir Vereschaka	74d9a76ad3	[CMake] Stop using c++ subdirectory for libc++ on Win to ARM Linux cross builds. NFC Updated cross Win-x-ARM Linux toolchain cmake cache file in according of the following changes: https://reviews.llvm.org/D100869 Stop using use c++ subdirectory for libc++ library	2021-04-29 14:23:33 -07:00
Lang Hames	aaf026d9da	[ORC] JITDylib::addDependencies should be run under the session lock.	2021-04-29 14:09:40 -07:00
Martin Storsjö	5bf2ef9d86	Revert "[llvm-readobj] [ARMWinEH] Fix handling of relocations and symbol offsets" This reverts commit `3778924088`. The added test fails on at least one buildbot, by printing a reversed combination, printing "func3_xdata +0x18 (0x8)" while it's supposed to be "func3_xdata +0x8 (0x18)", see e.g. https://lab.llvm.org/buildbot/#/builders/107/builds/7269. Currently no idea how that could happen, but reverting until it can be figured out.	2021-04-30 00:06:16 +03:00
Amara Emerson	96ec6d91e4	[AArch64][GlobalISel] Simplify out of range rotate amount. Differential Revision: https://reviews.llvm.org/D101005	2021-04-29 14:05:58 -07:00
Mehdi Amini	086e0f05bf	Revert "[mlir][sparse] migrate sparse operations into new sparse tensor dialect" This reverts commit `a6d92a9711`. The build with -DBUILD_SHARED_LIBS=ON is broken.	2021-04-29 20:59:41 +00:00
Martin Storsjö	3778924088	[llvm-readobj] [ARMWinEH] Fix handling of relocations and symbol offsets When looking up data referenced from pdata/xdata structures, the referenced data can be found in two different ways: - For an unrelocated object file, it's located via a relocation - For a relocated, linked image, the data is referenced with an (image relative) absolute address For the latter case, the absolute address can optionally be described with a symbol. For the case of an object file, there's two offsets involved; one immediate offset encoded in the data location that is modified by the relocation, and a section offset in the symbol. Previously, for the ExceptionRecord field, we printed the offset from the symbol (only) but used the immediate offset ignoring the symbol's address (using only the symbol's section) for printing the exception data. Add a helper method for doing the lookup and address calculation, for simplifying the calling code and making all the cases consistent. This addresses an existing FIXME comment, fixing printing of the exception data for cases where relocations point at individual symbols in the xdata section (which is what MSVC generates) instead of all relocations pointing at the start of the xdata section (which is what LLVM generates). This also fixes printing of the function name for packed entries in linked images. Differential Revision: https://reviews.llvm.org/D100305	2021-04-29 23:35:10 +03:00
Martin Storsjö	2b01a417d7	[LLD] [COFF] Fix the mingw --export-all-symbols behaviour with comdat symbols When looking for the "all" symbols that are supposed to be exported, we can't look at the live flag - the symbols we mark as to be exported will become GC roots even if they aren't yet marked as live. With this in place, building an LLVM library with BUILD_SHARED_LIBS produces the same set of symbols exported regardless of whether the --gc-sections flag is specified, both with and without being built with -ffunction-sections. Differential Revision: https://reviews.llvm.org/D101522	2021-04-29 23:35:10 +03:00
Arnamoy Bhattacharyya	8f5a2a5836	[flang][OpenMP][FIX] Fix the worksharing nesting check with inclusion of more constructs to cover combined constructs.	2021-04-29 16:17:32 -04:00
Philip Reames	a047837b90	Revert "Generalize getInvertibleOperand recurrence handling slightly" This reverts commit `0c01b37eeb` while a problem reported is investigated.	2021-04-29 13:06:26 -07:00
Benjamin Kramer	b389c80963	[mlir] Fix lowering of multi-dimensional vector log1p to LLVM This was using the untransformed operand, leading to invalid IR. Differential Revision: https://reviews.llvm.org/D101531	2021-04-29 21:53:52 +02:00
Jay Foad	16d707e656	[AMDGPU] Fix v_swap_b32 formation on physical registers As explained in the comments, matchSwap matches: // mov t, x // mov x, y // mov y, t and turns it into: // mov t, x (t is potentially dead and move eliminated) // v_swap_b32 x, y On physical registers we don't have full use-def chains so the check for T being live-out was not working properly with subregs/superregs. Differential Revision: https://reviews.llvm.org/D101546	2021-04-29 20:53:40 +01:00
Alexey Bataev	12c51f2358	[COST] Improve shuffle kind detection if shuffle mask is provided. Added an extra analysis for better choosing of shuffle kind in getShuffleCost functions for better cost estimation if mask was provided. Differential Revision: https://reviews.llvm.org/D100865	2021-04-29 12:48:00 -07:00
Alexey Bataev	6e859f3cd4	Revert "[COST] Improve shuffle kind detection if shuffle mask is provided." This reverts commit `9239932221` to fix a compiler crash on mask checks.	2021-04-29 12:40:33 -07:00
Jez Ng	07884152ec	[lld-macho] Remove stray file	2021-04-29 15:33:23 -04:00
Sriraman Tallam	a64411916c	Basic block sections for functions with implicit-section-name attribute Functions can have section names set via #pragma or section attributes, basic block sections should be correctly named for such functions. With #pragma, the expectation is that all functions in that file are placed in the same section in the final binary. Basic block sections should be correctly named with the unique flag set so that the final binary has all the basic blocks of the function in that named section. This patch fixes the bug by calling getExplictSectionGlobal when implicit-section-name attribute is set to make sure the function's basic blocks get the correct section name. Differential Revision: https://reviews.llvm.org/D101311	2021-04-29 12:29:34 -07:00
Jez Ng	d9c8ffa958	[lld-macho][nfc] Clean up header.s test I don't think it's super worthwhile to test the dylib headers outputs of all the different archs when x86_64 is the only one that has interesting behavior. Motivated by my upcoming addition of arm32...	2021-04-29 15:11:23 -04:00
Jez Ng	7e115da5df	[lld-macho] Make everything PIE by default Modern versions of macOS (>= 10.7) and in general all modern Mach-O target archs want PIEs by default. ld64 defaults to PIE for iOS >= 4.3, as well as for all versions of watchOS and simulators. Basically all the platforms LLD is likely to target want PIE. So instead of cluttering LLD's code with legacy version checks, I think it's simpler to just default to PIE for everything. Note that `-no_pie` still works, so users can still opt out of it. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D101513	2021-04-29 15:11:23 -04:00
Aart Bik	a6d92a9711	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101488	2021-04-29 12:09:10 -07:00
Sanjay Patel	0f8b6686ac	[InstCombine] narrow popcount with zext operand https://llvm.org/PR50141	2021-04-29 15:07:16 -04:00
Sanjay Patel	b142e9d1c5	[InstCombine] add tests for popcount with zext operand; NFC PR50141	2021-04-29 15:07:16 -04:00
Tim Northover	c1b7460b5b	Revert "RegAlloc: do not consider liveins to EH-pad successors as liveout." Some liveins can come from this block (e.g. any SSA value except the call), it's only the ones that produce `landingpad` values that can't and I didn't think it through properly.	2021-04-29 20:00:07 +01:00
Petar Avramovic	c34900e133	AMDGPU/GlobalISel: Fix selection of image intrinsics with unused return When atomic image intrinsic return value is unused, register class for destination of a sub-register copy of return value ends up not being set. This copy then hits 'Register class not set' assert later. If return value has uses, register class is determined by use instruction. Fix is to not create sub-register copy when image intrinsic destination has no uses because it would be deleted by dead-mi-elimination later anyway. Differential Revision: https://reviews.llvm.org/D101448	2021-04-29 20:56:03 +02:00
Dan Liew	2d42b2ee7b	[ASan] Rename `-fsanitize-address-destructor-kind=` to drop the `-kind` suffix. Renaming the option is based on discussions in https://reviews.llvm.org/D101122. It is normally not a good idea to rename driver flags but this flag is new enough and obscure enough that it is very unlikely to have adopters. While we're here also drop the `<kind>` metavar. It's not necessary and is actually inconsistent with the documentation in `clang/docs/ClangCommandLineReference.rst`. Differential Revision: https://reviews.llvm.org/D101491	2021-04-29 11:55:42 -07:00
Tim Northover	438a63e13b	RegAlloc: do not consider liveins to EH-pad successors as liveout. These registers get defined by the runtime, not the block being allocated, and treating them as preassigned in RegAllocFast adds extra pressure, sometimes enough to make the function unallocatable.	2021-04-29 19:34:49 +01:00
Victor Huang	ae3377c553	[AIX][TLS] Add ASM portion changes to support TLSGD relocations to XCOFF objects - Add new variantKinds for the symbol's variable offset and region handle - Print the proper relocation specifier @gd in the asm streamer when emitting the TC Entry for the variable offset for the symbol - Fix the switch section failure between the TC Entry of variable offset and region handle - Put .__tls_get_addr symbol in the ProgramCodeSects with XTY_ER property Reviewed by: sfertile Differential Revision: https://reviews.llvm.org/D100956	2021-04-29 13:18:59 -05:00
Roman Lebedev	cc63203908	[SimplifyCFG] Common code sinking: fix application of profitability check The profitability check is: we don't want to create more than a single PHI per instruction sunk. We need to create the PHI unless we'll sink all of it's would-be incoming values. But there is a caveat there. This profitability check doesn't converge on the first iteration! If we first decide that we want to sink 10 instructions, but then determine that 5'th one is unprofitable to sink, that may result in us not sinking some instructions that resulted in determining that some other instruction we've determined to be profitable to sink becoming unprofitable. So we need to iterate until we converge, as in determine that all leftover instructions are profitable to sink. But, the direct approach of just re-iterating seems dumb, because in the worst case we'd find that the last instruction is unprofitable, which would result in revisiting instructions many many times. Instead, i think we can get away with just two passes - forward and backward. However then it isn't obvious what is the most performant way to update InstructionsToSink.	2021-04-29 21:11:40 +03:00
Sam Clegg	a6f406480a	[lld][WebAssembly] Add `--export-if-defined` Unlike the existing `--export` option this will not causes errors or warnings if the specified symbol is not defined. See: https://github.com/emscripten-core/emscripten/issues/13736 Differential Revision: https://reviews.llvm.org/D99887	2021-04-29 10:58:45 -07:00
Mark de Wever	9393060f90	[libc++] Fixes std::to_chars for bases != 10. While working on D70631, Microsoft's unit tests discovered an issue. Our `std::to_chars` implementation for bases != 10 uses the range `[first,last)` as temporary buffer. This violates the contract for to_chars: [charconv.to.chars]/1 http://eel.is/c++draft/charconv#to.chars-1 `to_chars_result to_chars(char* first, char* last, see below value, int base = 10);` "If the member ec of the return value is such that the value is equal to the value of a value-initialized errc, the conversion was successful and the member ptr is the one-past-the-end pointer of the characters written." Our implementation modifies the range `[member ptr, last)`, which causes Microsoft's test to fail. Their test verifies the buffer `[member ptr, last)` is unchanged. (The test is only done when the conversion is successful.) While looking at the code I noticed the performance for bases != 10 also is suboptimal. This is tracked in D97705. This patch fixes the issue and adds a benchmark. This benchmark will be used as baseline for D97705. Reviewed By: #libc, Quuxplusone, zoecarver Differential Revision: https://reviews.llvm.org/D100722	2021-04-29 19:56:28 +02:00
Petr Hosek	ba631240ae	[CMake] Set correct CXX_FLAGS for relative-vtables variants We overrite CXX_FLAGS to enable relative vtables, but doing so overwrites generic Fuchsia CXX_FLAGS leading to a build failure on Windows. Differential Revision: https://reviews.llvm.org/D101551	2021-04-29 10:34:37 -07:00
Raphael Isemann	a76df78470	[lldb] Make the NSSet formatter faster and less prone to infinite recursion Right now to get the 'NSSet ` pointer value we first derefence it and then take the address of the result. Beside being inefficient this potentially can cause an infinite recursion if the `pointer` value we get is a pointer of a type that the TypeSystem can't derefence. If the pointer is for example some form of `void ` that the dynamic type resolution can't resolve to an actual type, then the `Derefence` call goes back to asking the formatters how to reference it. If the NSSet formatter then checks if it's an NSSet variation under the hood then we just end infinitely often recursion. In practice this seems to happen with some form of Builtin.RawPointer we get from a NSDictionary in Swift. FWIW, no other formatter is doing the same deref->addressOf as here and there doesn't seem to be any specific reason to do so in the git history (it's just part of the initial formatter commit) Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D101537	2021-04-29 19:13:43 +02:00
LLVM GN Syncbot	5fbea82692	[gn build] Port `df323ba445`	2021-04-29 16:59:58 +00:00
Benjamin Kramer	df323ba445	Revert "[X86] Support AMX fast register allocation" This reverts commit `3b8ec86fd5`. Revert "[X86] Refine AMX fast register allocation" This reverts commit `c3f95e9197`. This pass breaks using LLVM in a multi-threaded environment by introducing global state.	2021-04-29 18:56:33 +02:00
Vitaly Buka	ea7618684c	Revert "[scudo] Use require_constant_initialization" This reverts commit `7ad4dee3e7`.	2021-04-29 09:55:54 -07:00
Sanjay Patel	1089158c5a	[ConstantFolding] propagate poison through vector reduction intrinsics	2021-04-29 12:54:20 -04:00
Martin Storsjö	203096adfc	[libcxx] [test] Include more libraries that normally are linked automatically As the libcxx tests link with -nostdlib, libraries that normally are added by default by the compiler driver has to be added manually. The "oldnames" library is automatically added when driving linking with clang-cl. When linking with the plain clang driver, as the libcxx tests do, the clang driver does the same but only since Clang 12.0). But when linking with -nostdlib, like the libcxx tests do, the driver defaults aren't added at all, and we need to specify the defaults manually. This allows removing a TODO from the Windows CI setup; it turns out that upgrading to Clang 12.0 didn't help here as expected, sorry about that mixup. Differential Revision: https://reviews.llvm.org/D101434	2021-04-29 19:54:07 +03:00
Vitaly Buka	7ad4dee3e7	[scudo] Use require_constant_initialization Attribute guaranties safe static initialization of globals. Differential Revision: https://reviews.llvm.org/D101514	2021-04-29 09:47:59 -07:00
Craig Topper	dcdda2bdf2	[RISCV] Teach DAG combine to fold (and (select_cc lhs, rhs, cc, -1, c), x) -> (select_cc lhs, rhs, cc, x, (and, x, c)) Similar for or/xor with 0 in place of -1. This is the canonical form produced by InstCombine for something like `c ? x & y : x;` Since we have to use control flow to expand select we'll usually end up with a mv in basic block. By folding this we may be able to pull the and/or/xor into the block instead and avoid a mv instruction. The code here is based on code from ARM that uses this to create predicated instructions. I'm doing it on SELECT_CC so it happens late, but we could do it on select earlier which is what ARM does. I'm not sure if we lose any combine opportunities if we do it earlier. I left out add and sub because this can separate sext.w from the add/sub. It also made a conditional i64 addition/subtraction on RV32 worse. I guess both of those would be fixed by doing this earlier on select. The select-binop-identity.ll test has not been commited yet, but I made the diff show the changes to it. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D101485	2021-04-29 09:43:51 -07:00
Craig Topper	60216adef1	[RISCV] Add test cases for D101485. NFC	2021-04-29 09:43:51 -07:00
Alexey Bataev	9239932221	[COST] Improve shuffle kind detection if shuffle mask is provided. Added an extra analysis for better choosing of shuffle kind in getShuffleCost functions for better cost estimation if mask was provided. Differential Revision: https://reviews.llvm.org/D100865	2021-04-29 09:42:56 -07:00
Fangrui Song	47a686d5cb	[unittest] Fix Frontend/OpenMPIRBuilderTest.cpp -Wsign-compare after D89671	2021-04-29 09:37:58 -07:00
Fangrui Song	b7c6697813	[DebugInfo] Add tests that we emit .eh_frame instead of .debug_frame Add tests which can catch the issue in `0ce723cb22` (If any function needs CFISection::EH, the module should use CFISection::EH). Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D101339	2021-04-29 09:35:48 -07:00
Sanjay Patel	678018138d	[ConstProp] add tests for vector reductions of poison; NFC	2021-04-29 12:20:59 -04:00
Sanjay Patel	71597d40e8	[ConstantFolding] refactor helper for vector reductions; NFC We should handle other cases (undef/poison), so reduce the duplication of repeated switches.	2021-04-29 12:09:22 -04:00
Sanjay Patel	f4b1272d3d	[ADT] fix typo in code block comment; NFC	2021-04-29 12:09:22 -04:00

1 2 3 4 5 ...

387037 Commits All Branches Search

387037 Commits

All Branches