llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	cdca1785d3	[CostModel][X86] Adjust uitofp(vXi64) SSE/AVX legalized costs based on llvm-mca reports. Update v4i64 -> v4f32/v4f64 uitofp costs based on the worst case costs from the script in D103695. Fixes a few regressions before we start adding AVX costs for legalized types.	2021-07-02 13:09:00 +01:00
Alexey Bataev	28ac873bcb	[SLP]Fix gathering of the scalars by not ignoring UndefValues. The compiler should not ignore UndefValue when gathering the scalars, otherwise the resulting code may be less defined than the original one. Also, grouped scalars to insert them at first to reduce the analysis in further passes. Differential Revision: https://reviews.llvm.org/D105275	2021-07-02 04:46:48 -07:00
Alexandru Octavian Butiu	e90c6f5596	[MachineCopyPropagation] Fix differences in code gen when compiling with -g Fixes bugs [[ https://bugs.llvm.org/show_bug.cgi?id=50580 \| 50580 ]] and [[ https://bugs.llvm.org/show_bug.cgi?id=49446 \| 49446 ]] When compiling with -g "DBG_VALUE <reg>" instructions are added in the MIR, if such a instruction is inserted between instructions that use <reg> then MachineCopyPropagation invalidates <reg> , this causes some copies to not be propagated and causes differences in code generation (ex bugs 50580 and 49446 ). DBG_VALUE instructions should be ignored since they don't actually modify the register. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D104394	2021-07-02 19:27:06 +08:00
Alex Richardson	c142c06c19	Place the BlockAddress type in the address space of the containing function While this should not matter for most architectures (where the program address space is 0), it is important for CHERI (and therefore Arm Morello). We use address space 200 for all of our code pointers and without this change we assert in the SelectionDAG handling of BlockAddress nodes. It is also useful for AVR: previously programs targeting AVR that attempt to read their own machine code via a pointer to a label would instead read from RAM using a pointer relative to the the start of program flash. Reviewed By: dylanmckay, theraven Differential Revision: https://reviews.llvm.org/D48803	2021-07-02 12:17:55 +01:00
Adrian Kuegel	791ddb79f1	Add LogOp to Complex dialect. Differential Revision: https://reviews.llvm.org/D105337	2021-07-02 13:15:47 +02:00
Sven van Haastregt	b77b2201dc	[NFC] Fix typo in comment Reported-by: Marco Cali <marco.cali@arm.com>	2021-07-02 11:39:17 +01:00
Florian Hahn	1a248233a5	[AArch64] Use custom lowering for fp16 vector copysign. The custom copysign lowering already supports fp16. Use it. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D105277	2021-07-02 11:15:30 +01:00
Roman Lebedev	48db080383	[NFC][SimplifyCFG] Autogenerate checklines in trapping-load-unreachable.ll test	2021-07-02 12:59:14 +03:00
Michał Górny	4d2503cd54	[lldb] [test] Add missing category to test_detach_current	2021-07-02 11:44:41 +02:00
Florian Hahn	7655061cc6	[Matrix] Hoist address computation before multiply to enable fusion. If the store address does not dominate the matrix multiply, try to hoist address computation instructions without side-effects and/or memory reads before the multiply, to allow fusion. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D105193	2021-07-02 09:52:11 +01:00
Roman Lebedev	c2c0d3ea89	Revert "[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR" This reverts commit `4facbf213c`. ``` ****************** FAIL: LLVM :: CodeGen/WebAssembly/funcref-call.ll (44466 of 44468) **************** TEST 'LLVM :: CodeGen/WebAssembly/funcref-call.ll' FAILED ****************** Script: -- : 'RUN: at line 1'; /builddirs/llvm-project/build-Clang12/bin/llc < /repositories/llvm-project/llvm/test/CodeGen/WebAssembly/funcref-call.ll --mtriple=wasm32-unknown-unknown -asm-verbose=false -mattr=+reference-types \| /builddirs/llvm-project/build-Clang12/bin/FileCheck /repositories/llvm-project/llvm/test/CodeGen/WebAssembly/funcref-call.ll -- Exit Code: 2 Command Output (stderr): -- llc: /repositories/llvm-project/llvm/include/llvm/Support/LowLevelTypeImpl.h:44: static llvm::LLT llvm::LLT::scalar(unsigned int): Assertion `SizeInBits > 0 && "invalid scalar size"' failed. ```	2021-07-02 11:49:51 +03:00
Michał Górny	b7c140335b	[lldb] [gdb-remote server] Support selecting process via Hg Support using the extended thread-id syntax with Hg packet to select a subprocess. This makes it possible to start providing support for running some of the debugger packets against another subprocesses. Differential Revision: https://reviews.llvm.org/D100261	2021-07-02 10:23:11 +02:00
Balázs Kéri	a27a17f883	[clang][AST] Add support for BindingDecl to ASTImporter. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D102492	2021-07-02 10:14:50 +02:00
Sam McCall	26e1553a10	[clangd] CMake: express -Iclangd/ at top level and inherit For files directly under clangd/, -Iclang-tools-extra/clangd (and the equivalent for generated files) are not required, as CMake/the compiler puts these directories on the include path by default. However this means each subdirectory needs to include_directories(.. ${CMAKE_CURRENT_BINARY_DIR}/..) etc, and this proved annoying and error-prone to maintain and debug. Since include_directories is inherited by subdirectories, we just configure this explicitly at the top level instead.	2021-07-02 09:52:36 +02:00
Paulo Matos	4facbf213c	[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR Reland of `31859f896`. This change implements new DAG notes GLOBAL_GET/GLOBAL_SET, and lowering methods for load and stores of reference types from IR globals. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set. Differential Revision: https://reviews.llvm.org/D104797	2021-07-02 09:46:28 +02:00
Tobias Gysi	6944f7da25	[mlir][linalg][python] Introduce python integration test folder. Introduce an integration test folder in the test/python subfolder and move the opsrun.py test into the newly created folder. The test verifies named operations end-to-end using both the yaml and the python path. Differential Revision: https://reviews.llvm.org/D105276	2021-07-02 07:20:34 +00:00
Tobias Gysi	3b95400f78	[mlir][linalg][python] Add max operation in OpDSL Add the max operation to the OpDSL and introduce a max pooling operation to test the implementation. As MLIR has no builtin max operation, the max function is lowered to a compare and select pair. Differential Revision: https://reviews.llvm.org/D105203	2021-07-02 07:12:37 +00:00
Sam McCall	0c53f602d5	[clangd] Add some more missing include dirs for completeness	2021-07-02 09:04:53 +02:00
Martin Storsjö	ce211c505b	[LLD] [COFF] Fix up missing stdcall decorations in MinGW mode If linking directly against a DLL without an import library, the DLL export symbols might not contain stdcall decorations. If we have an undefined symbol with decoration, and we happen to have a matching undecorated symbol (which either is lazy and can be loaded, or already defined), then alias it against that instead. This matches what's done in reverse, when we have a def file declaring to export a symbol without decoration, but we only have a defined decorated symbol. In that case we do a fuzzy match (SymbolTable::findMangle). This case is more straightforward; if we have a decorated undefined symbol, just strip the decoration and look for the corresponding undecorated symbol name. Add warnings and options for either silencing the warning or disabling the whole feature, corresponding to how ld.bfd does it. (This feature works for any symbol decoration mismatch, not only when linking against a DLL directly; ld.bfd also tolerates it anywhere, and also fixes up mismatches in the other direction, like SymbolTable::findMangle, for any symbol, not only exports. But in practice, at least for lld, it would primarily end up used for linking against DLLs.) Differential Revision: https://reviews.llvm.org/D104532	2021-07-02 09:49:14 +03:00
Martin Storsjö	c09e5e50b1	[LLD] [MinGW] Allow linking to DLLs directly As the COFF linker is capable of linking directly against a DLL now (after D104530, as long as it is running in mingw mode), don't error out here but successfully load libraries specified with "-l" from DLLs if that's what ld.bfd would have matched. Differential Revision: https://reviews.llvm.org/D104531	2021-07-02 09:49:13 +03:00
Martin Storsjö	a9ff1ce1b9	[LLD] [COFF] Support linking directly against DLLs in MinGW mode GNU ld.bfd supports linking directly against DLLs without using an import library, and some projects have picked up on this habit. (There's no one single unsurmountable issue with using import libraries, but this is a regularly surfacing missing feature.) As long as one is linking by name (instead of by ordinal), the DLL export table contains most of the information needed. (One can inspect what section a symbol points at, to see if it's a function or data symbol. The practical implementation of this loops over all sections for each symbol, but as long as they're not very many, that should hopefully be tolerable performance wise.) One exception where the information in the DLL isn't entirely enough is on i386 with stdcall functions; depending on how they're done, the exported function name can be a plain undecorated name, while the import library would contain the full decorated symbol name. This issue is addressed separately in a different patch. This is implemented mimicing the structure of a regular import library, with one InputFile corresponding to the static archive that just adds lazy symbols, which then are fetched when they are needed. When such a symbol is fetched, we synthesize a coff_import_header structure in memory and create a regular ImportFile out of it. The implementation could be even smaller by just creating ImportFiles for every symbol available immediately, but that would have the drawback of actually ending up importing all symbols unless running with GC enabled (and mingw mode defaults to having it disabled for historical reasons). Differential Revision: https://reviews.llvm.org/D104530	2021-07-02 09:49:13 +03:00
Sam McCall	86c5afa6e6	[clangd] Fix XPC build due to missing include path (Tentative, untested as I don't have a mac)	2021-07-02 08:47:46 +02:00
Douglas Yung	f737d9794a	Relax newly added opcode check to check only for a number instead of a specific opcode.	2021-07-01 23:09:25 -07:00
Vitaly Buka	07a1f3513e	[scudo] Fix test on aarch64 without MTE	2021-07-01 21:40:04 -07:00
Evgeniy Brevnov	9568811cb8	[NFC][DSE]Change 'do-while' to 'for' loop to simplify code structure With 'for' loop there is is a single place where 'Current' is adjusted. It helps to avoid copy paste and makes a bit easy to understand overall loop controll flow. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D101044	2021-07-02 10:00:47 +07:00
Fangrui Song	5efffac71a	[llvm-symbolizer] Move setGroupedShortOptions and don't ignore case setGroupedShortOptions in the ctor seems more popular.	2021-07-01 19:43:49 -07:00
Lang Hames	425b908301	[ORC] Rename SPSTargetAddress to SPSExecutorAddress. Also removes SPSTagTargetAddress, which was accidentally introduced at some point (and never used).	2021-07-02 12:40:14 +10:00
Craig Topper	066524ea54	[ScalarizeMaskedMemIntrin][SelectionDAGBuilder] Use the element type to calculate alignment for gather/scatter when alignment operand is 0. Previously we used the vector type, but we're loading/storing invididual elements so I think only element alignment should matter. Noticed while looking at the code for something else so I don't have a test case. Differential Revision: https://reviews.llvm.org/D105220	2021-07-01 19:08:47 -07:00
Richard Smith	9ab5f76117	Support for merging UsingPackDecls across modules. Fixes a false-positive error if the same std::variant<...> type is instantiated across multiple modules.	2021-07-01 18:43:49 -07:00
Jez Ng	f6b6e72143	[lld-macho] Factor out common InputSection members We have been creating many ConcatInputSections with identical values due to .subsections_via_symbols. This diff factors out the identical values into a Shared struct, to reduce memory consumption and make copying cheaper. I also changed `callSiteCount` from a uint32_t to a 31-bit field to save an extra word. All in all, this takes InputSection from 120 to 72 bytes (and ConcatInputSection from 160 to 112 bytes), i.e. 30% size reduction in ConcatInputSection. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.14 4.24 4.18 4.183 0.027548999 + 20 4.04 4.11 4.075 4.0775 0.018027756 Difference at 95.0% confidence -0.1055 +/- 0.0149005 -2.52211% +/- 0.356215% (Student's t, pooled s = 0.0232803) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105305	2021-07-01 21:22:39 -04:00
Jez Ng	08715e6c47	[lld-macho][nfc] Remove unnecessary vertical spacing This makes NonLazyPointerSectionBase's style more in line with the rest of the classes in its file.	2021-07-01 21:22:38 -04:00
Jez Ng	ac2dd06b91	[lld-macho] Deduplicate CFStrings `__cfstring` is a special literal section, so instead of breaking it up at symbol boundaries, we break it up at fixed-width boundaries (since each literal is the same size). Symbols can only occur at one of those boundaries, so this is strictly more powerful than `.subsections_via_symbols`. With that in place, we then run the section through ICF. This change is about perf-neutral when linking chromium_framework. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D105045	2021-07-01 21:22:38 -04:00
Jez Ng	3a11528d97	[lld-macho] Move ICF earlier to avoid emitting redundant binds This is a pretty big refactoring diff, so here are the motivations: Previously, ICF ran after scanRelocations(), where we emitting bind/rebase opcodes etc. So we had a bunch of redundant leftovers after ICF. Having ICF run before Writer seems like a better design, and is what LLD-ELF does, so this diff refactors it accordingly. However, ICF had two dependencies on things occurring in Writer: 1) it needs literals to be deduplicated beforehand and 2) it needs to know which functions have unwind info, which was being handled by `UnwindInfoSection::prepareRelocations()`. In order to do literal deduplication earlier, we need to add literal input sections to their corresponding output sections. So instead of putting all input sections into the big `inputSections` vector, and then filtering them by type later on, I've changed things so that literal sections get added directly to their output sections during the 'gather' phase. Likewise for compact unwind sections -- they get added directly to the UnwindInfoSection now. This latter change is not strictly necessary, but makes it easier for ICF to determine which functions have unwind info. Adding literal sections directly to their output sections means that we can no longer determine `inputOrder` from iterating over `inputSections`. Instead, we store that order explicitly on InputSection. Bloating the size of InputSection for this purpose would be unfortunate -- but LLD-ELF has already solved this problem: it reuses `outSecOff` to store this order value. One downside of this refactor is that we now make an additional pass over the unwind info relocations to figure out which functions have unwind info, since want to know that before `processRelocations()`. I've made sure to run that extra loop only if ICF is enabled, so there should be no overhead in non-optimizing runs of the linker. The upside of all this is that the `inputSections` vector now contains only ConcatInputSections that are destined for ConcatOutputSections, so we can clean up a bunch of code that just existed to filter out other elements from that vector. I will test for the lack of redundant binds/rebases in the upcoming cfstring deduplication diff. While binds/rebases can also happen in the regular `.text` section, they're more common in `.data` sections, so it seems more natural to test it that way. This change is perf-neutral when linking chromium_framework. Reviewed By: oontvoo Differential Revision: https://reviews.llvm.org/D105044	2021-07-01 21:22:38 -04:00
Matthias Springer	e895a670f8	[mlir] Move BufferizeDimOp to Tensor/Transforms/Bufferize.cpp Differential Revision: https://reviews.llvm.org/D105256	2021-07-02 10:05:59 +09:00
Rob Suderman	06ac83fcac	[mlir][tosa] Update Bazel files for TOSA pass changes There were some missing bazel dependencies for the Tosa dialect. Added these deps. Reviewed By: GMNGeoffrey Differential Revision: https://reviews.llvm.org/D105326	2021-07-01 17:45:00 -07:00
Rob Suderman	6aaaeacd3d	[mlir][tosa] Include TosaDialect as include in tosa PassDetail.h Tosa's PassDetail.h may be used in non-TOSA transforms. Include TosaDialect to avoid transient dependency. Differential Revision: https://reviews.llvm.org/D105324	2021-07-01 17:09:05 -07:00
Matt Arsenault	32a73198fc	Mips/GlobalISel: Use accurate memory LLTs	2021-07-01 20:08:14 -04:00
Akira Hatanaka	76dd98ec75	Precommit test cases in https://reviews.llvm.org/D104953	2021-07-01 17:03:07 -07:00
Rob Suderman	65eb4028ad	[mlir][tosa] Added missing includes on PassDetails.h Includes were missing in the PassDetails.h that cause downstream failures on TOSA passes. Differential Revision: https://reviews.llvm.org/D105323	2021-07-01 16:42:47 -07:00
Jessica Paquette	e59f02216f	[GlobalISel] Translate <1 x N> getelementptrs to scalar G_PTR_ADDs In `IRTranslator::translateGetElementPtr`, when we run into a vector gep with some scalar operands, we try to normalize those operands using `buildSplatVector`. This is fine except for when the getelementptr has a <1 x N> type. In that case it is treated as a scalar. If we run into one of these then every call to ``` // With VectorWidth = 1 LLT::fixed_vector(VectorWidth, PtrTy) ``` will assert. Here's an example (equivalent to the added testcase): https://godbolt.org/z/hGsTnMYdW To get around this, this patch adds a variable, `WantSplatVector`, which is true when our vector type ought to actually be represented using a vector. When it's false, we'll translate as a scalar. This checks if `VectorWidth > 1`. This fixes this bug: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=35496 Differential Revision: https://reviews.llvm.org/D105316	2021-07-01 16:38:47 -07:00
Rob Suderman	8dea784b3e	[mlir][tosa] Add tosa shape inference with InferReturnTypeComponent Added InferReturnTypeComponents for NAry operations, reshape, and reverse. With the additional tosa-infer-shapes pass, we can infer/propagate shapes across a set of TOSA operations. Current version does not modify the FuncOp type by inserting an unrealized conversion cast prior to any new non-matchin returns. Differential Revision: https://reviews.llvm.org/D105312	2021-07-01 16:04:26 -07:00
Eli Friedman	0176ac9503	[AArch64] Optimize SVE bitcasts of unpacked types. Target-independent code only knows how to spill to the stack; instead, use AArch64ISD::REINTERPRET_CAST. Differential Revision: https://reviews.llvm.org/D104573	2021-07-01 15:35:48 -07:00
LLVM GN Syncbot	430bfc4f3b	[gn build] Port `33a7b4d9d8`	2021-07-01 22:26:09 +00:00
Petr Hosek	33a7b4d9d8	[InstrProfiling] Use external weak reference for bias variable We need the compiler generated variable to override the weak symbol of the same name inside the profile runtime, but using LinkOnceODRLinkage results in weak symbol being emitted which leads to an issue where the linker might choose either of the weak symbols potentially disabling the runtime counter relocation. This change replaces the use of weak definition inside the runtime with an external weak reference to address the issue. We also place the compiler generated symbol inside a COMDAT group so dead definition can be garbage collected by the linker. Differential Revision: https://reviews.llvm.org/D105176	2021-07-01 15:25:31 -07:00
Jon Roelofs	14d64be6e5	[GISel] Print better error messages for missing Combiner Observer calls Differential revision: https://reviews.llvm.org/D105290	2021-07-01 15:18:18 -07:00
Mehdi Amini	6bbbd7b499	Update MLIRContext to allow injecting an external ThreadPool (NFC) The context can be created with threading disabled, to avoid creating a thread pool that may be destroyed when injecting another one later. Differential Revision: https://reviews.llvm.org/D105302	2021-07-01 22:17:47 +00:00
Arthur O'Dwyer	64a0241d64	[libc++] IWYU <__utility/pair.h> in <__functional/hash.h>. NFCI. This was the only thing preventing any one of our detail headers from compiling on its own.	2021-07-01 18:12:30 -04:00
Hansang Bae	f1b9ce2736	[OpenMP] Fix a few issues with hidden helper task This patch includes the following changes to address a few issues when using hidden helper task. - Assertion is triggered when there are inadvertent calls to hidden helper functions on non-Linux OS - Added deinit code in __kmp_internal_end_library function to fix random shutdown crashes - Moved task data access into the lock-guarded region in __kmp_push_task Differential Revision: https://reviews.llvm.org/D105308	2021-07-01 17:10:32 -05:00
zoecarver	edc1f0c12c	[libcxx][ranges] Implement indirectly_swappable. Differential Revision: https://reviews.llvm.org/D105304	2021-07-01 15:08:23 -07:00
Sanjay Patel	9eb613b2de	[InstSimplify] do not propagate poison from select arm to icmp user This is the cause of the miscompile in: https://llvm.org/PR50944 The problem has likely existed for some time, but it was made visible with: `5af8bacc94` ( D104661 ) handleOtherCmpSelSimplifications() assumed it can convert select of constants to bool logic ops, but that does not work with poison. We had a very similar construct in InstCombine, so the fix here mimics the fix there. The bug is in instsimplify, but I'm not sure how to reproduce it outside of instcombine. The reason this is visible in instcombine is because we have a hack (FIXME) to bypass simplification of a select when it has an icmp user: `955f125899/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp (L2632)` So we get to an unusual case where we are trying to simplify an instruction that has an operand that would have already simplified if we had processed it in normal order. Differential Revision: https://reviews.llvm.org/D105298	2021-07-01 17:40:07 -04:00

... 7 8 9 10 11 ...

393080 Commits All Branches Search

393080 Commits

All Branches