llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	9d826fdb28	[X86][NFC]Add run lines for AVX512VL for masked gather test, NFC.	2021-07-08 11:30:31 -07:00
Matheus Izvekov	5a1c50410c	[clang] fix constexpr code generation for user conversions. When building the member call to a user conversion function during an implicit cast, the expression was not being checked for immediate invocation, so we were never adding the ConstantExpr node to AST. This would cause the call to the user conversion operator to be emitted even if it was constantexpr evaluated, and this would even trip an assert when said user conversion was declared consteval: `Assertion failed: !cast<FunctionDecl>(GD.getDecl())->isConsteval() && "consteval function should never be emitted", file clang\lib\CodeGen\CodeGenModule.cpp, line 3530` Fixes PR48855. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D105446	2021-07-08 20:23:19 +02:00
Stanislav Mekhanoshin	de5582be26	[AMDGPU] Fix more indention in llc-pipeline test. NFC.	2021-07-08 11:20:00 -07:00
Louis Dionne	8ea2b951c6	[libc++][ci] Install Clang 11, Clang 12 and Clang ToT in the Docker image The compiler support policy mentions that we support Clang 11 and 12, so we should test those. We already test on Clang 12, but I'll add testers for Clang 11 once the new Docker image is in use on all the builders.	2021-07-08 14:14:17 -04:00
Nicolas Vasilache	4747e1b83b	[mlir][Linalg] Fix tensor.extract_slice(linalg.init_tensor) canonicalization for rank-reducing extract Differential Revision: https://reviews.llvm.org/D105636	2021-07-08 18:13:51 +00:00
Michael Liao	8c7ff9da90	[Metadata] Decorate methods with 'const'. NFC. - Minor coding style fix.	2021-07-08 14:11:14 -04:00
Stanislav Mekhanoshin	9dae86ce56	[AMDGPU] Fix indention in llc-pipeline test. NFC.	2021-07-08 11:08:25 -07:00
Matt Arsenault	43f25e61ce	Mips/GlobalISel: Remove custom splitToValueTypes	2021-07-08 13:39:06 -04:00
Matt Arsenault	9b057f647d	GlobalISel: Track original argument index in ArgInfo SelectionDAG's equivalents in ISD::InputArg/OutputArg track the original argument index. Mips relies on this, and its currently reinventing its own parallel CallLowering infrastructure which tracks these indexes on the side. Add this to help move towards deleting the custom mips handling.	2021-07-08 13:39:02 -04:00
Matt Arsenault	2f9504aa41	Mips/GlobalISel: Use correct callee calling convention This was using the convention from the calling function.	2021-07-08 13:38:57 -04:00
Fangrui Song	d833543dd5	[LangRef] Fix typo about SHF_LINK_ORDER	2021-07-08 10:29:43 -07:00
Leonard Chan	a11aea68a4	[compiler-rt][hwasan] Define fuchsia implementations of required hwasan functions This contains all the definitions required by hwasan for the fuchsia implementation and can be landed independently from the remaining parts of D91466. Differential Revision: https://reviews.llvm.org/D103936	2021-07-08 10:24:27 -07:00
Eli Friedman	5b350183cd	[ScalarEvolution] Fix overflow in computeBECount. There are two issues with the current implementation of computeBECount: 1. It doesn't account for the possibility that adding "Stride - 1" to Delta might overflow. For almost all loops, it doesn't, but it's not actually proven anywhere. 2. It doesn't account for the possibility that Stride is zero. If Delta is zero, the backedge is never taken; the value of Stride isn't relevant. To handle this, we have to make sure that the expression returned by computeBECount evaluates to zero. To deal with this, add two new checks: 1. Use a variety of tricks to try to prove that the addition doesn't overflow. If the proof is impossible, use an alternate sequence which never overflows. 2. Use umax(Stride, 1) to handle the possibility that Stride is zero. Differential Revision: https://reviews.llvm.org/D105216	2021-07-08 10:09:55 -07:00
Simon Pilgrim	769e782793	Fix MSVC "truncation from 'int' to 'bool'" warning. NFCI.	2021-07-08 18:08:24 +01:00
Simon Pilgrim	8ef67fa9d2	[CostModel][X86] Account for older SSE targets with slow fp->int conversions Both the conversion cost and the xmm->gpr transfer cost tend to be a lot higher on early SSE targets	2021-07-08 18:08:24 +01:00
Fangrui Song	c34b0ab589	[LangRef] Clarify !associated Notably, a global variable with the metadata should generally not be referenced by a function function. E.g. -fstack-size-section usage is fine, but -fsanitize-coverage= used to have a linker GC problem (fixed by D97430). Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D104933	2021-07-08 10:07:10 -07:00
Aart Bik	8cf60e61e7	[mlir][sparse] updated setter/getter comments For the getters, it is bad practice to keep the reference around for too long, as explained in the new comment Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D105599	2021-07-08 09:36:22 -07:00
Stanislav Mekhanoshin	74a5760d35	[AMDGPU] Set LoopInfo as preserved by SIAnnotateControlFlow The pass does not change loops, it just adds calls. Differential Revision: https://reviews.llvm.org/D105583	2021-07-08 09:34:43 -07:00
Louis Dionne	d2a8d362c5	[libc++][ci] Stop testing on GCC previous, since we don't support it anymore This is the first of a few commits that update the CI to match the recently officialized compiler support policy. I'm staging those changes to try and keep the CI green at all times, accounting how builders refresh their Docker image.	2021-07-08 12:33:26 -04:00
Nikita Popov	83a87b831a	[IR] Restore vector support for deprecated CreateGEP methods As pointed out in post-commit review on rG8e22539067d9, it's necessary to call getScalarType() to support GEPs with a vector base. Dropping that call was an oversight on my side.	2021-07-08 18:15:56 +02:00
Louis Dionne	a276f45180	[libc++][docs] Update documentation to reflect libc++'s compiler support policy In https://lists.llvm.org/pipermail/llvm-dev/2021-March/148881.html, we discussed updating the compiler support policy for libc++ to match more closely what we do actually support. This commit enshrines that policy decision in libc++'s documentation. Differential Revision: https://reviews.llvm.org/D105563	2021-07-08 11:56:50 -04:00
Louis Dionne	6afd6e96ce	[libc++] Workaround failures with modules on Clang ToT	2021-07-08 11:50:54 -04:00
Jeremy Morse	63cc251eb9	[DebugInfo][InstrRef][4/4] Support DBG_INSTR_REF through all backend passes This is a cleanup patch -- we're now able to support all flavours of variable location in instruction referencing mode. This patch updates various tests for debug instructions to be broader: numerous code paths try to ignore debug isntructions, and they now have to ignore the additional DBG_PHI and DBG_INSTR_REFs that we can generate. A small amount of rework happens for LiveDebugVariables: as we don't need to track live intervals through regalloc any more, we can get away with unlinking debug instructions before regalloc, then re-inserting them after. Note that this isn't (yet) true of DBG_VALUE_LISTs, they still have to go through live interval tracking. In SelectionDAG, add a helper lambda that emits half-formed DBG_INSTR_REFs for arguments in instr-ref mode, DBG_VALUE otherwise. This is one of the final locations where DBG_VALUEs are emitted for vreg arguments. X86InstrInfo now un-sets the debug instr number on SUB instructions that get mutated into CMP instructions. As the instruction no longer computes a subtraction, we can't use it for variable locations. Differential Revision: https://reviews.llvm.org/D88898	2021-07-08 16:42:24 +01:00
LLVM GN Syncbot	87e41cc4b6	[gn build] Port `321c2ea91c`	2021-07-08 15:35:54 +00:00
Mark de Wever	321c2ea91c	[libc++][NFC] Move monostate to its own header. The format library uses `std::monostate`, but not a `std::variant`. Moving `std::monostate` to its own header allows the format library to reduce the amount of included code. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D105582	2021-07-08 17:35:26 +02:00
Mark de Wever	4947ecf4e9	[libc++] Guard testing implementation details. The unit tests test some implementation details. As @Quuxplusone pointed out in D96664 this should only be tested when the tests use libc++. This addresses the issue for code already in main. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D105568	2021-07-08 17:34:58 +02:00
Tim Northover	48c68a630e	Recommit: Support: add llvm::thread class that supports specifying stack size. This adds a new llvm::thread class with the same interface as std::thread except there is an extra constructor that allows us to set the new thread's stack size. On Darwin even the default size is boosted to 8MB to match the main thread. It also switches all users of the older C-style `llvm_execute_on_thread` API family over to `llvm::thread` followed by either a `detach` or `join` call and removes the old API. Moved definition of DefaultStackSize into the .cpp file to hopefully fix the build on some (GCC-6?) machines.	2021-07-08 16:22:26 +01:00
Alexey Bataev	b5113bff46	[Instcombine]Transform reduction+(sext/zext(<n x i1>) to <n x im>) to [-]zext/trunc(ctpop(bitcast <n x i1> to in)) to im. Some of the SPEC tests end up with reduction+(sext/zext(<n x i1>) to <n x im>) pattern, which can be transformed to [-]zext/trunc(ctpop(bitcast <n x i1> to in)) to im. Also, reduction+(<n x i1>) can be transformed to ctpop(bitcast <n x i1> to in) & 1 != 0. Differential Revision: https://reviews.llvm.org/D105587	2021-07-08 07:56:41 -07:00
Michael Liao	4e5d9c8803	[Internalize] Preserve variables externally initialized. - ``externally_initialized`` variables would be initialized or modified elsewhere. Particularly, CUDA or HIP may have host code to initialize or modify ``externally_initialized`` device variables, which may not be explicitly referenced on the device side but may still be used through the host side interfaces. Not preserving them triggers the elimination of them in the GlobalDCE and breaks the user code. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D105135	2021-07-08 10:48:19 -04:00
Markus Böck	3e6d2cbf26	[mlir] Fully qualify types and expressions in Interfaces This patch adds full qualification to the types and function calls used inside of (Static)InterfaceMethod in OpInterfaces. Without this patch using many of these interfaces in a downstream project yields compiler errors as the types and default implementations are mostly copied verbatim. Without then putting using namespace mlir; in the header file of the implementations of those interfaces, compilation is impossible. Using fully qualified lookup fixes this issue. Differential Revision: https://reviews.llvm.org/D105619	2021-07-08 16:44:16 +02:00
Alexey Bataev	9320d4b695	[Instcombine][NFC]Add a test for reduce+([sext/zext](<n x i1)) case, NFC.	2021-07-08 07:38:11 -07:00
Michael Liao	cc92833f8a	[amdgpu] Remove the GlobalDCE pass prior to the internalization pass. - In [D98783](https://reviews.llvm.org/D98783), an extra GlobalDCE pass is inserted before the internalization pass to ensure a global variable without users could be internalized even if there are dead users. Instead of inserting a dedicated optimization pass, the dead user checking, i.e. 'use_empty()', should be preceeded with constant dead user removal to ensure an accurate result. Differential Revision: https://reviews.llvm.org/D105590	2021-07-08 10:25:58 -04:00
Tim Northover	2bf5e8d953	Revert "Support: add llvm::thread class that supports specifying stack size." It's causing build failures because DefaultStackSize isn't defined everywhere it should be and I need time to investigate.	2021-07-08 14:59:47 +01:00
Tim Northover	727e1c9be3	Support: add llvm::thread class that supports specifying stack size. This adds a new llvm::thread class with the same interface as std::thread except there is an extra constructor that allows us to set the new thread's stack size. On Darwin even the default size is boosted to 8MB to match the main thread. It also switches all users of the older C-style `llvm_execute_on_thread` API family over to `llvm::thread` followed by either a `detach` or `join` call and removes the old API.	2021-07-08 14:51:53 +01:00
xndcn	7445f1e4dc	[NFC] Mark Expected<T>::assertIsChecked() as const Some const methods of Expected<T> invoke assertIsChecked(), so we should mark it as const too. Differential Revision: https://reviews.llvm.org/D105292	2021-07-08 21:30:23 +08:00
Chia-hung Duan	ba913b8da5	[mlir-reduce] Fix the memory leak and recycle unused modules. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105416	2021-07-08 20:03:47 +08:00
Bradley Smith	026bb84bcd	[AArch64][SVE] Add ISel patterns for floating point compare with zero instructions Additionally, lower the floating point compare SVE intrinsics to SETCC_MERGE_ZERO ISD nodes to avoid duplicating ISel patterns. Differential Revision: https://reviews.llvm.org/D105486	2021-07-08 10:46:12 +00:00
Max Kazantsev	767eb9f9d5	[Test] Add loop deletion switch tests Patch by Dmitry Makogon! Differential Revision: https://reviews.llvm.org/D105543	2021-07-08 17:28:08 +07:00
Nicolas Vasilache	31f80393bc	Revert "[mlir][MemRef] Fix DimOp folding of OffsetSizeAndStrideInterface." This reverts commit `6c0fd4db79`. This simple implementation is unfortunately not extensible and needs to be reverted. The extensible way should be to extend https://reviews.llvm.org/D104321.	2021-07-08 10:09:00 +00:00
Moritz Sichert	d58c7a9238	[IR] Added operator delete to subclasses of User to avoid UB Several subclasses of User override operator new without also overriding operator delete. This means that delete expressions fall back to using operator delete of the base class, which would be User. However, this is only allowed if the base class has a virtual destructor which is not the case for User, so this is UB. See also [expr.delete] (3) for the exact wording. This is actually detected in some cases by GCC 11's -Wmismatched-new-delete now which is how I found this error. Differential Revision: https://reviews.llvm.org/D103143	2021-07-08 11:59:22 +02:00
Martin Storsjö	715ca752ac	[libcxx] [test] Fix spurious failures in the thread detach test on Windows Make sure that the detached thread has started up before exiting the process. If the detached thread hasn't started up at all, and the main thread exits, global data structures in the process are torn down, which then can cause crashes when the thread starts up late after required mutexes have been destroyed. (In particular, the mutex used internally in _Init_thread_header, which is used in the initialization of __thread_local_data()::__p, can cause crashes if the main thread already has finished and progressed far with destruction.) Differential Revision: https://reviews.llvm.org/D105592	2021-07-08 12:36:03 +03:00
Tobias Gysi	abfa950d86	[mlir][linalg][python] Add exp and log to the OpDSL. Introduce the exp and log function in OpDSL. Add the soft plus operator to test the emitted IR in Python and C++. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105420	2021-07-08 08:48:23 +00:00
Tobias Gysi	84354b2ab2	[mlir][linalg] Remove GenericOpBase. Remove the GenericOpBase class formerly used to factor out common logic shared be GenericOp and IndexedGenericOp. After removing IndexedGenericOp, the base class is not used anymore. Differential Revision: https://reviews.llvm.org/D105307	2021-07-08 08:35:28 +00:00
Nicolas Vasilache	6c0fd4db79	[mlir][MemRef] Fix DimOp folding of OffsetSizeAndStrideInterface. This addresses the issue reported in https://llvm.discourse.group/t/rank-reducing-memref-subview-offsetsizeandstrideopinterface-interface-issues/3805 Differential Revision: https://reviews.llvm.org/D105558	2021-07-08 08:30:24 +00:00
Alex Zinenko	684dfe8adb	[mlir] factor out ConvertToLLVMPattern This class and classes that extend it are general utilities for any dialect that is being converted into the LLVM dialect. They are in no way specific to Standard-to-LLVM conversion and should not make their users depend on it. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105542	2021-07-08 10:27:05 +02:00
Sebastian Neubauer	9ced1e44ad	[AMDGPU] Fix typo	2021-07-08 10:07:33 +02:00
Mikael Holmen	21fd875952	[lld/mac] Fix warning about unused variable [NFC] Change "dyn_cast" to "isa" to get rid of the unused variable "bitcodeFile". gcc warned with lld/MachO/Driver.cpp:531:17: warning: unused variable 'bitcodeFile' [-Wunused-variable] 531 \| if (auto *bitcodeFile = dyn_cast<BitcodeFile>(file)) { \| ^~~~~~~~~~~	2021-07-08 09:46:30 +02:00
Tobias Gysi	511af1b1ad	[mlir][linalg] Tighter StructuredOp Verification. Verify the number of results matches exactly the number of output tensors. Simplify the FillOp verification since part of it got redundant. Differential Revision: https://reviews.llvm.org/D105427	2021-07-08 06:53:36 +00:00
Lang Hames	d7afd11e3d	[ORC] Introduce ExecutorAddress type, fix broken LLDB bot. ExecutorAddressRange depended on JITTargetAddress, but JITTargetAddress is defined in ExecutionEngine, which OrcShared should not depend on. This seems like as good a time as any to introduce a new ExecutorAddress type to eventually replace JITTargetAddress. For now it's just another uint64_t alias, but it will soon be changed to a class type to provide greater type safety.	2021-07-08 16:31:59 +10:00
Lang Hames	963378bd82	[ORC] Improve computeLocalDeps / computeNamedSymbolDependencies performance. The computeNamedSymbolDependencies and computeLocalDeps methods on ObjectLinkingLayerJITLinkContext are responsible for computing, for each symbol in the current MaterializationResponsibility, the set of non-locally-scoped symbols that are depended on. To calculate this we have to consider the effect of chains of dependence through locally scoped symbols in the LinkGraph. E.g. .text .globl foo foo: callq bar ## foo depneds on external 'bar' movq Ltmp1(%rip), %rcx ## foo depends on locally scoped 'Ltmp1' addl (%rcx), %eax retq .data Ltmp1: .quad x ## Ltmp1 depends on external 'x' In this example symbol 'foo' depends directly on 'bar', and indirectly on 'x' via 'Ltmp1', which is locally scoped. Performance of the existing implementations appears to have been mediocre: Based on flame graphs posted by @drmeister (in #jit on the LLVM discord server) the computeLocalDeps function was taking up a substantial amount of time when starting up Clasp (https://github.com/clasp-developers/clasp). This commit attempts to address the performance problems in three ways: 1. Using jitlink::Blocks instead of jitlink::Symbols as the nodes of the dependencies-introduced-by-locally-scoped-symbols graph. Using either Blocks or Symbols as nodes provides the same information, but since there may be more than one locally scoped symbol per block the block-based version of the dependence graph should always be a subgraph of the Symbol-based version, and so faster to operate on. 2. Improved worklist management. The older version of computeLocalDeps used a fixed worklist containing all nodes, and iterated over this list propagating dependencies until no further changes were required. The worklist was not sorted into a useful order before the loop started. The new version uses a variable work-stack, visiting nodes in DFS order and only adding nodes when there is meaningful work to do on them. Compared to the old version the new version avoids revisiting nodes which haven't changed, and I suspect it converges more quickly (due to the DFS ordering). 3. Laziness and caching. Mappings of... jitlink::Symbol* -> Interned Name (as SymbolStringPtr) jitlink::Block* -> Immediate dependencies (as SymbolNameSet) jitlink::Block* -> Transitive dependencies (as SymbolNameSet) are all built lazily and cached while running computeNamedSymbolDependencies. According to @drmeister these changes reduced Clasp startup time in his test setup (averaged over a handful of starts) from 4.8 to 2.8 seconds (with ORC/JITLink linking ~11,000 object files in that time), which seems like enough to justify switching to the new algorithm in the absence of any other perf numbers.	2021-07-08 16:31:59 +10:00

1 2 3 4 5 ...

393042 Commits All Branches Search

393042 Commits

All Branches