llvm-project

Commit Graph

Author	SHA1	Message	Date
Arthur Eubanks	772bdef6af	[docs] Add page on opaque pointer types Reviewed By: dblaikie, dexonsmith Differential Revision: https://reviews.llvm.org/D102292	2021-05-13 15:10:27 -07:00
Lang Hames	71a0609a2b	[clang-repl] Temporarily disable the execute.cpp test on ppc64. This test is failing on some builders (see [1]) with the following error: error: Added modules have incompatible data layouts: e-m:e-i64:64-n32:64-S128-v256:256:256-v512:512:512 (module) vs E-m:a-i64:64-n32:64-S128-v256:256:256-v512:512:512 (jit) The JIT layout is correct, but some IR module added to the JIT is using a little-endian layout instead. This commit disables the test on ppc64 until we can investigate further and fix the bug. [1] https://lab.llvm.org/staging/#/builders/126/builds/371	2021-05-13 14:39:12 -07:00
Nikita Popov	425781bce0	[CaptureTracking] Use isIdentifiedFunctionLocal() (NFC) These conditions together exactly match isIdentifiedFunctionLocal(), and this is also what we logically want to check for here.	2021-05-13 23:06:42 +02:00
Nikita Popov	dce158c58d	[AA] Use isIdentifiedFunctionLocal() (NFC) This condition is equivalent to isIdentifiedFunctionLocal(), and this is also what we semantically want to check here.	2021-05-13 23:06:42 +02:00
Roman Lebedev	5fddc3312b	Revert "[X86][CostModel] X86TTIImpl::getMemoryOpCost(): rewrite vector handling again" As reported in post-commit feedback, this has issues with e.g. <16 x i1>: https://llvm.godbolt.org/z/jxPvdGEW4 This reverts commit `c02476f315`.	2021-05-14 00:03:36 +03:00
Roman Lebedev	6b95fd199d	Revert "[X86] X86TTIImpl::getInterleavedMemoryOpCostAVX2(): use getMemoryOpCost()" Depends on a commit that is about to be reverted. This reverts commit `69ed93a435`.	2021-05-14 00:03:36 +03:00
Roman Lebedev	aa0dcb3ba4	[X86] AMD Zen 3: same-reg SSE XMM XORPS is a 1-cycle(!) dep-breaking one-idiom While both the SOG and Agner insist that it is zero-cycle, i can not confirm that claim. While it clearly breaks the dependency, i can not come up with a snippet, or measurement approach, to end up with IPC bigger than 4, which, to me, means that it actually consumes execution resource of an FP unit for a cycle.	2021-05-14 00:03:36 +03:00
Roman Lebedev	6c4596793d	[NFC][X86][MCA] AMD Zen 3: add same-reg SSE XMM XORPS test	2021-05-14 00:03:36 +03:00
Rob Suderman	f97d970a49	[mlir][tosa] Add lowering to tosa.abs for integer cases Integer case requires decomposing to simple LLVM operatons. Differential Revision: https://reviews.llvm.org/D101809	2021-05-13 13:55:17 -07:00
Siva Chandra Reddy	b47539a14d	[libc] Enable fmaf and fma on x86_64. They require clang-11 or above for building and hence had to be disabled as the bots did not have clang-11 or higher. Bots have now been upgraded so we can enable these functions now.	2021-05-13 20:51:15 +00:00
Fangrui Song	4f05f4c8e6	[CMake][ELF] Link libLLVM.so and libclang-cpp.so with -Bsymbolic-functions llvm-dev message: https://lists.llvm.org/pipermail/llvm-dev/2021-May/150465.html In an ELF shared object, a default visibility defined symbol is preemptible by default. This creates some missed optimization opportunities. -Bsymbolic-functions is more aggressive than our current -fvisibility-inlines-hidden (present since 2012) as it applies to all function definitions. It can * avoid PLT for cross-TU function calls && reduce dynamic symbol lookup * reduce dynamic symbol lookup for taking function addresses and optimize out GOT/TOC on x86-64/ppc64 In a -DLLVM_TARGETS_TO_BUILD=X86 build, the number of JUMP_SLOT decreases from 12716 to 1628, and the number of GLOB_DAT decreases from 1918 to 1313 The built clang with `-DLLVM_LINK_LLVM_DYLIB=on -DCLANG_LINK_CLANG_DYLIB=on` is significantly faster. See the Linux kernel build result https://bugs.archlinux.org/task/70697 Note: the performance of -fno-semantic-interposition -Bsymbolic-functions libLLVM.so and libclang-cpp.so is close to a PIE binary linking against `libLLVM.a` and `libclang.a`. When the host compiler is Clang, -Bsymbolic-functions is the major contributor. On x86-64 (with GOTPCRELX) and ppc64 ELFv2, the GOT/TOC relocations can be optimized. Some implication: Interposing a subset of functions is no longer supported. (This is fragile on ELF and unsupported on Mach-O at all. For Mach-O we don't use `ld -interpose` or `-flat_namespace`) Compiling a program which takes the address of any LLVM function with `{gcc,clang} -fno-pic` and expects the address to equal to the address taken from libLLVM.so or libclang-cpp.so is unsupported. I am fairly confident that llvm-project shouldn't have different behaviors depending on such pointer equality (as we've been using -fvisibility-inlines-hidden which applies to inline functions for a long time), but if we accidentally do, users should be aware that they should not make assumption on pointer equality in `-fno-pic` mode. See more on https://maskray.me/blog/2021-05-09-fno-semantic-interposition Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D102090	2021-05-13 13:44:57 -07:00
Joseph Huber	8b57ed09bd	[OpenMP] Prevent Attributor from deleting functions in OpenMPOptCGSCC pass Summary: This patch prevents the Attributor instances made in the CGSCC pass from deleting functions. This prevents the attributor from changing the call graph while OpenMPOpt is working with it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102363	2021-05-13 16:35:23 -04:00
natashaknk	0831793ed9	[mlir][tosa] Add tosa.div integer lowering to linalg.generic. Lowering div elementwise op to the linalg dialect. Since tosa only supports integer division, that is the only version that is currently implemented. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102430	2021-05-13 13:16:00 -07:00
Sean Silva	12874e93a1	[mlir][NFC] Add helper for common pattern of replaceAllUsesExcept This covers the extremely common case of replacing all uses of a Value with a new op that is itself a user of the original Value. This should also be a little bit more efficient than the `SmallPtrSet<Operation *, 1>{op}` idiom that was being used before. Differential Revision: https://reviews.llvm.org/D102373	2021-05-13 12:42:10 -07:00
Martin Storsjö	b42fb6811e	[llvm-nm] Support the -V option, print that the tool is compatible with GNU nm This unlocks some codepaths in libtool. Differential Revision: https://reviews.llvm.org/D102321	2021-05-13 22:36:25 +03:00
Siva Chandra Reddy	7deb5ef44f	[libc][NFC] Instead of erroring, skip math targets with missing implementations. Fixes Aarch64 bot.	2021-05-13 19:22:11 +00:00
Siva Chandra Reddy	861dc75906	[libc] Add x86_64 implementations of double precision cos, sin and tan. The implementations use the x86_64 FPU instructions. These instructions are extremely slow compared to a polynomial based software implementation. Also, their accuracy falls drastically once the input goes beyond 2PI. To improve both the speed and accuracy, we will be taking the following approach going forward: 1. As a follow up to this CL, we will implement a range reduction algorithm which will expand the accuracy to the entire double precision range. 2. After that, we will replace the HW instructions with a polynomial implementation to improve the run time. After step 2, the implementations will be accurate, performant and target architecture independent. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D102384	2021-05-13 19:02:00 +00:00
Arnamoy Bhattacharyya	b766576d38	[flang][OpenMP] Add semantic check for close nesting of `master` regions This patch implements the following semantic check: ``` A master region may not be closely nested inside a work-sharing, loop, atomic, task, or taskloop region. ``` Adds a test case and also modifies a couple of existing test cases to include the check. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D100228	2021-05-13 14:58:37 -04:00
Aaron En Ye Shi	a249ffa421	[HIP] Clean up llvm intrinsics using __asm Instead of using inline asm, use clang builtins for llvm intrinsics. Differential Revision: https://reviews.llvm.org/D102427	2021-05-13 18:55:51 +00:00
peter klausler	72abc19977	[flang] Support legacy extension OPEN(ACCESS='APPEND') It should of course be POSITION='APPEND' but Sun Fortran supported it on ACCESS=. Differential Revision: https://reviews.llvm.org/D102350	2021-05-13 11:51:20 -07:00
zoecarver	fe319a8848	[libcxx][docs] Add two locks: transform_view and take_view. Assign myself both of these views.	2021-05-13 11:49:20 -07:00
zoecarver	3ac9ff5577	[libcxx][docs] Update the One Ranges PRoposal Status with open revisions. 1. Moves the names into the names column. 2. Changes the names to reflect who's actually working on what. 3. Adds open revisions.	2021-05-13 11:49:20 -07:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Artem Dergachev	5ad2eeeada	[clang-tidy] bugprone-infinite-loop: React to ObjC ivars and messages. If the loop condition is a value of an instance variable, a property value, or a message result value, it's a good indication that the loop is not infinite and we have a really hard time proving the opposite so suppress the warning. Differential Revision: https://reviews.llvm.org/D102294	2021-05-13 11:25:02 -07:00
Artem Dergachev	46c6c08c94	[clang-tidy] bugprone-infinite-loop: forFunction() -> forCallable(). Take advantage of the new ASTMatcher added in D102213 to fix massive false negatives of the infinite loop checker on Objective-C. Differential Revision: https://reviews.llvm.org/D102214	2021-05-13 11:25:01 -07:00
Artem Dergachev	6a079dfdc9	[ASTMatchers] Add forCallable(), a generalization of forFunction(). The new matcher additionally covers blocks and Objective-C methods. This matcher actually makes sure that the statement truly belongs to that declaration's body. forFunction() incorrectly reported that a statement in a nested block belonged to the surrounding function. forFunction() is now deprecated due to the above footgun, in favor of forCallable(functionDecl()) when only functions need to be considered. Differential Revision: https://reviews.llvm.org/D102213	2021-05-13 11:25:00 -07:00
Artem Dergachev	dd98ea528c	[ASTMatchers] NFC: Fix formatting around forFunction(). Differential Revision: https://reviews.llvm.org/D102303	2021-05-13 11:25:00 -07:00
Roman Lebedev	0d8f91d2a9	[NFC] Delete two newly-added test cases Failing on bots in unobvious ways.	2021-05-13 21:23:01 +03:00
peter klausler	6829bd3ed0	[flang] (NFC) Expose internal idiom as utility API Add overloads to AsGenericExpr() in Evaluate/tools.h to take care of wrapping an untyped DataRef or bare Symbol in a typed Designator wrapped up in a generic Expr<SomeType>. Use the new overloads to replace a few instances of code that was calling TypedWrapper<>() with a dynamic type. This new tool will be useful in lowering to drive some code that works with typed expressions (viz., list-directed I/O list items) when starting with only a bare Symbol (viz., NAMELIST). Differential Revision: https://reviews.llvm.org/D102352	2021-05-13 11:19:37 -07:00
Roman Lebedev	ecc4e9e8f4	[NFC] Try to fix CodeGenCXX/thunk-wrong-return-type.cpp test	2021-05-13 21:17:31 +03:00
cynecx	8ec9fd4839	Support unwinding from inline assembly I've taken the following steps to add unwinding support from inline assembly: 1) Add a new `unwind` "attribute" (like `sideeffect`) to the asm syntax: ``` invoke void asm sideeffect unwind "call thrower", "~{dirflag},~{fpsr},~{flags}"() to label %exit unwind label %uexit ``` 2.) Add Bitcode writing/reading support + LLVM-IR parsing. 3.) Emit EHLabels around inline assembly lowering (SelectionDAGBuilder + GlobalISel) when `InlineAsm::canThrow` is enabled. 4.) Tweak InstCombineCalls/InlineFunction pass to not mark inline assembly "calls" as nounwind. 5.) Add clang support by introducing a new clobber: "unwind", which lower to the `canThrow` being enabled. 6.) Don't allow unwinding callbr. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D95745	2021-05-13 19:13:03 +01:00
Roman Lebedev	9d3eb7885d	[NFC] Try to fix CodeGenCXX/thunk-wrong-this.cpp test	2021-05-13 21:10:14 +03:00
Stefan Pintilie	54310fc176	[PowerPC] Add ROP Protection to prologue and epilogue Added hashst to the prologue and hashchk to the epilogue. The hash for the prologue and epilogue must always be stored as the first element in the local variable space on the stack. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D99377	2021-05-13 12:54:44 -05:00
peter klausler	50e0b2985e	[flang] Implement DOT_PRODUCT in the runtime API, implementation, and basic tests for the transformational reduction intrinsic function DOT_PRODUCT in the runtime support library. Differential Revision: https://reviews.llvm.org/D102351	2021-05-13 10:40:07 -07:00
Duncan P. N. Exon Smith	7c57a9bd7d	Modules: Simplify how DisableGeneratingGlobalModuleIndex is set, likely NFC DisableGeneratingGlobalModuleIndex was being set by CompilerInstance::findOrCompileModuleAndReadAST most of (but not all of) the times it returned `nullptr` as a "normal" failure. Pull that up to the caller, CompilerInstance::loadModule, to simplify the code. This resolves a number of FIXMEs added during the refactoring in `5cca622310`. The extra cases where this is set are all some version of a fatal error, and the only client of the field, shouldBuildGlobalModuleIndex, seems to be unreachable in that case. Even if there is some corner case where this has an effect, it seems like the right/consistent behaviour. Differential Revision: https://reviews.llvm.org/D101672	2021-05-13 10:39:40 -07:00
Roman Lebedev	16d0381841	Return "[CGCall] Annotate `this` argument with alignment" The original change was reverted because it was discovered that clang mishandles thunks, and they receive wrong attributes for their this/return types - the ones for the function they will call, not the ones they have. While i have tried to fix this in https://reviews.llvm.org/D100388 that patch has been up and stuck for a month now, with little signs of progress. So while it will be good to solve this for real, for now we can simply avoid introducing the bug, by not annotating this/return for thunks. This reverts commit `6270b3a1ea`, relanding `0aa0458f14`.	2021-05-13 20:33:14 +03:00
Roman Lebedev	a624cec56d	[Clang][Codegen] Do not annotate thunk's this/return types with align/deref/nonnull attrs As it was discovered in post-commit feedback for `0aa0458f14`, we handle thunks incorrectly, and end up annotating their this/return with attributes that are valid for their callees, not for thunks themselves. While it would be good to fix this properly, and keep annotating them on thunks, i've tried doing that in https://reviews.llvm.org/D100388 with little success, and the patch is stuck for a month now. So for now, as a stopgap measure, subj.	2021-05-13 20:33:08 +03:00
Roman Lebedev	70aa4623de	[NFC][Clang][Codegen] Add tests with wrong attributes on this/return of thunks From https://reviews.llvm.org/D100388	2021-05-13 20:32:40 +03:00
David Green	1011d4ed60	[ARM] Constrain CMPZ shift combine to a single use We currently prefer t2CMPrs over t2CMPri when the node contains a shift. This can introduce more nodes if the shift has multiple uses though, as value from the shift will be needed anyway, and in the case of a t2CMPri compared with zero will more readily be removed entirely. Differential Revision: https://reviews.llvm.org/D101688	2021-05-13 18:31:01 +01:00
Jonas Devlieghere	f93e9c12bf	[lldb] Fixup indirect symbols as they are signed. This fixes a bunch of test failures in Apple Silicon (arm64e).	2021-05-13 10:27:22 -07:00
Jonas Devlieghere	ce12b52de2	[lldb] Fixup more code addresses The Swift async task pointers are signed on arm64e and we need to fixup the addresses in the CFA and DWARF expressions.	2021-05-13 10:27:22 -07:00
Duncan P. N. Exon Smith	23e9146fba	Modules: Rename ModuleBuildFailed => DisableGeneratingGlobalModuleIndex, NFC Rename CompilerInstance's ModuleBuildFailed field to DisableGeneratingGlobalModuleIndex, which more precisely describes its role. Otherwise, it's hard to suss out how it's different from ModuleLoader::HadFatalFailure, and what sort of code simplifications are safe. Differential Revision: https://reviews.llvm.org/D101670	2021-05-13 10:22:40 -07:00
Weiwei Li	cd0eeb52ad	[mlir][spirv] Define spv.ImageQuerySize operation Support OpImageQuerySize in spirv dialect co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D102029	2021-05-13 13:17:08 -04:00
Valeriy Savchenko	45212dec01	[analyzer][solver] Prevent use of a null state rdar://77686137 Differential Revision: https://reviews.llvm.org/D102240	2021-05-13 20:16:29 +03:00
zoecarver	7f607ac6af	[pstl] Use logical operator for loop condition in tests Fix a probable typo in two PSTL tests that causes warnings with GCC. Patch by Jonathan Wakely (jwakely). Reviewed By: zoecarver Differential Revision: https://reviews.llvm.org/D102327	2021-05-13 10:11:40 -07:00
Duncan P. N. Exon Smith	7c2afd5899	Modules: Remove ModuleLoader::OtherUncachedFailure, NFC `5cca622310` refactored CompilerInstance::loadModule, splitting out findOrCompileModuleAndReadAST, but was careful to avoid making any functional changes. It added ModuleLoader::OtherUncachedFailure to facilitate this and left behind FIXMEs asking why certain failures weren't cached. After a closer look, I think we can just remove this and simplify the code. This changes the behaviour of the following (simplified) code from CompilerInstance::loadModule, causing a failure to be cached more often: ``` if (auto MaybeModule = MM.getCachedModuleLoad(Path[0].first)) return MaybeModule; if (ModuleName == getLangOpts().CurrentModule) return MM.cacheModuleLoad(PP.lookupModule(...)); ModuleLoadResult Result = findOrCompileModuleAndReadAST(...); if (Result.isNormal()) // This will be 'true' more often. return MM.cacheModuleLoad(..., Module); return Result; ``` `MM` here is a ModuleMap owned by the Preprocessor. Here are the cases where `findOrCompileModuleAndReadAST` starts returning a "normal" failed result: - Emitted `diag::err_module_not_found`, where there's no module map found. - Emitted `diag::err_module_build_disabled`, where implicitly building modules is disabled. - Emitted `diag::err_module_cycle`, which detects module cycles in the implicit modules build system. - Emitted `diag::err_module_not_built`, which avoids building a module in this CompilerInstance if another one tried and failed already. - `compileModuleAndReadAST()` was called and failed to build. The four errors are all fatal, and last item also reports a fatal error, so it this extra caching has no functionality change... but even if it did, it seems fine to cache these failed results within a ModuleMap instance (note that each CompilerInstance has its own Preprocessor and ModuleMap). Differential Revision: https://reviews.llvm.org/D101667	2021-05-13 10:10:46 -07:00
zoecarver	98e4fd0701	[libcxx][ranges] Fix `ranges::empty` when begin, end, and empty members are provided. Before this commit, we'd get a compilation error because the operator() overload was ambiguous. Differential Revision: https://reviews.llvm.org/D102263	2021-05-13 10:07:57 -07:00
Lei Huang	9469ff15b7	[PowerPC] Add clang option -m[no-]prefixed Add user-facing front end option to turn off power10 prefixed instructions. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D102191	2021-05-13 12:02:10 -05:00
Jon Chesterfield	10de217209	[libomptarget][amdgpu] Fix truncation error for partial wavefront [libomptarget][amdgpu] Fix truncation error for partial wavefront The partial barrier implementation involves one wavefront resetting and N-1 waiting. This change future proofs against launching with a number of threads that is not a multiple of the wavefront size. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102407	2021-05-13 17:31:57 +01:00
Jon Chesterfield	b049870d3b	[libomptarget][amdgpu] Convert an assert to print and offload_fail [libomptarget][amdgpu] Convert an assert to print and offload_fail The kernel launched is supposed to be present in the binary, but a not yet diagnosed bug means it is missing for some of the qmcpack test cases. Changing from assert to print and offload_fail should help diagnose that and similar bugs. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102378	2021-05-13 17:31:36 +01:00

... 2 3 4 5 6 ...

388475 Commits All Branches Search

388475 Commits

All Branches