llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	b8000c0ce8	[Windows] Autolink with basenames and add libdir to libpath Prior to this change, for a few compiler-rt libraries such as ubsan and the profile library, Clang would embed "-defaultlib:path/to/rt-arch.lib" into the .drective section of every object compiled with -finstr-profile-generate or -fsanitize=ubsan as appropriate. These paths assume that the link step will run from the same working directory as the compile step. There is also evidence that sometimes the paths become absolute, such as when clang is run from a different drive letter from the current working directory. This is fragile, and I'd like to get away from having paths embedded in the object if possible. Long ago it was suggested that we use this for ASan, and apparently I felt the same way back then: https://reviews.llvm.org/D4428#56536 This is also consistent with how all other autolinking usage works for PS4, Mac, and Windows: they all use basenames, not paths. To keep things working for people using the standard GCC driver workflow, the driver now adds the resource directory to the linker library search path when it calls the linker. This is enough to make check-ubsan pass, and seems like a generally good thing. Users that invoke the linker directly (most clang-cl users) will have to add clang's resource library directory to their linker search path in their build system. I'm not sure where I can document this. Ideally I'd also do it in the MSBuild files, but I can't figure out where they go. I'd like to start with this for now. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D65543	2020-04-28 11:36:21 -07:00
Francis Visoiu Mistrih	e770153865	[AArch64] Add support for -ffixed-x30 Add support for reserving LR in: * the driver through `-ffixed-x30` * cc1 through `-target-feature +reserve-x30` * the backend through `-mattr=+reserve-x30` * a subtarget feature `reserve-x30` the same way we're doing for the other registers.	2020-04-28 08:48:28 -07:00
Luke Geeson	740a1dd050	[ARM] Armv8.6-a Matrix Mul cmd line support This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Command line options to enable these features with +i8mm, +f32mm, or f64mm Note: +f32mm and +f64mm are optional and so are not enabled by default This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, DavidSpickett Reviewed By: DavidSpickett Subscribers: DavidSpickett, ostannard, kristof.beyls, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77875	2020-04-24 15:54:06 +01:00
Sergej Jaskiewicz	2899103108	[TimeProfiler] Emit clock synchronization point Time profiler emits relative timestamps for events (the number of microseconds passed since the start of the current process). This patch allows combining events from different processes while preserving their relative timing by emitting a new attribute "beginningOfTime". This attribute contains the system time that corresponds to the zero timestamp of the time profiler. This has at least two use cases: - Build systems can use this to merge time traces from multiple compiler invocations and generate statistics for the whole build. Tools like ClangBuildAnalyzer could also leverage this feature. - Compilers that use LLVM as their backend by invoking llc/opt in a child process. If such a compiler supports generating time traces of its own events, it could merge those events with LLVM-specific events received from llc/opt, and produce a more complete time trace. A proof-of-concept script that merges multiple logs that contain a synchronization point into one log: https://github.com/broadwaylamb/merge_trace_events Differential Revision: https://reviews.llvm.org/D78030	2020-04-23 01:09:31 +03:00
Sergej Jaskiewicz	a5bf02815d	[TimeProfiler] Emit real process ID and thread names Differential Revision: https://reviews.llvm.org/D78027	2020-04-23 00:12:51 +03:00
Justin Hibbits	4ca2cad947	[PowerPC] Add clang -msvr4-struct-return for 32-bit ELF Summary: Change the default ABI to be compatible with GCC. For 32-bit ELF targets other than Linux, Clang now returns small structs in registers r3/r4. This affects FreeBSD, NetBSD, OpenBSD. There is no change for 32-bit Linux, where Clang continues to return all structs in memory. Add clang options -maix-struct-return (to return structs in memory) and -msvr4-struct-return (to return structs in registers) to be compatible with gcc. These options are only for PPC32; reject them on PPC64 and other targets. The options are like -fpcc-struct-return and -freg-struct-return for X86_32, and use similar code. To actually return a struct in registers, coerce it to an integer of the same size. LLVM may optimize the code to remove unnecessary accesses to memory, and will return i32 in r3 or i64 in r3:r4. Fixes PR#40736 Patch by George Koehler! Reviewed By: jhibbits, nemanjai Differential Revision: https://reviews.llvm.org/D73290	2020-04-21 20:17:25 -05:00
Richard Smith	c8248dc3bb	Change deprecated -fsanitize-recover flag to apply to all sanitizers, not just UBSan. Summary: This flag has been deprecated, with an on-by-default warning encouraging users to explicitly specify whether they mean "all" or ubsan for 5 years (released in Clang 3.7). Change it to mean what we wanted and undeprecate it. Also make the argument to -fsanitize-trap optional, and likewise default it to 'all', and express the aliases for these flags in the .td file rather than in code. (Plus documentation updates for the above.) Reviewers: kcc Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77753	2020-04-17 22:37:30 -07:00
Lei Huang	10b60dde76	[PowerPC] Refactor ppcUserFeaturesCheck() Summary: This function keeps growing, refactor to use lambda. Reviewers: nemanjai, stefanp Subscribers: kbarton, shchenz, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78308	2020-04-17 15:19:46 -05:00
Matt Arsenault	3a61245050	clang/AMDGPU: Assume denormals are enabled for the default target. Since the default logic was based on having fast denormal/fma features, and the default target has no features, we assumed flushing by default. This fixes incorrectly assuming flushing in builds for "generic" IR libraries. The handling for no specified --cuda-gpu-arch in HIP is kind of broken. Somewhere else forces a default target of gfx803, which does not enable denormal handling by default. We don't see this default switching here, so you'll end up with a different denormal mode depending on whether you explicitly requested gfx803, or used it by default.	2020-04-15 09:17:26 -04:00
Sam Clegg	3ea1c62cba	[WebAssembly] Emit .llvmcmd and .llvmbc as custom sections Fixes: https://bugs.llvm.org/show_bug.cgi?id=45362 Differential Revision: https://reviews.llvm.org/D77115	2020-04-14 13:24:18 -07:00
Jon Roelofs	38b39c34ab	[clang] Add missing FileCheck colons	2020-04-14 12:32:48 -06:00
Scott Constable	539163affe	[X86] Add tests to clang Driver to ensure that SLH/Retpoline features are not enabled with LVI-CFI Differential Revision: https://reviews.llvm.org/D77427	2020-04-14 10:47:27 -07:00
Matt Arsenault	dc89a3efb4	HIP: Fix handling of denormal mode I didn't realize HIP was a distinct offloading kind, so the subtarget was looking for -march, which isn't correct for HIP. We also have the possibility of different denormal defaults in the case of multiple offload targets, so we need to thread the JobAction through the target hook.	2020-04-13 11:48:45 -07:00
Alexander Kornienko	8dda0f9199	Remove dependency between test files. There seems to be no good reason for rocm-device-libs.cl to depend on opencl.cl. Removed this dependency to unbreak the tests in our setup.	2020-04-13 06:19:09 +02:00
Matt Arsenault	1e93b3d8a7	Disable test on windows	2020-04-10 18:48:18 -04:00
Matt Arsenault	4593e4131a	AMDGPU: Teach toolchain to link rocm device libs Currently the library is separately linked, but this isn't correct to implement fast math flags correctly. Each module should get the version of the library appropriate for its combination of fast math and related flags, with the attributes propagated into its functions and internalized. HIP already maintains the list of libraries, but this is not used for OpenCL. Unfortunately, HIP uses a separate --hip-device-lib argument, despite both languages using the same bitcode library. Eventually these two searches need to be merged. An additional problem is there are 3 different locations the libraries are installed, depending on which build is used. This also needs to be consolidated (or at least the search logic needs to deal with this unnecessary complexity).	2020-04-10 13:37:32 -04:00
Simon Cook	dd1ee6dc07	[RISCV] Support experimental/unratified extensions This adds support for enabling experimental/unratified RISC-V ISA extensions in the -march string in the case where an explicit version number has been declared, and the -menable-experimental-extensions flag has been provided. This follows the design as discussed on the mailing lists in the following RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138364.html Since the RISC-V toolchain definition currently rejects any extension with an explicit version number, the parsing logic has been tweaked to support this, and to allow standard extensions to have their versions checked in future patches. The bitmanip 'b' extension has been added as a first use of this support, it should easily extend to other as yet unratified extensions (such as the vector 'v' extension). Differential Revision: https://reviews.llvm.org/D73891	2020-04-09 18:04:22 +01:00
Sid Manning	9bda29ab0f	[Hexagon] Default linker tests can fail if CLANG_DEFAULT_LINKER is used. These values are not always known since there is a configuration option to set the default linker, CLANG_DEFAULT_LINKER. Differential Revision: https://reviews.llvm.org/D77684	2020-04-09 08:36:50 -05:00
Shengchen Kan	792b10978d	[Driver][X86] Add -mpad-max-prefix-size Summary: The option `-mpad-max-prefix-size` performs some checking and delegate to MC option `-x86-pad-max-prefix-size`. This option is designed for eliminate NOPs when we need to align something by adding redundant prefixes to instructions, e.g. it can be used along with `-malign-branch`, `-malign-branch-boundary` to prefix padding branch. It has similar (but slightly different) effect as GAS's option `-malign-branch-prefix-size`, e.g. `-mpad-max-prefix-size` can also elminate NOPs emitted by align directive, so we use a different name here. I remove the option `-malign-branch-prefix-size` since is unimplemented and not needed. If we need to be compatible with GAS, we can make `-malign-branch-prefix-size` an alias for this option later. Reviewers: jyknight, reames, MaskRay, craig.topper, LuoYuanke Reviewed By: MaskRay, LuoYuanke Subscribers: annita.zhang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77628	2020-04-09 19:34:12 +08:00
Pratyai Mazumder	ced398fdc8	[SanitizerCoverage] Add -fsanitize-coverage=inline-bool-flag Reviewers: kcc, vitalybuka Reviewed By: vitalybuka Subscribers: cfe-commits, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77637	2020-04-09 02:40:55 -07:00
WangTianQing	a3dc949000	[X86] Add TSXLDTRK instructions. Summary: For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference Reviewers: craig.topper, RKSimon, LuoYuanke Reviewed By: craig.topper Subscribers: mgorny, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77205	2020-04-09 13:17:29 +08:00
Fangrui Song	969b91af73	[Driver] Default arm-linux-androideabi to -z max-page-size=4096 Similar to D55029. The requirement arises when discussing increasing default max-page-size for lld ARM (D77330). For the record, the default max-page-size on the 3 commonly used linkers: * GNU ld since 2014 (https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=7572ca8989ead4c3425a1500bc241eaaeffa2c89) defaults to 65536 * GNU gold remains 4096 * lld<=10 uses 4096. lld from 11 onwards will use 65536 (D77330) Reviewed By: srhines, thieta Differential Revision: https://reviews.llvm.org/D77746	2020-04-08 12:05:28 -07:00
Artem Belevich	d2e498b172	[CUDA] Improve testing of libdevice detection. Added new testcases for libdevice in CUDA-9+ and removed unused checks. Differential Revision: https://reviews.llvm.org/D77688	2020-04-08 11:19:45 -07:00
Francis Visoiu Mistrih	9e6670b03c	[Driver] Only pass LTO remark arguments if the driver asks for it Previous fix missed a check to willEmitRemarks, causing remarks to always be enabled for LTO.	2020-04-07 14:11:47 -07:00
Sid Manning	aed2fdb167	[Hexagon] Update paths for linux/musl Update the sysroot expectation to match other targets and breakout linux/musl toolchain tests into a new file. Differential Revision: https://reviews.llvm.org/D77440	2020-04-07 13:45:52 -05:00
Michael Liao	c97be2c377	[hip] Remove `hip_pinned_shadow`. Summary: - Use `device_builtin_surface` and `device_builtin_texture` for surface/texture reference support. So far, both the host and device use the same reference type, which could be revised later when interface/implementation is stablized. Reviewers: yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77583	2020-04-07 09:51:49 -04:00
Sid Manning	2c5d6dfda9	[Hexagon] Make lld be the default linker for linux/musl When the target is hexagon-unknown-linux-musl select lld as the default linker. Differential Revision: https://reviews.llvm.org/D77498	2020-04-06 12:59:07 -05:00
Hans Wennborg	f8e1fc20cb	Make clang/test/Driver/cl-options.cu pass in 32-bit builds	2020-04-06 16:04:43 +02:00
Nico Weber	e01ec11882	make `ccabe93298` more robust	2020-04-05 13:07:50 -04:00
Nico Weber	ccabe93298	clang: Make tests using symlinks more consistent. Instead of checking if each symlink exists before removing it, remove the whole temp dir housing the symlinks before recreating it. This is a bit shorter, conceptually simpler (in that the first and consecutive test runs have more similar behavior), it's what we're already doing in almost all places where we do it, and it works if the symlink exists but is a dead link (e.g. when it points into the build dir but the build dir is renamed). No intended behavior change.	2020-04-05 12:56:41 -04:00
Craig Topper	1d42c0db9a	Revert "[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets" This reverts commit `c74dd640fd`. Reverting to address coding standard issues raised in post-commit review.	2020-04-03 16:56:08 -07:00
Francis Visoiu Mistrih	ba8b3052b5	[Driver] Handle all optimization-record options for Darwin LTO clang with -flto does not handle -foptimization-record-path=<path> This dulicates the code from ToolChains/Clang.cpp with modifications to support everything in the same fashion.	2020-04-03 15:30:08 -07:00
Scott Constable	c74dd640fd	[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets Adds a new data structure, ImmutableGraph, and uses RDF to find LVI gadgets and add them to a MachineGadgetGraph. More specifically, a new X86 machine pass finds Load Value Injection (LVI) gadgets consisting of a load from memory (i.e., SOURCE), and any operation that may transmit the value loaded from memory over a covert channel, or use the value loaded from memory to determine a branch/call target (i.e., SINK). Also adds a new target feature to X86: +lvi-load-hardening The feature can be added via the clang CLI using -mlvi-hardening. Differential Revision: https://reviews.llvm.org/D75936	2020-04-03 13:02:04 -07:00
Scott Constable	5b519cf1fc	[X86] Add Indirect Thunk Support to X86 to mitigate Load Value Injection (LVI) This pass replaces each indirect call/jump with a direct call to a thunk that looks like: lfence jmpq *%r11 This ensures that if the value in register %r11 was loaded from memory, then the value in %r11 is (architecturally) correct prior to the jump. Also adds a new target feature to X86: +lvi-cfi ("cfi" meaning control-flow integrity) The feature can be added via clang CLI using -mlvi-cfi. This is an alternate implementation to https://reviews.llvm.org/D75934 That merges the thunk insertion functionality with the existing X86 retpoline code. Differential Revision: https://reviews.llvm.org/D76812	2020-04-03 00:34:39 -07:00
WangTianQing	d08fadd662	[X86] Add SERIALIZE instruction. Summary: For more details about this instruction, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference Reviewers: craig.topper, RKSimon, LuoYuanke Reviewed By: craig.topper Subscribers: mgorny, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77193	2020-04-02 16:19:23 +08:00
Matt Arsenault	4ea3650c21	HIP: Link correct denormal mode library This wasn't respecting the flush mode based on the default, and also wasn't correctly handling the explicit -fno-cuda-flush-denormals-to-zero overriding the mode.	2020-04-01 12:36:22 -04:00
Fangrui Song	531b3aff30	[Frontend] Replace CC1 option -masm-verbose with -fno-verbose-asm Most OS✕target enable -fverbose-asm, so it makes sense to flip the CC1 option to reduce common command lines.	2020-03-31 22:33:55 -07:00
Fangrui Song	d0d076fed9	[Driver] Flip the CC1 default of -fdiagnostics-show-option The driver enables -fdiagnostics-show-option by default, so flip the CC1 default to reduce the lengths of common CC1 command lines. This change also makes ParseDiagnosticArgs() consistently enable -fdiagnostics-show-option by default.	2020-03-31 21:59:27 -07:00
Fangrui Song	3341dc7339	[Driver] Don't pass -fobjc-rumtime= for non-ObjC input	2020-03-31 17:50:37 -07:00
Fangrui Song	4805901930	[Driver] Don't pass -fmessage-length=0 to CC1 -fmessage-length=0 is common (unless the environment variable COLUMNS is set and exported. This simplifies a common CC1 command line.	2020-03-31 17:12:08 -07:00
Matt Arsenault	c9d65a48af	HIP: Ensure new denormal mode attributes are set Apparently HIPToolChain does not subclass from AMDGPUToolChain, so this was not applying the new denormal attributes. I'm not sure why this doesn't subclass. Just copy the implementation for now.	2020-03-31 18:00:37 -04:00
Amara Emerson	7f1ea924c6	Add a new -fglobal-isel option and make -fexperimental-isel an alias for it. Since GlobalISel is maturing and is already on at -O0 for AArch64, it's not completely "experimental". Create a more appropriate driver flag and make the older option an alias for it. Differential Revision: https://reviews.llvm.org/D77103	2020-03-31 12:06:11 -07:00
Florian Hahn	7899a111ea	Revert "[Darwin] Respect -fno-unroll-loops during LTO." As per post-commit comment at https://reviews.llvm.org/D76916, this should better be done at the TU level. This reverts commit `9ce198d6ed`.	2020-03-30 15:20:30 +01:00
Alexandre Ganea	3ab3f3c5d5	After `09158252f7`, fix build when -DLLVM_ENABLE_THREADS=OFF Tested on Linux with Clang 9, and on Windows with Visual Studio 2019 16.5.1 with -DLLVM_ENABLE_THREADS=ON and OFF.	2020-03-28 13:54:58 -04:00
Florian Hahn	9ce198d6ed	[Darwin] Respect -fno-unroll-loops during LTO. Currently -fno-unroll-loops is ignored when doing LTO on Darwin. This patch adds a new -lto-no-unroll-loops option to the LTO code generator and forwards it to the linker if -fno-unroll-loops is passed. Reviewers: thegameg, steven_wu Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D76916	2020-03-27 22:19:03 +00:00
Douglas Yung	5db37f3bca	Make PS4 use -fno-use-init-array only as the ABI does not support .init_array. Reviewed by Paul Robinson	2020-03-26 15:45:40 -07:00
Matt Arsenault	40076c14fe	CUDA: Fix broken test run lines There was a misisng space between the -march and --cuda-gpu-arch arguments, so --cuda-gpu-arch wasn't actually being parsed. I'm not sure what the intent of the sm_10 run lines were, but they error as an unsupported architecture. Switch these to something else.	2020-03-26 12:19:34 -04:00
Ties Stuij	71ae267d1f	[PATCH] [ARM] ARMv8.6-a command-line + BFloat16 Asm Support Summary: This patch introduces command-line support for the Armv8.6-a architecture and assembly support for BFloat16. Details can be found https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a in addition to the GCC patch for the 8..6-a CLI: https://gcc.gnu.org/legacy-ml/gcc-patches/2019-11/msg02647.html In detail this patch - march options for armv8.6-a - BFloat16 assembly This is part of a patch series, starting with command-line and Bfloat16 assembly support. The subsequent patches will upstream intrinsics support for BFloat16, followed by Matrix Multiplication and the remaining Virtualization features of the armv8.6-a architecture. Based on work by: - labrinea - MarkMurrayARM - Luke Cheeseman - Javed Asbar - Mikhail Maltsev - Luke Geeson Reviewers: SjoerdMeijer, craig.topper, rjmccall, jfb, LukeGeeson Reviewed By: SjoerdMeijer Subscribers: stuij, kristof.beyls, hiraditya, dexonsmith, danielkiss, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D76062	2020-03-26 09:17:20 +00:00
Michael Liao	4f4e68799f	[test][clang][driver] Add required features. - to avoid false alarms on builds without that features.	2020-03-24 17:08:21 -04:00
Alexander Belyaev	10bd8422d0	[ARM][CMSE] Fix clang/test/Driver/save-temps.c test. Differential Revision: https://reviews.llvm.org/D76703	2020-03-24 15:24:14 +01:00

1 2 3 4 5 ...

4152 Commits