llvm-project

Commit Graph

Author	SHA1	Message	Date
Arthur Eubanks	0c509dbc7e	[NewPM] Add options to PrintPassInstrumentation To bring D99599's implementation in line with the existing PrintPassInstrumentation, and to fix a FIXME, add more customizability to PrintPassInstrumentation. Introduce three new options. The first takes over the existing "-debug-pass-manager-verbose" cl::opt. The second and third option are specific to -fdebug-pass-structure. They allow indentation, and also don't print analysis queries. To avoid more golden file tests than necessary, prune down the -fdebug-pass-structure tests. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D102196	2021-05-18 20:59:35 -07:00
Mike Rice	ff99fdf63f	[OpenMP] Stabilize OpenMP/parallel_for_codegen.cpp test (NFC) Revert recent commit to require x86-registered-target (`e4b790c5e3`). Remove -O1 from the run lines so they are less dependent on backend passes. Update the CHECK6 and CHECK10 lines with script. Differential Revision: https://reviews.llvm.org/D102720	2021-05-18 16:07:22 -07:00
Fangrui Song	2919222d80	[Driver] Delete -mimplicit-it= This is a GNU as and Clang cc1as option, not a GCC option. Users should specify `-Wa,-mimplicit-it=` instead. Note: mixing the -m option and the -Wa, option doesn't work `-Wa,-mimplicit-it=never -mimplicit-it=always` => `clang (LLVM option parsing): for the --arm-implicit-it option: may only occur zero or one times!` Reviewed By: nickdesaulniers, raj.khem Differential Revision: https://reviews.llvm.org/D102568	2021-05-18 10:57:24 -07:00
Aaron Ballman	ccbac06a07	Speculatively fix failing tests from `6381664580` This was causing some Mac-specific build failures: http://45.33.8.238/macm1/9739/step_7.txt http://45.33.8.238/mac/31615/step_7.txt As best I can tell with psychic debugging, the /Users/blah path to the source file is being treated as a macro undef with the clang-cl driver. This splits the filename off explicitly so hopefully the rest of the command line arguments will be read properly.	2021-05-18 12:44:58 -04:00
Sergey Dmitriev	8998a8aa97	[clang-offload-bundler] Add sections and set section flags using one llvm-objcopy invocation llvm-objcopy has been changed to support adding a section and updating section flags in one run (D90438), so we can now change clang-offload-bundler to run llvm-objcopy tool only once when creating fat object. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D102670	2021-05-18 08:44:41 -07:00
Aaron Ballman	6381664580	Introduce SYCL 2020 mode Currently, we have support for SYCL 1.2.1 (also known as SYCL 2017). This patch introduces the start of support for SYCL 2020 mode, which is the latest SYCL standard available at (https://www.khronos.org/registry/SYCL/specs/sycl-2020/html/sycl-2020.html). This sets the default SYCL to be 2020 in the driver, and introduces the notion of a "default" version (set to 2020) when cc1 is in SYCL mode but there was no explicit -sycl-std= specified on the command line.	2021-05-18 10:34:14 -04:00
Jinsong Ji	7d6449322e	[DebugInfo][test] Check specific func name to ignore codegen differences We use `CHECK-LABEL: define` to divide input stream into functions, this works well on most platforms. But there are cases that some platforms (eg: AIX) may have different codegen , especially for global constructor and descructors. On AIX, the codegen will have two more functions: __dtor_b, __finalize_b, which will fail the test. The fix is to use specific function name so that we can safely ignore those unrelated codegen differences. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102654	2021-05-18 14:03:27 +00:00
Alexey Bader	2ab513cd3e	[SYCL] Enable `opencl_global_[host,device]` attributes for SYCL Differential Revision: https://reviews.llvm.org/D100396	2021-05-18 10:27:35 +03:00
Ole Strohm	642d2f000b	[OpenCL] Fix initialization of __constant constructors without arguments This fixes the initialization of objects in the __constant address space that occurs when declaring the object. Fixes part of PR42566 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D102248	2021-05-18 10:59:53 +01:00
Ten Tzen	797ad70152	[Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 This patch is the Part-1 (FE Clang) implementation of HW Exception handling. This new feature adds the support of Hardware Exception for Microsoft Windows SEH (Structured Exception Handling). This is the first step of this project; only X86_64 target is enabled in this patch. Compiler options: For clang-cl.exe, the option is -EHa, the same as MSVC. For clang.exe, the extra option is -fasync-exceptions, plus -triple x86_64-windows -fexceptions and -fcxx-exceptions as usual. NOTE:: Without the -EHa or -fasync-exceptions, this patch is a NO-DIFF change. The rules for C code: For C-code, one way (MSVC approach) to achieve SEH -EHa semantic is to follow three rules: * First, no exception can move in or out of _try region., i.e., no "potential faulty instruction can be moved across _try boundary. * Second, the order of exceptions for instructions 'directly' under a _try must be preserved (not applied to those in callees). * Finally, global states (local/global/heap variables) that can be read outside of _try region must be updated in memory (not just in register) before the subsequent exception occurs. The impact to C++ code: Although SEH is a feature for C code, -EHa does have a profound effect on C++ side. When a C++ function (in the same compilation unit with option -EHa ) is called by a SEH C function, a hardware exception occurs in C++ code can also be handled properly by an upstream SEH _try-handler or a C++ catch(...). As such, when that happens in the middle of an object's life scope, the dtor must be invoked the same way as C++ Synchronous Exception during unwinding process. Design: A natural way to achieve the rules above in LLVM today is to allow an EH edge added on memory/computation instruction (previous iload/istore idea) so that exception path is modeled in Flow graph preciously. However, tracking every single memory instruction and potential faulty instruction can create many Invokes, complicate flow graph and possibly result in negative performance impact for downstream optimization and code generation. Making all optimizations be aware of the new semantic is also substantial. This design does not intend to model exception path at instruction level. Instead, the proposed design tracks and reports EH state at BLOCK-level to reduce the complexity of flow graph and minimize the performance-impact on CPP code under -EHa option. One key element of this design is the ability to compute State number at block-level. Our algorithm is based on the following rationales: A _try scope is always a SEME (Single Entry Multiple Exits) region as jumping into a _try is not allowed. The single entry must start with a seh_try_begin() invoke with a correct State number that is the initial state of the SEME. Through control-flow, state number is propagated into all blocks. Side exits marked by seh_try_end() will unwind to parent state based on existing SEHUnwindMap[]. Note side exits can ONLY jump into parent scopes (lower state number). Thus, when a block succeeds various states from its predecessors, the lowest State triumphs others. If some exits flow to unreachable, propagation on those paths terminate, not affecting remaining blocks. For CPP code, object lifetime region is usually a SEME as SEH _try. However there is one rare exception: jumping into a lifetime that has Dtor but has no Ctor is warned, but allowed: Warning: jump bypasses variable with a non-trivial destructor In that case, the region is actually a MEME (multiple entry multiple exits). Our solution is to inject a eha_scope_begin() invoke in the side entry block to ensure a correct State. Implementation: Part-1: Clang implementation described below. Two intrinsic are created to track CPP object scopes; eha_scope_begin() and eha_scope_end(). _scope_begin() is immediately added after ctor() is called and EHStack is pushed. So it must be an invoke, not a call. With that it's also guaranteed an EH-cleanup-pad is created regardless whether there exists a call in this scope. _scope_end is added before dtor(). These two intrinsics make the computation of Block-State possible in downstream code gen pass, even in the presence of ctor/dtor inlining. Two intrinsic, seh_try_begin() and seh_try_end(), are added for C-code to mark _try boundary and to prevent from exceptions being moved across _try boundary. All memory instructions inside a _try are considered as 'volatile' to assure 2nd and 3rd rules for C-code above. This is a little sub-optimized. But it's acceptable as the amount of code directly under _try is very small. Part-2 (will be in Part-2 patch): LLVM implementation described below. For both C++ & C-code, the state of each block is computed at the same place in BE (WinEHPreparing pass) where all other EH tables/maps are calculated. In addition to _scope_begin & _scope_end, the computation of block state also rely on the existing State tracking code (UnwindMap and InvokeStateMap). For both C++ & C-code, the state of each block with potential trap instruction is marked and reported in DAG Instruction Selection pass, the same place where the state for -EHsc (synchronous exceptions) is done. If the first instruction in a reported block scope can trap, a Nop is injected before this instruction. This nop is needed to accommodate LLVM Windows EH implementation, in which the address in IPToState table is offset by +1. (note the purpose of that is to ensure the return address of a call is in the same scope as the call address. The handler for catch(...) for -EHa must handle HW exception. So it is 'adjective' flag is reset (it cannot be IsStdDotDot (0x40) that only catches C++ exceptions). Suppress push/popTerminate() scope (from noexcept/noTHrow) so that HW exceptions can be passed through. Original llvm-dev [RFC] discussions can be found in these two threads below: https://lists.llvm.org/pipermail/llvm-dev/2020-March/140541.html https://lists.llvm.org/pipermail/llvm-dev/2020-April/141338.html Differential Revision: https://reviews.llvm.org/D80344/new/	2021-05-17 22:42:17 -07:00
Jinsong Ji	82b5281247	[Driver][test] Don't assume integrated-as The tests of fdebug-compilation-dir and -ffile-compilation-dir for `-x assembler` are assuming integrated-as. If the platform set the no-itegrated-as by default (eg: AIX for now), then this test will fail. Add the -integrated-as to aviod relying on the platform defaults. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D102647	2021-05-17 20:24:21 +00:00
Eli Friedman	698568b74c	[clang CodeGen] Don't crash on large atomic function parameter. I wouldn't recommend writing code like the testcase; a function parameter isn't atomic, so using an atomic type doesn't really make sense. But it's valid, so clang shouldn't crash on it. The code was assuming hasAggregateEvaluationKind(Ty) implies Ty is a RecordType, which isn't true. Just use isRecordType() instead. Differential Revision: https://reviews.llvm.org/D102015	2021-05-17 13:18:23 -07:00
Arthur Eubanks	3a0b6dc3e8	Revert "[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable" This reverts commit `14dfb3831c`. More false positives, see D100581.	2021-05-17 12:16:10 -07:00
Nick Desaulniers	0f41778919	[AArch64] Support customizing stack protector guard Follow up to D88631 but for aarch64; the Linux kernel uses the command line flags: 1. -mstack-protector-guard=sysreg 2. -mstack-protector-guard-reg=sp_el0 3. -mstack-protector-guard-offset=0 to use the system register sp_el0 for the stack canary, enabling the kernel to have a unique stack canary per task (like a thread, but not limited to userspace as the kernel can preempt itself). Address pr/47341 for aarch64. Fixes: https://github.com/ClangBuiltLinux/linux/issues/289 Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: xiangzhangllvm, DavidSpickett, dmgreen Differential Revision: https://reviews.llvm.org/D100919	2021-05-17 11:49:22 -07:00
Abbas Sabra	ebcf030efc	[analyzer] Engine: fix crash with SEH __leave keyword MSVC has a `try-except` statement. This statement could containt a `__leave` keyword, which is similar to `goto` to the end of the try block. The semantic of this keyword is not implemented. We should at least parse such code without crashing. https://docs.microsoft.com/en-us/cpp/cpp/try-except-statement?view=msvc-160 Patch By: AbbasSabra! Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D102280	2021-05-17 20:10:26 +02:00
Michael Benfield	14dfb3831c	[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable These are intended to mimic warnings available in gcc. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D100581	2021-05-17 11:02:26 -07:00
Yaxun (Sam) Liu	18cb17ce4c	[HIP] Fix spack detection Missing or duplicate spack package should not cause error, since users may only installed llvm/clang package, or users may installed duplicate HIP package but will use environment variable or compiler option to choose HIP path. The message about missing or duplicate spack package is informational, therefore should be emitted only when -v is specified. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102556	2021-05-17 13:24:05 -04:00
Steffen Larsen	f226e28a88	[Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX redux.sync instructions Adds NVPTX builtins and intrinsics for the CUDA PTX `redux.sync` instructions for `sm_80` architecture or newer. PTX ISA description of `redux.sync`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-redux-sync Authored-by: Steffen Larsen <steffen.larsen@codeplay.com> Differential Revision: https://reviews.llvm.org/D100124	2021-05-17 09:46:59 -07:00
Stuart Adams	02c2468864	[Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX cp.async instructions Adds NVPTX builtins and intrinsics for the CUDA PTX `cp.async` instructions for `sm_80` architecture or newer. PTX ISA description of `cp.async`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-asynchronous-copy https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-cp-async-mbarrier-arrive Authored-by: Stuart Adams <stuart.adams@codeplay.com> Co-Authored-by: Alexander Johnston <alexander@codeplay.com> Differential Revision: https://reviews.llvm.org/D100394	2021-05-17 09:46:59 -07:00
Xiangling Liao	e0921655b1	[AIX] Implement AIX special bitfield related alignment rules 1.[bool, char, short] bitfields have the same alignment as unsigned int 2.Adjust alignment on typedef field decls/honor align attribute 3.Fix alignment for scoped enum class 4.Long long bitfield has 4bytes alignment and StorageUnitSize under 32 bit compile mode Differential Revision: https://reviews.llvm.org/D87029	2021-05-17 11:30:29 -04:00
Sylvain Audi	6052a8a535	[clang] In DependencyCollector on Windows, ignore case and separators when discarding duplicate dependency file paths. This patch removes duplicates also encountered in the output of clang-scan-deps when one same header file is encountered with different casing and/or different separators ('/' vs '\'). The case of separators can appear when the same file is included externally by `#include <folder/file.h>` whereas a file from the same folder does `#include "file.h"` Under Windows, clang computes the paths using '/' from the include directive, the `\` from the -I options, and the concatenations use the native `\`, leading to internal paths containing a mix of both separators. Differential Revision: https://reviews.llvm.org/D102339	2021-05-17 10:32:52 -04:00
Irina Dobrescu	50511df32e	[AArch64] Lower bitreverse in ISel Adding lowering support for bitreverse. Previously, lowering bitreverse would expand it into a series of other instructions. This patch makes it so this produces a single rbit instruction instead. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D102397	2021-05-17 13:35:27 +01:00
Raphael Isemann	888ce70af2	[DebugInfo] Fix DWARF expressions for __block vars that are not on the heap `__block` variables used to be always stored on the head instead of stack. D51564 allowed `__block` variables to the stored on the stack like normal variablesif they not captured by any escaping block, but the debug-info generation code wasn't made aware of it so we still unconditionally emit DWARF expressions pointing to the heap. This patch makes CGDebugInfo use the `EscapingByref` introduced in D51564 that tracks whether the `__block` variable is actually on the heap. If it's stored on the stack instead we just use the debug info we would generate for normal variables instead. Reviewed By: ahatanak, aprantl Differential Revision: https://reviews.llvm.org/D99946	2021-05-17 14:32:07 +02:00
Ole Strohm	66b112d624	[OpenCL] Fix reinterpret_cast of vectors Fixes issues with vectors in reinterpret_cast in C++ for OpenCL and adds tests to make sure they both pass without errors and generate the correct code. Fixes: PR47977 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D101519	2021-05-17 12:42:21 +01:00
Jan Svoboda	ccd485e554	Revert "[clang][deps] Support inferred modules" This reverts commit `1d9e8e13` Something is off with handling Windows paths: http://45.33.8.238/win/38908/step_7.txt Most likely introduced in https://reviews.llvm.org/D102491	2021-05-17 13:27:14 +02:00
Anastasia Stulova	3549466ac0	[OpenCL] Drop pragma handling for extension types/decls. Drop non-conformant extension pragma implementation as it does not properly disable anything and therefore enabling non-disabled logic has no meaning. This simplifies clang code and user interface to the extension functionality. With this patch extension pragma 'begin'/'end' and 'enable'/'disable' are only accepted for backward compatibility and no longer have any default behavior. Differential Revision: https://reviews.llvm.org/D101043	2021-05-17 12:09:43 +01:00
Michael Spencer	1d9e8e13dd	[clang][deps] Support inferred modules This patch adds support for inferred modules to the dependency scanner. Effectively a cherry-pick of https://github.com/apple/llvm-project/pull/699 authored by @Bigcheese with libclang and other changes omitted. Contains following changes: 1. [Clang][ScanDeps] Ignore __inferred_module.map dependency. * This shows up with inferred modules, but it doesn't exist on disk, so don't report it as a dependency. 2. [Clang][ScanDeps] Use the module map a module was inferred from for inferred modules. Also includes a smoke test that uses clang-scan-deps output to perform an explicit build. There's no intention to duplicate whatever `test/Modules` contains, just to verify the produced command-line does "work" (with very loose definition of work). Split from D100934. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D102495	2021-05-17 11:41:54 +02:00
Michael Spencer	d3676d4b66	[clang][modules] Build inferred modules This patch enables explicitly building inferred modules. Effectively a cherry-pick of https://github.com/apple/llvm-project/pull/699 authored by @Bigcheese with libclang and dependency scanner changes omitted. Contains the following changes: 1. [Clang] Fix the header paths in clang::Module for inferred modules. * The UmbrellaAsWritten and NameAsWritten fields in clang::Module are a lie for framework modules. For those they actually are the path to the header or umbrella relative to the clang::Module::Directory. * The exception to this case is for inferred modules. Here it actually is the name as written, because we print out the module and read it back in when implicitly building modules. This causes a problem when explicitly building an inferred module, as we skip the printing out step. * In order to fix this issue this patch adds a new field for the path we want to use in getInputBufferForModule. It also makes NameAsWritten actually be the name written in the module map file (or that would be, in the case of an inferred module). 2. [Clang] Allow explicitly building an inferred module. * Building the actual module still fails, but make sure it fails for the right reason. Split from D100934. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D102491	2021-05-17 10:40:51 +02:00
Pan, Tao	976a3e5f61	[SelectionDAG] Make fast and linearize visible by clang -pre-RA-sched ScheduleDAGFast.cpp is compiled to object file, but the ScheduleDAGFast object file isn't linked into clang executable file as no symbol is referred by outside. Add calling to createXxx of ScheduleDAGFast.cpp, then the ScheduleDAGFast object file will be linked into clang executable file. The static RegisterScheduler will register scheduler fast and linearize at clang boot time. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D101601	2021-05-17 11:25:15 +08:00
Alessandro Decina	833e9b2ea7	[BPF] add support for 32 bit registers in inline asm Add "w" constraint type which allows selecting 32 bit registers. 32 bit registers were added in https://reviews.llvm.org/rGca31c3bb3ff149850b664838fbbc7d40ce571879. Differential Revision: https://reviews.llvm.org/D102118	2021-05-16 11:01:47 -07:00
Florian Hahn	803c52d0db	Recommit "[Clang,Driver] Add -fveclib=Darwin_libsystem_m support." Recommit D102489, with the test case requiring the AArch64 backend. This reverts the revert `59b419adc6`.	2021-05-16 18:49:53 +01:00
Danila Kutenin	d29f7f1a7b	[clang] Fix ternary operator in the second for loop statement Fix ternary operator in for loop argument, it was by mistake not set as CanBeForRangeDecl and led to incorrect codegen. It fixes https://bugs.llvm.org/show_bug.cgi?id=50038. I don't have commit rights. Danila Kutenin. kutdanila@yandex.ru Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D102502	2021-05-16 10:42:52 -07:00
Nikita Popov	656296b1c2	Reapply [CaptureTracking] Do not check domination Reapply after adjusting the synchronized.m test case, where the TODO is now resolved. The pointer is only captured on the exception handling path. ----- For the CapturesBefore tracker, it is sufficient to check that I can not reach BeforeHere. This does not necessarily require that BeforeHere dominates I, it can also occur if the capture happens on an entirely disjoint path. This change was previously accepted in D90688, but had to be reverted due to large compile-time impact in some cases: It increases the number of reachability queries that are performed. After recent changes, the compile-time impact is largely mitigated, so I'm reapplying this patch. The remaining compile-time impact is largely proportional to changes in code-size.	2021-05-16 15:46:31 +02:00
Saurabh Jha	ecb235d940	[Matrix] Implement static cast for matrix types This patch implements static casts for matrix types. This patch finishes all the work needed for https://bugs.llvm.org/show_bug.cgi?id=47141 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D102125	2021-05-16 08:09:12 +01:00
Hubert Tong	9ae529d0db	[AIX][clang-repl][test] Mark unsupported pending XCOFF64 integrated-as This patch replaces the `powerpc64` token with the `system-aix` one in the UNSUPPORTED line of a test. The `powerpc64` token was originally added temporarily in `71a0609a2b`. If AIX uses integrated-as by default and it works both for 32-bit and 64-bit objects, then the issues encountered so far (see comments in D96033) would be mostly solved. As it is, marking the test as expected-to-fail (as opposed to unsupported) on AIX might cause more trouble in the form of 32-bit versus 64-bit differences. I am not aware of other situations where LIT tests are dependent on whether the LLVM build is 64-bit or 32-bit. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D102560	2021-05-15 22:41:45 -04:00
Pengxuan Zheng	c9b36a041f	Support GCC's -fstack-usage flag This patch adds support for GCC's -fstack-usage flag. With this flag, a stack usage file (i.e., .su file) is generated for each input source file. The format of the stack usage file is also similar to what is used by GCC. For each function defined in the source file, a line with the following information is produced in the .su file. <source_file>:<line_number>:<function_name> <size_in_byte> <static/dynamic> "Static" means that the function's frame size is static and the size info is an accurate reflection of the frame size. While "dynamic" means the function's frame size can only be determined at run-time because the function manipulates the stack dynamically (e.g., due to variable size objects). The size info only reflects the size of the fixed size frame objects in this case and therefore is not a reliable measure of the total frame size. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100509	2021-05-15 10:22:49 -07:00
Douglas Yung	59b419adc6	Revert "[Clang,Driver] Add -fveclib=Darwin_libsystem_m support." This reverts commit `187a14e1f3`. The test added in this commit is failing on several build bots: https://lab.llvm.org/buildbot/#/builders/139/builds/4059 https://lab.llvm.org/buildbot/#/builders/132/builds/5605	2021-05-14 22:39:12 -07:00
Florian Hahn	187a14e1f3	[Clang,Driver] Add -fveclib=Darwin_libsystem_m support. Support for Darwin's libsystem_m's vector functions has been added to LLVM in `93a9a8a8d9`. This patch adds support for -fveclib=Darwin_libsystem_m to Clang. Reviewed By: arphaman Differential Revision: https://reviews.llvm.org/D102489	2021-05-14 21:00:13 +01:00
Anastasia Stulova	769cc335e6	[OpenCL] Simplify use of C11 atomic types. Remove requirements on extension pragma in atomic types because it has not respected the spec wrt disabling types and hasn't been useful either. With this change, the developers can use atomic types from the extensions if they are supported without enabling the pragma just like the builtin functions This patch does not break backward compatibility since the extension pragma is still supported and it makes the behavior of the compiler less strict by accepting code without needless and inconsistent pragma statements. Differential Revision: https://reviews.llvm.org/D100976	2021-05-14 17:43:00 +01:00
Nemanja Ivanovic	7cd2833311	[PowerPC] Add vec_vupkhpx and vec_vupklpx for XL compatibility These are old names for these functions that XL still supports.	2021-05-14 08:02:00 -05:00
David Spickett	2db090a2eb	[llvm][AsmPrinter] Restore source location to register clobber warning Since `5de2d189e6` this particular warning hasn't had the location of the source file containing the inline assembly. Fix this by reporting via LLVMContext. Which means that we no longer have the "instantiated into assembly here" lines but they were going to point to the start of the inline asm string anyway. This message is already tested via IR in llvm. However we won't have the required location info there so I've added a C file test in clang to cover it. (though strictly, this is testing llvm code) Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D102244	2021-05-14 08:22:57 +00:00
Michael Kruse	83ff0ff463	[Clang][OpenMP] Allow unified_shared_memory for Pascal-generation GPUs. The Pascal architecture supports the page migration engine required for unified_shared_memory, as indicated by NVIDIA: * https://developer.nvidia.com/blog/unified-memory-cuda-beginners/ * https://developer.nvidia.com/blog/beyond-gpu-memory-limits-unified-memory-pascal/ * https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-requirements The limitation was introduced in D54493 which justified the cut-off by the requirement for unified addressing. However, Unified Virtual Addressing (UVA) is already available with sm20 (Fermi, Kepler, Maxwell): * https://docs.nvidia.com/cuda/gpudirect-rdma/index.html#basics-of-uva-cuda-memory-management Unified shared memory might even be possible with these, but with migration of entire allocations on kernel startup. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101595	2021-05-13 17:15:34 -05:00
Lang Hames	71a0609a2b	[clang-repl] Temporarily disable the execute.cpp test on ppc64. This test is failing on some builders (see [1]) with the following error: error: Added modules have incompatible data layouts: e-m:e-i64:64-n32:64-S128-v256:256:256-v512:512:512 (module) vs E-m:a-i64:64-n32:64-S128-v256:256:256-v512:512:512 (jit) The JIT layout is correct, but some IR module added to the JIT is using a little-endian layout instead. This commit disables the test on ppc64 until we can investigate further and fix the bug. [1] https://lab.llvm.org/staging/#/builders/126/builds/371	2021-05-13 14:39:12 -07:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Roman Lebedev	0d8f91d2a9	[NFC] Delete two newly-added test cases Failing on bots in unobvious ways.	2021-05-13 21:23:01 +03:00
Roman Lebedev	ecc4e9e8f4	[NFC] Try to fix CodeGenCXX/thunk-wrong-return-type.cpp test	2021-05-13 21:17:31 +03:00
cynecx	8ec9fd4839	Support unwinding from inline assembly I've taken the following steps to add unwinding support from inline assembly: 1) Add a new `unwind` "attribute" (like `sideeffect`) to the asm syntax: ``` invoke void asm sideeffect unwind "call thrower", "~{dirflag},~{fpsr},~{flags}"() to label %exit unwind label %uexit ``` 2.) Add Bitcode writing/reading support + LLVM-IR parsing. 3.) Emit EHLabels around inline assembly lowering (SelectionDAGBuilder + GlobalISel) when `InlineAsm::canThrow` is enabled. 4.) Tweak InstCombineCalls/InlineFunction pass to not mark inline assembly "calls" as nounwind. 5.) Add clang support by introducing a new clobber: "unwind", which lower to the `canThrow` being enabled. 6.) Don't allow unwinding callbr. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D95745	2021-05-13 19:13:03 +01:00
Roman Lebedev	9d3eb7885d	[NFC] Try to fix CodeGenCXX/thunk-wrong-this.cpp test	2021-05-13 21:10:14 +03:00
Roman Lebedev	16d0381841	Return "[CGCall] Annotate `this` argument with alignment" The original change was reverted because it was discovered that clang mishandles thunks, and they receive wrong attributes for their this/return types - the ones for the function they will call, not the ones they have. While i have tried to fix this in https://reviews.llvm.org/D100388 that patch has been up and stuck for a month now, with little signs of progress. So while it will be good to solve this for real, for now we can simply avoid introducing the bug, by not annotating this/return for thunks. This reverts commit `6270b3a1ea`, relanding `0aa0458f14`.	2021-05-13 20:33:14 +03:00
Roman Lebedev	a624cec56d	[Clang][Codegen] Do not annotate thunk's this/return types with align/deref/nonnull attrs As it was discovered in post-commit feedback for `0aa0458f14`, we handle thunks incorrectly, and end up annotating their this/return with attributes that are valid for their callees, not for thunks themselves. While it would be good to fix this properly, and keep annotating them on thunks, i've tried doing that in https://reviews.llvm.org/D100388 with little success, and the patch is stuck for a month now. So for now, as a stopgap measure, subj.	2021-05-13 20:33:08 +03:00

1 2 3 4 5 ...

43393 Commits