llvm-project

Commit Graph

Author	SHA1	Message	Date
Dave Airlie	1854db74c5	opencl-c.h: add 3.0 optional extension support for a few more bits These 3 are fairly simple, pipes, workgroups and subgroups. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105858	2021-08-07 09:25:00 +10:00
Justas Janickas	a5a2f05dcc	[C++4OpenCL] Introduces __remove_address_space utility This change provides a way to conveniently declare types that have address space qualifiers removed. Since OpenCL adds address spaces implicitly even when they are not specified in source, it is useful to allow deriving address space unqualified types. Fixes llvm.org/PR45326 Differential Revision: https://reviews.llvm.org/D106785	2021-08-06 10:40:22 +01:00
Jon Chesterfield	509854b69c	[clang] Replace asm with __asm__ in cuda header Asm is a gnu extension for C, so at present -fopenmp -std=c99 and similar fail to compile on nvptx, bug 51344 Changing to `__asm__` or `__asm` works for openmp, all three appear to work for cuda. Suggesting `__asm__` here as `__asm` is used by MSVC with different syntax, so this should make for better error diagnostics if the header is passed to a compiler other than clang. Reviewed By: tra, emankov Differential Revision: https://reviews.llvm.org/D107492	2021-08-05 18:46:57 +01:00
Dave Airlie	14cb67862a	[OpenCL] allow generic address and non-generic defs for CL3.0 This allows both sets of definitions to exist on CL 3.0 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D107318	2021-08-05 07:32:45 +10:00
Pushpinder Singh	f3eb5f900d	[AMDGPU][OpenMP] Wrap amdgcn declare variant inside ifdef This fixes the issue https://bugs.llvm.org/show_bug.cgi?id=51337 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D107468	2021-08-04 15:24:46 +00:00
Pushpinder Singh	713a5d12cd	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-08-02 14:38:52 +00:00
Hans Wennborg	12dc13b73c	prfchwintrin.h: Make _m_prefetchw take a pointer to volatile (PR49124) For some reason, Microsoft declares _m_prefetch to take a const void, but _m_prefetchw to take a /volatile/ const void. Do the same for compatibility. Differential revision: https://reviews.llvm.org/D106790	2021-08-02 15:16:04 +02:00
Jon Chesterfield	7f97ddaf8a	Revert "[OpenMP][AMDGCN] Initial math headers support" Broke nvptx compilation on files including <complex> This reverts commit `12da97ea10`.	2021-07-30 22:07:00 +01:00
Nemanja Ivanovic	9019b55b60	[PowerPC] Fix byte ordering of ld/st with length on BE The builtins vec_xl_len_r and vec_xst_len_r actually use the wrong side of the vector on big endian Power9 systems. We never spotted this before because there was no such thing as a big endian distro that supported Power9. Now we have AIX and the elements are in the wrong part of the vector. This just fixes it so the elements are loaded to and stored from the right side of the vector.	2021-07-30 14:37:24 -05:00
Pushpinder Singh	12da97ea10	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-07-30 14:52:41 +00:00
Dave Airlie	3c7d2f1b67	[OpenCL] opencl-c.h: add CL 3.0 non-generic address space atomics CL 2.0 introduced atomics and generic address space so there were only one set of APIs for doing atomics, however since CL 3.0 makes generic address space optional, there has to be new sets of atomic interfaces to handle that cases. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D106778	2021-07-30 14:46:47 +10:00
Thomas Lively	33786576fd	[WebAssembly] Codegen for extmul SIMD instructions Replace the clang builtins and LLVM intrinsics for the SIMD extmul instructions with normal codegen patterns. Differential Revision: https://reviews.llvm.org/D106724	2021-07-27 08:41:30 -07:00
Anastasia Stulova	e5f47eedeb	[OpenCL] NULL redefined as nullptr in C++ mode. Redefines NULL as nullptr instead of ((void*)0) in C++ for OpenCL. Such internal representation of NULL provides compatibility with C++11 and later language standards. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D105987	2021-07-27 16:33:50 +01:00
Nemanja Ivanovic	1c50a5da36	[PowerPC] Implement partial vector ld/st builtins for XL compatibility XL provides functions __vec_ldrmb/__vec_strmb for loading/storing a sequence of 1 to 16 bytes in big endian order, right justified in the vector register (regardless of target endianness). This is equivalent to vec_xl_len_r/vec_xst_len_r which are only available on Power9. This patch simply uses the Power9 functions when compiled for Power9, but provides a more general implementation for Power8. Differential revision: https://reviews.llvm.org/D106757	2021-07-26 13:19:52 -05:00
Qiu Chaofan	240dde9482	[PowerPC] Change altivec indexed load/store builtins argument type This patch changes the index argument of lvxl?/lve[bhw]x and stvxl?/stve[bhw]x builtins from int to long. Because on 64-bit subtargets, an extra extsw will always been generated, which is incorrect. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D106530	2021-07-27 00:26:50 +08:00
Ulrich Weigand	8cd8120a7b	[SystemZ] Add support for new cpu architecture - arch14 This patch adds support for the next-generation arch14 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch14 as host processor. - Assembler/disassembler support for new instructions. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10304. Note: No currently available Z system supports the arch14 architecture. Once new systems become available, the official system name will be added as supported -march name.	2021-07-26 16:57:28 +02:00
Dave Airlie	9451403c5f	[OPENCL] opencl-c.h: add initial CL 3.0 conditionals for atomic operations. This adds the optional wrappers around things, however this isn't sufficient yet for CL 3.0 without generic address space, I've got one more additional patch to add all those APIs, but this is an easier to review precursor. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D106111	2021-07-26 11:06:33 +10:00
Thomas Lively	85157c0079	[WebAssembly] Codegen for pmin and pmax Replace the clang builtins and LLVM intrinsics for {f32x4,f64x2}.{pmin,pmax} with standard codegen patterns. Since wasm_simd128.h uses an integer vector as the standard single vector type, the IR for the pmin and pmax intrinsic functions contains bitcasts that would not be there otherwise. Add extra codegen patterns that can still select the pmin and pmax instructions in the presence of these bitcasts. Differential Revision: https://reviews.llvm.org/D106612	2021-07-23 14:49:21 -07:00
Anastasia Stulova	5c63bf3abd	[OpenCL] Add NULL to standards prior to v2.0. NULL was undefined in OpenCL prior to version 2.0. However, the language specification states that "macro names defined by the C99 specification but not currently supported by OpenCL are reserved for future use". Therefore, application developers cannot redefine NULL. The change is supposed to resolve inconsistency between language versions. Currently there is no apparent reason why NULL should be kept undefined. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D105988	2021-07-23 11:54:36 +01:00
Sven van Haastregt	989bedec7a	[OpenCL] Add cl_khr_integer_dot_product Add the builtins defined by Section 42 "Integer dot product" in the OpenCL Extension Specification. Differential Revision: https://reviews.llvm.org/D106434	2021-07-23 10:10:16 +01:00
namazso	91bc85b1eb	[MS] Preserve base register %esi around movs[bwl] fix for behavior reported in https://bugs.llvm.org/show_bug.cgi?id=51100 workaround for root cause https://bugs.llvm.org/show_bug.cgi?id=16830 similar to https://reviews.llvm.org/D101338 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D106210	2021-07-23 16:28:32 +08:00
Aaron En Ye Shi	9ce931bd71	[HIP] Fix no matching constructor for init of shared_ptr and malloc Allow standard header versions of malloc and free to be defined before introducing the device versions. Fixes: SWDEV-295901 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D106463	2021-07-22 14:32:41 +00:00
Thomas Lively	db7efcab7d	[WebAssembly] Remove clang builtins for extract_lane and replace_lane These builtins were added to capture the fact that the underlying Wasm instructions return i32s and implicitly sign or zero extend the extracted lanes in the case of the i8x16 and i16x8 variants. But we do sufficient optimizations during code gen that these low-level details do not need to be exposed to users. This commit replaces the use of the builtins in wasm_simd128.h with normal target-independent vector code. As a result, we can switch the relevant intrinsics to use functions rather than macros and can use more user-friendly return types rather than trying to precisely expose the underlying Wasm types. Note, however, that the generated LLVM IR is no different after this change. Differential Revision: https://reviews.llvm.org/D106500	2021-07-21 16:11:00 -07:00
Yaxun (Sam) Liu	db5f100fe4	[HIP] Remove workaround in __clang_hip_runtime_wrapper.h Remove the workaround for -fopenmp in __clang_hip_runtime_wrapper.h since it causes device functions in HIP wrapper headers disabled when compiling HIP program with -fopenmp. Reviewed by: Aaron Enye Shi, Jon Chesterfield Differential Revision: https://reviews.llvm.org/D106070	2021-07-21 15:16:28 -04:00
Jon Chesterfield	d71062fbda	Revert "[OpenMP][AMDGCN] Initial math headers support" This reverts commit `968899ad9c`.	2021-07-21 17:35:40 +01:00
Pushpinder Singh	968899ad9c	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-07-21 16:15:39 +01:00
Sven van Haastregt	724f0e2abb	[OpenCL] Add cl_khr_extended_bit_ops Add the builtins defined by Section 40 "Extended Bit Operations" in the OpenCL Extension Specification. Differential Revision: https://reviews.llvm.org/D106267	2021-07-21 10:01:19 +01:00
Jon Chesterfield	3e649f8ef1	[openmp][nfc] Simplify macros guarding math complex headers The `__CUDA__` macro is already defined for openmp/nvptx and is not used by `__clang_cuda_complex_builtins.h`, so dropping that macro slightly simplifies nvptx and avoids defining it on amdgcn (where it is likely to be harmful). Also dropped a cplusplus test from a C++ header as compilation will have failed on cmath earlier if it was included from C. Reviewed By: jdoerfert, fodinabor Differential Revision: https://reviews.llvm.org/D105221	2021-07-18 23:30:35 +01:00
Stefan Pintilie	0bf4b81d57	[Clang] Add an empty builtins.h file. On Power PC some legacy compilers included a number of builtins in a builtins.h header file. While this header file is not required to hold builtins for clang some legacy code does try to include this file and so this patch provides an empty version of that file. Differential Revision: https://reviews.llvm.org/D106065	2021-07-16 12:50:04 -05:00
Dave Airlie	de79ba9f9a	[OpenCL] opencl-c.h: CL3.0 generic address space This is one of the easier pieces of adding CL3.0 support. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105526	2021-07-15 10:51:04 +10:00
Dave Airlie	090f007e34	[OpenCL][NFC] opencl-c.h: reorder atomic operations This just reorders the atomics, it doesn't change anything except their layout in the header. This is a prep patch for adding some conditionals around these for CL3.0 but that patch is much easier to review if all the atomic operations are grouped together like this. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105601	2021-07-15 10:48:44 +10:00
Thomas Lively	4a4229f70f	[WebAssembly] Codegen for v128.storeX_lane instructions Replace the experimental clang builtins and LLVM intrinsics for these instructions with normal codegen patterns. Resolves PR50435. Differential Revision: https://reviews.llvm.org/D106019	2021-07-14 16:15:25 -07:00
Thomas Lively	970e090010	[WebAssembly] Codegen for v128.loadX_lane instructions Replace the experimental clang builtin and LLVM intrinsics for these instructions with normal codegen patterns. Resolves PR50433. Differential Revision: https://reviews.llvm.org/D105950	2021-07-14 11:31:53 -07:00
Thomas Lively	cbabfc63b1	[WebAssembly] Custom combines for f32x4.demote_zero_f64x2 Replace the clang builtin function and LLVM intrinsic for f32x4.demote_zero_f64x2 with combines from normal SDNodes. Also add missing combines for i32x4.trunc_sat_zero_f64x2_{s,u}, which share the same pattern. Differential Revision: https://reviews.llvm.org/D105755	2021-07-12 10:32:18 -07:00
Bardia Mahjour	2071ce9d45	[Altivec] Use signed comparison for vec_all_* and vec_any_* interfaces We are currently being inconsistent in using signed vs unsigned comparisons for vec_all_* and vec_any_* interfaces that use vector bool types. For example we use signed comparison for vec_all_ge(vector signed char, vector bool char) but unsigned comparison for when the arguments are swapped. GCC and XL use signed comparison instead. This patch makes clang consistent with itself and with XL and GCC. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105666	2021-07-12 11:41:16 -04:00
Nemanja Ivanovic	84e429693f	[PowerPC] Fix rounding mode for vec_round in altivec.h The function is supposed to be the equivalent of rint() (as in round to nearest, ties to even) rather than round() (round to nearest, ties away from zero). In fact, the instruction we emit without VSX is vrfin which is correct. However, with VSX we emit xvrspi which is the equivalent of round() and therefore incorrect. Since there is no equivalent VSX instruction, simply use vrfin regardless of availability of VSX.	2021-07-12 06:11:27 -05:00
Nemanja Ivanovic	41ce5ec5f6	[PowerPC] Remove unnecessary 64-bit guards from altivec.h A number of functions in the header have guards for 64-bit only that were presumably added as some of the functions in the blocks use vector __int128 which is only available in 64-bit mode. A more appropriate guard (__SIZEOF_INT128__) has been added for those functions since, making the 64-bit guards redundant. This patch removes those guards as they inadvertently guard code that uses vector long long which does not actually require 64-bit mode.	2021-07-12 04:59:00 -05:00
Thomas Lively	e5220104d0	[WebAssembly] Custom combines for f64x2.promote_low_f32x4 Replace the clang builtin function and LLVM intrinsic previously used to select the f64x2.promote_low_f32x4 instruction with custom combines from standard SelectionDAG nodes. Implement the new combines to share code with the similar combines for f64x2.convert_low_i32x4_{s,u}. Resolves PR50232. Differential Revision: https://reviews.llvm.org/D105675	2021-07-09 18:59:29 -07:00
Aaron En Ye Shi	ccb10266f5	[HIP] Move std headers after device malloc/free Set the device malloc and free functions as weak, and move the std headers after device malloc/free to avoid issues with std malloc/free. Fixes: SWDEV-293590 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D105707	2021-07-09 21:20:16 +00:00
Joachim Meyer	5d689cf2a6	[NFC][CUDA] Fix order of round(f) definition in __clang_cuda_math.h for non-LP64. This broke ARM builds e.g.: https://lab.llvm.org/buildbot/#/builders/187/builds/212	2021-07-02 21:55:48 +02:00
Brian Cain	28b01c59c9	[hexagon] Add {hvx,}hexagon_{protos,circ_brev...} Add definitions for Hexagon, Hexagon circular/bit-reverse and HVX intrinsics.	2021-06-30 22:58:56 -05:00
Xiang1 Zhang	6d234a6908	[X86] Zero some outputs of Kelocker intrinsics in error case Reviewed By: WangPengfei Differential Revision: https://reviews.llvm.org/D104766	2021-06-29 13:35:40 +08:00
Nemanja Ivanovic	ef906573a1	[PowerPC] Fix vec_add for 64-bit on pre-Power7 subtargets The shift of the carry was actually incorrect.	2021-06-24 18:42:44 -05:00
Ethan Stewart	5dfdc1812d	[OpenMP][AMDGCN] Apply fix for isnan, isinf and isfinite for amdgcn. This fixes issues with various return types(bool/int) and was already in place for nvptx headers, adjusted to work for amdgcn. This does not affect hip as the change is guarded with OPENMP_AMDGCN. Similar to D85879. Reviewed By: jdoerfert, JonChesterfield, yaxunl Differential Revision: https://reviews.llvm.org/D104677	2021-06-23 15:26:09 +01:00
Yaxun (Sam) Liu	186f2ac612	[HIP] Add support functions for C++ polymorphic types Add runtime functions to detect invalid calls to pure or deleted virtual functions. Patch by: Siu Chi Chan Reviewed by: Yaxun Liu Differential Revision: https://reviews.llvm.org/D104392	2021-06-21 11:41:07 -04:00
Bing1 Yu	56d5c46b49	[X86] Support __tile_stream_loadd intrinsic for new AMX interface Adding support for __tile_stream_loadd intrinsic. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D103784	2021-06-11 17:28:43 +08:00
Sven van Haastregt	d54e7b731e	[OpenCL] Add memory_scope_all_devices Add the `memory_scope_all_devices` enum value, which is restricted to OpenCL 3.0 or newer and the `__opencl_c_atomic_scope_all_devices` feature. Also guard `memory_scope_all_svm_devices` accordingly, which is already available in OpenCL 2.0. The `__opencl_c_atomic_scope_all_devices` feature is header-only, so set its define to 1 in `opencl-c-base.h`. This is done unconditionally at the moment, as the mechanism for disabling header-only options hasn't been decided yet. This patch only adds a negative test for now. Ideally adding a CL3.0 run line to atomic-ops.cl should suffice as a positive test, but we cannot do that yet until (at least) generic address spaces and program scope variables are supported in OpenCL 3.0 mode. Differential Revision: https://reviews.llvm.org/D103241	2021-06-08 11:51:12 +01:00
Stuart Brady	9b14670f3c	[OpenCL] Add const attribute to ctz() builtins Reviewed By: svenvh Differential Revision: https://reviews.llvm.org/D97725	2021-06-07 11:41:52 +01:00
Stuart Brady	86c24493ea	[OpenCL][NFC] Test commit: tidy up whitespace in comment	2021-06-04 14:44:12 +01:00
Qiu Chaofan	c0b3071833	[PowerPC] Fix x86 vector intrinsics wrapper compilation under C++ Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D103386	2021-06-01 01:19:12 +08:00

1 2 3 4 5 ...

1862 Commits