llvm-project

Commit Graph

Author	SHA1	Message	Date
Joseph Huber	9d3550c517	[OpenMP] Add AMDGPU calling convention to ctor / dtor functions This patch adds the necessary AMDGPU calling convention to the ctor / dtor kernels. These are fundamentally device kenels called by the host on image load. Without this calling convention information the AMDGPU plugin is unable to identify them. Depends on D122504 Fixes #54091 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122515	2022-03-25 22:44:20 -04:00
Joseph Huber	3c6d32ec6c	[OpenMP] Make Ctor / Dtor functions have external visibility The default construction of constructor functions by LLVM tends to make them have internal linkage. When we call a ctor / dtor function in the target region we are actually creating a kernel that is called at registration. Because the ctor is a kernel we need to make sure it's externally visible so we can actually call it. This prevented AMDGPU from correctly using constructors while NVPTX could use them simply because it ignored internal visibility. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D122504	2022-03-25 22:44:17 -04:00
Joseph Huber	b9f67d44ba	[OpenMP] Replace device kernel linkage with weak_odr Currently the device kernels all have weak linkage to prevent linkage errors on multiple defintions. However, this prevents some optimizations from adequately analyzing them because of the nature of weak linkage. This patch replaces the weak linkage with weak_odr linkage so we can statically assert that multiple declarations of the same kernel will have the same definition. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122443	2022-03-25 11:29:15 -04:00
Jennifer Yu	a6cdac48ff	Eliminate extra set of simd variant function attribute. Current clang generates extra set of simd variant function attribute with extra 'v' encoding. For example: _ZGVbN2v__Z5add_1Pf vs _ZGVbN2vv__Z5add_1Pf The problem is due to declaration of ParamAttrs following: llvm::SmallVector<ParamAttrTy, 8> ParamAttrs(ParamPositions.size()); where ParamPositions.size() is grown after following assignment: Pos = ParamPositions[PVD]; So the PVD is not find in ParamPositions. The problem is ParamPositions need to set for each FD decl. To fix this Move ParamPositions's init inside while loop for each FD. Differential Revision: https://reviews.llvm.org/D122338	2022-03-24 13:27:28 -07:00
Dávid Bolvanský	a683ba4ff5	[NFCI] Fix set-but-unused warning in CGOpenMPRuntime.cpp	2022-03-24 07:49:21 +01:00
Joseph Huber	0d16c23af1	[OpenMP] Do not create offloading entries for internal or hidden symbols Currently we create offloading entries to register device variables with the host. When we register a variable we will look up the symbol in the device image and map the device address to the host address. This is a problem when the symbol is declared with hidden visibility or internal linkage. This means the symbol is not accessible externally and we cannot get its address. We should still allow static variables to be declared on the device, but ew should not create an offloading entry for them so they exist independently on the host and device. Fixes #54309 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122352	2022-03-23 18:27:16 -04:00
Nikita Popov	c070d5ceff	[CGOpenMPRuntime] Remove uses of deprecated Address constructor And as these are the last remaining uses, also remove the constructor itself.	2022-03-23 12:40:44 +01:00
Nikita Popov	8b62dd3cd6	Reapply [CodeGen] Avoid deprecated Address ctor in EmitLoadOfPointer() This requires some adjustment in caller code, because there was a confusion regarding the meaning of the PtrTy argument: This argument is the type of the pointer being loaded, not the addresses being loaded from. Reapply after fixing the specified pointer type for one call in `47eb4f7dcd`, where the used type is important for determining alignment.	2022-03-23 12:06:11 +01:00
Nikita Popov	47eb4f7dcd	[CGOpenMPRuntime] Specify correct type in EmitLoadOfPointerLValue() Perform a bitcast first, so we can specify the correct pointer type inf EmitLoadOfPointerLValue(), rather than using a dummy void pointer.	2022-03-23 11:51:14 +01:00
Nikita Popov	ba2be802b0	[CGOpenMPRuntime] Reuse getDepobjElements() (NFC) There were two more places repeating this code, reuse the helper. This requires moving the static functions into the class.	2022-03-23 11:31:49 +01:00
Nikita Popov	27f6cee12d	Revert "[CodeGen] Avoid deprecated Address ctor in EmitLoadOfPointer()" This reverts commit `767ec883e3`. This results in a some incorrect alignments which are not covered by existing tests.	2022-03-23 10:24:39 +01:00
Nikita Popov	cd6d9ae263	[CGOpenMPRuntime] Remove some uses of deprecated Adddress ctor	2022-03-22 16:29:35 +01:00
Nikita Popov	4f5640cad3	[CGOpenMPRuntime] Remove some uses of deprecated Address ctor	2022-03-22 15:35:45 +01:00
Nikita Popov	767ec883e3	[CodeGen] Avoid deprecated Address ctor in EmitLoadOfPointer() This requires some adjustment in caller code, because there was a confusion regarding the meaning of the PtrTy argument: This argument is the type of the pointer being loaded, not the addresses being loaded from.	2022-03-22 09:42:31 +01:00
Nikita Popov	b6f85d8539	[CodeGen][OpenMP] Use correct type in EmitLoadOfPointer() Rather than using a dummy void pointer type, we should specify the correct private type and perform the bitcast beforehand rather than afterwards. This way, the Address will have correct alignment information.	2022-03-21 12:08:05 +01:00
Nikita Popov	52cc65d474	[OpenMPRuntime] Specify correct pointer type Rather than specifying a dummy type in EmitLoadOfPointer() and then casting it to the correct one, we should instead specify the correct type and cast beforehand. Otherwise the computed alignment will be incorrect.	2022-03-18 14:25:51 +01:00
Johannes Doerfert	f02550bdd9	Reapply "[OpenMP][FIX] Allow device constructors for AMD GPU" This reverts commit `a597d6a780` and reapplies `07b1766461`. In AMD GPU device code the globals are in AS(1). Before, we crashed if the global was a structure. Now we simply cast away the AS before we generate the code to initialize the global. Differential Revision: https://reviews.llvm.org/D121837 Fixes: https://github.com/llvm/llvm-project/issues/54421	2022-03-17 12:53:47 -05:00
Nikita Popov	6c0af92612	[CodeGen] Avoid some pointer element type accesses	2022-03-17 16:36:14 +01:00
Johannes Doerfert	a597d6a780	Revert "[OpenMP][FIX] Allow device constructors for AMD GPU" This reverts commit `07b1766461` as it broke the buildbots: https://lab.llvm.org/buildbot#builders/193/builds/8594	2022-03-16 17:35:54 -05:00
Johannes Doerfert	07b1766461	[OpenMP][FIX] Allow device constructors for AMD GPU In AMD GPU device code the globals are in AS(1). Before, we crashed if the global was a structure. Now we simply cast away the AS before we generate the code to initialize the global. Differential Revision: https://reviews.llvm.org/D121837	2022-03-16 17:04:28 -05:00
David Blaikie	c0a6433f2b	Simplify OpenMP Lambda use * Use default ref capture for non-escaping lambdas (this makes maintenance easier by allowing new uses, removing uses, having conditional uses (such as in assertions) not require updates to an explicit capture list) * Simplify addPrivate API not to take a lambda, since it calls it unconditionally/immediately anyway - most callers are simply passing in a named value or short expression anyway and the lambda syntax just adds noise/overhead Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D121077	2022-03-07 18:23:20 +00:00
Akira Hatanaka	d112cc2756	[NFC][Clang][OpaquePtr] Remove the call to Address::deprecated in CreatePointerBitCastOrAddrSpaceCast Differential Revision: https://reviews.llvm.org/D120757	2022-03-02 08:58:00 -08:00
Alexey Bataev	d04d9220e1	[OPENMP]Fix PR50347: Mapping of global scope deep object fails. Changed the we handle llvm::Constants in sizes arrays. ConstExprs and GlobalValues cannot be used as initializers, need to put them at the runtime, otherwise there wight be the compilation errors. Differential Revision: https://reviews.llvm.org/D105297	2022-02-25 10:54:24 -08:00
Alexey Bataev	ca6fa71b7e	Revert "[OPENMP]Fix PR50347: Mapping of global scope deep object fails." This reverts commit `638938117a`. Need to fix reported fail https://lab.llvm.org/buildbot/#/builders/193/builds/7496	2022-02-24 12:04:39 -08:00
Alexey Bataev	638938117a	[OPENMP]Fix PR50347: Mapping of global scope deep object fails. Changed the we handle llvm::Constants in sizes arrays. ConstExprs and GlobalValues cannot be used as initializers, need to put them at the runtime, otherwise there wight be the compilation errors. Differential Revision: https://reviews.llvm.org/D105297	2022-02-24 11:49:14 -08:00
Joseph Huber	119d71cb73	[OpenMP][NFC] Address warnings and lint messages in CGOpenMPRuntime Summary: This patch addressed the warnings and linting messages for the CGOpenMPRuntime.cpp file. This was causing some -Werror builds to fail.	2022-02-23 18:07:25 -05:00
Reid Kleckner	1d1b089c5d	Fix more unused lambda capture warnings, NFC	2022-02-23 14:07:04 -08:00
Reid Kleckner	cd37594c03	Fix unused lambda capture warning, NFC	2022-02-23 14:01:01 -08:00
Joseph Huber	2b97b16f29	[OpenMP] Add option to make offloading mandatory Currently when we generate OpenMP offloading code we always make fallback code for the CPU. This is necessary for implementing features like conditional offloading and ensuring that unhandled pragmas don't result in missing symbols. However, this is problematic for a few cases. For offloading tests we can silently fail to the host without realizing that offloading failed. Additionally, this makes it impossible to provide interoperabiility to other offloading schemes like HIP or CUDA because those methods do not provide any such host fallback guaruntee. this patch adds the `-fopenmp-offload-mandatory` flag to prevent generating the fallback symbol on the CPU and instead replaces the function with a dummy global and the failed branch with 'unreachable'. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120353	2022-02-23 16:45:36 -05:00
Nikita Popov	5065076698	[CodeGen] Rename deprecated Address constructor To make uses of the deprecated constructor easier to spot, and to ensure that no new uses are introduced, rename it to Address::deprecated(). While doing the rename, I've filled in element types in cases where it was relatively obvious, but we're still left with 135 calls to the deprecated constructor.	2022-02-17 11:26:42 +01:00
David Pagan	0a7cc078ac	Enable inoutset dependency-type in depend clause. Done in manner similar to mutexinoutset (see https://reviews.llvm.org/D57576) Runtime support already exists in LLVM OpenMP runtime (see https://reviews.llvm.org/D97085). The value used to identify an inoutset dependency type in the LLVM OpenMP runtime is 8. Some tests updated due to change in dependency type error messages that now include new dependency type. Also updated test/OpenMP/task_codegen.cpp to verify we emit the right code.	2022-02-08 08:35:36 -05:00
Nikita Popov	30d4a7e295	[IRBuilder] Require explicit element type in CreatePtrDiff() For opaque pointer compatibility, we cannot derive the element type from the pointer type.	2022-01-25 12:43:57 +01:00
Nikita Popov	caff8591ef	[OpenMP] Simplify pointer comparison Rather than checking ptrdiff(a, b) != 0, directly check a != b.	2022-01-25 12:38:37 +01:00
Nikita Popov	99adacbcb7	[clang] Remove some getPointerElementType() uses Same cases where the call can be removed in a straightforward way.	2022-01-25 12:09:06 +01:00
Nikita Popov	aa97bc116d	[NFC] Remove uses of PointerType::getElementType() Instead use either Type::getPointerElementType() or Type::getNonOpaquePointerElementType(). This is part of D117885, in preparation for deprecating the API.	2022-01-25 09:44:52 +01:00
Kazu Hirata	17d4bd3d78	[clang] Fix bugprone argument comments (NFC) Identified with bugprone-argument-comment.	2022-01-09 00:19:49 -08:00
David Pagan	7df2371bc6	Add codegen for allocate directive's 'align' clause	2022-01-05 12:40:58 -05:00
Johannes Doerfert	944aa0421c	Reapply "[OpenMP][NFCI] Embed the source location string size in the ident_t" This reverts commit `73ece231ee` and reapplies `7bfcdbcbf3` with mlir changes. Also reverts commit `423ba12971` and includes the unit test changes of `16da214004`.	2021-12-29 01:10:38 -06:00
Mehdi Amini	73ece231ee	Revert "[OpenMP][NFCI] Embed the source location string size in the ident_t" This reverts commit `7bfcdbcbf3`. Broke MLIR build	2021-12-29 06:57:36 +00:00
Johannes Doerfert	7bfcdbcbf3	[OpenMP][NFCI] Embed the source location string size in the ident_t One of the unused ident_t fields now holds the size of the string (=const char *) field so we have an easier time dealing with those in the future. Differential Revision: https://reviews.llvm.org/D113126	2021-12-28 23:53:29 -06:00
Kazu Hirata	31cfb3f4f6	[clang] Remove redundant calls to c_str() (NFC) Identified with readability-redundant-string-cstr.	2021-12-26 13:31:40 -08:00
Kazu Hirata	0542d15211	Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2021-12-26 09:39:26 -08:00
Kazu Hirata	76f0f1cc5c	Use {DenseSet,SetVector,SmallPtrSet}::contains (NFC)	2021-12-24 21:43:06 -08:00
Nikita Popov	dd903173c0	[OpenMP] Avoid creating null pointer lvalue (NFC) The reduction initialization code creates a "naturally aligned null pointer to void lvalue", which I found somewhat odd, even though it works out in the end because it is not actually used. It doesn't look like this code actually needs an LValue for anything though, and we can use an invalid Address to represent this case instead. Differential Revision: https://reviews.llvm.org/D116214	2021-12-24 09:01:56 +01:00
Nikita Popov	7977fd7cfc	[OpenMP] Remove no-op cast (NFC) This was casting the address to its own element type, which is a no-op.	2021-12-23 15:15:26 +01:00
Nikita Popov	2c7dc13146	[CGBuilder] Add CreateGEP() overload that accepts an Address Add an overload for an Address and a single non-constant offset. This makes it easier to preserve the element type and adjust the alignment appropriately.	2021-12-23 14:53:42 +01:00
Nikita Popov	09669e6c5f	[CodeGen] Avoid pointer element type access when creating LValue This required fixing two places that were passing the pointer type rather than the expected pointee type to the method.	2021-12-23 10:53:15 +01:00
Nikita Popov	34eb715f61	[CodeGen] Avoid more pointer element type accesses	2021-12-16 12:03:11 +01:00
Nikita Popov	d930c3155c	[CodeGen] Pass element type to EmitCheckedInBoundsGEP() Same as for other GEP creation methods.	2021-12-15 14:03:33 +01:00
Joseph Huber	bc9c4d7216	[OpenMP][FIX] Pass the num_threads value directly to parallel_51 The problem with the old scheme is that we would need to keep track of the "next region" and reset the num_threads value after it. The new RT doesn't do it and an assertion is triggered. The old RT doesn't do it either, I haven't tested it but I assume a num_threads clause might impact multiple parallel regions "accidentally". Further, in SPMD mode num_threads was simply ignored, for some reason beyond me. In any case, parallel_51 is designed to take the clause value directly, so let's do that instead. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D113623	2021-12-09 16:30:29 -05:00

1 2 3 4 5 ...

682 Commits