llvm-project/clang/lib/CodeGen
Samuel Antao 4af1b7b693 [OpenMP] Update target directive codegen to use 4.5 implicit data mappings.
Summary:
This patch implements the 4.5 specification for the implicit data maps. OpenMP 4.5 specification changes the default way data is captured into a target region. All the non-aggregate kinds are passed by value by default. This required activating the capturing by value during SEMA for the target region. All the non-aggregate values that can be encoded in the size of a pointer are properly casted and forwarded to the runtime library. On top of fixing the previous weird behavior for mapping pointers in nested data regions (an explicit map was always required), this also improves performance as the number of allocations/transactions to the device per non-aggregate map are reduced from two to only one - instead of passing a reference and the value, only the value passed.

Explicit maps will be added later on once firstprivate, private, and map clauses' SEMA and parsing are available.

Reviewers: hfinkel, rjmccall, ABataev

Subscribers: cfe-commits, carlo.bertolli

Differential Revision: http://reviews.llvm.org/D14940

llvm-svn: 254521
2015-12-02 17:44:43 +00:00
..
ABIInfo.h Support __builtin_ms_va_list. 2015-09-17 20:55:33 +00:00
Address.h PR25368: Replace workaround for build failure with modules enabled with a fix 2015-11-05 21:16:22 +00:00
BackendUtil.cpp [ASan] Allow -fsanitize-recover=address. 2015-11-11 10:45:48 +00:00
CGAtomic.cpp Correct atomic libcall support for __atomic_*_fetch builtins. 2015-11-12 18:37:29 +00:00
CGBlocks.cpp Don't actually add the __unsafe_unretained qualifier in MRC; 2015-11-19 02:28:03 +00:00
CGBlocks.h Move BlockByrefHelpers back to CodeGenModule.h to placate MSVC. 2015-09-08 08:21:11 +00:00
CGBuilder.h Revert "Change memcpy/memset/memmove to have dest and source alignments." 2015-11-19 05:55:59 +00:00
CGBuiltin.cpp Move checkTargetFeatures to CodeGenFunction.cpp to make it 2015-11-12 00:44:07 +00:00
CGCUDANV.cpp Compute and preserve alignment more faithfully in IR-generation. 2015-09-08 08:05:57 +00:00
CGCUDARuntime.cpp Roll-back r250822. 2015-10-20 13:23:58 +00:00
CGCUDARuntime.h Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). 2015-06-22 23:07:51 +00:00
CGCXX.cpp [COFF] Don't try to emit weak aliases on COFF 2015-11-10 22:23:58 +00:00
CGCXXABI.cpp [MS ABI] Don't zero-initialize vbptrs in bases 2015-11-02 09:01:44 +00:00
CGCXXABI.h Fix use-after-free when a C++ thread_local variable gets replaced (because its 2015-12-01 01:10:48 +00:00
CGCall.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGCall.h Don't emit exceptional stackrestore cleanups around inalloca functions 2015-10-08 00:17:45 +00:00
CGClass.cpp Fix Clang-tidy modernize-use-nullptr warnings in source directories; other minor cleanups 2015-10-06 23:40:43 +00:00
CGCleanup.cpp CodeGen: Remove implicit ilist iterator conversions, NFC 2015-11-06 23:00:41 +00:00
CGCleanup.h [WinEH] Remove NewMSEH and enable its behavior by default 2015-10-08 01:13:52 +00:00
CGDebugInfo.cpp [DebugInfo] Look through type sugar on union types when casting 2015-11-20 17:41:12 +00:00
CGDebugInfo.h Put back doxygen comment accidentally dropped in r250418. 2015-10-15 16:46:25 +00:00
CGDecl.cpp Correctly handle type mismatches in the __weak copy/move-initialization 2015-11-16 22:11:41 +00:00
CGDeclCXX.cpp Fix use-after-free when a C++ thread_local variable gets replaced (because its 2015-12-01 01:10:48 +00:00
CGException.cpp CodeGen: Remove implicit ilist iterator conversions, NFC 2015-11-06 23:00:41 +00:00
CGExpr.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGExprAgg.cpp Atomics: support __c11_* calls on _Atomic struct types. 2015-11-09 19:56:35 +00:00
CGExprCXX.cpp [MS ABI] Tolerate invokes of __RTDynamicCast 2015-11-23 03:01:14 +00:00
CGExprComplex.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGExprConstant.cpp PR25368: Replace workaround for build failure with modules enabled with a fix 2015-11-05 21:16:22 +00:00
CGExprScalar.cpp CodeGen: Remove implicit ilist iterator conversions, NFC 2015-11-06 23:00:41 +00:00
CGLoopInfo.cpp Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll". 2015-08-10 17:29:39 +00:00
CGLoopInfo.h Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll". 2015-08-10 17:29:39 +00:00
CGObjC.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGObjCGNU.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGObjCMac.cpp Preserve exceptions information during calls code generation. 2015-11-23 22:04:44 +00:00
CGObjCRuntime.cpp Initialize @catch variables correctly in fragile-runtime ARC. 2015-10-30 00:56:02 +00:00
CGObjCRuntime.h Initialize @catch variables correctly in fragile-runtime ARC. 2015-10-30 00:56:02 +00:00
CGOpenCLRuntime.cpp Roll-back r250822. 2015-10-20 13:23:58 +00:00
CGOpenCLRuntime.h Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). 2015-06-22 23:07:51 +00:00
CGOpenMPRuntime.cpp [OpenMP] Update target directive codegen to use 4.5 implicit data mappings. 2015-12-02 17:44:43 +00:00
CGOpenMPRuntime.h Roll-back r250822. 2015-10-20 13:23:58 +00:00
CGRecordLayout.h Respect alignment of nested bitfields 2015-07-10 17:30:00 +00:00
CGRecordLayoutBuilder.cpp [MS ABI] Don't crash when inheriting from base with trailing empty array member 2015-10-22 18:04:22 +00:00
CGStmt.cpp [OPENMP 4.5] Parsing/sema analysis for 'taskloop' directive. 2015-12-01 04:18:41 +00:00
CGStmtOpenMP.cpp [OpenMP] Update target directive codegen to use 4.5 implicit data mappings. 2015-12-02 17:44:43 +00:00
CGVTT.cpp Remove and forbid raw_svector_ostream::flush() calls. 2015-08-13 18:12:56 +00:00
CGVTables.cpp CodeGen: Remove implicit ilist iterator conversions, NFC 2015-11-06 23:00:41 +00:00
CGVTables.h [CodeGen] Remove dead code. NFC. 2015-10-15 15:29:40 +00:00
CGValue.h Introduce __builtin_nontemporal_store and __builtin_nontemporal_load. 2015-09-08 23:52:33 +00:00
CMakeLists.txt [CMake] Fill up required libs, corresponding to r241653. 2015-07-08 02:06:21 +00:00
CodeGenABITypes.cpp Put back dead code that's used out-of-tree. 2015-10-20 07:50:21 +00:00
CodeGenAction.cpp Update for llvm api change. 2015-12-01 19:51:03 +00:00
CodeGenFunction.cpp When producing error messages for always_inline functions with the 2015-11-16 18:29:59 +00:00
CodeGenFunction.h [OpenMP] Update target directive codegen to use 4.5 implicit data mappings. 2015-12-02 17:44:43 +00:00
CodeGenModule.cpp Fix use-after-free when a C++ thread_local variable gets replaced (because its 2015-12-01 01:10:48 +00:00
CodeGenModule.h Fix use-after-free when a C++ thread_local variable gets replaced (because its 2015-12-01 01:10:48 +00:00
CodeGenPGO.cpp [PGO] Code cleanup [NFC] 2015-11-09 00:04:16 +00:00
CodeGenPGO.h [PGO] Removed unused code. [NFC] 2015-11-18 00:14:08 +00:00
CodeGenTBAA.cpp Roll-back r250822. 2015-10-20 13:23:58 +00:00
CodeGenTBAA.h Header guard canonicalization, clang part. 2014-08-13 16:25:19 +00:00
CodeGenTypeCache.h Compute and preserve alignment more faithfully in IR-generation. 2015-09-08 08:05:57 +00:00
CodeGenTypes.cpp [OpenCL] Add new types for OpenCL 2.0. 2015-09-15 11:18:52 +00:00
CodeGenTypes.h Remove superfluous private:, TypeCache is private by default. 2015-08-13 07:12:03 +00:00
CoverageMappingGen.cpp Use profile data template file for covmap func record (NFC) 2015-11-05 05:46:39 +00:00
CoverageMappingGen.h [cleanup] Re-sort *all* #include lines with llvm/utils/sort_includes.py 2015-01-14 11:29:14 +00:00
EHScopeStack.h [CodeGen] Remove dead code. NFC. 2015-10-15 15:29:40 +00:00
ItaniumCXXABI.cpp Fix use-after-free when a C++ thread_local variable gets replaced (because its 2015-12-01 01:10:48 +00:00
Makefile
MicrosoftCXXABI.cpp Remove superfluous StringRef casts, NFC. 2015-12-01 08:14:39 +00:00
ModuleBuilder.cpp [MSVC Compat] Enable ABI impacting non-conforming behavior independently of -fms-compatibility 2015-10-08 04:53:31 +00:00
ObjectFilePCHContainerOperations.cpp Module Debugging: Emit module debug info for types inside of Objective-C 2015-10-23 17:02:22 +00:00
README.txt
SanitizerMetadata.cpp [ASan] Initial support for Kernel AddressSanitizer 2015-06-19 12:19:07 +00:00
SanitizerMetadata.h Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. 2015-02-15 22:54:08 +00:00
TargetInfo.cpp Fixed default label in fully covered switch warning that was introduced in r254203. 2015-11-27 19:49:24 +00:00
TargetInfo.h Fix Clang-tidy modernize-use-nullptr warnings in headers and generated files; other minor cleanups. 2015-09-29 20:56:43 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//