llvm-project/clang/lib/CodeGen
Yaxun Liu a4005e13f7 [CUDA][HIP] Allow function-scope static const variable
CUDA 8.0 E.3.9.4 says: Within the body of a __device__ or __global__
function, only __shared__ variables or variables without any device
memory qualifiers may be declared with static storage class.

It is unclear how a function-scope non-const static variable
without device memory qualifier is implemented, therefore only static
const variable without device memory qualifier is allowed, which
can be emitted as a global variable in constant address space.

Currently clang only allows function-scope static variable with
__shared__ qualifier.

This patch also allows function-scope static const variable without
device memory qualifier and emits it as a global variable in constant
address space.

Differential Revision: https://reviews.llvm.org/D49931

llvm-svn: 338188
2018-07-28 03:05:25 +00:00
..
ABIInfo.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
Address.h
BackendUtil.cpp Re-land r337333, "Teach Clang to emit address-significance tables.", 2018-07-18 00:27:07 +00:00
CGAtomic.cpp Added atomic_fetch_min, max, umin, umax intrinsics to clang. 2018-05-13 07:45:58 +00:00
CGBlocks.cpp [CodeGen][ObjC] Make block copy/dispose helper functions exception-safe. 2018-07-26 16:51:21 +00:00
CGBlocks.h [CodeGen][ObjC] Make copying and disposing of a non-escaping block 2018-07-20 17:10:32 +00:00
CGBuilder.h CodeGen: specify alignment + inbounds for automatic variable initialization 2018-07-13 20:33:23 +00:00
CGBuiltin.cpp [NEON] Fix support for vrndi_f32(), vrndiq_f32() and vrndns_f32() intrinsics 2018-07-23 13:26:37 +00:00
CGCUDANV.cpp [HIP] Register/unregister device fat binary only once 2018-07-20 22:45:24 +00:00
CGCUDARuntime.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCUDARuntime.h
CGCXX.cpp IRgen: Mark aliases of ctors and dtors as unnamed_addr. 2018-06-18 20:58:54 +00:00
CGCXXABI.cpp [MS] Always use base dtors in place of complete/vbase dtors when possible 2018-03-16 19:40:50 +00:00
CGCXXABI.h [WebAssembly] Use Windows EH instructions for Wasm EH 2018-05-31 22:18:13 +00:00
CGCall.cpp [COFF, ARM64] Decide when to mark struct returns as SRet 2018-07-26 18:07:59 +00:00
CGCall.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGClass.cpp Implement CFI for indirect calls via a member function pointer. 2018-06-26 02:15:47 +00:00
CGCleanup.cpp Support lifetime-extension of conditional temporaries. 2018-07-23 22:56:45 +00:00
CGCleanup.h Replace LLVM_ALIGNAS with just alignas. 2018-07-17 22:24:11 +00:00
CGCoroutine.cpp [Coroutines] Less IR for noexcept await_resume 2018-06-23 18:57:26 +00:00
CGDebugInfo.cpp Revert "[DebugInfo] Generate debug information for labels. (Fix PR37395)" 2018-07-24 02:57:11 +00:00
CGDebugInfo.h Revert "[DebugInfo] Generate debug information for labels. (Fix PR37395)" 2018-07-24 02:57:11 +00:00
CGDecl.cpp [CodeGen][ObjC] Make block copy/dispose helper functions exception-safe. 2018-07-26 16:51:21 +00:00
CGDeclCXX.cpp Add a command line option 'fregister_global_dtors_with_atexit' to 2018-04-17 18:41:52 +00:00
CGException.cpp [CodeGen] Always use MSVC personality for windows-msvc targets 2018-06-08 00:41:01 +00:00
CGExpr.cpp [CodeGenCXX] Emit strip.invariant.group with -fstrict-vtable-pointers 2018-07-02 19:21:36 +00:00
CGExprAgg.cpp [Fixed Point Arithmetic] Fixed Point Precision Bits and Fixed Point Literals 2018-06-20 17:19:40 +00:00
CGExprCXX.cpp Rename invariant.group.barrier to launder.invariant.group 2018-05-03 11:03:01 +00:00
CGExprComplex.cpp Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGExprConstant.cpp [AST] Add a convenient getter from QualType to RecordDecl 2018-07-28 02:16:13 +00:00
CGExprScalar.cpp [CodeGenCXX] Emit strip.invariant.group with -fstrict-vtable-pointers 2018-07-02 19:21:36 +00:00
CGGPUBuiltin.cpp Recommit r326946 after reducing CallArgList memory footprint 2018-03-15 15:25:19 +00:00
CGLoopInfo.cpp [CodeGen] Pass objects that are expensive to copy by const ref. 2016-11-24 16:01:20 +00:00
CGLoopInfo.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGNonTrivialStruct.cpp Move the visitor classes that are used to traverse non-trivial C structs 2018-04-17 19:05:17 +00:00
CGObjC.cpp Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGObjCGNU.cpp [objc-gnustep2] Use unsigned char to avoid potential UB in isalnum. 2018-05-22 10:13:17 +00:00
CGObjCMac.cpp Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGObjCRuntime.cpp [CodeGen] Propagate may-alias'ness of lvalues with TBAA info 2017-10-31 11:05:34 +00:00
CGObjCRuntime.h Clean up CGObjCMac's APIs for deriving class references. NFC. 2016-11-30 23:54:50 +00:00
CGOpenCLRuntime.cpp [OpenCL] Add separate read_only and write_only pipe IR types 2018-04-27 10:37:04 +00:00
CGOpenCLRuntime.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGOpenMPRuntime.cpp [OPENMP] ThreadId in serialized parallel regions is 0. 2018-07-25 20:03:01 +00:00
CGOpenMPRuntime.h [OPENMP] Fix checks for declare target link entries. 2018-07-16 20:05:25 +00:00
CGOpenMPRuntimeNVPTX.cpp [OPENMP] ThreadId in serialized parallel regions is 0. 2018-07-25 20:03:01 +00:00
CGOpenMPRuntimeNVPTX.h [OPENMP, NVPTX] Fix globalization of the variables passed to orphaned 2018-06-21 20:26:33 +00:00
CGRecordLayout.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGRecordLayoutBuilder.cpp [AST] Add a convenient getter from QualType to RecordDecl 2018-07-28 02:16:13 +00:00
CGStmt.cpp Revert "[DebugInfo] Generate debug information for labels. (Fix PR37395)" 2018-07-24 02:57:11 +00:00
CGStmtOpenMP.cpp [OPENMP] DO not crash on combined constructs in declare target 2018-05-16 15:08:32 +00:00
CGVTT.cpp Recommit r324107 again. 2018-02-07 22:15:33 +00:00
CGVTables.cpp Implement CFI for indirect calls via a member function pointer. 2018-06-26 02:15:47 +00:00
CGVTables.h [MS] Emit vftable thunks for functions with incomplete prototypes 2018-04-02 20:20:33 +00:00
CGValue.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CMakeLists.txt Compile CodeGenModule.cpp with /bigobj. 2018-06-26 17:45:26 +00:00
CodeGenABITypes.cpp Include getting generated struct offsets in CodegenABITypes 2017-10-10 23:54:21 +00:00
CodeGenAction.cpp Change \t to spaces 2018-07-20 08:19:20 +00:00
CodeGenFunction.cpp Implement cpu_dispatch/cpu_specific Multiversioning 2018-07-20 14:13:28 +00:00
CodeGenFunction.h [CodeGen][ObjC] Make block copy/dispose helper functions exception-safe. 2018-07-26 16:51:21 +00:00
CodeGenModule.cpp [CUDA][HIP] Allow function-scope static const variable 2018-07-28 03:05:25 +00:00
CodeGenModule.h Implement cpu_dispatch/cpu_specific Multiversioning 2018-07-20 14:13:28 +00:00
CodeGenPGO.cpp Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CodeGenPGO.h Remove a dead field. NFC. 2017-04-24 20:54:36 +00:00
CodeGenTBAA.cpp [CodeGen] Fix generation of TBAA tags for may-alias accesses 2018-02-20 12:33:04 +00:00
CodeGenTBAA.h [CodeGen] Fix generation of TBAA tags for may-alias accesses 2018-02-20 12:33:04 +00:00
CodeGenTypeCache.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
CodeGenTypes.cpp [Fixed Point Arithmetic] Addition of the remaining fixed point types and their saturated equivalents 2018-06-14 14:53:51 +00:00
CodeGenTypes.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
ConstantEmitter.h [NFC] typo 2018-07-11 19:51:40 +00:00
ConstantInitBuilder.cpp Further fixes and improvements to the ConstantInitBuilder API. 2017-03-06 19:04:16 +00:00
CoverageMappingGen.cpp [Coverage] End deferred regions before labels, fixes PR35867 2018-06-01 00:37:13 +00:00
CoverageMappingGen.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
EHScopeStack.h Spelling mistakes in comments. NFCI. (PR27635) 2017-03-30 14:13:19 +00:00
ItaniumCXXABI.cpp Borrow visibility from __fundamental_type_info for generated fundamental type infos 2018-07-24 00:43:47 +00:00
MacroPPCallbacks.cpp Reland '[clang] Adding CharacteristicKind to PPCallbacks::InclusionDirective' 2018-05-10 19:05:36 +00:00
MacroPPCallbacks.h Reland '[clang] Adding CharacteristicKind to PPCallbacks::InclusionDirective' 2018-05-10 19:05:36 +00:00
MicrosoftCXXABI.cpp [ARM64] [Windows] Follow MS X86_64 C++ ABI when passing structs 2018-07-26 22:18:28 +00:00
ModuleBuilder.cpp D34444: Teach codegen to work in incremental processing mode. 2017-08-27 10:58:03 +00:00
ObjectFilePCHContainerOperations.cpp [clang] Update uses of DEBUG macro to LLVM_DEBUG. 2018-05-15 13:30:56 +00:00
README.txt
SanitizerMetadata.cpp hwasan: add -fsanitize=kernel-hwaddress flag 2018-04-13 18:05:21 +00:00
SanitizerMetadata.h
SwiftCallingConv.cpp Generalize the swiftcall API since being passed indirectly isn't 2018-04-07 20:16:47 +00:00
TargetInfo.cpp [RISCV] Add support for interrupt attribute 2018-07-26 17:37:45 +00:00
TargetInfo.h [CUDA][HIP] Set kernel calling convention before arrange function 2018-06-12 00:16:33 +00:00
VarBypassDetector.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
VarBypassDetector.h [CodeGen] Don't emit lifetime intrinsics for some local variables 2016-10-26 05:42:30 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//