llvm-project/clang/lib/CodeGen
Richard Smith fbe2369f1a Improve handling of instantiated thread_local variables in Itanium C++ ABI.
* Do not initialize these variables when initializing the rest of the
   thread_locals in the TU; they have unordered initialization so they can be
   initialized by themselves.

   This fixes a rejects-valid bug: we would make the per-variable initializer
   function internal, but put it in a comdat keyed off the variable, resulting
   in link errors when the comdat is selected from a different TU (as the per
   TU TLS init function tries to call an init function that does not exist).

 * On Darwin, when we decide that we're not going to emit a thread wrapper
   function at all, demote its linkage to External. Fixes a verifier failure
   on explicit instantiation of a thread_local variable on Darwin.

llvm-svn: 291865
2017-01-13 00:43:31 +00:00
..
ABIInfo.h swiftcc: Add an api to query whether a target ABI stores swifterror in a register 2016-12-01 18:07:38 +00:00
Address.h Work around build failure due to GCC 4.8.1 bug. We don't completely understand 2016-02-02 23:11:49 +00:00
BackendUtil.cpp Revert r291774 which caused buildbot failure. 2017-01-12 16:56:18 +00:00
CGAtomic.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGBlocks.cpp Add the alloc_size attribute to clang, attempt 2. 2016-12-22 02:50:20 +00:00
CGBlocks.h [CodeGen][ObjC] Block captures should inherit the type of the captured 2016-09-16 00:02:06 +00:00
CGBuilder.h IRGen: Remove an unused overload of CreateAlignedLoad. 2016-12-05 00:02:18 +00:00
CGBuiltin.cpp [ARM] Use generic bitreverse intrinsic, rather than ARM specific rbit. 2017-01-10 18:55:11 +00:00
CGCUDABuiltin.cpp [CUDA] Align kernel launch args correctly when the LLVM type's alignment is different from the clang type's alignment. 2016-07-27 22:36:21 +00:00
CGCUDANV.cpp ConstantBuilder -> ConstantInitBuilder for clarity, and 2016-11-28 22:18:27 +00:00
CGCUDARuntime.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCUDARuntime.h [CUDA] Emit host-side 'shadows' for device-side global variables 2016-03-02 18:28:50 +00:00
CGCXX.cpp CodeGen: New vtable group representation: struct of vtable arrays. 2016-12-13 20:40:39 +00:00
CGCXXABI.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCXXABI.h Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCall.cpp Clean up redundant isa<T> before getAs<T>. NFC. 2017-01-06 19:10:48 +00:00
CGCall.h Name some anonymous structs to avoid using a (very common) extension. 2016-11-07 21:13:27 +00:00
CGClass.cpp Remove custom handling of array copies in lambda by-value array capture and 2016-12-14 00:03:17 +00:00
CGCleanup.cpp Retire llvm::alignOf in favor of C++11 alignof. 2016-10-20 14:27:22 +00:00
CGCleanup.h Use the correct ObjC EH personality 2017-01-08 22:58:07 +00:00
CGCoroutine.cpp [coroutines] Add allocation and deallocation substatements. 2016-10-27 16:28:31 +00:00
CGDebugInfo.cpp DebugInfo: Don't include size/alignment on class declarations 2016-12-27 22:05:35 +00:00
CGDebugInfo.h [DebugInfo] Added support for Checksum debug info feature. 2016-12-25 10:12:27 +00:00
CGDecl.cpp CGDecl: Skip static variable initializers in unreachable code 2017-01-10 17:43:01 +00:00
CGDeclCXX.cpp Improve handling of instantiated thread_local variables in Itanium C++ ABI. 2017-01-13 00:43:31 +00:00
CGException.cpp Use the correct ObjC EH personality 2017-01-08 22:58:07 +00:00
CGExpr.cpp [ubsan] Minimize size of data for type_mismatch (Redo of D19667) 2017-01-06 14:40:12 +00:00
CGExprAgg.cpp Fix problems in "[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand." 2016-12-23 14:55:49 +00:00
CGExprCXX.cpp Remove custom handling of array copies in lambda by-value array capture and 2016-12-14 00:03:17 +00:00
CGExprComplex.cpp Fix problems in "[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand." 2016-12-23 14:55:49 +00:00
CGExprConstant.cpp [CodeGen] Unique constant CompoundLiterals. 2016-12-28 07:27:40 +00:00
CGExprScalar.cpp Fix problems in "[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand." 2016-12-23 14:55:49 +00:00
CGLoopInfo.cpp [CodeGen] Pass objects that are expensive to copy by const ref. 2016-11-24 16:01:20 +00:00
CGLoopInfo.h [CodeGen] Pass objects that are expensive to copy by const ref. 2016-11-24 16:01:20 +00:00
CGObjC.cpp CodeGen: fix runtime function dll storage 2016-12-15 06:59:05 +00:00
CGObjCGNU.cpp Clean up CGObjCMac's APIs for deriving class references. NFC. 2016-11-30 23:54:50 +00:00
CGObjCMac.cpp Clean up CGObjCMac's APIs for deriving class references. NFC. 2016-11-30 23:54:50 +00:00
CGObjCRuntime.cpp CodeGen: ensure that the runtime calling convention matches 2016-10-13 19:45:08 +00:00
CGObjCRuntime.h Clean up CGObjCMac's APIs for deriving class references. NFC. 2016-11-30 23:54:50 +00:00
CGOpenCLRuntime.cpp [OpenCL] Augment pipe built-ins with pipe packet size and alignment. 2016-09-23 14:20:00 +00:00
CGOpenCLRuntime.h [OpenCL] Augment pipe built-ins with pipe packet size and alignment. 2016-09-23 14:20:00 +00:00
CGOpenMPRuntime.cpp [OpenMP] Basic support for a parallel directive in a target region on an NVPTX device 2017-01-10 15:42:51 +00:00
CGOpenMPRuntime.h [OpenMP] Basic support for a parallel directive in a target region on an NVPTX device 2017-01-10 15:42:51 +00:00
CGOpenMPRuntimeNVPTX.cpp [OpenMP] Basic support for a parallel directive in a target region on an NVPTX device 2017-01-10 15:42:51 +00:00
CGOpenMPRuntimeNVPTX.h [OpenMP] Basic support for a parallel directive in a target region on an NVPTX device 2017-01-10 15:42:51 +00:00
CGRecordLayout.h Make CodeGen headers self-contained. 2016-02-02 16:05:18 +00:00
CGRecordLayoutBuilder.cpp revert SVN r265702, r265640 2016-04-08 16:52:00 +00:00
CGStmt.cpp [OpenMP] Sema and parsing for 'target teams distribute simd’ pragma 2017-01-10 18:08:18 +00:00
CGStmtOpenMP.cpp [OpenMP] Sema and parsing for 'target teams distribute simd’ pragma 2017-01-10 18:08:18 +00:00
CGVTT.cpp CodeGen: Start using inrange annotations on vtable getelementptr. 2016-12-13 20:50:44 +00:00
CGVTables.cpp CodeGen: New vtable group representation: struct of vtable arrays. 2016-12-13 20:40:39 +00:00
CGVTables.h CodeGen: New vtable group representation: struct of vtable arrays. 2016-12-13 20:40:39 +00:00
CGValue.h [Sema] PR26444 fix crash when alignment value is >= 2**16 2016-03-02 06:48:47 +00:00
CMakeLists.txt clangCodeGen: Add LLVMPasses to libdeps. r290450 introduced it. 2016-12-24 01:55:12 +00:00
CodeGenABITypes.cpp Various improvements to the public IRGen interface. 2016-05-18 05:21:18 +00:00
CodeGenAction.cpp CodeGen: plumb header search down to the IAS 2017-01-05 16:02:32 +00:00
CodeGenFunction.cpp Add a cc1 option to force disabling lifetime-markers emission from clang 2017-01-06 23:18:09 +00:00
CodeGenFunction.h [OpenMP] Sema and parsing for 'target teams distribute simd’ pragma 2017-01-10 18:08:18 +00:00
CodeGenModule.cpp Module: Do not add any link flags when an implementation TU of a module imports 2017-01-11 18:47:38 +00:00
CodeGenModule.h [CodeGen] Unique constant CompoundLiterals. 2016-12-28 07:27:40 +00:00
CodeGenPGO.cpp [Coverage] Support for C++17 if initializers 2016-10-14 23:38:16 +00:00
CodeGenPGO.h [NFC] Header cleanup 2016-07-18 19:02:11 +00:00
CodeGenTBAA.cpp revert SVN r265702, r265640 2016-04-08 16:52:00 +00:00
CodeGenTBAA.h Make the remaining headers self-contained. 2016-02-02 14:24:21 +00:00
CodeGenTypeCache.h Re-commit [OpenCL] AMDGCN: Fix size_t type 2016-08-19 05:17:25 +00:00
CodeGenTypes.cpp Re-commit r289252 and r289285, and fix PR31374 2016-12-15 08:09:08 +00:00
CodeGenTypes.h Re-commit r289252 and r289285, and fix PR31374 2016-12-15 08:09:08 +00:00
ConstantBuilder.h Struct GEPs must use i32, not whatever size_t is. It should be safe 2016-12-01 23:51:30 +00:00
CoverageMappingGen.cpp Fix use-of-temporary with StringRef in code coverage 2016-11-07 17:28:04 +00:00
CoverageMappingGen.h [NFC] Header cleanup 2016-07-18 19:02:11 +00:00
EHScopeStack.h Retire llvm::alignOf in favor of C++11 alignof. 2016-10-20 14:27:22 +00:00
ItaniumCXXABI.cpp Improve handling of instantiated thread_local variables in Itanium C++ ABI. 2017-01-13 00:43:31 +00:00
MicrosoftCXXABI.cpp CodeGen: update comment about RTTI field 2017-01-01 19:16:02 +00:00
ModuleBuilder.cpp Introduce a type-safe enum for ForDefinition. 2016-11-30 23:25:13 +00:00
ObjectFilePCHContainerOperations.cpp CodeGen: plumb header search down to the IAS 2017-01-05 16:02:32 +00:00
README.txt
SanitizerMetadata.cpp Implement no_sanitize_address for global vars 2016-10-14 19:55:09 +00:00
SanitizerMetadata.h Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. 2015-02-15 22:54:08 +00:00
SwiftCallingConv.cpp swiftcc: Add an api to query whether a target ABI stores swifterror in a register 2016-12-01 18:07:38 +00:00
TargetInfo.cpp Correct Vectorcall Register passing and HVA Behavior 2017-01-05 00:20:51 +00:00
TargetInfo.h Re-commit r289252 and r289285, and fix PR31374 2016-12-15 08:09:08 +00:00
VarBypassDetector.cpp [CodeGen] Don't emit lifetime intrinsics for some local variables 2016-10-26 05:42:30 +00:00
VarBypassDetector.h [CodeGen] Don't emit lifetime intrinsics for some local variables 2016-10-26 05:42:30 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//