llvm-project/clang/lib/CodeGen
Ulrich Weigand fa80642205 Allow targets to define minimum alignment for global variables
This patch adds a new common code feature that allows platform code to
request minimum alignment of global symbols.  The background for this is
that on SystemZ, the most efficient way to load addresses of global symbol
is the LOAD ADDRESS RELATIVE LONG (LARL) instruction.  This instruction
provides PC-relative addressing, but only to *even* addresses.  For this
reason, existing compilers will guarantee that global symbols are always
aligned to at least 2.  [ Since symbols would otherwise already use a
default alignment based on their type, this will usually only affect global
objects of character type or character arrays. ]  GCC also allows creating
symbols without that extra alignment by using explicit "aligned" attributes
(which then need to be used on both definition and each use of the symbol).

To enable support for this with Clang, this patch adds a
TargetInfo::MinGlobalAlign variable that provides a global minimum for the
alignment of every global object (unless overridden via explicit alignment
attribute), and adds code to respect this setting.  Within this patch, no
platform actually sets the value to anything but the default 1, resulting
in no change in behaviour on any existing target.

This version of the patch incorporates feedback from reviews by
Eric Christopher and John McCall.  Thanks to all reviewers!

Patch by Richard Sandiford.

llvm-svn: 181210
2013-05-06 16:23:57 +00:00
..
ABIInfo.h Standardize accesses to the TargetInfo in IR-gen. 2013-04-16 22:48:15 +00:00
BackendUtil.cpp Plumb through the -fsplit-stack option using the existing backend 2013-04-04 06:29:47 +00:00
CGAtomic.cpp Standardize accesses to the TargetInfo in IR-gen. 2013-04-16 22:48:15 +00:00
CGBlocks.cpp Correctly emit certain implicit references to 'self' even within 2013-05-03 07:33:41 +00:00
CGBlocks.h Remove useless 'llvm::' qualifier from names like StringRef and others that are 2013-01-12 19:30:44 +00:00
CGBuilder.h Rewrite #includes for llvm/Foo.h to llvm/IR/Foo.h as appropriate to 2013-01-02 11:45:17 +00:00
CGBuiltin.cpp AArch64: teach Clang about __clear_cache intrinsic 2013-05-04 07:15:13 +00:00
CGCUDANV.cpp Use the actual ABI-determined C calling convention for runtime 2013-02-28 19:01:20 +00:00
CGCUDARuntime.cpp Sort all of Clang's files under 'lib', and fix up the broken headers 2012-12-04 09:13:33 +00:00
CGCUDARuntime.h CUDA: IR generation support for device stubs 2011-10-06 18:51:56 +00:00
CGCXX.cpp Better support for constructors with -cxx-abi microsoft, partly fixes PR12784 2013-02-27 13:46:31 +00:00
CGCXXABI.cpp Implement CodeGen for C++11 thread_local, following the Itanium ABI specification as discussed on cxx-abi-dev. 2013-04-19 16:42:07 +00:00
CGCXXABI.h Implement CodeGen for C++11 thread_local, following the Itanium ABI specification as discussed on cxx-abi-dev. 2013-04-19 16:42:07 +00:00
CGCall.cpp Replace ArrayRef<T>() with None, now that we have an implicit ArrayRef constructor from None 2013-05-05 00:41:58 +00:00
CGCall.h Under ARC, when we're passing the address of a strong variable 2013-03-23 02:35:54 +00:00
CGClass.cpp Correctly emit certain implicit references to 'self' even within 2013-05-03 07:33:41 +00:00
CGCleanup.cpp Reapply r180982 with repaired logic and an additional testcase. 2013-05-03 20:11:48 +00:00
CGCleanup.h Documentation cleanup: 2012-06-15 22:10:14 +00:00
CGDebugInfo.cpp Revert 180817 because 180816 was reverted. 2013-04-30 22:45:09 +00:00
CGDebugInfo.h Revert "Revert "PR14606: Debug info for using directives/DW_TAG_imported_module"" 2013-04-22 06:13:21 +00:00
CGDecl.cpp Revert "Revert "PR14606: Debug info for using directives/DW_TAG_imported_module"" 2013-04-22 06:13:21 +00:00
CGDeclCXX.cpp Revert r180739 and r180748: they broke C++11 thread_local on non-Darwin systems and did not do the right thing on Darwin. 2013-04-30 21:34:13 +00:00
CGException.cpp Change hasAggregateLLVMType, which conflates complex and 2013-03-07 21:37:08 +00:00
CGExpr.cpp Correctly emit certain implicit references to 'self' even within 2013-05-03 07:33:41 +00:00
CGExprAgg.cpp C++1y: Allow aggregates to have default initializers. 2013-04-20 22:23:05 +00:00
CGExprCXX.cpp Tighten up the rules for precise lifetime and document 2013-03-13 03:10:54 +00:00
CGExprComplex.cpp C++1y: Allow aggregates to have default initializers. 2013-04-20 22:23:05 +00:00
CGExprConstant.cpp C++1y: Allow aggregates to have default initializers. 2013-04-20 22:23:05 +00:00
CGExprScalar.cpp C++1y: Allow aggregates to have default initializers. 2013-04-20 22:23:05 +00:00
CGObjC.cpp Correctly emit certain implicit references to 'self' even within 2013-05-03 07:33:41 +00:00
CGObjCGNU.cpp Use the actual ABI-determined C calling convention for runtime 2013-02-28 19:01:20 +00:00
CGObjCMac.cpp Use the ugly PRIx64 macro to make format string portable. 2013-04-22 16:10:38 +00:00
CGObjCRuntime.cpp Standardize accesses to the TargetInfo in IR-gen. 2013-04-16 22:48:15 +00:00
CGObjCRuntime.h Use the actual ABI-determined C calling convention for runtime 2013-02-28 19:01:20 +00:00
CGOpenCLRuntime.cpp Add OpenCL samplers as Clang builtin types and check sampler related restrictions. 2013-02-07 10:55:47 +00:00
CGOpenCLRuntime.h Rewrite #includes for llvm/Foo.h to llvm/IR/Foo.h as appropriate to 2013-01-02 11:45:17 +00:00
CGRTTI.cpp Don't treat a non-deduced 'auto' type as being type-dependent. Instead, there 2013-04-30 13:56:41 +00:00
CGRecordLayout.h Rewrite #includes for llvm/Foo.h to llvm/IR/Foo.h as appropriate to 2013-01-02 11:45:17 +00:00
CGRecordLayoutBuilder.cpp Standardize accesses to the TargetInfo in IR-gen. 2013-04-16 22:48:15 +00:00
CGStmt.cpp Reapply r180982 with repaired logic and an additional testcase. 2013-05-03 20:11:48 +00:00
CGVTT.cpp simplify a bunch of code to use the well-known LLVM IR types computed by CodeGenModule. 2012-02-07 00:39:47 +00:00
CGVTables.cpp Change hasAggregateLLVMType, which conflates complex and 2013-03-07 21:37:08 +00:00
CGVTables.h The standard ARM C++ ABI dictates that inline functions are 2013-01-25 22:31:03 +00:00
CGValue.h Initial support for struct-path aware TBAA. 2013-04-04 21:53:22 +00:00
CMakeLists.txt The IRReader header is now part of its own library. Update the include 2013-03-26 02:25:54 +00:00
CodeGenAction.cpp The IRReader header is now part of its own library. Update the include 2013-03-26 02:25:54 +00:00
CodeGenFunction.cpp Reapply r180982 with repaired logic and an additional testcase. 2013-05-03 20:11:48 +00:00
CodeGenFunction.h AArch64: teach Clang about __clear_cache intrinsic 2013-05-04 07:15:13 +00:00
CodeGenModule.cpp Allow targets to define minimum alignment for global variables 2013-05-06 16:23:57 +00:00
CodeGenModule.h Allow targets to define minimum alignment for global variables 2013-05-06 16:23:57 +00:00
CodeGenTBAA.cpp Struct-path aware TBAA: enable struct-path aware TBAA for classes. 2013-04-30 17:38:09 +00:00
CodeGenTBAA.h Struct-path aware TBAA: uniformize scalar tag and path tag. 2013-04-11 23:02:56 +00:00
CodeGenTypes.cpp Don't treat a non-deduced 'auto' type as being type-dependent. Instead, there 2013-04-30 13:56:41 +00:00
CodeGenTypes.h Standardize accesses to the TargetInfo in IR-gen. 2013-04-16 22:48:15 +00:00
ItaniumCXXABI.cpp Use the Itanium ABI for thread_local on Darwin. 2013-05-02 19:18:03 +00:00
Makefile
MicrosoftCXXABI.cpp [ms-cxxabi] Emit non-virtual member function pointers 2013-05-03 01:15:11 +00:00
ModuleBuilder.cpp Don't propagate around TargetOptions in IR-gen; we don't use it. 2013-04-16 22:48:20 +00:00
README.txt
TargetInfo.cpp Set SRet flags properly in '-cxx-abi microsoft'. 2013-04-17 12:54:10 +00:00
TargetInfo.h Fix the required args count for variadic blocks. 2012-12-07 07:03:17 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//