llvm-project/clang/lib/CodeGen
Sanjay Patel c81450e29b [Driver, CodeGen] rename options to disable an FP cast optimization
As suggested in the post-commit thread for rL331056, we should match these 
clang options with the established vocabulary of the corresponding sanitizer
option. Also, the use of 'strict' is well-known for these kinds of knobs, 
and we can improve the descriptive text in the docs.

So this intends to match the logic of D46135 but only change the words.
Matching LLVM commit to match this spelling of the attribute to follow shortly.

Differential Revision: https://reviews.llvm.org/D46236

llvm-svn: 331209
2018-04-30 18:19:03 +00:00
..
ABIInfo.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
Address.h
BackendUtil.cpp Fix build break due to content moving from Scalar.h to InstCombine.h in LLVM 2018-04-24 00:59:22 +00:00
CGAtomic.cpp [Atomics] warn about atomic accesses using libcalls 2018-04-23 08:16:24 +00:00
CGBlocks.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
CGBlocks.h [CodeGen][ObjC] Block captures should inherit the type of the captured 2016-09-16 00:02:06 +00:00
CGBuilder.h Change memcpy/memove/memset to have dest and source alignment attributes. 2018-01-28 17:27:45 +00:00
CGBuiltin.cpp [ARM,AArch64] Add intrinsics for dot product instructions 2018-04-27 14:03:32 +00:00
CGCUDANV.cpp [HIP] Add hip input kind and codegen for kernel launching 2018-04-25 01:10:37 +00:00
CGCUDARuntime.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCUDARuntime.h
CGCXX.cpp [MS] Always use base dtors in place of complete/vbase dtors when possible 2018-03-16 19:40:50 +00:00
CGCXXABI.cpp [MS] Always use base dtors in place of complete/vbase dtors when possible 2018-03-16 19:40:50 +00:00
CGCXXABI.h [MS] Always use base dtors in place of complete/vbase dtors when possible 2018-03-16 19:40:50 +00:00
CGCall.cpp [Driver, CodeGen] rename options to disable an FP cast optimization 2018-04-30 18:19:03 +00:00
CGCall.h Recommit r326946 after reducing CallArgList memory footprint 2018-03-15 15:25:19 +00:00
CGClass.cpp PR36992: do not store beyond the dsize of a class object unless we know 2018-04-05 20:52:58 +00:00
CGCleanup.cpp [CodeGen] Avoid destructing a callee-destructued struct type in a 2018-04-27 06:57:00 +00:00
CGCleanup.h Use the correct ObjC EH personality 2017-01-08 22:58:07 +00:00
CGCoroutine.cpp Remove redundant casts. NFC 2018-03-01 05:43:23 +00:00
CGDebugInfo.cpp [CodeGen] Reland r330442: Add an option to suppress output of llvm.ident 2018-04-23 10:08:46 +00:00
CGDebugInfo.h [CodeView] Initial support for emitting S_THUNK32 symbols for compiler... 2018-04-16 16:53:57 +00:00
CGDecl.cpp [CodeGen] Avoid destructing a callee-destructued struct type in a 2018-04-27 06:57:00 +00:00
CGDeclCXX.cpp Add a command line option 'fregister_global_dtors_with_atexit' to 2018-04-17 18:41:52 +00:00
CGException.cpp [MS] Don't escape MS C++ names with \01 2018-03-16 20:36:49 +00:00
CGExpr.cpp [CodeGen] Handle __func__ inside __finally 2018-04-11 18:17:35 +00:00
CGExprAgg.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
CGExprCXX.cpp PR36992: do not store beyond the dsize of a class object unless we know 2018-04-05 20:52:58 +00:00
CGExprComplex.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
CGExprConstant.cpp [CodeGen] Use the zero initializer instead of storing an all zero representation. 2018-02-09 22:10:09 +00:00
CGExprScalar.cpp Clean carriage returns from lib/ and include/. NFC. 2018-04-16 08:31:08 +00:00
CGGPUBuiltin.cpp Recommit r326946 after reducing CallArgList memory footprint 2018-03-15 15:25:19 +00:00
CGLoopInfo.cpp [CodeGen] Pass objects that are expensive to copy by const ref. 2016-11-24 16:01:20 +00:00
CGLoopInfo.h [CodeGen] Pass objects that are expensive to copy by const ref. 2016-11-24 16:01:20 +00:00
CGNonTrivialStruct.cpp Move the visitor classes that are used to traverse non-trivial C structs 2018-04-17 19:05:17 +00:00
CGObjC.cpp PR36992: do not store beyond the dsize of a class object unless we know 2018-04-05 20:52:58 +00:00
CGObjCGNU.cpp ObjCGNU: Fix empty v3 protocols being emitted two fields short 2018-04-12 06:46:15 +00:00
CGObjCMac.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
CGObjCRuntime.cpp [CodeGen] Propagate may-alias'ness of lvalues with TBAA info 2017-10-31 11:05:34 +00:00
CGObjCRuntime.h Clean up CGObjCMac's APIs for deriving class references. NFC. 2016-11-30 23:54:50 +00:00
CGOpenCLRuntime.cpp [OpenCL] Add separate read_only and write_only pipe IR types 2018-04-27 10:37:04 +00:00
CGOpenCLRuntime.h [OpenCL] Add separate read_only and write_only pipe IR types 2018-04-27 10:37:04 +00:00
CGOpenMPRuntime.cpp [OPENMP] Do not crash on codegen for CXX member functions. 2018-04-30 18:09:40 +00:00
CGOpenMPRuntime.h [OPENMP] General code improvements. 2018-04-16 17:59:34 +00:00
CGOpenMPRuntimeNVPTX.cpp [OPENMP] Do not cast captured by value variables with pointer types in 2018-04-23 17:33:41 +00:00
CGOpenMPRuntimeNVPTX.h [OPENMP] General code improvements. 2018-04-16 20:16:21 +00:00
CGRecordLayout.h
CGRecordLayoutBuilder.cpp [NFC] Fix terrible formatting of CGRecordLower constructor. 2018-04-12 20:46:31 +00:00
CGStmt.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
CGStmtOpenMP.cpp [OPENMP] Replace push_back by emplace_back, NFC. 2018-04-13 17:48:43 +00:00
CGVTT.cpp Recommit r324107 again. 2018-02-07 22:15:33 +00:00
CGVTables.cpp [MS] Fix unprototyped thunk emission for incomplete return types 2018-04-18 23:21:32 +00:00
CGVTables.h [MS] Emit vftable thunks for functions with incomplete prototypes 2018-04-02 20:20:33 +00:00
CGValue.h PR36992: do not store beyond the dsize of a class object unless we know 2018-04-05 20:52:58 +00:00
CMakeLists.txt Link to AggressiveInstCombine in a few places. Unbreaks build for me. 2018-04-24 08:40:44 +00:00
CodeGenABITypes.cpp Include getting generated struct offsets in CodegenABITypes 2017-10-10 23:54:21 +00:00
CodeGenAction.cpp Use special new Clang flag 'FrontendTimesIsEnabled' instead of 'llvm::TimePassesIsEnabled' inside -ftime-report feature. 2018-04-23 09:22:30 +00:00
CodeGenFunction.cpp [XRay] Add clang builtin for xray typed events. 2018-04-17 21:32:43 +00:00
CodeGenFunction.h [CodeGen] Avoid destructing a callee-destructued struct type in a 2018-04-27 06:57:00 +00:00
CodeGenModule.cpp Revert rC330794 and some dependent tiny bug fixes 2018-04-26 00:42:40 +00:00
CodeGenModule.h Add a command line option 'fregister_global_dtors_with_atexit' to 2018-04-17 18:41:52 +00:00
CodeGenPGO.cpp Mark all library options as hidden. 2017-12-01 00:53:10 +00:00
CodeGenPGO.h Remove a dead field. NFC. 2017-04-24 20:54:36 +00:00
CodeGenTBAA.cpp [CodeGen] Fix generation of TBAA tags for may-alias accesses 2018-02-20 12:33:04 +00:00
CodeGenTBAA.h [CodeGen] Fix generation of TBAA tags for may-alias accesses 2018-02-20 12:33:04 +00:00
CodeGenTypeCache.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
CodeGenTypes.cpp Remove redundant casts. NFC 2018-03-01 05:43:23 +00:00
CodeGenTypes.h [MS] Emit vftable thunks for functions with incomplete prototypes 2018-04-02 20:20:33 +00:00
ConstantEmitter.h Convert clang::LangAS to a strongly typed enum 2017-10-15 18:48:14 +00:00
ConstantInitBuilder.cpp Further fixes and improvements to the ConstantInitBuilder API. 2017-03-06 19:04:16 +00:00
CoverageMappingGen.cpp PR37189 Fix incorrect end source location and spelling for a split '>>' token. 2018-04-30 05:25:48 +00:00
CoverageMappingGen.h [Lexer] Report more precise skipped regions (PR34166) 2017-09-11 20:47:42 +00:00
EHScopeStack.h Spelling mistakes in comments. NFCI. (PR27635) 2017-03-30 14:13:19 +00:00
ItaniumCXXABI.cpp Add a command line option 'fregister_global_dtors_with_atexit' to 2018-04-17 18:41:52 +00:00
MacroPPCallbacks.cpp [NFC] Refactor the Preprocessor function that handles Macro definitions and rename Arguments to Parameters in Macro Definitions. 2017-07-17 17:18:43 +00:00
MacroPPCallbacks.h Fix API breaks 2017-04-26 20:58:21 +00:00
MicrosoftCXXABI.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
ModuleBuilder.cpp D34444: Teach codegen to work in incremental processing mode. 2017-08-27 10:58:03 +00:00
ObjectFilePCHContainerOperations.cpp -gmodules: Emit debug info for implicit module imports via #include. 2018-01-03 19:10:21 +00:00
README.txt
SanitizerMetadata.cpp hwasan: add -fsanitize=kernel-hwaddress flag 2018-04-13 18:05:21 +00:00
SanitizerMetadata.h
SwiftCallingConv.cpp Generalize the swiftcall API since being passed indirectly isn't 2018-04-07 20:16:47 +00:00
TargetInfo.cpp [CUDA] Set LLVM calling convention for CUDA kernel 2018-04-20 17:01:03 +00:00
TargetInfo.h [CUDA] Set LLVM calling convention for CUDA kernel 2018-04-20 17:01:03 +00:00
VarBypassDetector.cpp Fix typos in clang 2018-04-06 15:14:32 +00:00
VarBypassDetector.h [CodeGen] Don't emit lifetime intrinsics for some local variables 2016-10-26 05:42:30 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//