llvm-project/clang/lib/CodeGen
Lang Hames aa53b936ec Add -ffp-contract = { fast | on | off } command line option support.
This flag sets the 'fp-contract' mode, which controls the formation of fused
floating point operations. Available modes are:

- Fast: Form fused operations anywhere. 
- On: Form fused operations where allowed by FP_CONTRACT. This is the default
      mode.
- Off: Don't form fused operations (in future this may be relaxed to forming
       fused operations where it can be proved that the result won't be
       affected).

Currently clang doesn't support the FP_CONTRACT pragma, so the 'On' and 'Off'
modes are equivalent.

llvm-svn: 159794
2012-07-06 00:59:19 +00:00
..
ABIInfo.h Add field PaddingType to ABIArgInfo which specifies the type of padding that 2012-01-07 00:25:33 +00:00
BackendUtil.cpp Add -ffp-contract = { fast | on | off } command line option support. 2012-07-06 00:59:19 +00:00
CGBlocks.cpp block literal irgen: several improvements on naming block 2012-06-26 16:06:38 +00:00
CGBlocks.h Whenever explicitly activating or deactivating a cleanup, we 2011-11-10 10:43:54 +00:00
CGBuilder.h Update Clang to reflect the new home of IRBuilder.h as of r159421. 2012-06-29 12:39:08 +00:00
CGBuiltin.cpp Dead code eliminate the massive hexagon builtin intrinsic supporting code. 2012-06-28 20:08:55 +00:00
CGCUDANV.cpp CUDA: IR generation support for device stubs 2011-10-06 18:51:56 +00:00
CGCUDARuntime.cpp CUDA: IR generation support for kernel call expressions 2011-10-06 18:29:37 +00:00
CGCUDARuntime.h CUDA: IR generation support for device stubs 2011-10-06 18:51:56 +00:00
CGCXX.cpp Remove get(V)BaseClassOffsetInBits, the CharUnit functions should be used instead. 2012-07-04 18:45:14 +00:00
CGCXXABI.cpp Fix PR13234 - crash when trying to report an unsupported ABI feature 2012-06-29 01:14:21 +00:00
CGCXXABI.h IRGen: Factor v-table generation into the CGCXXABI object. 2012-06-23 23:44:00 +00:00
CGCall.cpp revert CodeGen support for the alloc_size attribute until we finish the design of a more generic metadata node 2012-06-22 18:01:38 +00:00
CGCall.h Whether an argument is required (in contrast with being an 2012-02-17 03:33:10 +00:00
CGClass.cpp Revert Decl's iterators back to pointer value_type rather than reference value_type 2012-06-06 20:45:41 +00:00
CGCleanup.cpp llvm::SwitchInst 2012-03-11 06:09:37 +00:00
CGCleanup.h Documentation cleanup: 2012-06-15 22:10:14 +00:00
CGDebugInfo.cpp Remove get(V)BaseClassOffsetInBits, the CharUnit functions should be used instead. 2012-07-04 18:45:14 +00:00
CGDebugInfo.h Drop the ASTContext.h include from Stmt.h and fix up transitive users. 2012-07-04 17:04:04 +00:00
CGDecl.cpp Add -ftls-model command-line flag. 2012-06-28 08:01:44 +00:00
CGDeclCXX.cpp [asan] add missing asan instrumentation in generated global init functions 2012-06-26 08:56:33 +00:00
CGException.cpp Rename the GCC Objective-C runtime to gcc from gnu-fragile and the GNUstep 2012-07-03 20:49:52 +00:00
CGExpr.cpp Silence warning in -Asserts build 2012-07-03 03:55:58 +00:00
CGExprAgg.cpp Significantly simplify CGExprAgg's logic about ignored results: 2012-07-02 23:58:38 +00:00
CGExprCXX.cpp Compare the canonical types and document why we give up on the covariant case. 2012-06-28 17:57:36 +00:00
CGExprComplex.cpp Unify naming of LangOptions variable/get function across the Clang stack (Lex to AST). 2012-03-11 07:00:24 +00:00
CGExprConstant.cpp Remove get(V)BaseClassOffsetInBits, the CharUnit functions should be used instead. 2012-07-04 18:45:14 +00:00
CGExprScalar.cpp Remove get(V)BaseClassOffsetInBits, the CharUnit functions should be used instead. 2012-07-04 18:45:14 +00:00
CGObjC.cpp Documentation cleanup: Escape @objc* in Doxygen comments. 2012-06-22 05:41:30 +00:00
CGObjCGNU.cpp Rename the GCC Objective-C runtime to gcc from gnu-fragile and the GNUstep 2012-07-03 20:49:52 +00:00
CGObjCMac.cpp Rename the GCC Objective-C runtime to gcc from gnu-fragile and the GNUstep 2012-07-03 20:49:52 +00:00
CGObjCRuntime.cpp Propagate lvalue alignment into bitfields. Per report on cfe-dev. 2012-06-27 21:19:48 +00:00
CGObjCRuntime.h Documentation cleanup: 2012-06-15 22:10:14 +00:00
CGOpenCLRuntime.cpp OpenCL: introduce support for function scope __local variables 2011-09-19 21:14:35 +00:00
CGOpenCLRuntime.h OpenCL: introduce support for function scope __local variables 2011-09-19 21:14:35 +00:00
CGRTTI.cpp Restructure how the driver communicates information about the 2012-06-20 06:18:46 +00:00
CGRecordLayout.h Propagate lvalue alignment into bitfields. Per report on cfe-dev. 2012-06-27 21:19:48 +00:00
CGRecordLayoutBuilder.cpp Propagate lvalue alignment into bitfields. Per report on cfe-dev. 2012-06-27 21:19:48 +00:00
CGStmt.cpp [ms-style asm] Change the fatal error to an extension warning. Apparently, this 2012-06-20 18:28:37 +00:00
CGVTT.cpp simplify a bunch of code to use the well-known LLVM IR types computed by CodeGenModule. 2012-02-07 00:39:47 +00:00
CGVTables.cpp Fix a use-after-free in thunk emission. EmitThunk may call RAUW on Init, invalidating the pointer. 2012-03-20 20:18:13 +00:00
CGVTables.h Move VTable builder to AST 2011-09-26 01:57:12 +00:00
CGValue.h Significantly simplify CGExprAgg's logic about ignored results: 2012-07-02 23:58:38 +00:00
CMakeLists.txt Implement AST classes for comments, a real parser for Doxygen comments and a 2012-07-06 00:28:32 +00:00
CodeGenAction.cpp Escape % in diagnostic message when compiling LLVM IR. 2012-03-16 22:31:42 +00:00
CodeGenFunction.cpp block literal irgen: several improvements on naming block 2012-06-26 16:06:38 +00:00
CodeGenFunction.h Significantly simplify CGExprAgg's logic about ignored results: 2012-07-02 23:58:38 +00:00
CodeGenModule.cpp Rename the GCC Objective-C runtime to gcc from gnu-fragile and the GNUstep 2012-07-03 20:49:52 +00:00
CodeGenModule.h Add -ftls-model command-line flag. 2012-06-28 08:01:44 +00:00
CodeGenTBAA.cpp Remove unused private member variables found by clang's new -Wunused-private-field. 2012-06-06 17:32:50 +00:00
CodeGenTBAA.h Remove unused private member variables found by clang's new -Wunused-private-field. 2012-06-06 17:32:50 +00:00
CodeGenTypes.cpp Fix some i1/i8 confusion within _Atomic(bool) in IR generation, both 2012-04-12 20:42:30 +00:00
CodeGenTypes.h Whether an argument is required (in contrast with being an 2012-02-17 03:33:10 +00:00
ItaniumCXXABI.cpp IRGen: Factor v-table generation into the CGCXXABI object. 2012-06-23 23:44:00 +00:00
Makefile BUILD_ARCHIVE is the default for libraries, no need to set it. 2010-07-18 00:14:47 +00:00
MicrosoftCXXABI.cpp IRGen: Factor v-table generation into the CGCXXABI object. 2012-06-23 23:44:00 +00:00
ModuleBuilder.cpp Unify naming of LangOptions variable/get function across the Clang stack (Lex to AST). 2012-03-11 07:00:24 +00:00
README.txt
TargetInfo.cpp Remove get(V)BaseClassOffsetInBits, the CharUnit functions should be used instead. 2012-07-04 18:45:14 +00:00
TargetInfo.h Whether an argument is required (in contrast with being an 2012-02-17 03:33:10 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//