llvm-project/clang/lib/CodeGen
Vedant Kumar 77dfca88b2 [CodeGen] Handle mixed-width ops in mixed-sign mul-with-overflow lowering
The special lowering for __builtin_mul_overflow introduced in r320902
fixed an ICE seen when passing mixed-sign operands to the builtin.

This patch extends the special lowering to cover mixed-width, mixed-sign
operands. In a few common scenarios, calls to muloti4 will no longer be
emitted.

This should address the latest comments in PR34920 and work around the
link failure seen in:

  https://bugzilla.redhat.com/show_bug.cgi?id=1657544

Testing:
- check-clang
- A/B output comparison with: https://gist.github.com/vedantk/3eb9c88f82e5c32f2e590555b4af5081

Differential Revision: https://reviews.llvm.org/D55843

llvm-svn: 349542
2018-12-18 21:05:03 +00:00
..
ABIInfo.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
Address.h
BackendUtil.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGAtomic.cpp Remove CodeGen dependencies on Sema. 2018-12-06 06:12:20 +00:00
CGBlocks.cpp Misc typos fixes in ./lib folder 2018-12-10 12:37:46 +00:00
CGBlocks.h Fix the type of 1<<31 integer constants. 2018-09-24 17:51:15 +00:00
CGBuilder.h Remove trailing space 2018-07-30 19:24:48 +00:00
CGBuiltin.cpp [CodeGen] Handle mixed-width ops in mixed-sign mul-with-overflow lowering 2018-12-18 21:05:03 +00:00
CGCUDANV.cpp [CUDA] Use all 64 bits of GUID in __nv_module_id 2018-10-05 18:39:58 +00:00
CGCUDARuntime.cpp Refactor call emission to package the function pointer together with 2016-10-26 23:46:34 +00:00
CGCUDARuntime.h
CGCXX.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGCXXABI.cpp Remove trailing space 2018-07-30 19:24:48 +00:00
CGCXXABI.h [WebAssembly] Use Windows EH instructions for Wasm EH 2018-05-31 22:18:13 +00:00
CGCall.cpp [OpenCL] Add generic AS to 'this' pointer 2018-12-13 10:15:27 +00:00
CGCall.h [NFC] Move storage of dispatch-version to GlobalDecl 2018-11-13 15:48:08 +00:00
CGClass.cpp [OpenCL] Add generic AS to 'this' pointer 2018-12-13 10:15:27 +00:00
CGCleanup.cpp [TI removal] Make `getTerminator()` return a generic `Instruction`. 2018-10-15 10:42:50 +00:00
CGCleanup.h Remove trailing space 2018-07-30 19:24:48 +00:00
CGCoroutine.cpp Port getLocStart -> getBeginLoc 2018-08-09 21:08:08 +00:00
CGDebugInfo.cpp Reinstate DW_AT_comp_dir support after D55519. 2018-12-13 17:53:29 +00:00
CGDebugInfo.h Remove CGDebugInfo::getOrCreateFile() and use TheCU->getFile() directly. 2018-12-11 16:58:46 +00:00
CGDecl.cpp Automatic variable initialization 2018-12-18 05:12:21 +00:00
CGDeclCXX.cpp [OpenCL] Add generic AS to 'this' pointer 2018-12-13 10:15:27 +00:00
CGException.cpp [NFC] Move storage of dispatch-version to GlobalDecl 2018-11-13 15:48:08 +00:00
CGExpr.cpp [OpenCL] Add generic AS to 'this' pointer 2018-12-13 10:15:27 +00:00
CGExprAgg.cpp Compound literals, enums, et al require const expr 2018-11-09 00:41:36 +00:00
CGExprCXX.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGExprComplex.cpp Compound literals, enums, et al require const expr 2018-11-09 00:41:36 +00:00
CGExprConstant.cpp Correct indentation. 2018-12-01 09:06:26 +00:00
CGExprScalar.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGGPUBuiltin.cpp Recommit r326946 after reducing CallArgList memory footprint 2018-03-15 15:25:19 +00:00
CGLoopInfo.cpp Move LoopHint.h from Sema to Parse 2018-11-28 04:36:31 +00:00
CGLoopInfo.h [UnrollAndJam] Add unroll_and_jam pragma handling 2018-08-01 14:36:12 +00:00
CGNonTrivialStruct.cpp [NFC] Move storage of dispatch-version to GlobalDecl 2018-11-13 15:48:08 +00:00
CGObjC.cpp Generate objc intrinsics instead of runtime calls as the ARC optimizer now works only on intrinsics 2018-12-18 20:33:00 +00:00
CGObjCGNU.cpp llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...) 2018-09-26 22:16:28 +00:00
CGObjCMac.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGObjCRuntime.cpp Fix clang -Wimplicit-fallthrough warnings across llvm, NFC 2018-11-01 19:54:45 +00:00
CGObjCRuntime.h [CodeGen] Merge identical block descriptor global variables. 2018-08-17 15:46:07 +00:00
CGOpenCLRuntime.cpp [OpenCL] Add support of cl_intel_device_side_avc_motion_estimation extension 2018-11-08 11:25:41 +00:00
CGOpenCLRuntime.h Revert r326937 "[OpenCL] Remove block invoke function from emitted block literal struct" 2018-10-02 13:02:24 +00:00
CGOpenMPRuntime.cpp [OPENMP][NVPTX]Mark __kmpc_barrier functions as convergent. 2018-12-04 15:03:25 +00:00
CGOpenMPRuntime.h Misc typos fixes in ./lib folder 2018-12-10 12:37:46 +00:00
CGOpenMPRuntimeNVPTX.cpp [OPENMP][NVPTX]Emit shared memory buffer for reduction as 128 bytes 2018-12-18 21:01:42 +00:00
CGOpenMPRuntimeNVPTX.h [OPENMP][NVPTX]Mark __kmpc_barrier functions as convergent. 2018-12-04 15:03:25 +00:00
CGRecordLayout.h Remove \brief commands from doxygen comments. 2018-05-09 01:00:01 +00:00
CGRecordLayoutBuilder.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGStmt.cpp Remove CodeGen dependencies on Sema. 2018-12-06 06:12:20 +00:00
CGStmtOpenMP.cpp Misc typos fixes in ./lib folder 2018-12-10 12:37:46 +00:00
CGVTT.cpp [CodeGen] Align rtti and vtable data 2018-09-12 14:09:06 +00:00
CGVTables.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CGVTables.h Remove trailing space 2018-07-30 19:24:48 +00:00
CGValue.h [OpenCL] Add generic AS to 'this' pointer 2018-12-13 10:15:27 +00:00
CMakeLists.txt [CodeGen] Fix -DBUILD_SHARED_LIBS=on build after rC348907 2018-12-12 06:07:33 +00:00
CodeGenABITypes.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CodeGenAction.cpp Reapply "Avoid emitting redundant or unusable directories in DIFile metadata entries."" 2018-12-06 18:44:50 +00:00
CodeGenFunction.cpp [NFC] Fix usage of Builder.insert(new Bitcast...)in CodeGenFunction 2018-12-18 16:22:21 +00:00
CodeGenFunction.h Remove unused Args parameter from EmitFunctionBody, NFC 2018-12-13 01:33:20 +00:00
CodeGenModule.cpp Implement -frecord-command-line (-frecord-gcc-switches) 2018-12-14 15:38:15 +00:00
CodeGenModule.h Generate objc intrinsics instead of runtime calls as the ARC optimizer now works only on intrinsics 2018-12-18 20:33:00 +00:00
CodeGenPGO.cpp [cxx2a] P0614R1: Support init-statements in range-based for loops. 2018-09-28 18:44:09 +00:00
CodeGenPGO.h Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CodeGenTBAA.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
CodeGenTBAA.h [CodeGen] Fix generation of TBAA tags for may-alias accesses 2018-02-20 12:33:04 +00:00
CodeGenTypeCache.h Delete BuiltinCC. NFC. 2018-03-20 22:02:57 +00:00
CodeGenTypes.cpp [OpenCL] Add support of cl_intel_device_side_avc_motion_estimation extension 2018-11-08 11:25:41 +00:00
CodeGenTypes.h Remove unnecessary include. 2018-12-04 04:53:18 +00:00
ConstantEmitter.h Specify constant context in constant emitter 2018-12-01 08:29:36 +00:00
ConstantInitBuilder.cpp Remove trailing space 2018-07-30 19:24:48 +00:00
CoverageMappingGen.cpp [Coverage] Do not visit artificial stmts in defaulted methods (PR39822) 2018-11-28 20:48:07 +00:00
CoverageMappingGen.h Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
EHScopeStack.h Spelling mistakes in comments. NFCI. (PR27635) 2017-03-30 14:13:19 +00:00
ItaniumCXXABI.cpp Don't speculatively emit VTTs for classes unless we are able to correctly emit references to all the functions they will (directly or indirectly) reference. 2018-11-27 19:33:49 +00:00
MacroPPCallbacks.cpp [CodeGen] Fix included headers. 2018-11-28 04:14:29 +00:00
MacroPPCallbacks.h Add header guards to some headers that are missing them 2018-09-03 16:26:36 +00:00
MicrosoftCXXABI.cpp [NFC] Move storage of dispatch-version to GlobalDecl 2018-11-13 15:48:08 +00:00
ModuleBuilder.cpp [darwin] parse the SDK settings from SDKSettings.json if it exists and 2018-12-17 19:19:15 +00:00
ObjectFilePCHContainerOperations.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
README.txt
SanitizerMetadata.cpp hwasan: add -fsanitize=kernel-hwaddress flag 2018-04-13 18:05:21 +00:00
SanitizerMetadata.h
SwiftCallingConv.cpp In swiftcall, don't merge FP/vector types within a chunk. 2018-10-29 20:32:36 +00:00
TargetInfo.cpp Move CodeGenOptions from Frontend to Basic 2018-12-11 03:18:39 +00:00
TargetInfo.h [CUDA][HIP] Set kernel calling convention before arrange function 2018-06-12 00:16:33 +00:00
VarBypassDetector.cpp Fix clang -Wimplicit-fallthrough warnings across llvm, NFC 2018-11-01 19:54:45 +00:00
VarBypassDetector.h [CodeGen] Don't emit lifetime intrinsics for some local variables 2016-10-26 05:42:30 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//