llvm-project/clang/lib/CodeGen
Joseph Huber d12502a3ab [OpenMP] Apply OpenMP assumptions to applicable call sites
This patch adds OpenMP assumption attributes to call sites in applicable
regions. Currently this applies the caller's assumption attributes to
any calls contained within it. So, if a call occurs inside an OpenMP
assumes region to a function outside that region, we will assume that
call respects the assumptions. This is primarily useful for inline
assembly calls used heavily in the OpenMP GPU device runtime, which
allows us to then make judgements about what the ASM will do.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D110655
2021-09-29 16:08:21 -04:00
..
ABIInfo.h [ABI][NFC] Fix the confusion of ByVal and ByRef argument names 2020-08-06 15:20:18 +03:00
Address.h
BackendUtil.cpp [CSSPGO] Set PseudoProbeInserter as a default pass. 2021-09-22 09:09:48 -07:00
CGAtomic.cpp [OpaquePtr] Remove uses of CreateConstGEP1_64() without element type 2021-07-17 16:43:20 +02:00
CGBlocks.cpp [clang][NFC] GetOrCreateLLVMGlobal takes LangAS 2021-08-23 14:55:58 +02:00
CGBlocks.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGBuilder.h [OpaquePtr] Remove uses of CreateGEP() without element type 2021-07-17 22:56:27 +02:00
CGBuiltin.cpp [PowerPC] swdiv builtins for XL compatibility 2021-09-29 11:31:07 -05:00
CGCUDANV.cpp [OpaquePtr] Remove uses of CreateConstGEP1_32() without element type 2021-07-17 18:32:36 +02:00
CGCUDARuntime.cpp
CGCUDARuntime.h [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
CGCXX.cpp [OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type 2021-07-17 17:07:46 +02:00
CGCXXABI.cpp Fix PR35902: incorrect alignment used for ubsan check. 2020-12-28 18:11:17 -05:00
CGCXXABI.h [clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate 2021-01-12 19:44:01 +00:00
CGCall.cpp [OpenMP] Apply OpenMP assumptions to applicable call sites 2021-09-29 16:08:21 -04:00
CGCall.h Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. 2020-12-22 19:54:29 -05:00
CGClass.cpp Fix vtbl field addr space 2021-09-16 10:57:31 -04:00
CGCleanup.cpp [Windows SEH]: Fix -O2 crash for Windows -EHa 2021-06-04 14:07:44 -07:00
CGCleanup.h [XCOFF][AIX] Generate LSDA data and compact unwind section on AIX 2020-12-02 18:42:44 +00:00
CGCoroutine.cpp Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass" 2021-04-18 17:22:28 -07:00
CGDebugInfo.cpp DebugInfo: Use sugared function type when emitting function declarations for call sites 2021-09-28 10:44:35 -07:00
CGDebugInfo.h DebugInfo: Use sugared function type when emitting function declarations for call sites 2021-09-28 10:44:35 -07:00
CGDecl.cpp [clang] NFC: change uses of `Expr->getValueKind` into `is?Value` 2021-07-28 03:09:31 +02:00
CGDeclCXX.cpp PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members. 2021-08-24 19:53:44 -07:00
CGException.cpp [WebAssembly] Warn on exception spec for Emscripten EH 2021-05-20 13:00:20 -07:00
CGExpr.cpp DebugInfo: Use sugared function type when emitting function declarations for call sites 2021-09-28 10:44:35 -07:00
CGExprAgg.cpp [OpaquePtr] Remove uses of CreateInBoundsGEP() without element type 2021-07-17 21:27:16 +02:00
CGExprCXX.cpp [clang] don't mark as Elidable CXXConstruct expressions used in NRVO 2021-09-21 21:41:20 +02:00
CGExprComplex.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprConstant.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprScalar.cpp [OpenCL] Fix as_type3 invalid store creation 2021-09-29 09:40:06 +01:00
CGGPUBuiltin.cpp
CGLoopInfo.cpp [Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. 2021-02-13 17:35:54 -06:00
CGLoopInfo.h [SVE] Add support to vectorize_width loop pragma for scalable vectors 2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp [CodeGen] Stop creating fake FunctionDecls when generating IR for 2021-06-29 14:22:33 -07:00
CGObjC.cpp Put code that avoids heapifying local blocks behind a flag 2021-09-14 14:06:05 -04:00
CGObjCGNU.cpp [OpaquePtr] Remove uses of CreateStructGEP() without element type 2021-07-17 18:48:21 +02:00
CGObjCMac.cpp [clang] NFC: Fix range-based for loop warnings related to decl lookup 2021-04-19 18:31:31 +02:00
CGObjCRuntime.cpp [OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) 2021-03-12 21:01:16 +01:00
CGObjCRuntime.h [clang] Implement objc_non_runtime_protocol to remove protocol metadata 2020-10-02 17:35:50 -04:00
CGOpenCLRuntime.cpp
CGOpenCLRuntime.h
CGOpenMPRuntime.cpp [OpenMP] Introduce a new worksharing RTL function for distribute 2021-09-27 11:36:37 -04:00
CGOpenMPRuntime.h [OpenMP] Introduce a new worksharing RTL function for distribute 2021-09-27 11:36:37 -04:00
CGOpenMPRuntimeAMDGCN.cpp [openmp][nfc] Replace OMPGridValues array with struct 2021-08-19 13:25:42 +01:00
CGOpenMPRuntimeAMDGCN.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeGPU.cpp [OpenMP][Offloading] Use bitset to indicate execution mode instead of value 2021-09-22 11:40:52 -04:00
CGOpenMPRuntimeGPU.h [openmp][nfc] Replace OMPGridValues array with struct 2021-08-19 13:25:42 +01:00
CGOpenMPRuntimeNVPTX.cpp [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGRecordLayout.h [ARM] Follow AACPS standard for volatile bit-fields access width 2020-10-13 10:31:48 +01:00
CGRecordLayoutBuilder.cpp [CodeGen] Use getCharWidth() more consistently in CGRecordLowering. NFC 2021-01-22 21:12:17 +01:00
CGStmt.cpp [OpenMP] Apply OpenMP assumptions to applicable call sites 2021-09-29 16:08:21 -04:00
CGStmtOpenMP.cpp Revert "[OpenMP] Codegen aggregate for outlined function captures" 2021-09-21 13:20:39 -07:00
CGVTT.cpp [AMDGPU] Set the default globals address space to 1 2020-11-20 15:46:53 +00:00
CGVTables.cpp [OpenMP] Apply OpenMP assumptions to applicable call sites 2021-09-29 16:08:21 -04:00
CGVTables.h [clang] Frontend components for the relative vtables ABI (round 2) 2020-06-11 11:17:08 -07:00
CGValue.h [AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. 2020-10-13 13:26:56 +02:00
CMakeLists.txt Reland [clang] Rework dontcall attributes 2021-09-28 15:31:30 -07:00
CodeGenABITypes.cpp [CodeGen] Add public function to emit C++ destructor call. 2020-07-01 11:01:23 -07:00
CodeGenAction.cpp Reland [clang] Rework dontcall attributes 2021-09-28 15:31:30 -07:00
CodeGenFunction.cpp Simplify handling of builtin with inline redefinition 2021-09-28 21:00:47 +02:00
CodeGenFunction.h Revert "[OpenMP] Codegen aggregate for outlined function captures" 2021-09-21 13:20:39 -07:00
CodeGenModule.cpp Reland [clang] Rework dontcall attributes 2021-09-28 15:31:30 -07:00
CodeGenModule.h [OpenMP] Apply OpenMP assumptions to applicable call sites 2021-09-29 16:08:21 -04:00
CodeGenPGO.cpp [PGO] Don't reference functions unless value profiling is enabled 2021-05-20 11:09:24 -07:00
CodeGenPGO.h [PGO] Don't reference functions unless value profiling is enabled 2021-05-20 11:09:24 -07:00
CodeGenTBAA.cpp Reland Implement _ExtInt as an extended int type specifier. 2020-04-17 10:45:48 -07:00
CodeGenTBAA.h
CodeGenTypeCache.h Fix __attribute__((annotate("")) with non-zero globals AS 2021-08-26 10:09:40 +01:00
CodeGenTypes.cpp [Clang] Add __ibm128 type to represent ppc_fp128 2021-09-06 18:00:58 +08:00
CodeGenTypes.h CodeGenTypes::CGRecordLayouts: Use unique_ptr to simplify memory management 2020-04-28 22:31:16 -07:00
ConstantEmitter.h attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
ConstantInitBuilder.cpp Fix ConstantAggregateBuilderBase::getRelativeOffset 2020-06-15 12:23:20 -07:00
CoverageMappingGen.cpp Revert "Revert "[Coverage] Emit gap region between statements if first statements contains terminate statements."" 2021-03-04 11:52:43 -08:00
CoverageMappingGen.h [Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir} 2021-02-25 21:40:12 -08:00
EHScopeStack.h [Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 2021-05-17 22:42:17 -07:00
ItaniumCXXABI.cpp [Clang] Add __ibm128 type to represent ppc_fp128 2021-09-06 18:00:58 +08:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h
MicrosoftCXXABI.cpp TypeInfo records more information about align requirement 2021-08-28 19:47:48 -04:00
ModuleBuilder.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
ObjectFilePCHContainerOperations.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
PatternInit.cpp
PatternInit.h
README.txt Revert "This is a test commit" 2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp [clang][patch] Inclusive language, modify filename SanitizerBlacklist.h to NoSanitizeList.h 2021-02-22 15:11:37 -05:00
SanitizerMetadata.h [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SwiftCallingConv.cpp Teach the swift calling convention about _Atomic types 2020-08-31 07:07:25 -07:00
TargetInfo.cpp [X86] Always check the size of SourceTy before getting the next type 2021-09-20 23:34:19 +08:00
TargetInfo.h [Clang][AArch64] Inline assembly support for the ACLE type 'data512_t' 2021-07-31 09:51:28 +01:00
VarBypassDetector.cpp [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00
VarBypassDetector.h [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//