llvm-project/clang/lib/CodeGen
skc7 16b781e6d1 [AMDGPU][clang] Fix __builtin_nontemporal_store() failure on AMDGPU
Reviewed By: yaxunl, sameerds

Differential Revision: https://reviews.llvm.org/D114849
2021-12-02 05:53:25 +00:00
..
ABIInfo.h [ABI][NFC] Fix the confusion of ByVal and ByRef argument names 2020-08-06 15:20:18 +03:00
Address.h
BackendUtil.cpp [sancov] add tracing for loads and store 2021-11-09 14:35:13 -08:00
CGAtomic.cpp [HIP] Add atomic load, atomic store and atomic cmpxchng_weak builtin support in HIP-clang 2021-11-29 12:07:13 -07:00
CGBlocks.cpp Reland [IR] Increase max alignment to 4GB 2021-10-06 13:29:23 -07:00
CGBlocks.h
CGBuilder.h [OpaquePtr] Remove uses of CreateGEP() without element type 2021-07-17 22:56:27 +02:00
CGBuiltin.cpp [AMDGPU][clang] Fix __builtin_nontemporal_store() failure on AMDGPU 2021-12-02 05:53:25 +00:00
CGCUDANV.cpp [CUDA][HIP] Allow comdat for kernels 2021-11-10 16:42:23 -05:00
CGCUDARuntime.cpp
CGCUDARuntime.h [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
CGCXX.cpp [OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type 2021-07-17 17:07:46 +02:00
CGCXXABI.cpp Fix PR35902: incorrect alignment used for ubsan check. 2020-12-28 18:11:17 -05:00
CGCXXABI.h [clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate 2021-01-12 19:44:01 +00:00
CGCall.cpp [OpenMP] Remove doing assumption propagation in the front end. 2021-11-09 17:39:24 -05:00
CGCall.h Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. 2020-12-22 19:54:29 -05:00
CGClass.cpp Don't update the vptr at the start of the destructor of a final class. 2021-10-08 19:59:42 -07:00
CGCleanup.cpp [Windows SEH]: Fix -O2 crash for Windows -EHa 2021-06-04 14:07:44 -07:00
CGCleanup.h [XCOFF][AIX] Generate LSDA data and compact unwind section on AIX 2020-12-02 18:42:44 +00:00
CGCoroutine.cpp Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass" 2021-04-18 17:22:28 -07:00
CGDebugInfo.cpp DebugInfo/Printing: Improve name of policy for including types for template arguments 2021-11-11 21:59:27 -08:00
CGDebugInfo.h Reland "[Attr] support btf_type_tag attribute" 2021-11-05 11:25:17 -07:00
CGDecl.cpp In spir functions, llvm.dbg.declare intrinsics created 2021-11-05 15:08:09 -07:00
CGDeclCXX.cpp PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members. 2021-08-24 19:53:44 -07:00
CGException.cpp Correct handling of the 'throw()' exception specifier in C++17. 2021-11-10 17:40:16 -05:00
CGExpr.cpp [CFE][Codegen] Make sure to maintain the contiguity of all the static allocas 2021-11-10 08:45:21 +05:30
CGExprAgg.cpp No longer crash when a consteval function returns a structure 2021-11-04 09:41:10 -04:00
CGExprCXX.cpp [clang] don't mark as Elidable CXXConstruct expressions used in NRVO 2021-09-21 21:41:20 +02:00
CGExprComplex.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprConstant.cpp PR52183: Don't emit code for a void-typed constant expression. 2021-10-14 20:55:51 -07:00
CGExprScalar.cpp PR52183: Don't emit code for a void-typed constant expression. 2021-10-14 20:55:51 -07:00
CGGPUBuiltin.cpp [OpenMP] Lower printf to __llvm_omp_vprintf 2021-11-10 15:30:56 +00:00
CGLoopInfo.cpp [Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. 2021-02-13 17:35:54 -06:00
CGLoopInfo.h [SVE] Add support to vectorize_width loop pragma for scalable vectors 2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp [CodeGen] Stop creating fake FunctionDecls when generating IR for 2021-06-29 14:22:33 -07:00
CGObjC.cpp [ObjC][ARC] Use operand bundle "clang.arc.attachedcall" on x86-64 2021-11-08 18:38:40 -08:00
CGObjCGNU.cpp Fix a variety of bugs with nil-receiver checks when targeting 2021-10-08 05:44:06 -04:00
CGObjCMac.cpp [clang][objc][codegen] Skip emitting ObjC category metadata when the 2021-11-12 16:21:21 -08:00
CGObjCRuntime.cpp [clang] Add range accessor for ObjCAtTryStmt catch_stmts and use it 2021-10-27 08:57:05 -04:00
CGObjCRuntime.h Fix a variety of bugs with nil-receiver checks when targeting 2021-10-08 05:44:06 -04:00
CGOpenCLRuntime.cpp
CGOpenCLRuntime.h
CGOpenMPRuntime.cpp Revert "OpenMP: Start calling setTargetAttributes for generated kernels" 2021-11-29 15:47:10 -05:00
CGOpenMPRuntime.h [OpenMP] support depend clause for taskwait directive, by Deepak 2021-11-19 06:30:17 -08:00
CGOpenMPRuntimeGPU.cpp [clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files 2021-11-09 15:11:05 -05:00
CGOpenMPRuntimeGPU.h [clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files 2021-11-09 15:11:05 -05:00
CGRecordLayout.h [ARM] Follow AACPS standard for volatile bit-fields access width 2020-10-13 10:31:48 +01:00
CGRecordLayoutBuilder.cpp [CodeGen] Use getCharWidth() more consistently in CGRecordLowering. NFC 2021-01-22 21:12:17 +01:00
CGStmt.cpp [clang] Make -masm=intel affect inline asm style 2021-11-17 13:41:59 -05:00
CGStmtOpenMP.cpp [clang][OpenMP][DebugInfo] Debug support for private variables inside an OpenMP task construct 2021-11-25 19:55:22 +05:30
CGVTT.cpp [AMDGPU] Set the default globals address space to 1 2020-11-20 15:46:53 +00:00
CGVTables.cpp [OpenMP] Remove doing assumption propagation in the front end. 2021-11-09 17:39:24 -05:00
CGVTables.h
CGValue.h [AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. 2020-10-13 13:26:56 +02:00
CMakeLists.txt [clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files 2021-11-09 15:11:05 -05:00
CodeGenABITypes.cpp
CodeGenAction.cpp [clang] Add option to disable -clear-ast-before-backend 2021-10-19 20:51:48 -07:00
CodeGenFunction.cpp [CFE][Codegen] Make sure to maintain the contiguity of all the static allocas 2021-11-10 08:45:21 +05:30
CodeGenFunction.h [clang] Use isa instead of dyn_cast (NFC) 2021-11-14 09:32:40 -08:00
CodeGenModule.cpp [clang][ARM] PACBTI-M frontend support 2021-12-01 10:37:16 +00:00
CodeGenModule.h Reapply 'Implement target_clones multiversioning' 2021-11-29 06:30:01 -08:00
CodeGenPGO.cpp Implement if consteval (P1938) 2021-10-05 08:04:14 -04:00
CodeGenPGO.h [PGO] Don't reference functions unless value profiling is enabled 2021-05-20 11:09:24 -07:00
CodeGenTBAA.cpp
CodeGenTBAA.h
CodeGenTypeCache.h Fix __attribute__((annotate("")) with non-zero globals AS 2021-08-26 10:09:40 +01:00
CodeGenTypes.cpp [Clang] Add __ibm128 type to represent ppc_fp128 2021-09-06 18:00:58 +08:00
CodeGenTypes.h
ConstantEmitter.h
ConstantInitBuilder.cpp
CoverageMappingGen.cpp [clang] Use isa instead of dyn_cast (NFC) 2021-11-14 09:32:40 -08:00
CoverageMappingGen.h [Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir} 2021-02-25 21:40:12 -08:00
EHScopeStack.h [Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 2021-05-17 22:42:17 -07:00
ItaniumCXXABI.cpp PR51079: Treat thread_local variables with an incomplete class type as 2021-10-08 18:46:01 -07:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h
MicrosoftCXXABI.cpp [clang] Use llvm::{count,count_if,find_if,all_of,none_of} (NFC) 2021-10-25 09:14:45 -07:00
ModuleBuilder.cpp [clang-repl] Allow Interpreter::getSymbolAddress to take a mangled name. 2021-11-10 12:52:05 +00:00
ObjectFilePCHContainerOperations.cpp [WebAssembly] Emit clangast in custom section aligned by 4 bytes 2021-10-19 15:50:08 -07:00
PatternInit.cpp
PatternInit.h
README.txt Revert "This is a test commit" 2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp [clang][patch] Inclusive language, modify filename SanitizerBlacklist.h to NoSanitizeList.h 2021-02-22 15:11:37 -05:00
SanitizerMetadata.h
SwiftCallingConv.cpp Teach the swift calling convention about _Atomic types 2020-08-31 07:07:25 -07:00
TargetInfo.cpp [clang][ARM] PACBTI-M frontend support 2021-12-01 10:37:16 +00:00
TargetInfo.h [Clang][AArch64] Inline assembly support for the ACLE type 'data512_t' 2021-07-31 09:51:28 +01:00
VarBypassDetector.cpp [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00
VarBypassDetector.h Use {DenseSet,SmallPtrSet}::contains (NFC) 2021-10-29 20:26:07 -07:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//