llvm-project/clang/lib/CodeGen
Kerry McLaughlin af64948e2a [SVE][Inline-Asm] Add constraints for SVE ACLE types
Summary:
Adds the constraints described below to ensure that we
can tie variables of SVE ACLE types to operands in inline-asm:
 - y: SVE registers Z0-Z7
 - Upl: One of the low eight SVE predicate registers (P0-P7)
 - Upa: Full range of SVE predicate registers (P0-P15)

Reviewers: sdesmalen, huntergr, rovka, cameron.mcinally, efriedma, rengolin

Reviewed By: efriedma

Subscribers: miyuki, tschuett, rkruppe, psnobl, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D75690
2020-03-17 11:04:19 +00:00
..
ABIInfo.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
Address.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
BackendUtil.cpp [memtag] Plug in stack safety analysis. 2020-03-16 16:35:25 -07:00
CGAtomic.cpp [OPENMP50]Add codegen for acq_rel clause in atomic|flush directives. 2020-02-07 15:05:09 -05:00
CGBlocks.cpp [CodeGen] Emit destructor calls to destruct compound literals 2020-03-10 14:08:28 -07:00
CGBlocks.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGBuilder.h Reapply "[IRBuilder] Virtualize IRBuilder" 2020-02-17 19:04:11 +01:00
CGBuiltin.cpp Revert "[SVE] Auto-generate builtins and header for svld1." 2020-03-16 15:22:15 +00:00
CGCUDANV.cpp [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
CGCUDARuntime.cpp Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGCUDARuntime.h [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
CGCXX.cpp [Alignment][NFC] Use Align with CreateAlignedLoad 2020-01-27 10:58:36 +01:00
CGCXXABI.cpp Avoid Attr.h includes, CodeGen edition 2019-12-09 16:17:18 -08:00
CGCXXABI.h Improve code generation for thread_local variables: 2019-09-12 20:00:24 +00:00
CGCall.cpp [SVE][Inline-Asm] Add constraints for SVE ACLE types 2020-03-17 11:04:19 +00:00
CGCall.h Revert "[IRGen] Emit lifetime intrinsics around temporary aggregate argument allocas" 2020-02-12 12:26:46 -08:00
CGClass.cpp Don't assume a reference refers to at least sizeof(T) bytes. 2020-01-31 19:08:17 -08:00
CGCleanup.cpp [Alignment][Clang][NFC] Add CharUnits::getAsAlign 2019-10-03 13:00:29 +00:00
CGCleanup.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGCoroutine.cpp [Coroutines] Do not evaluate InitListExpr of a co_return 2020-03-16 12:42:44 +08:00
CGDebugInfo.cpp Revert "Reland "[DebugInfo] Enable the debug entry values feature by default"" 2020-03-13 15:37:44 -04:00
CGDebugInfo.h Avoid including Module.h from ExternalASTSource.h 2020-03-11 13:37:41 -07:00
CGDecl.cpp [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
CGDeclCXX.cpp Avoid Attr.h includes, CodeGen edition 2019-12-09 16:17:18 -08:00
CGException.cpp [clang] Add -fignore-exceptions 2020-02-12 09:56:18 +00:00
CGExpr.cpp [CodeGen] Emit destructor calls to destruct compound literals 2020-03-10 14:08:28 -07:00
CGExprAgg.cpp [CodeGen] Emit destructor calls to destruct compound literals 2020-03-10 14:08:28 -07:00
CGExprCXX.cpp Fix "pointer is null" static analyzer warnings. NFCI. 2020-01-11 16:02:23 +00:00
CGExprComplex.cpp Fix a crash when casting _Complex and ignoring the results. 2020-01-28 13:05:56 -05:00
CGExprConstant.cpp C++ DR2026: static storage duration variables are not zeroed before 2020-02-06 16:37:22 -08:00
CGExprScalar.cpp [CodeGen] Emit destructor calls to destruct compound literals 2020-03-10 14:08:28 -07:00
CGGPUBuiltin.cpp [Alignment][NFC] Use Align with CreateAlignedStore 2020-01-23 17:34:32 +01:00
CGLoopInfo.cpp Sink more Attr.h inline methods, NFC 2020-03-12 11:54:31 -07:00
CGLoopInfo.h Don't keep stale pointers to LoopInfos. 2019-08-19 13:37:41 +00:00
CGNonTrivialStruct.cpp Make llvm::StringRef to std::string conversions explicit. 2020-01-28 23:25:25 +01:00
CGObjC.cpp Relax the rules around objc_alloc and objc_alloc_init optimizations. 2020-01-14 19:48:33 -08:00
CGObjCGNU.cpp [NFC] Remove trailing space 2020-02-18 10:49:13 +08:00
CGObjCMac.cpp Replace getAs with castAs to fix null dereference static analyzer warning. 2020-03-12 18:52:58 +00:00
CGObjCRuntime.cpp [NFC] Remove trailing space 2020-02-18 10:49:13 +08:00
CGObjCRuntime.h Implement __attribute__((objc_direct)), __attribute__((objc_direct_members)) 2019-11-18 11:48:40 -08:00
CGOpenCLRuntime.cpp Fix "pointer is null" static analyzer warning. NFCI. 2020-01-08 17:19:08 +00:00
CGOpenCLRuntime.h [OpenCL] Simplify LLVM IR generated for OpenCL blocks 2019-02-21 11:02:10 +00:00
CGOpenMPRuntime.cpp [OPENMP]Fix PR45047: Do not copy firstprivates in tasks twice. 2020-03-13 18:04:16 -04:00
CGOpenMPRuntime.h [OPENMP50]Codegen for depobj dependency kind. 2020-03-09 17:46:06 -04:00
CGOpenMPRuntimeNVPTX.cpp [OPENMP50]Add basic support for depobj construct. 2020-03-02 13:10:32 -05:00
CGOpenMPRuntimeNVPTX.h [OPENMP50]Full handling of atomic_default_mem_order in requires 2020-02-11 15:42:34 -05:00
CGRecordLayout.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGRecordLayoutBuilder.cpp P0840R2: support for [[no_unique_address]] attribute 2019-06-20 20:44:45 +00:00
CGStmt.cpp [SVE][Inline-Asm] Add constraints for SVE ACLE types 2020-03-17 11:04:19 +00:00
CGStmtOpenMP.cpp [OPENMP]Fix PR45047: Do not copy firstprivates in tasks twice. 2020-03-13 18:04:16 -04:00
CGVTT.cpp Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGVTables.cpp Restore "[LTO/WPD] Enable aggressive WPD under LTO option" 2020-01-27 07:55:05 -08:00
CGVTables.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CGValue.h Move verification of Sema::MaximumAlignment to a .cpp file 2020-01-30 13:37:52 -08:00
CMakeLists.txt Generalize the pass registration mechanism used by Polly to any third-party tool 2020-01-02 16:45:31 +01:00
CodeGenABITypes.cpp Fix parameter name comments using clang-tidy. NFC. 2019-07-16 04:46:31 +00:00
CodeGenAction.cpp [remark][diagnostics] [codegen] Fix PR44896 2020-02-25 08:15:17 -08:00
CodeGenFunction.cpp [Clang][SVE] Parse builtin type string for scalable vectors 2020-03-15 14:34:52 +00:00
CodeGenFunction.h Revert "[SVE] Auto-generate builtins and header for svld1." 2020-03-16 15:22:15 +00:00
CodeGenModule.cpp Avoid including FileManager.h from SourceManager.h 2020-03-11 13:53:12 -07:00
CodeGenModule.h [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
CodeGenPGO.cpp [NFC] Fixes -Wrange-loop-analysis warnings 2020-01-01 20:01:37 +01:00
CodeGenPGO.h [CodeGenPGO] Fix shadow variable warning. NFC. 2020-03-02 15:06:34 +00:00
CodeGenTBAA.cpp Fix detection of __attribute__((may_alias)) to properly look through 2019-12-11 14:04:37 -08:00
CodeGenTBAA.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CodeGenTypeCache.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
CodeGenTypes.cpp [AArch64][SVE] CodeGen of ACLE Builtin Types 2020-02-19 12:10:47 +00:00
CodeGenTypes.h IRGen: Remove StructorType; thread GlobalDecl through more code. NFCI. 2019-03-22 23:05:10 +00:00
ConstantEmitter.h [CodeGen] Fix clang crash on aggregate initialization of array of labels 2019-11-28 00:59:25 +01:00
ConstantInitBuilder.cpp [Alignment][Clang][NFC] Add CharUnits::getAsAlign 2019-10-03 13:00:29 +00:00
CoverageMappingGen.cpp Avoid including FileManager.h from SourceManager.h 2020-03-11 13:53:12 -07:00
CoverageMappingGen.h Reland: [Coverage] Revise format to reduce binary size 2020-02-28 18:12:04 -08:00
EHScopeStack.h EHScopeStack::Cleanup has virtual functions so the destructor should be too. 2020-03-02 15:06:34 +00:00
ItaniumCXXABI.cpp [AIX][Frontend] C++ ABI customizations for AIX boilerplate 2020-02-24 10:26:51 -05:00
MacroPPCallbacks.cpp Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
MacroPPCallbacks.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
MicrosoftCXXABI.cpp [NFC] Let mangler accept GlobalDecl 2020-03-07 23:51:41 -05:00
ModuleBuilder.cpp reland "[DebugInfo] Support to emit debugInfo for extern variables" 2019-12-22 18:28:50 -08:00
ObjectFilePCHContainerOperations.cpp [Alignment][Clang][NFC] Add CharUnits::getAsAlign 2019-10-03 13:00:29 +00:00
PatternInit.cpp Avoid ASTContext.h -> TargetInfo.h dep 2020-02-27 14:35:00 -08:00
PatternInit.h Variable auto-init: also auto-init alloca 2019-04-12 00:11:27 +00:00
README.txt
SanitizerMetadata.cpp Avoid SourceManager.h include in RawCommentList.h, add missing incs 2020-02-27 13:49:40 -08:00
SanitizerMetadata.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
SwiftCallingConv.cpp Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
TargetInfo.cpp [HIP] Mark kernels with uniform-work-group-size=true 2020-03-13 06:56:56 -04:00
TargetInfo.h [OpenCL][PR41727] Prevent ICE on global dtors 2019-07-15 11:58:10 +00:00
VarBypassDetector.cpp Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00
VarBypassDetector.h Update the file headers across all of the LLVM projects in the monorepo 2019-01-19 08:50:56 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//