llvm-project/clang/lib/CodeGen
Christopher Tetreault 55448ab540 [AArch64] Adding Neon Polynomial vadd Intrinsics
This patch adds the following intrinsics:
            vadd_p8
            vadd_p16
            vadd_p64
            vaddq_p8
            vaddq_p16
            vaddq_p64
            vaddq_p128

Reviewed By: t.p.northover, DavidSpickett, ctetreau

Differential Revision: https://reviews.llvm.org/D96825
2021-02-19 14:48:12 -08:00
..
ABIInfo.h [ABI][NFC] Fix the confusion of ByVal and ByRef argument names 2020-08-06 15:20:18 +03:00
Address.h
BackendUtil.cpp [Msan, NewPM] Reduce size of msan binaries 2021-02-11 16:07:18 -08:00
CGAtomic.cpp [AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. 2020-10-13 13:26:56 +02:00
CGBlocks.cpp [NFC] Refine some uninitialized used variables. 2021-01-26 16:51:05 +08:00
CGBlocks.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGBuilder.h Reapply "[IRBuilder] Virtualize IRBuilder" 2020-02-17 19:04:11 +01:00
CGBuiltin.cpp [AArch64] Adding Neon Polynomial vadd Intrinsics 2021-02-19 14:48:12 -08:00
CGCUDANV.cpp [CUDA][HIP] Fix device variable linkage 2021-02-05 15:11:12 -05:00
CGCUDARuntime.cpp
CGCUDARuntime.h [NFC][CUDA] Refactor registering device variable 2021-02-03 14:29:51 -05:00
CGCXX.cpp [Alignment][NFC] Use Align with CreateAlignedLoad 2020-01-27 10:58:36 +01:00
CGCXXABI.cpp Fix PR35902: incorrect alignment used for ubsan check. 2020-12-28 18:11:17 -05:00
CGCXXABI.h [clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate 2021-01-12 19:44:01 +00:00
CGCall.cpp [clang] functions with the 'const' or 'pure' attribute must always return. 2021-02-18 17:29:46 +01:00
CGCall.h Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. 2020-12-22 19:54:29 -05:00
CGClass.cpp UBSAN: emit distinctive traps 2020-12-08 10:28:26 +00:00
CGCleanup.cpp [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGCleanup.h [XCOFF][AIX] Generate LSDA data and compact unwind section on AIX 2020-12-02 18:42:44 +00:00
CGCoroutine.cpp [Coroutines] Do not evaluate InitListExpr of a co_return 2020-03-16 12:42:44 +08:00
CGDebugInfo.cpp [Clang][RISCV] Define RISC-V V builtin types 2021-02-18 10:17:31 +08:00
CGDebugInfo.h CGDebugInfo: Delete unused parameters 2021-01-11 13:39:03 -08:00
CGDecl.cpp Revert "[NFC, Refactor] Modernize StorageClass from Specifiers.h to a scoped enum (II)" 2021-01-04 23:17:45 +01:00
CGDeclCXX.cpp [AIX][FE] Support constructor/destructor attribute 2020-11-19 09:24:01 -05:00
CGException.cpp [WebAssembly] Rename wasm_rethrow_in_catch intrinsic/builtin 2021-01-08 06:55:04 -08:00
CGExpr.cpp CGExpr - EmitMatrixSubscriptExpr - fix getAs<> null-dereference static analyzer warning. NFCI. 2021-01-05 17:08:11 +00:00
CGExprAgg.cpp [CodeGen] Emit destructor calls to destruct non-trivial C struct 2020-10-23 14:46:17 -07:00
CGExprCXX.cpp De-templatify EmitCallArgs argument type checking, NFCI 2020-12-09 11:08:00 -08:00
CGExprComplex.cpp [FPEnv] Use strictfp metadata in casting nodes 2020-11-06 11:56:12 -05:00
CGExprConstant.cpp [CGExpr] Use getCharWidth() more consistently in CCGExprConstant. NFC 2021-01-22 21:12:17 +01:00
CGExprScalar.cpp [Fixed Point] Add codegen for conversion between fixed-point and floating point. 2021-01-12 13:53:01 +01:00
CGGPUBuiltin.cpp [Alignment][NFC] Use Align with CreateAlignedStore 2020-01-23 17:34:32 +01:00
CGLoopInfo.cpp [Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. 2021-02-13 17:35:54 -06:00
CGLoopInfo.h [SVE] Add support to vectorize_width loop pragma for scalable vectors 2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp Revert "[NFC, Refactor] Modernize StorageClass from Specifiers.h to a scoped enum (II)" 2021-01-04 23:17:45 +01:00
CGObjC.cpp [ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of 2021-02-12 09:51:57 -08:00
CGObjCGNU.cpp [GNU ObjC] Fix a regression listing methods twice. 2020-12-01 09:50:18 +00:00
CGObjCMac.cpp Fix crash when emitting NullReturn guards for functions returning BOOL 2021-01-21 14:29:36 -08:00
CGObjCRuntime.cpp Fix a variety of minor issues with ObjC method mangling: 2020-09-29 19:51:53 -04:00
CGObjCRuntime.h [clang] Implement objc_non_runtime_protocol to remove protocol metadata 2020-10-02 17:35:50 -04:00
CGOpenCLRuntime.cpp Fix "pointer is null" static analyzer warning. NFCI. 2020-01-08 17:19:08 +00:00
CGOpenCLRuntime.h
CGOpenMPRuntime.cpp [OPENMP50]Allow overlapping mapping in target constructs. 2021-02-16 14:42:08 -08:00
CGOpenMPRuntime.h [OpenMP] Add Passing in Original Declaration Names To Mapper API 2020-11-18 15:28:39 -05:00
CGOpenMPRuntimeAMDGCN.cpp [libomptarget][amdgpu] Call into deviceRTL instead of ockl 2021-01-04 16:48:47 +00:00
CGOpenMPRuntimeAMDGCN.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeGPU.cpp [AMDGPU] gfx90a support 2021-02-17 16:01:32 -08:00
CGOpenMPRuntimeGPU.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.cpp [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGRecordLayout.h [ARM] Follow AACPS standard for volatile bit-fields access width 2020-10-13 10:31:48 +01:00
CGRecordLayoutBuilder.cpp [CodeGen] Use getCharWidth() more consistently in CGRecordLowering. NFC 2021-01-22 21:12:17 +01:00
CGStmt.cpp [OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). 2021-02-16 09:45:07 -08:00
CGStmtOpenMP.cpp [OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). 2021-02-16 09:45:07 -08:00
CGVTT.cpp [AMDGPU] Set the default globals address space to 1 2020-11-20 15:46:53 +00:00
CGVTables.cpp [clang][RelativeVTablesABI] Use dso_local_equivalent rather than emitting stubs 2020-11-30 16:02:35 -08:00
CGVTables.h [clang] Frontend components for the relative vtables ABI (round 2) 2020-06-11 11:17:08 -07:00
CGValue.h [AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. 2020-10-13 13:26:56 +02:00
CMakeLists.txt Remove dependency on clangASTMatchers. 2020-09-10 22:17:48 -04:00
CodeGenABITypes.cpp [CodeGen] Add public function to emit C++ destructor call. 2020-07-01 11:01:23 -07:00
CodeGenAction.cpp Restore diagnostic handler after CodeGenAction::ExecuteAction 2021-02-15 10:33:00 +00:00
CodeGenFunction.cpp Support for instrumenting only selected files or functions 2021-01-26 17:13:34 -08:00
CodeGenFunction.h [OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). 2021-02-16 09:45:07 -08:00
CodeGenModule.cpp [DebugInfo] Keep the DWARF64 flag in the module metadata 2021-02-17 17:03:34 +07:00
CodeGenModule.h [ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of 2021-02-12 09:51:57 -08:00
CodeGenPGO.cpp Don't emit coverage mapping for excluded functions 2021-02-05 13:03:57 -08:00
CodeGenPGO.h [NFC] Make non-modifying members const. 2020-10-18 18:50:21 +02:00
CodeGenTBAA.cpp Reland Implement _ExtInt as an extended int type specifier. 2020-04-17 10:45:48 -07:00
CodeGenTBAA.h
CodeGenTypeCache.h [CGExpr] Use getCharWidth() more consistently in CCGExprConstant. NFC 2021-01-22 21:12:17 +01:00
CodeGenTypes.cpp [Clang][RISCV] Define RISC-V V builtin types 2021-02-18 10:17:31 +08:00
CodeGenTypes.h CodeGenTypes::CGRecordLayouts: Use unique_ptr to simplify memory management 2020-04-28 22:31:16 -07:00
ConstantEmitter.h attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
ConstantInitBuilder.cpp Fix ConstantAggregateBuilderBase::getRelativeOffset 2020-06-15 12:23:20 -07:00
CoverageMappingGen.cpp [Coverage] Store compilation dir separately in coverage mapping 2021-02-18 14:34:39 -08:00
CoverageMappingGen.h [Coverage] Store compilation dir separately in coverage mapping 2021-02-18 14:34:39 -08:00
EHScopeStack.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
ItaniumCXXABI.cpp [clang] Emit type metadata on available_externally vtables for WPD 2021-02-19 12:42:34 -08:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h
MicrosoftCXXABI.cpp [clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate 2021-01-12 19:44:01 +00:00
ModuleBuilder.cpp reland "[DebugInfo] Support to emit debugInfo for extern variables" 2019-12-22 18:28:50 -08:00
ObjectFilePCHContainerOperations.cpp [Clang] Add __STDCPP_THREADS__ to standard predefine macros 2020-11-22 16:05:53 -08:00
PatternInit.cpp Clean up usages of asserting vector getters in Type 2020-04-13 13:01:40 -07:00
PatternInit.h
README.txt Revert "This is a test commit" 2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SanitizerMetadata.h [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SwiftCallingConv.cpp Teach the swift calling convention about _Atomic types 2020-08-31 07:07:25 -07:00
TargetInfo.cpp [CFE, SystemZ] New target hook testFPKind() for checks of FP values. 2021-02-18 12:36:46 -06:00
TargetInfo.h [CFE, SystemZ] New target hook testFPKind() for checks of FP values. 2021-02-18 12:36:46 -06:00
VarBypassDetector.cpp
VarBypassDetector.h

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//