llvm-project/clang/lib/CodeGen
Ryan Santhirarajan 0395f9e70b [ARM] Neon Polynomial vadd Intrinsic fix
The Neon vadd intrinsics were added to the ARMSIMD intrinsic map,
however due to being defined under an AArch64 guard in arm_neon.td,
were not previously useable on ARM. This change rectifies that.

It is important to note that poly128 is not valid on ARM, thus it was
extracted out of the original arm_neon.td definition and separated
for the sake of AArch64.

Reviewed By: DavidSpickett

Differential Revision: https://reviews.llvm.org/D100772
2021-04-28 11:59:40 -07:00
..
ABIInfo.h [ABI][NFC] Fix the confusion of ByVal and ByRef argument names 2020-08-06 15:20:18 +03:00
Address.h
BackendUtil.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
CGAtomic.cpp Let clang atomic builtins fetch add/sub support floating point types 2021-04-06 15:44:00 -04:00
CGBlocks.cpp [clang] NFC: Fix range-based for loop warnings related to decl lookup 2021-04-19 18:31:31 +02:00
CGBlocks.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGBuilder.h [OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) 2021-03-12 21:01:16 +01:00
CGBuiltin.cpp [ARM] Neon Polynomial vadd Intrinsic fix 2021-04-28 11:59:40 -07:00
CGCUDANV.cpp [clang] NFC: Fix range-based for loop warnings related to decl lookup 2021-04-19 18:31:31 +02:00
CGCUDARuntime.cpp
CGCUDARuntime.h [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
CGCXX.cpp [OpaquePtrs] Remove some uses of type-less CreateLoad APIs (NFC) 2021-03-11 14:40:57 +01:00
CGCXXABI.cpp Fix PR35902: incorrect alignment used for ubsan check. 2020-12-28 18:11:17 -05:00
CGCXXABI.h [clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate 2021-01-12 19:44:01 +00:00
CGCall.cpp Reland "[Clang] Propagate guaranteed alignment for malloc and others" 2021-04-23 14:05:57 +02:00
CGCall.h Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. 2020-12-22 19:54:29 -05:00
CGClass.cpp Implemented [[clang::musttail]] attribute for guaranteed tail calls. 2021-04-15 17:12:21 -07:00
CGCleanup.cpp [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGCleanup.h [XCOFF][AIX] Generate LSDA data and compact unwind section on AIX 2020-12-02 18:42:44 +00:00
CGCoroutine.cpp Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass" 2021-04-18 17:22:28 -07:00
CGDebugInfo.cpp Reland [DebugInfo] Fix the mismatching between C++ language tags and Dwarf versions."" 2021-04-12 11:05:55 +00:00
CGDebugInfo.h [Clang][Coroutine][DebugInfo] In c++ coroutine, clang will emit different debug info variables for parameters and move-parameters. 2021-04-12 11:10:47 +08:00
CGDecl.cpp [SYCL] Implement SYCL address space attributes handling 2021-04-26 13:44:10 +03:00
CGDeclCXX.cpp [AIX] Support init priority attribute 2021-04-08 15:40:09 -04:00
CGException.cpp [CGBuilder] Remove type-less CreateAlignedLoad() APIs (NFC) 2021-03-11 10:41:23 +01:00
CGExpr.cpp BPF: emit debuginfo for Function of DeclRefExpr if requested 2021-04-26 16:53:25 -07:00
CGExprAgg.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprCXX.cpp Implemented [[clang::musttail]] attribute for guaranteed tail calls. 2021-04-15 17:12:21 -07:00
CGExprComplex.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprConstant.cpp [Matrix] Implement C-style explicit type conversions for matrix types. 2021-04-10 11:48:41 +01:00
CGExprScalar.cpp [Matrix] Support #pragma clang fp 2021-04-22 11:45:34 +01:00
CGGPUBuiltin.cpp [Alignment][NFC] Use Align with CreateAlignedStore 2020-01-23 17:34:32 +01:00
CGLoopInfo.cpp [Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. 2021-02-13 17:35:54 -06:00
CGLoopInfo.h [SVE] Add support to vectorize_width loop pragma for scalable vectors 2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp [OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) 2021-03-12 21:01:16 +01:00
CGObjC.cpp [CGBuilder] Remove type-less CreateAlignedLoad() APIs (NFC) 2021-03-11 10:41:23 +01:00
CGObjCGNU.cpp [clang] NFC: Fix range-based for loop warnings related to decl lookup 2021-04-19 18:31:31 +02:00
CGObjCMac.cpp [clang] NFC: Fix range-based for loop warnings related to decl lookup 2021-04-19 18:31:31 +02:00
CGObjCRuntime.cpp [OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) 2021-03-12 21:01:16 +01:00
CGObjCRuntime.h [clang] Implement objc_non_runtime_protocol to remove protocol metadata 2020-10-02 17:35:50 -04:00
CGOpenCLRuntime.cpp Fix "pointer is null" static analyzer warning. NFCI. 2020-01-08 17:19:08 +00:00
CGOpenCLRuntime.h
CGOpenMPRuntime.cpp [OPENMP]Fix PR49698: OpenMP declare mapper causes segmentation fault. 2021-04-21 10:38:31 -07:00
CGOpenMPRuntime.h [OpenMP] Fix -Wdeprecated-copy 2021-04-23 10:49:19 -07:00
CGOpenMPRuntimeAMDGCN.cpp [libomptarget][amdgpu] Call into deviceRTL instead of ockl 2021-01-04 16:48:47 +00:00
CGOpenMPRuntimeAMDGCN.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeGPU.cpp [OpenMP] Simplify offloading parallel call codegen 2021-04-21 18:46:07 -07:00
CGOpenMPRuntimeGPU.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.cpp [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.h [OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 2020-08-03 05:38:39 +00:00
CGRecordLayout.h [ARM] Follow AACPS standard for volatile bit-fields access width 2020-10-13 10:31:48 +01:00
CGRecordLayoutBuilder.cpp [CodeGen] Use getCharWidth() more consistently in CGRecordLowering. NFC 2021-01-22 21:12:17 +01:00
CGStmt.cpp Implemented [[clang::musttail]] attribute for guaranteed tail calls. 2021-04-15 17:12:21 -07:00
CGStmtOpenMP.cpp [OPENMP]Fix PR49115: Incorrect results for scan directive. 2021-04-16 06:25:35 -07:00
CGVTT.cpp [AMDGPU] Set the default globals address space to 1 2020-11-20 15:46:53 +00:00
CGVTables.cpp [clang][RelativeVTablesABI] Use dso_local_equivalent rather than emitting stubs 2020-11-30 16:02:35 -08:00
CGVTables.h [clang] Frontend components for the relative vtables ABI (round 2) 2020-06-11 11:17:08 -07:00
CGValue.h [AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. 2020-10-13 13:26:56 +02:00
CMakeLists.txt Remove dependency on clangASTMatchers. 2020-09-10 22:17:48 -04:00
CodeGenABITypes.cpp [CodeGen] Add public function to emit C++ destructor call. 2020-07-01 11:01:23 -07:00
CodeGenAction.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
CodeGenFunction.cpp Normalize interaction with boolean attributes 2021-04-17 08:17:33 +02:00
CodeGenFunction.h Implemented [[clang::musttail]] attribute for guaranteed tail calls. 2021-04-15 17:12:21 -07:00
CodeGenModule.cpp [SYCL] Implement SYCL address space attributes handling 2021-04-26 13:44:10 +03:00
CodeGenModule.h [SYCL] Implement SYCL address space attributes handling 2021-04-26 13:44:10 +03:00
CodeGenPGO.cpp Don't emit coverage mapping for excluded functions 2021-02-05 13:03:57 -08:00
CodeGenPGO.h [NFC] Make non-modifying members const. 2020-10-18 18:50:21 +02:00
CodeGenTBAA.cpp Reland Implement _ExtInt as an extended int type specifier. 2020-04-17 10:45:48 -07:00
CodeGenTBAA.h
CodeGenTypeCache.h [CGExpr] Use getCharWidth() more consistently in CCGExprConstant. NFC 2021-01-22 21:12:17 +01:00
CodeGenTypes.cpp [Clang][RISCV] Define RISC-V V builtin types 2021-02-18 10:17:31 +08:00
CodeGenTypes.h CodeGenTypes::CGRecordLayouts: Use unique_ptr to simplify memory management 2020-04-28 22:31:16 -07:00
ConstantEmitter.h attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
ConstantInitBuilder.cpp Fix ConstantAggregateBuilderBase::getRelativeOffset 2020-06-15 12:23:20 -07:00
CoverageMappingGen.cpp Revert "Revert "[Coverage] Emit gap region between statements if first statements contains terminate statements."" 2021-03-04 11:52:43 -08:00
CoverageMappingGen.h [Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir} 2021-02-25 21:40:12 -08:00
EHScopeStack.h Implemented [[clang::musttail]] attribute for guaranteed tail calls. 2021-04-15 17:12:21 -07:00
ItaniumCXXABI.cpp Temporarily revert the code part of D100981 "Delete le32/le64 targets" 2021-04-22 10:18:44 -07:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h
MicrosoftCXXABI.cpp [OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) 2021-03-12 21:01:16 +01:00
ModuleBuilder.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
ObjectFilePCHContainerOperations.cpp [clang/Basic] Make TargetInfo.h not use DataLayout again 2021-04-27 22:26:10 -04:00
PatternInit.cpp Clean up usages of asserting vector getters in Type 2020-04-13 13:01:40 -07:00
PatternInit.h
README.txt Revert "This is a test commit" 2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp [clang][patch] Inclusive language, modify filename SanitizerBlacklist.h to NoSanitizeList.h 2021-02-22 15:11:37 -05:00
SanitizerMetadata.h [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SwiftCallingConv.cpp Teach the swift calling convention about _Atomic types 2020-08-31 07:07:25 -07:00
TargetInfo.cpp [SYCL] Implement SYCL address space attributes handling 2021-04-26 13:44:10 +03:00
TargetInfo.h [CFE, SystemZ] New target hook testFPKind() for checks of FP values. 2021-02-18 12:36:46 -06:00
VarBypassDetector.cpp [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00
VarBypassDetector.h [clang,NFC] Fix typos in file headers 2021-02-25 12:47:02 -08:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//