llvm-project

History

Michael Kruse 707ce34b06 [OpenMP][OpenMPIRBuilder] Implement loop unrolling. Add methods for loop unrolling to the OpenMPIRBuilder class and use them in Clang if `-fopenmp-enable-irbuilder` is enabled. The unrolling methods are: * `unrollLoopFull` * `unrollLoopPartial` * `unrollLoopHeuristic` `unrollLoopPartial` and `unrollLoopHeuristic` can use compiler heuristics to automatically determine the unroll factor. If possible, that is if no CanonicalLoopInfo is required to pass to another method, metadata for LLVM's LoopUnrollPass is added. Otherwise the unroll factor is determined using the same heurstics as user by LoopUnrollPass. Not requiring a CanonicalLoopInfo, especially with `unrollLoopHeuristic` allows greater flexibility. With full unrolling and partial unrolling with known unroll factor, instead of duplicating instructions by the OpenMPIRBuilder, the full unroll is still delegated to the LoopUnrollPass. In case of partial unrolling the loop is first tiled using the existing `tileLoops` methods, then the inner loop fully unrolled using the same mechanism. Reviewed By: jdoerfert, kiranchandramohan Differential Revision: https://reviews.llvm.org/D107764		2021-09-02 02:37:25 -05:00
..
ABIInfo.h	[ABI][NFC] Fix the confusion of ByVal and ByRef argument names	2020-08-06 15:20:18 +03:00
Address.h	…
BackendUtil.cpp	[NewPM] Make some sanitizer passes parameterized in the PassRegistry	2021-08-19 12:43:37 +02:00
CGAtomic.cpp	[OpaquePtr] Remove uses of CreateConstGEP1_64() without element type	2021-07-17 16:43:20 +02:00
CGBlocks.cpp	[clang][NFC] GetOrCreateLLVMGlobal takes LangAS	2021-08-23 14:55:58 +02:00
CGBlocks.h	[CodeGen] Simplify the way lifetime of block captures is extended	2020-06-11 16:06:22 -07:00
CGBuilder.h	[OpaquePtr] Remove uses of CreateGEP() without element type	2021-07-17 22:56:27 +02:00
CGBuiltin.cpp	[NFC][clang] Move IR-independent parts of target MV support to X86TargetParser.cpp	2021-08-30 09:48:48 -07:00
CGCUDANV.cpp	[OpaquePtr] Remove uses of CreateConstGEP1_32() without element type	2021-07-17 18:32:36 +02:00
CGCUDARuntime.cpp	…
CGCUDARuntime.h	[HIP] Emit kernel symbol	2021-03-01 16:31:40 -05:00
CGCXX.cpp	[OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type	2021-07-17 17:07:46 +02:00
CGCXXABI.cpp	Fix PR35902: incorrect alignment used for ubsan check.	2020-12-28 18:11:17 -05:00
CGCXXABI.h	[clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate	2021-01-12 19:44:01 +00:00
CGCall.cpp	[Clang] add support for error+warning fn attrs	2021-08-25 10:34:18 -07:00
CGCall.h	Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC.	2020-12-22 19:54:29 -05:00
CGClass.cpp	[OpaquePtr] Remove uses of CreateGEP() without element type	2021-07-17 22:56:27 +02:00
CGCleanup.cpp	[Windows SEH]: Fix -O2 crash for Windows -EHa	2021-06-04 14:07:44 -07:00
CGCleanup.h	[XCOFF][AIX] Generate LSDA data and compact unwind section on AIX	2020-12-02 18:42:44 +00:00
CGCoroutine.cpp	Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass"	2021-04-18 17:22:28 -07:00
CGDebugInfo.cpp	DebugInfo: Refactor/deduplicate various template argument list emission	2021-08-30 22:39:46 -07:00
CGDebugInfo.h	DebugInfo: Refactor/deduplicate various template argument list emission	2021-08-30 22:39:46 -07:00
CGDecl.cpp	[clang] NFC: change uses of `Expr->getValueKind` into `is?Value`	2021-07-28 03:09:31 +02:00
CGDeclCXX.cpp	PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members.	2021-08-24 19:53:44 -07:00
CGException.cpp	[WebAssembly] Warn on exception spec for Emscripten EH	2021-05-20 13:00:20 -07:00
CGExpr.cpp	[clang][CodeGen] GetDefaultAlignTempAlloca uses preferred alignment	2021-08-23 14:55:58 +02:00
CGExprAgg.cpp	[OpaquePtr] Remove uses of CreateInBoundsGEP() without element type	2021-07-17 21:27:16 +02:00
CGExprCXX.cpp	[NFC] More get/removeAttribute() cleanup	2021-08-17 21:05:41 -07:00
CGExprComplex.cpp	[Matrix] Implement C-style explicit type conversions for matrix types.	2021-04-10 11:48:41 +01:00
CGExprConstant.cpp	[Matrix] Implement C-style explicit type conversions for matrix types.	2021-04-10 11:48:41 +01:00
CGExprScalar.cpp	[OpenCL] Fix as_type(vec3) invalid store creation	2021-08-19 11:57:09 +01:00
CGGPUBuiltin.cpp	…
CGLoopInfo.cpp	[Clang] Ensure vector predication loop metadata is always emitted when pragma is specified.	2021-02-13 17:35:54 -06:00
CGLoopInfo.h	[SVE] Add support to vectorize_width loop pragma for scalable vectors	2021-01-08 11:37:27 +00:00
CGNonTrivialStruct.cpp	[CodeGen] Stop creating fake FunctionDecls when generating IR for	2021-06-29 14:22:33 -07:00
CGObjC.cpp	[clang][patch][FPEnv] Make initialization of C++ globals strictfp aware	2021-07-29 12:02:37 -04:00
CGObjCGNU.cpp	[OpaquePtr] Remove uses of CreateStructGEP() without element type	2021-07-17 18:48:21 +02:00
CGObjCMac.cpp	[clang] NFC: Fix range-based for loop warnings related to decl lookup	2021-04-19 18:31:31 +02:00
CGObjCRuntime.cpp	[OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC)	2021-03-12 21:01:16 +01:00
CGObjCRuntime.h	[clang] Implement objc_non_runtime_protocol to remove protocol metadata	2020-10-02 17:35:50 -04:00
CGOpenCLRuntime.cpp	…
CGOpenCLRuntime.h	…
CGOpenMPRuntime.cpp	[OpenMP][OpenACC] Implement `ompx_hold` map type modifier extension in Clang (1/2)	2021-08-31 16:13:49 -04:00
CGOpenMPRuntime.h	[OpenMP] Creating the `omp_target_num_teams` and `omp_target_thread_limit` attributes to outlined functions	2021-07-27 17:21:04 -04:00
CGOpenMPRuntimeAMDGCN.cpp	[openmp][nfc] Replace OMPGridValues array with struct	2021-08-19 13:25:42 +01:00
CGOpenMPRuntimeAMDGCN.h	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3	2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeGPU.cpp	[openmp][nfc] Refactor GridValues	2021-08-23 16:19:11 +01:00
CGOpenMPRuntimeGPU.h	[openmp][nfc] Replace OMPGridValues array with struct	2021-08-19 13:25:42 +01:00
CGOpenMPRuntimeNVPTX.cpp	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3	2020-08-03 05:38:39 +00:00
CGOpenMPRuntimeNVPTX.h	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3	2020-08-03 05:38:39 +00:00
CGRecordLayout.h	[ARM] Follow AACPS standard for volatile bit-fields access width	2020-10-13 10:31:48 +01:00
CGRecordLayoutBuilder.cpp	[CodeGen] Use getCharWidth() more consistently in CGRecordLowering. NFC	2021-01-22 21:12:17 +01:00
CGStmt.cpp	[NFC] More get/removeAttribute() cleanup	2021-08-17 21:05:41 -07:00
CGStmtOpenMP.cpp	[OpenMP][OpenMPIRBuilder] Implement loop unrolling.	2021-09-02 02:37:25 -05:00
CGVTT.cpp	[AMDGPU] Set the default globals address space to 1	2020-11-20 15:46:53 +00:00
CGVTables.cpp	[Clang][Codegen] Do not annotate thunk's this/return types with align/deref/nonnull attrs	2021-05-13 20:33:08 +03:00
CGVTables.h	[clang] Frontend components for the relative vtables ABI (round 2)	2020-06-11 11:17:08 -07:00
CGValue.h	[AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair.	2020-10-13 13:26:56 +02:00
CMakeLists.txt	Remove dependency on clangASTMatchers.	2020-09-10 22:17:48 -04:00
CodeGenABITypes.cpp	[CodeGen] Add public function to emit C++ destructor call.	2020-07-01 11:01:23 -07:00
CodeGenAction.cpp	[Clang] add support for error+warning fn attrs	2021-08-25 10:34:18 -07:00
CodeGenFunction.cpp	Ensure field-annotations on pointers properly match the AS of the field.	2021-09-01 06:12:24 -07:00
CodeGenFunction.h	[OpenMP][OpenMPIRBuilder] Implement loop unrolling.	2021-09-02 02:37:25 -05:00
CodeGenModule.cpp	[OpenCL] Defines helper function for kernel language compatible OpenCL version	2021-08-31 10:08:38 +01:00
CodeGenModule.h	[clang][NFC] GetOrCreateLLVMGlobal takes LangAS	2021-08-23 14:55:58 +02:00
CodeGenPGO.cpp	[PGO] Don't reference functions unless value profiling is enabled	2021-05-20 11:09:24 -07:00
CodeGenPGO.h	[PGO] Don't reference functions unless value profiling is enabled	2021-05-20 11:09:24 -07:00
CodeGenTBAA.cpp	Reland Implement _ExtInt as an extended int type specifier.	2020-04-17 10:45:48 -07:00
CodeGenTBAA.h	…
CodeGenTypeCache.h	Fix __attribute__((annotate("")) with non-zero globals AS	2021-08-26 10:09:40 +01:00
CodeGenTypes.cpp	[Clang][RISCV] Define RISC-V V builtin types	2021-02-18 10:17:31 +08:00
CodeGenTypes.h	CodeGenTypes::CGRecordLayouts: Use unique_ptr to simplify memory management	2020-04-28 22:31:16 -07:00
ConstantEmitter.h	attempt to fix failing buildbots after `3bab88b7ba`	2020-06-15 12:58:37 +02:00
ConstantInitBuilder.cpp	Fix ConstantAggregateBuilderBase::getRelativeOffset	2020-06-15 12:23:20 -07:00
CoverageMappingGen.cpp	Revert "Revert "[Coverage] Emit gap region between statements if first statements contains terminate statements.""	2021-03-04 11:52:43 -08:00
CoverageMappingGen.h	[Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir}	2021-02-25 21:40:12 -08:00
EHScopeStack.h	[Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1	2021-05-17 22:42:17 -07:00
ItaniumCXXABI.cpp	PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members.	2021-08-24 19:53:44 -07:00
MacroPPCallbacks.cpp	…
MacroPPCallbacks.h	…
MicrosoftCXXABI.cpp	TypeInfo records more information about align requirement	2021-08-28 19:47:48 -04:00
ModuleBuilder.cpp	[clang/Basic] Make TargetInfo.h not use DataLayout again	2021-04-27 22:26:10 -04:00
ObjectFilePCHContainerOperations.cpp	[clang/Basic] Make TargetInfo.h not use DataLayout again	2021-04-27 22:26:10 -04:00
PatternInit.cpp	Clean up usages of asserting vector getters in Type	2020-04-13 13:01:40 -07:00
PatternInit.h	…
README.txt	Revert "This is a test commit"	2020-12-23 13:04:37 -06:00
SanitizerMetadata.cpp	[clang][patch] Inclusive language, modify filename SanitizerBlacklist.h to NoSanitizeList.h	2021-02-22 15:11:37 -05:00
SanitizerMetadata.h	[Analysis/Transforms/Sanitizers] As part of using inclusive language	2020-06-20 00:42:26 -07:00
SwiftCallingConv.cpp	Teach the swift calling convention about _Atomic types	2020-08-31 07:07:25 -07:00
TargetInfo.cpp	TypeInfo records more information about align requirement	2021-08-28 19:47:48 -04:00
TargetInfo.h	[Clang][AArch64] Inline assembly support for the ACLE type 'data512_t'	2021-07-31 09:51:28 +01:00
VarBypassDetector.cpp	[clang,NFC] Fix typos in file headers	2021-02-25 12:47:02 -08:00
VarBypassDetector.h	[clang,NFC] Fix typos in file headers	2021-02-25 12:47:02 -08:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//