llvm-project

Commit Graph

Author	SHA1	Message	Date
Chengji Yao	e5a4cf6743	[mlir] Fix printer when it is a DenseElementsAttr of i1 A large DenseElementsAttr of i1could trigger a bug in printer/parser roundtrip. Ex. A DenseElementsAttr of i1 with 200 elements will print as Hex format of length 400 before the fix. However, when parsing the printed text, an error will be triggered. After fix, the printed length will be 50. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122925	2022-05-05 16:56:43 -07:00
Christopher Bate	22c6e7b277	[mlir][nvvm] Fix support for tf32 data type in mma.sync The NVVM dialect test coverage for all possible type/shape combinations in the `nvvm.mma.sync` op is mostly complete. However, there were tests missing for TF32 datatype support. This change adds tests for the one relevant shape/type combination. This uncovered a small bug in the op verifier, which this change also fixes. Differential Revision: https://reviews.llvm.org/D124975	2022-05-05 11:02:03 -06:00
Min-Yih Hsu	794c4218a6	[mlir][LLVMIR] Do not update instMap via assignments to entry references Inside processInstruction, we assign the translated mlir::Value to a reference previously taken from the corresponding entry in instMap. However, instMap (a DenseMap) might resize after the entry reference was taken, rendering the assignment useless since it's assigning to a dangling reference. Here is a (pseudo) snippet that shows the concept: ``` // inst has type llvm::Instruction * Value &v = instMap[inst]; ... // op is one of the operands of inst, has type llvm::Value * processValue(op); // instMap resizes inside processValue ... translatedValue = b.createOp<Foo>(...); // v is already a dangling reference at this point! // The following assignment is bogus. v = translatedValue; ``` Nevertheless, after we stop caching llvm::Constant into instMap, there is only one case that can cause processValue to resize instMap: If the operand is a llvm::ConstantExpr. In which case we will insert the derived llvm::Instruction into instMap. To trigger instMap to resize, which is a DenseMap, the threshold depends on the ratio between # of map entries and # of (hash) buckets. More specifically, it resizes if (# of map entries / # of buckets) >= 0.75. In this case # of map entries is equal to # of LLVM instructions, and # of buckets is the power-of-two upperbound of # of map entries. Thus, eventually in the attaching test case (test/Target/LLVMIR/Import/incorrect-instmap-assignment.ll), we picked 96 and 128 for the # of map entries and # of buckets, respectively. (We can't pick numbers that are too small since DenseMap used inlined storage for small number of entries). Therefore, the ConstantExpr in the said test case (i.e. a GEP) is the 96-th llvm::Value cached into the instMap, triggering the issue we're discussing here on its enclosing instruction (i.e. a load). This patch fixes this issue by calling `operator[]` everytime we need to update an entry. Differential Revision: https://reviews.llvm.org/D124627	2022-05-04 13:16:51 -07:00
Min-Yih Hsu	857eb4a152	[mlir][LLVMIR] Add support for translating Switch instruction Add support for translating llvm::SwitchInst. Differential Revision: https://reviews.llvm.org/D124628	2022-05-03 09:41:40 -07:00
Min-Yih Hsu	e927a336a5	[mlir][LLVMIR] Add support for translating FCmp & FP constants This patch add supports for translating FCmp and more kinds of FP constants in addition to 32 & 64-bit ones. However, we can't express ppc_fp128 constants right now because the semantics for its underlying APFloat is `S_PPCDoubleDouble` but mlir::FloatType doesn't support such semantics right now. Differential Revision: https://reviews.llvm.org/D124630	2022-05-02 16:22:35 -07:00
Marius Brehler	84fe39a45b	[mlir][emitc] Add a cast op This adds a cast operation that allows to perform an explicit type conversion. The cast op is emitted as a C-style cast. It can be applied to integer, float, index and EmitC types. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D123514	2022-04-28 15:50:59 +00:00
Marius Brehler	50d648b40e	[mlir][emitc] Replace !emitc.opaque pointers Replaces using !emitc.opaque pointers which using !emitc.ptr types.	2022-04-28 15:20:39 +00:00
Min-Yih Hsu	a75657d66a	[mlir][LLVMIR] Do not cache llvm::Constant into instMap Constants in MLIR are not globally unique, unlike that in LLVM IR. Therefore, reusing previous-translated constants might cause the user operations not being dominated by the constant (because the previous-translated ones can be placed in arbitrary place) This indeed misses some opportunities where we actually can reuse a previous-translated constants, but verbosity is not our first priority here. Differential Revision: https://reviews.llvm.org/D124404	2022-04-27 09:43:49 -07:00
Min-Yih Hsu	ea9bcb8b27	[mlir][LLVMIR] Do not cache Instruction generated on-the-fly More specifically, the llvm::Instruction generated by llvm::ConstantExpr::getAsInstruction. Such Instruction will be deleted right away, but it's possible that when getAsInstruction is called again, it will create a new Instruction that has the same address with the one we just deleted. Thus, we shouldn't keep it in the `instMap` to avoid a conflicting index that triggers an assertion in processInstruction. Differential Revision: https://reviews.llvm.org/D124402	2022-04-27 09:42:59 -07:00
Min-Yih Hsu	00fcf9e95a	[mlir][LLVMIR] Add support for importing struct-type ConstantAggregate(Zero) And move importer test files from `test/Target/LLVMIR` into `test/Target/LLVMIR/Import`. We simply translate struct-type ConstantAggregate(Zero) into a serious of `llvm.insertvalue` operations against a `llvm.undef` root. Note that this doesn't affect the original logics on translating vector/array-type ConstantAggregate values. Differential Revision: https://reviews.llvm.org/D124399	2022-04-27 09:42:26 -07:00
Michael Kruse	ff289feeba	[OpenMPIRBuilder] Remove ContinuationBB argument from Body callback. The callback is expected to create a branch to the ContinuationBB (sometimes called FiniBB in some lambdas) argument when finishing. This creates problems: 1. The InsertPoint used for CodeGenIP does not need to be the end of a block. If it is not, a naive callback will insert a branch instruction into the middle of the block. 2. The BasicBlock the CodeGenIP is pointing to may or may not have a terminator. There is an conflict where to branch to if the block already has a terminator. 3. Some API functions work only with block having a terminator. Some workarounds have been used to insert a temporary terminator that is removed again. 4. Some callbacks are sensitive to whether the BasicBlock has a terminator or not. This creates a callback ordering problem where different callback may have different behaviour depending on whether a previous callback created a terminator or not. The problem also exists for FinalizeCallbackTy where some callbacks do create branch to another "continue" block, but unlike BodyGenCallbackTy does not receive the target as argument. This is not addressed in this patch. With this patch, the callback receives an CodeGenIP into a BasicBlock where to insert instructions. If it has to insert control flow, it can split the block at that position as needed but otherwise no separate ContinuationBB is needed. In particular, a callback can be empty without breaking the emitted IR. If the caller needs the control flow to branch to a specific target, it can insert the branch instruction itself and pass an InsertPoint before the terminator to the callback. Certain frontends such as Clang may expect the current IRBuilder position to be at the end of a basic block. In this case its callbacks must split the block at CodeGenIP before setting the IRBuilder position such that the instructions after CodeGenIP are moved to another basic block and before returning create a new branch instruction to the split block. Some utility functions such as `splitBB` are supporting correct splitting of BasicBlocks, independent of whether they have a terminator or not, returning/setting the InsertPoint of an IRBuilder to the end of split predecessor block, and optionally omitting creating a branch to the split successor block to be added later. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D118409	2022-04-26 16:35:01 -05:00
River Riddle	a4936cb3e8	[mlir][NFC] Update textual references of `func` to `func.func` in Pass/Target tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
Shraiysh Vaishay	88bb2521b0	[mlir][OpenMP] Add checks and tests for hint clause and fix empty hint This patch handles empty hint value for critical and atomic constructs. This also adds checks and tests for hint clause on atomic constructs. Reviewed By: peixin, kiranchandramohan, NimishMishra Differential Revision: https://reviews.llvm.org/D123186	2022-04-21 07:31:03 +05:30
jacquesguan	590a38920f	[mlir][LLVMIR] Add vector predication type cast intrinsic ops. This patch adds vector predication type cast intrinsic ops. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123996	2022-04-20 02:11:14 +00:00
Michael Kruse	2d92ee97f1	Reapply "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `af0285122f`. The test "libomp::loop_dispatch.c" on builder openmp-gcc-x86_64-linux-debian fails from time-to-time. See #54969. This patch is unrelated.	2022-04-18 21:56:47 -05:00
Michael Kruse	af0285122f	Revert "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `9ec501da76`. It may have caused the openmp-gcc-x86_64-linux-debian buildbot to fail. https://lab.llvm.org/buildbot/#/builders/4/builds/20377	2022-04-18 14:38:31 -05:00
Michael Kruse	9ec501da76	[OpenMP] Refactor OMPScheduleType enum. The OMPScheduleType enum stores the constants from libomp's internal sched_type in kmp.h and are used by several kmp API functions. The enum values have an internal structure, namely each scheduling algorithm (e.g.) exists in four variants: unordered, orderend, normerge unordered, and nomerge ordered. This patch (basically a followup to D114940) splits the "ordered" and "nomerge" bits into separate flags, as was already done for the "monotonic" and "nonmonotonic", so we can apply bit flags operations on them. It also now contains all possible combinations according to kmp's sched_type. Deriving of the OMPScheduleType enum from clause parameters has been moved form MLIR's OpenMPToLLVMIRTranslation.cpp to OpenMPIRBuilder to make available for clang as well. Since the primary purpose of the flag is the binary interface to libomp, it has been made more private to LLVMFrontend. The primary interface for generating worksharing-loop using OpenMPIRBuilder code becomes `applyWorkshareLoop` which derives the OMPScheduleType automatically and calls the appropriate emitter function. While this is mostly a NFC refactor, it still applies the following functional changes: * The logic from OpenMPToLLVMIRTranslation to derive the OMPScheduleType also applies to clang. Most notably, it now applies the nonmonotonic flag for non-static schedules by default. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was previously not applied if the simd modifier was used. I assume this was a bug, since the effect was due to `loop.schedule_modifier()` returning `mlir::omp::ScheduleModifier::none` instead of `llvm::Optional::None`. * In OpenMPToLLVMIRTranslation, the nonmonotonic default flag was set even if ordered was specified, in breach to what the comment before citing the OpenMP specification says. I assume this was an oversight. The ordered flag with parameter was not considered in this patch. Changes will need to be made (e.g. adding/modifying function parameters) when support for it is added. The lengthy names of the enum values can be discussed, for the moment this is avoiding reusing previously existing enum value names such as `StaticChunked` to avoid confusion. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D123403	2022-04-18 14:03:17 -05:00
Alex Zinenko	6c5ae8e974	[mlir] Support opaque types in LLVM IR -> MLIR translation LLVM IR is moving towards adoption of opaque pointer types. These require extra information to be passed when constructing some operations, in particular GEP and Alloca. Adapt the builders of said operations and modify the translation code to handle both opaque and non-opaque pointers. This incidentally adds the translation for Alloca alignment and fixes the translation of struct-related GEP indices that must be constant. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123792	2022-04-15 17:51:31 +02:00
jacquesguan	3d79c52f31	[mlir][LLVMIR] Add more vector predication intrinsic ops. This revision adds vector predication select, merge and load/store intrinsic ops. Differential Revision: https://reviews.llvm.org/D123477	2022-04-15 02:13:42 +00:00
Alex Zinenko	2366a43b3c	[mlir] initial support for opaque pointers in the LLVM dialect LLVM IR has introduced and is moving forward with the concept of opaque pointers, i.e. pointer types that are not carrying around the pointee type. Instead, memory-related operations indicate the type of the data being accessed through the opaque pointer. Introduce the initial support for opaque pointers in the LLVM dialect: - `LLVMPointerType` to support omitting the element type; - alloca/load/store/gep to support opaque pointers in their operands and results; this requires alloca and gep to store the element type as an attribute; - memory-related intrinsics to support opaque pointers in their operands; - translation to LLVM IR for the ops above is no longer using methods deprecated in LLVM API due to the introduction of opaque pointers. Unlike LLVM IR, MLIR can afford to support both opaque and non-opaque pointers at the same time and simplify the transition. Translation to LLVM IR of MLIR that involves opaque pointers requires the LLVMContext to be configured to always use opaque pointers. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D123310	2022-04-14 13:23:29 +02:00
Shraiysh Vaishay	f0ba32d666	[mlir][LLVM-IR] Added support for global variable attributes This patch adds thread_local to llvm.mlir.global and adds translation for dso_local and addr_space to and from LLVM IR. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D123412	2022-04-13 08:31:01 +05:30
Peixin-Qiao	139c5b8a96	[MLIR][OpenMP] Add support for threadprivate directive This supports the threadprivate directive in OpenMP dialect following the OpenMP 5.1 [2.21.2] standard. Also lowering to LLVM IR using OpenMP IRBduiler. Reviewed By: kiranchandramohan, shraiysh, arnamoy10 Differential Revision: https://reviews.llvm.org/D123350	2022-04-12 10:15:15 +08:00
jacquesguan	5bd7b0efd0	[mlir][LLVMIR] Add more vector predication intrinsic ops. This revision adds float unary, ternary and float/integer reduction intrinsic ops. Differential Revision: https://reviews.llvm.org/D123189	2022-04-08 03:16:37 +00:00
jacquesguan	2420d42925	[mlir][LLVMIR] Add vector predication binary intrinsic ops. Differential Revision: https://reviews.llvm.org/D122971	2022-04-06 10:48:24 +08:00
Michael Kruse	c082ca16f1	[OpenMPIRBuilder] Detect and fix ambiguous InsertPoints for createSections. Follow-up on D117226 for createSections. Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D117835	2022-04-05 12:36:29 -05:00
Peixin-Qiao	3e7415a0ff	[OMPIRBuilder] Support ordered clause specified without parameter This patch supports ordered clause specified without parameter in worksharing-loop directive in the OpenMPIRBuilder and lowering MLIR to LLVM IR. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D114940	2022-04-01 16:17:29 +08:00
Christopher Bate	3be7c28917	[mlir][NVVM] Add support for nvvm mma.sync ops This patch adds MLIR NVVM support for the various NVPTX `mma.sync` operations. There are a number of possible data type, shape, and other attribute combinations supported by the operation, so a custom assebmly format is added and attributes are inferred where possible. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D122410	2022-03-25 17:28:05 +00:00
Shraiysh Vaishay	8722c12c12	[mlir][OpenMP][IRBuilder] Add support for nowait on single construct This patch adds the nowait parameter to `createSingle` in OpenMPIRBuilder and handling for IR generation from OpenMP Dialect. Also added tests for the same. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122371	2022-03-24 22:51:52 +05:30
Shraiysh Vaishay	3c0d470865	[mlir][OpenMP] omp.single translation to LLVM IR This patch adds translation from omp.single to LLVM IR. Depends on D122288 Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D122297	2022-03-24 10:07:30 +05:30
Shraiysh Vaishay	b244bba582	[mlir][OpenMP] Added assembly format for omp.wsloop and remove parseClauses This patch - adds assembly format for `omp.wsloop` operation - removes the `parseClauses` clauses as it is not required anymore This is expected to be the final patch in a series of patches for replacing parsers for clauses with `oilist`. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D121367	2022-03-23 10:02:02 +05:30
Shraiysh Vaishay	31486a9fc2	[mlir][OpenMP] Added translation from `omp.atomic.capture` to LLVM IR This patch adds translation from `omp.atomic.capture` to LLVM IR. Also added tests for the same. Depends on D121546 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D121554	2022-03-21 16:39:36 +05:30
Shraiysh Vaishay	423e3edc27	[OpenMP][IRBuilder] Fix emitAtomicUpdate conditions This patch fixes the condition for emitting atomic update using `atomicrmw` instruction or compare-exchange loop. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D121546	2022-03-21 16:19:43 +05:30
Eric Schweitz	1c083e6985	[mlir] Add support for the nest attribute to the LLVM-IR dialect conversion. Differential Revision: https://reviews.llvm.org/D121948	2022-03-18 08:15:03 -07:00
River Riddle	3655069234	[mlir] Move the Builtin FuncOp to the Func dialect This commit moves FuncOp out of the builtin dialect, and into the Func dialect. This move has been planned in some capacity from the moment we made FuncOp an operation (years ago). This commit handles the functional aspects of the move, but various aspects are left untouched to ease migration: func::FuncOp is re-exported into mlir to reduce the actual API churn, the assembly format still accepts the unqualified `func`. These temporary measures will remain for a little while to simplify migration before being removed. Differential Revision: https://reviews.llvm.org/D121266	2022-03-16 17:07:03 -07:00
Ian Bearman	7ecb7efc89	[MLIR] UnknownLoc on Inlinable Calls in LLVMIR Translation During MLIR translation to LLVMIR if an inlineable call has an UnkownLoc we get this error message: ``` inlinable function call in a function with debug info must have a !dbg location call void @callee() ``` There is code that checks for this case and strips debug information to avoid this situation. I'm expanding this code to handle the case where an debug location points at a UnknownLoc. For example, a NamedLoc whose child location is an UnknownLoc. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D121633	2022-03-15 14:48:50 -07:00
Arnamoy Bhattacharyya	0e9198c3e9	[MLIR][OpenMP] Add support for basic SIMD construct Patch adds a new operation for the SIMD construct. The op is designed to be very similar to the existing `wsloop` operation, so that the `CanonicalLoopInfo` of `OpenMPIRBuilder` can be used. Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D118065	2022-03-15 09:41:04 -04:00
Thomas Raoux	6d007e0278	[mlir][nvvm] Fix bug in ldmatrix intrinsic conversion The ldmatrix intrinsic trans option was inverted. Bug found by @christopherbate! Differential Revision: https://reviews.llvm.org/D121666	2022-03-15 05:04:09 +00:00
Thomas Raoux	2f33f11428	[mlir][NVVM] Add ldmatrix op to NVVM dialect Differential Revision: https://reviews.llvm.org/D121347	2022-03-10 20:37:17 +00:00
Shraiysh Vaishay	a1ac771247	[MLIR][OpenMP][NFC] Move the modifier testcases to correct directory This patch moves the testcases from `mlir/test/Target/LLVMIR/openmp-llvm-bad-schedule-modifier.mlir` to `mlir/test/Dialect/OpenMP/invalid.mlir` as they test the verifier (not the translation to LLVM IR). Reviewed By: NimishMishra Differential Revision: https://reviews.llvm.org/D120877	2022-03-10 21:32:00 +05:30
Shraiysh Vaishay	6dd54da5a5	[OpenMP][mlir] Lowering for omp.atomic.update This patch adds lowering from omp.atomic.update to LLVM IR. Whenever a special LLVM IR instruction is available for the operation, `atomicrmw` instruction is emitted, otherwise a compare-exchange loop based update is emitted. Depends on D119522 Reviewed By: ftynse, peixin Differential Revision: https://reviews.llvm.org/D119657	2022-03-10 18:28:51 +05:30
Lei Zhang	cfb9e474ae	[mlir][spirv] Define spv.VectorTimesScalar op Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D121247	2022-03-08 15:58:31 -05:00
River Riddle	47f175b09b	[mlir] Update FuncOp conversion passes to Pass/InterfacePass<FunctionOpInterface> These passes generally don't rely on any special aspects of FuncOp, and moving allows for these passes to be used in many more situations. The passes that obviously weren't relying on invariants guaranteed by a "function" were updated to be generic pass, the rest were updated to be FunctionOpinterface InterfacePasses. The test updates are NFC switching from implicit nesting (-pass -pass2) form to the -pass-pipeline form (generic passes do not implicitly nest as op-specific passes do). Differential Revision: https://reviews.llvm.org/D121190	2022-03-08 12:25:32 -08:00
River Riddle	5a7b919409	[mlir][NFC] Rename StandardToLLVM to FuncToLLVM The current StandardToLLVM conversion patterns only really handle the Func dialect. The pass itself adds patterns for Arithmetic/CFToLLVM, but those should be/will be split out in a followup. This commit focuses solely on being an NFC rename. Aside from the directory change, the pattern and pass creation API have been renamed: * populateStdToLLVMFuncOpConversionPattern -> populateFuncToLLVMFuncOpConversionPattern * populateStdToLLVMConversionPatterns -> populateFuncToLLVMConversionPatterns * createLowerToLLVMPass -> createConvertFuncToLLVMPass Differential Revision: https://reviews.llvm.org/D120778	2022-03-07 11:25:23 -08:00
William S. Moses	87ec6f41bb	[OpenMPIRBuilder] Allocate temporary at the correct block in a nested parallel The OpenMPIRBuilder has a bug. Specifically, suppose you have two nested openmp parallel regions (writing with MLIR for ease) ``` omp.parallel { %a = ... omp.parallel { use(%a) } } ``` As OpenMP only permits pointer-like inputs, the builder will wrap all of the inputs into a stack allocation, and then pass this allocation to the inner parallel. For example, we would want to get something like the following: ``` omp.parallel { %a = ... %tmp = alloc store %tmp[] = %a kmpc_fork(outlined, %tmp) } ``` However, in practice, this is not what currently occurs in the context of nested parallel regions. Specifically to the OpenMPIRBuilder, the entirety of the function (at the LLVM level) is currently inlined with blocks marking the corresponding start and end of each region. ``` entry: ... parallel1: %a = ... ... parallel2: use(%a) ... endparallel2: ... endparallel1: ... ``` When the allocation is inserted, it presently inserted into the parent of the entire function (e.g. entry) rather than the parent allocation scope to the function being outlined. If we were outlining parallel2, the corresponding alloca location would be parallel1. This causes a variety of bugs, including https://github.com/llvm/llvm-project/issues/54165 as one example. This PR allows the stack allocation to be created at the correct allocation block, and thus remedies such issues. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D121061	2022-03-06 18:34:25 -05:00
Peixin-Qiao	a5605c9a15	[MLIR] Fix afterIP for dynamic worksharing-loop after collaping loops The loopInfos gets invalidated after collapsing nested loops. Use the saved afterIP since the returned afterIP by applyDynamicWorkshareLoop may be not valid. Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D120294	2022-03-03 15:22:20 +08:00
William S. Moses	758ddba381	[MLIR] Use Datalayout defaults when importing LLVM LLVM defines several default datalayouts for integer and floating point types that are not being considered when importing into MLIR. This patch remedies this. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120832	2022-03-02 13:00:53 -05:00
Alex Zinenko	eb27da7dec	[mlir] Ignore index data layout in translation to LLVM It can be present, but is irrelevant for the translation.	2022-03-02 16:56:21 +01:00
Alex Zinenko	ea998709eb	[mlir] Add partial support for translating data layout Add support for translating data layout specifications for integer and float types between MLIR and LLVM IR. This is a first step towards removing the string-based LLVM dialect data layout attribute on modules. The latter is still available and will remain so until the first-class MLIR modeling can fully replace it. Depends On D120739 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D120740	2022-03-02 14:56:50 +01:00
Michael Kruse	a66f7769a3	[OpenMPIRBuilder] Implement static-chunked workshare-loop schedules. Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads. This patch includes the following related changes: * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder. * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder. * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop. * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause. Differential Revision: https://reviews.llvm.org/D114413	2022-02-28 18:18:33 -06:00
Marius Brehler	1fa1251116	[mlir][emitc] Add a variable op This adds a variable op, emitted as C/C++ locale variable, which can be used if the `emitc.constant` op is not sufficient. As an example, the canonicalization pass would transform ```mlir %0 = "emitc.constant"() {value = 0 : i32} : () -> i32 %1 = "emitc.constant"() {value = 0 : i32} : () -> i32 %2 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> %3 = emitc.apply "&"(%1) : (i32) -> !emitc.ptr<i32> emitc.call "write"(%2, %3) : (!emitc.ptr<i32>, !emitc.ptr<i32>) -> () ``` into ```mlir %0 = "emitc.constant"() {value = 0 : i32} : () -> i32 %1 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> %2 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> emitc.call "write"(%1, %2) : (!emitc.ptr<i32>, !emitc.ptr<i32>) -> () ``` resulting in pointer aliasing, as %1 and %2 point to the same address. In such a case, the `emitc.variable` operation can be used instead. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D120098	2022-02-24 15:25:21 +00:00

1 2 3 4 5 ...

372 Commits