llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Mehdi Amini	78accf9f35	Make LLVM Linkage a first class attribute instead of using an integer attribute This makes the IR more readable, in particular when this will be used on the builtin func outside of the LLVM dialect. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D109209	2021-09-03 21:21:46 +00:00
Alex Zinenko	f9be7a7afd	[mlir] speed up construction of LLVM IR constants when possible The translation to LLVM IR used to construct sequential constants by recurring down to individual elements, creating constant values for them, and wrapping them into aggregate constants in post-order. This is highly inefficient for large constants with known data such as DenseElementsAttr. Use LLVM's ConstantData for the innermost dimension instead. LLVM does seem to support data constants for nested sequential constants so the outer dimensions are still handled recursively. Nevertheless, this speeds up the translation of large constants with equal dimensions by up to 30x. Users are advised to rewrite large constants to use flat types before translating to LLVM IR if more efficiency in translation is necessary. This is not done automatically as the translation is not aware of the expectations of the overall compilation flow about type changes and indexing, in particular for global constants with external linkage. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109152	2021-09-02 23:07:30 +02:00
Kiran Chandramohan	711aa35759	[MLIR][OpenMP] Add support for declaring critical construct names Add an operation omp.critical.declare to declare names/symbols of critical sections. Named omp.critical operations should use symbols declared by omp.critical.declare. Having a declare operation ensures that the names of critical sections are global and unique. In the lowering flow to LLVM IR, the OpenMP IRBuilder creates unique names for critical sections. Reviewed By: ftynse, jeanPerier Differential Revision: https://reviews.llvm.org/D108713	2021-09-02 14:31:19 +00:00
Alex Zinenko	8647e4c3a0	[mlir] support translating OpenMP loops with reductions Use the recently introduced OpenMPIRBuilder facility to transate OpenMP workshare loops with reductions to LLVM IR calling OpenMP runtime. Most of the heavy lifting is done at the OpenMPIRBuilder. When other OpenMP dialect constructs grow support for reductions, the translation can be updated to operate on, e.g., an operation interface for all reduction containers instead of workshare loops specifically. Designing such a generic translation for the single operation that currently supports reductions is premature since we don't know how the reduction modeling itself will be generalized. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D107343	2021-09-02 15:38:20 +02:00
Tyler Augustine	d25e91d7f6	Support alias.scope and noalias metadata Introduces new Ops to represent 1. alias.scope metadata in LLVM, and 2. domains for these scopes. These correspond to the metadata described in https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata. Lists of scopes are modeled the same way as access groups - as an ArrayAttr on the Op (added in https://reviews.llvm.org/D97944). Lowering 'noalias' attributes on function parameters is already supported. However, lowering `noalias` metadata on individual Ops is not, which is added in this change. LLVM uses the same keyword for these, but this change introduces a separate attribute name 'noalias_scopes' to represent this distinct concept. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107870	2021-08-24 20:42:59 +02:00
Florian Hahn	f999312872	Recommit "[Matrix] Overload stride arg in matrix.columnwise.load/store." This reverts the revert `28c04794df`. The failing MLIR test that caused the revert should be fixed in this version. Also includes a PPC test fix previously in `1f87c7c478`.	2021-08-12 18:31:57 +01:00
Alex Zinenko	c4c1030976	[mlir] support collapsed loops in OpenMP-to-LLVM translation Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105706	2021-08-06 17:13:12 +02:00
Kiran Chandramohan	59989d68ba	[MLIR][OpenMP] Add support for critical construct This patch adds the critical construct to the OpenMP dialect. The implementation models the definition in 2.17.1 of the OpenMP 5 standard. A name and hint can be specified. The name is a global entity or has external linkage, it is modelled as a FlatSymbolRefAttr. Hint is modelled as an integer enum attribute. Also lowering to LLVM IR using the OpenMP IRBuilder. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D107135	2021-08-03 10:50:21 +01:00
Eli Friedman	6eb2ffbaeb	Fix a couple regression tests I missed updating in `2a284782`	2021-07-31 13:41:15 -07:00
Valentin Clement	fe7ca1a9fc	[mlir][openacc] Initial translation for DataOp to LLVM IR Add basic translation of acc.data to LLVM IR with runtime calls. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104301	2021-07-27 22:04:04 -04:00
Alex Zinenko	c282d55a38	[mlir] add support for reductions in OpenMP WsLoopOp Use a modeling similar to SCF ParallelOp to support arbitrary parallel reductions. The two main differences are: (1) reductions are named and declared beforehand similarly to functions using a special op that provides the neutral element, the reduction code and optionally the atomic reduction code; (2) reductions go through memory instead because this is closer to the OpenMP semantics. See https://llvm.discourse.group/t/rfc-openmp-reduction-support/3367. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D105358	2021-07-09 17:54:20 +02:00
Alex Zinenko	d4df3825bd	[mlir] don't drop undef initializers in translation to LLVM IR LLVM IR allows globals with external linkage to have initializers, including undef. The translation was incorrectly using undef as a indicator that the initializer should be ignored in translation, leading to the impossibility to create an external global with an explicit undef initializer. Fix this and use nullptr as a marker instead. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D105631	2021-07-09 17:52:43 +02:00
Sean Silva	da289a174f	[mlir] Allow conversion to LLVM for ElementsAttr's with size 0 The code seems to correctly handle the case that this assertion was guarding. Differential Revision: https://reviews.llvm.org/D105379	2021-07-07 11:15:20 -07:00
Felipe de Azevedo Piovezan	8ca04b0513	[mlir] Add support for LLVM's dso_local attr This patch brings support for setting runtime preemption specifiers of LLVM's GlobalValues. In LLVM semantics, if the `dso_local` attribute is not explicitly requested, then it is inferred based on linkage and visibility. We model this same behavior with a UnitAttribute: if it is present, then we explicitly request the GlobalValue to marked as `dso_local`, otherwise we rely on the GlobalValue itself to make this decision. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D104983	2021-06-29 15:00:48 +02:00
Benoit Jacob	20daedacca	2d Arm Neon sdot op, and lowering to the intrinsic. This adds Sdot2d op, which is similar to the usual Neon intrinsic except that it takes 2d vector operands, reflecting the structure of the arithmetic that it's performing: 4 separate 4-dimensional dot products, whence the vector<4x4xi8> shape. This also adds a new pass, arm-neon-2d-to-intr, lowering this new 2d op to the 1d intrinsic. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102504	2021-06-10 14:36:39 -07:00
Javier Setoain	96ca2d92b5	[mlir][ArmSVE] Add basic load/store operations ArmSVE-specific memory operations are needed to generate end-to-end code for as long as MLIR core doesn't support scalable vectors. This instructions will be eventually unnecessary, for now they're required for more complex testing. Differential Revision: https://reviews.llvm.org/D103535	2021-06-09 15:53:40 +01:00
Javier Setoain	f880bd261f	[mlir][ArmSVE] Add basic mask generation operations These `arm_sve.cmp` functions are needed to generate scalable vector masks as long as scalable vectors are not part of the standard types. Once in standard, these can be removed and `std.cmp` can be used instead. Differential Revision: https://reviews.llvm.org/D103473	2021-06-09 09:56:53 +01:00
Javier Setoain	57546f5b22	Revert "[mlir][ArmSVE] Add basic mask generation operations" This reverts commit `392af6a78b`	2021-06-08 10:02:19 +01:00
Javier Setoain	392af6a78b	[mlir][ArmSVE] Add basic mask generation operations These `arm_sve.cmp` functions are needed to generate scalable vector masks as long as scalable vectors are not part of the standard types. Once in standard, these can be removed and `std.cmp` can be used instead. Differential Revision: https://reviews.llvm.org/D103473	2021-06-08 08:56:31 +01:00
Valentin Clement	1005ef445d	[mlir][openacc] Translate UpdateOp to LLVM IR Add translation to LLVM IR for the UpdateOp with host and device operands. Translation is done with call using the runtime. This is done in a similar way as D101504 and D102381. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102382	2021-05-26 11:42:15 -04:00
Valentin Clement	ab5ff154ab	[mlir][openacc] Translate ExitDataop to LLVM IR Translate ExitDataOp with delete and copyout operands to runtime call. This is done in a similar way as D101504. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D102381	2021-05-17 11:11:59 -04:00
Adrian Kuegel	5ef21506b9	Add support for complex constants to MLIR core. BEGIN_PUBLIC Add support for complex constants to MLIR core. END_PUBLIC Differential Revision: https://reviews.llvm.org/D101908	2021-05-17 09:12:39 +02:00
Benoit Jacob	e0a88db545	Fix some typos. Fix some typos Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102503	2021-05-14 21:34:09 +05:30
Valentin Clement	113b807017	[mlir][openacc] Add OpenACC translation to LLVM IR (enter_data op create/copyin) This patch begins to translate acc.enter_data operation to call to tgt runtime call. It currently only translate create/copyin operands of memref type. This acts as a basis to add support for FIR types in the Flang/OpenACC support. It follows more or less a similar path than clang with `omp target enter data map` directives. This patch is taking a different approach than D100678 and perform a translation to LLVM IR and make use of the OpenMPIRBuilder instead of doing a conversion to the LLVMIR dialect. OpenACC support in Flang will rely on the current OpenMP runtime where 1:1 lowering can be applied. Some extension will be added where features are not available yet. Big part of this code will be shared for other standalone data operations in the OpenACC dialect such as acc.exit_data and acc.update. It is likely that parts of the lowering can also be shared later with the ops for standalone data directives in the OpenMP dialect when they are introduced. This is an initial translation and it probably needs more work. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D101504	2021-05-12 13:41:14 -04:00
Dumitru Potop	9a0ea5994b	[mlir] Support alignment in LLVM dialect GlobalOp First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D101492	2021-05-12 09:07:20 +02:00
Uday Bondhugula	1c777ab459	[MLIR] Switch llvm.noalias to a unit attribute Switch llvm.noalias attribute from a boolean attribute to a unit attribute. Differential Revision: https://reviews.llvm.org/D102225	2021-05-11 15:41:09 +05:30
Mats Petersson	7280f4b279	[OpenMP][MLIR]Add support for guided, auto and runtime scheduling When using parallel loop construct, the OpenMP specification allows for guided, auto and runtime as scheduling variants (as well as static and dynamic which are already supported). This adds the translation from MLIR to LLVM-IR for these scheduling variants. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101435	2021-05-10 09:18:52 +00:00
Alex Zinenko	72d013dd73	[mlir] OpenMP-to-LLVM: properly set outer alloca insertion point Previously, the OpenMP to LLVM IR conversion was setting the alloca insertion point to the same position as the main compuation when converting OpenMP `parallel` operations. This is problematic if, for example, the `parallel` operation is placed inside a loop and would keep allocating on stack on each iteration leading to stack overflow. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D101307	2021-05-10 10:04:52 +02:00
Navdeep Kumar	875eb523c1	[MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM dialect. Add following three ops to GPU dialect :- 1.) subgroup_mma_load_matrix 2.) subgroup_mma_store_matrix 3.) subgroup_mma_compute Add following three ops to NVVM dialect :- 1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride 2.) wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse, ThomasRaoux Differential Revision: https://reviews.llvm.org/D95330	2021-05-06 12:06:25 +05:30
Javier Setoain	95861216ac	[mlir][ArmSVE] Add masked arithmetic operations These instructions map to SVE-specific instrinsics that accept a predicate operand to support control flow in vector code. Differential Revision: https://reviews.llvm.org/D100982	2021-05-05 17:41:58 +01:00
Javier Setoain	001d601ac4	[mlir][ArmSVE] Add basic arithmetic operations While we figure out how to best add Standard support for scalable vectors, these instructions provide a workaround for basic arithmetic between scalable vectors. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100837	2021-05-05 09:50:18 +02:00
Ranjith Kumar H	b65472d66d	[MLIR] Add and propagate section attribute for LLVM_GlobalOp Add a section attribute to LLVM_GlobalOp, during module translation attribute value is propagated to llvm Reviewed By: sgrechanik, ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D100947	2021-04-28 04:15:49 +00:00
clementval	c46a88625d	[mlir][llvm] Add UnnamedAddr attribute to GlobalOp This patch add the UnnamedAddr attribute for the GlobalOp in the LLVM dialect. The attribute is also handled to and from LLVM IR. This is meant to be used in a follow up patch to lower OpenACC/OpenMP ops to call to kmp and tgt runtime calls (D100678). Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D100677	2021-04-19 21:45:14 -04:00
Javier Setoain	b739bada9d	[mlir][ArmSVE] Cleanup dialect registration ArmSVE dialect is behind the recent changes in how the Vector dialect interacts with backend vector dialects and the MLIR -> LLVM IR translation module. This patch cleans up ArmSVE initialization within Vector and removes the need for an LLVMArmSVE dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D100171	2021-04-16 15:56:51 +02:00
Emilio Cota	0b63e3222b	[mlir] X86Vector: Add AVX Rsqrt Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D99818	2021-04-13 08:43:48 -07:00
Emilio Cota	8508a63b88	[mlir] Rename AVX512 dialect to X86Vector We will soon be adding non-AVX512 operations to MLIR, such as AVX's rsqrt. In https://reviews.llvm.org/D99818 several possibilities were discussed, namely to (1) add non-AVX512 ops to the AVX512 dialect, (2) add more dialects (e.g. AVX dialect for AVX rsqrt), and (3) expand the scope of the AVX512 to include these SIMD x86 ops, thereby renaming the dialect to something more accurate such as X86Vector. Consensus was reached on option (3), which this patch implements. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D100119	2021-04-12 19:20:04 +02:00
Jean Perier	ffa455d4d4	[mlir] Translate global initializers after creating all LLVM IR globals In case an operation in a global initializer region refers to another global variable defined afterwards in the module of itself, translation to LLVM IR was currently crashing because it could not find the LLVM IR global when going through the initializer block. To solve this problem, split global conversion to LLVM IR into two passes. A first pass that creates LLVM IR global variables, and a second one that converts the initializer, if any, and adds it to the llvm global. Differential Revision: https://reviews.llvm.org/D99246	2021-03-25 09:53:58 +01:00
Ahmed Taei	f5963944d9	Add arm_neon.sdot operation Differential Revision: https://reviews.llvm.org/D98198	2021-03-17 08:24:58 -07:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Arpith C. Jacob	b4a516cc43	[mlir] Add LLVM loop codegen options to control software pipelining Support specifying the II and disabling pipelining. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98420	2021-03-11 16:46:44 +01:00
Alex Zinenko	a776942ba1	[mlir] squash LLVM_AVX512 dialect into AVX512 The dialect separation was introduced to demarkate ops operating in different type systems. This is no longer the case after the LLVM dialect has migrated to using built-in vector types, so the original reason for separation is no longer valid. Squash the two dialects into one. The code size decrease isn't quite large: the ops originally in LLVM_AVX512 are preserved because they match LLVM IR intrinsics specialized for vector element bitwidth. However, it is still conceptually beneficial to have only one dialect. I originally considered to use Tablegen multiclasses to define both the type-polymorphic op and its two intrinsic-related instantiations, but decided against it given both the complexity of the required Tablegen input and its dissimilarity with the rest of ODS-defined ops, both potentially resulting in very poor maintainability. Depends On D98327 Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98328	2021-03-10 13:07:26 +01:00
Mehdi Amini	8205c1a90a	Rework LLVM Dialect LoopOptions attribute Instead of storing an array of LoopOpt attributes, which were just wrapping std::pair<enum, int> anyway, we can have an attribute storing a sorted ArrayRef<std::pair<enum, int>> as a single unit. This improves here the textual format and the general API. Note that we're limiting the options to fit into an int64_t by design, but this isn't a new constraint. Building the LoopOptions attribute is likely worth a specific builder for efficient reason, that'll be the subject of a future patch. Differential Revision: https://reviews.llvm.org/D98105	2021-03-09 19:43:45 +00:00
Alex Zinenko	8184247f0b	[mlir] move LLVM target import header and tests Move Target/LLVMIR.h to target/LLVMIR/Import.h to better reflect the purpose of this file. Also move all LLVM IR target tests under the LLVMIR directory. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98178	2021-03-09 09:22:14 +01:00

44 Commits