llvm-project

Commit Graph

Author	SHA1	Message	Date
Aart Bik	9826fe5c9f	[VectorOps] Add lowering of vector.insert to LLVM IR For example, an insert %0 = vector.insert %arg0, %arg1[3 : i32] : f32 into vector<4xf32> becomes %0 = llvm.mlir.constant(3 : i32) : !llvm.i32 %1 = llvm.insertelement %arg0, %arg1[%0 : !llvm.i32] : !llvm<"<4 x float>"> A more elaborate example, inserting an element in a higher dimension vector %0 = vector.insert %arg0, %arg1[3 : i32, 7 : i32, 15 : i32] : f32 into vector<4x8x16xf32> becomes %0 = llvm.extractvalue %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> %1 = llvm.mlir.constant(15 : i32) : !llvm.i32 %2 = llvm.insertelement %arg0, %0[%1 : !llvm.i32] : !llvm<"<16 x float>"> %3 = llvm.insertvalue %2, %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> PiperOrigin-RevId: 284882443	2019-12-10 17:12:49 -08:00
Andy Davis	4d8ba88610	Add VectorOp transform pattern which splits vector TransferReadOps to target vector unroll size. PiperOrigin-RevId: 284880592	2019-12-10 17:02:51 -08:00
Uday Bondhugula	36a415bcc5	More affine expr simplifications for floordiv and mod Add one more simplification for floordiv and mod affine expressions. Examples: (2d0 + 1) floordiv 2 is simplified to d0 (8d0 + 4d1 + d2) floordiv 4 simplified to 4d0 + d1 + d2 floordiv 4. etc. Similarly, (4d1 + 1) mod 2 is simplified to 1, (2d0 + 8d1) mod 8 simplified to 2d0 mod 8. Change getLargestKnownDivisor to return int64_t to be consistent and to avoid casting at call sites (since the return value is used in expressions of int64_t/index type). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#202 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/202 from bondhugula:affine b13fcb2f1c00a39ca5434613a02408e085a80e77 PiperOrigin-RevId: 284866710	2019-12-10 16:00:53 -08:00
Nicolas Vasilache	995048d7b7	Fold TestLinalgTilePermutePatterns into TestLinalgTransformPatterns - NFC Centralize all patterns that test Linalg transforms in a single pass. PiperOrigin-RevId: 284835938	2019-12-10 13:26:15 -08:00
Jose Ignacio Gomez	b19fed5415	[Linalg] Add a Linalg iterator permutation transformation This patch closes issue tensorflow/mlir#272 We add a standalone iterator permutation transformation to Linalg. This transformation composes a permutation map with the maps in the "indexing_maps" attribute. It also permutes "iterator_types" accordingly. Change-Id: I7c1e693b8203aeecc595a7c012e738ca1100c857 Closes tensorflow/mlir#307 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/307 from tetuante:issue272 f7908d58792f4111119721885e247045104f1131 PiperOrigin-RevId: 284824102	2019-12-10 12:25:43 -08:00
Nicolas Vasilache	ad38e49806	Uniformize Vector transforms as patterns on the model of Linalg - NFC This reorganizes the vector transformations to be more easily testable as patterns and more easily composable into fused passes in the future. PiperOrigin-RevId: 284817474	2019-12-10 11:54:33 -08:00
Aart Bik	1fe65688d4	[VectorOps] Add a ShuffleOp to the VectorOps dialect For example %0 = vector.shuffle %x, %y [3 : i32, 2 : i32, 1 : i32, 0 : i32] : vector<2xf32>, vector<2xf32> yields a vector<4xf32> result with a permutation of the elements of %x and %y PiperOrigin-RevId: 284657191	2019-12-09 16:15:41 -08:00
Aart Bik	0e963b9c42	[VectorOps] Fix off-by-one error in insert/extract validation PiperOrigin-RevId: 284652653	2019-12-09 15:54:23 -08:00
Denis Khalikov	34265dad65	[spirv] Add CompositeConstruct operation. Closes tensorflow/mlir#308 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/308 from denis0x0D:sandbox/composite_construct 9ef7180f77f9374bcd05afc4f9e6c1d2d72d02b7 PiperOrigin-RevId: 284613617	2019-12-09 12:43:53 -08:00
Lei Zhang	2c7e8ed7c6	[spirv] Add spv.IAdd, spv.ISub, and spv.IMul folders The patterns to be folded away can be commonly generated during lowering to SPIR-V. PiperOrigin-RevId: 284604855	2019-12-09 11:59:10 -08:00
Jacques Pienaar	89cef725f4	ODS: Generate named accessors for raw attributes Currently named accessors are generated for attributes returning a consumer friendly type. But sometimes the attributes are used while transforming an existing op and then the returned type has to be converted back into an attribute or the raw `getAttr` needs to be used. Generate raw named accessor for attributes to reference the raw attributes without having to use the string interface for better compile time verification. This allows calling `blahAttr()` instead of `getAttr("blah")`. Raw here refers to returning the underlying storage attribute. PiperOrigin-RevId: 284583426	2019-12-09 10:29:34 -08:00
Mahesh Ravishankar	4a62019eb8	Add lowering for module with gpu.kernel_module attribute. The existing GPU to SPIR-V lowering created a spv.module for every function with gpu.kernel attribute. A better approach is to lower the module that the function lives in (which has the attribute gpu.kernel_module) to a spv.module operation. This better captures the host-device separation modeled by GPU dialect and simplifies the lowering as well. PiperOrigin-RevId: 284574688	2019-12-09 09:52:21 -08:00
Andy Davis	312ccb1c0f	Unify vector op unrolling transformation. Unifies vector op unrolling transformation, by using the same unrolling implementation for contraction and elementwise operations. Removes fakefork/join operations which are non longer needed now that we have the InsertStridedSlice operation. PiperOrigin-RevId: 284570784	2019-12-09 09:35:15 -08:00
Kazuaki Ishizaki	ae05cf27c6	Minor spelling tweaks Closes tensorflow/mlir#304 PiperOrigin-RevId: 284568358	2019-12-09 09:23:48 -08:00
Nicolas Vasilache	91c0074624	[StructuredOps][Linalg] Add a primitive pattern to rewrite the linalg.generic form of matmul to vector form. This CL uses the newly expanded matcher support to easily detect when a linalg.generic has a multiply-accumulate body. A linalg.generic with such a body is rewritten as a vector contraction. This CL additionally limits the rewrite to the case of matrix multiplication on contiguous and statically shaped memrefs for now. Before expanding further, we should harden the infrastructure for expressing custom ops with the structured ops abstraction. PiperOrigin-RevId: 284566659	2019-12-09 09:14:39 -08:00
Jacques Pienaar	70aeb4566e	Add RegionRange for when need to abstract over different region iteration Follows ValueRange in representing a generic abstraction over the different ways to represent a range of Regions. This wrapper is not as ValueRange and only considers the current cases of interest: MutableArrayRef<Region> and ArrayRef<std::unique_ptr<Region>> as occurs during op construction vs op region querying. Note: ArrayRef<std::unique_ptr<Region>> allows for unset regions, so this range returns a pointer to a Region instead of a Region. PiperOrigin-RevId: 284563229	2019-12-09 08:57:56 -08:00
Nicolas Vasilache	7b19bd5411	Post-submit cleanups in RecursiveMatchers This CL addresses leftover cleanups and adds a test mixing RecursiveMatchers and m_Constant that captures properly. PiperOrigin-RevId: 284551567	2019-12-09 07:47:35 -08:00
Nicolas Vasilache	ade58a268c	Add a layer of recursive matchers that compose. This CL adds support for building matchers recursively. The following matchers are provided: 1. `m_any()` can match any value 2. `m_val(Value *)` binds to a value and must match it 3. `RecursivePatternMatcher<OpType, Matchers...>` n-arity pattern that matches `OpType` and whose operands must be matched exactly by `Matchers...`. This allows building expression templates for patterns, declaratively, in a very natural fashion. For example pattern `p9` defined as follows: ``` auto mul_of_muladd = m_Op<MulFOp>(m_Op<MulFOp>(), m_Op<AddFOp>()); auto mul_of_anyadd = m_Op<MulFOp>(m_any(), m_Op<AddFOp>()); auto p9 = m_Op<MulFOp>(m_Op<MulFOp>( mul_of_muladd, m_Op<MulFOp>()), m_Op<MulFOp>(mul_of_anyadd, mul_of_anyadd)); ``` Successfully matches `%6` in: ``` %0 = addf %a, %b: f32 %1 = addf %a, %c: f32 // matched %2 = addf %c, %b: f32 %3 = mulf %a, %2: f32 // matched %4 = mulf %3, %1: f32 // matched %5 = mulf %4, %4: f32 // matched %6 = mulf %5, %5: f32 // matched ``` Note that 0-ary matchers can be used as leaves in place of n-ary matchers. This alleviates from passing explicit `m_any()` leaves. In the future, we may add extra patterns to specify that operands may be matched in any order. PiperOrigin-RevId: 284469446	2019-12-08 18:09:40 -08:00
River Riddle	d6ee6a0310	Update the builder API to take ValueRange instead of ArrayRef<Value *> This allows for users to provide operand_range and result_range in builder.create<> calls, instead of requiring an explicit copy into a separate data structure like SmallVector/std::vector. PiperOrigin-RevId: 284360710	2019-12-07 10:35:41 -08:00
River Riddle	8904e91035	Add a flag to the IRPrinter instrumentation to only print after a pass if there is a change to the IR. This adds an additional filtering mode for printing after a pass that checks to see if the pass actually changed the IR before printing it. This "change" detection is implemented using a SHA1 hash of the current operation and its children. PiperOrigin-RevId: 284291089	2019-12-06 17:05:05 -08:00
Jacques Pienaar	4add9edd72	Change inferReturnTypes to return LogicalResult and values Previously the error case was using a sentinel in the error case which was bad. Also make the one `build` invoke the other `build` to reuse verification there. And follow up on suggestion to use formatv which I missed during previous review. PiperOrigin-RevId: 284265762	2019-12-06 14:42:45 -08:00
Aart Bik	d37f27251f	[VecOps] Rename vector.[insert\|extract]element to just vector.[insert\|extract] Since these operations lower to [insert\|extract][element\|value] at LLVM dialect level, neither element nor value would correctly reflect the meaning. PiperOrigin-RevId: 284240727	2019-12-06 12:39:25 -08:00
Aart Bik	b36aaeafb1	[VectorOps] Add lowering of vector.broadcast to LLVM IR For example, a scalar broadcast %0 = vector.broadcast %x : f32 to vector<2xf32> return %0 : vector<2xf32> which expands scalar x into vector [x,x] by lowering to the following LLVM IR dialect to implement the duplication over the leading dimension. %0 = llvm.mlir.undef : !llvm<"<2 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.insertelement %x, %0[%1 : !llvm.i64] : !llvm<"<2 x float>"> %3 = llvm.shufflevector %2, %0 [0 : i32, 0 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> return %3 : vector<2xf32> In the trailing dimensions, the operand is simply "passed through", unless a more elaborate "stretch" is required. For example %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> return %0 : vector<4xf32> becomes %0 = llvm.mlir.undef : !llvm<"<4 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.extractelement %arg0[%1 : !llvm.i64] : !llvm<"<1 x float>"> %3 = llvm.mlir.constant(0 : index) : !llvm.i64 %4 = llvm.insertelement %2, %0[%3 : !llvm.i64] : !llvm<"<4 x float>"> %5 = llvm.shufflevector %4, %0 [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>"> llvm.return %5 : !llvm<"<4 x float>"> PiperOrigin-RevId: 284219926	2019-12-06 11:02:29 -08:00
Jacques Pienaar	398f04aa49	Generate builder for ops that use InferTypeOpInterface trait in ODS For ops with infer type op interface defined, generate version that calls the inferal method on build. This is intermediate step to removing special casing of SameOperandsAndResultType & FirstAttrDereivedResultType. After that would be generating the inference code, with the initial focus on shaped container types. In between I plan to refactor these a bit to reuse generated paths. The intention would not be to add the type inference trait in multiple places, but rather to take advantage of the current modelling in ODS where possible to emit it instead. Switch the `inferReturnTypes` method to be static. Skipping ops with regions here as I don't like the Region vs unique_ptr<Region> difference at the moment, and I want the infer return type trait to be useful for verification too. So instead, just skip it for now to avoid churn. PiperOrigin-RevId: 284217913	2019-12-06 10:53:06 -08:00
Alex Zinenko	e216a72ab8	Add conversions of GPU func with memory attributions to LLVM/NVVM GPU functions use memory attributions, a combination of Op attributes and region arguments, to specify function-wide buffers placed in workgroup or private memory spaces. Introduce a lowering pattern for GPU functions to be converted to LLVM functions taking into account memory attributions. Workgroup attributions get transformed into module-level globals with unique names derived from function names. Private attributions get converted into llvm.allocas inside the function body. In both cases, we inject at the beginning of the function the IR that obtains the raw pointer to the data and populates a MemRef descriptor based on the MemRef type of buffer, making attributions compose with the rest of the MemRef lowering and transparent for use with std.load and std.store. While using raw pointers instead of descriptors might have been more efficient, it is better implemented as a canonicalization or a separate transformation so that non-attribution memrefs could also benefit from it. PiperOrigin-RevId: 284208396	2019-12-06 10:08:43 -08:00
River Riddle	79047e1ab5	Use regex to fix failure when stats are disabled. It would be nice if we could detect if stats were enabled or not and use 'Requires', but this isn't possible to do at configure time. Fixes tensorflow/mlir#296 PiperOrigin-RevId: 284200271	2019-12-06 09:29:14 -08:00
Andy Davis	41f8e105fa	Unroll vector masks along with their associated vector arguments. Updates vector ContractionOp to use proper vector masks (produced by CreateMaskOp/ConstantMaskOp). Leverages the following canonicalizations in unrolling unit test: CreateMaskOp -> ConstantMaskOp, StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp Removes IndexTupleOp (no longer needed now that we have vector mask ops). Updates all unit tests. PiperOrigin-RevId: 284182168	2019-12-06 07:37:28 -08:00
Uday Bondhugula	3ade6a7d15	DimOp folding for alloc/view dynamic dimensions Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#253 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/253 from bondhugula:dimop a4b464f24ae63fd259114558d87e11b8ee4dae86 PiperOrigin-RevId: 284169689	2019-12-06 06:00:54 -08:00
Kazuaki Ishizaki	84a6182ddd	minor spelling tweaks Closes tensorflow/mlir#290 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/290 from kiszk:spelling_tweaks_201912 9d9afd16a723dd65754a04698b3976f150a6054a PiperOrigin-RevId: 284169681	2019-12-06 05:59:30 -08:00
Alex Zinenko	58adf99ed1	LLVM::AddressOfOp: properly take into account the address space The AddressOf operation in the LLVM dialect return a pointer to a global variable. The latter may be in a non-default address space as indicated by the "addr_space" attribute. Check that the address space of the pointer returned by AddressOfOp matches that of the referenced GlobalOp. Update the AddressOfOp builder to respect this constraint. PiperOrigin-RevId: 284138860	2019-12-06 01:09:13 -08:00
River Riddle	71999ff7f2	Add include path to the TestDialect to fix broken build. PiperOrigin-RevId: 284067891	2019-12-05 15:33:33 -08:00
Jose Ignacio Gomez	f60bbb6c3b	[Linalg] Add permutation information to tiling This patch closes issue tensorflow/mlir#271. It adds an optional permutation map to declarative tiling transformations. The map is expressed as a list of integers. Closes tensorflow/mlir#288 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/288 from tetuante:issue271 2df2938d6a1f01b3bc404ded08dea2dd1e10b588 PiperOrigin-RevId: 284064151	2019-12-05 15:14:59 -08:00
nmostafa	daff60cd68	Add UnrankedMemRef Type Closes tensorflow/mlir#261 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/261 from nmostafa:nmostafa/unranked 96b6e918f6ed64496f7573b2db33c0b02658ca45 PiperOrigin-RevId: 284037040	2019-12-05 13:13:20 -08:00
Denis Khalikov	e67acfa468	[spirv] Add CompositeInsertOp operation A CompositeInsertOp operation make a copy of a composite object, while modifying one part of it. Closes tensorflow/mlir#292 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/292 from denis0x0D:sandbox/composite_insert 2200962b9057bda53cd2f2866b461e2797196380 PiperOrigin-RevId: 284036551	2019-12-05 13:10:44 -08:00
River Riddle	33a64540ad	Add support for instance specific pass statistics. Statistics are a way to keep track of what the compiler is doing and how effective various optimizations are. It is useful to see what optimizations are contributing to making a particular program run faster. Pass-instance specific statistics take this even further as you can see the effect of placing a particular pass at specific places within the pass pipeline, e.g. they could help answer questions like "what happens if I run CSE again here". Statistics can be added to a pass by simply adding members of type 'Pass::Statistics'. This class takes as a constructor arguments: the parent pass pointer, a name, and a description. Statistics can be dumped by the pass manager in a similar manner to how pass timing information is dumped, i.e. via PassManager::enableStatistics programmatically; or -pass-statistics and -pass-statistics-display via the command line pass manager options. Below is an example: struct MyPass : public OperationPass<MyPass> { Statistic testStat{this, "testStat", "A test statistic"}; void runOnOperation() { ... ++testStat; ... } }; $ mlir-opt -pass-pipeline='func(my-pass,my-pass)' foo.mlir -pass-statistics Pipeline Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== 'func' Pipeline MyPass (S) 15 testStat - A test statistic MyPass (S) 6 testStat - A test statistic List Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== MyPass (S) 21 testStat - A test statistic PiperOrigin-RevId: 284022014	2019-12-05 11:53:28 -08:00
Mahesh Ravishankar	4d61a79db4	Allow specification of the workgroup size for GPUToSPIRV lowering. SPIR-V/Vulkan spec requires the workgroups size to be specified with the spv.ExecutionMode operation. This was hard-wired to be set to a particular value. It is now changed to be configurable by clients of the pass or of the patterns that implement the lowering from GPU to SPIRV. PiperOrigin-RevId: 284017482	2019-12-05 11:31:57 -08:00
Lei Zhang	037044b0ae	Add spv.AtomicCompareExchangeWeak PiperOrigin-RevId: 283997917	2019-12-05 10:06:24 -08:00
Lei Zhang	c0a9de29ad	[spirv] Fix nested loop (de)serialization For serialization, when we have nested ops, the inner loop will create multiple SPIR-V blocks. If the outer loop has block arguments (which corresponds to OpPhi instructions), we defer the handling of OpPhi's parent block handling until we serialized all blocks and then fix it up with the result <id>. These two cases happening together was generating invalid SPIR-V blob because we previously assume the parent block to be the block containing the terminator. That is not true anymore when the block contains structured control flow ops. If that happens, it should be fixed to use the structured control flow op's merge block. For deserialization, we record a map from header blocks to their corresponding merge and continue blocks during the initial deserialization and then use the info to construct spv.selection/spv.loop. The existing implementation will also fall apart when we have nested loops. If so, we clone all blocks for the outer loop, including the ones for the inner loop, to the spv.loop's region. So the map for header blocks' merge info need to be updated; otherwise we are operating on already deleted blocks. PiperOrigin-RevId: 283949230	2019-12-05 04:39:37 -08:00
Tres Popp	b8cd0c1486	Move ModuleManager functionality into mlir::SymbolTable. Note for broken code, the following transformations occurred: ModuleManager::insert(Block::iterator, Operation) - > SymbolTable::insert(Operation, Block::iterator) ModuleManager::lookupSymbol -> SymbolTable::lookup ModuleManager::getModule() -> SymbolTable::getOp() ModuleManager::getContext() -> SymbolTable::getOp()->getContext() ModuleManager::* -> SymbolTable::* PiperOrigin-RevId: 283944635	2019-12-05 03:56:46 -08:00
Nicolas Vasilache	b3f7cf80a7	Add a CL option to Standard to LLVM lowering to use alloca instead of malloc/free. In the future, a more configurable malloc and free interface should be used and exposed via extra parameters to the `createLowerToLLVMPass`. Until requirements are gathered, a simple CL flag allows generating code that runs successfully on hardware that cannot use the stdlib. PiperOrigin-RevId: 283833424	2019-12-04 14:16:00 -08:00
Andy Davis	d20d763241	Add canonicalization patterns for vector CreateMaskOp and StridedSliceOp to be used in the unroll vector op transformation. Adds a ConstantMaskOp to the vector ops dialect. Adds the following canonicalization patterns: CreateMaskOp -> ConstantMaskOp StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp PiperOrigin-RevId: 283816752	2019-12-04 13:00:43 -08:00
Nicolas Vasilache	edfaf925cf	Drop MaterializeVectorTransfers in favor of simpler declarative unrolling Now that we have unrolling as a declarative pattern, we can drop a full pass that has gone stale. In the future we may want to add specific unrolling patterns for VectorTransferReadOp. PiperOrigin-RevId: 283806880	2019-12-04 12:11:42 -08:00
Sean Silva	26484bc0b6	Print out large elementsattr's such that they are parseable. I found that when running crash reproducers, the elided elementsattr's would prevent parsing the IR repro. I found myself manually going and replacing the "..." with some valid IR. With this change, we now print elided attrs as `opaque<"", "0xDEADBEEF">` to clearly delineate them as being elided while still being parseable. PiperOrigin-RevId: 283781806	2019-12-04 10:19:54 -08:00
Scott Todd	bf45ff6aab	[spirv] Adding sqrt op in the GLSL extension. PiperOrigin-RevId: 283769736	2019-12-04 09:16:23 -08:00
Alex Zinenko	75175134d4	Loop coalescing: fix pointer chainsing in use-chain traversal In the replaceAllUsesExcept utility function called from loop coalescing the iteration over the use-chain is incorrect. The use list nodes (IROperands) have next/prev links, and bluntly resetting the use would make the loop to continue on uses of the value that was replaced instead of the original one. As a result, it could miss the existing uses and update the wrong ones. Make sure we increment the iterator before updating the use in the loop body. Reported-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#291. PiperOrigin-RevId: 283754195	2019-12-04 07:42:29 -08:00
Julian Gross	f7c6bc70a9	Added new FAbs, FCeil, Cos, Neg, Sign, Tanh operations. Closes tensorflow/mlir#251 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/251 from dfki-jugr:new_ops 0398997bf9953016898f873068e22916a062eb2b PiperOrigin-RevId: 283750699	2019-12-04 07:17:30 -08:00
Andy Davis	34e1f4aa51	Adds support for unrolling single-result vector operations with iterator type lists and indexing maps to a target vector size. Adds unit tests for unrolling the vector ContractionOp with different iteration orders. PiperOrigin-RevId: 283747503	2019-12-04 06:53:37 -08:00
Nicolas Vasilache	5c0c51a997	Refactor dependencies to expose Vector transformations as patterns - NFC This CL refactors some of the MLIR vector dependencies to allow decoupling VectorOps, vector analysis, vector transformations and vector conversions from each other. This makes the system more modular and allows extracting VectorToVector into VectorTransforms that do not depend on vector conversions. This refactoring exhibited a bunch of cyclic library dependencies that have been cleaned up. PiperOrigin-RevId: 283660308	2019-12-03 17:52:10 -08:00
Lei Zhang	50b2b26e70	[spirv] Add spv.GroupNonUniformBallot This CL also did the following cleanup: - Moved the test for spv.SubgroupBallotKHR to its own file - Wrapped generated canonicalization patterns in anonymous namespace - Updated header comments in SPVOps.td PiperOrigin-RevId: 283650091	2019-12-03 16:44:09 -08:00
Mahesh Ravishankar	c5ba37b6ae	Add a pass to legalize operations before lowering to SPIR-V. Not all StandardOps can be lowered to SPIR-V. For example, subview op implementation requires use of pointer bitcasts which is not valid according to SPIR-V spec (or at least is ambiguous about it). Such ops need to be removed/transformed before lowering to SPIR-V. The SPIRVLegalizationPass is added a place where such legalizations can be added. Current implementation folds the subview ops with load/stores so that the lowering itself does not have to convert a subview op. PiperOrigin-RevId: 283642981	2019-12-03 16:06:17 -08:00
Sean Silva	82f9f9d112	Make diagnostic a bit clearer. This prints out in case of any pass failure. Not just a crash. PiperOrigin-RevId: 283616719	2019-12-03 14:01:25 -08:00
Andy Davis	2c13fd9f17	Add CreateMaskOp to the VectorOps dialect. PiperOrigin-RevId: 283591888	2019-12-03 11:55:54 -08:00
Sean Silva	67515e8d7a	Verifier: Better error message in case of successor operand mismatch. In particular, print the successor number in the diagnostic. PiperOrigin-RevId: 283585084	2019-12-03 11:24:31 -08:00
Mahesh Ravishankar	353fb2bd38	Convert MemRefType to a linearized array in SPIR-V lowering. The SPIR-V lowering used nested !spv.arrays to represented multi-dimensional arrays, with the hope that in-conjunction with the layout annotations, the shape and layout of memref can be represented directly. It is unclear though how portable this representation will end up being. It will rely on driver compilers implementing complex index computations faithfully. A more portable approach is to use linearized arrays to represent memrefs and explicitly instantiate all the index computation in SPIR-V. This gives added benefit that we can further optimize the generated code in MLIR before generating the SPIR-V binary. PiperOrigin-RevId: 283571167	2019-12-03 10:21:16 -08:00
Alex Zinenko	993e79e9bd	Fix ViewOp to have at most one offset operand As described in the documentation, ViewOp is expected to take an optional dynamic offset followed by a list of dynamic sizes. However, the ViewOp parser did not include a check for the offset being a single value and accepeted a list of values instead. Furthermore, several tests have been exercising the wrong syntax of a ViewOp, passing multiple values to the dyanmic stride list, which was not caught by the parser. The trailing values could have been erronously interpreted as dynamic sizes. This is likely due to resyntaxing of the ViewOp, with the previous syntax taking the list of sizes before the offset. Update the tests to use the syntax with the offset preceding the sizes. Worse, the conversion of ViewOp to the LLVM dialect assumed the wrong order of operands with offset in the trailing position, and erronously relied on the permissive parsing that interpreted trailing dynamic offset values as leading dynamic sizes. Fix the lowering to use the correct order of operands. PiperOrigin-RevId: 283532506	2019-12-03 06:23:04 -08:00
Diego Caballero	330d1ff00e	AffineLoopFusion: Prevent fusion of multi-out-edge producer loops tensorflow/mlir#162 introduced a bug that incorrectly allowed fusion of producer loops with multiple outgoing edges. This commit fixes that problem. It also introduces a new flag to disable sibling loop fusion so that we can test producer-consumer fusion in isolation. Closes tensorflow/mlir#259 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/259 from dcaballe:dcaballe/fix_multi_out_edge_producer_fusion 578d5661705fd5c56c555832d5e0528df88c5282 PiperOrigin-RevId: 283531105	2019-12-03 06:09:50 -08:00
Stephan Herhut	2125c0e3a8	Extend conversion of SubViewOp to llvm to also support cases where size and stride are constant (i.e., there are no size and stride operands). We recently added canonicalization that rewrites constant size and stride operands to SubViewOp into static information in the type, so these patterns now occur during code generation. PiperOrigin-RevId: 283524688	2019-12-03 05:11:49 -08:00
Lei Zhang	1af9633d85	[spirv] Add spv.SubgroupBallotKHROp PiperOrigin-RevId: 283522284	2019-12-03 04:49:56 -08:00
Alex Zinenko	fdbb99cd62	Add linkage support to LLVMFuncOp A recent commit introduced the Linkage attribute to the LLVM dialect and used it in the Global Op. Also use it in LLVMFuncOp. As per LLVM Language Reference, if the linkage attribute is omitted, the function is assumed to have external linkage. PiperOrigin-RevId: 283493299	2019-12-03 00:26:44 -08:00
Aart Bik	3126004a5a	[VectorOps] Add legality rules to broadcast PiperOrigin-RevId: 283360101	2019-12-02 09:57:27 -08:00
Lei Zhang	b41162b3af	[ODS] Generate builders taking unwrapped value and defaults for attributes Existing builders generated by ODS require attributes to be passed in as mlir::Attribute or its subclasses. This is okay foraggregate- parameter builders, which is primarily to be used by programmatic C++ code generation; it is inconvenient for separate-parameter builders meant to be called in manually written C++ code because it requires developers to wrap raw values into mlir::Attribute by themselves. This CL extends to generate additional builder methods that take raw values for attributes and handles the wrapping in the builder implementation. Additionally, if an attribute appears late in the arguments list and has a default value, the default value is supplied in the declaration if possible. PiperOrigin-RevId: 283355919	2019-12-02 09:33:57 -08:00
Lei Zhang	4982eaf87c	[DRR] Introduce `$_` to ignore op argument match Right now op argument matching in DRR is position-based, meaning we need to specify N arguments for an op with N ODS-declared argument. This can be annoying when we don't want to capture all the arguments. `$_` is to remedy the situation. PiperOrigin-RevId: 283339992	2019-12-02 07:54:50 -08:00
Alexander Belyaev	9630fcbc52	Lower linalg.indexed_generic with libcall to LLVM. PiperOrigin-RevId: 283328994	2019-12-02 06:30:52 -08:00
Alex Zinenko	d5e627f84b	Introduce Linkage attribute to the LLVM dialect LLVM IR supports linkage on global objects such as global variables and functions. Introduce the Linkage attribute into the LLVM dialect, backed by an integer storage. Use this attribute on LLVM::GlobalOp and make it mandatory. Implement parsing/printing of the attribute and conversion to LLVM IR. See tensorflow/mlir#277. PiperOrigin-RevId: 283309328	2019-12-02 03:28:10 -08:00
Denis Khalikov	cd556f25de	[spirv] Check that operand of `spirv::CompositeExtractOp` is constant while folding. Closes tensorflow/mlir#281 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/281 from denis0x0D:sandbox/composite_ex_fold d02d73658bd1b9eaa515eb4e0aee34bc41d4252b PiperOrigin-RevId: 282971563	2019-11-28 13:27:56 -08:00
Jose Ignacio Gomez	0494ef60f7	[Linalg] Change attribute n_loop_types to iterator This addresses issue tensorflow/mlir#270. Linalg is updated to take the same form of iterator_types than vector contraction. Closes tensorflow/mlir#280 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/280 from tetuante:PRissue270 d26d88d090d3765d3b9884bfabdd023143f27287 PiperOrigin-RevId: 282905396	2019-11-28 01:59:55 -08:00
Lei Zhang	d4e4387fbf	[spirv] Add folders for spv.IAdd and spv.IMul Adding zero and multiplying one can be common when generating code for index calculation. This CL also sorted canonicalize.mlir to alphabetical order. PiperOrigin-RevId: 282828055	2019-11-27 13:46:52 -08:00
Aart Bik	9f89c34f4b	Fixed typo in Toy tutorial (second var e -> var f) PiperOrigin-RevId: 282810649	2019-11-27 11:58:45 -08:00
Nicolas Vasilache	1fa8c8070b	Implement Linalg to loops lowering as a pattern This CL rewrites the linalg ops to loops transformations as patterns that can be targeted directly from Tablegen. Reliance on OpFolder is removed and to cope with it we introduce local folding patterns that are applied greedily. PiperOrigin-RevId: 282765550	2019-11-27 07:32:13 -08:00
Aart Bik	e2232fbcee	[VectorOps] Refine BroadcastOp in VectorOps dialect Since second argument is always fully overwritten and shape is define in "to" clause, it is not needed. Also renamed "into" to "to" now that arg is dropped. PiperOrigin-RevId: 282686475	2019-11-26 19:52:38 -08:00
Aart Bik	cf97263cb8	[VectorOps] Add a BroadcastOp to the VectorOps dialect PiperOrigin-RevId: 282643305	2019-11-26 14:43:31 -08:00
Mahesh Ravishankar	03620fa70a	Misc changes to lowering to SPIR-V. These changes to SPIR-V lowering while adding support for lowering SUbViewOp, but are not directly related. - Change the lowering of MemRefType to !spv.ptr<!spv.struct<!spv.array<...>[offset]>, ..> This is consistent with the Vulkan spec. - To enable testing a simple pattern of lowering functions is added to ConvertStandardToSPIRVPass. This is just used to convert the type of the arguments of the function. The added function lowering itself is not meant to be the way functions are eventually lowered into SPIR-V dialect. PiperOrigin-RevId: 282589644	2019-11-26 10:11:34 -08:00
Nicolas Vasilache	9059cf392d	Automated rollback of commit `d60133f89b` PiperOrigin-RevId: 282574110	2019-11-26 08:47:48 -08:00
Nicolas Vasilache	109338085d	Relax restriction on affine_apply dim and symbol operands The affine_apply operation is currently "doubly" affine and conflates two things: 1. it applies an affine map to a list of values of type `index` that are defined as either dim or symbol 2. it restricts (and propagates constraints on) the provenance of dims and symbols to a small subset of ops for which more restrictive polyhedral constraints apply. Point 2. is related to the ability to form so-called static control parts and is related to dependence analysis and legality of transformations. Point 1. however is completely independent, the only local implication of dims and symbol for affine_apply is that dims compose while symbols concatenate as well as the structural constraint that dims may not be multiplied. The properties of composition and canonicalization in affine_apply are more generally useful. This CL relaxes the verifier on affine_apply so it can be used more generally. The relevant affine.for/if/load/store op verifiers already implement the dim and symbol checking. See this thread for the related discussion: https://groups.google.com/a/tensorflow.org/g/mlir/c/HkwCbV8D9N0/m/8srUNrX6CAAJ PiperOrigin-RevId: 282562517	2019-11-26 07:39:05 -08:00
Lei Zhang	13c6e419ca	Add support for AttrSizedOperandSegments/AttrSizedResultSegments Certain operations can have multiple variadic operands and their size relationship is not always known statically. For such cases, we need a per-op-instance specification to divide the operands into logical groups or segments. This can be modeled by attributes. This CL introduces C++ trait AttrSizedOperandSegments for operands and AttrSizedResultSegments for results. The C++ trait just guarantees such size attribute has the correct type (1D vector) and values (non-negative), etc. It serves as the basis for ODS sugaring that with ODS argument declarations we can further verify the number of elements match the number of ODS-declared operands and we can generate handy getter methods. PiperOrigin-RevId: 282467075	2019-11-25 17:26:50 -08:00
Nicolas Vasilache	174076a157	Use vector.InsertStridedSlice in Vector -> Vector unrolling This CL uses the recently added op to finish the implementation of Vector -> Vector unrolling by replacing the "fake join op" by a series of InsertStridedSliceOp. Test is updated accordingly PiperOrigin-RevId: 282451126	2019-11-25 15:56:37 -08:00
Nicolas Vasilache	36469f7d2a	Add a vector.InsertStridedSliceOp This new op is the counterpart of vector.StridedSliceOp and will be used for in the pattern rewrites for vector unrolling. PiperOrigin-RevId: 282447414	2019-11-25 15:37:13 -08:00
MLIR Team	1012c492f0	Allow LLVM::ExtractElementOp to have non-i32 indices. Also change the text format a bit, so that indices are braced by squares. PiperOrigin-RevId: 282437095	2019-11-25 14:44:52 -08:00
Ben Vanik	38d7870ee5	Make std.divis and std.diviu support ElementsAttr folding. PiperOrigin-RevId: 282434465	2019-11-25 14:31:43 -08:00
Andy Davis	8fc44a4d13	Update VectorContractionOp to take iterator types and index mapping attributes compatible with linalg ops. PiperOrigin-RevId: 282412311	2019-11-25 12:40:00 -08:00
Christian Sigg	d60133f89b	Changing directory shortcut for CPU/GPU runner utils. Moving cuda-runtime-wrappers.so into subdirectory to match libmlir_runner_utils.so. Provide parent directory when running test and load .so from subdirectory. PiperOrigin-RevId: 282410749	2019-11-25 12:30:54 -08:00
Lei Zhang	9b6e6cef68	De-duplicate EnumAttr overrides by defining defaults EnumAttr should provide meaningful defaults so concrete instances do not need to duplicate the fields. PiperOrigin-RevId: 282398431	2019-11-25 11:29:55 -08:00
Mahesh Ravishankar	bd485afda0	Introduce attributes that specify the final ABI for a spirv::ModuleOp. To simplify the lowering into SPIR-V, while still respecting the ABI requirements of SPIR-V/Vulkan, split the process into two 1) While lowering a function to SPIR-V (when the function is an entry point function), allow specifying attributes on arguments and function itself that describe the ABI of the function. 2) Add a pass that materializes the ABI described in the function. Two attributes are needed. 1) Attribute on arguments of the entry point function that describe the descriptor_set, binding, storage class, etc, of the spv.globalVariable this argument will be replaced by 2) Attribute on function that specifies workgroup size, etc. (for now only workgroup size). Add the pass -spirv-lower-abi-attrs to materialize the ABI described by the attributes. This change makes the SPIRVBasicTypeConverter class unnecessary and is removed, further simplifying the SPIR-V lowering path. PiperOrigin-RevId: 282387587	2019-11-25 11:19:56 -08:00
Mahesh Ravishankar	1ea231bd39	Allow memref_cast from static strides to dynamic strides. Memref_cast supports cast from static shape to dynamic shape memrefs. The same should be true for strides as well, i.e a memref with static strides can be casted to a memref with dynamic strides. PiperOrigin-RevId: 282381862	2019-11-25 11:08:56 -08:00
Nicolas Vasilache	01145544aa	Add vector.insertelement op This is the counterpart of vector.extractelement op and has the same limitations at the moment (static I64IntegerArrayAttr to express position). This restriction will be filterd in the future. LLVM lowering will be added in a subsequent commit. PiperOrigin-RevId: 282365760	2019-11-25 08:47:15 -08:00
Alex Zinenko	bf4692dc49	Introduce gpu.func Introduce a new function-like operation to the GPU dialect to provide a placeholder for the execution semantic description and to add support for GPU memory hierarchy. This aligns with the overall goal of the dialect to expose the common abstraction layer for GPU devices, in particular by providing an MLIR unit of semantics (i.e. an operation) for memory modeling. This proposal has been discussed in the mailing list: https://groups.google.com/a/tensorflow.org/d/msg/mlir/RfXNP7Hklsc/MBNN7KhjAgAJ As decided, the "convergence" aspect of the execution model will be factored out into a new discussion and therefore is not included in this commit. This commit only introduces the operation but does not hook it up with the remaining flow. The intention is to develop the new flow while keeping the old flow operational and do the switch in a simple, separately reversible commit. PiperOrigin-RevId: 282357599	2019-11-25 08:10:37 -08:00
Ben Vanik	d2284f1f0b	Support folding of StandardOps with DenseElementsAttr. PiperOrigin-RevId: 282270243	2019-11-24 19:23:38 -08:00
Lei Zhang	aaafeac89b	[spirv] NFC: rename test files and sort tests inside PiperOrigin-RevId: 282132339	2019-11-23 06:58:38 -08:00
Uday Bondhugula	6a101671b0	Make isValidSymbol more powerful The check in isValidSymbol, as far as a DimOp result went, checked if the dim op was on a top-level memref. However, any alloc'ed, view, or subview memref would be fine as long as the corresponding dimension of that memref is either a static one or was in turn created using a valid symbol in the case of dynamic dimensions. Reported-by: Jose Gomez Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#252 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/252 from bondhugula:symbol 7b57dc394df9375e651f497231c6e4525a32a662 PiperOrigin-RevId: 282097114	2019-11-22 22:09:31 -08:00
River Riddle	b8ee563449	NFC: Remove unnecessarily guarded tablegen includes. Support for including a file multiple times was added in tablegen, removing the need for these extra guards. This is because we already insert c/c++ style header guards within each of the specific .td files. PiperOrigin-RevId: 282076728	2019-11-22 18:01:57 -08:00
Denis Khalikov	a5cda4763f	[spirv] Add a canonicalizer for `spirv::LogicalNotOp`. Add a canonicalizer for `spirv::LogicalNotOp`. Converts: * spv.LogicalNot(spv.IEqual(...)) -> spv.INotEqual(...) * spv.LogicalNot(spv.INotEqual(...)) -> spv.IEqual(...) * spv.LogicalNot(spv.LogicalEqual(...)) -> spv.LogicalNotEqual(...) * spv.LogicalNot(spv.LogicalNotEqual(...)) -> spv.LogicalEqual(...) Also moved the test for spv.IMul to arithemtic tests. Closes tensorflow/mlir#256 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/256 from denis0x0D:sandbox/canon_logical_not 76ab5787b2c777f948c8978db061d99e76453d44 PiperOrigin-RevId: 282012356	2019-11-22 12:25:52 -08:00
Mahesh Ravishankar	6db8530c26	Add more canonicalizations for SubViewOp. Depending on which of the offsets, sizes, or strides are constant, the subview op can be canonicalized in different ways. Add such canonicalizations, which generalize the existing approach of canonicalizing subview op only if all of offsets, sizes and shapes are constants. PiperOrigin-RevId: 282010703	2019-11-22 12:14:18 -08:00
River Riddle	c35378003c	Add support for using the ODS result names as the Asm result names for multi-result operations. This changes changes the OpDefinitionsGen to automatically add the OpAsmOpInterface for operations with multiple result groups using the provided ODS names. We currently just limit the generation to multi-result ops as most single result operations don't have an interesting name(result/output/etc.). An example is shown below: // The following operation: def MyOp : ... { let results = (outs AnyType:$first, Variadic<AnyType>:$middle, AnyType); } // May now be printed as: %first, %middle:2, %0 = "my.op" ... PiperOrigin-RevId: 281834156	2019-11-21 14:55:46 -08:00
Christian Sigg	d7c17195a4	Change CUDA tests to use print_memref. Swap dimensions in all-reduce-op test. PiperOrigin-RevId: 281791744	2019-11-21 11:26:36 -08:00
Nicolas Vasilache	2c4985816f	Split Linalg declarative patterns from specific test patterns - NFC This will make it easier to scale out test patterns and build specific passes that do not interfere with independent testing. PiperOrigin-RevId: 281736335	2019-11-21 06:40:17 -08:00
Alex Zinenko	b5af3784a6	Don't force newline before function attributes Due to legacy reasons, a newline character followed by two spaces was always inserted before the attributes of the function Op in pretty form. This breaks formatting when functions are nested in some other operations. Don't print the newline and just put the attributes on the same line, which is also more consistent with module Op. Line breaking aware of indentation can be introduced separately into the parser if deemed useful. PiperOrigin-RevId: 281721793	2019-11-21 05:08:19 -08:00
MLIR Team	75379a684f	Correctly parse empty affine maps. Previously the test case crashes / produces an error. PiperOrigin-RevId: 281630540	2019-11-20 18:30:15 -08:00
River Riddle	fafb708b9a	Merge DCE and unreachable block elimination into a new utility 'simplifyRegions'. This moves the different canonicalizations of regions into one place and invokes them in the fixed-point iteration of the canonicalizer. PiperOrigin-RevId: 281617072	2019-11-20 15:53:19 -08:00
Andy Davis	d6a70b31be	Add VectorContractionOp to the VectorOps dialect. PiperOrigin-RevId: 281605471	2019-11-20 14:53:57 -08:00
Mahesh Ravishankar	1145cebdab	Verify subview op result has dynamic shape, when sizes are specified. If the sizes are specified as arguments to the subview op, then the shape must be dynamic as well. PiperOrigin-RevId: 281591608	2019-11-20 14:16:05 -08:00
Sean Silva	e4f83c6c26	Add multi-level DCE pass. This is a simple multi-level DCE pass that operates pretty generically on the IR. Its key feature compared to the existing peephole dead op folding that happens during canonicalization is being able to delete recursively dead cycles of the use-def graph, including block arguments. PiperOrigin-RevId: 281568202	2019-11-20 12:55:10 -08:00
Mahesh Ravishankar	19212105dd	Changes to SubViewOp to make it more amenable to canonicalization. The current SubViewOp specification allows for either all offsets, shape and stride to be dynamic or all of them to be static. There are opportunities for more fine-grained canonicalization based on which of these are static. For example, if the sizes are static, the result memref is of static shape. The specification of SubViewOp is modified to allow on or more of offsets, shapes and strides to be statically specified. The verification is updated to ensure that the result type of the subview op is consistent with which of these are static and which are dynamic. PiperOrigin-RevId: 281560457	2019-11-20 12:32:51 -08:00
Nicolas Vasilache	fa14d4f6ab	Implement unrolling of vector ops to finer-grained vector ops as a pattern. This CL uses the pattern rewrite infrastructure to implement a simple VectorOps -> VectorOps legalization strategy to unroll coarse-grained vector operations into finer grained ones. The transformation is written using local pattern rewrites to allow composition with other rewrites. It proceeds by iteratively introducing fake cast ops and cleaning canonicalizing or lowering them away where appropriate. This is an example of writing transformations as compositions of local pattern rewrites that should enable us to make them significantly more declarative. PiperOrigin-RevId: 281555100	2019-11-20 11:49:36 -08:00
River Riddle	eb418559ef	Add a new OpAsmOpInterface to allow for ops to directly hook into the AsmPrinter. This interface provides more fine-grained hooks into the AsmPrinter than the dialect interface, allowing for operations to define the asm name to use for results directly on the operations themselves. The hook is also expanded to enable defining named result "groups". Get a special name to use when printing the results of this operation. The given callback is invoked with a specific result value that starts a result "pack", and the name to give this result pack. To signal that a result pack should use the default naming scheme, a None can be passed in instead of the name. For example, if you have an operation that has four results and you want to split these into three distinct groups you could do the following: setNameFn(getResult(0), "first_result"); setNameFn(getResult(1), "middle_results"); setNameFn(getResult(3), ""); // use the default numbering. This would print the operation as follows: %first_result, %middle_results:2, %0 = "my.op" ... PiperOrigin-RevId: 281546873	2019-11-20 10:45:45 -08:00
Nicolas Vasilache	3c055957de	Add StridedMemRef<>::operator[] - NFC This operator is used for internal debugging purposes. PiperOrigin-RevId: 281544152	2019-11-20 10:17:13 -08:00
Alexander Belyaev	e50261657f	Fix 'the the' typo. PiperOrigin-RevId: 281501234	2019-11-20 05:38:14 -08:00
Stephan Herhut	abb626686d	Extend kernel outlining to also consider dim worth inlining. PiperOrigin-RevId: 281483447	2019-11-20 02:59:35 -08:00
Christian Sigg	f868adafee	Make type and rank explicit in mcuMemHostRegister function. Fix registered size of indirect MemRefType kernel arguments. PiperOrigin-RevId: 281362940	2019-11-19 13:13:02 -08:00
Nicolas Vasilache	ee95f6f259	Add VectorOps.StridedSliceOp The `vector.strided_slice` takes an n-D vector, k-D `offsets` integer array attribute, a k-D `sizes` integer array attribute, a k-D `strides` integer array attribute and extracts the n-D subvector at the proper offset. Returns an n-D vector where the first k-D dimensions match the `sizes` attribute. The returned subvector contains the elements starting at offset `offsets` and ending at `offsets + sizes`. Example: ``` %1 = vector.strided_slice %0 {offsets : [0, 2], sizes : [2, 4], strides : [1, 1]}: vector<4x8x16xf32> // returns a vector<2x4x16xf32> ``` This op will be useful for progressive lowering within the VectorOp dialect. PiperOrigin-RevId: 281352749	2019-11-19 12:22:34 -08:00
Nicolas Vasilache	3732ba4def	Fix pretty printer corner case in mlir_runner_utils.cpp. In the particular case where the size of a memref dimension is 1, double printing would happen because printLast was called unconditionally. This CL fixes the print and updates an incorrect test that should have caught this in the first place. PiperOrigin-RevId: 281345142	2019-11-19 11:52:27 -08:00
Diego Caballero	dd5a7cb488	Add getRemappedValue to ConversionPatternRewriter This method is needed for N->1 conversion patterns to retrieve remapped Values used in the original N operations. Closes tensorflow/mlir#237 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/237 from dcaballe:dcaballe/getRemappedValue 1f64fadcf2b203f7b336ff0c5838b116ae3625db PiperOrigin-RevId: 281321881	2019-11-19 11:09:39 -08:00
Alex Zinenko	8961d8e32f	Change conversion CLI flag from -lower-to-llvm to -convert-std-to-llvm The command-line flag name `lower-to-llvm` for the pass performing dialect conversion from the Standard dialect to the LLVM dialect is misleading and inconsistent with most of the conversion passses. It leads the user to believe that there are no restrictions on what can be converted, while in fact only a subset of the Standard dialect can be converted (with operations from other dialects converted by separate passes). Use `convert-std-to-llvm` that better reflects what the pass does and is consistent with most other conversions. PiperOrigin-RevId: 281238797	2019-11-19 00:34:51 -08:00
Hanhan Wang	c614c92fdc	Support SPIR-V constant op to take DenseElementsAttr as input. Iterates each element to build the array. This includes a little refactor to combine bool/int/float into a function, since they are similar. The only difference is calling different function in the end. PiperOrigin-RevId: 281210288	2019-11-18 20:02:05 -08:00
Alexander Belyaev	8c6a5233d5	Lower linalg.indexed_generic to loops. PiperOrigin-RevId: 281169885	2019-11-18 16:55:15 -08:00
Andy Davis	a6a287335d	Fix SubViewOp stride calculation in constant folding. Adds unit tests for subview offset and stride argument constant folding. PiperOrigin-RevId: 281161041	2019-11-18 15:01:08 -08:00
River Riddle	9873a29817	Add a parseAttribute<AttrType> overload for the non-type case. The variant that accepts a type will check that the parsed attribute is a valid instance of AttrType. The non-type variant would silently fail in this case, leading to garbage attribute values. PiperOrigin-RevId: 281136528	2019-11-18 13:11:36 -08:00
Denis Khalikov	6c77e59bfd	[spirv] Add a canonicalizer for BitcastOp. Convert chained `spirv::BitcastOp` operations into one `spirv::BitcastOp` operation. Closes tensorflow/mlir#238 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/238 from denis0x0D:sandbox/canon_bitcast 4352ed4f81b959ec92f849c599e733b62a99c010 PiperOrigin-RevId: 281129234	2019-11-18 12:37:00 -08:00
Andy Davis	68a8da4a93	Fix Affine Loop Fusion test case reported on github. This CL utilizies the more robust fusion feasibility analysis being built out in LoopFusionUtils, which will eventually be used to replace the current affine loop fusion pass. PiperOrigin-RevId: 281112340	2019-11-18 11:20:37 -08:00
Stephan Herhut	f0f3b71d67	Implement folding of pattern dim(subview(_)[...][s1, ..., sn][...], i) -> si. PiperOrigin-RevId: 281042016	2019-11-18 04:31:33 -08:00
Alex Zinenko	b8dc3fd812	Rename CLI flags -lower-gpu-ops-to--ops to -convert-gpu-to- This makes the flags consistent with the naming scheme used elsewhere in the codebase for dialect conversions. PiperOrigin-RevId: 281027517	2019-11-18 02:43:10 -08:00
Denis Khalikov	68e48ba111	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpBitFieldInsert * OpBitFieldSExtract * OpBitFieldUExtract Closes tensorflow/mlir#233 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/233 from denis0x0D:sandbox/bit_field_ops e7fd85b00d72d483d7992dc42b9cc4d673903455 PiperOrigin-RevId: 280691816	2019-11-15 11:03:19 -08:00
Lei Zhang	a0986bf43d	NFC: Convert CmpIPredicate in StandardOps to use EnumAttr This turns several hand-written functions to auto-generated ones. PiperOrigin-RevId: 280684326	2019-11-15 10:17:31 -08:00
Lei Zhang	88843ae37c	Use aggregate-parameter builder for ops having autogen type-deduction builder Thus far DRR always invokes the separate-parameter builder (i.e., requiring a separate parameter for each result-type/operand/attribute) for creating ops, no matter whether we can auto-generate a builder with type-deduction ability or not. This CL changes the path for ops that we can auto-generate type-deduction builders, i.e., with SameOperandsAndResultType/FirstAttrDerivedResultType traits. Now they are going through a aggregate-parameter builder (i.e., requiring one parameter for all result-types/operands/attributes). attributes.) It is expected this approach will be more friendly for future shape inference function autogen and calling those autogen'd shape inference function without excessive packing and repacking operand/attribute lists. Also, it would enable better support for creating ops with optional attributes because we are not required to provide an Attribute() as placeholder for an optional attribute anymore. PiperOrigin-RevId: 280654800	2019-11-15 07:33:54 -08:00
Stephan Herhut	57bafc674e	Mark std.view as no-sideeffect. The same reasoning as for std.subview applies. PiperOrigin-RevId: 280639308	2019-11-15 05:28:31 -08:00
Stephan Herhut	9c7bceb4fe	Mark std.subview as no-sideeffect. In essence, std.subview is just an abstract indexing transformation (somewhat akin to a gep in llvm) and by itself has no effect. From a practical perspective this helps, as it allows to remove dead subview operations. PiperOrigin-RevId: 280630046	2019-11-15 04:00:31 -08:00
Nicolas Vasilache	0b271b7dfe	Refactor the LowerVectorTransfers pass to use the RewritePattern infra - NFC This is step 1/n in refactoring infrastructure along the Vector dialect to make it ready for retargetability and composable progressive lowering. PiperOrigin-RevId: 280529784	2019-11-14 15:40:07 -08:00
Andy Davis	a4669cd3b4	Adds canonicalizer to SubViewOp which folds constants from base memref and operands into the subview result memref type. Changes SubViewOp to support zero operands case, when offset, strides and sizes are all constant. PiperOrigin-RevId: 280485075	2019-11-14 12:23:04 -08:00
Lei Zhang	796ca609eb	[ODS] Fix operation argument population to avoid crash The `Operator` class keeps an `arguments` field, which contains pointers to `operands` and `attributes` elements. Thus it must be populated after `operands` and `attributes` are finalized so to have stable pointers. SmallVector may re-allocate when still having new elements added, which will invalidate pointers. PiperOrigin-RevId: 280466896	2019-11-14 11:03:29 -08:00
Alex Zinenko	bf5916e7a4	Use MemRefDescriptor in Vector-to-LLVM convresion Following up on the consolidation of MemRef descriptor conversion, update Vector-to-LLVM conversion to use the helper class that abstracts away the implementation details of the MemRef descriptor. This also makes the types of the attributes in emitted llvm.insert/extractelement operations consistently i64 instead of a mix of index and i64. PiperOrigin-RevId: 280441451	2019-11-14 09:05:42 -08:00
Nicolas Vasilache	f2b6ae9991	Move VectorOps to Tablegen - (almost) NFC This CL moves VectorOps to Tablegen and cleans up the implementation. This is almost NFC but 2 changes occur: 1. an interface change occurs in the padding value specification in vector_transfer_read: the value becomes non-optional. As a shortcut we currently use %f0 for all paddings. This should become an OpInterface for vectorization in the future. 2. the return type of vector.type_cast is trivial and simplified to `memref<vector<...>>` Relevant roundtrip and invalid tests that used to sit in core are moved to the vector dialect. The op documentation is moved to the .td file. PiperOrigin-RevId: 280430869	2019-11-14 08:15:23 -08:00
Jacques Pienaar	d1c99e10d0	Do not emit aliases when printing local form Expand local scope printing to skip printing aliases as aliases are printed out at the top of a module and may not be part of the output generated by local scope print. PiperOrigin-RevId: 280278617	2019-11-13 14:21:49 -08:00
Nicolas Vasilache	0bd6390b54	Deprecate linalg.subview in favor of std.subview This CL uses the now standard std.subview in linalg. Two shortcuts are currently taken to allow this port: 1. the type resulting from a view is currently degraded to fully dynamic to pass the SubViewOp verifier. 2. indexing into SubViewOp may access out of bounds since lowering to LLVM does not currently enforce it by construction. These will be fixed in subsequent commits after discussions. PiperOrigin-RevId: 280250129	2019-11-13 12:10:09 -08:00
Sean Silva	486f2122cd	Add FuncOp::eraseArgument This is a quite complex operation that users are likely to attempt to write themselves and get wrong (citation: users=me). Ideally, we could pull this into FunctionLike, but for now, the FunctionType rewriting makes it FuncOp specific. We would need some hook for rewriting the function type (which for LLVM's func op, would need to rewrite the underlying LLVM type). PiperOrigin-RevId: 280234164	2019-11-13 10:59:55 -08:00
River Riddle	d985c74883	NFC: Refactor block signature conversion to not erase the original arguments. This refactors the implementation of block signature(type) conversion to not insert fake cast operations to perform the type conversion, but to instead create a new block containing the proper signature. This has the benefit of enabling the use of pre-computed analyses that rely on mapping values. It also leads to a much cleaner implementation overall. The major user facing change is that applySignatureConversion will now replace the entry block of the region, meaning that blocks generally shouldn't be cached over calls to applySignatureConversion. PiperOrigin-RevId: 280226936	2019-11-13 10:27:53 -08:00
River Riddle	6df8369941	Rename the current parseSymbolName to parseOptionalSymbolName The current implementation silently fails if the '@' identifier isn't present, making it similar to the 'optional' parse methods. This change renames the current implementation to 'Optional' and adds a new 'parseSymbolName' that emits an error. PiperOrigin-RevId: 280214610	2019-11-13 09:32:20 -08:00
Hanhan Wang	85d7fb3324	Make VariableOp instructions be in the first block in the function. Since VariableOp is serialized during processBlock, we add two more fields, `functionHeader` and `functionBody`, to collect instructions for a function. After all the blocks have been processed, we append them to the `functions`. Also, fix a bug in processGlobalVariableOp. The global variables should be encoded into `typesGlobalValues`. PiperOrigin-RevId: 280105366	2019-11-12 18:59:15 -08:00
Mahesh Ravishankar	2be53603e9	Add operations needed to support lowering of AffineExpr to SPIR-V. Lowering of CmpIOp, DivISOp, RemISOp, SubIOp and SelectOp to SPIR-V dialect enables the lowering of operations generated by AffineExpr -> StandardOps conversion into the SPIR-V dialect. PiperOrigin-RevId: 280039204	2019-11-12 13:20:06 -08:00
Lei Zhang	b259c26eb0	Add support for OpPhi in loop header block During deserialization, the loop header block will be moved into the spv.loop's region. If the loop header block has block arguments, we need to make sure it is correctly carried over to the block where the new spv.loop resides. During serialization, we need to make sure block arguments from the spv.loop's entry block are not silently dropped. PiperOrigin-RevId: 280021777	2019-11-12 12:00:28 -08:00
River Riddle	626e1fd95e	Add an option to print an operation if a diagnostic is emitted on it It is often helpful to inspect the operation that the error/warning/remark/etc. originated from, especially in the context of debugging or in the case of a verifier failure. This change adds an option 'mlir-print-op-on-diagnostic' that attaches the operation as a note to any diagnostic that is emitted on it via Operation::emit(Error\|Warning\|Remark). In the case of an error, the operation is printed in the generic form. PiperOrigin-RevId: 280021438	2019-11-12 11:59:19 -08:00
Mahesh Ravishankar	104af84f4c	Add Conversion to lower loop::ForOp to spirv::LoopOp. loop::ForOp can be lowered to the structured control flow represented by spirv::LoopOp by making the continue block of the spirv::LoopOp the loop latch and the merge block the exit block. The resulting spirv::LoopOp has a single back edge from the continue to header block, and a single exit from header to merge. PiperOrigin-RevId: 280015614	2019-11-12 11:33:27 -08:00
Nicolas Vasilache	51de3f688e	Add LLVM lowering of std.subview A followup CL will replace usage of linalg.subview by std.subview. PiperOrigin-RevId: 279961981	2019-11-12 07:23:18 -08:00
Andy Davis	82d2c43eca	Adds affine.min operation which returns the minimum value from a multi-result affine map. This operation is useful for things like computing the dynamic value of affine loop bounds, and is trivial to constant fold. PiperOrigin-RevId: 279959714	2019-11-12 07:08:49 -08:00
Nicolas Vasilache	f51a155337	Add support for alignment attribute in std.alloc. This CL adds an extra pointer to the memref descriptor to allow specifying alignment. In a previous implementation, we used 2 types: `linalg.buffer` and `view` where the buffer type was the unit of allocation/deallocation/alignment and `view` was the unit of indexing. After multiple discussions it was decided to use a single type, which conflates both, so the memref descriptor now needs to carry both pointers. This is consistent with the [RFC-Proposed Changes to MemRef and Tensor MLIR Types](https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ). PiperOrigin-RevId: 279959463	2019-11-12 07:06:54 -08:00
River Riddle	9b9c647cef	Add support for nested symbol references. This change allows for adding additional nested references to a SymbolRefAttr to allow for further resolving a symbol if that symbol also defines a SymbolTable. If a referenced symbol also defines a symbol table, a nested reference can be used to refer to a symbol within that table. Nested references are printed after the main reference in the following form: symbol-ref-attribute ::= symbol-ref-id (`::` symbol-ref-id)* Example: module @reference { func @nested_reference() } my_reference_op @reference::@nested_reference Given that SymbolRefAttr is now more general, the existing functionality centered around a single reference is moved to a derived class FlatSymbolRefAttr. Followup commits will add support to lookups, rauw, etc. for scoped references. PiperOrigin-RevId: 279860501	2019-11-11 18:18:31 -08:00
Andy Davis	5cf6e0ce7f	Adds std.subview operation which takes dynamic offsets, sizes and strides and returns a memref type which represents sub/reduced-size view of its memref argument. This operation is a companion operation to the std.view operation added as proposed in "Updates to the MLIR MemRefType" RFC. PiperOrigin-RevId: 279766410	2019-11-11 10:33:27 -08:00
Stephan Herhut	e04d4bf865	Also consider index constants when folding integer arithmetics with constants. PiperOrigin-RevId: 279698088	2019-11-11 02:34:21 -08:00
MLIR Team	9fbf52e330	Look for SymbolRefAttr in KernelOutlining instead of hard-coding CallOp This code should be exercised using the existing kernel outlining unit test, but let me know if I should add a dedicated unit test using a fake call instruction as well. PiperOrigin-RevId: 279436321	2019-11-08 19:13:13 -08:00
Denis Khalikov	4697d657b7	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpShiftLeftLogical * OpShiftRightArithmetic * OpShiftRightLogical * OpBitCount * OpBitReverse * OpNot Also moved the definition of spv.BitwiseAnd to follow the lexicographical order. Closes tensorflow/mlir#215 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/215 from denis0x0D:sandbox/bit_ops d9b0852b689ac6c4879a9740b1740a2357f44d24 PiperOrigin-RevId: 279350470	2019-11-08 11:17:05 -08:00
Alex Zinenko	09e8e7107a	mlir-translate: support -verify-diagnostics MLIR translation tools can emit diagnostics and we want to be able to check if it is indeed the case in tests. Reuse the source manager error handlers provided for mlir-opt to support the verification in mlir-translate. This requires us to change the signature of the functions that are registered to translate sources to MLIR: it now takes a source manager instead of a memory buffer. PiperOrigin-RevId: 279132972	2019-11-07 11:42:46 -08:00
Uday Bondhugula	eb47d5ee66	Fix asm printer for affine expr - fixes tensorflow/mlir#201 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#204 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/204 from bondhugula:printfix 3f8a5b65391f45598258b2735fecaa409fbde848 PiperOrigin-RevId: 279115720	2019-11-07 10:27:27 -08:00
Andy Davis	8f00b4494d	Swap operand order in std.view operation so that offset appears before dynamic sizes in the operand list. PiperOrigin-RevId: 279114236	2019-11-07 10:20:23 -08:00
River Riddle	6b4e30b7c8	Add Ch-7 of the toy tutorial detailing how to define new types. This chapter adds a new composite type to Toy, and shows the process of adding a new type to the IR, adding and updating operations to use it, and constant folding operations producing it. PiperOrigin-RevId: 279107885	2019-11-07 09:54:04 -08:00
Andy Davis	5fbdb67b0a	Add canonicalizer for ViewOp which folds constants into the ViewOp memref shape and layout map strides and offset. PiperOrigin-RevId: 279088023	2019-11-07 08:05:03 -08:00
Jacques Pienaar	7af61f6bcd	Add compatible query method to infer type interface A return type that differs from the inferred return type need not indicate that an operation is invalid (e.g., tensor<*xf32> vs tensor<10xf32>) but they should be compatible for the operation to be considered valid. Add method to query if inferred type is compatible with return type. Also add InferTypeOpIntefaceDefault trait that considers equality and compatibility as the same. Currently an op has to opt in to using it explicitly. PiperOrigin-RevId: 279085639	2019-11-07 07:51:45 -08:00
Nicolas Vasilache	72040bf7c8	Update Linalg to use std.view Now that a view op has graduated to the std dialect, we can update Linalg to use it and remove ops that have become obsolete. As a byproduct, the linalg buffer and associated ops can also disappear. PiperOrigin-RevId: 279073591	2019-11-07 06:33:10 -08:00
Alexander Belyaev	eee9cbdeb7	Add IndexedGenericOp to Linalg. PiperOrigin-RevId: 279013404	2019-11-06 22:36:25 -08:00
Nicolas Vasilache	ffebc8ce1d	Drop spurious test file PiperOrigin-RevId: 278959717	2019-11-06 16:00:57 -08:00
Nicolas Vasilache	7f6c6084b5	Add lowering of std.view to LLVM This CL ports the lowering of linalg.view to the newly introduced std.view. Differences in implementation relate to std.view having slightly different semantics: 1. a static or dynamic offset can be specified. 2. the size of the (contiguous) shape is passed instead of a range. 3. static size and stride information is extracted from the memref type rather than the range. Besides these differences, lowering behaves the same. A future CL will update Linalg to use this unified infrastructure. PiperOrigin-RevId: 278948853	2019-11-06 15:06:16 -08:00
Andy Davis	b5654d1311	Add ViewOp verification for dynamic strides, and address some comments from previous change. PiperOrigin-RevId: 278903187	2019-11-06 11:25:54 -08:00
Andy Davis	c38dca7f4b	Add ViewOp to the StandardOps dialect, which casts a 1D/i8 element type memref type to an N-D memref type. Proposed in RFC: https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ Supports creating the N-D memref type with dynamic sizes and at a dynamic offset within the 1D base memref. This change contains op definition/parsing/printing and tests. Follow up changes will handle constant shape/layout map folding and llvm lowering. PiperOrigin-RevId: 278869990	2019-11-06 08:54:12 -08:00
Eric Schweitz	0d545921ea	Add support for the LLVM FNeg instruction Closes tensorflow/mlir#216 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/216 from schweitzpgi:llvmir-fneg-op f9b5f185845d671b745ab6fc213d5d9aff044b34 PiperOrigin-RevId: 278795325	2019-11-06 00:02:10 -08:00
James Molloy	250a11ae0f	[llvm] Allow GlobalOp to take a region for complex initializers This allows GlobalOp to either take a value attribute (for simple constants) or a region that can contain IR instructions (that must be constant-foldable) to create a ConstantExpr initializer. Example: // A complex initializer is constructed with an initializer region. llvm.mlir.global constant @int_gep() : !llvm<"i32"> { %0 = llvm.mlir.addressof @g2 : !llvm<"i32"> %1 = llvm.mlir.constant(2 : i32) : !llvm.i32 %2 = llvm.getelementptr %0[%1] : (!llvm<"i32">, !llvm.i32) -> !llvm<"i32"> llvm.return %2 : !llvm<"i32*"> } PiperOrigin-RevId: 278717836	2019-11-05 15:11:01 -08:00
James Molloy	6b534ecbcb	[llvm] Add initial import of LLVM modules to mlir-translate This adds an importer from LLVM IR or bitcode to the LLVM dialect. The importer is registered with mlir-translate. Known issues exposed by this patch but not yet fixed: * Globals' initializers are attributes, which makes it impossible to represent a ConstantExpr. This will be fixed in a followup. * icmp returns i32 rather than i1. * select and a couple of other instructions aren't implemented. * llvm.cond_br takes its successors in a weird order. The testing here is known to be non-exhaustive. I'd appreciate feedback on where this functionality should live. It looks like the translator from MLIR to LLVM lives in Target/, but the SPIR-V deserializer lives in Dialect/ which is why I've put this here too. PiperOrigin-RevId: 278711683	2019-11-05 14:41:38 -08:00
River Riddle	2366561a39	Add a PatternRewriter hook to merge blocks, and use it to support for folding branches. A pattern rewriter hook, mergeBlock, is added that allows for merging the operations of one block into the end of another. This is used to support a canonicalization pattern for branch operations that folds the branch when the successor has a single predecessor(the branch block). Example: ^bb0: %c0_i32 = constant 0 : i32 br ^bb1(%c0_i32 : i32) ^bb1(%x : i32): return %x : i32 becomes: ^bb0: %c0_i32 = constant 0 : i32 return %c0_i32 : i32 PiperOrigin-RevId: 278677825	2019-11-05 11:57:38 -08:00
MLIR Team	1f43d0d000	[NVVM] Add mma.sync operation. PiperOrigin-RevId: 278440547	2019-11-04 12:36:37 -08:00
River Riddle	e4a912eb5a	Update the SPV dialect type parser to use the methods on DialectAsmParser directly. This simplifies the implementation quite a bit, and removes the need for explicit string munging. One change is made to some of the enum elements of SPV_DimAttr to ensure that they are proper identifiers; The string form is now prefixed with 'Dim'. PiperOrigin-RevId: 278027132	2019-11-01 16:55:25 -07:00
River Riddle	68cfc89a0d	Refactor LinalgDialect::parseType to use the DialectAsmParser methods directly. This simplifies the implementation, and removes the need to do explicit string manipulation. A utility method 'parseDimensionList' is added to the DialectAsmParser to simplify defining types and attributes that contain shapes. PiperOrigin-RevId: 278020604	2019-11-01 16:14:10 -07:00
River Riddle	e94a8bfca8	Refactor QuantOps TypeParser to use the DialectAsmParser methods directly. This greatly simplifies the implementation and removes custom parser functionality. The necessary methods are added to the DialectAsmParser. PiperOrigin-RevId: 278015983	2019-11-01 15:47:03 -07:00
Lei Zhang	f143fbfa77	Add ReferToOp attribute constraint for SymbolRefAttr This constraint can be used to limit a SymbolRefAttr to point to a specific kind of op in the closest parent with a symbol table. PiperOrigin-RevId: 278001364	2019-11-01 14:26:36 -07:00
Nicolas Vasilache	e20a2aa9f2	Delete spurious file PiperOrigin-RevId: 277967079	2019-11-01 11:28:15 -07:00
Mahesh Ravishankar	9cbbd8f4df	Support lowering of imperfectly nested loops into GPU dialect. The current lowering of loops to GPU only supports lowering of loop nests where the loops mapped to workgroups and workitems are perfectly nested. Here a new lowering is added to handle lowering of imperfectly nested loop body with the following properties 1) The loops partitioned to workgroups are perfectly nested. 2) The loop body of the inner most loop partitioned to workgroups can contain one or more loop nests that are to be partitioned across workitems. Each individual loops nests partitioned to workitems should also be perfectly nested. 3) The number of workgroups and workitems are not deduced from the loop bounds but are passed in by the caller of the lowering as values. 4) For statements within the perfectly nested loop nest partitioned across workgroups that are not loops, it is valid to have all threads execute that statement. This is NOT verified. PiperOrigin-RevId: 277958868	2019-11-01 10:52:06 -07:00
Nicolas Vasilache	bd94a10c02	Add Linalg pattern for producer-consumer fusion This CL adds a simple pattern for specifying producer-consumer fusion on Linalg operations. Implementing such an extension reveals some interesting properties. Since Linalg operates on a buffer abstraction, the output buffers are specified as in/out parameters to the ops. As a consequence, there are no SSA use-def chains and one cannot specify complex dag input patterns with the current infrastructure. Instead this CL uses constraints based on the existing linalg dependence analysis to focus the pattern and refine patterns based on the type of op that last wrote in a buffer. This is a very local property and is less powerful than the generic dag specification based on SSA use-def chains. This will be generalized in the future. PiperOrigin-RevId: 277931503	2019-11-01 08:30:38 -07:00
James Molloy	96531e2f87	[mlir][llvm] Add missing cast ops Also adds a builder method for fcmp, identical to that for icmp. PiperOrigin-RevId: 277923158	2019-11-01 07:32:09 -07:00
Lei Zhang	7432234f3c	NFC: Use #ifndef in various .td files instead of #ifdef and #else Upstream LLVM gained support for #ifndef with https://reviews.llvm.org/D61888 This is changed mechanically via the following command: find . -name ".td" -exec sed -i -e ':a' -e 'N' -e '$!ba' -e 's/#ifdef $[A-Z_]$\n#else/#ifndef \1/g' {} \; PiperOrigin-RevId: 277789427	2019-10-31 13:29:50 -07:00
Mehdi Amini	ce9477934a	Add a test for lowering GPU ops that cover cases where the symbol table isn't held by a ModuleOp (NFC) PiperOrigin-RevId: 277752004	2019-10-31 10:35:15 -07:00
Mehdi Amini	07b4ce7409	Add a test.symbol_scope operation that has the SymbolTable Traits to the Test dialect PiperOrigin-RevId: 277741687	2019-10-31 09:49:42 -07:00
Denis Khalikov	d423d4a338	[spirv] Add cast operations This CL added op definitions for a few cast operations: * OpConvertFToU * OpConvertFToS * OpConvertSToF * OpConvertUToF * OpUConvert * OpSConvert * OpFConvert Also moved the definition of spv.Bitcast to the new file. Closes tensorflow/mlir#208 and tensorflow/mlir#174 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/208 from denis0x0D:sandbox/cast_ops 79bc9b37398aafddee6cf6beb301807988fe67f9 PiperOrigin-RevId: 277587891	2019-10-30 14:53:04 -07:00
Lei Zhang	d024b68e6b	Use `not` to invert return code in expected to fail tests Windows does not like the RUN command of `(... \|\| true) \| ...`. PiperOrigin-RevId: 277587031	2019-10-30 14:38:18 -07:00
River Riddle	a32f0dcb5d	Add support to GreedyPatternRewriter for erasing unreachable blocks. Rewrite patterns may make modifications to the CFG, including dropping edges between blocks. This change adds a simple unreachable block elimination run at the end of each iteration to ensure that the CFG remains valid. PiperOrigin-RevId: 277545805	2019-10-30 11:19:24 -07:00
Lei Zhang	cb40e36d3b	Fix segfault when no symbol is given to an constraint operand This fixed the segfault when we see the following pattern: Pat<(...), (...), [(... 1, 2, 3), ...]> PiperOrigin-RevId: 277544300	2019-10-30 11:12:57 -07:00
Nicolas Vasilache	05a5a41416	Add basic support for declarative Linalg transformations Linalg ops provide a good anchor for pattern matching/rewriting transformations. This CL adds a simple example of how multi-level tiling may be specified by attaching a simple StringAttr to ops as they are transformed so we can easily specify partial lowering to control transformation application. This is a first stab at taking advantage of higher-level information contained in Linalg ops and will evolve in the future. PiperOrigin-RevId: 277497958	2019-10-30 07:12:33 -07:00
Lei Zhang	80213ba5f0	[spirv] Fix gen_spirv_dialect.py and add spv.Unreachable This CL fixed gen_spirv_dialect.py to support nested delimiters when chunking existing ODS entries in .td files and to allow ops without correspondence in the spec. This is needed to pull in the definition of OpUnreachable. PiperOrigin-RevId: 277486465	2019-10-30 05:41:18 -07:00
Lei Zhang	ca2538e9a7	[spirv] Support OpPhi using block arguments This CL adds another control flow instruction in SPIR-V: OpPhi. It is modelled as block arguments to be idiomatic with MLIR. See the rationale.md doc for "Block Arguments vs PHI nodes". Serialization and deserialization is updated to convert between block arguments and SPIR-V OpPhi instructions. PiperOrigin-RevId: 277161545	2019-10-28 15:58:42 -07:00
Sean Silva	66ec24d833	Parse locations in parseGenericOperation For ops that recursively re-enter the parser to parse an operation (such as ops with a "wraps" pretty form), this ensures that the wrapped op will parse its location, which can then be used for the locations of the wrapping op and any other implicit ops. PiperOrigin-RevId: 277152636	2019-10-28 15:11:26 -07:00
River Riddle	2f4d0c085a	Add support for marking an operation as recursively legal. In some cases, it may be desirable to mark entire regions of operations as legal. This provides an additional granularity of context to the concept of "legal". The `ConversionTarget` supports marking operations, that were previously added as `Legal` or `Dynamic`, as `recursively` legal. Recursive legality means that if an operation instance is legal, either statically or dynamically, all of the operations nested within are also considered legal. An operation can be marked via `markOpRecursivelyLegal<>`: ```c++ ConversionTarget &target = ...; /// The operation must first be marked as `Legal` or `Dynamic`. target.addLegalOp<MyOp>(...); target.addDynamicallyLegalOp<MySecondOp>(...); /// Mark the operation as always recursively legal. target.markOpRecursivelyLegal<MyOp>(); /// Mark optionally with a callback to allow selective marking. target.markOpRecursivelyLegal<MyOp, MySecondOp>([](Operation *op) { ... }); /// Mark optionally with a callback to allow selective marking. target.markOpRecursivelyLegal<MyOp>([](MyOp op) { ... }); ``` PiperOrigin-RevId: 277086382	2019-10-28 10:04:34 -07:00
Alexander Belyaev	780a108d31	Fix include guards and add tests for OpToFuncCallLowering. PiperOrigin-RevId: 276859463	2019-10-26 08:21:36 -07:00
Smit Hinsu	cde337cfde	Define AnyRankedTensor Type in TableGen PiperOrigin-RevId: 276714649	2019-10-25 10:31:56 -07:00
River Riddle	b69e8ee049	Add support for parsing multiple result name groups. This allows for parsing things like: %name_1, %name_2:5, %name_3:2 = "my.op" ... This is useful for operations that have groups of variadic result values. The total number of results is expected to match the number of results defined by the operation. PiperOrigin-RevId: 276703280	2019-10-25 09:34:02 -07:00
Denis Khalikov	dd2e444325	[spirv] AccessChainOp canonicalization. Combine chained `spirv::AccessChainOp` operations into one `spirv::AccessChainOp` operation. Closes tensorflow/mlir#198 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/198 from denis0x0D:sandbox/canon_access_chain 0cb87955a85511071143d62637ff939d0dabc2bd PiperOrigin-RevId: 276609345	2019-10-24 18:41:34 -07:00
River Riddle	2b61b7979e	Convert the Canonicalize and CSE passes to generic Operation Passes. This allows for them to be used on other non-function, or even other function-like, operations. The algorithms are already generic, so this is simply changing the derived pass type. The majority of this change is just ensuring that the nesting of these passes remains the same, as the pass manager won't auto-nest them anymore. PiperOrigin-RevId: 276573038	2019-10-24 15:01:09 -07:00
River Riddle	ef43b56538	Add support for replacing all uses of a symbol. This requires reconstructing the attribute dictionary of each operation containing a use. PiperOrigin-RevId: 276520544	2019-10-24 10:47:27 -07:00
River Riddle	21ee4e987f	Add @below and @above directives to verify-diagnostics. This simplifies defining expected-* directives when there are multiple that apply to the next or previous line. @below applies the directive to the next non-designator line, i.e. the next line that does not contain an expected-* designator. @above applies to the previous non designator line. Examples: // Expect an error on the next line that does not contain a designator. // expected-remark@below {{remark on function below}} // expected-remark@below {{another remark on function below}} func @bar(%a : f32) // Expect an error on the previous line that does not contain a designator. func @baz(%a : f32) // expected-remark@above {{remark on function above}} // expected-remark@above {{another remark on function above}} PiperOrigin-RevId: 276369085	2019-10-23 15:56:29 -07:00
Uday Bondhugula	ad6925f479	Update loop.for verifier message fix: nonnegative -> positive Closes tensorflow/mlir#206 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/206 from bondhugula:bondhugula-patch-1 9a47ca7dfd230180a9df33e9a64b33d02252d30a PiperOrigin-RevId: 276060885	2019-10-22 07:34:56 -07:00
Lei Zhang	020f9eb68c	[DRR] Allow interleaved operands and attributes Previously DRR assumes attributes to appear after operands. This was the previous requirements on ODS, but that has changed some time ago. Fix DRR to also support interleaved operands and attributes. PiperOrigin-RevId: 275983485	2019-10-21 20:48:17 -07:00
Lei Zhang	d9fe892e42	[spirv] Allow block arguments on spv.Branch(Conditional) We will use block arguments as the way to model SPIR-V OpPhi in the SPIR-V dialect. This CL also adds a few useful helper methods to both ops to get the block arguments. Also added tests for branch weight (de)serialization. PiperOrigin-RevId: 275960797	2019-10-21 17:32:00 -07:00
Alex Zinenko	5f867d26b4	Use LLVM_Type instead of AnyType in the definition of LLVM_CallOp The type constraint had to be relaxed due to the order of lowering passes in the examples, that since has been fixed. The relaxed version was still used by the CUDA lowering for launch sizes of `index` type. This is not necessary since the GPU dialect does not restrict the type of the launch size operands. Use an LLVM type instead and restore the check in the LLVM_CallOp definition. PiperOrigin-RevId: 275920109	2019-10-21 14:12:19 -07:00
River Riddle	4514cdd5eb	Cleanup and rewrite Ch-4.md. This change rewrites Ch-4.md to introduced interfaces in a detailed step-by-step manner, adds examples, and fixes some errors. PiperOrigin-RevId: 275887017	2019-10-21 11:32:39 -07:00
River Riddle	941a1c4332	NFC: Fix remaining usages of MulOp as matrix multiplication. MulOp now represents an element-wise multiplication instead of a matrix multiplication. PiperOrigin-RevId: 275886774	2019-10-21 11:31:32 -07:00
River Riddle	03d7be2aca	NFC: Elide the value of a UnitAttr within nested attribute dictionaries. This matches the behavior of the top level attribute dictionary. PiperOrigin-RevId: 275879828	2019-10-21 11:02:07 -07:00
River Riddle	9ac459e871	Add a Symbol trait to simplify defining operations that represent symbols. This trait provides accessors for the name, symbol use list methods, verification, with more to be added. PiperOrigin-RevId: 275864554	2019-10-21 09:58:59 -07:00
River Riddle	1bdfc9e74d	NFC: Fix typo : Retur -> Return PiperOrigin-RevId: 275745931	2019-10-20 15:13:20 -07:00
Kazuaki Ishizaki	f28c5aca17	Fix minor spelling tweaks (NFC) Closes tensorflow/mlir#175 PiperOrigin-RevId: 275726876	2019-10-20 09:44:36 -07:00
Kazuaki Ishizaki	8bfedb3ca5	Fix minor spelling tweaks (NFC) Closes tensorflow/mlir#177 PiperOrigin-RevId: 275692653	2019-10-20 00:11:34 -07:00
Christian Sigg	c3e56cd12c	Get active source lane predicate from shuffle instruction. nvvm.shfl.sync.bfly optionally returns a predicate whether source lane was active. Support for this was added to clang in https://reviews.llvm.org/D68892. Add an optional 'pred' unit attribute to the instruction to return this predicate. Specify this attribute in the partial warp reduction so we don't need to manually compute the predicate. PiperOrigin-RevId: 275616564	2019-10-19 01:53:25 -07:00
Sean Silva	9c9a7e9268	Add support for function result attributes. This allows dialect-specific attributes to be attached to func results. (or more specifically, FunctionLike ops). For example: ``` func @f() -> (i32 {my_dialect.some_attr = 3}) ``` This attaches my_dialect.some_attr with value 3 to the first result of func @f. Another more complex example: ``` func @g() -> (i32, f32 {my_dialect.some_attr = "foo", other_dialect.some_other_attr = [1,2,3]}, i1) ``` Here, the second result has two attributes attached. PiperOrigin-RevId: 275564165	2019-10-18 16:03:28 -07:00
Nicolas Vasilache	9e7e297da3	Lower vector transfer ops to loop.for operations. This allows mixing linalg operations with vector transfer operations (with additional modifications to affine ops) and is a step towards solving tensorflow/mlir#189. PiperOrigin-RevId: 275543361	2019-10-18 14:10:10 -07:00
Nicolas Vasilache	2823b68580	Implement lowering of VectorTypeCastOp to LLVM A VectorTypeCastOp can only be used to lower between statically sized contiguous memrefs of scalar and matching vector type. The sizes and strides are thus fully static and easy to determine. A relevant test is added. This is a step towards solving tensorflow/mlir#189. PiperOrigin-RevId: 275538981	2019-10-18 14:00:06 -07:00
Nicolas Vasilache	151e7e61e8	Automated rollback of commit `575405f4d6` PiperOrigin-RevId: 275461067	2019-10-18 06:45:06 -07:00
Stephan Herhut	3622e1833f	Use StrEnumAttr for gpu.allreduce op instead of StringAttr to better encode constraints. PiperOrigin-RevId: 275448372	2019-10-18 04:44:48 -07:00
Christian Sigg	fe0ee32da5	Add gpu.barrier op to synchronize invocations of a local workgroup. Adding gen table for rewrite patterns from GPU to NVVM dialect. Copy missing op documentation from GPUOps.td to GPU.md. PiperOrigin-RevId: 275419588	2019-10-18 00:30:44 -07:00
River Riddle	2acc220f17	NFC: Remove trivial builder get methods. These don't add any value, and some are even more restrictive than the respective static 'get' method. PiperOrigin-RevId: 275391240	2019-10-17 20:08:34 -07:00
River Riddle	575405f4d6	Automated rollback of commit `b65c8bb5d6` PiperOrigin-RevId: 275370861	2019-10-17 17:11:39 -07:00
Nicolas Vasilache	b65c8bb5d6	Add EDSC support for loop.for operations This CL adds support for loop.for operations in EDSC and adds a test. This will be used in a followup commit to implement lowering of vector_transfer ops so that it works more generally and is not subject to affine constraints. PiperOrigin-RevId: 275349796	2019-10-17 15:18:34 -07:00
River Riddle	dae0ae6879	NFC: Delete the Linalg tutorial. This part of the tutorial is now covered by a new flow in Toy. This also removes a point of confusion as there is also a proper Linalg dialect. PiperOrigin-RevId: 275338933	2019-10-17 14:27:37 -07:00
River Riddle	0372eb413f	Add Ch.6 of the Toy tutorial. This chapters introduces the notion of a full conversion, and adds support for lowering down to the LLVM dialect, LLVM IR, and thus code generation. PiperOrigin-RevId: 275337786	2019-10-17 14:22:13 -07:00
Nicolas Vasilache	5b03e692f6	Decouple Linalg promotion from Linalg tiling - NFC This CL creates a new Linalg promotion pass that operates on SubViewOp and decouples it from Linalg tiling. This is mostly moving code around. PiperOrigin-RevId: 275329213	2019-10-17 13:41:17 -07:00
Denis Khalikov	a560505d1a	[spirv] Add a canonicalization pattern for spv.selection. Add a canonicalization pattern for spv.selection operation. Convert spv.selection operation to spv.Select based on simple pattern. Closes tensorflow/mlir#183 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/183 from denis0x0D:sandbox/canon_select 43d04d923272dd60b9da39f70bdbc51a5168db62 PiperOrigin-RevId: 275312748	2019-10-17 12:36:47 -07:00
Lei Zhang	057dc41bf6	Allow '_' when pretty printing dialect symbols '_' is used frequently enough as the separator of words in symbols. We should allow it in dialect symbols when considering pretty printing. Also updated LangRef.md regarding pretty form. PiperOrigin-RevId: 275312494	2019-10-17 12:24:18 -07:00
Nicolas Vasilache	10039d04e2	Rename LoopNestBuilder to AffineLoopNestBuilder - NFC PiperOrigin-RevId: 275310747	2019-10-17 12:13:59 -07:00
Lei Zhang	23d21af65c	[DRR] Allow capturing and referencing no-result ops Previously when we bind a symbol to an op in DRR, it means to capture the op's result(s) and later references will be expanded to result(s). This means for ops without result, we are replacing the symbol with nothing. This CL treats non-result op capturing and referencing as a special case to mean the op itself. PiperOrigin-RevId: 275269702	2019-10-17 09:02:31 -07:00
Lei Zhang	603117b2d6	Fix RewriterGen to support using NativeCodeCall as auxiliary pattern NativeCodeCall is handled differently than normal op creation in RewriterGen (because its flexibility). It will only be materialized to output stream if it is used. But when using it for auxiliary patterns, we still want the side effect even if it is not replacing matched root op's results. PiperOrigin-RevId: 275265467	2019-10-17 08:39:59 -07:00
River Riddle	bdc250c5a7	Fix invalid transpose in example and add proper verification. The transpose in the example had the same result type as its input, which is incorrect. PiperOrigin-RevId: 275186568	2019-10-16 22:37:00 -07:00
Lei Zhang	0e3efb32c6	[spirv] Implement inliner interface We just need to implement a few interface hooks to DialectInlinerInterface and CallOpInterface to gain the benefits of an inliner. :) Right now only supports some trivial cases: * Inlining single block with spv.Return/spv.ReturnValue * Inlining multi block with spv.Return * Inlining spv.selection/spv.loop without return ops More advanced cases will require block argument and Phi support. PiperOrigin-RevId: 275151132	2019-10-16 17:46:19 -07:00
River Riddle	1ba9bb0507	Add Ch.5 of the toy tutorial. This chapter adds a partial lowering of toy operations, all but PrintOp, to a combination of the Affine and Std dialects. This chapter focuses on introducing the conversion framework, the benefits of partial lowering, and how easily dialects may co-exist in the IR. PiperOrigin-RevId: 275150649	2019-10-16 17:45:09 -07:00
River Riddle	7045471913	Add support for inlining toy call operations. The GenericCallOp needed to have the CallOpInterface to be picked up by the inliner. This also adds a CastOp to perform shape casts that are generated during inlining. The casts generated by the inliner will be folded away after shape inference. PiperOrigin-RevId: 275150438	2019-10-16 17:32:57 -07:00
Rob Suderman	a245023c1c	Add ComplexType to TableGen with Tensor support Create a ComplexType for table gen references. Include an AnyComplex type to check whether the resulting tensor can be complex. Expand tensors to allow complex types. PiperOrigin-RevId: 275144804	2019-10-16 16:59:08 -07:00
Sana Damani	3940b90d84	Update Chapter 4 of the Toy tutorial This Chapter now introduces and makes use of the Interface concept in MLIR to demonstrate ShapeInference. END_PUBLIC Closes tensorflow/mlir#191 PiperOrigin-RevId: 275085151	2019-10-16 12:19:39 -07:00
Jacques Pienaar	e88dbc8c95	Update comments in ast.toy PiperOrigin-RevId: 275084969	2019-10-16 12:08:24 -07:00
Mahesh Ravishankar	54a8473470	Makes spv.module generated by GPU->SPIRV conversion spec compliant Makes the spv.module generated by the GPU to SPIR-V conversion SPIR-V spec compliant (validated using spirv-val from Vulkan tools). 1) Separate out the VulkanLayoutUtils from DecorateSPIRVCompositeTypeLayoutPass to make it reusable within the Type converter in SPIR-V lowering infrastructure. This is used to compute the layout of the !spv.struct used in global variable type description. 2) Set the capabilities of the spv.module to Shader (needed for use of Logical Memory Model, and the extensions to SPV_KHR_storage_buffer_storage_class for use of Storage Buffer) PiperOrigin-RevId: 275081486	2019-10-16 11:53:07 -07:00
Christian Sigg	d2f0f847af	Support custom accumulator provided as region to gpu.all_reduce. In addition to specifying the type of accumulation through the 'op' attribute, the accumulation can now also be specified as arbitrary code region. Adds a gpu.yield op to specify the result of the accumulation. Also support more types (integers) and accumulations (mul). PiperOrigin-RevId: 275065447	2019-10-16 10:43:44 -07:00
River Riddle	dfe09cc621	Add support for PatternRewriter::eraseOp. This hook is useful when an operation is known to be dead, and no replacement values make sense. PiperOrigin-RevId: 275052756	2019-10-16 09:50:57 -07:00
Stephan Herhut	b843cc5d5a	Implement simple loop-invariant-code-motion based on dialect interfaces. PiperOrigin-RevId: 275004258	2019-10-16 04:28:38 -07:00
Hanhan Wang	950979745a	Add support for OpBitwiseOr, OpBitwiseXor, and OpBitwiseAnd in SPIR-V dialect. PiperOrigin-RevId: 274935374	2019-10-15 18:42:40 -07:00
Lei Zhang	e03e151983	[spirv] Add support for SpecId decoration on spv.specConstant The SpecId decoration is the handle for providing external specialization. Similar to descriptor set and binding on global variables, we directly bake it into assembly parsing and printing. PiperOrigin-RevId: 274893879	2019-10-15 14:53:30 -07:00
Sana Damani	cd45b0c8d9	Update Chapter 3 to demonstrate pattern match and rewrite optimizations This is using Table-driven Declarative Rewrite Rules (DRR), the previous version of the tutorial only showed the C++ patterns. Closes tensorflow/mlir#187 PiperOrigin-RevId: 274852321	2019-10-15 11:40:44 -07:00
Nicolas Vasilache	31c5a41a30	Consistent use of int in mlir_runner_utils.cpp This should fix the OSS build by only using int in template types. PiperOrigin-RevId: 274843584	2019-10-15 11:04:45 -07:00
Nicolas Vasilache	abf5c60af9	Add conversion for splat of vectors of 2+D This CL adds a missing lowering for splat of multi-dimensional vectors. Additional support is also added to the runtime utils library to allow printing memrefs with such vectors. PiperOrigin-RevId: 274794723	2019-10-15 06:53:08 -07:00
River Riddle	300112e135	Merge Ch3 of the Toy tutorial into chapter 2. This effectively rewrites Ch.2 to introduce dialects, operations, and registration instead of deferring to Ch.3. This allows for introducing the best practices up front(using ODS, registering operations, etc.), and limits the opaque API to the chapter document instead of the code. PiperOrigin-RevId: 274724289	2019-10-14 21:13:45 -07:00
River Riddle	96de7091bc	Allowing replacing non-root operations in DialectConversion. When dealing with regions, or other patterns that need to generate temporary operations, it is useful to be able to replace other operations than the root op being matched. Before this PR, these operations would still be considered for legalization meaning that the conversion would either fail, erroneously need to mark these ops as legal, or add unnecessary patterns. PiperOrigin-RevId: 274598513	2019-10-14 10:01:59 -07:00
Mehdi Amini	24c392f21c	Use single quotes to wrap '@HOST_LDFLAGS@' in LIT config file ldflags can contain double-quoted paths, so must use single quotes here. PiperOrigin-RevId: 274581983	2019-10-14 09:05:34 -07:00
Nicolas Vasilache	5c5d83afb4	Fix linalg.subview behavior in (partially) static cases. When the implementation of the strided memref [RFC](https://groups.google.com/a/tensorflow.org/forum/#!msg/mlir/MaL8m2nXuio/1scRqZa6AQAJ) landed, linalg started using this type instead of the now retired !linalg.view. As static and partially static cases appear, the stride information needs to be maintained properly. In particular, the result type of the subview op was generally incorrect. This CL fixes the issue by computing a return type that: 1. always has dynamic sizes, which is generally the only correct way to construct a subview in the absence of data padding and/or code versioning. 2. has the same strides as the base strided memref. Point 1. above can be further refined but will needs further analysis and canonicalization to optimize the particular case where: 1. The base memref has static size along a given dimension. 2. The subview size can be statically derived (e.g. after canonicalization). 3. And the subview size is an even divisor of the base memref. This 3rd constraint is well-known in the case of tiled layouts that don't assume implicit padding: the boundary tile may be only partial and has size given by `problem_size % tile_size`. Tests are updated as appropriate. PiperOrigin-RevId: 274578624	2019-10-14 08:43:53 -07:00
Nicolas Vasilache	c2285b619d	Add lowering of VectorOps dialect to LLVM to the Linalg LLVM lowering pass This fixes an omission that prevents Linalg to lower generic ops regions operating on ops in the VectorOps dialect. To achieve this we simply need to `populateVectorToLLVMConversionPatterns` in the conversion. Relevant tests are added. PiperOrigin-RevId: 274577325	2019-10-14 08:43:26 -07:00
Eric Schweitz	a3d084848d	Add LLVM IR dialect hooks for FP128 and X86_FP80 types Closes tensorflow/mlir#184 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/184 from schweitzpgi:more-float-types ca27d00510a86ffc9c79c65fb3a0193b5ea097a0 PiperOrigin-RevId: 274288813	2019-10-11 18:35:33 -07:00
Alex Zinenko	8c2ea32072	Emit LLVM IR equivalent of sizeof when lowering alloc operations Originally, the lowering of `alloc` operations has been computing the number of bytes to allocate when lowering based on the properties of MLIR type. This does not take into account type legalization that happens when compiling LLVM IR down to target assembly. This legalization can widen the type, potentially leading to out-of-bounds accesses to `alloc`ed data due to mismatches between address computation that takes the widening into account and allocation that does not. Use the LLVM IR's equivalent of `sizeof` to compute the number of bytes to be allocated: %0 = getelementptr %type* null, %indexType 0 %1 = ptrtoint %type* %0 to %indexType adapted from http://nondot.org/sabre/LLVMNotes/SizeOf-OffsetOf-VariableSizedStructs.txt PiperOrigin-RevId: 274159900	2019-10-11 06:33:26 -07:00
Alex Zinenko	71b82bcbf6	LLVM Dialect: introduce llvm.mlir.null operation Similarly to `llvm.mlir.undef`, this auxiliary operation creates an SSA value that corresponds to `null` in LLVM IR. This operation is necessary to model sizeof(<...>) behavior when allocating memory. PiperOrigin-RevId: 274158760	2019-10-11 06:32:24 -07:00
Alexander Belyaev	00d2a37e32	Add unary ops and ExpOp to Standard Dialect. PiperOrigin-RevId: 274152154	2019-10-11 05:13:55 -07:00
Alex Zinenko	304e44a6b0	LLVM conversion: harden a test to check for LLVM funcs rather than any funcs This test was not updated in the original commit that switched to using LLVM functions since it wasn't broken by that change. FileCheck was able to match the `func` part of `llvm.func` to the expected pattern and continue as usual. Make sure the `llvm.` dialect prefix is included in the expected output. PiperOrigin-RevId: 274127281	2019-10-11 01:36:38 -07:00
River Riddle	7a7dcc171d	Add support for generating reproducers on pass crash and failure. This cl adds support for generating a .mlir file containing a reproducer for crashes and failures that happen during pass execution. The reproducer contains a comment detailing the configuration of the pass manager(e.g. the textual description of the pass pipeline that the pass manager was executing), along with the original input module. Example Output: // configuration: -pass-pipeline='func(cse, canonicalize), inline' // note: verifyPasses=false module { ... } PiperOrigin-RevId: 274088134	2019-10-10 19:36:54 -07:00
Alex Zinenko	08a2ce8a14	Standard-to-LLVM conversion: check that operands have LLVM types In Standard to LLVM dialect conversion, the binary op conversion pattern implicitly assumed some operands were of LLVM IR dialect type. This is not necessarily true, for example if the Ops that produce those operands did not match the existing convresion patterns. Check if all operands are of LLVM IR dialect type and if not, fail to patch the binary op pattern. Closes tensorflow/mlir#168 PiperOrigin-RevId: 274063207	2019-10-10 17:19:57 -07:00
Mahesh Ravishankar	28d7f9c052	Add lowering of constant ops to SPIR-V. The lowering is specified as a pattern and is done only if the result is a SPIR-V scalar type or vector type. Handling ConstantOp with index return type needs special handling since SPIR-V dialect does not have index types. Based on the bitwidth of the attribute value, either i32 or i64 is chosen. Other constant lowerings are left as a TODO. PiperOrigin-RevId: 274056805	2019-10-10 17:19:57 -07:00
Geoffrey Martin-Noble	736f80d0dd	Add trait for specified shapes matching PiperOrigin-RevId: 274046434	2019-10-10 17:19:57 -07:00
River Riddle	6b1cc3c6ea	Add support for canonicalizing callable regions during inlining. This will allow for inlining newly devirtualized calls, as well as give a more accurate cost model(when we have one). Currently canonicalization will only run for nodes that have no child edges, as the child nodes may be erased during canonicalization. We can support this in the future, but it requires more intricate deletion tracking. PiperOrigin-RevId: 274011386	2019-10-10 17:06:33 -07:00
River Riddle	438dc176b1	Remove the need to convert operations in regions of operations that have been replaced. When an operation with regions gets replaced, we currently require that all of the remaining nested operations are still converted even though they are going to be replaced when the rewrite is finished. This cl adds a tracking for a minimal set of operations that are known to be "dead". This allows for ignoring the legalization of operations that are won't survive after conversion. PiperOrigin-RevId: 274009003	2019-10-10 17:06:25 -07:00
Geoffrey Martin-Noble	cc145706aa	NFC: Cleanup of type checking tests 1. Rename test ops referencing operand to index from 0 consistent with how we index elsewhere. 2. Don't limit type checking that functions for all shaped types to only tensors. 3. Don't limit (element) type checking functions and add tests for scalars. 4. Remove SSA values that don't do anything. PiperOrigin-RevId: 273917608	2019-10-10 02:31:53 -07:00
Alex Zinenko	5e7959a353	Use llvm.func to define functions with wrapped LLVM IR function type This function-like operation allows one to define functions that have wrapped LLVM IR function type, in particular variadic functions. The operation was added in parallel to the existing lowering flow, this commit only switches the flow to use it. Using a custom function type makes the LLVM IR dialect type system more consistent and avoids complex conversion rules for functions that previously had to use the built-in function type instead of a wrapped LLVM IR dialect type and perform conversions during the analysis. PiperOrigin-RevId: 273910855	2019-10-10 01:34:06 -07:00
Parker Schuh	309b4556d0	Add test for fix to tablegen for custom folders for ops that return a single variadic result. Add missing test for single line fix to `void OpEmitter::genFolderDecls()` entitled "Fold away reduction over 0 dimensions." PiperOrigin-RevId: 273880337	2019-10-09 20:44:30 -07:00
MLIR Team	ae6946ec11	Add ::printAsTextualPipeline to Pass and OpPassManager. Allow printing out pipelines in a format that is as close as possible to the textual pass pipeline format. Individual passes can override the print function in order to format any options that may have been used to construct that pass. PiperOrigin-RevId: 273813627	2019-10-09 13:49:17 -07:00
Mahesh Ravishankar	e2ed25bc43	Make SPIR-V lowering infrastructure follow Vulkan SPIR-V validation. The lowering infrastructure needs to be enhanced to lower into a spv.Module that is consistent with the SPIR-V spec. The following changes are needed 1) The Vulkan/SPIR-V validation rules dictates entry functions to have signature of void(void). This requires changes to the function signature conversion infrastructure within the dialect conversion framework. When an argument is dropped from the original function signature, a function can be specified that when invoked will return the value to use as a replacement for the argument from the original function. 2) Some changes to the type converter to make the converted type consistent with the Vulkan/SPIR-V validation rules, a) Add support for converting dynamically shaped tensors to spv.rtarray type. b) Make the global variable of type !spv.ptr<!spv.struct<...>> 3) Generate the entry point operation for the kernel functions and automatically compute all the interface variables needed PiperOrigin-RevId: 273784229	2019-10-09 11:25:58 -07:00
Nicolas Vasilache	171637d4f0	Fix Windows linkage error This CL fixes bad macro names usage in mlir_runner_utils.h. The macro mlir_runner_utils_EXPORTS now matches what is defined in CMakeLists.txt. PiperOrigin-RevId: 273773931	2019-10-09 10:38:31 -07:00
Diego Caballero	3451055614	Add support for some multi-store cases in affine fusion This PR is a stepping stone towards supporting generic multi-store source loop nests in affine loop fusion. It extends the algorithm to support fusion of multi-store loop nests that: 1. have only one store that writes to a function-local live out, and 2. the remaining stores are involved in loop nest self dependences or no dependences within the function. Closes tensorflow/mlir#162 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/162 from dcaballe:dcaballe/multi-output-fusion 7fb7dec6fe8b45f5ce176f018bfe37b256420c45 PiperOrigin-RevId: 273773907	2019-10-09 10:37:30 -07:00
Alexander Belyaev	c9c24ca23b	Add exp operation to LLVMOPs.td. PiperOrigin-RevId: 273718958	2019-10-09 04:19:43 -07:00
River Riddle	395ce4b41b	NFC: Fully qualify use of std::string. PiperOrigin-RevId: 273668957	2019-10-08 21:16:20 -07:00
Smit Hinsu	85b46314c0	Allow dynamic but ranked types in ops with SameOperandsAndResultShape and SameOperandsAndResultType traits Currently SameOperandsAndResultShape trait allows operands to have tensor<*xf32> and tensor<2xf32> but doesn't allow tensor<?xf32> and tensor<10xf32>. Also, use the updated shape compatibility helper function in TensorCastOp::areCastCompatible method. PiperOrigin-RevId: 273658336	2019-10-08 19:37:11 -07:00
River Riddle	b3a6ae8363	Update the symbol utility methods to handle the case of unknown operations. This enhances the symbol table utility methods to handle the case where an unknown operation may define a symbol table. When walking symbols, we now collect all symbol uses before allowing the user to iterate. This prevents the user from assuming that all symbols are actually known before performing a transformation. PiperOrigin-RevId: 273651963	2019-10-08 18:38:37 -07:00
MLIR Team	7446151236	Add Instance Specific Pass Options. This allows individual passes to define options structs and for these options to be parsed per instance of the pass while building the pass pipeline from the command line provided textual specification. The user can specify these per-instance pipeline options like so: ``` struct MyPassOptions : public PassOptions<MyPassOptions> { Option<int> exampleOption{this, "flag-name", llvm:🆑:desc("...")}; List<int> exampleListOption{this, "list-flag-name", llvm:🆑:desc("...")}; }; static PassRegistration<MyPass, MyPassOptions> pass("my-pass", "description"); ``` PiperOrigin-RevId: 273650140	2019-10-08 18:23:43 -07:00
River Riddle	71c7962201	Add support for parsing/printing non bare-identifier SymbolRefs. The restriction that symbols can only have identifier names is arbitrary, and artificially limits the names that a symbol may have. This change adds support for parsing and printing symbols that don't fit in the 'bare-identifier' grammar by printing the reference in quotes, e.g. @"0_my_reference" can now be used as a symbol name. PiperOrigin-RevId: 273644768	2019-10-08 17:45:07 -07:00
Deven Desai	956a831130	[ROCm] Fix the return type for the device function calls from i32 to i64. This is matching what the runtime library is expecting. Closes tensorflow/mlir#171 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/171 from deven-amd:deven-rocdl-device-func-i64 80762629a8c34e844ebdc542b34dd783990db9db PiperOrigin-RevId: 273640767	2019-10-08 17:41:42 -07:00
Denis Khalikov	d21ba951de	[spirv] Add a pass to decorate the composite types with layout info. Add a pass to decorate the composite types used by composite objects in the StorageBuffer, PhysicalStorageBuffer, Uniform, and PushConstant storage classes with layout information. Closes tensorflow/mlir#156 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/156 from denis0x0D:sandbox/layout_info_decoration 7c50840fd38ca169a2da7ce9886b52b50c868b84 PiperOrigin-RevId: 273634140	2019-10-08 16:54:11 -07:00
River Riddle	49b29dd186	Add a PatternRewriter hook for cloning a region into another. This is similar to the `inlineRegionBefore` hook, except the original blocks are unchanged. The region to be cloned must not have been modified during the conversion process at the point of cloning, i.e. it must belong an operation that has yet to be converted, or the operation that is currently being converted. PiperOrigin-RevId: 273622533	2019-10-08 15:45:08 -07:00
Uday Bondhugula	6136f33d59	unroll and jam: fix order of jammed bodies - bodies would earlier appear in the order (i, i+3, i+2, i+1) instead of (i, i+1, i+2, i+3) for example for factor 4. - clean up hardcoded test cases Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#170 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/170 from bondhugula:ujam b66b405b2b1894a03b376952e32a9d0292042665 PiperOrigin-RevId: 273613131	2019-10-08 15:13:11 -07:00
River Riddle	ac91e67375	Add support for walking the uses of a symbol. MLIR uses symbol references to model references to many global entities, such as functions/variables/etc. Before this change, there is no way to actually reason about the uses of such entities. This change provides a walker for symbol references(via SymbolTable::walkSymbolUses), as well as 'use_empty' support(via SymbolTable::symbol_use_empty). It also resolves some deficiencies in the LangRef definition of SymbolRefAttr, namely the restrictions on where a SymbolRefAttr can be stored, ArrayAttr and DictionaryAttr, and the relationship with operations containing the SymbolTable trait. PiperOrigin-RevId: 273549331	2019-10-08 10:21:59 -07:00
Alex Zinenko	11d12670da	GPUToCUDA: attach CUBIN to the nested module rather than to the function Originally, we were attaching attributes containing CUBIN blobs to the kernel function called by `gpu.launch_func`. This kernel is now contained in a nested module that is used as a compilation unit. Attach compiled CUBIN blobs to the module rather than to the function since we were compiling the module. This also avoids duplication of the attribute on multiple kernels within the same module. PiperOrigin-RevId: 273497303	2019-10-08 05:11:26 -07:00
Alex Zinenko	52e082b6ed	GPUToCUDA: emit addressof directly instead of wrapping it into a getter function Originally, the CUBIN getter function was introduced as a mechanism to circumvent the absence of globals in the LLVM dialect. It would allocate memory and populate it with the CUBIN data. LLVM dialect now supports globals and they are already used to store CUBIN data, making the getter function a trivial address computation of a global. Emit the address computation directly at the place of `gpu.launch_func` instead of putting it in a function and calling it. This simplifies the conversion flow and prepares it for using the DialectConversion infrastructure. PiperOrigin-RevId: 273496221	2019-10-08 05:03:42 -07:00
Alex Zinenko	16af5924cb	Fuse GenerateCubinAccessors pass into LaunchFunctToCuda Now that the accessor function is a trivial getter of the global variable, it makes less sense to have the getter generation as a separate pass. Move the getter generation into the lowering of `gpu.launch_func` to CUDA calls. This change is mostly code motion, but the process can be simplified further by generating the addressof inplace instead of using a call. This is will be done in a follow-up. PiperOrigin-RevId: 273492517	2019-10-08 04:35:33 -07:00
Alex Zinenko	90d65d32d6	Use named modules for gpu.launch_func The kernel function called by gpu.launch_func is now placed into an isolated nested module during the outlining stage to simplify separate compilation. Until recently, modules did not have names and could not be referenced. This limitation was circumvented by introducing a stub kernel at the same name at the same nesting level as the module containing the actual kernel. This relation is only effective in one direction: from actual kernel function to its launch_func "caller". Leverage the recently introduced symbol name attributes on modules to refer to a specific nested module from `gpu.launch_func`. This removes the implicit connection between the identically named stub and kernel functions. It also enables support for `gpu.launch_func`s to call different kernels located in the same module. PiperOrigin-RevId: 273491891	2019-10-08 04:30:32 -07:00
River Riddle	a8a73f0640	Add a flag to the AsmPrinter for eliding large ElementsAttrs. Some modules may have extremely large ElementsAttrs, which makes debugging involving IR dumping extremely slow and painful. This change adds a flag that will elide ElementsAttrs with a "large"(as defined by the user) number of elements by printing "..." instead of the element data. PiperOrigin-RevId: 273413100	2019-10-07 17:19:20 -07:00
Lei Zhang	5a1108c9a6	[spirv] Disable a crashing spv.loop test PiperOrigin-RevId: 273379318	2019-10-07 14:40:49 -07:00
Mahesh Ravishankar	9e9c3a009a	Update UndefOp (de)serialization to generate OpUndef at module level. The SPIR-V spec recommends all OpUndef instructions be generated at module level. For the SPIR-V dialect its better for UndefOp to produce an SSA value for use with other instructions. If UndefOp is to be used at module level, it cannot produce an SSA value (use of this SSA value within FuncOp would need implicit capture). To satisfy needs of the SPIR-V spec while making it simpler to represent UndefOp in the SPIR-V dialect, the serialization is updated to create OpUndef instruction at module scope. PiperOrigin-RevId: 273355526	2019-10-07 12:56:38 -07:00
Lei Zhang	ebf584b813	[spirv] Fix function entry block erase after moving to spv.selection The structured selection/loop's entry block does not have arguments. If the function's header block is also part of the structured control flow, we cannot just simply erase it because it may contain arguments matching the function signature and used by the cloned blocks. Instead, turn it into a block only containing a spv.Branch op. Also, we can directly emit instructions for the spv.selection header block to the block containing the spv.selection op. This eliminates unnecessary branches in the SPIR-V blob. Added a test for nested spv.loop. PiperOrigin-RevId: 273351424	2019-10-07 12:37:13 -07:00
Uday Bondhugula	89e7a76a1c	fix simplify-affine-structures bug Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#157 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/157 from bondhugula:quickfix bd1fcd79825fc0bd5b4a3e688153fa0993ab703d PiperOrigin-RevId: 273316498	2019-10-07 10:04:50 -07:00
Nicolas Vasilache	3b4f133fb7	Start a minimal mlir_utils runtime library for testing debugging purposes Now that MLIR has a standardized StridedMemRef descriptor, it becomes very easy to interact with external library functions and build utilities directly in C++. This CL introduces basic printing support in a libmlir_utils.so. Unit tests are rewritten using this feature and also to improve coverage. For now, C mandates that we have a unique function for each MemRef element type and rank. In a future a simple unranked descriptor can be introduced to only require uniqu'ing by element type. PiperOrigin-RevId: 273304741	2019-10-07 09:06:55 -07:00
Nicolas Vasilache	9f98bcda47	Support AllocOp terminal in Linalg::AliasAnalysis. Now that linalg.view and strided memrefs are unified, there is no reason to disallow AllocOp in alias analysis. This CLs adds support for AllocOp which allows writing shorter tests that do not require explicitly creating a view for each operation. PiperOrigin-RevId: 273303060	2019-10-07 09:01:18 -07:00
Nicolas Vasilache	c07a604f87	Fix CMake build after adding TestOpaqueLoc.cpp PiperOrigin-RevId: 273296399	2019-10-07 08:25:53 -07:00
MLIR Team	da984166df	Add OpaqueLoc to MLIR locations. See RFC: https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/xE2IzfhE3Wg. Opaque location stores two pointers, one of them points to some data structure that is external to MLIR, and the other one is unique for each type and represents type id of that data structure. OpaqueLoc also stores an optional location that can be used if the first one is not suitable. OpaqueLoc is managed similar to FileLineColLoc. It is passed around by MLIR transformations and can be used in compound locations like CallSiteLoc. PiperOrigin-RevId: 273266510	2019-10-07 05:05:42 -07:00
Christian Sigg	7c765d97f9	Support reduction of partial warps. gpu.all_reduce now supports block sizes that are not multiple of 32. PiperOrigin-RevId: 273255204	2019-10-07 03:31:00 -07:00
Geoffrey Martin-Noble	18db4ce493	Allow element type traits to operate on scalars This allows confirming that a scalar argument has the same element type as a shaped one. It's easy to validate a type is shaped on its own if that's desirable, so this shouldn't make that use case harder. This matches the behavior of other traits that operate on element type (e.g. AllElementTypesMatch). Also this makes the code simpler because now we just use getElementTypeOrSelf. Verified that all uses in core already check the type is shaped in another way. PiperOrigin-RevId: 273068507	2019-10-05 10:06:06 -07:00
Geoffrey Martin-Noble	8b9b72cee8	NFC: Cleanup test ops and traits tests 1. Rename a few ops to make it clear they operate on element types. 2. Remove unused and generic operand and result ODS names (e.g. $res, $arg, $input). These are just clutter and don't make the op definitions any clearer. 3. Give test cases with duplicate names clearer names. 4. Add missing test case for no operands in SameOperandAndResultElementType. PiperOrigin-RevId: 273067933	2019-10-05 10:00:57 -07:00
Lei Zhang	c020480fc6	[spirv] Allow return ops to be in control flow ops Use `getParentOfType<FunctionOp>()` instead of `cast<FuncOp>(getParentOp())` to avoid crash when return ops are used inside spv.selection/spv.loop. PiperOrigin-RevId: 273006041	2019-10-04 20:08:52 -07:00
Mehdi Amini	58e2ead314	Add missing dependency on the TypeInferOpInterface from the Test dialect This is fixing a build failure, usually non-deterministic because of parallelism in the build, but could be reliably reproduced: ninja projects/mlir/test/lib/TestDialect/CMakeFiles/MLIRTestDialect.dir/TestPatterns.cpp.o PiperOrigin-RevId: 272998436	2019-10-04 18:40:47 -07:00
Mahesh Ravishankar	3f8bde40cb	Add spv.Undef op to support OpUndef instruction in SPIR-V. Adding support for OpUndef instruction. Updating the dialect generation script to fix a few bugs in the instruction spec generation. PiperOrigin-RevId: 272975685	2019-10-04 16:00:22 -07:00
Nicolas Vasilache	754ea72794	Replace constexpr MemRefType::kDynamicStrideOrOffset by a MemRefType:;getDynamicStrideOrOffset() method - NFC This fixes global ODR-use issues, some of which manifest in Parser.cpp. Fixes tensorflow/mlir#167. PiperOrigin-RevId: 272886347	2019-10-04 08:58:09 -07:00
Nicolas Vasilache	516f6a3477	Add missing Linalg lowerings to allow roundtrip.mlir to lower to LLVM Certain lowering patterns were reported as [missing](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/dkdmHa77sSQ). This CL adds them and allows Linalg/roundtrip.mlir and Linalg/loops.mlir to lower to LLVM directly. Those 2 tests are updated to additionally check that the direct lowering to LLVM does not crash. The following points, left as TODOs still need to be addressed for correct end-to-end execution: 1. the lowering for ConvOp needs to pass attributes such as strides and dilations; the external library call needs to support it. 2. the lowering for GenericOp needs to support lowering to loops as a DialectConversion pattern. This is blocked on the DialectConversion infrastructure accepting an OperationFolder. PiperOrigin-RevId: 272878131	2019-10-04 08:07:54 -07:00
River Riddle	5830f71a45	Add support for inlining calls with different arg/result types from the callable. Some dialects have implicit conversions inherent in their modeling, meaning that a call may have a different type that the type that the callable expects. To support this, a hook is added to the dialect interface that allows for materializing conversion operations during inlining when there is a mismatch. A hook is also added to the callable interface to allow for introspecting the expected result types. PiperOrigin-RevId: 272814379	2019-10-03 23:10:51 -07:00
River Riddle	a20d96e436	Update the Inliner pass to work on SCCs of the CallGraph. This allows for the inliner to work on arbitrary call operations. The updated inliner will also work bottom-up through the callgraph enabling support for multiple levels of inlining. PiperOrigin-RevId: 272813876	2019-10-03 23:05:21 -07:00
Feng Liu	8c95223e3c	Add `axis` attribute to the quant.stats op The first dim length of the axisStats attribute should equals to the slice size of the input argument when splitted by the axis dimension. PiperOrigin-RevId: 272798042	2019-10-03 20:29:08 -07:00
MLIR Team	0dfa7fc908	Add fpext and fptrunc to the Standard dialect and includes conversion to LLVM PiperOrigin-RevId: 272768027	2019-10-03 16:37:24 -07:00
Christian Sigg	496f4590a1	Generalize parse/printBinaryOp to parse/printOneResultOp. PiperOrigin-RevId: 272722539	2019-10-03 13:00:12 -07:00
Nicolas Vasilache	218f0e611a	Add syntactic sugar for strided memref parsing. This CL implements the last remaining bit of the [strided memref proposal](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). The syntax is a bit more explicit than what was originally proposed and resembles: `memref<?x?xf32, offset: 0 strides: [?, 1]>` Nonnegative strides and offsets are currently supported. Future extensions will include negative strides. This also gives a concrete example of syntactic sugar for the ([RFC] Proposed Changes to MemRef and Tensor MLIR Types)[https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/-wKHANzDNTg]. The underlying implementation still uses AffineMap layout. PiperOrigin-RevId: 272717437	2019-10-03 12:34:36 -07:00
Alex Zinenko	8633b6bc8e	Give modules a name Modules are now Ops and, as such, can be nested. They do not produce an SSA value so there is no possibility to refer to them in the IR. Introduce support for symbol names attached to the module Op so that it can be referred to using SymbolRefAttrs. The name is optional, for example the implicit top-level module does not have a name. PiperOrigin-RevId: 272671600	2019-10-03 08:56:38 -07:00
Alex Zinenko	e0d78eac23	NFC: rename Conversion/ControlFlowToCFG to Conversion/LoopToStandard This makes the name of the conversion pass more consistent with the naming scheme, since it actually converts from the Loop dialect to the Standard dialect rather than working with arbitrary control flow operations. PiperOrigin-RevId: 272612112	2019-10-03 01:35:03 -07:00

... 4 5 6 7 8 ...

1715 Commits