llvm-project

Commit Graph

Author	SHA1	Message	Date
Uday Bondhugula	a63f6e0bf9	Replace spurious SmallVector constructions with ValueRange Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#305 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/305 from bondhugula:value_range 21d1fae73f549e3c8e72b60876eff1b864cea39c PiperOrigin-RevId: 284541027	2019-12-09 06:26:33 -08:00
Nicolas Vasilache	ade58a268c	Add a layer of recursive matchers that compose. This CL adds support for building matchers recursively. The following matchers are provided: 1. `m_any()` can match any value 2. `m_val(Value *)` binds to a value and must match it 3. `RecursivePatternMatcher<OpType, Matchers...>` n-arity pattern that matches `OpType` and whose operands must be matched exactly by `Matchers...`. This allows building expression templates for patterns, declaratively, in a very natural fashion. For example pattern `p9` defined as follows: ``` auto mul_of_muladd = m_Op<MulFOp>(m_Op<MulFOp>(), m_Op<AddFOp>()); auto mul_of_anyadd = m_Op<MulFOp>(m_any(), m_Op<AddFOp>()); auto p9 = m_Op<MulFOp>(m_Op<MulFOp>( mul_of_muladd, m_Op<MulFOp>()), m_Op<MulFOp>(mul_of_anyadd, mul_of_anyadd)); ``` Successfully matches `%6` in: ``` %0 = addf %a, %b: f32 %1 = addf %a, %c: f32 // matched %2 = addf %c, %b: f32 %3 = mulf %a, %2: f32 // matched %4 = mulf %3, %1: f32 // matched %5 = mulf %4, %4: f32 // matched %6 = mulf %5, %5: f32 // matched ``` Note that 0-ary matchers can be used as leaves in place of n-ary matchers. This alleviates from passing explicit `m_any()` leaves. In the future, we may add extra patterns to specify that operands may be matched in any order. PiperOrigin-RevId: 284469446	2019-12-08 18:09:40 -08:00
Lei Zhang	9a4c2df480	NFC: Expose constFoldBinaryOp via a header This allows other dialects to reuse the logic to support constant folding binary operations and reduces code duplication. PiperOrigin-RevId: 284428721	2019-12-08 06:25:54 -08:00
River Riddle	d6ee6a0310	Update the builder API to take ValueRange instead of ArrayRef<Value *> This allows for users to provide operand_range and result_range in builder.create<> calls, instead of requiring an explicit copy into a separate data structure like SmallVector/std::vector. PiperOrigin-RevId: 284360710	2019-12-07 10:35:41 -08:00
River Riddle	9d1a0c72b4	Add a new ValueRange class. This class represents a generic abstraction over the different ways to represent a range of Values: ArrayRef<Value >, operand_range, result_range. This class will allow for removing the many instances of explicit SmallVector<Value , N> construction. It has the same memory cost as ArrayRef, and only suffers cost from indexing(if+elsing the different underlying representations). This change only updates a few of the existing usages, with more to be changed in followups; e.g. 'build' API. PiperOrigin-RevId: 284307996	2019-12-06 20:07:23 -08:00
Nicolas Vasilache	d27bc1db6a	Improve Linalg documentation following the Structured Ops presentation. PiperOrigin-RevId: 284291653	2019-12-06 17:09:16 -08:00
River Riddle	8904e91035	Add a flag to the IRPrinter instrumentation to only print after a pass if there is a change to the IR. This adds an additional filtering mode for printing after a pass that checks to see if the pass actually changed the IR before printing it. This "change" detection is implemented using a SHA1 hash of the current operation and its children. PiperOrigin-RevId: 284291089	2019-12-06 17:05:05 -08:00
Uday Bondhugula	ca23bd78d4	NFC - update doc, comments, vim syntax file - for the symbol rules, the code was updated but the doc wasn't. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#284 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/284 from bondhugula:doc 9aad8b8a715559f7ce61265f3da3f8a3c11b45ea PiperOrigin-RevId: 284283712	2019-12-06 16:17:06 -08:00
nmostafa	fcc215e399	Fix langref code snippet - NFC Closes tensorflow/mlir#294 PiperOrigin-RevId: 284281172	2019-12-06 16:03:51 -08:00
Mahesh Ravishankar	6500b7e0c0	NFC: Separate implementation and definition in ConvertStandardToSPIRV.cpp PiperOrigin-RevId: 284274326	2019-12-06 15:26:17 -08:00
Jacques Pienaar	4add9edd72	Change inferReturnTypes to return LogicalResult and values Previously the error case was using a sentinel in the error case which was bad. Also make the one `build` invoke the other `build` to reuse verification there. And follow up on suggestion to use formatv which I missed during previous review. PiperOrigin-RevId: 284265762	2019-12-06 14:42:45 -08:00
Alex Zinenko	e96150eb46	Replace custom getBody method with an ODS-generated in gpu::LaunchOp PiperOrigin-RevId: 284262981	2019-12-06 14:29:25 -08:00
Mahesh Ravishankar	883f555726	During serialization do a walk of ops in module to find spv.module. During lowering, spv.module might be within other modules (for example gpu kernel module). Walk the module op to find spirv module to serialize. PiperOrigin-RevId: 284262550	2019-12-06 14:27:03 -08:00
Alex Zinenko	3230267d0d	Move GPU::LaunchOp to ODS. NFC. Move the definition of the GPU launch opreation from hand-rolled C++ code to ODS framework. This only does the moves, a follow-up is necessary to clean up users of custom functions that could be auto-generated by ODS. PiperOrigin-RevId: 284261856	2019-12-06 14:23:37 -08:00
Alex Zinenko	6e0a2e4e2f	Use named traits in the ODS definition of LLVMFuncOp The "FunctionLike" and "IsIsolatedFromAbove" op traits are now defined as named records in base ODS file. Use those instead of NativeOpTrait referring to the C++ class name in the ODS definition of LLVMFuncOp. NFC. PiperOrigin-RevId: 284260891	2019-12-06 14:18:37 -08:00
Aart Bik	d37f27251f	[VecOps] Rename vector.[insert\|extract]element to just vector.[insert\|extract] Since these operations lower to [insert\|extract][element\|value] at LLVM dialect level, neither element nor value would correctly reflect the meaning. PiperOrigin-RevId: 284240727	2019-12-06 12:39:25 -08:00
Alex Zinenko	be3ed14658	LLVM::GlobalOp: take address space as builder argument Accept the address space of the global as a builder argument when constructing an LLVM::GlobalOp instance. This decreases the reliance of LLVM::GlobalOp users on the internal name of the attribute used for this purpose. Update several uses of the address space in GPU to NVVM conversion. PiperOrigin-RevId: 284233254	2019-12-06 12:01:46 -08:00
Alex Zinenko	ccc767d63b	Move GPU::FuncOp definition to ODS - NFC Move the definition of the GPU function opreation from hand-rolled C++ code to ODS framework. This only does the moves, a follow-up is necessary to clean up users of custom functions that could be auto-generated by ODS. PiperOrigin-RevId: 284233245	2019-12-06 12:00:32 -08:00
MLIR Team	9ef9e23682	Provide a way to get the type of a ValueHandle. PiperOrigin-RevId: 284221337	2019-12-06 11:07:10 -08:00
Aart Bik	b36aaeafb1	[VectorOps] Add lowering of vector.broadcast to LLVM IR For example, a scalar broadcast %0 = vector.broadcast %x : f32 to vector<2xf32> return %0 : vector<2xf32> which expands scalar x into vector [x,x] by lowering to the following LLVM IR dialect to implement the duplication over the leading dimension. %0 = llvm.mlir.undef : !llvm<"<2 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.insertelement %x, %0[%1 : !llvm.i64] : !llvm<"<2 x float>"> %3 = llvm.shufflevector %2, %0 [0 : i32, 0 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> return %3 : vector<2xf32> In the trailing dimensions, the operand is simply "passed through", unless a more elaborate "stretch" is required. For example %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> return %0 : vector<4xf32> becomes %0 = llvm.mlir.undef : !llvm<"<4 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.extractelement %arg0[%1 : !llvm.i64] : !llvm<"<1 x float>"> %3 = llvm.mlir.constant(0 : index) : !llvm.i64 %4 = llvm.insertelement %2, %0[%3 : !llvm.i64] : !llvm<"<4 x float>"> %5 = llvm.shufflevector %4, %0 [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>"> llvm.return %5 : !llvm<"<4 x float>"> PiperOrigin-RevId: 284219926	2019-12-06 11:02:29 -08:00
Jacques Pienaar	398f04aa49	Generate builder for ops that use InferTypeOpInterface trait in ODS For ops with infer type op interface defined, generate version that calls the inferal method on build. This is intermediate step to removing special casing of SameOperandsAndResultType & FirstAttrDereivedResultType. After that would be generating the inference code, with the initial focus on shaped container types. In between I plan to refactor these a bit to reuse generated paths. The intention would not be to add the type inference trait in multiple places, but rather to take advantage of the current modelling in ODS where possible to emit it instead. Switch the `inferReturnTypes` method to be static. Skipping ops with regions here as I don't like the Region vs unique_ptr<Region> difference at the moment, and I want the infer return type trait to be useful for verification too. So instead, just skip it for now to avoid churn. PiperOrigin-RevId: 284217913	2019-12-06 10:53:06 -08:00
Alex Zinenko	e216a72ab8	Add conversions of GPU func with memory attributions to LLVM/NVVM GPU functions use memory attributions, a combination of Op attributes and region arguments, to specify function-wide buffers placed in workgroup or private memory spaces. Introduce a lowering pattern for GPU functions to be converted to LLVM functions taking into account memory attributions. Workgroup attributions get transformed into module-level globals with unique names derived from function names. Private attributions get converted into llvm.allocas inside the function body. In both cases, we inject at the beginning of the function the IR that obtains the raw pointer to the data and populates a MemRef descriptor based on the MemRef type of buffer, making attributions compose with the rest of the MemRef lowering and transparent for use with std.load and std.store. While using raw pointers instead of descriptors might have been more efficient, it is better implemented as a canonicalization or a separate transformation so that non-attribution memrefs could also benefit from it. PiperOrigin-RevId: 284208396	2019-12-06 10:08:43 -08:00
Alexandre E. Eichenberger	3c69ca1e69	fix examples in comments Closes tensorflow/mlir#301 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/301 from AlexandreEichenberger:vect-doc-update 7e5418a9101a4bdad2357882fe660b02bba8bd01 PiperOrigin-RevId: 284202462	2019-12-06 09:40:50 -08:00
River Riddle	79047e1ab5	Use regex to fix failure when stats are disabled. It would be nice if we could detect if stats were enabled or not and use 'Requires', but this isn't possible to do at configure time. Fixes tensorflow/mlir#296 PiperOrigin-RevId: 284200271	2019-12-06 09:29:14 -08:00
Andy Davis	41f8e105fa	Unroll vector masks along with their associated vector arguments. Updates vector ContractionOp to use proper vector masks (produced by CreateMaskOp/ConstantMaskOp). Leverages the following canonicalizations in unrolling unit test: CreateMaskOp -> ConstantMaskOp, StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp Removes IndexTupleOp (no longer needed now that we have vector mask ops). Updates all unit tests. PiperOrigin-RevId: 284182168	2019-12-06 07:37:28 -08:00
Denis Khalikov	9ca53130f3	[spirv] Reorder `erase` and `emplace` to avoid "invalid iterator access". The iterator should be erased before adding a new entry into blockMergeInfo to avoid iterator invalidation. Closes tensorflow/mlir#299 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/299 from denis0x0D:sandbox/reoder_erase 983be565809aa0aadfc7e92962e4d4b282f63c66 PiperOrigin-RevId: 284173235	2019-12-06 06:26:56 -08:00
Uday Bondhugula	3ade6a7d15	DimOp folding for alloc/view dynamic dimensions Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#253 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/253 from bondhugula:dimop a4b464f24ae63fd259114558d87e11b8ee4dae86 PiperOrigin-RevId: 284169689	2019-12-06 06:00:54 -08:00
Kazuaki Ishizaki	84a6182ddd	minor spelling tweaks Closes tensorflow/mlir#290 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/290 from kiszk:spelling_tweaks_201912 9d9afd16a723dd65754a04698b3976f150a6054a PiperOrigin-RevId: 284169681	2019-12-06 05:59:30 -08:00
Alex Zinenko	58adf99ed1	LLVM::AddressOfOp: properly take into account the address space The AddressOf operation in the LLVM dialect return a pointer to a global variable. The latter may be in a non-default address space as indicated by the "addr_space" attribute. Check that the address space of the pointer returned by AddressOfOp matches that of the referenced GlobalOp. Update the AddressOfOp builder to respect this constraint. PiperOrigin-RevId: 284138860	2019-12-06 01:09:13 -08:00
River Riddle	12e57cf6c0	NFC: Add documentation for `-mlir-print-op-on-diagnostic` and `-mlir-print-stacktrace-on-diagnostic`. This change adds proper documentation in Diagnostics.md, allowing for users to more easily find them. PiperOrigin-RevId: 284092336	2019-12-05 17:47:03 -08:00
River Riddle	71999ff7f2	Add include path to the TestDialect to fix broken build. PiperOrigin-RevId: 284067891	2019-12-05 15:33:33 -08:00
Jose Ignacio Gomez	f60bbb6c3b	[Linalg] Add permutation information to tiling This patch closes issue tensorflow/mlir#271. It adds an optional permutation map to declarative tiling transformations. The map is expressed as a list of integers. Closes tensorflow/mlir#288 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/288 from tetuante:issue271 2df2938d6a1f01b3bc404ded08dea2dd1e10b588 PiperOrigin-RevId: 284064151	2019-12-05 15:14:59 -08:00
River Riddle	da53000fb4	Refactor the IRPrinting instrumentation to take a derivable config. This allows for more interesting behavior from users, e.g. enabling the ability to dump the IR to a separate file for each pass invocation. PiperOrigin-RevId: 284059447	2019-12-05 14:53:01 -08:00
nmostafa	daff60cd68	Add UnrankedMemRef Type Closes tensorflow/mlir#261 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/261 from nmostafa:nmostafa/unranked 96b6e918f6ed64496f7573b2db33c0b02658ca45 PiperOrigin-RevId: 284037040	2019-12-05 13:13:20 -08:00
Denis Khalikov	e67acfa468	[spirv] Add CompositeInsertOp operation A CompositeInsertOp operation make a copy of a composite object, while modifying one part of it. Closes tensorflow/mlir#292 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/292 from denis0x0D:sandbox/composite_insert 2200962b9057bda53cd2f2866b461e2797196380 PiperOrigin-RevId: 284036551	2019-12-05 13:10:44 -08:00
River Riddle	33a64540ad	Add support for instance specific pass statistics. Statistics are a way to keep track of what the compiler is doing and how effective various optimizations are. It is useful to see what optimizations are contributing to making a particular program run faster. Pass-instance specific statistics take this even further as you can see the effect of placing a particular pass at specific places within the pass pipeline, e.g. they could help answer questions like "what happens if I run CSE again here". Statistics can be added to a pass by simply adding members of type 'Pass::Statistics'. This class takes as a constructor arguments: the parent pass pointer, a name, and a description. Statistics can be dumped by the pass manager in a similar manner to how pass timing information is dumped, i.e. via PassManager::enableStatistics programmatically; or -pass-statistics and -pass-statistics-display via the command line pass manager options. Below is an example: struct MyPass : public OperationPass<MyPass> { Statistic testStat{this, "testStat", "A test statistic"}; void runOnOperation() { ... ++testStat; ... } }; $ mlir-opt -pass-pipeline='func(my-pass,my-pass)' foo.mlir -pass-statistics Pipeline Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== 'func' Pipeline MyPass (S) 15 testStat - A test statistic MyPass (S) 6 testStat - A test statistic List Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== MyPass (S) 21 testStat - A test statistic PiperOrigin-RevId: 284022014	2019-12-05 11:53:28 -08:00
Mahesh Ravishankar	4d61a79db4	Allow specification of the workgroup size for GPUToSPIRV lowering. SPIR-V/Vulkan spec requires the workgroups size to be specified with the spv.ExecutionMode operation. This was hard-wired to be set to a particular value. It is now changed to be configurable by clients of the pass or of the patterns that implement the lowering from GPU to SPIRV. PiperOrigin-RevId: 284017482	2019-12-05 11:31:57 -08:00
Lei Zhang	037044b0ae	Add spv.AtomicCompareExchangeWeak PiperOrigin-RevId: 283997917	2019-12-05 10:06:24 -08:00
River Riddle	780f0c043a	Add a flag to dump the current stack trace when emitting a diagnostic. It is often desirable to know where within the program that a diagnostic was emitted, without reverting to assert/unreachable which crash the program. This change adds a flag `mlir-print-stacktrace-on-diagnostic` that attaches the current stack trace as a note to every diagnostic that gets emitted. PiperOrigin-RevId: 283996373	2019-12-05 10:00:25 -08:00
Lei Zhang	c0a9de29ad	[spirv] Fix nested loop (de)serialization For serialization, when we have nested ops, the inner loop will create multiple SPIR-V blocks. If the outer loop has block arguments (which corresponds to OpPhi instructions), we defer the handling of OpPhi's parent block handling until we serialized all blocks and then fix it up with the result <id>. These two cases happening together was generating invalid SPIR-V blob because we previously assume the parent block to be the block containing the terminator. That is not true anymore when the block contains structured control flow ops. If that happens, it should be fixed to use the structured control flow op's merge block. For deserialization, we record a map from header blocks to their corresponding merge and continue blocks during the initial deserialization and then use the info to construct spv.selection/spv.loop. The existing implementation will also fall apart when we have nested loops. If so, we clone all blocks for the outer loop, including the ones for the inner loop, to the spv.loop's region. So the map for header blocks' merge info need to be updated; otherwise we are operating on already deleted blocks. PiperOrigin-RevId: 283949230	2019-12-05 04:39:37 -08:00
Mehdi Amini	b14ee5a9a1	Fix MLIR Build after LLVM upstream JIT changes (getMainJITDylib removed) The getMainJITDylib() method was removed in `4fc68b9b7f`, replace it by creating a JITDylib on the fly. PiperOrigin-RevId: 283948595	2019-12-05 04:32:46 -08:00
Tres Popp	b8cd0c1486	Move ModuleManager functionality into mlir::SymbolTable. Note for broken code, the following transformations occurred: ModuleManager::insert(Block::iterator, Operation) - > SymbolTable::insert(Operation, Block::iterator) ModuleManager::lookupSymbol -> SymbolTable::lookup ModuleManager::getModule() -> SymbolTable::getOp() ModuleManager::getContext() -> SymbolTable::getOp()->getContext() ModuleManager::* -> SymbolTable::* PiperOrigin-RevId: 283944635	2019-12-05 03:56:46 -08:00
Lei Zhang	b60799b71b	Add MLIRIR as a dependency to LLVM and related dialects Fixes tensorflow/mlir#289 PiperOrigin-RevId: 283914472	2019-12-04 23:45:35 -08:00
River Riddle	d9da8b647a	Optimize operation ordering to support non-congruent indices. This change adds support for non-congruent indices in the operation ordering within a basic block. This effect of this is that insertions are less likely to cause an invalidation of the ordering within a block. This has a big effect on modules that have very large basic blocks. PiperOrigin-RevId: 283858136	2019-12-04 16:10:13 -08:00
River Riddle	2c930f8d9d	Add emitOptional(Error\|Warning\|Remark) functions to simplify emission with an optional location. In some situations a diagnostic may optionally be emitted by the presence of a location, e.g. attribute and type verification. These situations currently require extra 'if(loc) emitError(...); return failure()' wrappers that make verification clunky. These new overloads take an optional location and a list of arguments to the diagnostic, and return a LogicalResult. We take the arguments directly and return LogicalResult instead of returning InFlightDiagnostic because we cannot create a valid diagnostic with a null location. This creates an awkward situation where a user may try to treat the, potentially null, diagnostic as a valid one and encounter crashes when attaching notes/etc. Below is an example of how these methods simplify some existing usages: Before: if (loc) emitError(*loc, "this is my diagnostic with argument: ") << 5; return failure(); After: return emitOptionalError(loc, "this is my diagnostic with argument: ", 5); PiperOrigin-RevId: 283853599	2019-12-04 15:49:42 -08:00
Nicolas Vasilache	b3f7cf80a7	Add a CL option to Standard to LLVM lowering to use alloca instead of malloc/free. In the future, a more configurable malloc and free interface should be used and exposed via extra parameters to the `createLowerToLLVMPass`. Until requirements are gathered, a simple CL flag allows generating code that runs successfully on hardware that cannot use the stdlib. PiperOrigin-RevId: 283833424	2019-12-04 14:16:00 -08:00
Andy Davis	d20d763241	Add canonicalization patterns for vector CreateMaskOp and StridedSliceOp to be used in the unroll vector op transformation. Adds a ConstantMaskOp to the vector ops dialect. Adds the following canonicalization patterns: CreateMaskOp -> ConstantMaskOp StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp PiperOrigin-RevId: 283816752	2019-12-04 13:00:43 -08:00
River Riddle	6f895bec7d	[CSE] NFC: Hash the attribute dictionary pointer instead of the list of attributes. PiperOrigin-RevId: 283810829	2019-12-04 12:32:08 -08:00
Nicolas Vasilache	edfaf925cf	Drop MaterializeVectorTransfers in favor of simpler declarative unrolling Now that we have unrolling as a declarative pattern, we can drop a full pass that has gone stale. In the future we may want to add specific unrolling patterns for VectorTransferReadOp. PiperOrigin-RevId: 283806880	2019-12-04 12:11:42 -08:00
River Riddle	31b3e2248b	NFC: Fix mismatches between LangRef.md and actual parser implementation. PiperOrigin-RevId: 283805832	2019-12-04 12:06:24 -08:00

1 2 3 4 5 ...

3216 Commits All Branches Search

3216 Commits

All Branches