llvm-project

Commit Graph

Author	SHA1	Message	Date
Jerry Wu	ad7c49bef7	[mlir][linalg] Fix padding size calculation for Conv2d ops. This patch fixed the padding size calculation for Conv2d ops when the stride > 1. It contains the changes below: - Use addBound to add constraint for AffineApplyOp in getUpperBoundForIndex. So the result value can be mapped and retrieved later. - Fixed the bound from AffineMinOp by adding as a closed bound. Originally the bound was added as an open upper bound, which results in the incorrect bounds when we multiply the values. For example: ``` %0 = affine.min affine_map<()[s0] -> (4, -s0 + 11)>()[iv0] %1 = affine.apply affine_map<()[s0] -> (s0 * 2)>()[%0] If we add the affine.min as an open bound, addBound will internally transform it into the close bound "%0 <= 3". The following sliceBounds will derive the bound of %1 as "%1 <= 6" and return the open bound "%1 < 7", while the correct bound should be "%1 <= 8". ``` - In addition to addBound, I also changed sliceBounds to support returning closed upper bound, since for the size computation, we usually care about the closed bounds. - Change the getUpperBoundForIndex to favor constant bounds when required. The sliceBounds will return a tighter but non-constant bounds, which can't be used for padding. The constantRequired option requires getUpperBoundForIndex to get the constant bounds when possible. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124821	2022-05-09 08:45:37 -07:00
gysit	d26c42af57	[mlir][linalg] Control dimensions to pad. This revision supports padding only a subset of the iteration dimensions via an additional padding-dimensions parameter. This control allows us to pad an operation in multiple steps. For example, one may want to pad only the output dimensions of a producer matmul fused into a consumer loop nest, before tiling and padding its reduction dimension. Depends On D122309 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D122560	2022-03-28 14:39:57 +00:00
gysit	58d0da885e	[mlir][linalg] Use arrays to pass padding options. Pass the padding options using arrays instead of lambdas. In particular pass the padding value as string and use the argument parser to create the padding value. Arrays are a more natural choice that matches the current use cases and avoids converting arrays to lambdas. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D122309	2022-03-28 13:49:05 +00:00
gysit	b1b57f8104	[mlir][linalg] Support padding LinalgOps in use-def chain. Previously, only LinalgOps whose operands are defined by an ExtractSliceOp could be padded. The revision supports walking a use-def chain of LinalgOps to find an ExtractSliceOp. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D122116	2022-03-24 10:44:44 +00:00
gysit	1d259f9f02	[mlir][affine] Add affine.min / affine.max canonicalization. The revision introduces a affine.min and affine.max canonicalization pattern that orders the result expressions. It flattens the result expressions to arrays of dimension and symbol coefficients plus one constant coefficient and rearranges them in lexicographic order. Without the pattern, CSE will not eliminate two affine.min / affine.max operation if the results are ordered differently. For example, the operations ``` %1 = affine.min affine_map<(d0) -> (8, -d0 + 27)>(%arg4) %2 = affine.min affine_map<(d0) -> (-d0 + 27, 8)>(%arg4) ``` doe not CSE. After applying the pattern, the two operations are equivalent ``` %1 = affine.min affine_map<(d0) -> (8, -d0 + 27)>(%arg4) %2 = affine.min affine_map<(d0) -> (8, -d0 + 27)>(%arg4) ``` which enables CSE. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D121819	2022-03-22 07:17:19 +00:00
gysit	7294be2b8e	[mlir][linalg] Replace linalg.fill by OpDSL variant. The revision removes the linalg.fill operation and renames the OpDSL generated linalg.fill_tensor operation to replace it. After the change, all named structured operations are defined via OpDSL and there are no handwritten operations left. A side-effect of the change is that the pretty printed form changes from: ``` %1 = linalg.fill(%cst, %0) : f32, tensor<?x?xf32> -> tensor<?x?xf32> ``` changes to ``` %1 = linalg.fill ins(%cst : f32) outs(%0 : tensor<?x?xf32>) -> tensor<?x?xf32> ``` Additionally, the builder signature now takes input and output value ranges as it is the case for all other OpDSL operations: ``` rewriter.create<linalg::FillOp>(loc, val, output) ``` changes to ``` rewriter.create<linalg::FillOp>(loc, ValueRange{val}, ValueRange{output}) ``` All other changes remain minimal. In particular, the canonicalization patterns are the same and the `value()`, `output()`, and `result()` methods are now implemented by the FillOpInterface. Depends On D120726 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120728	2022-03-14 10:51:08 +00:00
Alexander Belyaev	fd0c6f5391	[mlir] Move linalg::PadTensorOp to tensor::PadOp. RFC: https://llvm.discourse.group/t/rfc-move-linalg-padtensorop-to-tensor-padop/5785 Differential Revision: https://reviews.llvm.org/D117892	2022-01-21 20:02:39 +01:00
River Riddle	d75c3e8396	[mlir] Don't print `// no predecessors` on entry blocks Entry blocks can never have predecessors, so this is unnecessary. Fixes #53287 Differential Revision: https://reviews.llvm.org/D117713	2022-01-19 15:57:58 -08:00
gysit	db7a2e9176	[mlir][linalg] Only compose PadTensorOps if no ExtractSliceOp is rank-reducing. Do not compose pad tensor operations if the extract slice of the outer pad tensor operation is rank reducing. The inner extract slice op cannot be rank-reducing since it source type must match the desired type of the padding. Depends On D115359 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115428	2021-12-13 13:01:30 +00:00
gysit	f895e95138	[mlir][linalg] Make padding work for rank-reducing slice ops. Adapt the computation of a static bounding box to take rank-reducing slice operations into account by filtering out reduced size one dimensions. The revision is needed to make padding work for decomposed convolution operations. The decomposition introduces rank reducing extract slice operations that previously let padding fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115336	2021-12-13 12:34:20 +00:00
gysit	7f7103cd06	[mlir][linalg] Use top down traversal for padding. Pad the operation using a top down traversal. The top down traversal unlocks folding opportunities and dim op canonicalizations due to the introduced extract slice operation after the padded operation. Depends On D114585 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114689	2021-11-30 15:30:45 +00:00
Tobias Gysi	b6e7b1be73	[mlir][linalg] Simplify padding test (NFC). The padding tests previously contained the tile loops. This revision removes the tile loops since padding itself does not consider the loops. Instead the induction variables are passed in as function arguments which promotes them to symbols in the affine expressions. Note that the pad-and-hoist.mlir test still exercises padding in the context of the full loop nest. Depends On D114175 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114227	2021-11-24 19:21:50 +00:00
Tobias Gysi	86f186efea	[mlir][linalg] Add makeComposedPadHighOp. Add the makeComposedPadHighOp method which creates a new PadTensorOp if necessary. If the source to pad is actually the result of a sequence of padded LinalgOps, the method checks if padding is needed or if we can use the padded result of the padded LinalgOp sequence directly. Example: ``` %0 = tensor.extract_slice %arg0 [%iv0, %iv1] [%sz0, %sz1] %1 = linalg.pad_tensor %0 low[0, 0] high[...] { linalg.yield %cst } %2 = linalg.matmul ins(...) outs(%1) %3 = tensor.extract_slice %2 [0, 0] [%sz0, %sz1] ``` when padding %3 return %2 instead of introducing ``` %4 = linalg.pad_tensor %3 low[0, 0] high[...] { linalg.yield %cst } ``` Depends On D114161 Reviewed By: nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D114175	2021-11-24 19:18:59 +00:00
Tobias Gysi	a4fd8cb76f	[mlir][linalg] Update failure conditions for padOperandToSmallestStaticBoundingBox. Change the failure condition of padOperandToSmallestStaticBoundingBox to never fail if the operand is already statically sized. In particular: - if the padding value computation fails -> return failure if the operand shape is dynamic and success if it is static. - if there is no extract slice op -> return failure if the operand shape is dynamic and success if it is static. The latter change prevents padding from failure if the output operand passed by iteration argument is statically sized since in this case the extract / insert slice pairs are removed by canonicalization. Depends On D114153 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114161	2021-11-24 19:10:50 +00:00
Tobias Gysi	0c7c532643	[mli][linalg] Use CodegenStrategy to test padding (NFC). Use CodegenStrategy instead of a separate test pass to test padding. Depends On D113409 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113410	2021-11-10 15:00:06 +00:00
Tobias Gysi	b86b2309ce	[mlir][linalg] Use AffineApplyOp to compute padding width (NFC). Use AffineApplyOp instead of SubIOp to compute the padding width when creating a pad tensor operation. Depends On D113382 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113404	2021-11-10 14:53:52 +00:00
Tobias Gysi	0609eb1b32	[mlir][linalg] Remove padding from tiling options. Remove the padding options from the tiling options since padding is now implemented by a separate pattern/pass introduced in https://reviews.llvm.org/D112412. The revsion remove the tile-and-pad-tensors.mlir and replaces it with the pad.mlir that tests padding in isolation (without tiling). Similarly, hoist-padding.mlir is replaced by pad-and-hoist.mlir introduced in https://reviews.llvm.org/D112713. Depends On D112838 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113382	2021-11-10 13:33:28 +00:00

17 Commits