Dialects/Linalg/OpDSL.md

226f925cSTobias Gysi# Linalg OpDSL
226f925cSTobias Gysi
*2b9edabeSRenato Golin**_Warning: Linalg's OpDSL is currently being [deprecated](https://discourse.llvm.org/t/how-to-add-custom-linalg-named-ops-using-opdsl/83200/2),
*2b9edabeSRenato Golinwith its operations slowly [being moved](https://github.com/llvm/llvm-project/pull/115319) into TableGen's ODS format.
*2b9edabeSRenato GolinPlease refer to the [MLIR Restructuring discussion](https://discourse.llvm.org/t/rfc-mlir-project-charter-and-restructuring/82896)
*2b9edabeSRenato Golinfor more in-depth information._**
*2b9edabeSRenato Golin
226f925cSTobias GysiPython based DSL for authoring Linalg op definitions and generating
226f925cSTobias Gysi`linalg.generic` IR based on them for samples.
226f925cSTobias Gysi
226f925cSTobias GysiThe Linalg OpDSL is a high level DSL for constructing structured op definitions
226f925cSTobias Gysiin a way that can be exported to built-in, named structured ops via
226f925cSTobias Gysi[YAML-based definitions](_index.md/#yaml-gen) or used interactively to emit
226f925cSTobias Gysicorresponding `linalg.generic` IR for the composition.
226f925cSTobias Gysi
226f925cSTobias Gysi## Basic usage
226f925cSTobias Gysi
226f925cSTobias GysiThe tool is bundled with the MLIR Python bindings. To use from the CMake build
226f925cSTobias Gysitree, MLIR must be build with Python bindings enabled
8d59fc5fSThomas Preud'homme(`-DMLIR_ENABLE_BINDINGS_PYTHON=ON`). Then add the `python` directory in the
226f925cSTobias Gysibuild tree to your `PYTHONPATH` environment variable (i.e. `export
a543abc5STobias GysiPYTHONPATH=$PWD/build/tools/mlir/python_packages/mlir_core`). Optionally, use an
a543abc5STobias Gysiinstalled MLIR package, if available, to avoid building.
226f925cSTobias Gysi
226f925cSTobias Gysi```shell
226f925cSTobias Gysi# Dump the `core_named_ops.py` module as YAML.
226f925cSTobias Gysipython -m mlir.dialects.linalg.opdsl.dump_oplib .ops.core_named_ops
226f925cSTobias Gysi```
226f925cSTobias Gysi
a543abc5STobias GysiAlternatively, run the `$PWD/build/bin/update_core_linalg_named_ops.sh` script,
4cab4f43SFelix Schneiderwhich is available after building the `mlir-linalg-ods-yaml-gen` target. The tool
4cab4f43SFelix Schneideris meant for use during both development and runtime, but not as a build tool of
a543abc5STobias Gysithe core compiler: in order to export static named op definitions to be built as
a543abc5STobias Gysipart of the compiler, the corresponding Linalg dialect YAML file must be updated
a543abc5STobias Gysiand reviewed. TODO: Develop a script to automate op updates to these files.
226f925cSTobias Gysi
226f925cSTobias Gysi## Language Guide
226f925cSTobias Gysi
226f925cSTobias GysiThe language presented here is loosely inspired from the
226f925cSTobias Gysi[Tensor Comprehensions](https://arxiv.org/pdf/1802.04730.pdf) work, adapted to
226f925cSTobias Gysirepresent linalg structured ops.
226f925cSTobias Gysi
226f925cSTobias GysiThis tool is new and rapidly evolving. For language examples, refer to the
226f925cSTobias Gysibuilt-in ops in the `mlir.tools.linalg_opdsl.ops` package
226f925cSTobias Gysi(`lib/Bindings/Python/mlir/tools/linalg_opdsl/ops` in the repository).
226f925cSTobias Gysi
226f925cSTobias GysiUsing a matmul as an example, we will decompose the language:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
226f925cSTobias GysiT1 = TV.T1
226f925cSTobias GysiT2 = TV.T2
226f925cSTobias Gysi
226f925cSTobias Gysi@linalg_structured_op
226f925cSTobias Gysidef matmul(A=TensorDef(T1, S.M, S.K),
226f925cSTobias Gysi           B=TensorDef(T2, S.K, S.N),
226f925cSTobias Gysi           C=TensorDef(U, S.M, S.N, output=True)):
226f925cSTobias Gysi  """Performs a matrix multiplication of two 2D inputs.
226f925cSTobias Gysi
226f925cSTobias Gysi  Numeric casting is performed on the operands to the inner multiply, promoting
226f925cSTobias Gysi  them to the same data type as the accumulator/output.
226f925cSTobias Gysi  """
226f925cSTobias Gysi  domain(D.m, D.n, D.k)
d629645fSgysit  defines(Canonicalizer)
226f925cSTobias Gysi  implements(ContractionOpInterface)
e9085d0dSgysit  C[D.m, D.n] += TypeFn.cast_signed(
e9085d0dSgysit      U, A[D.m, D.k]) * TypeFn.cast_signed(U, B[D.k, D.n])
226f925cSTobias Gysi```
226f925cSTobias Gysi
226f925cSTobias GysiHere we have a simple type polymorphic contraction that takes arguments `A` and
226f925cSTobias Gysi`B` and outputs `C`. Each is bound to a `TensorDef`, which specifies:
226f925cSTobias Gysi
226f925cSTobias Gysi*   The symbolic element type (`T1`, `T2`, `U` above).
226f925cSTobias Gysi*   Symbolic shape expressions with symbols that are bound globally for the op (
226f925cSTobias Gysi    note that in this simple example, the shape expressions are just symbol
226f925cSTobias Gysi    references, but they are permitted to be a constrained set of affine
226f925cSTobias Gysi    expressions).
226f925cSTobias Gysi*   Usage (`output=True`).
226f925cSTobias Gysi
226f925cSTobias GysiThe docstring will be transferred to the op definition verbatim.
226f925cSTobias Gysi
226f925cSTobias GysiAn explicit iteration domain dimension order can be declared for the op via
226f925cSTobias Gysi`domain(D.d0[, D.d1...])`.
226f925cSTobias Gysi
226f925cSTobias GysiSpecial identifying op interfaces can be declared for the op via
226f925cSTobias Gysi`implements(interface1[, interface2...])`.
226f925cSTobias Gysi
d629645fSgysitExtra method definitions can be declared for the op via
d629645fSgysit`defines(definition1[, definition2...])`.
d629645fSgysit
226f925cSTobias Gysi## Parameters
226f925cSTobias Gysi
226f925cSTobias GysiStructured operations take two types of runtime parameters namely scalars and
226f925cSTobias Gysitensors. While scalars are inputs only, a tensor may be marked as an output.
226f925cSTobias GysiAssignment expressions index the tensor parameters to access the individual
226f925cSTobias Gysielements, while scalars can be accessed directly.
226f925cSTobias Gysi
226f925cSTobias GysiThe following example demonstrates the use of the two parameter types:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
226f925cSTobias Gysi@linalg_structured_op
226f925cSTobias Gysidef copy_and_scale(val=ScalarDef(T),
226f925cSTobias Gysi                   I=TensorDef(T, S.M, S.K),
226f925cSTobias Gysi                   O=TensorDef(T, S.M, S.K, output=True)):
226f925cSTobias Gysi  """Scale the input by the scalar value and store the result"""
226f925cSTobias Gysi  O[D.m, D.n] = I[D.m, D.n] * val
226f925cSTobias Gysi```
226f925cSTobias Gysi
226f925cSTobias GysiThe operation scales the input tensor `I` scales its elements by the value `val`
226f925cSTobias Gysiand writes the result to the output tensor `out`. The scalar `val` is bound to a
226f925cSTobias Gysi`ScalarDef`, which specifies the type of the scalar operand. The tensors are
226f925cSTobias Gysibound to a `TensorDef` as demonstrated by the matmul example. All parameters
226f925cSTobias Gysiappear in the parameter list of the operation:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
a3655de2Sgysitcopy_and_scale(val, in_tensor, outs=[out_tensor])
226f925cSTobias Gysi```
226f925cSTobias Gysi
d50571abSgysit## Index Attributes
226f925cSTobias Gysi
24357fecSgysitIndex attributes are compile-time constant parameters only accessible in index
226f925cSTobias Gysiexpressions. They can be used to parameterize the access pattern of a structured
226f925cSTobias Gysioperation, for example, by setting its strides. They cannot take part in the
226f925cSTobias Gysiactual computation.
226f925cSTobias Gysi
24357fecSgysitThe following example demonstrates the use of index attributes:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
226f925cSTobias Gysi@linalg_structured_op
226f925cSTobias Gysidef strided_copy(I=TensorDef(T, S.IH, S.IW),
226f925cSTobias Gysi                 O=TensorDef(T, S.OH, S.OW, output=True),
d50571abSgysit                 strides=IndexAttrDef(S.SH, S.SW, default=[1, 1])):
226f925cSTobias Gysi  """Copy a subset of the input tensor elements to the output tensor"""
226f925cSTobias Gysi  O[D.oh, D.ow] = I[D.oh * S.SH, D.ow * S.SW]
226f925cSTobias Gysi```
226f925cSTobias Gysi
226f925cSTobias GysiThe operation implements a strided copy from the input tensor `I` to the output
2648e2d5Sgysittensor `O`. The `strides` attribute is bound to an `IndexAttrDef`. It defines
226f925cSTobias Gysithe symbols `S.SH` and `S.SW`, which are used to index the input tensor `I`.
226f925cSTobias GysiWhen instantiating the operation, the attribute is set using a named argument:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
226f925cSTobias Gysistrided_copy(in_tensor, outs=[out_tensor], strides=[1, 2])
226f925cSTobias Gysi```
226f925cSTobias Gysi
226f925cSTobias GysiThe `strides` vector elements substitute the symbols `S.SH` and `S.SW` in the
d50571abSgysitindex expressions of the operation instance. If no strides are provided the
d50571abSgysit`default` vector elements are used instead.
226f925cSTobias Gysi
24357fecSgysitIndex attributes are currently limited to integer vectors and only accessible in
24357fecSgysitindex expressions. An operation may have multiple attributes all of them placed
24357fecSgysitat the end of the parameter list after the output tensors.
226f925cSTobias Gysi
226f925cSTobias Gysi## Shape-Only Tensors
226f925cSTobias Gysi
226f925cSTobias GysiStructured operations derive the iteration space given the sizes of the input
226f925cSTobias Gysiand output tensors. Certain operations need shape-only tensors that are not
226f925cSTobias Gysiaccessed and exist purely for the sake of specifying the iteration domain. An
226f925cSTobias Gysiexample is the pooling operation that takes a shape-only tensor to define the
226f925cSTobias Gysiiteration space of the reduction. As shape-only tensors have no uses, the
226f925cSTobias Gysi`TensorDef` takes an additional optional `index_dims` parameter to map the shape
226f925cSTobias Gysito index dimensions.
226f925cSTobias Gysi
226f925cSTobias GysiThe following example demonstrates the index dimension annotation:
226f925cSTobias Gysi
226f925cSTobias Gysi```python
226f925cSTobias Gysi@linalg_structured_op
226f925cSTobias Gysidef pooling_poly(
226f925cSTobias Gysi    I=TensorDef(T1, S.N, S.H, S.W, S.C),
226f925cSTobias Gysi    K=TensorDef(T2, S.KH, S.KW, index_dims=[D.kh, D.kw]),
226f925cSTobias Gysi    O=TensorDef(U, S.N, S.OH, S.OW, S.C, output=True),
d50571abSgysit    strides=IndexAttrDef(S.SH, S.SW, default=[1, 1]),
d50571abSgysit    dilations=IndexAttrDef(S.DH, S.DW, default=[1, 1])):
e9085d0dSgysit  O[D.n, D.oh, D.ow, D.c] += TypeFn.cast_signed(U,
15757ea8Sgysit          I[D.n, D.oh * S.SH + D.kh * S.DH, D.ow * S.SW + D.kw * S.DW, D.c])
226f925cSTobias Gysi```
226f925cSTobias Gysi
226f925cSTobias GysiThe pooling operation does not access the shape-only tensor `K`. Instead, the
226f925cSTobias Gysishapes `S.KH` and `S.KW` specify the iteration domain for the reduction
226f925cSTobias Gysidimensions `D.kh` and `D.kw`.
226f925cSTobias Gysi
226f925cSTobias Gysi## Assignments
226f925cSTobias Gysi
226f925cSTobias GysiThe bulk of language consists of assignment expressions of the form above. The
226f925cSTobias Gysiiteration dimension order is determined lexically based on the order encountered
226f925cSTobias Gysiin the expression (following operator precedence if math operators are used).
226f925cSTobias GysiTODO: Introduce a directive to fix the dimension bindings.
226f925cSTobias Gysi
226f925cSTobias GysiReduction dimensions are inferred to be any dimensions on the RHS that are not
226f925cSTobias Gysion the LHS.
226f925cSTobias Gysi
cd2776b0SgysitA number of unary and binary arithmetic functions are supported:
226f925cSTobias Gysi
cd2776b0Sgysit*   `BinaryFn.add(a, b)` (also via overloading the binary `+` operator)
cd2776b0Sgysit*   `BinaryFn.mul(a, b)` (also via overloading the binary `*` operator)
e9085d0dSgysit*   `BinaryFn.max_signed(a, b)`
e9085d0dSgysit*   `BinaryFn.min_signed(a, b)`
cd2776b0Sgysit*   `BinaryFn.sub(a, b)` (also via overloading the binary `-` operator)
cd2776b0Sgysit*   `BinaryFn.max_unsigned(a, b)`
cd2776b0Sgysit*   `BinaryFn.min_unsigned(a, b)`
cd2776b0Sgysit*   `UnaryFn.exp(a)`
cd2776b0Sgysit*   `UnaryFn.log(a)`
cf05668cSgysit
cf05668cSgysitAs the integer types are signless, signedness is implement by different
cf05668cSgysitfunctions that treat integers as signed or unsigned values.
226f925cSTobias Gysi
e3b442b6SgysitA subset of the arithmetic functions are supported in reductions. These
e3b442b6Sgysitreduction functions can appear as the outermost function on the RHS:
226f925cSTobias Gysi
226f925cSTobias Gysi*   `ReduceFn.add` (also overloading the inplace `+=` on a LHS)
226f925cSTobias Gysi*   `ReduceFn.mul`
e9085d0dSgysit*   `ReduceFn.max_signed`
e9085d0dSgysit*   `ReduceFn.min_signed`
e3b442b6Sgysit*   `ReduceFn.max_unsigned`
e3b442b6Sgysit*   `ReduceFn.min_unsigned`
e3b442b6Sgysit
e3b442b6SgysitAs the integer types are signless, signedness is implement by different
e3b442b6Sgysitfunctions that treat integers as signed or unsigned values.
226f925cSTobias Gysi
15757ea8SgysitAdditionally, type conversion functions cast an operand to a target type:
15757ea8Sgysit
e9085d0dSgysit*   `TypeFn.cast_signed(TypeVar, operand)`
15757ea8Sgysit*   `TypeFn.cast_unsigned(TypeVar, operand)`
15757ea8Sgysit
15757ea8SgysitAs the integer types are signless, signedness is implement by different
e9085d0dSgysitfunctions that treat integers as signed (`TypeFn.cast_signed`) or unsigned
15757ea8Sgysit(`TypeFn.cast_unsigned`) values.
15757ea8Sgysit
226f925cSTobias GysiThere are also special forms:
226f925cSTobias Gysi
15757ea8Sgysit*   `const(value)` returns a constant value.
226f925cSTobias Gysi*   `index(dim)` returns the iteration index in the given dimension `dim`.
226f925cSTobias Gysi
24357fecSgysit## Function Attributes
24357fecSgysit
24357fecSgysitFunction attributes are compile-time constant function parameters. They can be
24357fecSgysitused to parameterize the computation performed by a structured operation, for
24357fecSgysitexample, to support signed and unsigned computations.
24357fecSgysit
24357fecSgysitThe following example demonstrates the use of function attributes:
24357fecSgysit
24357fecSgysit```python
24357fecSgysit@linalg_structured_op
24357fecSgysitdef elemwise_binary(
24357fecSgysit    lhs=TensorDef(T1),
24357fecSgysit    rhs=TensorDef(T2),
24357fecSgysit    O=TensorDef(U, output=True),
24357fecSgysit    fun=BinaryFnAttrDef(default=BinaryFn.add),
e9085d0dSgysit    cast=TypeFnAttrDef(default=TypeFn.cast_signed)):
24357fecSgysit  O[None] = fun(cast(U, lhs[None]), cast(U, rhs[None]))
24357fecSgysit```
24357fecSgysit
24357fecSgysitThe `fun` and `cast` function attributes by default are aliases for their
e9085d0dSgysitdefault values `BinaryFn.add` and `TypeFn.cast_signed`, respectively. When
24357fecSgysitinstantiating the operation, the function attributes may be set to other
24357fecSgysitfunctions using optional named arguments:
24357fecSgysit
24357fecSgysit```python
24357fecSgysitelemwise_binary(lhs, rhs, outs=[out_tensor],
24357fecSgysit                fun=BinaryFn.mul, cast=TypeFn.cast_unsigned)
24357fecSgysit```
24357fecSgysit
24357fecSgysitIn the example, the `fun` and `cast` arguments adapt the body of the operation
24357fecSgysitto implement multiplication and unsigned casts instead of addition and signed
24357fecSgysitcasts.
24357fecSgysit
24357fecSgysitOpDSL supports unary, binary, and type conversion function attributes. An
24357fecSgysitoperation can take multiple attributes of different kinds placed at the end of
24357fecSgysitthe parameter list.
24357fecSgysit
226f925cSTobias Gysi## Types
226f925cSTobias Gysi
226f925cSTobias GysiAll types in assignment expressions are late bound based on actual input and
226f925cSTobias Gysioutput types of constructed ops. An exception are predefined types such as
226f925cSTobias Gysi`I32`, `I64`, `F32`, and `F64`. These hardwired types enable intermediate
226f925cSTobias Gysicomputations with a type that is independent of the input and output types. For
226f925cSTobias Gysiexample, parts of floating point computation may require double precision
226f925cSTobias Gysiarithmetic despite all inputs and outputs being single precision values.
e9085d0dSgysitAssignment expressions with no `TypeFn.cast_signed` calls will generally require
15757ea8Sgysituniform types throughout and will fail to verify if violated. The presence of a
e9085d0dSgysit`TypeFn.cast_signed` or `TypeFn.cast_unsigned` allows for a limited form of
e9085d0dSgysitnumeric type conversion between element types that can be derived from inputs
e9085d0dSgysitand outputs (and in the future, attributes). `TypeFn.cast_signed` calls with a
e9085d0dSgysit`TypeVar` first argument are emitted as `type_fn` primitives in the YAML
e9085d0dSgysitdefinition.
226f925cSTobias Gysi
226f925cSTobias GysiCasting will perform `int<->float` and `index->int` type conversions and will
15757ea8Sgysitperform any necessary extension or truncation within the type family. The
15757ea8Sgysitinteger types themselves are signless and signedness is implemented by
e9085d0dSgysitfunctions/operations. The `TypeFn.cast_signed` function treats all integers as
e9085d0dSgysitsigned, while `TypeFn.cast_unsigned` treats them as unsigned.
15757ea8Sgysit
15757ea8SgysitThe following examples illustrate the lowering of signed and unsigned functions:
15757ea8Sgysit
e9085d0dSgysit*   cast_signed(I32 -> I64) -> `arith.ExtSIOp`
e9085d0dSgysit*   cast_signed(F32 -> I32) -> `arith.FPToSIOp`
15757ea8Sgysit*   cast_unsigned(I32 -> I64) -> `arith.ExtUIOp`
15757ea8Sgysit*   cast_unsigned(F32 -> I32) -> `arith.FPToUIOp`
e9085d0dSgysit*   max_signed -> `arith.MaxSIOp`
5f4c89edSRageking8*   max_unsigned -> `arith.MaxUIOp`
226f925cSTobias Gysi
226f925cSTobias GysiNot all functions are applicable for all numeric types, and on mismatch, op
226f925cSTobias Gysiverification will fail.
a3655de2Sgysit
a3655de2Sgysit## Pointwise Computations
a3655de2Sgysit
a3655de2SgysitPointwise computations are expressible in a rank polymorphic form that supports
a3655de2Sgysitarbitrary ranked operands - all of them need to have the same rank - with a
a3655de2Sgysitsingle operation definition.
a3655de2Sgysit
a3655de2SgysitAn example for a rank polymorphic operation is `fill`:
a3655de2Sgysit
a3655de2Sgysit```python
a3655de2Sgysit@linalg_structured_op
a3655de2Sgysitdef fill(value=ScalarDef(T1),
a3655de2Sgysit         O=TensorDef(U, output=True)):
e9085d0dSgysit  O[None] = TypeFn.cast_signed(U, value)
a3655de2Sgysit```
a3655de2Sgysit
a3655de2SgysitThe operation sets the elements of the output tensor `O` to `value`. All
a3655de2Sgysitoperands are either scalars or rank zero tensors that are accessed using the
a3655de2Sgysitindex `None`. The operation thus performs a scalar computation that trivially
a3655de2Sgysitextends to a multi-dimensional pointwise computation. As a result, we may use
a3655de2Sgysit`fill` with arbitrary ranked output tensors:
a3655de2Sgysit
a3655de2Sgysit```python
81ca5aa4SMatthias Springertensor_2d = tensor.EmptyOp([4, 8], f32)
81ca5aa4SMatthias Springertensor_3d = tensor.EmptyOp([4, 8, 16], f32)
a3655de2Sgysitfill(value, outs=[tensor_2d])
a3655de2Sgysitfill(value, outs=[tensor_3d])
a3655de2Sgysit```