Tutorials/Toy/Ch-4.md

5b4a01d4SMehdi Amini# Chapter 4: Enabling Generic Transformation with Interfaces
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini[TOC]
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini## Background: Grappling with an Extensible IR
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiThrough dialects, MLIR allows for the representation of many different levels of
5b4a01d4SMehdi Aminiabstraction; the Toy dialect that we have previously defined is one such
5b4a01d4SMehdi Aminiexample. Though these different dialects may represent different abstractions,
5b4a01d4SMehdi Aminithere is often a set of common transformations and analyses that we would like
5b4a01d4SMehdi Aminito perform. The problem that arises is that naively implementing each
5b4a01d4SMehdi Aminitransformation for each dialect leads to large amounts of code duplication, as
5b4a01d4SMehdi Aminithe internal algorithms are generally very similar, if not the same. We would
5b4a01d4SMehdi Aminilike to provide the ability for transformations to opaquely hook into dialects
5b4a01d4SMehdi Aminilike Toy to get the information they need.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiMLIR provides a set of always available-hooks for certain core transformations,
5b4a01d4SMehdi Aminias seen in the [previous chapter](Ch-3.md), where we registered some
5b4a01d4SMehdi Aminicanonicalizations via a hook on our operations (`getCanonicalizationPatterns`).
5b4a01d4SMehdi AminiHowever, these types of hooks don't really scale well. Therefore, a more generic
5b4a01d4SMehdi Aminisolution was designed, in the form of [interfaces](../../Interfaces.md), to make
5b4a01d4SMehdi Aminithe MLIR infrastructure as extensible as the representation. Interfaces provide
5b4a01d4SMehdi Aminia generic mechanism for dialects and operations to provide information to a
5b4a01d4SMehdi Aminitransformation or analysis.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini## Shape Inference: Preparing for Code Generation
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiOur Toy IR currently operates on generic tensors, meaning that we don't know the
5b4a01d4SMehdi Aminishape of tensors other than during the initialization of constants. This
5b4a01d4SMehdi Aminicomplicates optimizations, as well as code generation. Fortunately, we can
5b4a01d4SMehdi Aminisimply propagate the shapes through the computation until they are all known.
5b4a01d4SMehdi AminiThe issue is how to handle calls to user-defined generic functions: every call
5b4a01d4SMehdi Aminisite could deduce different shapes. One possibility would be to perform symbolic
5b4a01d4SMehdi Aminiinference based on the argument types, but this would be hard to generalize if
5b4a01d4SMehdi Aminiwe were to introduce more control flow in the language. Another approach would
5b4a01d4SMehdi Aminibe function specialization, where every call site with new argument shapes
5b4a01d4SMehdi Aminiduplicates the called function and specializes it. The approach we take for Toy
5b4a01d4SMehdi Aminiis to inline all of the function calls, then perform intraprocedural shape
5b4a01d4SMehdi Aminipropagation.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini### Inlining
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiHere we could write an inlining algorithm specifically designed for the Toy
5b4a01d4SMehdi Aminidialect, but that can become quite complicated depending on the level of
5b4a01d4SMehdi Aminicomplexity that we want. Disregarding cost modeling, the pure structural
5b4a01d4SMehdi Aminitransformation is already complex to implement from scratch. Thankfully, MLIR
5b4a01d4SMehdi Aminiprovides a generic inliner algorithm that dialects can plug into. All we need to
5b4a01d4SMehdi Aminido in Toy is to provide the [interfaces](../../Interfaces.md) for the inliner to
5b4a01d4SMehdi Aminihook into.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiThe first thing we need to do is to define the constraints on inlining
5b4a01d4SMehdi Aminioperations in the Toy dialect. This information is provided through a
31d1ae79SMarkus Böck[dialect interface](../../Interfaces.md/#dialect-interfaces). This is essentially
d8392f76SMatthias Kramma class containing a set of virtual hooks which the dialect can override.
d8392f76SMatthias KrammIn this case, the interface is `DialectInlinerInterface`.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Amini/// This class defines the interface for handling inlining with Toy operations.
d8392f76SMatthias Kramm/// We simplify inherit from the base interface class and override
d8392f76SMatthias Kramm/// the necessary methods.
5b4a01d4SMehdi Aministruct ToyInlinerInterface : public DialectInlinerInterface {
5b4a01d4SMehdi Amini  using DialectInlinerInterface::DialectInlinerInterface;
5b4a01d4SMehdi Amini
501fda01SRiver Riddle  /// This hook checks to see if the given callable operation is legal to inline
501fda01SRiver Riddle  /// into the given call. For Toy this hook can simply return true, as the Toy
501fda01SRiver Riddle  /// Call operation is always inlinable.
fa417479SRiver Riddle  bool isLegalToInline(Operation *call, Operation *callable,
fa417479SRiver Riddle                       bool wouldBeCloned) const final {
501fda01SRiver Riddle    return true;
501fda01SRiver Riddle  }
501fda01SRiver Riddle
5b4a01d4SMehdi Amini  /// This hook checks to see if the given operation is legal to inline into the
5b4a01d4SMehdi Amini  /// given region. For Toy this hook can simply return true, as all Toy
5b4a01d4SMehdi Amini  /// operations are inlinable.
fa417479SRiver Riddle  bool isLegalToInline(Operation *, Region *, bool,
4d67b278SJeff Niu                       IRMapping &) const final {
5b4a01d4SMehdi Amini    return true;
5b4a01d4SMehdi Amini  }
5b4a01d4SMehdi Amini
ee2c6cd9SRiver Riddle  /// This hook cheks if the given 'src' region can be inlined into the 'dest'
ee2c6cd9SRiver Riddle  /// region. The regions here are the bodies of the callable functions. For
ee2c6cd9SRiver Riddle  /// Toy, any function can be inlined, so we simply return true.
ee2c6cd9SRiver Riddle  bool isLegalToInline(Region *dest, Region *src, bool wouldBeCloned,
4d67b278SJeff Niu                       IRMapping &valueMapping) const final {
ee2c6cd9SRiver Riddle    return true;
ee2c6cd9SRiver Riddle  }
ee2c6cd9SRiver Riddle
5b4a01d4SMehdi Amini  /// This hook is called when a terminator operation has been inlined. The only
5b4a01d4SMehdi Amini  /// terminator that we have in the Toy dialect is the return
5b4a01d4SMehdi Amini  /// operation(toy.return). We handle the return by replacing the values
5b4a01d4SMehdi Amini  /// previously returned by the call operation with the operands of the
5b4a01d4SMehdi Amini  /// return.
5b4a01d4SMehdi Amini  void handleTerminator(Operation *op,
26a0b277SMehdi Amini                        MutableArrayRef<Value> valuesToRepl) const final {
5b4a01d4SMehdi Amini    // Only "toy.return" needs to be handled here.
5b4a01d4SMehdi Amini    auto returnOp = cast<ReturnOp>(op);
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini    // Replace the values directly with the return operands.
5b4a01d4SMehdi Amini    assert(returnOp.getNumOperands() == valuesToRepl.size());
5b4a01d4SMehdi Amini    for (const auto &it : llvm::enumerate(returnOp.getOperands()))
2bdf33ccSRiver Riddle      valuesToRepl[it.index()].replaceAllUsesWith(it.value());
5b4a01d4SMehdi Amini  }
5b4a01d4SMehdi Amini};
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
2b2c13e6SChenggang ZhaoBesides, the inliner will only discard private-visible unused function
2b2c13e6SChenggang Zhaodefinitions. We also have to set the visibility of functions (except the
2b2c13e6SChenggang Zhaomain function) in the MLIR generator.
2b2c13e6SChenggang Zhao
2b2c13e6SChenggang Zhao```c++
2b2c13e6SChenggang Zhao/// Emit a new function and add it to the MLIR module.
ee2c6cd9SRiver Riddlemlir::toy::FuncOp mlirGen(FunctionAST &funcAST) {
2b2c13e6SChenggang Zhao  ...
2b2c13e6SChenggang Zhao  // If this function isn't main, then set the visibility to private.
2b2c13e6SChenggang Zhao  if (funcAST.getProto()->getName() != "main")
2b2c13e6SChenggang Zhao    function.setPrivate();
2b2c13e6SChenggang Zhao
2b2c13e6SChenggang Zhao  return function;
2b2c13e6SChenggang Zhao}
2b2c13e6SChenggang Zhao```
2b2c13e6SChenggang Zhao
5b4a01d4SMehdi AminiWe then register our dialect interface directly on the Toy dialect, similarly to
5b4a01d4SMehdi Aminihow we did for operations.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
ee748605SRiver Riddlevoid ToyDialect::initialize() {
5b4a01d4SMehdi Amini  addInterfaces<ToyInlinerInterface>();
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNext, we need to provide a way for the inliner to know that `toy.generic_call`
ee2c6cd9SRiver Riddlerepresents a call, and `toy.func` represents a function. MLIR provides
ee2c6cd9SRiver Riddle[operation interfaces](../../Interfaces.md/#attributeoperationtype-interfaces) that can be used
ee2c6cd9SRiver Riddleto mark an operation as being "call-like" or "callable-like". Unlike dialect interfaces,
ee2c6cd9SRiver Riddleoperation interfaces provide a more refined granularity of information that is specific
ee2c6cd9SRiver Riddleand core to a single operation. The interfaces that we will be adding here is the
ee2c6cd9SRiver Riddle`CallOpInterface` and `CallableOpInterface`.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiTo add this interface we just need to include the definition into our operation
5b4a01d4SMehdi Aminispecification file (`Ops.td`):
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```tablegen
7ce1e7abSRiver Riddleinclude "mlir/Interfaces/CallInterfaces.td"
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Aminiand add it to the traits list of `GenericCallOp`:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```tablegen
ee2c6cd9SRiver Riddledef FuncOp : Toy_Op<"func",
ee2c6cd9SRiver Riddle    [DeclareOpInterfaceMethods<CallableOpInterface>]> {
ee2c6cd9SRiver Riddle  ...
ee2c6cd9SRiver Riddle}
ee2c6cd9SRiver Riddle
5b4a01d4SMehdi Aminidef GenericCallOp : Toy_Op<"generic_call",
5b4a01d4SMehdi Amini    [DeclareOpInterfaceMethods<CallOpInterface>]> {
5b4a01d4SMehdi Amini  ...
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiIn the above we also use the `DeclareOpInterfaceMethods` directive to
5b4a01d4SMehdi Aminiauto-declare all of the interface methods in the class declaration of
5b4a01d4SMehdi AminiGenericCallOp. This means that we just need to provide a definition:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
ee2c6cd9SRiver Riddle/// Returns the region on the function operation that is callable.
ee2c6cd9SRiver RiddleRegion *FuncOp::getCallableRegion() { return &getBody(); }
ee2c6cd9SRiver Riddle
ee2c6cd9SRiver Riddle// ....
ee2c6cd9SRiver Riddle
5b4a01d4SMehdi Amini/// Return the callee of the generic call operation, this is required by the
5b4a01d4SMehdi Amini/// call interface.
5b4a01d4SMehdi AminiCallInterfaceCallable GenericCallOp::getCallableForCallee() {
5b4a01d4SMehdi Amini  return getAttrOfType<SymbolRefAttr>("callee");
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini
a2ab6a5eSWhitney Tsang/// Set the callee for the generic call operation, this is required by the call
a2ab6a5eSWhitney Tsang/// interface.
a2ab6a5eSWhitney Tsangvoid GenericCallOp::setCalleeFromCallable(CallInterfaceCallable callee) {
a2ab6a5eSWhitney Tsang  (*this)->setAttr("callee", callee.get<SymbolRefAttr>());
a2ab6a5eSWhitney Tsang}
a2ab6a5eSWhitney Tsang
5b4a01d4SMehdi Amini/// Get the argument operands to the called function, this is required by the
5b4a01d4SMehdi Amini/// call interface.
5b4a01d4SMehdi AminiOperation::operand_range GenericCallOp::getArgOperands() { return inputs(); }
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNow that the inliner has been informed about the Toy dialect, we can add the
5b4a01d4SMehdi Aminiinliner pass to the pass manager for Toy:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Amini  pm.addPass(mlir::createInlinerPass());
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNow let's look at a working example:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```mlir
ee2c6cd9SRiver Riddletoy.func @multiply_transpose(%arg0: tensor<*xf64>, %arg1: tensor<*xf64>) -> tensor<*xf64> {
0050e8f0SRiver Riddle  %0 = toy.transpose(%arg0 : tensor<*xf64>) to tensor<*xf64>
0050e8f0SRiver Riddle  %1 = toy.transpose(%arg1 : tensor<*xf64>) to tensor<*xf64>
0050e8f0SRiver Riddle  %2 = toy.mul %0, %1 : tensor<*xf64>
0050e8f0SRiver Riddle  toy.return %2 : tensor<*xf64>
5b4a01d4SMehdi Amini}
ee2c6cd9SRiver Riddletoy.func @main() {
0050e8f0SRiver Riddle  %0 = toy.constant dense<[[1.000000e+00, 2.000000e+00, 3.000000e+00], [4.000000e+00, 5.000000e+00, 6.000000e+00]]> : tensor<2x3xf64>
0050e8f0SRiver Riddle  %1 = toy.reshape(%0 : tensor<2x3xf64>) to tensor<2x3xf64>
0050e8f0SRiver Riddle  %2 = toy.constant dense<[1.000000e+00, 2.000000e+00, 3.000000e+00, 4.000000e+00, 5.000000e+00, 6.000000e+00]> : tensor<6xf64>
0050e8f0SRiver Riddle  %3 = toy.reshape(%2 : tensor<6xf64>) to tensor<2x3xf64>
0050e8f0SRiver Riddle  %4 = toy.generic_call @multiply_transpose(%1, %3) : (tensor<2x3xf64>, tensor<2x3xf64>) -> tensor<*xf64>
0050e8f0SRiver Riddle  %5 = toy.generic_call @multiply_transpose(%3, %1) : (tensor<2x3xf64>, tensor<2x3xf64>) -> tensor<*xf64>
0050e8f0SRiver Riddle  toy.print %5 : tensor<*xf64>
0050e8f0SRiver Riddle  toy.return
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
4666f309SJack XiaWe have two calls to multiply_transpose that we would like to inline into main,
5b4a01d4SMehdi Aminibut if we look at the output nothing has changed. We are missing one last subtle
5b4a01d4SMehdi Aminipiece: there is a hidden type conversion on the edge of the call. If we look at
5b4a01d4SMehdi Aminithe above, the operands to the generic_call are of type `tensor<2x3xf64>`, while
5b4a01d4SMehdi Aminithe inputs to the function expect `tensor<*xf64>`. To resolve this difference,
5b4a01d4SMehdi Aminithe inliner expects an explicit cast operation to be inserted. For this, we need
5b4a01d4SMehdi Aminito add a new operation to the Toy dialect, `ToyCastOp`(toy.cast), to represent
5b4a01d4SMehdi Aminicasts between two different shapes.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```tablegen
6ccf2d62SRiver Riddledef CastOp : Toy_Op<"cast", [
6ccf2d62SRiver Riddle    DeclareOpInterfaceMethods<CastOpInterface>,
08f31b8fSHsiangkai Wang    Pure,
6ccf2d62SRiver Riddle    SameOperandsAndResultShape]
6ccf2d62SRiver Riddle  > {
5b4a01d4SMehdi Amini  let summary = "shape cast operation";
5b4a01d4SMehdi Amini  let description = [{
5b4a01d4SMehdi Amini    The "cast" operation converts a tensor from one type to an equivalent type
5b4a01d4SMehdi Amini    without changing any data elements. The source and destination types
6ccf2d62SRiver Riddle    must both be tensor types with the same element type. If both are ranked,
6ccf2d62SRiver Riddle    then shape is required to match. The operation is invalid if converting
6ccf2d62SRiver Riddle    to a mismatching constant dimension.
5b4a01d4SMehdi Amini  }];
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini  let arguments = (ins F64Tensor:$input);
5b4a01d4SMehdi Amini  let results = (outs F64Tensor:$output);
ee2c6cd9SRiver Riddle  let assemblyFormat = "$input attr-dict `:` type($input) `to` type($output)";
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
6ccf2d62SRiver RiddleNote that the definition of this cast operation adds a `CastOpInterface` to the
6ccf2d62SRiver Riddletraits list. This interface provides several utilities for cast-like operation,
6ccf2d62SRiver Riddlesuch as folding identity casts and verification. We hook into this interface by
6ccf2d62SRiver Riddleproviding a definition for the `areCastCompatible` method:
6ccf2d62SRiver Riddle
6ccf2d62SRiver Riddle```c++
6ccf2d62SRiver Riddle/// Returns true if the given set of input and result types are compatible with
6ccf2d62SRiver Riddle/// this cast operation. This is required by the `CastOpInterface` to verify
6ccf2d62SRiver Riddle/// this operation and provide other additional utilities.
6ccf2d62SRiver Riddlebool CastOp::areCastCompatible(TypeRange inputs, TypeRange outputs) {
6ccf2d62SRiver Riddle  if (inputs.size() != 1 || outputs.size() != 1)
6ccf2d62SRiver Riddle    return false;
6ccf2d62SRiver Riddle  // The inputs must be Tensors with the same element type.
6ccf2d62SRiver Riddle  TensorType input = inputs.front().dyn_cast<TensorType>();
6ccf2d62SRiver Riddle  TensorType output = outputs.front().dyn_cast<TensorType>();
6ccf2d62SRiver Riddle  if (!input || !output || input.getElementType() != output.getElementType())
6ccf2d62SRiver Riddle    return false;
6ccf2d62SRiver Riddle  // The shape is required to match if both types are ranked.
6ccf2d62SRiver Riddle  return !input.hasRank() || !output.hasRank() || input == output;
6ccf2d62SRiver Riddle}
6ccf2d62SRiver Riddle
6ccf2d62SRiver Riddle```
6ccf2d62SRiver Riddle
6ccf2d62SRiver RiddleWith a proper cast operation, we can now override the necessary hook on the
6ccf2d62SRiver RiddleToyInlinerInterface to insert it for us when necessary:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Aministruct ToyInlinerInterface : public DialectInlinerInterface {
5b4a01d4SMehdi Amini  ...
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini  /// Attempts to materialize a conversion for a type mismatch between a call
5b4a01d4SMehdi Amini  /// from this dialect, and a callable region. This method should generate an
5b4a01d4SMehdi Amini  /// operation that takes 'input' as the only operand, and produces a single
5b4a01d4SMehdi Amini  /// result of 'resultType'. If a conversion can not be generated, nullptr
5b4a01d4SMehdi Amini  /// should be returned.
5b4a01d4SMehdi Amini  Operation *materializeCallConversion(OpBuilder &builder, Value input,
5b4a01d4SMehdi Amini                                       Type resultType,
5b4a01d4SMehdi Amini                                       Location conversionLoc) const final {
5b4a01d4SMehdi Amini    return builder.create<CastOp>(conversionLoc, resultType, input);
5b4a01d4SMehdi Amini  }
5b4a01d4SMehdi Amini};
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiIf we run the working example through the pipeline again, we get the expected:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```mlir
ee2c6cd9SRiver Riddletoy.func @main() {
ee2c6cd9SRiver Riddle  %0 = toy.constant dense<[[1.000000e+00, 2.000000e+00, 3.000000e+00], [4.000000e+00, 5.000000e+00, 6.000000e+00]]> : tensor<2x3xf64>
ee2c6cd9SRiver Riddle  %1 = toy.constant dense<[[1.000000e+00, 2.000000e+00, 3.000000e+00], [4.000000e+00, 5.000000e+00, 6.000000e+00]]> : tensor<2x3xf64>
ee2c6cd9SRiver Riddle  %2 = toy.cast %1 : tensor<2x3xf64> to tensor<*xf64>
ee2c6cd9SRiver Riddle  %3 = toy.cast %0 : tensor<2x3xf64> to tensor<*xf64>
ee2c6cd9SRiver Riddle  %4 = toy.transpose(%2 : tensor<*xf64>) to tensor<*xf64>
ee2c6cd9SRiver Riddle  %5 = toy.transpose(%3 : tensor<*xf64>) to tensor<*xf64>
ee2c6cd9SRiver Riddle  %6 = toy.mul %4, %5 : tensor<*xf64>
0050e8f0SRiver Riddle  toy.print %6 : tensor<*xf64>
0050e8f0SRiver Riddle  toy.return
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNOTE: The generic inliner will also perform simplifications, so the output may
5b4a01d4SMehdi Aminibe a bit cleaner than expected.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini### Intraprocedural Shape Inference
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNow that we have inlined all of the functions, we are left with a main function
5b4a01d4SMehdi Aminicontaining a mix of static and dynamically shaped operations. We can now write a
5b4a01d4SMehdi Aminisimple shape inference pass to propagate shapes intraprocedurally (within a
5b4a01d4SMehdi Aminisingle function). We could write this as a pass that directly encodes the
5b4a01d4SMehdi Aminiconstraints of the operations within the Toy dialect, but this seems like a good
5b4a01d4SMehdi Aminicandidate for a transformation that could be written generically. As a good rule
5b4a01d4SMehdi Aminiof thumb, it is best to express a transformation as generically as possible,
5b4a01d4SMehdi Aminisuch that it can be extended to other dialects in the future. There is no
5b4a01d4SMehdi Aminitelling how many other dialects may have similar needs or encounter the same
5b4a01d4SMehdi Aminiproblems.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiFor shape inference, if we break down the problem to its core, we really just
5b4a01d4SMehdi Aminiwant operations to tell us the expected outputs given a set of statically known
5b4a01d4SMehdi Aminiinputs. (We can definitely get more complex than that, but for our needs we can
5b4a01d4SMehdi Aminikeep it simple.) Given that this property is core to a specific operation, we
5b4a01d4SMehdi Aminican define an operation interface that can be specified on operations that need
5b4a01d4SMehdi Aminito have their result shapes inferred.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiSimilarly to operations, we can also
31d1ae79SMarkus Böck[define operation interfaces](../../Interfaces.md/#attributeoperationtype-interfaces) using
5b4a01d4SMehdi Aminithe operation definition specification (ODS) framework.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiThe interface is defined by inheriting from `OpInterface`, which takes the name
5b4a01d4SMehdi Aminito be given to the generated C++ interface class as a template argument. For our
d8392f76SMatthias Krammpurposes, we will simply name the generated class `ShapeInference`. We also
5b4a01d4SMehdi Aminiprovide a description for the interface.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```tablegen
5b4a01d4SMehdi Aminidef ShapeInferenceOpInterface : OpInterface<"ShapeInference"> {
5b4a01d4SMehdi Amini  let description = [{
5b4a01d4SMehdi Amini    Interface to access a registered method to infer the return types for an
5b4a01d4SMehdi Amini    operation that can be used during type inference.
5b4a01d4SMehdi Amini  }];
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNext, we define the interface methods that the operations will need to provide.
5b4a01d4SMehdi AminiAn interface method is comprised of: a description; a C++ return type in string
5b4a01d4SMehdi Aminiform; a method name in string form; and a few optional components, depending on
5b4a01d4SMehdi Aminithe need. See the
31d1ae79SMarkus Böck[ODS documentation](../../Interfaces.md/#attributeoperationtype-interfaces) for more
5b4a01d4SMehdi Aminiinformation.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```tablegen
5b4a01d4SMehdi Aminidef ShapeInferenceOpInterface : OpInterface<"ShapeInference"> {
d8392f76SMatthias Kramm  ...
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini  let methods = [
5b4a01d4SMehdi Amini    InterfaceMethod<"Infer and set the output shape for the current operation.",
5b4a01d4SMehdi Amini                    "void", "inferShapes">
5b4a01d4SMehdi Amini  ];
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiNow that the interface is defined, we can add it to the necessary Toy operations
5b4a01d4SMehdi Aminiin a similar way to how we added the `CallOpInterface` to the GenericCallOp:
5b4a01d4SMehdi Amini
430bba2aSJacques Pienaar```tablegen
5b4a01d4SMehdi Aminidef MulOp : Toy_Op<"mul",
5b4a01d4SMehdi Amini    [..., DeclareOpInterfaceMethods<ShapeInferenceOpInterface>]> {
5b4a01d4SMehdi Amini  ...
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiEach of these operations will then need to provide a definition for the
5b4a01d4SMehdi Amini`inferShapes()` method. As an example, for the mul op, the result shape is
5b4a01d4SMehdi Aminiinferred as the shape of the inputs.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Amini/// Infer the output shape of the MulOp, this is required by the shape inference
5b4a01d4SMehdi Amini/// interface.
0ce25b12SRahul Kayaithvoid MulOp::inferShapes() { getResult().setType(getLhs().getType()); }
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiAt this point, each of the necessary Toy operations provide a mechanism by which
41574554SRiver Riddleto infer their output shapes. The ShapeInferencePass will operate on functions:
ee2c6cd9SRiver Riddleit will run on each function in isolation. MLIR also supports general
*73fa6685Smlevesquedion[OperationPasses](../../PassManagement.md/#operation-pass) that run on any
ee2c6cd9SRiver Riddleisolated operation, but here our module only contains functions, so there is no
ee2c6cd9SRiver Riddleneed to generalize to all operations.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiImplementing such a pass is done by creating a class inheriting from
41574554SRiver Riddle`mlir::OperationPass<FuncOp>` and overriding the `runOnOperation()` method.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
80aca1eaSRiver Riddleclass ShapeInferencePass
41574554SRiver Riddle    : public mlir::PassWrapper<ShapeInferencePass, OperationPass<FuncOp>> {
41574554SRiver Riddle  void runOnOperation() override {
41574554SRiver Riddle    FuncOp function = getOperation();
5b4a01d4SMehdi Amini    ...
5b4a01d4SMehdi Amini  }
5b4a01d4SMehdi Amini};
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
d8392f76SMatthias KrammWhile at it, let's also create a helper method for instantiating the pass:
d8392f76SMatthias Kramm
d8392f76SMatthias Kramm```c++
d8392f76SMatthias Krammstd::unique_ptr<mlir::Pass> mlir::toy::createShapeInferencePass() {
d8392f76SMatthias Kramm  return std::make_unique<ShapeInferencePass>();
d8392f76SMatthias Kramm}
d8392f76SMatthias Kramm```
d8392f76SMatthias Kramm
d8392f76SMatthias KrammThe shape inference algorithm operates as follows:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini1.  Build a worklist containing all the operations that return a dynamically
5b4a01d4SMehdi Amini    shaped tensor: these are the operations that need shape inference.
5b4a01d4SMehdi Amini2.  Iterate on the worklist:
5b4a01d4SMehdi Amini    -   find an operation to process: the next ready operation in the worklist
5b4a01d4SMehdi Amini        has all of its arguments non-generic,
5b4a01d4SMehdi Amini    -   if no operation is found, break out of the loop,
5b4a01d4SMehdi Amini    -   remove the operation from the worklist,
5b4a01d4SMehdi Amini    -   infer the shape of its output from the argument types.
5b4a01d4SMehdi Amini3.  If the worklist is empty, the algorithm succeeded.
5b4a01d4SMehdi Amini
d8392f76SMatthias KrammWhen processing an operation like described, we query if it registered the
d8392f76SMatthias Kramm`ShapeInference` interface, using this code snippet:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Amini  // Ask the operation to infer its output shapes.
5b4a01d4SMehdi Amini  LLVM_DEBUG(llvm::dbgs() << "Inferring shape for: " << *op << "\n");
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini  /// We check if an operation has a particular interface by casting.
5b4a01d4SMehdi Amini  if (ShapeInference shapeOp = dyn_cast<ShapeInference>(op)) {
5b4a01d4SMehdi Amini    shapeOp.inferShapes();
5b4a01d4SMehdi Amini  } else {
5b4a01d4SMehdi Amini    op->emitError("unable to infer shape of operation without shape "
5b4a01d4SMehdi Amini                  "inference interface");
5b4a01d4SMehdi Amini    return signalPassFailure();
5b4a01d4SMehdi Amini  }
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiWe can then add our pass to the pass manager:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```c++
5b4a01d4SMehdi Amini  pm.addPass(mlir::createShapeInferencePass());
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiIf we rerun our original example, we now get the following:
5b4a01d4SMehdi Amini
5b4a01d4SMehdi Amini```mlir
ee2c6cd9SRiver Riddletoy.func @main() {
ee2c6cd9SRiver Riddle  %0 = toy.constant dense<[[1.000000e+00, 2.000000e+00, 3.000000e+00], [4.000000e+00, 5.000000e+00, 6.000000e+00]]> : tensor<2x3xf64>
ee2c6cd9SRiver Riddle  %1 = toy.transpose(%0 : tensor<2x3xf64>) to tensor<3x2xf64>
ee2c6cd9SRiver Riddle  %2 = toy.mul %1, %1 : tensor<3x2xf64>
0050e8f0SRiver Riddle  toy.print %2 : tensor<3x2xf64>
0050e8f0SRiver Riddle  toy.return
5b4a01d4SMehdi Amini}
5b4a01d4SMehdi Amini```
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiYou can build `toyc-ch4` and try yourself: `toyc-ch4
5b4a01d4SMehdi Aminitest/Examples/Toy/Ch4/codegen.toy -emit=mlir -opt`.
5b4a01d4SMehdi Amini
5b4a01d4SMehdi AminiIn the [next chapter](Ch-5.md), we will start the process of code generation by
5b4a01d4SMehdi Aminitargeting a lower level dialect for optimizing some of the more compute-heavy
5b4a01d4SMehdi AminiToy operations.