buffer-deallocation.mlir - OpenGrok history log for /llvm-project/mlir/test/Dialect/SCF/buffer-deallocation.mlir

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# 10056c82	20-Dec-2023	Matthias Springer <me@m-sp.org>	[mlir][SCF] `scf.parallel`: Make reductions part of the terminator (#75314) This commit makes reductions part of the terminator. Instead of `scf.yield`, `scf.reduce` now terminates the body of `scf [mlir][SCF] `scf.parallel`: Make reductions part of the terminator (#75314) This commit makes reductions part of the terminator. Instead of `scf.yield`, `scf.reduce` now terminates the body of `scf.parallel` ops. `scf.reduce` may contain an arbitrary number of reductions, with one region per reduction. Example: ```mlir %init = arith.constant 0.0 : f32 %r:2 = scf.parallel (%iv) = (%lb) to (%ub) step (%step) init (%init, %init) -> f32, f32 { %elem_to_reduce1 = load %buffer1[%iv] : memref<100xf32> %elem_to_reduce2 = load %buffer2[%iv] : memref<100xf32> scf.reduce(%elem_to_reduce1, %elem_to_reduce2 : f32, f32) { ^bb0(%lhs : f32, %rhs: f32): %res = arith.addf %lhs, %rhs : f32 scf.reduce.return %res : f32 }, { ^bb0(%lhs : f32, %rhs: f32): %res = arith.mulf %lhs, %rhs : f32 scf.reduce.return %res : f32 } } ``` `scf.reduce` operations can no longer be interleaved with other ops in the body of `scf.parallel`. This simplifies the op and makes it possible to assign the `RecursiveMemoryEffects` trait to `scf.reduce`. (This was not possible before because the op was not a terminator, causing the op to be DCE'd.) show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2
# ba727ac2	20-Sep-2023	Martin Erhart <merhart@google.com>	[mlir][bufferization][scf] Implement BufferDeallocationOpInterface for scf.reduce.return (#66886) This is necessary to run the new buffer deallocation pipeline as part of the sparse compiler pipeli [mlir][bufferization][scf] Implement BufferDeallocationOpInterface for scf.reduce.return (#66886) This is necessary to run the new buffer deallocation pipeline as part of the sparse compiler pipeline. show more ...
Revision tags: llvmorg-17.0.1, llvmorg-17.0.0
# 66aa9a25	14-Sep-2023	Martin Erhart <merhart@google.com>	[mlir][bufferization] Implement BufferDeallocationopInterface for scf.forall.in_parallel (#66351) The scf.forall.in_parallel terminator operation has a nested graph region with the NoTerminator trai [mlir][bufferization] Implement BufferDeallocationopInterface for scf.forall.in_parallel (#66351) The scf.forall.in_parallel terminator operation has a nested graph region with the NoTerminator trait. Such regions are not supported by the default implementations. Therefore, this commit adds a specialized implementation for this operation which only covers the case where the nested region is empty. This is because after bufferization, ops like tensor.parallel_insert_slice were already converted to memref operations residing int the scf.forall only and the nested region of scf.forall.in_parallel ends up empty. show more ...
# 1356e853	12-Sep-2023	Martin Erhart <merhart@google.com>	[mlir][bufferization] Implement BufferDeallocationopInterface for scf.forall.in_parallel The scf.forall.in_parallel terminator operation has a nested graph region with the NoTerminator trait. Such r [mlir][bufferization] Implement BufferDeallocationopInterface for scf.forall.in_parallel The scf.forall.in_parallel terminator operation has a nested graph region with the NoTerminator trait. Such regions are not supported by the default implementations. Therefore, this commit adds a specialized implementation for this operation which only covers the case where the nested region is empty. This is because after bufferization, ops like tensor.parallel_insert_slice were already converted to memref operations residing int the scf.forall only and the nested region of scf.forall.in_parallel ends up empty. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158979 show more ...