Ch2.py - OpenGrok history log for /llvm-project/mlir/test/Examples/NVGPU/Ch2.py

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
# f8ff9094	24-Jun-2024	Guray Ozen <guray.ozen@gmail.com>	[mlir][gpu] Add py binding for AsyncTokenType (#96466) The PR adds py binding for `AsyncTokenType`
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5
# 4d330820	24-Apr-2024	Guray Ozen <guray.ozen@gmail.com>	[mlir][nvgpu] NVGPU Tutorials (#87065) I have a tutorial at EuroLLVM 2024 ([Zero to Hero: Programming Nvidia Hopper Tensor Core with MLIR's NVGPU Dialect](https://llvm.swoogo.com/2024eurollvm/sess [mlir][nvgpu] NVGPU Tutorials (#87065) I have a tutorial at EuroLLVM 2024 ([Zero to Hero: Programming Nvidia Hopper Tensor Core with MLIR's NVGPU Dialect](https://llvm.swoogo.com/2024eurollvm/session/2086997/zero-to-hero-programming-nvidia-hopper-tensor-core-with-mlir's-nvgpu-dialect)). For that, I implemented tutorial codes in Python. The focus is the nvgpu dialect and how to use its advanced features. I thought it might be useful to upstream this. The tutorial codes are as follows: - Ch0.py: Hello World - Ch1.py: 2D Saxpy - Ch2.py: 2D Saxpy using TMA - Ch3.py: GEMM 128x128x64 using Tensor Core and TMA - Ch4.py: Multistage performant GEMM using Tensor Core and TMA - Ch5.py: Warp Specialized GEMM using Tensor Core and TMA I might implement one more chapter: - Ch6.py: Warp Specialized Persistent ping-pong GEMM This PR also introduces the nvdsl class, making IR building in the tutorial easier. show more ...