Skip to content

Latest commit

 

History

History
14 lines (8 loc) · 625 Bytes

README.md

File metadata and controls

14 lines (8 loc) · 625 Bytes

Examples of using the CUTLASS Python interface

  • 00_basic_gemm

    Shows how declare, configure, compile, and run a CUTLASS GEMM using the Python interface

  • 01_epilogue

    Shows how to fuse elementwise activation functions to GEMMs via the Python interface

  • 02_pytorch_extension_grouped_gemm

    Shows how to declare, compile, and run a grouped GEMM operation via the Python interface, along with how the emitted kernel can be easily exported to a PyTorch CUDA extension.