Skip to content

Latest commit

 

History

History
13 lines (7 loc) · 580 Bytes

0013_Sparse_Matmul.md

File metadata and controls

13 lines (7 loc) · 580 Bytes

Block-Sparse Matrix Multiplication

Link: https://github.com/microsoft/DeepSpeed/blob/master/deepspeed/ops/sparse_attention/matmul.py

Author: Microsoft Deepspeed, Philippe Tillet (adapted from his repo)

Tags: Sparsity, Matmul

Description: A collection of Triton kernels for block-sparse matrix multiplication. Has code for sdd (sparse = dense x dense), dsd (dense = sparse x dense), and dds (dense = dense x sparse) type matrix multiplications.

Triton Version: Triton v2.1.0+

Id in triton index: 0013