Robuta

https://www.graphcore.ai/posts/accelerating-block-sparse-matrix-multiplication-on-ipus-with-popsparse Accelerating Block Sparse Matrix Multiplication on IPUs with PopSparse Introducing PopSparse, a library that enables fast sparse operations on the Graphcore IPU. block sparseacceleratingipus https://antmicro.com/blog/2024/11/llm-optimizations-for-ampere-based-gpus/ Antmicro ยท LLM optimizations for sparse matrix processing on Jetson Orin and other Ampere GPUs sparse matrixantmicrollm