Robuta

https://kindxiaoming.github.io/blog/2026/sparse-attention-2/ Sparse attention 2 -- Unattention head, branching dynamics | Ziming Liu A simple, whitespace theme for academics. Based on [*folio](https://github.com/bogoli/-folio) design. sparse attentionheadbranchingdynamicsliu https://papers.neurips.cc/paper_files/paper/2025/hash/00a0ebcad584c59dbc439c2af8793638-Abstract-Conference.html SALS: Sparse Attention in Latent Space for KV Cache Compression sparse attentionlatent spacekv cachesalscompression https://deepwiki.com/deep-spin/adasplash/3.2-sparse-attention-without-block-masking Sparse Attention without Block Masking | deep-spin/adasplash | DeepWiki This document covers the `adasplashnoblockmask` implementation, which provides memory-efficient sparse attention computation without using block masking. This... sparse attentionwithoutblockmaskingdeep https://repositum.tuwien.at/handle/20.500.12708/113065 reposiTUm: Sparse graph attention networks as efficient ionic liquid potentials graph attention networkssparseefficientionicliquid https://www.neuronpedia.org/blog/interp-explorer Interp Explorer, Circuit Tracing w/ Attention, and Weight-Sparse Transformers | The Residual Stream Plus Library Updates and Awesome Community Projects https://aisengtech.com/2025/10/16/Top-AI-&-ML-Research-Updates-LLM-Pruning-and-Sparse-Attention-Breakthroughs/ Top AI & ML Research Updates: LLM Pruning and Sparse Attention Breakthroughs (Oct 16, 2025) - AI... https://www.deepspeed.ai/tutorials/sparse-attention/ DeepSpeed Sparse Attention - DeepSpeed In this tutorial we describe how to use DeepSpeed Sparse Attention (SA) and its building-block kernels. The easiest way to use SA is through DeepSpeed... deepspeedsparseattention