https://kindxiaoming.github.io/blog/2026/sparse-attention-2/
Sparse attention 2 -- Unattention head, branching dynamics | Ziming Liu
A simple, whitespace theme for academics. Based on [*folio](https://github.com/bogoli/-folio) design.
sparse attentionheadbranchingdynamicsliu
https://papers.neurips.cc/paper_files/paper/2025/hash/00a0ebcad584c59dbc439c2af8793638-Abstract-Conference.html
SALS: Sparse Attention in Latent Space for KV Cache Compression
sparse attentionlatent spacekv cachesalscompression
https://deepwiki.com/deep-spin/adasplash/3.2-sparse-attention-without-block-masking
Sparse Attention without Block Masking | deep-spin/adasplash | DeepWiki
This document covers the `adasplashnoblockmask` implementation, which provides memory-efficient sparse attention computation without using block masking. This...
sparse attentionwithoutblockmaskingdeep
https://repositum.tuwien.at/handle/20.500.12708/113065
reposiTUm: Sparse graph attention networks as efficient ionic liquid potentials
graph attention networkssparseefficientionicliquid
https://www.neuronpedia.org/blog/interp-explorer
Interp Explorer, Circuit Tracing w/ Attention, and Weight-Sparse Transformers | The Residual Stream
Plus Library Updates and Awesome Community Projects
https://aisengtech.com/2025/10/16/Top-AI-&-ML-Research-Updates-LLM-Pruning-and-Sparse-Attention-Breakthroughs/
Top AI & ML Research Updates: LLM Pruning and Sparse Attention Breakthroughs (Oct 16, 2025) - AI...
https://www.deepspeed.ai/tutorials/sparse-attention/
DeepSpeed Sparse Attention - DeepSpeed
In this tutorial we describe how to use DeepSpeed Sparse Attention (SA) and its building-block kernels. The easiest way to use SA is through DeepSpeed...
deepspeedsparseattention