Sponsor of the Day:
Jerkmate
https://sebastianraschka.com/llm-architecture-gallery/kv-cache-calculations/
KV Cache / Token (bf16) | Sebastian Raschka, PhD
Apr 3, 2026 - How the LLM Architecture Gallery computes KV cache growth per generated token.
sebastian raschka phdkv cachetokenbf16
https://sebastianraschka.com/llm-architecture-gallery/gqa/
Grouped-Query Attention (GQA) | Sebastian Raschka, PhD
Mar 21, 2026 - A gallery-local explainer for Grouped-Query Attention, based on the architecture comparison articles and LLMs-from-scratch.
sebastian raschka phdgroupedqueryattention
https://www.manning.com/books/build-a-large-language-model-from-scratch
Build a Large Language Model (From Scratch) - Sebastian Raschka
How to implement LLM attention mechanisms and GPT-style transformers.
large language modelsebastian raschkabuildscratch
https://sebastianraschka.com/
Sebastian Raschka, LLM Research Engineer | Sebastian Raschka, PhD
Apr 22, 2026 - Homepage of Sebastian Raschka with recent AI and LLM articles, books, talks, research links, and project resources.
sebastian raschkallm researchengineerphd
https://sebastianraschka.com/llm-architecture-gallery/swa/
Sliding Window Attention (SWA) | Sebastian Raschka, PhD
Mar 14, 2026 - A gallery-local explainer for Sliding Window Attention, based on the architecture comparison articles and LLMs-from-scratch.
sebastian raschka phdsliding windowattentionswa
https://sebastianraschka.com/llm-architecture-gallery/aa-intelligence-index/
Artificial Analysis Intelligence Index | Sebastian Raschka, PhD
Mar 27, 2026 - What the AA Intelligence Index total score and profile fields mean in the LLM Architecture Gallery.
sebastian raschka phdartificial analysisintelligence index
https://sebastianraschka.com/teaching/
Courses | Sebastian Raschka, PhD
Apr 6, 2026 - Course materials and video resources from Sebastian Raschka on machine learning, deep learning, and large language models.
sebastian raschka phdcourses
https://www.manning.com/livevideo/master-and-build-large-language-models
Master and Build Large Language Models - Sebastian Raschka and Abhinav Kimothi
The best way to understand LLMs is to build one yourself. This course gives you that power. In this engaging liveVideo, veteran AI researcher Sebastian Raschka...
large language modelssebastian raschkamasterbuildabhinav
https://2026.pycon.de/keynote-sebastian-raschka/
Sebastian Raschka Keynote — LLMs in 2026 — From Architecture to Production | PyCon DE & PyData 2026
Sebastian Raschka walks through building LLMs with Python — from architecture design to scaling — and reflects on Python's evolving role in the AI stack.
pycon de pydatasebastian raschkakeynotellms2026
https://sebastianraschka.com/llms-from-scratch/ch04/09_rmsnorm/
RMSNorm | Sebastian Raschka, PhD
Mar 14, 2026 - A concept guide to RMSNorm based on recent LLM architecture articles and the local LLMs-from-scratch materials.
sebastian raschka phd
https://sebastianraschka.com/llm-architecture-gallery/qk-norm/
QK-Norm | Sebastian Raschka, PhD
Mar 14, 2026 - A gallery-local explainer for QK-Norm, based on the architecture comparison articles and the local Qwen and OLMo notebooks.
sebastian raschka phdqknorm