https://en.wikipedia.org/wiki/Mechanistic_interpretability
Mechanistic interpretability - Wikipedia
mechanistic interpretabilitywikipedia
https://novaknown.com/tag/mechanistic-interpretability/
mechanistic interpretability Archives - Novaknown
The study of how neural networks process information internally by identifying the circuits and components responsible for their outputs.
mechanistic interpretabilityarchives
https://news.ysimulator.run/item/8703
TinyInterp - Local app for mechanistic interpretability of transformer models | Hacker News...
TinyInterp - Local app for mechanistic interpretability of transformer models
app formechanistic interpretabilitytransformer modelslocal
https://open2interp.substack.com/p/applying-network-motif-analysis-to
Borrowing a tool from systems biology for mechanistic interpretability
TL;DR: I have been analyzing attribution graphs manually and found it to be tedious and hard to scale.
a toolsystems biologyborrowinginterpretability