Robuta

https://en.wikipedia.org/wiki/Mechanistic_interpretability Mechanistic interpretability - Wikipedia mechanistic interpretabilitywikipedia https://novaknown.com/tag/mechanistic-interpretability/ mechanistic interpretability Archives - Novaknown The study of how neural networks process information internally by identifying the circuits and components responsible for their outputs. mechanistic interpretabilityarchives https://news.ysimulator.run/item/8703 TinyInterp - Local app for mechanistic interpretability of transformer models | Hacker News... TinyInterp - Local app for mechanistic interpretability of transformer models app formechanistic interpretabilitytransformer modelslocal https://open2interp.substack.com/p/applying-network-motif-analysis-to Borrowing a tool from systems biology for mechanistic interpretability TL;DR: I have been analyzing attribution graphs manually and found it to be tedious and hard to scale. a toolsystems biologyborrowinginterpretability