https://observablehq.com/@sorami/sizes-of-large-language-models
Inspired by the figure in the DistilBERT paper. (Sanh et al., 2019) DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
large language modelssizessoramihisamotoobservable