Robuta

https://github.com/jingyaogong/minimind GitHub - jingyaogong/minimind: 🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in... 🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h! - jingyaogong/minimind from scratchgithubjingyaogongminimindparameter