Robuta

https://slideslive.com/39004357/reinforcement-learning-from-human-feedback-a-tutorial-
nathan lambertreinforcement learninghuman feedbackdmitrymiddot
https://substack.com/@natolambert
ML researcher making sense of AI research, products, and the uncertain technological future. PhD from Berkeley AI. Experience at Meta, DeepMind, HuggingFace.
nathan lambertsubstack
https://www.retortai.com/
Distilling the major events and challenges in the world of artificial intelligence and machine learning, from Thomas Krendl Gilbert and Nathan Lambert. Click...
nathan lambertretortaisubstack