Robuta

https://openreview.net/forum?id=Fm0nDMKBwC&referrer=%5Bthe%20profile%20of%20Hermann%20Kumbong%5D(%2Fprofile%3Fid%3D~Hermann_Kumbong1)
Fine-tuning large language models (LLMs) is increasingly costly as models scale to hundreds of billions of parameters, and even parameter-efficient fine-tuning...
fine tuningaccurateefficientlorallms
https://aclanthology.org/2024.emnlp-main.444/
Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig. Proceedings of the 2024 Conference on Empirical Methods in...
fine tuning llmsnew knowledgeencouragehallucinationsacl
https://unsloth.ai/
Open source fine-tuning & reinforcment learning (RL) for gpt-oss, Llama 4, DeepSeek-R1, Gemma, and Qwen3 LLMs! Beginner friendly.
open sourcefine tuningunslothairl
https://www.coursera.org/learn/generative-ai-advanced-fine-tuning-for-llms?specialization=generative-ai-engineering-with-llms&authMode=signup
Offered by IBM. "Fine-tuning large language models (LLMs) is essential for aligning them with specific business needs, improving accuracy, ... Enroll for free.
generative aifine tuningadvancellmscoursera
https://www.thoughtworks.com/en-gb/insights/decoder/f/fine-tuning-llms
What is fine-tuning? How can it help businesses do more with large language models?
fine tuning llmsunited kingdomthoughtworks
https://huggingface.co/papers/2410.18210
Join the discussion on this paper page
papertowardsunderstandingfragilitymultilingual
https://www.allaboutai.com/ai-agents/fine-tune-llms/
Nov 27, 2025 - I share my top techniques for fine-tuning LLMs, along with best practices, a step-by-step guide, and how it compares to RAG and prompt engineering.
fine tuning llmstop techniquesbest practices
https://www.deeplearning.ai/alpha/short-courses/intro-to-federated-learning-c2/
fine tuningprivate datafederatedllmsdeeplearning
https://www.deeplearning.ai/courses/fine-tuning-and-reinforcement-learning-for-llms-intro-to-post-training/
Apply fine-tuning and reinforcement learning techniques to shape model behavior, improve reasoning, and make LLMs safer and more reliable.
fine tuningreinforcement learningllmsintropost
https://openreview.net/forum?id=pebVFFVs2R
This paper introduces TempSamp-R1, a new reinforcement fine-tuning framework designed to improve the effectiveness of adapting multimodal large language models...
fine tuningeffectivetemporalsamplingreinforcement
https://predibase.com/blog/7-things-you-need-to-know-about-fine-tuning-llms
Discover 7 essential tips for fine-tuning LLMs effectively—from choosing the right base model to avoiding common performance pitfalls.
fine tuning llmsthingsknow
https://openreview.net/forum?id=VT4Ovqg0BW&referrer=%5Bthe%20profile%20of%20Ruijia%20Niu%5D(%2Fprofile%3Fid%3D~Ruijia_Niu1)
From common-sense reasoning to domain-specific tasks, parameter-efficient fine tuning (PEFT) methods for large language models (LLMs) have showcased...
uncertainty quantificationfine tuningfunctionallevelcalibrated
https://research.google/blog/fine-tuning-llms-with-user-level-differential-privacy/
fine tuning llmsuser leveldifferential privacy