https://neurips.cc/virtual/2024/poster/97595
NeurIPS Poster EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes
reinforcement learningneuripsposterbenchmarkdynamic
Sponsored https://fantasy.ai/
Create, Chat, and Connect with Your Perfect AI Companion - Fantasy.ai
Upgrade your Fantasy with a next-level AI Companion Platform. Create, Chat, and Connect. Your Fantasy, your Way!
https://www.tue.nl/en/news-and-events/news-overview/10-04-2026-improving-reinforcement-learning-with-transferable-skills-and-flexible-decision-making
Improving reinforcement learning with transferable skills and flexible decision-making
PhD researcher Yucheng Yang investigated how reinforcement learning systems can be made more adaptable to new tasks and changing objectives.
reinforcement learningtransferable skillsdecision makingimprovingflexible
https://www.manning.com/livevideo/reinforcement-learning-in-motion
Reinforcement Learning in Motion - Phil Tabor
We all learn by interacting with the world around us, constantly experimenting and interpreting the results. Reinforcement learning is a machine learning...
reinforcement learningmotionphiltabor
https://towardsdatascience.com/revisiting-benchmarking-of-tabular-reinforcement-learning-methods/
Revisiting Benchmarking of Tabular Reinforcement Learning Methods | Towards Data Science
Jul 1, 2025 - Introducing a modular framework and improving model performance.
reinforcement learningdata sciencebenchmarkingtabularmethods
https://yewr.github.io/rlfp/
Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own
Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own
reinforcement learningfoundationletembodiedagent
https://huggingface.co/papers/2507.19457
Paper page - GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Join the discussion on this paper page
reinforcement learningpaperreflectivepromptevolution
https://research.atspotify.com/2023/07/automatic-music-playlist-generation-via-simulation-based-reinforcement-learning
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning | Spotify Research
Reinforcement learning (RL) is an established tool for sequential decision making. In this work, we apply RL to solve an automatic music playlist generation...
music playlistreinforcement learningautomaticgenerationvia
https://gpuopen.com/learn/announcing-amd-schola-v2-nextgen-rl-unreal-engine/
Announcing AMD Schola v2: Next-generation reinforcement learning for Unreal Engine - AMD GPUOpen
AMD Schola v2 is a major update to the open-source reinforcement learning plugin for Unreal® Engine 5, offering significant improvements in capabilities,...
next generationreinforcement learningunreal engineannouncingamd
https://www.manning.com/books/reinforcement-learning-from-human-feedback
Reinforcement Learning from Human Feedback - Nathan Lambert
The authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. Aligning AI models to human preferences helps them...
reinforcement learninghumanfeedbacknathanlambert
https://www.ibm.com/think/topics/reinforcement-learning
What is reinforcement learning? | IBM
Mar 2, 2026 - In reinforcement learning, an agent learns to make decisions by interacting with an environment. It is used in robotics and other decision-making settings.
what isreinforcement learningibm
https://unsloth.ai/docs/get-started/reinforcement-learning-rl-guide
Reinforcement Learning (RL) Guide | Unsloth Documentation
Learn all about Reinforcement Learning (RL) and how to train your own DeepSeek-R1 reasoning model with Unsloth using GRPO. A complete guide from beginner to...
reinforcement learningrlguideunslothdocumentation
https://www.cwi.nl/en/events/research-semester-programmes/control-theory-and-reinforcement/
Control Theory and Reinforcement Learning: Connections and Challenges
control theoryreinforcement learningconnectionschallenges
https://www.manning.com/catalog/data-science/deep-learning/deep-reinforcement-learning
Deep Reinforcement Learning books | Manning
Learn more about Deep Reinforcement Learning through expert-written books, eBooks, and practical guides for tech professionals.
deep reinforcement learningbooksmanning
https://towardsdatascience.com/monte-carlo-methods-for-solving-reinforcement-learning-problems-ff8389d46a3e/
Monte Carlo Methods for Solving Reinforcement Learning Problems | Towards Data Science
monte carloreinforcement learningdata sciencemethodsproblems
https://link.springer.com/article/10.1038/s44320-026-00206-9?error=cookies_not_supported&code=cc600ee2-b8ed-444a-b11e-1243bc664d50
SyntheMol-RL: a flexible reinforcement learning framework for designing easily synthesizable...
Apr 23, 2026 - The rise of antibiotic-resistant pathogens such as Staphylococcus aureus has created an urgent need for new antibiotics. Generative artificial intelligence
reinforcement learningrlflexibleframeworkdesigning
https://www.deeplearning.ai/courses/fine-tuning-and-reinforcement-learning-for-llms-intro-to-post-training/
Fine-tuning and Reinforcement Learning for LLMs: Intro to Post-Training - DeepLearning.AI
Feb 2, 2026 - Apply fine-tuning and reinforcement learning techniques to shape model behavior, improve reasoning, and make LLMs safer and more reliable.
fine tuningreinforcement learningllmsintropost
https://www.manning.com/books/applied-reinforcement-learning
Applied Reinforcement Learning - Hadi Aghazadeh
Optimize business processes, people, and resources using AI and the power of reinforcement learning. Whether you’re finding the best delivery route,...
reinforcement learningapplied
https://farama.org/
The Farama Foundation | Maintaining The World’s Open Source Reinforcement Learning Tools
Maintaining The World’s Open Source Reinforcement Learning Tools
open sourcereinforcement learningfoundationmaintainingtools
https://www.udacity.com/blog/2025/12/reinforcement-learning-explained-algorithms-examples-and-ai-use-cases.html
Reinforcement Learning Explained: Algorithms, Examples, and AI Use Cases | Udacity
Dec 10, 2025 - Introduction Imagine training a dog to sit. You don’t give it a complete list of instructions; instead, you reward it with a treat every time it performs the...
ai use casesreinforcement learningexplainedalgorithmsexamples
https://www.amazon.jobs/en/jobs/10401674/applied-scientist-ii-reinforcement-learning?cmpid=bsp-amazon-science
Applied Scientist II, Reinforcement Learning - Job ID: 10401674 | Amazon.jobs
Explore corporate jobs and career programs at Amazon, from full-time roles to internships. Join our global teams and create a better future for our customers.
reinforcement learningamazon jobsappliedscientistii
https://towardsdatascience.com/benchmarking-tabular-reinforcement-learning-algorithms/
Benchmarking Tabular Reinforcement Learning Algorithms | Towards Data Science
May 6, 2025 - Comparing all methods from Part I of Sutton’s book on gridworld environments
reinforcement learningdata sciencebenchmarkingtabularalgorithms
https://arxiv.org/abs/2407.15168
[2407.15168] Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
Abstract page for arXiv paper 2407.15168: Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
deep reinforcement learningbackdoorsneuralactivationspace
https://arxiv.org/abs/2507.19457
[2507.19457] GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Abstract page for arXiv paper 2507.19457: GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
reinforcement learningreflectivepromptevolution
Sponsored https://www.fanvue.com/
Fanvue
The creator subscription platform for the future. Sign up before the end of the month and take home 85%.
https://www.intechopen.com:443/online-first/1219884
Deep Reinforcement Learning for Robot Navigation: Concepts, Current Trends, Challenges, and Future...
deep reinforcement learningcurrent trendsrobotnavigationconcepts
https://rlhfbook.com/
Reinforcement Learning from Human Feedback
The Reinforcement Learning from Human Feedback Book
reinforcement learninghumanfeedback
https://www.codecademy.com/learn/learn-reinforcement-learning-with-gymnasium
Learn Reinforcement Learning with Gymnasium | Codecademy
Learn reinforcement learning fundamentals and build learning agents with Gymnasium in this hands-on Python course.
reinforcement learninggymnasiumcodecademy
https://arxiv.org/abs/2604.21030
[2604.21030] A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control...
Abstract page for arXiv paper 2604.21030: A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control Integration for Linear Systems
systematic reviewreinforcement learningtaxonomymodelpredictive
https://arxiv.org/html/2604.21030v1
A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control Integration for...
systematic reviewreinforcement learningtaxonomymodelpredictive
https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893
Deep Reinforcement Learning Online Course | Udacity
deep reinforcement learningonline courseudacity
https://arxiv.org/abs/2211.10530
[2211.10530] Provable Defense against Backdoor Policies in Reinforcement Learning
Abstract page for arXiv paper 2211.10530: Provable Defense against Backdoor Policies in Reinforcement Learning
reinforcement learningdefensebackdoorpolicies
https://www.alexirpan.com/2018/02/14/rl-hard.html
Deep Reinforcement Learning Doesn't Work Yet
June 24, 2018 note: If you want to cite an example from the post, pleasecite the paper which that example came from. If you want to cite thepost as a whole, ...
deep reinforcement learningworkyet
https://towardsdatascience.com/introduction-to-reinforcement-learning-and-solving-the-multi-armed-bandit-problem-e4ae74904e77/
Introduction to Reinforcement Learning and Solving the Multi-armed Bandit Problem | Towards Data...
reinforcement learningintroductionmultiarmedbandit
Sponsored https://www.naughtycharm.com/
NaughtyCharm
https://arxiv.org/abs/2410.23214
[2410.23214] Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
Abstract page for arXiv paper 2410.23214: Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
reinforcement learninggroundingtryingllmsenhanced
https://arxiv.org/abs/2202.05839
[2202.05839] Abstraction for Deep Reinforcement Learning
Abstract page for arXiv paper 2202.05839: Abstraction for Deep Reinforcement Learning
deep reinforcement learningabstraction
https://blog.tensorflow.org/2023/10/simulated-spotify-listening-experiences-reinforcement-learning-tensorflow-tf-agents.html
Simulated Spotify Listening Experiences for Reinforcement Learning with TensorFlow and TF-Agents —...
Spotify shares how they use TensorFlow and Reinforcement Learning to train models offline, translating results to large scale, online performance.
reinforcement learningsimulatedspotifylisteningexperiences
https://www.geeksforgeeks.org/machine-learning/what-is-reinforcement-learning/
Reinforcement Learning - GeeksforGeeks
Your All-in-One Learning Portal: GeeksforGeeks is a comprehensive educational platform that empowers learners across domains-spanning computer science and...
reinforcement learning
https://towardsdatascience.com/introduction-to-approximate-solution-methods-for-reinforcement-learning-2/
Introduction to Approximate Solution Methods for Reinforcement Learning | Towards Data Science
Learn about function approximation and the different choices for approximation functions
reinforcement learningdata scienceintroductionsolutionmethods
https://pwnagotchi.ai/
Pwnagotchi - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
deep reinforcement learningwifi
https://deepsense.ai/case-studies/reinforcement-learning-speeds-up-autonomous-driving/
Autonomous Driving R&D with Reinforcement Learning: Faster, Smarter Navigation
Oct 9, 2025 - Learn how deepsense.ai used deep RL to optimize AI models for real-world driving—cutting training time while boosting policy performance.
autonomous drivingreinforcement learningfastersmarternavigation
https://www.infoq.com/articles/agent-reinforcement-learning-apache-spark/
Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache...
Jan 30, 2026 - This article introduces a reinforcement learning (RL) approach grounded in Apache Spark that enables distributed computing systems to learn optimal...
big datareinforcement learningautonomousoptimizationmulti
https://huggingface.co/docs/trl/index
TRL - Transformers Reinforcement Learning · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
reinforcement learninghugging facetrltransformers
https://towardsdatascience.com/tag/reinforcement-learning/
Reinforcement Learning | Towards Data Science
Read articles about Reinforcement Learning in Towards Data Science - the world’s leading publication for data science, data analytics, data engineering,...
reinforcement learningdata science
https://speakerdeck.com/shunk031/the-landscape-of-agentic-reinforcement-learning-for-llms-a-survey
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey - Speaker Deck
Apr 3, 2026 - 大規模言語モデル(LLM)に強化学習を組み合わせた「Agentic RL」は,自律的な意思決定や動的な環境適応能力により,人工知能の新たなフロンティアを切り開いています。本資料では,この急速に進化するAgentic RLの全体像を,最新の包括的サーベイ論文「Agentic Reinforcement L…
reinforcement learningspeaker decklandscapeagenticllms
https://heise-academy.de/Videokurse/deep-learning-teil-4-deep-reinforcement-learning
Deep Learning – Teil 4: Deep Reinforcement Learning | heise academy
Mit Deep Reinforcement Learning (DRL) können KI-Agenten eigenständig Strategien entwickeln, um komplexe Prozesse in simulierten Umgebungen zu automatisieren....
deep learningheise academyreinforcement
Sponsored https://seasonedflirt.com/
SeasonedFlirt
Less algorithms. More humans.
https://ilyakuzovkin.com/
on Neuroscience, AI, Machine Learning, Reinforcement Learning, Robotics, Brain-Computer Interfaces,...
Apr 16, 2026 - Hi and Welcome!This page serves as my intro and a hub for occasional writings and slides on the topics of neuroscience, AI, and pretty much...
machine learningcomputer interfacesneuroscienceaireinforcement
https://www.academia.edu/165659755/Learning_Skills_in_Reinforcement_Learning_Using_Relative_Novelty
(PDF) Learning Skills in Reinforcement Learning Using Relative Novelty
We present a method for automatically creating a set of useful temporally-extended actions, or skills, in reinforcement learning. Our method identifies states...
learning skillspdfreinforcementusingrelative