https://neurips.cc/virtual/2024/poster/97595
NeurIPS Poster EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes
reinforcement learningneuripsposterbenchmarkdynamic
https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893
Deep Reinforcement Learning Online Course | Udacity
deep reinforcement learningonline courseudacity
https://www.alexirpan.com/2018/02/14/rl-hard.html
Deep Reinforcement Learning Doesn't Work Yet
June 24, 2018 note: If you want to cite an example from the post, pleasecite the paper which that example came from. If you want to cite thepost as a whole, ...
deep reinforcement learningworkyet
https://www.intechopen.com:443/online-first/1219884
Deep Reinforcement Learning for Robot Navigation: Concepts, Current Trends, Challenges, and Future...
deep reinforcement learningcurrent trendsrobotnavigationconcepts
https://unsloth.ai/docs/models/gpt-oss-how-to-run-and-fine-tune/gpt-oss-reinforcement-learning
gpt-oss Reinforcement Learning | Unsloth Documentation
reinforcement learninggptossunslothdocumentation
https://link.springer.com/article/10.1038/s44320-026-00206-9?error=cookies_not_supported&code=cc600ee2-b8ed-444a-b11e-1243bc664d50
SyntheMol-RL: a flexible reinforcement learning framework for designing easily synthesizable...
Apr 23, 2026 - The rise of antibiotic-resistant pathogens such as Staphylococcus aureus has created an urgent need for new antibiotics. Generative artificial intelligence
reinforcement learningrlflexibleframeworkdesigning
https://www.udacity.com/blog/2025/12/reinforcement-learning-explained-algorithms-examples-and-ai-use-cases.html
Reinforcement Learning Explained: Algorithms, Examples, and AI Use Cases | Udacity
Dec 10, 2025 - Introduction Imagine training a dog to sit. You don’t give it a complete list of instructions; instead, you reward it with a treat every time it performs the...
ai use casesreinforcement learningexplainedalgorithmsexamples
https://www.amazon.jobs/en/jobs/10401674/applied-scientist-ii-reinforcement-learning?cmpid=bsp-amazon-science
Applied Scientist II, Reinforcement Learning - Job ID: 10401674 | Amazon.jobs
Explore corporate jobs and career programs at Amazon, from full-time roles to internships. Join our global teams and create a better future for our customers.
reinforcement learningamazon jobsappliedscientistii
https://www.infoq.com/articles/agent-reinforcement-learning-apache-spark/
Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache...
Jan 30, 2026 - This article introduces a reinforcement learning (RL) approach grounded in Apache Spark that enables distributed computing systems to learn optimal...
big datareinforcement learningautonomousoptimizationmulti
https://farama.org/
The Farama Foundation | Maintaining The World’s Open Source Reinforcement Learning Tools
Maintaining The World’s Open Source Reinforcement Learning Tools
open sourcereinforcement learningfoundationmaintainingtools
https://arxiv.org/abs/2604.21030
[2604.21030] A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control...
Abstract page for arXiv paper 2604.21030: A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control Integration for Linear Systems
systematic reviewreinforcement learningtaxonomymodelpredictive
https://arxiv.org/abs/2507.19457
[2507.19457] GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Abstract page for arXiv paper 2507.19457: GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
reinforcement learningreflectivepromptevolution
https://research.atspotify.com/2023/07/automatic-music-playlist-generation-via-simulation-based-reinforcement-learning
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning | Spotify Research
Reinforcement learning (RL) is an established tool for sequential decision making. In this work, we apply RL to solve an automatic music playlist generation...
music playlistreinforcement learningautomaticgenerationvia
https://www.tue.nl/en/news-and-events/news-overview/10-04-2026-improving-reinforcement-learning-with-transferable-skills-and-flexible-decision-making
Improving reinforcement learning with transferable skills and flexible decision-making
PhD researcher Yucheng Yang investigated how reinforcement learning systems can be made more adaptable to new tasks and changing objectives.
reinforcement learningtransferable skillsdecision makingimprovingflexible
https://www.semanticscholar.org/search?q=Stratified+GRPO%3A+Handling+Structural+Heterogeneity+in+Reinforcement+Learning+of+LLM+Search+Agents.
Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents....
An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease.
reinforcement learningsearch agentshandlingstructuralllm
https://www.codecademy.com/learn/learn-reinforcement-learning-with-gymnasium
Learn Reinforcement Learning with Gymnasium | Codecademy
Learn reinforcement learning fundamentals and build learning agents with Gymnasium in this hands-on Python course.
reinforcement learninggymnasiumcodecademy
https://rlhfbook.com/
Reinforcement Learning from Human Feedback
The Reinforcement Learning from Human Feedback Book
reinforcement learninghumanfeedback
https://arxiv.org/abs/2407.15168
[2407.15168] Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
Abstract page for arXiv paper 2407.15168: Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
deep reinforcement learningbackdoorsneuralactivationspace
https://arxiv.org/abs/2410.23214
[2410.23214] Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
Abstract page for arXiv paper 2410.23214: Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
reinforcement learninggroundingtryingllmsenhanced
https://www.semanticscholar.org/search?q=Task-agnostic+Exploration+in+Reinforcement+Learning.
Task-agnostic Exploration in Reinforcement Learning. | Semantic Scholar
An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease.
reinforcement learningsemantic scholartaskagnosticexploration
https://www.semanticscholar.org/search?q=Efficient+Design+Space+Exploration+for+the+BOOM+Using+SAC-Based+Reinforcement+Learning.
Efficient Design Space Exploration for the BOOM Using SAC-Based Reinforcement Learning. | Semantic...
An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease.
space explorationreinforcement learningefficientdesignboom
https://towardsdatascience.com/benchmarking-tabular-reinforcement-learning-algorithms/
Benchmarking Tabular Reinforcement Learning Algorithms | Towards Data Science
May 6, 2025 - Comparing all methods from Part I of Sutton’s book on gridworld environments
reinforcement learningdata sciencebenchmarkingtabularalgorithms
https://www.fast.ai/posts/2017-07-28-killer-robots.html
fast.ai - Thoughts on OpenAI, reinforcement learning, and killer robots
reinforcement learningkiller robotsfastaithoughts
https://docs.ray.io/en/latest/cluster/kubernetes/examples/verl-post-training.html
Reinforcement Learning with Human Feedback (RLHF) for LLMs with verl on KubeRay — Ray 2.55.1
reinforcement learninghumanfeedbackrlhfllms
https://www.manning.com/preview/reinforcement-learning-from-human-feedback/chapter-1
Reinforcement Learning from Human Feedback
reinforcement learninghumanfeedback
https://ivyzhang.me/rl
Reinforcement Learning
reinforcement learning
https://www.deeplearning.ai/courses/fine-tuning-and-reinforcement-learning-for-llms-intro-to-post-training/
Fine-tuning and Reinforcement Learning for LLMs: Intro to Post-Training - DeepLearning.AI
Feb 2, 2026 - Apply fine-tuning and reinforcement learning techniques to shape model behavior, improve reasoning, and make LLMs safer and more reliable.
fine tuningreinforcement learningllmsintropost
https://deepsense.ai/case-studies/reinforcement-learning-speeds-up-autonomous-driving/
Autonomous Driving R&D with Reinforcement Learning: Faster, Smarter Navigation
Oct 9, 2025 - Learn how deepsense.ai used deep RL to optimize AI models for real-world driving—cutting training time while boosting policy performance.
autonomous drivingreinforcement learningfastersmarternavigation
https://www.geeksforgeeks.org/machine-learning/what-is-reinforcement-learning/
Reinforcement Learning - GeeksforGeeks
Your All-in-One Learning Portal: GeeksforGeeks is a comprehensive educational platform that empowers learners across domains-spanning computer science and...
reinforcement learning
https://www.cwi.nl/en/events/research-semester-programmes/control-theory-and-reinforcement/
Control Theory and Reinforcement Learning: Connections and Challenges
control theoryreinforcement learningconnectionschallenges
https://yewr.github.io/rlfp/
Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own
Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own
reinforcement learningfoundationletembodiedagent
https://slideslive.com/38922702/contributed-talk-adversarial-policies-attacking-deep-reinforcement-learning
Adam Gleave · Contributed talk: Adversarial Policies: Attacking Deep Reinforcement Learning ·...
In recent years, the use of deep neural networks as function approximators has enabled researchers to extend reinforcement learning techniques to solve...
deep reinforcement learningadamcontributedtalkpolicies
https://www.ibm.com/think/topics/reinforcement-learning
What is reinforcement learning? | IBM
Mar 2, 2026 - In reinforcement learning, an agent learns to make decisions by interacting with an environment. It is used in robotics and other decision-making settings.
what isreinforcement learningibm
https://towardsdatascience.com/revisiting-benchmarking-of-tabular-reinforcement-learning-methods/
Revisiting Benchmarking of Tabular Reinforcement Learning Methods | Towards Data Science
Jul 1, 2025 - Introducing a modular framework and improving model performance.
reinforcement learningdata sciencebenchmarkingtabularmethods
https://unsloth.ai/docs/get-started/reinforcement-learning-rl-guide
Reinforcement Learning (RL) Guide | Unsloth Documentation
Learn all about Reinforcement Learning (RL) and how to train your own DeepSeek-R1 reasoning model with Unsloth using GRPO. A complete guide from beginner to...
reinforcement learningrlguideunslothdocumentation
https://huggingface.co/docs/trl/index
TRL - Transformers Reinforcement Learning · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
reinforcement learninghugging facetrltransformers
https://dblp.org/rec/conf/aaai/BaiZM0W24.html
dblp: Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning.
Apr 30, 2026 - Bibliographic details on Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning.
reinforcement learningdblpautomatedriscmicroarchitecture
https://towardsdatascience.com/introduction-to-reinforcement-learning-and-solving-the-multi-armed-bandit-problem-e4ae74904e77/
Introduction to Reinforcement Learning and Solving the Multi-armed Bandit Problem | Towards Data...
reinforcement learningintroductionmultiarmedbandit
https://pwnagotchi.ai/
Pwnagotchi - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
deep reinforcement learningwifi
https://towardsdatascience.com/monte-carlo-methods-for-solving-reinforcement-learning-problems-ff8389d46a3e/
Monte Carlo Methods for Solving Reinforcement Learning Problems | Towards Data Science
monte carloreinforcement learningdata sciencemethodsproblems
https://towardsdatascience.com/introduction-to-approximate-solution-methods-for-reinforcement-learning-2/
Introduction to Approximate Solution Methods for Reinforcement Learning | Towards Data Science
Learn about function approximation and the different choices for approximation functions
reinforcement learningdata scienceintroductionsolutionmethods
https://blog.tensorflow.org/2023/10/simulated-spotify-listening-experiences-reinforcement-learning-tensorflow-tf-agents.html
Simulated Spotify Listening Experiences for Reinforcement Learning with TensorFlow and TF-Agents —...
Spotify shares how they use TensorFlow and Reinforcement Learning to train models offline, translating results to large scale, online performance.
reinforcement learningsimulatedspotifylisteningexperiences
https://arxiv.org/abs/2211.10530
[2211.10530] Provable Defense against Backdoor Policies in Reinforcement Learning
Abstract page for arXiv paper 2211.10530: Provable Defense against Backdoor Policies in Reinforcement Learning
reinforcement learningdefensebackdoorpolicies
https://dblp.org/rec/journals/tvlsi/ChengZZLGCXY25.html
dblp: Efficient Design Space Exploration for the BOOM Using SAC-Based Reinforcement Learning.
Apr 30, 2026 - Bibliographic details on Efficient Design Space Exploration for the BOOM Using SAC-Based Reinforcement Learning.
space explorationreinforcement learningdblpefficientdesign
https://gpuopen.com/learn/announcing-amd-schola-v2-nextgen-rl-unreal-engine/
Announcing AMD Schola v2: Next-generation reinforcement learning for Unreal Engine - AMD GPUOpen
AMD Schola v2 is a major update to the open-source reinforcement learning plugin for Unreal® Engine 5, offering significant improvements in capabilities,...
next generationreinforcement learningunreal engineannouncingamd
https://towardsdatascience.com/tag/reinforcement-learning/
Reinforcement Learning | Towards Data Science
Read articles about Reinforcement Learning in Towards Data Science - the world’s leading publication for data science, data analytics, data engineering,...
reinforcement learningdata science
https://www.manning.com/books/reinforcement-learning-from-human-feedback
Reinforcement Learning from Human Feedback - Nathan Lambert
The authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. Aligning AI models to human preferences helps them...
reinforcement learninghumanfeedbacknathanlambert
https://instadeep.com/2025/11/breaking-the-performance-ceiling-in-reinforcement-learning/
Breaking the Performance Ceiling in Reinforcement Learning | InstaDeep - Decision-Making AI For The...
Feb 27, 2026 - Just 30 seconds of inference-time search is more effective than days of training for complex RL problems.
reinforcement learningdecision makingbreakingperformanceceiling
https://dblp.org/rec/journals/corr/abs-2006-09497.html
dblp: Task-agnostic Exploration in Reinforcement Learning.
May 1, 2026 - Bibliographic details on Task-agnostic Exploration in Reinforcement Learning.
reinforcement learningdblptaskagnosticexploration
https://huggingface.co/papers/2507.19457
Paper page - GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Join the discussion on this paper page
reinforcement learningpaperreflectivepromptevolution
https://arxiv.org/html/2604.21030v1
A Systematic Review and Taxonomy of Reinforcement Learning-Model Predictive Control Integration for...
systematic reviewreinforcement learningtaxonomymodelpredictive
https://www.manning.com/livevideo/reinforcement-learning-in-motion
Reinforcement Learning in Motion - Phil Tabor
We all learn by interacting with the world around us, constantly experimenting and interpreting the results. Reinforcement learning is a machine learning...
reinforcement learningmotionphiltabor
https://www.manning.com/books/applied-reinforcement-learning
Applied Reinforcement Learning - Hadi Aghazadeh
Optimize business processes, people, and resources using AI and the power of reinforcement learning. Whether you’re finding the best delivery route,...
reinforcement learningapplied
https://arxiv.org/abs/2202.05839
[2202.05839] Abstraction for Deep Reinforcement Learning
Abstract page for arXiv paper 2202.05839: Abstraction for Deep Reinforcement Learning
deep reinforcement learningabstraction
https://speakerdeck.com/shunk031/the-landscape-of-agentic-reinforcement-learning-for-llms-a-survey
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey - Speaker Deck
Apr 3, 2026 - 大規模言語モデル(LLM)に強化学習を組み合わせた「Agentic RL」は,自律的な意思決定や動的な環境適応能力により,人工知能の新たなフロンティアを切り開いています。本資料では,この急速に進化するAgentic RLの全体像を,最新の包括的サーベイ論文「Agentic Reinforcement L…
reinforcement learningspeaker decklandscapeagenticllms
https://www.manning.com/catalog/data-science/deep-learning/deep-reinforcement-learning
Deep Reinforcement Learning books | Manning
Learn more about Deep Reinforcement Learning through expert-written books, eBooks, and practical guides for tech professionals.
deep reinforcement learningbooksmanning
https://unsloth.ai/docs/get-started/reinforcement-learning-rl-guide/vision-reinforcement-learning-vlm-rl
Vision Reinforcement Learning (VLM RL) | Unsloth Documentation
Train Vision/multimodal models via GRPO and RL with Unsloth!
reinforcement learningvisionvlmrlunsloth
https://www.cssmayo.com/tag/learning-reinforcement/
Learning Reinforcement Archives : Cssmayo
learningreinforcementarchives
https://ilyakuzovkin.com/
on Neuroscience, AI, Machine Learning, Reinforcement Learning, Robotics, Brain-Computer Interfaces,...
Apr 16, 2026 - Hi and Welcome!This page serves as my intro and a hub for occasional writings and slides on the topics of neuroscience, AI, and pretty much...
machine learningcomputer interfacesneuroscienceaireinforcement
https://heise-academy.de/Videokurse/deep-learning-teil-4-deep-reinforcement-learning
Deep Learning – Teil 4: Deep Reinforcement Learning | heise academy
Mit Deep Reinforcement Learning (DRL) können KI-Agenten eigenständig Strategien entwickeln, um komplexe Prozesse in simulierten Umgebungen zu automatisieren....
deep learningheise academyreinforcement