Robuta

https://www.sml-group.cc/blog/2023-optimal-transport-reward/
With the advent of large datasets, offline reinforcement learning (ORL) is a promising framework for learning good decision-making policies without interacting...
optimal transportimitation learningofflinesustainabilitymachine
https://openreview.net/forum?id=4iEoOIQ7nL&referrer=%5Bthe%20profile%20of%20Igor%20Krawczuk%5D(%2Fprofile%3Fid%3D~Igor_Krawczuk1)
A theoretically grounded new algorithm for imitation learning.
imitation learningproximalpoint
https://openreview.net/forum?id=CiEOW1CdKc&referrer=%5Bthe%20profile%20of%20Kai%20Yan%5D(%2Fprofile%3Fid%3D~Kai_Yan1)
Imitation Learning (IL) enables agents to mimic expert behavior by learning from demonstrations. However, traditional IL methods require large amounts of...
imitation learninglatentwassersteinadversarial
https://deepai.org/publication/random-expert-distillation-imitation-learning-via-expert-policy-support-estimation
05/16/19 - We consider the problem of imitation learning from a finite set of expert trajectories, without access to reinforcement signals. T...
imitation learningrandomexpertdistillationvia
https://danieltakeshi.github.io/2019/04/30/il-and-rl/
Imitation Learning (IL) and Reinforcement Learning (RL) are often introduced assimilar, but separate problems. Imitation learning involves a supervisor thatp...
imitation learningcombiningreinforcementusing
https://www.mdpi.com/2072-4292/15/17/4147
In unmanned systems, remote sensing is an approach that collects and analyzes data such as visual images, infrared thermal images, and LiDAR sensor data from a...
imitation learningimage augmentationtransformer modelusingenhanced
https://openreview.net/forum?id=8OPDJOpfZV&referrer=%5Bthe%20profile%20of%20Michael%20Muehlebach%5D(%2Fprofile%3Fid%3D~Michael_Muehlebach1)
We propose Constraint-Aware Diffusion Guidance (CoDiG), a constraint-aware imitation learning framework based on conditional diffusion models. Unlike...
imitation learningconstraintawarediffusionguidance
https://openreview.net/forum?id=5JKuc3vJNU&referrer=%5Bthe%20profile%20of%20Luca%20Viano%5D(%2Fprofile%3Fid%3D~Luca_Viano1)
This paper provides the first expert sample complexity characterization for learning a Nash equilibrium from expert data in Markov Games. We show that a new...
learningequilibriadataefficientmulti
https://gwern.net/doc/www/old.reddit.com/52b1364dcfce9965e94dbba4277a74045e9627d1.html
Most deep learning methods attempt to learn artificial neural networks from scratch, using architectures or neurons or approaches often only very...
middle wayimitation learningwbedrl