https://openreview.net/forum?id=4iEoOIQ7nL&referrer=%5Bthe%20profile%20of%20Igor%20Krawczuk%5D(%2Fprofile%3Fid%3D~Igor_Krawczuk1)
A theoretically grounded new algorithm for imitation learning.
imitation learningproximalpoint
https://openreview.net/forum?id=CiEOW1CdKc&referrer=%5Bthe%20profile%20of%20Kai%20Yan%5D(%2Fprofile%3Fid%3D~Kai_Yan1)
Imitation Learning (IL) enables agents to mimic expert behavior by learning from demonstrations. However, traditional IL methods require large amounts of...
imitation learninglatentwassersteinadversarial
https://deepai.org/publication/random-expert-distillation-imitation-learning-via-expert-policy-support-estimation
05/16/19 - We consider the problem of imitation learning from a finite set of expert trajectories, without access to reinforcement signals. T...
imitation learningrandomexpertdistillationvia
https://danieltakeshi.github.io/2019/04/30/il-and-rl/
Imitation Learning (IL) and Reinforcement Learning (RL) are often introduced assimilar, but separate problems. Imitation learning involves a supervisor thatp...
imitation learningcombiningreinforcementusing
https://openreview.net/forum?id=8OPDJOpfZV&referrer=%5Bthe%20profile%20of%20Michael%20Muehlebach%5D(%2Fprofile%3Fid%3D~Michael_Muehlebach1)
We propose Constraint-Aware Diffusion Guidance (CoDiG), a constraint-aware imitation learning framework based on conditional diffusion models. Unlike...
imitation learningconstraintawarediffusionguidance
https://openreview.net/forum?id=5JKuc3vJNU&referrer=%5Bthe%20profile%20of%20Luca%20Viano%5D(%2Fprofile%3Fid%3D~Luca_Viano1)
This paper provides the first expert sample complexity characterization for learning a Nash equilibrium from expert data in Markov Games. We show that a new...
learningequilibriadataefficientmulti
https://gwern.net/doc/www/old.reddit.com/52b1364dcfce9965e94dbba4277a74045e9627d1.html
Most deep learning methods attempt to learn artificial neural networks from scratch, using architectures or neurons or approaches often only very...
middle wayimitation learningwbedrl