Robuta

Sponsor of the Day: Jerkmate
https://www.gleave.me/publication/2022-11-imitation/ imitation: Clean Imitation Learning Implementations | Adam Gleave Dec 6, 2022 - imitation provides open-source implementations of imitation and reward learning algorithms in PyTorch. We include three inverse reinforcement learning (IRL)... adam gleaveimitationcleanlearningimplementations https://www.gleave.me/ Adam Gleave Adam Gleave is the CEO of FAR.AI, an alignment research non-profit. His research interests include adversarial robustness and value learning. adam gleave https://slideslive.com/38922702/contributed-talk-adversarial-policies-attacking-deep-reinforcement-learning Adam Gleave · Contributed talk: Adversarial Policies: Attacking Deep Reinforcement Learning ·... In recent years, the use of deep neural networks as function approximators has enabled researchers to extend reinforcement learning techniques to solve... deep reinforcement learningadam gleavecontributedtalkadversarial