Robuta

https://openreview.net/forum?id=S0V9MX9jKe&referrer=%5Bthe%20profile%20of%20Qiaomin%20Xie%5D(%2Fprofile%3Fid%3D~Qiaomin_Xie1)
In reinforcement learning, offline value function learning is the procedure of using an offline dataset to estimate the expected discounted return from each...
value functionstableofflinelearningbisimulation
https://www.rug.nl/news/2011/07/26_vinjamoor
PhD ceremony: Mr. H.G. Vinjamoor, 16.15 uur, Doopsgezinde kerk, Oude Boteringestraat 33, Groningen
linear systemsnews articlesbisimulationuniversity