https://openreview.net/forum?id=S0V9MX9jKe&referrer=%5Bthe%20profile%20of%20Qiaomin%20Xie%5D(%2Fprofile%3Fid%3D~Qiaomin_Xie1)
In reinforcement learning, offline value function learning is the procedure of using an offline dataset to estimate the expected discounted return from each...
value functionstableofflinelearningbisimulation