Robuta

https://www.amazon.science/publications/learning-two-step-hybrid-policy-for-graph-based-interpretable-reinforcement-learning Learning two-step hybrid policy for graph-based interpretable reinforcement learning - Amazon... We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem... learning twohybrid policystep