Reinforcement Learning

Learning to Search from Demonstration Sequences

We study the problem of learning to search from demonstration sequences. Online search algorithms, such as Monte Carlo Tree Search …

Dixant Mittal, Liwei Kang, Wee Sun Lee

ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search

A tree-based online search algorithm iteratively simulates trajectories and updates action-values of a set of states stored in a tree …

Dixant Mittal, Siddharth Aravindan, Wee Sun Lee