Academic
Academic
Home
Publications
Experience
Education
Projects
Contact
Resume
Light
Dark
Automatic
Reinforcement Learning
Learning to Search from Demonstration Sequences
We study the problem of learning to search from demonstration sequences. Online search algorithms, such as Monte Carlo Tree Search …
Dixant Mittal
,
Liwei Kang
,
Wee Sun Lee
PDF
Cite
ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search
A tree-based online search algorithm iteratively simulates trajectories and updates action-values of a set of states stored in a tree …
Dixant Mittal
,
Siddharth Aravindan
,
Wee Sun Lee
PDF
Cite
Cite
×