Report an accessibility problem
Engineering  |  FURI

Sahil Badyal

Hometown: Kathua, Jammu and Kashmir, India | Graduation Date: Fall 2021
Computer science

Efficient Policy Iteration Architecture for Learning Rollout Policy in POMDP

Research Theme: Data
MORE: Spring 2020

The research project considers an infinite horizon discounted dynamic programming problem with finite state and control space under partial observability. These problems are hard due to the curse of dimensionality. The work uses a policy iteration algorithm for learning a rollout policy with multi-step lookahead, truncated rollout, and terminal cost function approximation while exploiting distributed computation. The future work aims to use aggregation to further reduce the state space and complexity of the problem and also explore the efficacy of different neural network architectures as approximators. These methods have been applied in simulation to a class of search and rescue problems.

FURI Symposium Totals

TotalProjects

0

FacultyMentors

0

OnlineSymposia

0

ResearchFocus Areas

0

FURIStudents

0

MOREStudents

0

KEENStudents

0

GCSPStudents

0