Abstract
We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, un-der the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handling uncertainty, can be solved using dynamic programming for small to medium sized problems. However, due to the "curse of dimensionality", MDPs that model real-life problems are typically prohibitively large for such approaches. In this work we employ a reinforcement learning approach to tackle this planning problem: we develop a robust approximate dynamic programming method based on a projected fixed point equation to approximately solve large scale robust MDPs. We show that the proposed method provably succeeds under certain technical conditions, and demonstrate its effectiveness through simulation of an option pricing problem. To the best of our knowledge, this is the first attempt to scale up the robust MDP paradigm.
| Original language | English |
|---|---|
| Pages (from-to) | 1401-1415 |
| Number of pages | 15 |
| Journal | Proceedings of Machine Learning Research |
| Volume | 32 |
| State | Published - 2014 |
| Event | 31st International Conference on Machine Learning, ICML 2014 - Beijing, China Duration: 21 Jun 2014 → 26 Jun 2014 |
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Software
Fingerprint
Dive into the research topics of 'Scaling up robust MDPs using function approximation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver