Blackwell Optimality in
Markov Decision Processes with Partial Observation
Dinah
Rosenberg, Eilon Solan and Nicolas Vieille
The
Annals of Statistics 30 (2002).
A Blackwell e-optimal strategy in a Markov Decision Process is a strategy that is e-optimal for every discount factor sufficiently
close to 1.
We prove the existence of Blackwell e-optimal strategies in finite Markov Decision Processes with partial
observation. Extensions to
more general cases are also provided.