Trading Performance for Stability in Markov Decision Processes