Hado van Hasselt, Research scientist, discusses the Markov decision processes and dynamic programming as part of the Advanced Deep Learning & Reinforcement Learning Lectures.
source
Reinforcement Learning 3: Markov Decision Processes and Dynamic Programming
Previous ArticleNew Products: A Deep Dive