Reinforcement Learning: Dynamic Programming and Monte Carlo — Part 2

Introducing two simple iterative techniques to solve the Markov Decision Process

Tan Pengshi Alvin
Towards AI
Published in
12 min readSep 1, 2023

--

Image by Wil Stewart on Unsplash

In the previous article — Part 1 — we have formulated the Markov Decision Process (MDP) as a paradigm to solve any Reinforcement Learning (RL) problem. However, the overarching framework discussed did not mention a…

--

--

Data Scientist, AI and Software Engineer @SG. Shares simple codes, technical Data Science concepts and ideas. LinkedIn: https://www.linkedin.com/in/tanpengshi/