Reinforcement Learning: Dynamic Programming and Monte Carlo — Part 2

Introducing two simple iterative techniques to solve the Markov Decision Process

Published in

Towards AI

12 min readSep 1, 2023

In the previous article — Part 1 — we have formulated the Markov Decision Process (MDP) as a paradigm to solve any Reinforcement Learning (RL) problem. However, the overarching framework discussed did not mention a…

Reinforcement Learning: Dynamic Programming and Monte Carlo — Part 2

Introducing two simple iterative techniques to solve the Markov Decision Process

Written by Tan Pengshi Alvin