How Deep Mind Controls Reinforcement Learning Agents from Getting “Too Clever”

Specification gaming is a challenge in reinforcement learning.

Jesus Rodriguez
Towards AI
Published in
6 min readJun 29, 2022

--

Image Credit: DeepMind

I recently started an AI-focused educational newsletter, that already has over 125,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep…

--

--

CEO of IntoTheBlock, President of Faktory, President of NeuralFabric and founder of The Sequence , Lecturer at Columbia University, Wharton, Angel Investor...