How Deep Mind Controls Reinforcement Learning Agents from Getting “Too Clever”

Specification gaming is a challenge in reinforcement learning.

Published in

Towards AI

6 min readJun 29, 2022

I recently started an AI-focused educational newsletter, that already has over 125,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep…

How Deep Mind Controls Reinforcement Learning Agents from Getting “Too Clever”

Specification gaming is a challenge in reinforcement learning.

Written by Jesus Rodriguez