Learning by messing up: A beginner’s tour of Reinforcement Learning (opens in new tab)

From agents and rewards all the way to the Markov property and your first Gym environment, written like the notes you wish someone had…