Environment

The classic gridworld environment: states, actions, transitions, and terminal states.

Create and use a conda environment for the RL curriculum.

Agent, environment, state, action, reward, Markov property, exploration-exploitation, and discount factor — with explanations.

Pre-installation check and what you need to run the curriculum code and exercises.

Gridworld