Reinforcement Learning: Agents

Module 1 of 11

1. The Environment

1. The Loop

Agent: Takes Action ($A_t$).
Environment: Returns State ($S_{t+1}$) and Reward ($R_{t+1}$).
Goal: Maximize cumulative reward.

Report an issue or suggest an improvement

Mark as Completed

TensorLearn - AI Engineering for Professionals