TensorLearn
Back to Course
Reinforcement Learning: Agents
Module 1 of 8

1. The Environment

1. The Loop

  • Agent: Takes Action ($A_t$).
  • Environment: Returns State ($S_{t+1}$) and Reward ($R_{t+1}$).
  • Goal: Maximize cumulative reward.

Mark as Completed

TensorLearn - AI Engineering for Professionals