Tensor
Learn
Home
Courses
Resources
About
Contact
Back to Course
Reinforcement Learning: Agents
Module 1 of 8
1. The Environment
1. The Loop
Agent
: Takes Action ($A_t$).
Environment
: Returns State ($S_{t+1}$) and Reward ($R_{t+1}$).
Goal
: Maximize cumulative reward.
Report an issue or suggest an improvement
Mark as Completed
Next Lesson
TensorLearn - AI Engineering for Professionals