Back to Course
Reinforcement Learning: Agents
Module 5 of 8
5. Actor-Critic (A2C)
1. Two Brains
- Actor: Decides WHAT to do (Policy).
- Critic: Judges how good the action was (Value Function). They train together. The Critic reduces the variance of the Actor.