Back to Course
Reinforcement Learning: Agents
Module 2 of 8
2. Q-Learning (Table)
1. The Cheat Sheet
Imagine a table where rows are States and columns are Actions. The cell Value is "How good is this action?". We update this table as we explore. $$ Q(s,a) leftarrow Q(s,a) + alpha [R + gamma max Q(s',a') - Q(s,a)] $$