2. Q-Learning (Table)

1. The Cheat Sheet

Imagine a table where rows are States and columns are Actions. The cell Value is "How good is this action?". We update this table as we explore. $$ Q(s,a) leftarrow Q(s,a) + alpha [R + gamma max Q(s',a') - Q(s,a)] $$

Report an issue or suggest an improvement

Mark as Completed

Previous Next Lesson