TensorLearn
Back to Course
Reinforcement Learning: Agents
Module 2 of 8

2. Q-Learning (Table)

1. The Cheat Sheet

Imagine a table where rows are States and columns are Actions. The cell Value is "How good is this action?". We update this table as we explore. $$ Q(s,a) leftarrow Q(s,a) + alpha [R + gamma max Q(s',a') - Q(s,a)] $$

Mark as Completed

TensorLearn - AI Engineering for Professionals