My Knowledge Base

❯

❯

TD Error

Jul 26, 20261 min read

machine_learning/reinforcement_learning

Definition

The TD error $δ$ is the difference between the target and the current prediction.

TD error for State-Value Function

$δ_{t} = r_{t + 1} + γV (s_{t + 1}) - V (s_{t})$

TD error for SARSA

$δ_{t} = r_{t + 1} + γ Q (s_{t + 1}, a_{t + 1}) - Q (s_{t}, a_{t})$

TD error for Q-Learning

$δ_{t} = r_{t + 1} + γ max_{a} Q (s_{t + 1}, a) - Q (s_{t}, a_{t})$

Graph View

Definition
TD error for State-Value Function
TD error for SARSA
TD error for Q-Learning

Backlinks

Prioritized Replay
Reinforcement Learning Note

Created with Quartz v4.5.1 © 2026

GitHub
Discord Community