Definition The TD error δ is the difference between the target and the current prediction. TD error for State-Value Function δt=rt+1+γV(st+1)−V(st) TD error for SARSA δt=rt+1+γQ(st+1,at+1)−Q(st,at) TD error for Q-Learning δt=rt+1+γmaxaQ(st+1,a)−Q(st,at)