Definition Q(St,At)←Q(St,At)+α[Gt:t+n−Q(St,At)] where Gt:t+n:=k=0∑n−1γkRt+k+1+γnQ(St+n,At+n) is the n-Step Return.