Definition Gt:=k=0∑∞γkRt+k+1=Rt+1+γGt+1 where γ∈[0,1] is a discount factor Return Gt is the total discounted Reward from time step t.