Definition
where is the n-Step Return.
The n-step TD learning is a method that bridges the gap between TD and MC methods.
Examples
Larger reduces bias but increases variance.
where is the n-Step Return.
The n-step TD learning is a method that bridges the gap between TD and MC methods.
Larger reduces bias but increases variance.