My Knowledge Base

❯

❯

n-Step TD

Mar 12, 20261 min read

machine_learning/reinforcement_learning

Definition

$V (S_{t}) \leftarrow V (S_{t}) + α [G_{t : t + n} - V (S_{t})]$ where $G_{t : t + n}$ is the n-Step Return.

The n-step TD learning is a method that bridges the gap between TD and MC methods.

Examples

$n = 1$ : Temporal Difference Learning
$n = \infty$ : Monte Carlo Method

Larger $n$ reduces bias but increases variance.

Graph View

Definition
Examples

Backlinks

Reinforcement Learning Note
n-step Off-Policy Learning

Created with Quartz v4.5.1 © 2026

GitHub
Discord Community