Definition

TD( $λ$ ) is a TD algorithm that uses the lambda-Return for value function updates. $V (S_{t}) \leftarrow V (S_{t}) + α [G_{t}^{λ} - V (S_{t})]$ where $G_{t}^{λ}$ is the lambda-Return

Examples

$T D (0)$ : Temporal Difference Learning
$T D (1)$ : Monte Carlo Method

My Knowledge Base

Explorer

TD(lambda)

Definition

Examples

Graph View

Table of Contents

Backlinks