My Knowledge Base

Home

❯

3. Resource

❯

TD(lambda)

TD(lambda)

Dec 10, 20251 min read

  • machine_learning/reinforcement_learning

Definition

TD(λ) is a TD algorithm that uses the lambda-Return for value function updates. V(St​)←V(St​)+α[Gtλ​−V(St​)] where Gtλ​ is the lambda-Return

Examples

  • TD(0): Temporal Difference Learning
  • TD(1): Monte Carlo Method

Graph View

  • Definition
  • Examples

Backlinks

  • Reinforcement Learning Note

Created with Quartz v4.5.1 © 2025

  • GitHub
  • Discord Community