My Knowledge Base

Home

❯

3. Resource

❯

Generalized Policy Iteration

Jul 26, 20261 min read

machine_learning/reinforcement_learning

Definition

Generalized policy iteration uses the repeatedly approximated value function to the true value of the current policy (sample backup) and the policy is repeatedly improved to approach the optimality.

Graph View

Backlinks

Monte Carlo Method
Reinforcement Learning Note
Temporal Difference Learning

GitHub
Discord Community

My Knowledge Base

Explorer

Generalized Policy Iteration

Definition

Graph View

Backlinks