Definition

Distributional RL treats the Reward as a Random Variable, and uses random return , called an action-value distribution, instead of the Action-Value Function.

Distributional RL treats the Reward as a Random Variable, and uses random return , called an action-value distribution, instead of the Action-Value Function.