Definition

An action-value function is the expected discounted Return of taking action starting in state under policy .

Optimal Action-Value Function

An optimal action-value function is the maximum possible action-value function (an action-value function under optimal policy).