Definition Consider two policies π and π′. ∀s∈S, Qπ(s,π′(s))≥Vπ(s)(=)⟹∀s∈S, Vπ′(s)≥Vπ(s) This implies that π′ is a better policy than π.