Definition Suppose the number of data is n and the number of classes is K, then the cross entropy loss is defined as R(θ)=−i=1∑nk=1∑Kyiklnfk(xi)