Definition

Suppose , and Define a variable representing the proportion of class observations in a node We classify the observations in a node to class Suppose the number of terminal nodes in a tree is and is the split regions corresponding to the terminal node. For a tree , define the cost complexity function. where is an impunity function, is a tuning parameter regularizing complexity.

The tree minimizes is the selected as optimal.

Impunity Functions

  • Misclassification error:
  • Gini index:
  • Cross entropy on deviance: