Definition

Neural network can be thought as a non-linear generalization of linear model.

The derived features are constructed by an Activation Function and linear combinations of the inputs. where is an Activation Function

Output nodes are the linear combinations of And the output is modeled by a function of a linear combinations of where is called an output function.

Facts

The output function varies by the problem. For regression is Identity Function, and for -class classification Softmax Function is used as the .

For regression problem, Sum of Squared Errors Loss is used as Loss Function. For classification problem, we use Cross-Entropy Loss

With the softmax activation function and the Cross-Entropy Loss, the neural network model is exactly a linear Logistic Regression model in the hidden units.

The parameters of a neural network are estimated by Backpropagation.

Neural network is especially effective in problem with a high signal-to-noise ratio.