Definition

Neural network can be thought as a non-linear generalization of linear model.

The derived features $Z_{m}$ are constructed by an Activation Function and linear combinations of the inputs. $Z_{m} = σ (α_{0 m} + α_{m}^{⊺} X), m = 1, \dots, M$ where $σ$ is an Activation Function

Output nodes are the linear combinations of $Z$ $T_{k} = β_{0 k} + β_{k}^{⊺} Z, k = 1, \dots, K$ And the output is modeled by a function of a linear combinations of $Z_{m}$ $f_{k} (X) = g_{k} (T), k = 1, \dots, K$ where $g_{k} (T)$ is called an output function.

Facts

The output function $g_{k} (T)$ varies by the problem. For regression $g_{k}$ is Identity Function, and for $k$ -class classification Softmax Function $σ$ is used as the $g_{k}$ .

For regression problem, Sum of Squared Errors Loss is used as Loss Function. For classification problem, we use Cross-Entropy Loss

With the softmax activation function and the Cross-Entropy Loss, the neural network model is exactly a linear Logistic Regression model in the hidden units.

The parameters of a neural network are estimated by Backpropagation.

Neural network is especially effective in problem with a high signal-to-noise ratio.

My Knowledge Base

Explorer

Neural Network

Definition

Facts

Graph View

Table of Contents

Backlinks