author: jordi1215
score: 10 / 10

The paper introduces a new generalization of ReLU, called Parametric Rectified Linear Unit (PReLU), which adaptively learns the parameters of the rectifiers, and improves accuracy at negligible extra computational cost. image image

PReLU introduces a very small number of extra parameters. It can be trained using backpropagation and opptimized simultaneously with other layers.

ReLU expedites convergence of the training procedure and leads to better solutions than conventional sigmoid like units.

The paper also studies the difficulty of training rectified models that are very deep. By explicitly modeling the nonlinearity of rectifiers (ReLU/PReLU), the team derives a theoretically sound initialization method, which helps with convergence of very deep models (e.g., with 30 weight layers) trained directly from scratch. This gives more flexibility to explore more powerful network architectures.

Parametric Rectified Linear Unit (PReLU) surpass human-level performance for the classification task in the 1000-class ImageNet dataset.

TL;DR