Search Results for “rectifier (neural networks)”

Source: Rectifier (neural networks)

In the context of artificial neural networks, the rectifier or ReLU (rectified linear unit) activation function is an activation function defined as the non-negative part of its argument, i.e., the ramp function:

ReLU
⁡
(
x
)
=

x

+

=
max
(
0
,
x
)
=

x
+

|

x

|

2

=

{

x

if

x
>
0
,

0

x
≤
0

{\displaystyle \operatorname {ReLU} (x)=x^{+}=\max(0,x)={\frac {x+|x|}{2}}={\begin{cases}x&{\text{if }}x>0,\\0&x\leq 0\end{cases}}}

where

x

{\displaystyle x}

is the input to a neuron. This is analogous to half-wave rectification in electrical engineering.
ReLU is one of the most popular activation function for artificial neural networks, and finds application in computer vision and speech recognition using deep neural nets and computational neuroscience.
It was first used by Alston Householder in 1941 as a mathematical abstraction of biological neural networks. It was introduced by Kunihiko Fukushima in 1969 in the context of visual feature extraction in hierarchical neural networks. It was later argued that it has strong biological motivations and mathematical justifications. In 2011, ReLU activation enabled training deep supervised neural networks without unsupervised pre-training, compared to the widely used activation functions prior to 2011, e.g., the logistic sigmoid (which is inspired by probability theory; see logistic regression) and its more practical counterpart, the hyperbolic tangent.

Advantages

Potential problems

Variants

= Piecewise-linear variants

= Other non-linear variants

See also

References

Kata Kunci Pencarian:

Recent Movies

Recent Movies

Categories

Recent Movies