Introduction to Recurrent Neural Networks

What is RNN

The networks are recurrent because they performance same computations for all the elements of a sequence of input, and the output of each element dependents, in addition to current input, from all the previous commutations.

Why RNN

  • Sequential type information of the inputs
    Video Analysis
    Speech Recognition
    Machine Translation
  • RNN have proved to have excellent performance in such problems

RNN Procedure

Introduction to Recurrent Neural Networks

Sigmoid Gradient

Introduction to Recurrent Neural Networks

The Vanish Gradient Problem

Consider the recurrent networks:

ht=σ(Uxt+Vht1)

then,
h3=σ(Ux3+V(σ(Ux2+V(σ(Ux1)))))

E3U=E3out3out3h3h3h2h2h1h1U

LSTM Cell

Introduction to Recurrent Neural Networks

  • Input Gate

    g=tanh(bg+xtUg+ht1Vg)

    i=σ(bi+xtUi+ht1Vi)

    outi=gi

  • forget gate

    f=σ(bf+xtUf+ht1Vf)

    st=st1f+gi

  • output gate

    o=σ(bo+xtUo+ht1Vo)

    ht=tanh(st)o

Reducing The Problem

stst1=f

Reference