All technological notes.
Recurrent Neural Networks
RNN vs traditional deep neural networks:
Variants:
Different amounts of inputs
Usage:


- Input Layer
Hidden Layer:
- Linear function(input) + Linear function(Feedback) -> Hidden state
- Non-linear function -> Non-lear ouput
- Output Layer:
- Linear function -> Logit State
- Softmax function -> Softmax

- Hidden Layer’s
- Linear function(input): Yellow
- Non-linear function -> Non-lear ouput: Green
- Linear function -> Logit State: Output
- Linear function(Feedback): Orange
vanishing gradient problem
The model learns from a change in its gradient; this change affects the network’s output.
原因: