What is Sequence Model?
SM split data into …
SM split data into small chunks / sequences of data to solve classification problems
What are the 4 problems SM solve?
Within SM, what is neurons with recurrence?
Neurons wif Recurrence is the computation of … curr input and previous output …
Neurons with recurrence is the computation at each time step of the product of current input and the output of previous time step(Past memory).
How does RNNs work?
Applys a ___ relation at every ___ to process a sequence.
It apply a recurrence relation at every time step to process a sequence:
ht = fw(xt, ht-1)
What is RNNs intuition?
Give the idea of process
Input Weight vector -> Update hidden state -> Output vector/Pred output
What is the computation of RNN across time?
What are the 4 Design Criteria for SM?
What is the technique called that transform language into indexes?
Give 1 word
Embedding / Encoding
What are the 4 criteria to model sequences?
RNNs meet these criterias
The standard RNN gradient flow consists of repeated computation of weight matrices. What are the 2 issues with this?
Why is vanishing gradients a big problem?
It causes the model to…
It causes the model to lose the ability to learn something useful.
What is the most robust way to mitigate vanishing gradients?
___ cells: Use __ to add or remove info.
Gated cells: Use gates to selectively add or remove info within each recurrent unit.
What is the Long short-term memory (LSTM) key concept?
__ & __ information
Forget & Store information
How to build a more effective Sequence Model?
Use __ to model seq without recurrence.
Use self-attention to model sequence without recurrence.
What is the architecture of Transformer in AI?
Self-attention is the foundation mechanism build in NN of Transformer.
What are the 4 steps of self-attention with NN?