Transformers Flashcards by Maria Mahnoor

CBOW steps

From input vector take embeddings of cont4xt words

Average the context words v’

Multiplying this v’ vectory with output matrix

Get the score and apply softmax

Learn from result and tune the input matrix

How well did you know this?

Not at all

Perfectly

Transposed convolution feature out dimension calculation

(i-1)*s+k-2p

How well did you know this?

Not at all

Perfectly

Limitations of one hot encoding

Produces sparse, high dimensional vectors and captures no semantic relationship

How well did you know this?

Not at all

Perfectly

When is TF - IDF vector outperform bag of words

When the distinguished power of a word is crucial and common words are noisy.

How well did you know this?

Not at all

Perfectly

Trade off between CBoW and Skip gram

CBOW fasteand better for frequent words
Skip gram is slower but better for rare words

How well did you know this?

Not at all

Perfectly

Transformers Flashcards