Concepts Flashcards by Rich Alberth

Concepts

What’s an n-gram?

1-gram is 1 word. 5-gram is 5 words, etc.

How well did you know this?

Not at all

Perfectly

Concepts

What is Tokenization?

Convert raw text into sequence of tokens

How well did you know this?

Not at all

Perfectly

Concepts

What happens to punctuation in Tokenization?

Kept. Likely each is a token

How well did you know this?

Not at all

Perfectly

Concepts

Is a word just a token?

Not necessarily – some words can split into mulitple Tokens

How well did you know this?

Not at all

Perfectly

Concepts

Example of a single word split into multiple tokens?

Richard might be “Rich” + “ard” because Rich and Richard are semantically close

How well did you know this?

Not at all

Perfectly

Concepts

What is a Context Window?

Max tokens the model can consider at once.

How well did you know this?

Not at all

Perfectly

Concepts

How does Context Window affect chatbots and prompting?

Limits how much context and question can be input to the model.

How well did you know this?

Not at all

Perfectly

Concepts

How does Context Window affect image or video input?

Size of the image or video along with any text context or prompts

How well did you know this?

Not at all

Perfectly

Concepts

What’s the relation between tokens and vectors in a vector DB?

A single token (like “cat”) corresponds to a single vector of many float values

How well did you know this?

Not at all

Perfectly

Concepts

Why does a single token have a vector with many values?

Captures many types of semantic meaning, sentiment, syntactic role, …

How well did you know this?

Not at all

Perfectly

Concepts

Example of how a vector is useful for searching a RAG?

Find similar tokens by looking for similar vectors

How well did you know this?

Not at all

Perfectly

Concepts

What’s the techncial name for this similarity search between vectors?

k-nearest neighbor

How well did you know this?

Not at all

Perfectly

Hyperparameters

What are hyperparameters?

Settings that define the model structure, algorithm and process

How well did you know this?

Not at all

Perfectly

Hyperparameters

When do you set hyperparameters?

Have to be set before you tune – hyperparameters define how it learns

How well did you know this?

Not at all

Perfectly

Hyperparameters

Example hyperparameters?

Learning rate, batch size, number of epochs

How well did you know this?

Not at all

Perfectly

Hyperparameters

What do you get by tuning hyperparameters?

Study These Flashcards

Reduce overfitting, improve accuracy

Hyperparameters

What is Learning Rate?

Study These Flashcards

How large or small the steps are when updating weights during training

Hyperparameters

What happens if you set a small learning rate?

Study These Flashcards

Takes a long time to converge, get accurate weights

Hyperparameters

What happens if you set a high learning rate?

Study These Flashcards

Quicker to converge, but could over-shoot the right weights and not be accurate enough

Hyperparameters

What is Batch Size?

Study These Flashcards

Number of examples used to update a model weight in one iteration

Hyperparameters

What happens with small batch sizes?

Study These Flashcards

More stable learning, but takes more time to compute

Hyperparameters

What happens with large batch sizes?

Study These Flashcards

Faster, but unstable – too much info can swing weights violently.

Hyperparameters

What is Number of Epochs?

Study These Flashcards

How many times the model will iterate over the entire training set

Hyperparameters

What happens with too few Epochs?

Study These Flashcards

Underfitting

# *Hyperparameters* What happens with too many Epochs?

Overfitting

# *Hyperparameters* What happens with training data set is too small?

Overfitting: no outlyers, no noise, so everything fits (too) well

# *Hyperparameters* How do you prevent overfitting?

Large training data set size, fewer epochs

Concepts Flashcards

(27 cards)