What is a GAN?
A generative model with two neural networks — a Generator (creates fake data) and a Discriminator (detects fake vs real) — trained in competition to produce realistic synthetic data (e.g., images).
Key AWS-relevant idea about GANs?
Used for synthetic data generation, image creation, and data augmentation in ML workflows.
What is a VAE?
A generative model that encodes input data into a probability distribution (latent space) and then decodes samples from that distribution to generate new data.
GAN vs VAE difference?
VAEs optimize probabilistic reconstruction (more stable training), while GANs use adversarial training (often sharper outputs).
What is a Diffusion Model?
A generative model that gradually adds noise to data and then learns to reverse the noise process to generate high-quality synthetic outputs.
Why diffusion model is important?
Powers modern image generators (e.g., Stable Diffusion-type models). Known for stability and high-quality outputs.
What is RAG?
A technique that combines retrieval systems (e.g., vector databases) with LLMs to fetch relevant documents before generating a response.
Why use RAG?
Improves accuracy, reduces hallucinations, and enables models to use up-to-date or private enterprise data.
What is BERT?
BERT (Bidirectional Encoder Representations from Transformers) is a transformer-based language model that understands context by reading text bidirectionally (left and right).
What is BERT mainly used for?
Text classification, sentiment analysis, question answering, and named entity recognition.
What is ROUGE?
A metric used to evaluate text summarization by measuring overlap between generated text and reference summaries (recall-focused).
What does ROUGE measure?
N-gram overlap, longest common subsequence, and recall similarity.
What is BLEU?
A metric used to evaluate machine translation by measuring precision-based n-gram overlap between generated and reference text.
BLEU vs ROUGE difference?
BLEU focuses on precision (common in translation). ROUGE focuses on recall (common in summarization).
What is SVD?
A matrix factorization technique that decomposes a matrix into three matrices: A = U Σ V^T
Why is SVD important in ML?
Used for dimensionality reduction, noise reduction, and recommendation systems.
Where is SVD commonly applied in NLP?
In Latent Semantic Analysis (LSA).
What is Word2Vec?
A neural network model that learns dense vector representations (embeddings) of words based on context.
Two Word2Vec architectures?
Word2Vec Key idea?
Words appearing in similar contexts have similar vector representations.
What is PCA?
A dimensionality reduction technique that transforms data into new orthogonal axes (principal components) that maximize variance.
Core idea formula:
Maximize Variance = λ
Why use PCA?
Reduce features while retaining most important information.
What is LSA?
An NLP technique that uses SVD on a term-document matrix to uncover hidden (latent) relationships between words and documents.