Block 1 Flashcards

(114 cards)

1
Q

What is the relationship hierarchy between AI, Machine Learning, and Deep Learning?

A

AI is the overarching field; Machine Learning is a subset of AI; Deep Learning is a specialized subset of ML.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the primary characteristic of Machine Learning (ML) regarding programming?

A

ML allows systems to learn from data and improve performance without being explicitly programmed for every rule.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which technology simulates human intelligence to solve problems, recognize images, and write text?

A

Artificial Intelligence (AI).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What specialized field of ML uses multi-layered neural networks inspired by the human brain?

A

Deep Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What capability allows computers to derive insights and make decisions based on visual inputs like images and videos?

A

Computer Vision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which technology enables computers to interpret, manipulate, and comprehend human language (e.g., sentiment analysis)?

A

Natural Language Processing (NLP).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three types of layers found in a Neural Network?

A

Input layer, Hidden layer(s), and Output layer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the process called where a neural network learns from mistakes by adjusting weights to reduce error?

A

Backpropagation (or Backward Propagation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the term for using a trained ML model to make predictions on new, unseen data?

A

Inference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which inference type processes data points individually as they arrive, providing immediate, low-latency predictions?

A

Real-time (or Online) Inference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which inference type is best for processing large datasets all at once (e.g., nightly reporting) where immediate results are not critical?

A

Batch (or Offline) Inference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What distinguishes Supervised Learning from other ML methods?

A

It requires a labeled dataset (input paired with correct output) to train the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which ML method is used for finding patterns in unlabeled data, such as customer segmentation or anomaly detection?

A

Unsupervised Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which ML method involves training a model to make sequences of decisions through a system of rewards and penalties?

A

Reinforcement Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is “Labeled Data”?

A

Data that includes both the input features and the corresponding correct output (or target).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of data typically lacks a predefined model, such as text documents, images, and audio files?

A

Unstructured Data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What specific architecture do Large Language Models (LLMs) use to understand and generate human-like text?

A

Transformer architecture.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is “Bias” in the context of an AI model?

A

Systematic errors in a model that result in unfair, inaccurate, or discriminatory outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the term for how well a model learns patterns in training data to generalize to new data?

A

Fit (e.g., Underfitting or Overfitting).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the definition of a “feature” in a dataset?

A

An individual measurable property or characteristic of the data being observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Identify a scenario where AI/ML solutions are generally NOT appropriate.

A

When a specific, deterministic outcome is needed instead of a prediction, or when data is insufficient/poor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is “Accountable AI”?

A

The requirement for human oversight in AI decisions with significant consequences (e.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which ML technique is used to predict numerical outcomes, such as stock prices or house values?

A

Regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Which Amazon SageMaker algorithm trains multiple models in parallel to optimize regression predictions?

A

Linear Learner algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Which AWS service enables users to run ML prediction queries directly within their data warehouse using SQL?
Amazon Redshift ML.
26
Spam detection and sentiment analysis are examples of which Machine Learning method?
Classification (Supervised Learning).
27
Which Unsupervised Learning technique discovers patterns or groupings (like customer segmentation) without predefined labels?
Clustering.
28
What are the four core functional stages of the Amazon SageMaker workflow?
Prepare, Build, Train & Tune, Deploy & Manage.
29
What is the "Amazon SageMaker Lakehouse" architecture?
A hybrid data management architecture that combines the scalability of data lakes with the performance of data warehouses.
30
Which Amazon SageMaker feature provides a managed Jupyter server for running code and experimenting with models?
SageMaker Notebooks.
31
How does we distinguish between Amazon SageMaker and Amazon Bedrock?
SageMaker is for building models from scratch ("building the robot"), while Bedrock is for deploying pre-trained Large Language Models (LLMs).
32
In the context of SageMaker architecture, what is "Boto3"?
The AWS Software Development Kit (SDK) for Python used to interact with AWS services.
33
What policy language can be used with AWS Verified Access to create custom, zero-trust API permissions?
CEDAR language.
34
What is the primary interface for building, training, and deploying ML models in the SageMaker ecosystem?
Amazon SageMaker Studio (referred to as SageMaker Unified Studio in newer iterations).
35
Which managed SageMaker component is used to leverage optimized algorithms for tasks like image classification without managing infrastructure?
SageMaker Managed Containers.
36
What is the primary function of Amazon Transcribe?
It is a fully managed Automatic Speech Recognition (ASR) service that converts speech to text.
37
Which Amazon Transcribe feature allows it to distinguish between different speakers in an audio file?
Automatic speaker identification (diarization).
38
If a developer needs to remove personally identifiable information (PII) from an audio stream automatically, which service should they use?
Amazon Transcribe (using its redaction feature).
39
What is the primary difference between Amazon Transcribe and Amazon Polly?
Amazon Transcribe converts speech-to-text (ASR), whereas Amazon Polly converts text-to-speech (Voice Generation).
40
Which service uses "Neural Machine Translation" to provide high-quality, fast language translation?
Amazon Translate.
41
How can Amazon Translate be configured to correctly translate industry-specific terminology or brand names?
By using Custom Vocabulary.
42
Which AWS service is best suited for analyzing customer reviews to determine if the sentiment is positive, negative, neutral, or mixed?
Amazon Comprehend.
43
What specific feature of Amazon Comprehend is used to identify people, places, brands, and dates within a text document?
Entity Recognition.
44
What AWS service allows developers to build conversational interfaces (chatbots and voice bots) using voice and text?
Amazon Lex.
45
What specific AI technology does Amazon Lex use to determine the "intent" behind a user's input?
Natural Language Understanding (NLU).
46
Which AWS cloud-based contact center service integrates natively with Amazon Lex to create automated customer service flows?
Amazon Connect.
47
What is the primary use case for Amazon Polly?
To turn text into lifelike, natural-sounding speech (Text-to-Speech).
48
Which service enables the creation of audio for media from text (e.g., converting articles to speech) at a low cost?
Amazon Polly.
49
Is Amazon Polly an open-source framework that developers can download and modify?
No, it is a fully managed cloud AI service accessed via API.
50
Which service is capable of "transliterating" media in real-time?
Amazon Transcribe.
51
To detect the dominant language in a document before translating it, which service API can be called?
Amazon Comprehend (or Amazon Translate, which also has automatic language identification).
52
What is the phase in the ML pipeline involved in analyzing and visualizing data to detect anomalies and understand its structure?
Exploratory Data Analysis (EDA).
53
Which AWS service is used to label datasets for supervised learning tasks?
Amazon SageMaker Ground Truth.
54
What is the process of creating new input variables from existing data to improve model performance called?
Feature Engineering.
55
What is "Data Preprocessing"?
Transforming raw data into a clean format (cleaning, normalization, encoding) suitable for ML training.
56
What are the settings that control the training process (e.g., learning rate, batch size) which are set before training begins?
Hyperparameters.
57
Which Hyperparameter Tuning strategy involves systematically exploring a predefined set of values?
Grid Search.
58
Which Hyperparameter Tuning strategy treats tuning as a regression problem to find optimal values?
Bayesian Optimization.
59
What is the term for taking a pre-trained model (like BERT or ResNet) and fine-tuning it on a new, specific dataset?
Transfer Learning.
60
What is the primary benefit of using a Managed API Service (like SageMaker Endpoint) over a self-hosted API?
It handles underlying infrastructure, automatic scaling, and security patches automatically.
61
Which deployment method gives the user full control over the hardware, software, and network configuration (e.g., on EC2)?
Self-hosted API.
62
Which Amazon SageMaker tool provides a low-code visual interface to import, clean, transform, and visualize data?
Amazon SageMaker Data Wrangler.
63
What is the specific purpose of the "Offline Store" in Amazon SageMaker Feature Store?
Storing historical feature data for model training and batch inference.
64
What is the specific purpose of the "Online Store" in Amazon SageMaker Feature Store?
Providing low-latency access to features for real-time inference.
65
What critical issue does SageMaker Feature Store help prevent by ensuring features are consistent between training and inference?
Training-serving skew.
66
Which tool captures data from incoming requests and predictions to compare them against a training baseline?
Amazon SageMaker Model Monitor.
67
What is "Data Drift"?
A change in the statistical properties of input data over time, causing model performance to degrade.
68
Which SageMaker feature automates the end-to-end ML lifecycle (workflow orchestration)?
Amazon SageMaker Pipelines.
69
What is the "Validation Dataset" used for?
Assessing the model's performance on unseen data during the training phase to tune hyperparameters.
70
Which SageMaker tool centralizes model tracking and versioning?
Amazon SageMaker Model Registry.
71
What is "Imputation" in the context of Data Cleaning?
The process of replacing missing values in a dataset with substituted values (e.
72
What is the basic unit of text (word, part of a word, or character) that a Generative AI model processes and generates?
Token
73
What is the process of breaking down large documents into smaller, manageable sections to improve data processing and retrieval?
Chunking
74
What are the numerical representations of real-world objects or data points that capture relationships and meanings for AI systems?
Embeddings (or Vectors)
75
What is the term for natural language text used to request a specific task from a Generative AI model?
Prompt
76
What is the iterative process of refining inputs (choosing specific words, formats, and phrases) to guide a Generative AI model to the desired output?
Prompt Engineering
77
Which neural network architecture uses "self-attention" to process entire sequences of text in parallel and handle long-range dependencies?
Transformer-based Large Language Model (LLM)
78
What type of model is trained on massive amounts of generalized, unlabeled data (like the World Wide Web) and serves as a starting point for various downstream tasks?
Foundation Model (FM)
79
Which type of AI model can process and integrate multiple data types simultaneously, such as text, images, audio, and video?
Multi-modal model
80
Which type of Generative AI model creates photorealistic images by gradually removing "Gaussian noise" through a reverse process?
Diffusion Model
81
What is "Gaussian Noise" in the context of training image generation models?
Random noise (white noise) following a normal distribution added to images so the model learns to reconstruct them.
82
What is "Planogram Optimization" in retail AI use cases?
Using Generative AI to dynamically update virtual shelf layouts based on sales trends and inventory.
83
What are the seven stages of the Foundation Model Lifecycle in order?
Data Selection, Model Selection, Pre-training, Fine-tuning, Evaluation, Deployment, Feedback
84
Which data preparation technique involves enhancing a dataset by adding variations (e.g., different formats) to improve model robustness?
Data Augmentation
85
Which data preparation technique involves ensuring the dataset represents different categories equally to prevent model bias?
Data Balancing
86
What is the "Context Window" of a Foundation Model?
The maximum number of input tokens the model can consider at one time when making predictions.
87
Which AWS Marketplace model provider is known for "Jurassic" models focused on language generation?
AI21 Labs
88
Which AWS Marketplace model provider is known for "Claude" models, which focus on AI safety and alignment?
Anthropic
89
Which model family is specifically developed by AWS for a range of applications?
Amazon Titan
90
What distinguishes "Fine-tuning" from "Pre-training"?
Fine-tuning trains a pre-existing model on a smaller, labeled, task-specific dataset, whereas pre-training uses massive generalized datasets.
91
What is "Transfer Learning"?
Leveraging a pre-trained model and fine-tuning it on a specific task to reduce training time and resources.
92
Which regularization technique involves randomly ignoring neurons during training to prevent overfitting?
Dropout
93
What is "ROUGE" or "BLEU" typically used for in AI?
Automatic Evaluation metrics (commonly for text summarization and translation).
94
Which evaluation type assesses a model's ability to handle noisy, adverse, or adversarial inputs?
Robustness Testing
95
Which metric measures the harmonic mean of Precision and Recall?
F1 Score
96
Which metric specifically measures how well a probability distribution or language model predicts a sample?
Perplexity
97
What does a "Confusion Matrix" visualize?
The performance of a classification model by showing True Positives, False Positives, True Negatives, and False Negatives.
98
What are the three primary advantages of Generative AI described in this lesson?
Adaptability, Responsiveness, and Simplicity (in deployment and use).
99
Which GenAI advantage refers to the ability to be fine-tuned for industry-specific language or personalized user preferences?
Adaptability.
100
What is the main phenomenon where an AI system generates confident outputs that are factually incorrect, fabricated, or unreliable?
AI Hallucination.
101
What is the root cause of AI hallucinations?
LLMs predict responses based on probabilistic patterns in training data rather than having a true understanding of facts, leading to fabrications.
102
What specific AWS tool, used in Amazon Bedrock Guardrails, employs mathematical and logic-based verification to combat hallucinations?
Automated Reasoning checks.
103
What does the "Interpretability" disadvantage refer to?
The challenge in understanding and explaining how "black box" models (like generative adversarial networks or large language models) make decisions.
104
What is "Nondeterminism" in GenAI?
The characteristic of a system that can produce different outputs from the exact same input under varying conditions.
105
What is the primary operational risk posed by Nondeterminism?
It compromises the need for consistent, repeatable outcomes, making transparency and reliability difficult.
106
What is the first crucial step when selecting the appropriate Generative AI model type for a project?
Identifying the Use Cases (e.g., text summarization, image generation, etc.).
107
Which AWS service provides a Generative AI Best Practices Framework to help organizations continuously monitor and measure their compliance posture?
AWS Audit Manager.
108
What metric evaluates how well a GenAI model can perform across different domains or types of tasks?
Cross-Domain Performance (Generalization).
109
Which business metric measures the effectiveness of AI in influencing user behavior to achieve specific goals, such as sign-ups or purchases?
Conversion Rate.
110
What is "Ground Truth" in evaluation?
A set of known, correct answers used as a reliable benchmark against which AI-generated outputs are compared to determine accuracy.
111
Which three specific evaluation algorithms are mentioned in the course for measuring the accuracy of generated text?
BERTScore, ROUGE, and F1 Score.
112
Which specific evaluation metric is superior at measuring the semantic similarity (meaning) between generated text and the ground truth, even with paraphrasing?
BERTScore.
113
Which evaluation metric is the standard choice for measuring the recall of overlapping content (n-gram overlap) and is typically used for summarization tasks?
ROUGE (Recall-Oriented Understudy for Gisting Evaluation).
114
Which financial metric measures the average revenue generated from each active user over a specific time period?
Average Revenue Per User (ARPU).