AWS ML, AI, & Data Analytics Flashcards

(23 cards)

1
Q

What is Machine Learning (ML)?

A

Technology that trains algorithms with data to make predictions/decisions without being explicitly programmed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Artificial Intelligence (AI)?

A

Broader field simulating human-like cognitive abilities (learning, reasoning, recognition). ML is a subset of AI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name 4 types of ML data sets

A

Supervised (labeled data), Unsupervised (find patterns in unlabeled data), Semi-supervised (mix of both), Reinforcement learning (trial-error with rewards).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Difference AI vs ML (exam level)?

A

AI = goal of simulating human intelligence. ML = practical approach using algorithms/data to achieve that intelligence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

AWS service for building, training, and deploying ML models?

A

Amazon SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Use cases of SageMaker

A

Fraud detection, churn prediction, personalized recommendations, document analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which AWS AI service builds chatbots/voice assistants?

A

Amazon Lex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Core concept of Lex (exam tip)?

A

Conversational interface → intents, utterances, slots, fulfillment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

AWS AI service for enterprise search with NLP and LLM integration?

A

Amazon Kendra

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Use cases of Kendra

A

Intelligent search across documents (S3, Salesforce, Slack), FAQs, employee productivity, self-service bots.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

AWS service to run SQL queries on S3 data lakes?

A

Amazon Athena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Athena is…

A

Serverless, interactive analytics tool (analyze data in S3 with SQL/Python).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Common use case of Athena

A

Query raw data in S3 without ETL, ad-hoc analytics, BI queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

AWS service for real-time streaming analytics?

A

Amazon Kinesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Kinesis products (exam tip)

A
  1. Video Streams (stream video)
  2. Data Streams (real-time data capture)
  3. Firehose (load into AWS stores like S3/Redshift)
  4. Data Analytics (analyze streams in SQL/Flink).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS service for ETL (Extract, Transform, Load)?

17
Q

What is AWS Glue used for?

A

Discover, prepare, and integrate data across sources; build data catalog; run serverless ETL jobs into data lakes/warehouses.

18
Q

Glue + Athena relationship

A

Glue catalogs & prepares data → Athena queries it serverlessly with SQL.

19
Q

AWS service for Business Intelligence (BI) dashboards?

A

Amazon QuickSight

20
Q

Benefits of QuickSight

A

Interactive dashboards, paginated reports, embedded analytics, NLQ (“Q”), scales to 1000s of users, pay-per-session.

21
Q

AWS service for Big Data frameworks like Spark, Hive, Presto?

A

Amazon EMR (Elastic MapReduce)

22
Q

EMR is used for…

A

Processing petabytes of data, big data analytics, data pipelines, ML preprocessing using open-source frameworks.

23
Q

EMR vs SageMaker (exam difference)

A

EMR = process big data at scale (data engineering). SageMaker = train & deploy ML models (data science).