What is Machine Learning (ML)?
Technology that trains algorithms with data to make predictions/decisions without being explicitly programmed.
What is Artificial Intelligence (AI)?
Broader field simulating human-like cognitive abilities (learning, reasoning, recognition). ML is a subset of AI.
Name 4 types of ML data sets
Supervised (labeled data), Unsupervised (find patterns in unlabeled data), Semi-supervised (mix of both), Reinforcement learning (trial-error with rewards).
Difference AI vs ML (exam level)?
AI = goal of simulating human intelligence. ML = practical approach using algorithms/data to achieve that intelligence.
AWS service for building, training, and deploying ML models?
Amazon SageMaker
Use cases of SageMaker
Fraud detection, churn prediction, personalized recommendations, document analysis.
Which AWS AI service builds chatbots/voice assistants?
Amazon Lex
Core concept of Lex (exam tip)?
Conversational interface → intents, utterances, slots, fulfillment.
AWS AI service for enterprise search with NLP and LLM integration?
Amazon Kendra
Use cases of Kendra
Intelligent search across documents (S3, Salesforce, Slack), FAQs, employee productivity, self-service bots.
AWS service to run SQL queries on S3 data lakes?
Amazon Athena
Athena is…
Serverless, interactive analytics tool (analyze data in S3 with SQL/Python).
Common use case of Athena
Query raw data in S3 without ETL, ad-hoc analytics, BI queries.
AWS service for real-time streaming analytics?
Amazon Kinesis
Kinesis products (exam tip)
AWS service for ETL (Extract, Transform, Load)?
AWS Glue
What is AWS Glue used for?
Discover, prepare, and integrate data across sources; build data catalog; run serverless ETL jobs into data lakes/warehouses.
Glue + Athena relationship
Glue catalogs & prepares data → Athena queries it serverlessly with SQL.
AWS service for Business Intelligence (BI) dashboards?
Amazon QuickSight
Benefits of QuickSight
Interactive dashboards, paginated reports, embedded analytics, NLQ (“Q”), scales to 1000s of users, pay-per-session.
AWS service for Big Data frameworks like Spark, Hive, Presto?
Amazon EMR (Elastic MapReduce)
EMR is used for…
Processing petabytes of data, big data analytics, data pipelines, ML preprocessing using open-source frameworks.
EMR vs SageMaker (exam difference)
EMR = process big data at scale (data engineering). SageMaker = train & deploy ML models (data science).