Big Data & Machine Learning Flashcards

Question 1

Q

Why do we need machine learning in big data analytics?

Answer

A

The increasing volume of data requires efficient and automated ways to analyze it, which machine learning offers.

Question 2

Q

According to machine learning taxonomy, what are the 3 types of learning?

Answer

A

1- Supervised Learning.
2- Unsupervised Learning.
3- Reinforcement Learning.

Question 3

Q

What is supervised learning?

Answer

A

A machine learning technique in which the model is trained on a labeled training set.

Question 4

Q

What are the most popular supervised learning tasks?

Answer

A

1- Regression
2- Classification

Question 5

Q

What is unsupervised learning?

Answer

A

A machine learning technique in which the model is trained on an unlabeled training set.

Question 6

Q

What are the most popular unsupervised learning tasks?

Answer

A

1- Clustering
2- Anomaly Detection
3-Dimensionality Reduction
4- Association Rule Learning

Question 7

Q

What is clustering?

Answer

A

A process of grouping similar data points into clusters without the need to label the data.

Question 8

Q

What is anomaly detection?

Answer

A

A process of detecting deviations from normal data behavior.

Question 9

Q

What is dimensionality reduction?

Answer

A

A process of reducing the number of dimensions while retaining as much relevant information as possible without the need of labels.

Question 10

Q

What is association rule learning?

Answer

A

discovering interesting relationships between variables in large datasets.

Question 11

Q

What is reinforcement learning?

Answer

A

A machine learning technique in which the agent learns by making decisions and getting a reward if the decision is correct and a penalty if the decision is wrong.

Question 12

Q

What are the problems caused by high-dimensional data?

Answer

A

It make it difficult for the model to find patterns.

Big Data & Machine Learning Flashcards

(12 cards)