Big Data & Machine Learning Flashcards

(12 cards)

1
Q

Why do we need machine learning in big data analytics?

A

The increasing volume of data requires efficient and automated ways to analyze it, which machine learning offers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

According to machine learning taxonomy, what are the 3 types of learning?

A

1- Supervised Learning.
2- Unsupervised Learning.
3- Reinforcement Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is supervised learning?

A

A machine learning technique in which the model is trained on a labeled training set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the most popular supervised learning tasks?

A

1- Regression
2- Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is unsupervised learning?

A

A machine learning technique in which the model is trained on an unlabeled training set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the most popular unsupervised learning tasks?

A

1- Clustering
2- Anomaly Detection
3-Dimensionality Reduction
4- Association Rule Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is clustering?

A

A process of grouping similar data points into clusters without the need to label the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is anomaly detection?

A

A process of detecting deviations from normal data behavior.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is dimensionality reduction?

A

A process of reducing the number of dimensions while retaining as much relevant information as possible without the need of labels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is association rule learning?

A

discovering interesting relationships between variables in large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is reinforcement learning?

A

A machine learning technique in which the agent learns by making decisions and getting a reward if the decision is correct and a penalty if the decision is wrong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the problems caused by high-dimensional data?

A

It make it difficult for the model to find patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly