Data Analysis Flashcards

(21 cards)

1
Q

What is descriptive statistical analysis?

A

It uses numbers to describe the qualities of a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is inferential statistical analysis?

A

It draws conclusions about a larger population based on a sample group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is associational statistical analysis?

A

It is used to make predictions and find causation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is predictive analysis?

A

It uses statistical algorithms and machine learning to predict future events and behavior.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is prescriptive analysis?

A

It helps organizations use data to guide decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is exploratory data analysis?

A

It identifies patterns and trends in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is causal analysis?

A

It determines causation or why things happen, used in quality assurance and investigations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What role does metadata play in unstructured data analysis?

A

It provides information about data for management, storage, and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Natural Language Processing (NLP)?

A

A machine learning method to analyze the meaning of unstructured text data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are images analyzed in unstructured data?

A

By understanding unstructured information, e.g., diagnosing medical conditions from x-rays.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is supervised machine learning?

A

It requires labeled input and output data for training and is used for classification and prediction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is unsupervised machine learning?

A

It uses raw, unlabeled data to identify patterns and cluster similar data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the main uses of unsupervised machine learning?

A

Clustering datasets, understanding relationships, and initial data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are key differences between supervised and unsupervised learning?

A

Supervised needs labeled data and is used for classification/prediction; unsupervised finds relationships and is less explainable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is data dredging different from data mining?

A

Data dredging lacks a hypothesis and produces patterns by chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Quantitative data, name 4 features

A

Expressed as a numerical value
Analysed using computational techniques and algorithms
Measured objectively
Answers questions like ‘how much’ and ‘how often’

17
Q

Qualitative data, name 4 features

A

Represented as a name or symbol
Organised into themes
Measured subjectively
Answers questions such as ‘why or ‘how’

18
Q

Mean

A

Mathematical average of a range of numbers.

19
Q

Median

A

The midpoint in a range of numbers when in numerical order

20
Q

Mode

A

The most commonly occurring number in the data set.

21
Q

Skewness

A

Skewness indicates how symmetrical a range of numbers is