3.3 - Sampling & Data Exploration Flashcards

Learn how to select representative data samples and explore datasets to discover trends, patterns, and anomalies (12 cards)

1
Q

Why is sampling important in data analysis?

A

Sampling allows analysts to make inferences about a population without collecting all data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data exploration?

A

Examining and summarizing data to understand its characteristics, spot patterns, and detect anomalies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name two common sampling methods.

A

Random sampling, stratified sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A ______ sample gives every member of the population an equal chance of being selected.

A

Random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A ______ sample ensures all subgroups are proportionally represented.

A

Stratified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is one risk of using a non-representative sample?

A

It can lead to biased conclusions that don’t reflect the population accurately.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Name one technique for exploring data visually.

A

Examples: histograms, scatter plots, box plots, bar charts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which of these is NOT typically part of data exploration?
A) Summarizing variables
B) Identifying outliers
C) Running a full regression model
D) Checking distributions

A

C) Running a full regression model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Looking for unusual values that may indicate errors or interesting cases is called ______.

A

Outlier detection / anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is data exploration iterative?

A

Initial exploration may reveal new questions, missing data, or interesting patterns that need deeper investigation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Give an example of a summary statistic used in data exploration.

A

Examples: mean, median, mode, standard deviation, counts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does sampling and exploration help in decision-making?

A

They provide a manageable, representative view of data and highlight trends, patterns, or issues before deeper analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly