Why is probability important in data analysis?
It helps quantify uncertainty and make informed predictions and decisions.
What does “uncertainty” mean in data?
Uncertainty is the lack of complete certainty about outcomes or measurements in data.
What is a probability?
A measure of how likely an event is to occur, usually expressed between 0 (impossible) and 1 (certain).
An event that is guaranteed to happen has a probability of ______.
1
An event that cannot happen has a probability of ______.
0
Name one example of probability in everyday data.
Examples: likelihood of rain, customer making a purchase, rolling a 6 on a die.
What is the difference between theoretical and empirical probability?
Theoretical is based on reasoning (e.g., coin toss), empirical is based on observed data (e.g., past sales).
Probability can help analysts manage ______ when making decisions.
Risk / uncertainty
Which of the following is an example of uncertainty in data?
A) Measurement error
B) Random variation
C) Unknown future outcomes
D) All of the above
D) All of the above
What is a simple way to visualize probability distributions?
Examples: histograms, probability density plots, bar charts for discrete outcomes.
Why do analysts need to consider uncertainty when interpreting results?
Ignoring uncertainty can lead to overconfidence and poor decision-making.
Name one method to reduce uncertainty in data analysis.
Examples: increase sample size, improve data quality, use statistical models, gather more information.