WHAT IS STATISTICS
Statistics IS the science of gathering, describing, and
analyzing data
Variable
-value that changes among a group
-numerical or categorical
-of one usually
random variable
values determined by chance
numerical variable
-values with equal units eg. weight in pounds, time in hours
categorical variable
-places person or thing into a category
data
counts, measurements, observations about variables in a group
datum
single value
population
-whole, consists of all things being studied
-particular group of interest
-group i want to know about
sample
-within the whole
-group within the population being studied
-group i do know about
parameter
P for population
-numerical population
-fixed
-parameters unknown
-“the population mean”
statistic
S for sample
-actual numerical sample
-must contain population characteristics (representative sample)
-change with sample
-“the sample mean”
Descriptive Statistics
Definition: Methods for organizing, summarizing, and presenting data.
Purpose: To describe what the data show.
Examples: Mean, median, mode, Standard deviation, range, Graphs
Key Point: They do not go beyond the data at hand
Inferential Statistics
Definition: Methods for making predictions, decisions, or generalizations about a population based on a sample of data.
Purpose: To draw conclusions or test hypotheses.
Examples: Hypothesis testing
Key Point: beyond the observed data.
Probability
-studies randomness
-deals with chance
Qualitative data
-aka categorical
Definition: Describes qualities, characteristics, or categories.
Nature: Non-numerical (though sometimes represented with numbers as labels).
Examples: Hair color, Types of cars, Yes/No responses, gender
Key Point: Answers “what kind?” or “which category?”
Quantitative Data
-aka counted
Definition: Describes quantities or amounts that can be measured or counted.
Nature: Numerical
Examples: Height in cm, Age in years
,Number of pets
Key Point: Answers “how much?” or “how many?”
will qualitive have discrete or continuous
no
discrete variable
Discrete → countable numbers (e.g., number of fries, number of siblings, 0,1,2,3)
-“number of”
Continuous Variable
Continuous → measurable numbers that can take on any value in a range (e.g., height, weight, temperature)
-fractions and decimals
Qualitative data charts
pie and bar:
-side by side pie and bar
-no missing data : other category
-pareto: bars sorted by size
How to select a sample
-must be a representative sample
-has the same relevant characteristics as the defined population and does not favor
-allows people to study a population without studying every single individual in that population.
-is a valuable research tool.
Random sampling
Selected by using chance or
random numbers
- Each individual subject
(human or otherwise) has
an equal chance of being
selected
-Examples: Drawing names from a hat
Systematic Sampling
Select a random starting point and then select every nth subject in the population
Convenience Sampling
Definition: Choosing a sample based on convenience, not random selection.
Downside: Often biased
-examples: A researcher surveys their classmates because they’re nearby.