Statistics (singular)
way of reasoning, along with a collection
of tools and methods, designed to help us understand
the world with the use of data.
statistics (plural)
calculations made from
data.
data
values with a context.
Why we need to collect data?
to find answers to the questions that cannot be
answered otherwise
or
an experiment usually results in different outcomes
(variability) and we want to study the reason(s) for the
variability
population
entire collection of individuals
sample
subset of the population, selected for study
in some prescribed manner
parameter
numerical summary that describes a
characteristic of the population (often unknown)
statistic
numerical summary that describes a
characteristic of the sample (is known once the data are
observed)
Subjects or participants –
– people on whom we experiment
Respondents
individuals who answer a survey
Experimental units
animals, plants, and inanimate subjects
Variables
characteristics recorded about each individual
Types of Variables
Categorical
(Qualitative)
Numerical
(Quantitative)
Categorical/Qualitative Variable
places a subject into one of several groups or
categories (or levels).
Types of Categorical/Qualitative Variable
Nominal
Ordinal
Nominal
the levels have no order
Ordinal
the levels have some order
Numerical/Quantitative Variables
measures a numerical quantity or amount in each subject
types of Numerical/Quantitative Variables
Discrete
Continuous
Discrete
can only take on distinct values
Continuous
can take on any value in a given interval
Census
special sample that includes everyone and “samples” the entire population.
problems with taking a census:
too expensive
Undercoverage (may not actually include everyone) Too time-consuming
sample statistics
summaries that are found from data in a sample