Module 1 Flashcards

(56 cards)

1
Q

Statistics (singular)

A

way of reasoning, along with a collection
of tools and methods, designed to help us understand
the world with the use of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

statistics (plural)

A

calculations made from
data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

data

A

values with a context.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why we need to collect data?

A

to find answers to the questions that cannot be
answered otherwise

or

an experiment usually results in different outcomes
(variability) and we want to study the reason(s) for the
variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

population

A

entire collection of individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

sample

A

subset of the population, selected for study
in some prescribed manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

parameter

A

numerical summary that describes a
characteristic of the population (often unknown)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

statistic

A

numerical summary that describes a
characteristic of the sample (is known once the data are
observed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Subjects or participants –

A

– people on whom we experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Respondents

A

individuals who answer a survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Experimental units

A

animals, plants, and inanimate subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Variables

A

characteristics recorded about each individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Types of Variables

A

Categorical
(Qualitative)

Numerical
(Quantitative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Categorical/Qualitative Variable

A

places a subject into one of several groups or
categories (or levels).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Types of Categorical/Qualitative Variable

A

Nominal
Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Nominal

A

the levels have no order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Ordinal

A

the levels have some order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Numerical/Quantitative Variables

A

measures a numerical quantity or amount in each subject

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

types of Numerical/Quantitative Variables

A

Discrete
Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Discrete

A

can only take on distinct values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Continuous

A

can take on any value in a given interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Census

A

special sample that includes everyone and “samples” the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

problems with taking a census:

A

too expensive

Undercoverage (may not actually include everyone) 

Too time-consuming
24
Q

sample statistics

A

summaries that are found from data in a sample

25
types of conclusions you can make with sample statistics.
Population Inference Causal (cause-and-effect) Inference
26
Population Inference
Results from the sample can be generalized to an entire population (as estimates)
27
Causal (cause-and-effect) Inference
The difference in the responses is caused by the difference in treatments when comparing the results from two treatment groups.
28
why is random sampling important?
eliminate the effect of unknown extraneous factors there's no way to fix a biased sample and no way to salvage useful information from it.
29
Random Sampling Methods
Simple Random Samples (SRS) Stratified Random Sampling Systematic Random Sampling Cluster Random Sampling
30
Simple Random Samples (SRS)
each sample of size n in the population has the same chance of being selected.
31
sampling variability
Samples drawn at random generally differ from one another.
32
Stratified Random Sampling
the population is first divided into strata; then take an SRS within each stratum before the results are combined.
33
strata
different homogeneous groups
34
benefits of Stratified Random Sampling
can reduce bias can also reduce the variability of our results.
35
Systematic Random Sampling
Start from a randomly selected individual, then sample every Nth person.
36
benefits of Systematic Random Sampling
Systematic sampling can be much less expensive than true random sampling.
37
when is Systematic Random Sampling useful?
When there is no reason to believe that the order of the list could be associated in any way with the responses sought, systematic sampling can give a representative sample.
38
Cluster Random Sampling
Splitting the population into clusters, select one or a few clusters at random and perform a census within each of them.
39
cluster
Splitting the population into similar groups
40
What type of sampling is this? a store owner decides to randomly test the hygiene of his staff. There are 10 departments in his store, and each department has 20 staff members. He plans to test 40 of his staff members by randomly choosing two departments and check everyone in these two departments
Cluster Random Sampling
41
What type of sampling is this? you want to estimate the proportion of individuals that support federal party X in your area. You set up a booth in your area and ask every 50th person your question.
Systematic Random Sampling
42
What type of sampling is this? you want to estimate the proportion of Canadians that support federal party X based on an appropriate representation from each province. You break up the population by province and select an SRS from each province.
Stratified Random Sampling
43
What type of sampling is this? you put all the names of the individuals in the population in a box and draw names to complete the sample
Simple Random Samples (SRS)
44
Bias
tendency for a sample to differ from the corresponding population in some systematic way.
45
Sources of Bias:
Selection Bias Response Bias Voluntary Response Bias Nonresponse Bias
46
Selection Bias
Undercoverage - when some portion of the population is not sampled at all or has a smaller representation in the sample than it has in the population.
47
Response Bias
anything in the survey design that influences the responses. respondents may lie, especially if asked about illegal or unpopular behavior.
48
Voluntary Response Bias
when individuals can choose on their own whether to participate in the sample.
49
Nonresponse Bias:
when a large proportion of those sampled fail to respond.
50
when can we make Causal (cause-and-effect) Inference?
when we have random allocation
51
Lurking variables
variables that are related to both group membership and to the response. These are other variables that could possibly explain the result.
52
two main types of study designs:
Observational Studies Randomized Experiment
53
Observational Study
investigator observes individuals and measures variables of interest but does NOT attempt to influence the responses.
54
Randomized Experiment
study design that allows us to prove a cause-and-effect relationship. `
55
characteristics of an experiment:
Manipulates factor levels to create treatments. Randomly assigns subjects to these treatment levels. Compares the responses of the subject groups across treatment levels.
56
Random Allocation
the individuals are randomly assigned to different treatment groups