Marginal Distribution of a Variable
A frequency or relative frequency distribution of either the row or column variable in the contingency table.
A marginal distribution removes the effect of either the row variable or the column variable in the contingency table.
Conditional Distribution
A list of the relative frequency of each category of the response variable, given a specific value of the explanatory variable in a contingency table.
Contingency Table
A table that relates two categories of data. Also called a two-way table.
Lurking given by E=μ=np.
An explanatory variable that was not considered in the observational study, but that affects the value of the response variable. In addition, lurking variables are typically related to explanatory variables considered in the study.
Simpson’s Paradox
Describes a situation in which an association between two variables inverts or goes away when a third variable is introduced to the analysis.
Multiplication Rule for Independent Events
If two events E and F are independent, then:
P(EandF) = P(E) ⋅ P(F)
The expected value of a binomial random variable for n independent trials of a binomial experiment with probability of success p
It is given by E = μ = np.
Chi-Square Test for Independence
Used to determine whether there is an association between a row variable and a column variable in a contingency table constructed from sample data.
The null hypothesis is that the variables are not associated, or independent.
The alternative hypothesis is that the variables are associated, or dependent.
Finding the Expected Frequencies in a Chi-Square Test for Independence
Multiply the cell’s row total by its column total and divide this result by the table total. That is,
Chi-Square Test for Independence Using the TI-84 Calculator
Chi-Square Test for Independence or Homogeneity of Proportions
Chi-Square Test for Homogeneity of Proportions
A test of whether different populations have the same proportion of individuals with some characteristic.
Chi-Square Test for Homogeneity of Proportions Using TI-84 Calculator
What if the requirements for performing a chi-square test are not satisfied?
The researcher has one of two options:
(1) combine two or more columns (or rows) to increase the expected frequencies or
(2) increase the sample size.