Pearson Chi-squared test
aka Chi-square test; non-parametric test in two forms:
for goodness of fit:
for independence:
G-test
a goodness of fit test based on likelihood-ratio or maximum likelihood statistical significance test
G^2 = 2 sum_i x_i log(x_i / m_i), where
alternative to Pearson Chi-squared (with usually similar results)
goodness of fit tests
contingency table
inferences on correlations
Pearson–there exists a kind of extension to 1-dim linear regression to test if 2 variables have non-trivial correlation; assumes normal distributions of X and Y
Spearman–transform Spearman’s rank correlation to a value whose point estimate is approximately normal; does not make normality assumptions on X and Y
inference on a population mean
comparing two population means
population proportions, binary case
population proportions, multinomial case
population proportions, comparing more than two populations
Marascuillo procedure
population proportions, fitting to a given distribution
eg we suspect a population follows a Poisson distribution
ANOVA
MANOVA
tests against a suspected distribution
we have samples, and want a test for if those samples “fit” a suspected distribution:
parametric vs non-parametric tests
a parametric test is used when the populations have known distributions; this makes assumptions about the parameters of the population distribution (usually a normal distribution)
non-parametric tests are useful for when the populations in question have unknown distributions (though the test may impose some conditions on the populations or samples)
mixed categorical and numeric cases