Olivia notices that her students are worse than previous students she has taught. Experience has shown that scores obtained in a particular test are normally distributed with mean score 70 and variance 36. When the test is taken by a random sample of 36 students, the mean score is 68.5. A test for whether these students have not performed as well as expected is carried out.
One Sample z-test
large sample (known population variance)
Orla times her 100 m sprint times and assumes they follow a normal distribution. She trains intensively for a week and then runs 100 m on each of 5 consecutive days. A test is carried out to see whether the training has improved her times.
One sample t-test
small sample, unknown population variance
Total Nutrition are investigating the effects of adding certain vitamins to a diet. 64 two-week old rats were given a vitamin supplement in their diet for a period of one month, after which time their masses were noted. A control group of 36 rats of the same age were fed on an ordinary diet and their masses were also noted after one month. A test is carried out to see if the experimental group have a greater mass than the control group.
Two sample z-test
Often called difference of two means, z-test. Large samples
Brighton politicians are conducting a public opinion poll where 1000 randomly chosen electors were asked whether they would vote for the Green Party again at the next election. The Green party believe that 40% of electors would vote for them. A test is carried out to see whether they are overestimating their support.
One sample binomial proportion
Tesco purchase batches of Golden Delicious apples and test for bruising before resale. A random sample of 1000 apples contained 30 which were bruised. A second random sample of 2000 apples contained 78 bruised apples. A test is carried out to determine whether the proportions differ between the samples.
Two sample binomial proportions
Often known as difference of two proportions
Will randomly selects his journey times to work and back. He records his journey times over the course of 8 randomly selected mornings and 10 randomly selected evenings and tests whether there is a difference between his morning and evening commutes. He assumes the distributions of journey times are not normally distributed.
Wilcoxon rank sum
Non-parametric, independent samples
Steve thinks his students are better than previous students he has taught. Experience has shown that scores obtained in a particular test have average score 70, with a skewed distribution. The test is taken by a random sample of 16 students. A test for whether these students have performed better than expected is carried out.
Sign test
Non-parametric, one sample, not symmetrical
Organic nylon is tested for quality by sampling 35 from a large batch and their tensile strength is recorded. The tensile strength of synthetic nylon is 12 400 PSI. A test is carried out to see whether organic nylon is less strong than synthetic nylon.
One sample z-test
large sample, uses CLT
Simon thinks his times for running the 1500 m race follow a skewed distribution. He trains intensively for a week and then runs 1500 m on each of 5 consecutive days. A test is carried out to see whether the training has improved his times.
Sign test
Non-parametric, one sample, not symmetrical
Omar notices his times for running the 100 m race follow a normal distribution with variance 14.2. He trains intensively for a week and then runs 100 m on each of 10 consecutive days. A test is carried out to see whether the training has improved his times.
One sample z-test
small sample from normally distributed population with known variance
When Sam runs a 100 m race, he finds his times follow a symmetrical distribution. He trains intensively for a week and then runs 100 m on each of 10 consecutive days. A test is carried out to see whether the training has improved their times.
Wilcoxon signed-rank
Non-parametric, one sample, symmetrical
Ollie and June notice that certain vitamins may affect the mass of rats. An investigation was carried out to assess the effects of adding certain vitamins to the diet. Three groups were set up: a group of 10 rats received a vitamin supplement for 2 months; a group of 8 rats received a vitamin supplement for 1 month and the usual diet for 1 month; a group of 9 rats received the usual diet for 2 months. A test is carried out to see if there is a difference in masses between the groups.
One-factor ANOVA
More than two categories - one factor (diet)
Greengrocers find produce can sometimes be defective. Samples of size 5 are selected regularly from large batches and tested. During one week 500 samples are taken and the number of defective items in each sample recorded. A test to determine whether a Binomial distribution is a suitable model is carried out.
Goodness of fit
Test for suitability of a model - Binomial
Peter thinks the reaction times are affected by how much fluid is consumed in the day. A sample of 15 volunteers are asked to participate in a reaction test twice; once on a day where they drink 2 litres of water before the test and once on a day where they drink 500 ml of water before the test. The reaction times are then recorded and analysed. It is assumed that the differences in reaction times is normally distributed
Paired t-test
Same volunteers in two conditions (paired), normally distributed
Previous work indicate that reactions are affected by how much fluid is consumed in the day. A sample of 15 volunteers are asked to participate in a reaction test twice; once on a day where they drink 2 litres of water before the test and once on a day where they drink 500 ml of water before the test. The reaction times are then recorded and analysed. It is assumed that the differences in reaction times is not normal, but symmetrically distributed
Paired Wilcoxon signed-rank
same volunteers in two conditions (paired), non-parametric, symmetrical
Previous studies indicate that reactions are affected by how much fluid is consumed in the day. A sample of 15 volunteers are asked to participate in a reaction test twice; once on a day where they drink 2 litres of water before the test and once on a day where they drink 500 ml of water before the test. The reaction times are then recorded and analysed. No assumptions are made about the differences in reaction times
Paired sign test
same volunteers in two conditions (paired), non-parametric, not symmetrical
Callum thinks an investigation should be carried out into whether there is an association between the age of a person and the amount of calories consumed. People from three age categories (20s, 30s and 40s) where asked about their diet and placed into 4 categories (low, medium, high, very high).
Contingency table - chi-squared
Association test; catagorised, qualitative data
Pat presumes that the age of a person is associated with the amount of calories consumed. A sample of 15 people is taken and the age of each person and the average amount of calories they consumed was recorded. It is assumed that the underlying population follows a bivariate normal distribution.
Product moment correlation coefficient (PMCC)
Test for assocaition, bivariate normal data
Sian researched if there is an association between the age of a person and the amount of alcohol consumed. A sample of 12 people is taken and the age of each person and the average amount of alcohol they consumed was recorded. It is assumed that the underlying population does not follow a bivariate normal distribution.
Spearman’s rank correlation coefficient
Test for association, not bivariate normal data
Sarah researched if there is an association between the weight of a person and the amount of alcohol consumed. A sample of 17 people is taken and the weight of each person and the average amount of alcohol they consumed was recorded. No assumptions about the underlying population are made.
Spearman’s rank correlation coefficient
Test for association, not bivariate normal data
General fraud charges are suspected to occur randomly, independently and at a constant average rate. In one year, the Crown Prosecution Service recorded the number of fraud charges to test this theory.
Goodness of fit
Suspect the condidtion of a Poisson - Is it a suitable model?
Garden fires are suspected to be random, independent and at a constant average rate. In one year, 400 gardens are chosen and the time between fires in those gardens are recorded to test the theory.
Goodness of Fit
Suspect the condidtion of a Exponential - Is it a suitable model?
Three age groups are studied to see if there is a difference in the amount of calories consumed. It is also considered that a person’s BMI may have an effect so this is also considered in the analysis.
Two-factor ANOVA
More than two categories for each of two factors (age and BMI)
Pinewood Productions believe that if they spend more on television advertising, the more sales will be made. A sample of 15 days with the amount of expenditure on television advertising and the number of sales made on that day are recorded. A scatter diagram of the data shows an elliptical shape.
Product moment correlation coefficient
Test for association, elliptical shape suggests close to linear therefore bivariate normal