LOS 11a: Define a hypothesis, describe the steps of hypothesis testing, and describe and interpret the choice of the null and alternative hypotheses
LOS11b: Distinguish between one-tailed and two-tailed tests of hypotheses
LOS 11c: Explain a test statistic, Type I and Type II errors, a significance level, and how significance levels are used in hypothesis testing
LOS 11d: Explain a decision rule, the power of a test, and the relation between confidence intervals and hypothesis tests
A hypothesis is a statement about the value of a population parameter developed for the purpose of testing a theory.
Steps to the test
1)The null hypothesis (H0) generally represents the status quo, and is the hypothesis that we are interested in rejecting. This hypothesis will not be rejected unless the sample data provides sifficient evidence to reject it. Can be stated as
2) The alternative hypothesis (Ha) is essentially the statement whose validity we are trying to evaluate. The alternate hypothesis is the statement that will only be accepted if the sample data provides convincing evidence of its truth. Can be stated as:
3) a hypothesis test is always conducted at a particular level of significance. Essentially it involves the comparison of a sample’s test stastic to a critical value
One-Tailed vs Two-Tailed Tests
Under one-tailed tests, we assess whether the value of a population parameter is either greater than or less than a given hypothesized value. They can be stated as:
The following rejection rules apply when trying to determine whether a population mean is greater than the hypothesized value
And the following rejection rules apply when trying to determine whether a population mean is less than the hypothesized value
Under two tailed tests, we asses whether the value of the population parameter is simply different from a given hypothesized value. :
Rejection Rules for a 2-tailed Hypothesis test
Type I and Type II Errors
Two types of errors can be made when conducting a hypothesis test:
The significance level represents the probability of making a Type I error. A 5% significance level means there is a 5% chance of rejecting the null when it is actually true.
If we were to fail to reject the null hypothesis given the lack of overwhelming evidence in favor of the alternative, we risk a Type II error. Sample size and the choice of significance level together determine the probability of Type II error.
the power of a test is the probability of correctly rejecting the null hypothesis when it is false
relationship Between Confidence intervals and Hypothesis tests
LOS 11f: Explain and interpret the p-value as it relates to hypothesis testing
The p-value is the smallest level of significance at which the null hypothesis can be rejected. It represents the probability of obtaining a critical value that would lead to rejectiong of the null hypothesis.
LOS 11e: Distinguish between a statistical result and an economically meaningful result
Even though a trading strategy might provide a statistically significant return of greater than zero, it does not mean that we can gurantee that trading on this strategy would result in economically meaningful positive returns. The returns may not be economically significant after accounting for taxes, transactions costs, and risks.
LOS 11g: Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the population mean of both large and small samples when the population is normally or approximately distributed and the variance is 1) known or 2) unknown
Tests concerning a single mean
In the process of hypothesis testing, the decision whether to use critical values based on the z-distribution of the t-distribution depends on sample size, the distribution of the population and whether the variance of the population is known
The t-test is used when the variance of the population is unknown and either of the conditions below holds
The test statistic for hypothesis test concerning the mean of a single population is:
In a t-test, the sample’s t-stat is compared to the critical t-value with n-1 degrees of freedom, at the desired level of significance.
The z-test can be used to conduct hypothesis tests of population mean when the population is normally distributed and its variance is known
The z-test can also be used when the population’s variance is unknown, but the sample size is large
In a z-test the z-stat is compared to the critical z-value at the given level of significance.
LOS 11h: Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the equality of the population means of two at least approximately normally distributed populations, based on independent random samples with 1) equal or 2) unequal assumed variances
Tests relating to the Mean of Two Populations
Hypotheses describing the tests of means of two populations can be structured as:
Test for means when Population Variances are Assumed Equal
In the case of assumed equal variance, we will used a t-test with a pooled variance. If we are testing if the difference between the means is 0, we have a two-tailed test. If we are testing for greater than it is a one tail test.
So we would find our t-stat range from the table using the appropriate degrees of freedom, and then calculate our t-score for the hypothesis. If the t-score falls in the range of the t-stat, we fail to reject the hypothsis. If it falls outside the range, we reject the hypothesis.
When variances are assumed unequal
Again we would use a t-stat test but this time with variances not pooled
LOS 11i: Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the mean difference of two normally distributed populations.
When the samples of the two populations whose means we are comparing are dependent, the paired comparisons test is used. Dependence can result from events that affect both populations. The hypotheses are structed as:
Where :
This will be a two tailed t-test with paired comparisons
LOS 11j: Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the variance of a normally distributed population
Tests relating to the variance of normally distributed populations can be one-tailed or two-tailed.
One Tailed Tests:
H0 : σ2 <= σ02 vs Ha: σ2 > σ02 To test if population variance is greater than the
hypothesized variance
H0 : σ2 => σ02 vs Ha: σ2 < σ02 To test if pop is less than hypothesized
Two-tailed Tests
H0 : σ2 = σ02 vs Ha: σ2 does not equal σ02
Hypothesis tests for testing the variance of a normally distributed population involve the use of the chi-square distribution. Three important features of the chi-square distribution are:
Chi-square valies correspond the the probabilities in the right tail of the distribution. So if we were doing a two-tailed test at 5% significance, we would want the critical values at the .975 mark and the .025 mark, giving us 5% in total.
The test is calculated as:
LOS 11j: Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the equality of the variance of two normally distributed populations, based on two independent random samples
Hypotheses related to the equality of the variance of two populations are tested with an F-test, which is used under the assumptions that:
One-Tailed tests:
H0 : σ12 <= σ22 vs Ha : σ12> σ22
H0 : σ12 => σ22 vs Ha : σ12< σ22
Two-Tailed Tests:
H0 : σ12 = σ22 vs Ha : σ12 does not equal σ22
The test stat for the F-test is given by:
NOTE- always put the larger variance in the numerator when calculating the F-Test.
Features of the F-Distribution
The rejection region for an F-test, whether its one-tailed or two-tailed, always lies in the right tail
LOS 11k: Distinguish between parametric and nonparametric tests and describe the situations in which the use of nonparametric tests may be appropriate
A parametric test has at least one of the following two characteristics:
A non-parametric test is not concerned with a parameter, and makes only a minimal set of assumptions regarding the population. These are used when:
The Spearman rank correlation coefficient is a non-parametric test that is calculated based on the ranks of two variables within their repective data sets. It lies between -1 and 1 where -1 denotes a perfectly inverse relationship between the ranks of the 2 variables, and 0 represents no correlation.
Often the results of parametric and non-parametric tests are both presented.
Examples: