1B Statistics Flashcards

(77 cards)

1
Q

What is the arithmetic mean and its advantages and disadvantages?

A

Arithmetic mean = sum of all sample values/sample size

Advantage: uses all values in the data, so statistically efficient

Disadvantage: vulnerable to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the median and its advantages and disadvantages?

A

Median = list all the observations in order. Median is the middle value (for an odd number of observations), or the average of the two middle observations (for an even number of observations)

Advantage: not vulnerable to outliers

Disadvantage: not statistically efficient as it does not make use of all the individual data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the mode? When might it be used?

A

Most common value. Not used much in statistical analysis. Can be used for categorical data to describe the most frequent category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the range and IQR? Why is IQR useful?

A

Range: smallest to largest value

IQR: 25% to 75%. Can be obtained from a list of values by first obtaining the median and then halving the two halves to get LQ and UQ.

Advantage of IQR is that it is not vulnerable to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the standard deviation and variance?

A

The variance is the average squared deviation of each number from the sample mean. It is a statistical measure of spread.

The standard deviation is the square root of the variance. It is also a statistical measure of spread around the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is the standard deviation useful? When should it not be used?

A

NB: the standard deviation is useful in medicine because for data that follow a normal distribution 95% of observations will be within 2 SDs of the mean, known as the reference range.

SD should not be used for skewed data. IQR should be used instead.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define sensitivity and specificity

A

Sensitivity = true positives/all those who have the disease - P(T+|D+)

Specificity = true negatives/all those who do not have the disease - P(T-|D-)

NB these are NOT affected by prevalence since it is a characteristic of the test, not the population. Although for rare diseases can be tricky to accurately measure sensitivity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define false negative rate and false positive rate

A

False negative rate = 1 - sensitivity

False positive rate = 1 - specificity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Outline the rules of elementary probability theorem

A

P(A or B) when mutually exclusive = P(A) + P(B)

P(A or B) when not mutually exclusive = P(A) + P(B) – P(A and B)

P(A and B) = P(A) x P(B|A) = P(B) x P(A|B) (this is conditional probability e.g. probability of getting neuropathy if someone is diabetes)

P(A and B) = P(A) x P(B) (this is for independent events e.g. probability of being blood group O and getting diabetes - assuming they are completely unrelated)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the formula for Bayes’ Theorem?

A

P(A) x P(B|A) = P(B) x P(A|B)

Can be rearranged to give formula for Bayes’ theorem

P (B|A) = [ P(A|B) x P(B) ] / P(A)

Thus, the probability of B given A is the probability of A given B, times the probability of B divided by the probability of A.

This is derived from the multiplication rule for conditional probability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define PPV and NPV

A

PPV = true positive/all those who tested positive

NPV = true negatives/all those who tested negative

NB these values ARE affected by prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does Bayes’ theorem relate to a diagnostic/screening test?

A

From the multiplication rule: P(T+ and D+) = P(T+|D+) x P(D+)

P(T+|D+) is the sensitivity of the test and P(D+) is the prevalence

Diagnostic process can be summarised by Bayes’ Theorem in this way:
P(D+|T+) = [ P(T+|D+) x P(D+) ] / P(T+)

P(D+) is the a priori probability.

P(D+|T+) is the a posteriori probability (aka the probability based on empirical evidence/observation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can Bayes’ theorem be summarised through odds?

A

Bayes theorem can be summarised by:

Odds of disease after test = odds of disease before test x likelihood ratio

Odds before test = The probability that an individual has coronary heart disease, before testing, is 0.70, and so the odds are 0.70/(1-0.70)=2.33 (which can also be written as 2.33:1).

Positive likelihood ratio = Sensitivity / (1 - specificity)

The usefulness of a diagnostic/screening test will depend upon the prevalence of the disease in the population to which it has been applied. In general, a useful test is one which considerably modifies the pre-test probability. If the disease is very rare or very common, then the probabilities of disease given a negative or positive test are relatively close and so the test is of questionable value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can probabilities be used to determine if events are independent or not?

A

If the test and having the diagnosis were completely independent we would expect: P(D+ and T+ )= P(T+) x P(D+).

Therefore we can use this formula [P(D+ and T+)] – [ P(D+) x P(T+)] as a crude estimate of whether events are independent. If there is a difference P(D+ and T+) and P(D+) x P(T+), this suggests the events are NOT independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between independent and mutually exclusive events?

A

Mutually exclusive events cannot happen together (P(A and B) = 0), like flipping heads or tails on one coin, while independent events do not affect each other’s probability, like flipping two coins (P(A and B) = P(A) * P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is sampling error?

A

The uncertainty, caused by observing a sample rather than the whole population.

Generally, the larger the sample, the smaller the sampling error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the standard error?

A

The standard error estimates how precisely a population parameter (e.g. mean, difference between means, proportion) is estimated by the equivalent statistic in a sample. (The standard error is a way of measuring the likely sampling error)

The standard error is the standard deviation of the sampling distribution of the statistic.

With normally distributed values and/or large samples, 1.96 SEs around the sample mean produce a range of values which will include the true mean with 95% confidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How do you calculate the standard error of the mean and a confidence interval for the SEM?

How do you calculate the SE for a difference in means and the CI for this?

A

Standard error of the mean = SD/ √𝑛

95% CI = sample mean ± (1.96 x SE)

See this page for formula for SE of difference in means: https://www.healthknowledge.org.uk/index.php/public-health-textbook/research-methods/1b-statistical-methods/methods-quantification-uncertainty

95% CI for difference in means = (mean1 - mean2 ) ± (1.96 x SE of difference in means)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do you calculate the standard error of proportion/percentage?

How do you calculate the CI for a proportion/percentage?

How do you calculate the standard error for a difference in proportions?

How do you calculate the CI for a difference in proportions?

A

95% CI for a proportion = proportion ± (1.96 x SE)

95% CI for a difference in proportions = (p1 - p2) ± (1.96 x SE for the difference)

See this page for SE formulae: https://www.healthknowledge.org.uk/index.php/public-health-textbook/research-methods/1b-statistical-methods/methods-quantification-uncertainty

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What values are used for 90% and 99% CIs?

A

90% = 1.65
99% = 2.58

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Describe interpretation/comparison of two 95% confidence intervals

A

95% confidence intervals do not overlap: Significant difference at the 5% significance level (i.e. strong evidence of a true difference).

95% confidence intervals overlap but the point estimates are outside the confidence intervals of the other: Unclear - Requires calculation of a significance test.

Point estimate of one sample falls within the 95% confidence intervals of the other: No significant difference at the 5% significance level (i.e. no strong evidence of a true difference).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the normal distribution?

A

Normal distribution describes continuous data which have a symmetric distribution, with a characteristic ‘bell’ shape.

Described by μ the population mean, or centre of the distribution, and σ the population standard deviation. It is symmetrically distributed about the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the uses of the normal distribution?

A
  • Describes many variables in biology e.g. height, birthweight, systolic BB
  • Describes the sampling distribution of the mean (this is referring to when you take lots of different samples from a population - this will hold true even if the underlying population data is not itself normally distributed, the sampling distribution of the mean will still be normally distributed)
  • Other distributions approximate the normal distribution when sample sizes are sufficiently large
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the binomial distribution and its uses?

A

Constructed from sample size n and constant true probability π.

It shows the frequency of events with two possible outcomes e.g. success and failure. In this example π would be the treatment success rate.

The distribution will depict the probability of different numbers of successes.

Will approximate normal distribution for large samples.

Uses:
- Discrete data with two outcomes
- Sampling distribution of proportions

NB since a proportion or probability cannot be negative, binomial will not have negative values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is the Poisson distribution and its uses?
Poisson distribution shows the frequency of events over time in which events occur independently. λ is the mean number of events per interval. This is equal to the variance. Example: deaths, hospital admissions X axis shows the number of events (x). Y axis shows the probability of x number of events. For small samples Poisson will be asymmetrical. For large it will approximate normal. Uses: - Used for discrete quantitative data e.g. counts or rates - Since it predicts randomly occurring events, it can be used to determine whether observed events are occurring randomly or not
26
What is the Student's t distribution and its uses?
Continuous probability distribution with a similar shape to the Normal distribution but with wider tails. Uses: - Small sample sizes e.g. n<30
27
What is the chi-squared distribution and it uses?
The chi-squared distribution is continuous probability distribution whose shape is defined by the number of degrees of freedom. It is a right-skew distribution, but as the number of degrees of freedom increases it approximates the Normal distribution. Uses: - Used in chi-squared test
28
What is a Type 1 error?
This is a false positive. It is when the null hypothesis is wrongly rejected i.e. the study shows an effect which in reality does not exist. Denoted by α. It is equivalent to the p-value e.g. 0.05 or 0.001. This is because the definition of the p-value is the probability that the result, or a more extreme result is due to chance. Alpha represents the same thing i.e. that the result is due to chance, not the true effect.
29
What is a Type 2 error?
This is a false negative. It is when the null hypothesis is NOT rejected when it SHOULD HAVE BEEN. The study does not detect a difference that in reality existed. Occurs when sample size too small. Denoted by β
30
Relationship between power and Type 1/2 errors?
Power = 1 - β
31
What do you need to calculate sample size? How do they each affect sample size?
1. Precision/Significance level aka p-value aka Type I error 2. Power (probability that study will be able to detect a true difference, where one exists 3. Clinically meaningful effect size 4. Prevalence. For cohort and intervention studies this is the disease prevalence in the unexposed. For case-control studies it is the prevalence of exposure in controls. But think logically about whether it is relevant to the study design e.g. won't be relevant to a smoking cessation intervention, where everyone already smokes. - Smaller significance level needs higher sample size - Higher power needs higher sample size - Smaller effect size requires larger sample size - Low prevalence requires large sample size
32
What is the issue with multiple testing? When does it happen and how do you prevent it?
Running lots of tests means you will eventually get a clinically significant test even when the difference does not truly exist. For p=0.05 you would imagine 1/20 results would be a false positive. Can then lead to publication bias. Happens when: 1. Many outcomes are tested for significance 2. In a trial, one outcome is tested a number of times during the follow up 3. Many similar studies are being carried out at the same time. Ways to combat: 1. To specify clearly in the protocol which are the primary outcomes (few in number) and which are the secondary outcomes. 2. To specify at which time interim analyses are being carried out, and to allow for multiple testing. 3. To do a careful review of all published and also unpublished studies. Of course, the latter, by definition, are harder to find.
33
What is the Bonferroni correction?
If doing n independent tests one should specify the type I error rate as α/n rather than α.
34
What are dot plots?
Simple data visualisation that displays frequency or distribution by placing dots (representing individual data points) above a labeled number line
35
What are histograms?
Graph of continuous variable, grouped into several non-overlapping and equal intervals. The individual rectangles are called 'bins'. Usually would have 5-15 bins.
36
What is a box and whisker plot? What is it useful for?
Shows the median, LQ, UQ and range. A variation is for the two ends of the whiskers not to be the range, and to have the outliers represented by dots. Useful for: - Showing skew via position of median relative to LQ and UQ - Comparing groups
37
What is a scatterplot useful for? What are its advantages and disadvantages?
Showing relationship between two continuous variables Advantages: - Retain all the data values - Make outliers apparent Disadvantages: - Hard to visualise individual results for very large datasets - Weak relationships may not be apparent
38
What are bar charts used for?
Categorical data. Note there is a gap between each bar. Height of the bar shows frequencies or relative frequencies.
39
What is the correlation coefficient (r) and what do its values mean?
Statistical technique used to measure the strength of linear association between two continuous variables. The correlation coefficient (r) lies between -1 and +1 (inclusive). If r = 1 or -1, there is perfect positive (1) or negative (-1) linear relationship If r = 0, there is no linear relationship between the two variables Conventionally 0.8 ≤ |r| ≤ 1.0 = very strong relationship 0.6 ≤ |r| < 0.8 = strong relationship 0.4 ≤ |r| < 0.6 = moderate relationship 0.2 ≤ |r| < 0.4 = weak relationship 0.0 ≤ |r| < 0.2 = very weak relationship NB correlation only measures LINEAR relationships. A U-shaped relationship may have a correlation of 0.
40
What are the other names for the correlation coefficient?
When calculated using the observed data, it is commonly known as Pearson's correlation coefficient. When using the ranks of the data, instead of the observed data, it is known as Spearman's rank correlation.
41
What is the square of the correlation coefficient?
The square of the correlation coefficient (r2) indicates how much of the variation in variable y is accounted for (or “explained”) by the variable x. For example, if r = 0.7, then r2 = 0.49, which suggests that 49% of the variation in y is explained by x.
42
What is the equation for a regression line for simple linear regression? How is this calculated? How can the equation be used for a statistical test of a linear relationship?
y = a + bx a = constant (y intercept) b = gradient (regression coefficient) The model is fitted by choosing a and b such that the sum of the squares of the prediction errors (the difference between the observed y values and the values predicted by the regression equation) is minimised. This is known as the method of least squares. The method produces an estimate for b, together with a standard error and confidence interval. From this, one can test the statistical significance of b. In this case, the null hypothesis is that b = 0, i.e. that the variation in y is not predicted by x.
43
When is multiple linear regression used?
Can be used for quantitative response variables with either continuous or categorical explanatory variables.
44
When is logistic regression used?
Used when the response variable is binary, being either an event (e.g. death or cure) or no event (e.g. survival or not cured). The explanatory variables can be either binary, ordinal, categorical or continuous.
45
What are the two types of life tables?
1. Cohort life tables These show the probability of death at each age in a described group of individuals that has been followed over time. Cohort life tables are frequently used for survival analyses. This tracks a specific birth year across their life, including projected future mortality improvement. 2. Period life tables These give the current probability of death in given population at different ages. Period life tables are often used in demography. This uses current mortality rates only.
46
What is a survival curve?
- Any curve that shows probability of surviving beyond a given time - Y-axis: survival probability (0–1) - X-axis: time - Can be theoretical, model-based, or empirical
47
What does survival analysis measure?
- Time to event outcome - Follow up times differ - Censoring present
48
What are specific examples of what survival analysis measures?
- Time to death - Time to relapse - Time to first event
49
What is a censored observation and why does it occur?
A censored observation is one where there is incomplete data about when exactly the person experienced the outcome. Censored observations occur in two main ways: 1. Before the study completes, a subject may withdraw, or be lost to follow-up. 2. On completion of the study, subjects who have not yet experienced an event.
50
What is a hazard ratio and how do you interpret?
Hazard = instantaneous event rate at any given time (not a risk, not a probability) Hazard ratio compares hazards between groups HR < 1 → lower event rate in exposed group A hazard ratio of 0.75 suggests a 25% lower event rate at any given time in the intervention group. NB hazard ratio is NOT equivalent to the risk ratio
51
What is a Kaplan-Meier curve specifically?
- A non-parametric estimate of the survival function - Built directly from observed event times - Accounts explicitly for censoring - Stepwise drops occur only at event times NB there is no adjustment for covariates, KM is not regression it is JUST descriptive
52
How to interpret a KM curve?
1. Starting point - All groups start at survival = 1.0 2. Drops in the curve - Each drop = an event - Bigger drops = more events at that time 3. Censoring - Tick marks show censored observations - Censored individuals contribute follow-up time up to that point - The censored individuals are those whose follow-up ended without an event e.g. loss to follow up - Therefore KM curve does not drop because they was no event 4. Separation of curves - Persistent separation suggests a difference in survival experience 5. Number at risk - Reliability decreases as numbers at risk fall - Late divergence should be interpreted cautiously NB they do NOT show hazards or hazard ratios
53
What does the log rank test do (in relation to KM curves)?
- Null hypothesis: no difference in survival functions - Compares observed vs expected events over time - Gives a global p-value NB does not give an effect size, does not adjust for covariates, no info on how big the difference is
54
What are proportional hazards, when are they plausible and when are they violated?
What proportional hazards means - The ratio of hazards between groups is constant over time. - In plain English: One group is consistently “riskier” than the other The relative difference does not change over time When proportional hazards is plausible: - Curves separate early - Remain roughly parallel - Do not cross When proportional hazards is violated: - Curves cross - Separation changes markedly over time - Early benefit disappears or reverses Exam sentence - The roughly parallel separation of the curves suggests that the proportional hazards assumption is reasonable. Or, if violated: - The crossing of the curves suggests that the proportional hazards assumption may not hold.
55
What is Cox regression?
- Semi-parametric statistical method - Used to in survival analysis i.e. for time to event data - Estimates a hazard ratio comparing groups - Allows adjustment for multiple covariates, including changes in co-variates over time - Allows for censoring - Does not involve specifying the baseline hazard
56
What does the hazard ratio from Cox mean and how do you interpret?
What the hazard ratio from Cox means: - The hazard ratio represents the relative event rate at any given time, averaged over the follow-up period, assuming proportional hazards. Example interpretation: - A hazard ratio of 0.70 indicates a 30% lower event rate at any given time in the exposed group, assuming proportional hazards.
57
How do you interpret Cox when proportional hazards may not hold?
If proportional hazards are violated, the Cox model still provides a summary estimate, but the hazard ratio should be interpreted as an average effect over time.
58
What is the difference between KM curves and Cox adjusted curves
KM: - Descriptive - No adjustment for covariates - Stepwise - Non parametric - Difference in survival tested with a log rank test Cox adjusted curves - Model based - Adjust for covariates - Often smooth - Semi-parametric - Can calculate a hazard ratio (effect estimate) and CI and p-value
59
What is heterogeneity?
Heterogeneity means that the results of the included studies are genuinely different from each other beyond what would be expected by random sampling error alone. Put simply, the effect sizes are not all estimating the same underlying “true” effect. Therefore it may not be appropriate to pool them.
60
What are the different types of heterogeneity?
Clinical heterogeneity refers to differences in the specific research question that was studied, such as differences in the eligible populations, in the interventions and controls, and in the outcome measures. Methodological heterogeneity describes a variability in study design and in the risk of bias. This can include differences in the interventions given, and in how the outcomes were defined and measured, as well as variations in the use of blinding and allocation concealment. Such methodological heterogeneity may result in different studies actually measuring slightly different things. Statistical heterogeneity refers to variability in the “true” intervention effects in different studies, and it arises as a consequence of clinical and/or methodological heterogeneity. It results in a variation in effect sizes that are larger than can be expected by chance. Statistical heterogeneity can be identified using Cochran’s Q statistic (a form of chi-squared test of the null hypothesis that the true effect in all included studies are the same), or the I2 test (which uses Cochran’s Q statistic to give a percentage score for heterogeneity, with higher percentages indicating greater heterogeneity).
61
What is Cochran's Q?
This is calculated as the weighted sum of squared differences between the effects from individual studies and the pooled effects from all included studies. The Q statistic has a chi-square distribution with (k-1) degrees of freedom, where k is the number of included studies. The resulting Q statistic can be used to generate a p value for the null hypothesis of no heterogeneity. Note that Cochran’s Q has a low power to detect heterogeneity when the number of studies is small (e.g. < 20), as is the case with most meta-analyses. To compensate for this, a higher significance level may be used to determine statistical significance (e.g. p < 0.10).
62
What is the I-squared statistic?
The I2 statistic estimates the proportion of variation across included studies that is secondary to heterogeneity (rather than chance). It is calculated using the Q statistic. An I2 of zero means that all the variability in effect sizes seen is due to sampling error and not heterogeneity. An I2 value of above 30% may represent at least moderate heterogeneity, but this result needs to be interpreted in context of the actual clinical or methodological features that may have led to the heterogeneity.
63
What is a funnel plot?
A funnel plot is a specific type of scatterplot, plotting the intervention effect sizes from different studies (on the x-axis) against some measure of the study size or precision (e.g. the inverse of standard error, on the y-axis).
64
How can you tell if a funnel plot shows publication bias? If there is evidence of publication bias, what can you do to adjust for this?
Because the precision of the estimate of the effect size increases with the size of the study, the smaller studies will have more widely scattered effect sizes towards the bottom of the scatterplot, and this variability will reduce as the study sizes increase. The premise is that publication bias will result in smaller studies with non-significant outcomes not being published. If publication bias is present it will result in an asymmetric appearance of the funnel plot, with a unilateral gap towards the bottom of the funnel where the results of the small, negative, unpublished studies should have been. Techniques exist to modify summary estimates based on funnel plots using statistical estimation of missing data e.g. “trim and fill”.
65
How does publication bias affect meta-analysis results?
It will result in an overestimation of the true treatment effect.
66
What chi-squared value corresponds to p<0.05
More than 3.84 Chi squared of less than 3.84 is therefore not significant at the 5% level
67
What test is used for categorical paired data and how do you use it?
McNemar's test - This is specifically for paired data e.g. case-control study or repeated measurements in one participant - Data is presented in matched pairs - each cell is a matched pair - Therefore the total of the values is half the sample size - Interested in DISCORDANT PAIRS - Case exposed/control unexposed = r - Case unexposed/control exposed = s OR = r/s McNemar's = (r-s)SQUARED/(r+s)
68
What is absolute risk reduction and relative risk reduction?
Absolute risk reduction = risk in A - risk in B Relative risk reduction = (risk in A - risk in B)/risk in A
69
What is ANOVA and what are the different types?
- Used for comparing the means of an outcome variable across two or more exposures variables, by comparing within-group and between-group variance - It is a special case of multiple regression - Most datasets for which ANOVA is appropriate can be analysed by regression too and yield the same results ONE-WAY ANOVA - One way analysis of variance is used when the exposure groups being compared are defined by one exposure e.g. socioeconomic status --> Assesses how much of the overall variation in the outcome is attributable to differences between the exposure group means --> Compared using an F-test, sometimes called the variance-ratio test --> Where there are only two groups, the one-way ANOVA gives exactly the same result as a t-test TWO WAY ANOVA - Two-way analysis of variance is used when subdivision is based on two factors e.g. age and sex MANOVA (Multivariate analysis of variance) - Comparing two outcome variables simultaneously across exposure sub-categories - E.g. the means of dependent variances reading, writing and maths may be tested across due exposure groups male and female
70
What is the relationship between probability and odds?
Prob (A) = Odds (A)/ (1 + Odds (A)) Odds (A) = Prob (A)/(1 - Prob (A)) Therefore when probability is small, odds and probability will be similar. This is because in this situation (1-Prob A) is close to 1, so you are literally just dividing by close to 1, so you will get a similar number Odds are always bigger than probability, since (1 - Prob) is less than 1 (so you are dividing by a number less than one)
71
Assumptions of the unpaired t-test
- Sampling distributions are normally distributed - Data are measured at the interval, or ratio level (i.e. they are continuous) - Homogeneity of variance of the populations - Scores are independent - No extreme outliers
72
Assumptions of the paired t-test
- The differences in each pair are normally distributed - Data are measured at the interval, or ratio level (i.e. they are continuous) - Data consist of two categorical related groups e.g. same subject before and after - The pairs (not the individuals within each pair) are independent of each other - No extreme outliers
73
What are the different statistical tests for paired data?
Continuous (normal) outcome: Paired t-test Categorical outcome: McNemar's Ordinal, or continuous non-normal: Wilcoxon
74
What are different parametric tests and their non-parametric equivalents?
Paired t-test --> Wilcoxon Signed Rank Unpaired t-test --> Mann-Whitney U test Pearson --> Spearman ANOVA --> Kruskal-Wallis
75
Why are two-sided p-values used?
Two-sided P values are a test of a non-directional hypothesis that are used when we don't know which way the exposure will influence the outcome (increase vs decrease).
76
Why is log transformation of variables sometimes important?
It would mean that the differences in size of the practice lists (unequal homogeneity of variance) would give rise to unequal standard deviations for those populations (within groups) and lead to an invalid result. Logarithmic transformation of list sizes will reduce the within group error variance.
77
What is the Kruskal-Wallis test?
- Non-parametric equivalent of ANOVA - Rank based test - Used to compare whether two or more independent groups differ