1b Statistics Flashcards

Question

Odds.

Answer 1

If p1 is the probability of a success, the odds is the ratio of the probability of a success to a failure p1/(1-p1).

Answer 2

Used as a summary measure for binary outcomes. If p1 is the probability of a success in one group and p2 the probability of success in another, then the odds ratio is {p1/(1-p1)}/{p2/(1-p2)}.

Answer 3

Used to analyse survival data. The main assumption is if an explanatory variable is binary, then the hazard ratio for this variable is constant over time.

Answer 4

The distribution of a count variable when the probability of an event is constant. The Poisson distribution is used to describe discrete quantitative data such as counts in which the population size n is large, the probability of an individual event is small Typical examples: - number of deaths in a town from a disease per day, - number of admissions to a particular hospital Poisson distribution describes the distribution of binary data from an infinite sample. Thus it gives the probability of getting r events in a population.

Answer 5

Used to analyze data when the outcome variable is a count.

Answer 6

The Power of a test is the probability of declaring a result significant when the null hypothesis is false. It is denoted by 1-beta

Answer 7

PPV is the probability that a subject who tests positive will be a true positive i.e. has the disease and is correctly classified i.e. how good the test is at finding people with disease in screening situation, prevalence is small & PPV low. PPV & NPV both depend on prevalence of disease, sensitivity & specificity PPV=A/(A+B)

Answer 8

NPV is the probability that a subject who is test negative will be a true negative i.e. someone. doesn't have disease and is correctly classified... how good a test is at identifying people without disease NPV=D/(C+D)

Answer 9

A phenomenon when some studies which have been conducted fail to be published. It usually occurs because studies that have positive findings are more likely to be written up and submitted for publication, and editors are more likely to accept them

Answer 10

Assuming that the null hypothesis is true, the p-value is the probability due to chance alone of obtaining a result at least as extreme as the observed result. �calculate using significance test �P < 0.05: considered statistically significant �P > 0.05: result not statistically significant. Two groups are not significantly different and chance can not be excluded as potential explanation of association

Answer 11

A model with more than one random (or error) term. The assumption is that if the study was done again, the terms would estimate different population parameters, in contrast to a fixed effects model. Thus in a longitudinal study, the effect of a patient on the intervention effect is assumed random.

Answer 12

Used as a summary measure for binary outcomes for prospective studies. If p1 is the probability of success in one group and p2 the probability of success in another, the relative risk is p1/p2. If p1 and p2 are the incidences of an event, then the relative risk is also the incidence rate ratio

Answer 13

- A type I error occurs when the null hypothesis is rejected when it is true. A type I error rate is the expected probability of making a type I error, and this should be decided before collecting data. It is essentially the expected **false positive rate (significance level)** of the test and is often denoted by α (usually set at 0.05) - A type two error occurs when a study fails to reject a null hypothesis when it is false, i.e. the alternative hypothesis is true. A type 2 error rate is essentially a **false negative** rate, and is often denoted by β.

Answer 14

Standard error measures how precisely a population measure (eg mean/proportion/rate) is estimated by a sample measure (ie the amount of variability in the sample measure)

Answer 15

Likelihood ratio: sensitivity/(1-specificity) Number of times more likely to have got +ve result when have the disease

Answer 16

A sensitivity analysis varies each input to see which are the most important drivers of the final result

Answer 17

- smaller effects require a **larger sample size** to achieve adequate power, holding all other study components constant. - need more participants in order to detect a small effect. - If the trial effect size is smaller than expected, this would be expected to **decrease the power**

Answer 18

Non-parametric rank-based test for more than 2 independent variables For continuous outcome and unordered categorical exposure Parametric equivalent: one-way ANOVA

Answer 19

Non-parametric rank-based test for 2 independent variables AKA Wilcoxon Rank Sum test Parametric Equivalent: unpaired t test

Answer 20

Non parametric rank test for paired small samples Parametric equivalent: paired t test

Answer 21

- Low power (type 2 errors are more likely). - Calculating confidence intervals is more difficult. - Can only generally be used for simple bivariate analysis (i.e. unable to adjust for con- founding or test for interaction).

Answer 22

1. Independence: observations in samples are independent of other sample. 2. Approximately normal distributed. 3. Homogeneity of Variances: Both samples have approximately the same variance. 4. Random Sampling: Both samples obtained using random sampling method. 5. Data are measured at the interval (or ratio) level

1b Statistics Flashcards

(46 cards)