What are biostatistical methods used for?
They are designed to quantify risks and health outcomes in the population and to establish evidence for associations.
Biostats provides the foundation for evidence based PH
What is statistical bias?
A systematic error in the estimation of a quantity of interest that causes the expected value of a statistic to differ from the true population value.
What is sample selection bias?
When individuals of the sample aren’t representative of the population.
What is a random sample and what are the benefits and limitations?
Every individual in the population has an equal chance of being selected.
Benefits –> representative
Limitation –> not helpful to study rare conditions
Challenge –> getting the data (logistics)
What is population surveillance, what is it used for and what is one example of it?
Systematic collection of data in the population to examine risk behaviors and health trends
Descriptive statistics are computed to summarize the prevalence of risk factors and outcomes
Example: NHANES –> health and nutritional data from interviews and physical exams. yearly cross-sectional capture of the US population. Uses a probability sampling design instead of random sample to get data for specific groups.
How do you account for sampling variability?
Low sampling variability –> repeated sampling from population would yield similar answers
How do you establish an association and what does the strength of the evidence depend on?
As the exposure changes the outcome tends ro change with it
What is statistical power? why is it useful?
The probability that your proposed study will give a statistically significant result.
What is the relationship between measurement error and the strength of evidence?
What has to happen for something to be statistically significant? and what does it depend on?
The change we observe would have to be more extreme than what we would expect to see by chance.
depends on: sample size, magnitude of the chanage and consistency
What is a p value?
P values give the probability of getting the observed value or something more extreme under the assumption that there is no actual effect.
Smaller p - values provide stronger evidence
What is a confounder and what are examples of real confounders that have affected real data?
The effect of a hidden variables (s) that distorts the relationship between exposure and outcome.
How can you eliminate confounding variables from affecting you data?
Difference between statistical and practical significance
Statistical - evidence of a non-zero effect, not necessarily a large effect so it might not have practical significance.
What is a meta-analysis and why is it useful?
When you combine evidence from different studies.
replication of results is important to build strong evidence.
What is the target population?
Group you ideally want to generalize results to.
What is the source population?
Specific individuals from which a representative sample will be drawn - they are at risk of being a case
Study population?
group you can get access to, realistically generalize results too
sample
group you actually ends up in your study analysis