What is Confounding?
–Confounding is bias in the estimation of the effect of exposure on disease occurrence, due to a lack of comparability between exposed and unexposed population.
–Occurs when the substitute population is NOT equivalent to the counterfactual condition.
(AKA, the substitute population does not show the outcome in the exposed population WITHOUT the exposure.)
How do we Identify and Control for Empirical Manifestations of Confounding?
Practically, there is no empirical method for directly examining the correctness of the comparability assumption
We search for differences in the distributions of risk factors among the exposure groups = confounding variables
Interaction
Example : smoking, lung cancer, and asbestos
Occurs when the association between the exposure and the disease varies by levels of a third factor
ie when the magnitude of effect is “modified” by varying levels of a 3rd factor
Ex: the association btwn smoking + lung cancer varies by exposure to asbestos.
—(risk is much higher and smoking + asbestos exposures are present together)
Example of Confounding
smoking, lung cancer, and age
If smokers are older than non-smokers, how would we know whether the observed association between smoking + lung cancer is due to smoking or to age?
(if we have adequately measured confounders in all subjects, we can correct or control for their distorting effect in the analysis)
Comparability-based confounding
The observed value of the outcome measure in the exposed group is compared with the expected value that would have been observed in the exposure group if it had not been exposed.
The unexposed group’s actual outcome is used as a proxy for the exposed group’s unobserved value. When the two groups experiences differ, the groups are noncomparable
=confounding occurs
Collapsibility-based confounding
Confounding is a failure of the estimate for an adjusted effect parameter to equal the estimate for the crude parameter that is obtained when a covariate is ignored (collapsed).
Confounding is equated with non-collapsibility (or change in the estimated parameter)
Properties of a Confounder
Property 1:
A Confounder must be the cause of the disease
Meaning?
Covariate must be a risk factor for the disease in the unexposed base population.
OR, it can be a marker for another, often unmeasured risk factor.
Meaning: the association can be observed in the unexposed group in both cohort and case-control studies.
C is not a confounder if:
Property 2:
A confounder must be associated with the exposure in the base population
Covariate must be associated with exposure status in the total base population.
Property 1 and 2 depend on:
–Prior knowledge of covariate associations or effects in the base population, which may conflict with associations observed in the data
Property 3:
A confounder must not be affected by the exposure or the disease
C is not a confounder if its association with E in the base population is due entirely to the effect of E on C –even if C is a proxy/risk factor for D
C is not a confounder if:
*often requires prior knowledge
Positive v Negative Confounding
When X is a Risk Factor:
When X is a Protective Factor:
Positive confounding: crude OR/RR > Adjusted OR/RR
Negative confounding: crude OR/RR < Adjusted OR/RR
Stratification Methods:
Mantel-Haenszel method
Advantages and Disadvantages of Stratification Methods
Advantage:
-easy to understand and compute
Disadvantage:
How to Identify a Confounder
Identifying a Confounder: Prior Knowledge
Prior knowledge of a causal relationship from previous empirical studies, biologic plausibility, or theories/models
existing DAG’s
Identifying a Confounder: Change in Estimate Strategy
A crude effect estimate is compared to the adjusted effect estimate.
Magnitude of confounding = (crude - adjusted / adjusted)
Methods of Controlling for Confounding
Controlling for Confounding:
Design and Conduct of a Study
Matching
Method used to control for confounders in observational studies
- Restrict eligibility of unexposed subjects by making them similar/comparable to exposed subjects with respect to matching variables (confounders)
Matching in Cohort Studies
Matching in Case-Control Studies
Matching is more common in case-control studies, because there is more likely to be a relative shortage of cases than there is to be a shortage of controls
Controls are matched to the cases
Types of Matching:
Individual matching
Frequency matching
Individual Matching