Methodology Flashcards

Question

How to address heterogeneity in meta analysis

Answer 1

Investigate sources via subgroup analyses or meta-regression. Consider excluding outliers or low-quality studies in sensitivity analysis.

Answer 2

Defined: degree of variability in effect estimate between trials reported in forest plot Quantitative: e.g. I2 compares overlap between study Cis A Relative measure- shows how much of pooled effect is NOT heterogeneity Outcome of 0 doesn’t mean NO heterogeneity Qualitative: balance of studies on each side of line of no effect

Answer 3

Test robustness of findings by varying inclusion criteria. removing Subsecting by pre-specified factors to compare meta-analysis results Lets you analyze more specific question within portion of cohort MUST pre-specify in protocol otherwise data-driven Fine to add but explain why post-hoc

Answer 4

Explore effects in subgroups (e.g., by age, sex, dosage, study quality). separating and comparing MUST pre-specific – good to have a reference for why selecting every single variable Split/dichotomize studies by a variable to compare ES and heterogeneity

Answer 5

Fixed-effect model: assumes one true effect size. Random-effects model: assumes effect size may vary across studies (more realistic with heterogeneity).

Answer 6

Determines whether studies are consistent enough to be combined into one estimate

Answer 7

Assessed using statistical tests: I² statistic and Cochran’s Q to assess variability.

Answer 8

a percentage describing proportion of total variability due to study heterogeneity (in relation to total variability, including random error) More common because more easily interpretable >50% = high, suggests substantial heterogeneity between studies EG: I2=60% means 60% of variability in antidepressant effect is due to differences in true effects of the drug, and 40% is due to random sampling error compares overlap between study Cis Relative measure- shows how much of pooled effect is NOT heterogeneity Outcome of 0 doesn’t mean NO heterogeneity

Answer 9

a continuous measure showing actual/absolute variance in true effects between studies More direct measure of magnitude of difference in true effect EG: Tau2 = 0.25 means true variance of 0.25 in effect size, so the antidepressant effect differs by this amount across studies

Answer 10

Meta-regression: looks at study-level potential moderators of ES and the actual ES across studies See if study factors like design, sample size, measurement tool, population influence overall meta-analysis preferred alternative to subgroup analyses as keeps continuous so more informative (though splitting is a form of)

Answer 11

Effect sizes regressed onto study-level characteristics: like linear regression, coefficients tell you how much each moderator associated with the ES ES for each study becomes the dependent Plot study-level X variable against SMD, fit regression line

Answer 12

Use when: high ES heterogeneity across studies to ID where coming from, or when want to know effects of moderators

Answer 13

Divide studies based on moderator variable, calculate pooled ES and heterogeneity each group (>50% suggests too diverse) Compare ES and heterogeneity between subgroups: Q-test for statistical differences in ES

Answer 14

I2 alternative; measure of effect heterogeneity

Answer 15

significant p-value (typically less than 0.05) rejects the null hypothesis, indicating that the true treatment effects are not consistent - can be underpowered with small numbers of studies

Answer 16

Can still do with 3 studies, just low power and risk of overfitting Check heterogeneity – no point doing if low Alternatives: sensitivity analysis excluding one at a time, qualitative trend descriptions

Answer 17

weighted pooled effect size estimate, with study weight determined by: 1) Study size (larger = more impact) 2) Frequency of events (more = more statistically informative even if sample smaller)

Answer 18

Fixed/common effect (singular): assumes there is a single true value (effect of an intervention) that all studies are estimating Study outcomes distributed around this, shows how big chance error is around true value Only thing preventing getting true value is chance error which reduces with bigger sample size – why bigger studies given more weight Similar idea to CI where giving range of uncertainty around a true effect

Answer 19

Random effects (plural): assumes there is no measurable true value - instead each study estimating the distribution of effects around true value Measuring both chance error and random effect – why weight more distributed across studies though sample size still matters Larger CIs Even with infinite sample would never get true effect - best case scenario is a distribution around Always random with humans No harm assuming random even if truly fixed, as you would still get the outcome without distribution around

Answer 20

visual of pooling estimate Summarize between-group effects at study level around middle line of no difference culminating in weighted pooled estimate For each study: N, mean, SD for TRT and CTL group Plot: mean difference between groups with CI bar Size of plot shows study weight Middle line: no effect Bottom: summed totals of MD, CI, weight all studies, adds to 100% Final estimate: diamond of weighted mean difference with own prediction interval (red)

Answer 21

Funnel plot: accounts for issues with small study effects NOT publication bias Plot each study ES against SE(size of study) Middle line of true effect Assumption: larger studies closer to real effect (high y-axis SE, central ES at line; triangle point), smaller studies more spread around estimate (low y-axis SE, deviated ES from middle; triangle base) Visual evidence impacting pooled estimate: dots outside funnel, gaps on one side Contour enhanced: shows likelihood of outliers with p-value

Answer 22

Issues with X choice- can’t combine properties of study with properties at individual level (e.g. age of all subjects not mean age)

Answer 23

estimate of strength/certainty of results (not really quality) RoB2 – updated to look at specific outcomes; only do for the primary outcome of interest

Answer 24

Ratio of the risk (probability) of event out of all event possibilities in TRT relative to CTL

Answer 25

Ratio of the odds of event versus non-event outcomes in TRT relative to. CTL

Answer 26

Percentage risk of event is reduced in TRT relative to CTL

Answer 27

Absolute difference in risk of event in TRT compared to CTL

Answer 28

Number of people needed to reach one occurrence more/less in TRT vs. CTL

Answer 29

GRADE: additional domains of evidence beyond ROB – for confidence in overall MA estimate Upgrading: large effect, dose-response, direction of residual confounding and biases Downgrading: bias, inconsistency (no overalp between Cis; NOT hetero), indirect generalizability, imprecision/CI boundary around actual no effect, pub bias

Answer 30

same but mean difference is the OR ratios For each study: n/N for TRT and CTL group (number of events or deaths/total group sample) Middle line of no effect= 1 (+ favours CTL, - favours TRT) Bottom: summed totaled to 100% Estimate: test for overall effect, with p-value

Answer 31

Fr missing data Works by maximizing the likelihood of the observed data for each individual, allowing for valid parameter estimation without imputing missing values—making it a statistically robust method under the MAR assumption

Answer 32

Uses all available data even when some values are missing without imputing missing values explicitly - works directly with the likelihood function based on the observed data. Provides unbiased and efficient parameter estimates under MAR. Integrates missing data handling into the model estimation process. Uses the non-missing values to inform the estimation. Preserves sample size and statistical power by including incomplete cases Unlike listwise deletion (which drops incomplete cases) Cases with partial data still contribute to the analysis Avoids problems of imputation, like underestimating standard errors. Widely supported in structural equation modeling and mixed models.

Answer 33

Assumes data are missing at random (MAR) — missingness related to observed data but not unobserved -depends only on observed data, not the missing values themselves Requires appropriate model specification.

Answer 34

Better than traditional methods (e.g., complete case) which assume MCAR or ignore missingness mechanisms “FIML is preferred because it efficiently uses all available data, reduces bias, and maintains statistical power, making it a robust approach for handling missing data in RCTs.”

Answer 35

Model-based estimation: assumes a statistical model (e.g., linear regression, SEM). Calculates the likelihood (i.e., probability) of the observed data given the model parameters, for each case. It maximizes the total likelihood across all individuals to estimate the best-fitting model parameters.

Answer 36

Creates multiple complete datasets by imputing missing values based on observed data patterns. Accounts for uncertainty by combining results. Assumes data are missing at random (MAR). creating multiple "filled-in" (imputed) datasets, analyzing them separately, and pooling the results to account for uncertainty.

Answer 37

Estimates parameters directly from incomplete data under MAR assumptions.

Answer 38

Single Imputation Methods: Mean imputation: replaces missing values with the mean. Last Observation Carried Forward (LOCF): uses last available data point.

Answer 39

These underestimate variability and can bias results.

Answer 40

PCA: unsupervised, simply reduces number of predictors while capturing their maximum variance, irrespective of relationship to outcomes i.e. just reducing dimensionality Unguided

Answer 41

Vs. FA: unsupervised identifying the unobserved latent factors explaining predictor relationships i.e. just uncovering latent structures Guided but only one predictor

Answer 42

multivariate statistical method that models relationships between two data matrices: predictors (X) and responses (Y) Extracts latent variables (components) from predictor variables that explain maximum covariance with the response variables Combines features of principal component analysis (PCA) and multiple regression. Useful when predictors are highly collinear or when number of predictors exceeds number of observation

Answer 43

Normality, Homogeneity of variance, Independence of observations

Answer 44

since models assume linear relationship between predictors and outcome

Answer 45

ensure variance is similar across groups or predictors

Answer 46

Absence of multicollinearity, inter-correlation between predictors distorts estimates

Answer 47

ensuring the data/residuals are normally distributed

Answer 48

Via: histogram, quantile-quantile plot, box plot Shapiro-wilk (better for small n), Kolmogorov-smirnov, Anderson-darling

Answer 49

Levene’s test (more robust to non-normality) Bartlett’s test Residual plots In regression pot residuals vs. fitted and look for random scatter

Answer 50

Scatter plots between predictor/outcome Residuals vs. predicted values: looking for random scatter

Answer 51

Test for autocorrelation in regression residuals Absence of multicollinearity (regression): inter-correlation between predictors distorts estimates Correlation matrix of predictors Variance inflation factor Condition index and eigenvalue

Answer 52

Box plots, z-scores Cook’s distance, leverage values

Answer 53

Account for random effects of patients Especially important in repeated measures where measurements inter-correlated over time Accounts for individual variability between subjects when repeated measures over time

Answer 54

RM-ANOVA: no random effects, limited to simple balanced designs, assumes sphericity

Answer 55

Estimated N based on fixed ES, sig threshold, power All interconnected

Answer 56

probability of correctly rejecting null when false /true effect Type 2 error 80 % power = 80% chance detecting if exists

Answer 57

probability of rejecting null when true/type 1 error

Answer 58

Linear combination of group means Test specific comparison between levels of fixed effects More specific than just ANOVA overall group differenc

Answer 59

Linear combination of group means Test specific comparison between levels of fixed effects More specific than just ANOVA overall group differenc

Answer 60

Parametric tests are those that make assumptions about the parameters of the population distribution from which the sample is drawn. This is often the assumption that the population data are normally distributed. Non-parametric tests are “distribution-free” and, as such, can be used for non-Normal variables when comparing groups using tests. Eg/ Pearson vs spearman’s correlation

Answer 61

More statistical power, hard to do flexible modelling, more relevant to the population itself

Answer 62

All assess heterogeneity (variance) between studies, but differ in purpose: Q detects presence, I2 quantifies the percentage of total variance due to heterogeneity, and tau2 estimates the absolute variance of true effect sizes across studies (Presence, Percent, absolute)

Answer 63

Cochran’s Q is a weighted sum of squared differences between individual study effects and the overall meta-analytic effect. Significance test-It tests the null hypothesis that all studies share a common effect size (homogeneity). Interpretation: A significant p-value (typically <0.10 due to low power) indicates that the variation among studies is greater than what would be expected by chance alone, suggesting heterogeneity exists.

Answer 64

estimates the percentage of total variability across studies that is due to heterogeneity rather than chance. Interpretation: 0%–25%: Low. 25%–50%: Moderate. 50%–75%: Substantial/High heterogeneity. > 75%: High/Significant

Answer 65

𝜏2 (Tau-squared) represents the absolute between-study variance in a random-effects model. It measures the variance of the underlying true effect sizes. Interpretation: A tau2 (r2) of 0 indicates no between-study variance (homogeneous), while a higher value indicates greater variance. More robust than I2 bc #studies and precision don’t matter, but need context to interpret outcome

Answer 66

indicates a large percentage of the total variance across studies is due to heterogeneity in true effect rather than sampling error (random chance). high value suggests that the studies are not estimating the same underlying effect, making a single, pooled average ("mean effect") potentially meaningless.

Answer 67

As any regression, looking at significance Intercept: predicted effect size when all predictors are zero (or at their reference level). Coefficient (B): Indicates change in the average effect size for a one-unit increase in moderator. P-value: If the p-value is sig and the 95% CI does not include zero, moderator has a significant impact on the effect size. Residual Heterogeneity (I or tau2): Assesses if the model explains the differences between studies. A significant reduction in compared to a model without predictors suggests the covariates explain the variation. r2 Analog (Amount of Variance Explained): Indicates the percentage of true between-study variance accounted for by the covariates

Answer 68

Risk Ratios (RR) compare the probability of an outcome between groups, while Odds Ratios (OR) compare the odds of an outcome occurring vs. not occurring. RR is used in prospective studies (cohort/RCTs) for intuitive interpretation, whereas OR is used in retrospective studies (case-control) and logistic regression basic difference is that the odds ratio is a ratio of two odds whereas the relative risk is a ratio of two probabilities

Answer 69

Assumltion that that different samples or groups being compared have approximately equal variances (spread of scores around their respective means.) a key requirement for ANOVA and t-tests, which compare mean differences. Violating this assumption can lead to incorrect conclusion

Methodology Flashcards

(95 cards)