Appraising SRs and meta-analysis
Evidence on SRs that physical therapists use should FIRST be evaluated with the
following questions that are adapted from “Critical Appraisal Worksheets” developed
by the Oxford Center for Evidence-based Medicine. Diane Jewell (2018) p.342
Appraisal of validity of Systematic Reviews (Interventions)
1. Did the investigators limit the review to high-quality studies? (e.g.,
study design being true experimental or quasi-experimental for
intervention studies)
Appraisal of validity of Systematic Reviews (Interventions)
2. Did the investigators implement a comprehensive search and study
selection process?
Assessment of Study Credibility – Systematic Reviews
3. Did the investigators assess the quality of individual studies with
standardized processes and/or tools? (e.g., PEDro scale; Cochrane RoB)
Example of Cochrane Risk of Bias
Assessment of Study Credibility – Systematic Reviews
4. Did the investigators provide details about the research validity
(or quality) of studies included in the review?
Grading of Recommendations Assessment, Development and Evaluation
(GRADE)
Assessment of Study Credibility – Systematic Reviews
5. Did the authors address publication bias?
Publication bias:
i. only studies in English are included in the review
✓Authors are expected to explicitly discuss about it (e.g., no author reads non-
English-language publication)
ii. preference for publication is given to studies showing statistically
significant beneficial treatment effects;
✓the tendency of journal editors to publish studies based on direction and
statistical significance of the outcomes;
✓out of authors’ control;
Raising concerns about an accessibility of ‘unpublished findings’ grey literature:
“composed of nonpublished information, often on websites, in reports,
conference proceedings, theses/dissertations. These sources can be valuable to a
full understanding of the scope of knowledge on the topic”
* Addresses the need to interpret the review’s results in the context of
this potential problem; EX: PRISMA (Eligibility criteria)
✓Specify study characteristics (e.g., PICOS, length of follow-up) and report
characteristics (e.g., years considered, language, publication status) used
Assessment of Study Credibility – Systematic Reviews
6. If this is a meta-analysis, did the investigators use individual patient data in the analysis?
Study Credibility – Systematic Reviews
Should You Use this Evidence?
7. How confident are you in the research validity of this review?
This checklist refers to adequacy of reporting in each component of the
review, including
* a clear research question,
* criteria for selection of studies,
* a comprehensive search,
* reliability of reviewers, and
* assessing risk of bias.
* For a meta-analysis, use of appropriate
statistical estimates,
* evaluation of heterogeneity and
* publication bias.
* The appraiser is expected to make a
judgment regarding the overall confidence in
the results of the SRs (high, moderate, low,
and critically low) based on the number of
noncritical weaknesses and critical flaws in
the SRs identified by the checklist
Evidence and the Patient/Client’s Perspective
Consider the following questions:
▪ Is the review high quality (e.g., does the design minimize bias)? (Quality)
▪ Are the results important enough to use? (Importance) clinical vs.
statistical
▪ Was your patient/client represented in the review? (Similarity)
▪ Is the element of interest available, practical and safe for application in
the clinical setting? (Practicality)
AND
▪ Patient’s values and preferences
➢ Cost – financial, time, personal
➢ Belief in the value of scientific evidence
➢ Previous experiences
Homogeneity or heterogeneity of findings among
studies leads to
Heterogeneity (or homogeneity) can be assessed
using either
✓the ‘eyeball’ test: one looks for overlap of the confidence
intervals of the trials with the summary estimate; or
✓more formally with statistical tests such as I2 statistic
(based on the Cochran Q test). A value of 0% indicates no
heterogeneity, and higher percentages suggest greater
heterogeneity.
Statistics commonly reported for meta-analysis according to
type of research design (or research question)
✓Effect size (ES) or Standardized effect size (intervention studies)
✓Number needed to treat (NNT)
✓Likelihood ratio (LR; more discussion in Diagnostic studies)
✓Odds ratio (OR)
✓Relative risk (RR; aka risk ratio)
Odds ratio and relative risk ratio helps with case control or cohort study!
Meaning of meta-analysis results
Displayed as a forest plot illustrating weighted ES
by individual studies and a cumulative summary
Effect size:
the magnitude of the difference
between two mean values; may be standardized
95% Confidence Interval (CI)
a range of scores within which the true score is estimated to lie within a 95% probability 10
effect size notes….
Mean Score (Exp) – Mean Score (Control)/Pooled Standard Deviation
Effect size (=a/b)
the magnitude of the difference
between two mean values
➢ Between-group difference (a)
➢ Within-group variability (b)
* strength of Effect size
➢ 0.2 small
➢ 0.5 medium
➢ 0.8 large
Effect Size (standardized) scores
Number Needed to Treat (NNT)
✓“the number of subjects treated with an experimental
intervention over the course of a study required to achieve one good outcome or prevent one bad outcome.”
Positive Likelihood Ratio
(LR+)
✓The likelihood that a positive (+) test result
was observed in a person with the disorder v. in a
person without the disorder of interest
✓The likelihood that a
negative (-) test result is
observed in a person with
the disorder v. in a person
without the disorder of
interest