AMSTAR is a reliable and valid measurement tool to assess the methodological quality of systematic reviews

Beverley J Shea; Candyce Hamel; George A Wells; Lex M Bouter; Elizabeth Kristjansson; Jeremy Grimshaw; David A Henry; Maarten Boers

doi:10.1016/j.jclinepi.2008.10.009

AMSTAR is a reliable and valid measurement tool to assess the methodological quality of systematic reviews

J Clin Epidemiol. 2009 Oct;62(10):1013-20. doi: 10.1016/j.jclinepi.2008.10.009. Epub 2009 Feb 20.

Authors

Beverley J Shea¹, Candyce Hamel, George A Wells, Lex M Bouter, Elizabeth Kristjansson, Jeremy Grimshaw, David A Henry, Maarten Boers

Affiliation

¹ Community Information and Epidemiological Technologies (CIET), Institute of Population Health, Ottawa, Ontario, Canada. bshea@ciet.org

PMID: 19230606
DOI: 10.1016/j.jclinepi.2008.10.009

Abstract

Objective: Our purpose was to measure the agreement, reliability, construct validity, and feasibility of a measurement tool to assess systematic reviews (AMSTAR).

Study design and setting: We randomly selected 30 systematic reviews from a database. Each was assessed by two reviewers using: (1) the enhanced quality assessment questionnaire (Overview of Quality Assessment Questionnaire [OQAQ]); (2) Sacks' instrument; and (3) our newly developed measurement tool (AMSTAR). We report on reliability (interobserver kappas of the 11 AMSTAR items), intraclass correlation coefficients (ICCs) of the sum scores, construct validity (ICCs of the sum scores of AMSTAR compared with those of other instruments), and completion times.

Results: The interrater agreement of the individual items of AMSTAR was substantial with a mean kappa of 0.70 (95% confidence interval [CI]: 0.57, 0.83) (range: 0.38-1.0). Kappas recorded for the other instruments were 0.63 (95% CI: 0.38, 0.78) for enhanced OQAQ and 0.40 (95% CI: 0.29, 0.50) for the Sacks' instrument. The ICC of the total score for AMSTAR was 0.84 (95% CI: 0.65, 0.92) compared with 0.91 (95% CI: 0.82, 0.96) for OQAQ and 0.86 (95% CI: 0.71, 0.94) for the Sacks' instrument. AMSTAR proved easy to apply, each review taking about 15 minutes to complete.

Conclusions: AMSTAR has good agreement, reliability, construct validity, and feasibility. These findings need confirmation by a broader range of assessors and a more diverse range of reviews.

Publication types

Validation Study

MeSH terms

Evidence-Based Medicine / methods
Evidence-Based Medicine / standards*
Feasibility Studies
Humans
Observer Variation
Quality Control
Reproducibility of Results
Research Design / standards
Review Literature as Topic*