Original Article
GRADE guidelines: 20. Assessing the certainty of evidence in the importance of outcomes or values and preferences—inconsistency, imprecision, and other domains

https://doi.org/10.1016/j.jclinepi.2018.05.011Get rights and content

Abstract

Objective

To provide Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) guidance for assessing inconsistency, imprecision, and other domains for the certainty of evidence about the relative importance of outcomes.

Study Design and Setting

We applied the GRADE domains to rate the certainty of evidence in the importance of outcomes to several systematic reviews, iteratively reviewed draft guidance, and consulted GRADE members and other stakeholders for feedback.

Results

We describe the rationale for considering the remaining GRADE domains when rating the certainty in a body of evidence for the relative importance of outcomes. As meta-analyses are not common in this context, inconsistency and imprecision assessments are challenging. Furthermore, confusion exists about inconsistency, imprecision, and true variability in the relative importance of outcomes. To clarify this issue, we suggest that the true variability is neither equivalent to inconsistency nor imprecision. Specifically, inconsistency arises from population, intervention, comparison and outcome and methodological elements that should be explored and, if possible, explained. The width of the confidence interval and sample size inform judgments about imprecision. We also provide suggestions on how to detect publication bias and discuss the domains to rate up the certainty.

Conclusion

We provide guidance and examples for rating inconsistency, imprecision, and other domains for a body of evidence describing the relative importance of outcomes.

Introduction

The Grading of Recommendations, Assessment, Development and Evaluation (GRADE) working group developed a widely accepted approach to rate the certainty of a body of evidence in the contexts of systematic reviews, health technology assessment, health-care recommendations, and decision support [1], [2], [3], [4]. This is the 20th in the ongoing series of articles describing the GRADE approach in the Journal of Clinical Epidemiology and complements articles in this and other journals. We previously described the reasons for decreasing and increasing the certainty of a body of evidence; how an overall rating of the evidence is performed [5]; how to create GRADE evidence profiles and summary of findings tables [6], [7], [8], [9]; how evidence is used to move to recommendations and decisions [10], [11], [12], [13], [14], [15]; how evidence is dealt with in particular circumstances of diagnostic, prognostic, equity-related, multiple treatment comparison, environmental and public health questions [16], [17], [18], [19], [20], [21]; how GRADE applies to rapid advice [22]; how GRADE deals with new risk of bias tools [23]; and when there is missing outcome data [24].

Decisions in health care require an assessment of the certainty of how much people value the importance of the outcomes that researchers and practitioners intend to affect [10], [11], [12], [13], [14], [15]. Indeed, a variety of instruments are in use to elicit the relative importance of outcomes, including health state value or utility, or willingness to pay instruments. Our previous article in this series introduced the topic and the typical instruments in some detail [26]. For example, patients with severe or very severe chronic obstructive pulmonary disease (COPD) are willing to pay $13.46 to avoid mild side effects and $67.51 to gain symptom relief [25]. This suggests patients place more value on avoiding no symptom relief than avoiding mild side effects. We also described the terminology regarding the relative importance of outcomes and how it relates to the concept of values and preferences. The term outcome includes “health state” and non-health states that are related to the interventions under consideration, a broad set of the outcomes directly and indirectly related to health or a disease or non–health consequences. We then introduced the GRADE approach to rate the certainty of research evidence that focuses on this gap area, the rating of the certainty of a body of evidence about the relative importance of outcomes with a focus on risk of bias and indirectness [26]. We also introduced our rationale and explanations about the terminology of outcome importance [26]. In this article, we will provide guidance on rating of a body of evidence about the relative importance of outcomes dealing with the GRADE domains' inconsistency and imprecision and describe other concepts related to publication bias, rating up the certainty of evidence and variability in estimates. This guidance informs the certainty in the evidence about the relative importance of the outcome (values) and, thus, the evidence to decision framework. However, it will also affect the assessment of the balance between desirable and undesirable health outcomes and eventually affect the balance of the overall desirable and undesirable consequences (including the consequences on other criteria in the evidence to decision framework, for example, ethical consequences or consequences on health equity). In addition, our approach will be of use for those using utilities and values and preferences apart from GRADE evidence to decision frameworks; in particular, those conducting health technology assessments and decision modeling.

Section snippets

Methodology

We described the detailed methods for this work in the previous article [26]. Briefly, we used an iterative multipronged approach to develop guidance for assessing the certainty of a body of evidence addressing the relative importance of outcomes. We applied the same GRADE domains (risk of bias, inconsistency, indirectness, imprecision, publication bias, and domains to rate up the evidence) to the evidence describing the relative importance of outcome ratings systematic reviews and developed

Inconsistency

According to the GRADE approach, raters can lower the certainty of the evidence if there is unexplained inconsistency or heterogeneity. However, assessment of inconsistency of evidence about the relative importance of outcomes is challenging for several reasons. First, the existing systematic reviews or health technology assessments often lack a clear definition of the relative importance of outcomes or values and preferences and include a diverse set of methods and instruments to assess them

Imprecision

Rating imprecision for the relative importance of outcomes includes an assessment of both the CI and sample size for the body of evidence. This assessment is often challenging because there are meta-analyses rarely and, thus, no calculated CIs. For the same reason, there is no simple way to calculate the minimum sample size to produce a sufficiently narrow estimate with sufficient power for the relative importance of the outcomes [37]. However, we suggest raters take the following approaches

Publication bias

Publication bias may be important for evidence addressing the relative importance of outcomes. Although the reasons for publication bias for this type of evidence may differ importantly from those of intervention studies where for-profit interest often play a role, other reasons for failure to publish (in a timely manner) may be similar. Conceivable reasons for delayed or unsuccessful publication include the results are not consistent with previous research results, results are redundant, or

Rating up

The theoretical basis and empirical examples for using existing domains for rating up the certainty of evidence (a large effects or associations, dose-response gradient and direction of plausible residual confounding) of the relative importance of outcomes is limited. Thus far, we do not have clear guidance for when the evidence of the relative importance of outcomes should be rated up but we will describe some plausible scenarios here.

The certainty of a body of evidence summarizing the

Distribution across individuals and decision-making scenarios

We developed GRADE guidance for rating the certainty (or quality) of evidence. In this section, we will describe how it relates to decision-making and how it informs the GRADE EtD criterion “how much people value the main outcomes” [10]. Until we will have developed further guidance, we suggest to not rate the certainty in the variability of the relative importance of outcomes but making the potential for underlying variability transparent. The term “variability” of values is used ambiguously.

A practical example of assessing the overall certainty in the relative importance of an outcome across a body of evidence

We describe how the GRADE approach can be applied to assess the certainty of evidence for the relative importance of outcomes. The ratings start as “high” for all outcome assessments. Raters lower the certainty to moderate, low, or very low if one or more of the risk of bias, inconsistency, indirectness, imprecision or publication bias are judged as serious or very serious for the body of evidence, and consider the upgrading domains, to determine the final assessment of certainty of evidence (

Summary

This and the prior article in this series describe the GRADE approach for rating the certainty of evidence in the relative importance of outcomes or values and preferences [26]. Both the expansion of GRADE to this field of evidence and the assessment of a body of evidence in this area, in general, are innovative. The approach should be useful for systematic reviews, health technology assessment, decision modeling, and guidelines. The major challenge of rating the evidence about relative

Acknowledgments

The authors are grateful to Dr. Amiram Gafni from McMaster University for the comments on the manuscripts.

Authors' contributions: H.J.S. conceived of the project and approach; Y.Z., P.A., G.G., and H.J.S. designed the methodology for this project. Y.Z., J.J.Y.N., and Y.C. summarized the certainty assessment of relevant items in systematic reviews; Y.Z., P.A., G.G., and H.J.S. proposed the subdomains for the certainty of evidence assessment. All authors participated in methodological

References (51)

  • R.L. Morgan et al.

    GRADE: assessing the quality of evidence in environmental and occupational health

    Environ Int

    (2016)
  • K.A. Thayer et al.

    Using GRADE to respond to health questions with different levels of urgency

    Environ Int

    (2016)
  • G.H. Guyatt et al.

    GRADE guidelines 17: assessing the risk of bias associated with missing participant outcome data in a body of evidence

    J Clin Epidemiol

    (2017)
  • K. Sepucha et al.

    How to define and measure concordance between patients' preferences and medical treatments: a systematic review of approaches and recommendations for standardization

    Patient Educ Couns

    (2010)
  • G.H. Guyatt et al.

    GRADE guidelines: 7. Rating the quality of evidence–inconsistency

    J Clin Epidemiol

    (2011)
  • G.H. Guyatt et al.

    GRADE guidelines 6. Rating the quality of evidence–imprecision

    J Clin Epidemiol

    (2011)
  • H.J. Schunemann

    Interpreting GRADE's levels of certainty or quality of the evidence: GRADE for statisticians, considering review information size or less emphasis on imprecision?

    J Clin Epidemiol

    (2016)
  • G.H. Guyatt et al.

    GRADE guidelines: 5. Rating the quality of evidence–publication bias

    J Clin Epidemiol

    (2011)
  • G.H. Guyatt et al.

    GRADE guidelines: 9. Rating up the quality of evidence

    J Clin Epidemiol

    (2011)
  • H.J. Schunemann et al.

    Evaluation of the minimal important difference for the feeling thermometer and the St. George's Respiratory Questionnaire in patients with chronic airflow obstruction

    J Clin Epidemiol

    (2003)
  • K.B. Slot et al.

    Thrombolytic treatment for stroke: patient preferences for treatment, information, and involvement

    J Stroke Cerebrovasc Dis

    (2009)
  • R. Thomson et al.

    Decision analysis and guidelines for anticoagulant therapy to prevent stroke in patients with atrial fibrillation

    Lancet

    (2000)
  • H.J. Schunemann et al.

    Letters, numbers, symbols and words: how to communicate grades of evidence and recommendations

    CMAJ

    (2003)
  • D. Atkins et al.

    Grading quality of evidence and strength of recommendations

    BMJ

    (2004)
  • P. Alonso-Coello et al.

    GRADE Evidence to Decision (EtD) frameworks: a systematic and transparent approach to making well informed healthcare choices. 1: Introduction

    BMJ

    (2016)
  • Cited by (141)

    View all citing articles on Scopus

    Ethics approval and consent to participate: Not required. This study does not involve de novo patient data collection. No patient informed consent and institutional review board approval have been sought.

    Consent for publication: Not applicable.

    Availability of data and materials: The data sets supporting the conclusions of this article are included within the article and its additional file.

    Funding: This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors. It was funded through internal research funds at McMaster University available to HJS. GH is supported by a CIHR New Investigator Salary Award and a The Arthritis Society Young Investigator Salary Award, neither of which is directly related to this research project.

    Conflict of interest: All authors have completed the ICMJE uniform disclosure form at http://www.icmje.org/conflicts-of-interest/); no financial relationships with any organizations that might have an interest in the submitted work in the previous 3 years, no other relationships or activities that could appear to have influenced the submitted work. Authors are members of the GRADE Working Group.

    View full text