Article Text

Download PDFPDF

Empirical evaluation of SUCRA-based treatment ranks in network meta-analysis: quantifying robustness using Cohen’s kappa
  1. Caitlin H Daly1,
  2. Binod Neupane1,
  3. Joseph Beyene1,2,
  4. Lehana Thabane1,3,
  5. Sharon E Straus4,5,
  6. Jemila S Hamid1,6
  1. 1 Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
  2. 2 Department of Mathematics and Statistics, McMaster University, Hamilton, Ontario, Canada
  3. 3 Biostatistics Unit, Father Sean O’Sullivan Research Centre, St. Joseph’s Healthcare Hamilton, Hamilton, Ontario, Canada
  4. 4 Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, Ontario, Canada
  5. 5 Department of Medicine, University of Toronto, Toronto, Ontario, Canada
  6. 6 Clinical Research Unit, Children’s Hospital of Eastern Ontario, Ottawa, Ontario, Canada
  1. Correspondence to Dr Jemila S Hamid; jehamid{at}cheo.on.ca

Abstract

Objective To provide a framework for quantifying the robustness of treatment ranks based on Surface Under the Cumulative RAnking curve (SUCRA) in network meta-analysis (NMA) and investigating potential factors associated with lack of robustness.

Methods We propose the use of Cohen’s kappa to quantify the agreement between SUCRA-based treatment ranks estimated through NMA of a complete data set and a subset of it. We illustrate our approach using five published NMA data sets, where robustness was assessed by removing studies one at a time.

Results Overall, SUCRA-based treatment ranks were robust to individual studies in the five data sets we considered. We observed more incidences of disagreement between ranks in the networks with larger numbers of treatments. Most treatments moved only one or two ranks up or down. The lowest quadratic weighted kappa estimate observed across all networks was in the network with the smallest number of treatments (4), where weighted kappa=40%. In the network with the largest number of treatments (12), the lowest observed quadratic weighted kappa=89%, reflecting a small shift in this network's treatment ranks overall. Preliminary observations suggest that a study’s size, the number of studies making a treatment comparison, and the agreement of a study’s estimated treatment effect(s) with those estimated by other studies making the same comparison(s) may explain the overall robustness of treatment ranks to studies.

Conclusions Investigating robustness or sensitivity in an NMA may reveal outlying rank changes that are clinically or policy-relevant. Cohen’s kappa is a useful measure that permits investigation into study characteristics that may explain varying sensitivity to individual studies. However, this study presents a framework as a proof of concept and further investigation is required to identify potential factors associated with the robustness of treatment ranks using more extensive empirical evaluations.

  • mixed-treatment comparisons
  • network meta-analysis
  • ranks
  • SUCRA
  • robustness
  • kappa

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Footnotes

  • Contributors CHD conceptualised and designed the study, analysed the data, interpreted the results of the empirical evaluation, and drafted and revised the manuscript. BN and JB provided input into the study design, acquired the data and revised the manuscript. LT and SES provided input into the study design and revised the manuscript. JSH conceptualised and designed the study and revised the manuscript. All the authors approved the final version of the submitted manuscript.

  • Funding This work was supported in part by an Ontario Graduate Scholarship granted to CHD.

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement Data may be obtained from a third party and are not publicly available.