Article Text


Reporting of harms data in RCTs: a systematic review of empirical assessments against the CONSORT harms extension
  1. Alex Hodkinson,
  2. Jamie J Kirkham,
  3. Catrin Tudur-Smith,
  4. Carrol Gamble
  1. Department of Biostatistics, MRC North West Hub for Trials Methodology Research, University of Liverpool, Liverpool, UK
  1. Correspondence to Alex Hodkinson; ahoddy{at}


Objective To determine the standard of reporting of harms-related data, in randomised controlled trials (RCTs) according to the Consolidated Standards of Reporting Trials (CONSORT) statement extension for harms.

Design Systematic review.

Data sources The Cochrane library, Ovid MEDLINE, Scopus and ISI Web of Knowledge were searched for relevant literature.

Eligibility criteria for selecting studies We included publications of studies that used the CONSORT harms extension to assess the reporting of harms in RCTs.

Results We identified 7 studies which included between 10 and 205 RCTs. The clinical areas of the 7 studies were: hypertension (1), urology (1), epilepsy (1), complimentary medicine (2) and two not restricted to a clinical topic. Quality of the 7 studies was assessed by a risk of bias tool and was found to be variable. Adherence to the CONSORT harms criteria reported in the 7 studies was inadequate and variable across the items in the checklist. Adverse events are poorly defined, with 6 studies failing to exceed 50% adherence to the items in the checklist.

Conclusions Readers of RCT publications need to be able to balance the trade-offs between benefits and harms of interventions. This systematic review suggests that this is compromised due to poor reporting of harms which is evident across a range of clinical areas. Improvements in quality could be achieved by wider adoption of the CONSORT harms criteria by journals reporting RCTs.

Statistics from

Article summary

Strengths and limitations of this study

  • This is the first study to systematically review empirical studies assessing the quality of reporting according to the CONSORT-harms guideline.

  • The review was strengthened by its assessment of quality of the included studies across four key domains.

  • This study should be regarded as a reflection of reporting standards in general rather than an assessment of adherence to the CONSORT-harms extension.

  • Some included studies contained trials reported prior to the publication of the CONSORT harms guideline; we did not extract these results.

  • We have not assessed changes in reporting over time.


Every healthcare intervention is associated with a risk of harmful or adverse events, that must be balanced against the potential favourable outcomes.1

The Consolidated Standards of Reporting Trials (CONSORT) statement aims to improve the quality of published reports of randomised controlled trials (RCTs) and has been widely endorsed by healthcare journals, leading to improvements in quality when used by manuscript authors and peer reviewers.2–4 However, some reports suggest that assessment and reporting of harms in clinical trials may be suboptimal.5–7

The standard CONSORT statement8 is primarily aimed at reporting the intended, usually beneficial effects of intervention(s) with only one item (item 19) devoted to unintended adverse events (harms) in the original 2001 checklist. Owing to accumulating evidence that reporting on harms-related data in RCTs was of poor quality with an imbalanced ratio of benefit–harms reporting, a CONSORT statement extension was developed in 2004 to improve harms reporting (CONSORT-harms) and to help address perceived shortcomings in measurement, analysis and reporting of harms data.9 The subsequent update of the standard CONSORT statement, published in 2010,10 now specifically refers to the additional CONSORT-harms extension but it is still unclear whether authors and journals routinely adopt the use of this extension. The aim of this paper is to systematically review the evidence from previously conducted empirical studies that have assessed the adequacy of harms reporting in RCTs using the CONSORT-harms extension as a benchmark.


A protocol for the systematic review was developed by AH, CTS, CG and JJK.

Study inclusion criteria

We included published and unpublished research that evaluated the quality of harms reporting in RCTs against the CONSORT-harms recommendations.9 No restriction was placed on the clinical area or type of intervention studied. Excluded studies were those that assessed harms reporting using assessment criteria other than CONSORT-harms and studies that assessed harms reporting using study designs for which the CONSORT guideline was not intended (eg, observational studies).

Identification of studies

AH, CTS and CG developed the search strategy with support from an information specialist which was then implemented by AH in the following databases: the Cochrane Methodology register, Database of Abstracts of Reviews of Effects (DARE), Ovid MEDLINE, Scopus and ISI Web of Knowledge. Conference abstracts were searched for in the Web of Knowledge Conference Proceedings Citation Indexes (CPCI-S or CPCI-SSH) and the Zetoc database.11 An unpublished Masters dissertation involving one of the authors (JJK) was also obtained. Date filters were not used during the search criteria; our interest lies only within reviews published after 2004, with the cut-off date June 2012.

Titles and abstracts of reports identified by the search were screened by AH and full articles obtained for all potentially eligible studies. Each full article was assessed independently by two reviewers (AH and CTS) to determine eligibility.

Quality assessment

Two reviewers (AH and JJK) independently assessed the methodological quality of each study using the Cochrane Risk of Bias (RoB) tool12 as a guideline to cover the following aspects. Criteria were graded as low risk, high risk or unclear as indicated.

  1. Were the trials included in the study a representative sample, for example, unselected journals, and reasonable time scale?

    Low risk of bias: studies included trials from a primary search of all the available literature.

    High risk of bias: studies were highly selective of the trials included, for example, high-impact journals or specialised-journals only.

    Unclear risk of bias: not stated how studies were selected.

  2. During the data extraction of CONSORT-harms criteria, were reviewers blinded to study authors, institution, journal name and sponsors?

    Low risk of bias: reviewers were blinded.

    High risk of bias: reviewers were not blinded.

    Unclear risk of bias: not stated.

  3. Is there evidence of selective outcome reporting in the study (ie, were all CONSORT-harms recommendations considered and if not were suitable reasons provided)?

    Low risk of bias: studies that considered all CONSORT-harms criteria or reasons for excluding specific criteria were transparent and justified.

    High risk of bias: studies did not consider all CONSORT-harms criteria.

    Unclear risk of bias: unclear whether all CONSORT-harms criteria were considered.

  4. Did more than one reviewer assess the CONSORT-harms criteria for each primary RCT, with a description of how agreement was achieved?

    Low risk of bias: data extraction was completed independently by two people or reasonable attempts were made to maximise data extraction reliability.

    High risk of bias: data extraction not completed independently by two people.

    Unclear risk of bias: not stated.

Data collection and extraction

Two reviewers (AH and JJK) independently extracted the data and any discrepancies were resolved through a consensus discussion with a third reviewer (CTS). Data extraction included

  • Study characteristics: inclusion criteria including clinical area, types of interventions, databases or journals searched within the study and any search date restrictions.

  • Sample size (defined by the number of RCT reports assessed for reporting quality).

  • Reporting quality: inclusion of any of the 10 recommendations from the 2004 CONSORT-harms checklist (table 1 and supplemental data: see CONSORT plots).

Table 1

The 10 CONSORT-harms recommendations9

Lead authors were contacted through email with any queries relating to the quality assessment or data extraction.

Data analysis and presentation

For each study, the percentage of included RCTs that satisfied each CONSORT-harms recommendation is presented with 95% CIs. Some studies had presented data for individual items described within each of the 10 criteria rather than overall data. These are presented as such in tables with footnotes to provide further explanation. Forest plots were used to graphically depict the levels of adherence to the CONSORT harms recommendations so that readers can easily discern the extent of compliance and heterogeneity between studies with the I2 statistic (included as supplementary material online). We refrained from statistically combining results from the different studies due to the differences in their study characteristics. In accordance with the Cochrane Handbook, I2 statistics were interpreted as (0–40%, might not be important; 30–60%, may represent moderate heterogeneity; 50–90% may represent substantial heterogeneity; 75–100%, considerable heterogeneity).13


The search strategy identified 5083 potentially eligible study cohorts from which seven studies assessing the quality of reporting across almost 800 RCTs were included (figure 1).

Figure 1

Flow diagram of study identification and selection.

Five studies14–18 (with the study19 recently published) contained trials focusing on specific clinical areas with two20 ,21 covering multiple clinical areas (table 2). Four studies14 ,17 ,20 ,21 included trials using drug interventions, one comparing acupuncture18 and another alternative complementary medicines,16 the interventions were unclear in one study.15 MEDLINE was used by four of the studies17 ,18 ,20 ,21 to identify the relevant literature; three14 ,16 ,17 used the Cochrane database of RCTs and three15–17 searched specialised-journal databases. The date restrictions used in the search strategy of each study ranged from a 1 year period up to a 9 years span. The studies were published after 2008, 4 years after the release of the harms extension with three15 ,17 ,21 of them including trials that had been published before the publication of CONSORT-harms. Five studies14–17 ,20 excluded trials published in a non-English language.

Table 2

Characteristics of included reviews

Risk of bias

Lead authors were contacted by email with any queries relating to the quality of their study, or CONSORT criteria; however, two authors failed to respond.15 ,18 The risk of bias for the seven included studies, assessed across four domains, is summarised in table 3. Six studies14 ,15 ,17 ,18 ,20 ,21 were classified as high risk for bias for at least one domain with two of these studies14 ,20 classified as high risk for three domains. Four studies13 ,14 ,20 ,21 did not include trials of a representative sample targeting specific journals rather than a database search. Blinding of assessors was only implemented in two studies15 ,16 with one unclear.18 Most studies used all the CONSORT-harms criteria with the exception of the subgroup analysis item; one study16 however, discarded the use of recommendation eight, since it was captured elsewhere within the data extraction, and recommendation 10, which was considered too vague to assess with any objectivity. Reporting of the assessment within three15 ,18 ,20 of the seven identified studies was unclear and authors were contacted. The authors did not respond for two studies15 ,17 and in another study20 a response was received but some details remained unclear. Six studies15–18 ,20 ,21 had used two independent data extractors while one study14 had not and was classified as high risk of bias for this domain.

Table 3

Risk of bias assessment

CONSORT harms recommendations

Results extracted for the CONSORT-harms criteria (table 4) demonstrate variability in the level of adherence to items. Heterogeneity is highlighted by the individual Forest plots where inflated I2 values of over 85% are represented for all recommendations, denoting considerable heterogeneity.

Table 4

CONSORT harms criteria reported across included reviews

Of the six studies that assess inclusion of harms in the title and abstract of their included RCTs, three16 ,20 ,21 reported compliance in over 70% of RCTs, but three14–16 reported compliance in less than 30% of RCTs. The introduction section of the included RCTs reflect an imbalance in the reporting benefit–harms, with one study16 reporting that less than 5% of RCTs had mentioned harms in the introduction, and one study17 reporting more than 70% of its included RCTs has satisfied this criteria.

The definition of adverse events in reports is unsatisfactory with most studies14–16 ,17 ,20 indicating that fewer than 20% of RCTs satisfy these criteria adequately. The collection of harms-related information is described by more than 80% of RCTs in two studies,20 ,21 but this high level is not consistent across the other five studies with one study14 suggesting that as few as 10% of RCTs had provided an adequate description. The analysis and coding of adverse events is poorly described, with less than 50% of RCTs satisfying this criteria across six studies,13 ,16–18 ,20 ,21 with one of these studies13 indicating that none of the RCTs had provided an adequate description. The reporting of participant withdrawals due to harms was inconsistent with two studies15 ,16 suggesting infrequent reporting (less than 40% of RCTs had mentioned withdrawals), three studies13 ,20 ,21 suggesting occasional reporting (50–60% of RCTs had mentioned withdrawals) and two studies17 suggesting that reporting of withdrawals was quite common (approximately 70% of RCTs had mentioned withdrawals).

When providing the denominators within trial reports, the results were also varied across studies, with three17 ,20 ,21 studies identifying more than 70% of trials that satisfied this criterion, but two studies13 ,15 identifying less than 20% adherence. The risk and severity grading of adverse events, is detailed in more than 70% of trial across two studies,20 ,21 but the reporting is inadequate in three studies.13 ,15 ,17 An assessment of reporting of harms within subgroup analysis was only carried out within one study.21

Four studies14 ,15 ,17 ,21 assessed their included RCTs for a balanced report on the benefits and harms within their discussion: one study13 identified a very low percentage (<10%), two studies14 ,16 identified a moderate percentage (approximately 60%), and one study21 identified a high percentage (over 80%) of trials that met this criterion.


Summary of findings

This is the first study to systematically review empirical studies assessing the quality of reporting according to the CONSORT-harms guideline.9 Data were extracted from seven studies that had each assessed the quality of reporting across almost 800 RCTs from a range of clinical specialities. Eight years have now passed since the release of the harms extension, allowing adequate time for the guideline implementation. This review highlights that the reporting of harms in RCTs is inconsistent, and at times very poor. Heterogeneity is easily discerned between studies for each recommendation. Further adherence to the CONSORT-harms is needed.

The standard CONSORT is well established in health research with building evidence to support the use of the guideline.5 ,6 Currently the standard CONSORT is endorsed by over 50% of the core medical journals in the abridged Index Medicus on PubMed.22 In a review23 of 116 health research journals, 41 provided online instructions to authors. Almost half (19/41 (46%)) mentioned the standard CONSORT guideline but none referred to the CONSORT extension for harms.

Strengths and weaknesses of the study

In this study we have focused on assessing reporting according to the CONSORT-harms criteria only. The included studies contained trials reported prior to the publication of the CONSORT-harms guideline. However, we have not assessed changes in reporting over time. Nevertheless, our results support those from previous studies3 ,4 that used various guidelines published before the release of the CONSORT-harms extension. This study should be regarded as a reflection of reporting standards in general rather than an assessment of adherence to the CONSORT-harms extension.

This review was strengthened by its assessment of quality of the included studies across four key domains. With the guidance of the Cochrane review12 we have designed a RoB tool to perform a generalisable assessment of the included studies. In this assessment only the one study15 demonstrated low RoB across all four of the assessment criteria. No restriction was placed on the inclusion criteria of the identified studies such that the time span and clinical areas of their included studies varied. While this is a-strength in terms of generalisability of results, it may also be considered as a level of heterogeneity that cannot be explored due to the limited number of studies.

Conclusions and implications

Complete and accurate reporting is essential to guide decisions on advances in medical interventions. The responsibility to ensure greater balance between reporting of both benefits and harms lies with authors of research and journals publishing that research. We recognised that journals have limited space for the reporting of all outcomes which can lead to selective outcomes reporting.24 ,25 We recommend the use of supplementary online tables to help summarise key results on harms.

Further dissemination strategies should be used to ensure that trial journal editors and trial investigators are aware of the importance of adequate reporting of harms-related data in RCTs. As it stands, it is unclear as to whether the problem of the poor reporting of harms data in trial publications is a result of the lack of awareness of the CONSORT for harms statement, or journals and peer reviewers not implementing this guideline. The most effective strategy would follow that of the CONSORT statement with the extension for harms comprehensively incorporated in journal requirements along with clear instructions to peer reviewers for guidelines of acceptance.


The authors are grateful to Su Golder and Fiona Beyer York University for expanding on the item words used within the databases and Alison Beamond University of Liverpool, for recommending databases to search conference abstracts.


View Abstract
  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

    Files in this Data Supplement:


  • CT-S and CG are joint senior authors

  • Contributors AH and CTS carried out all screening of literature; AH and JJK extracted data; AH, JJK, CTS and CG interpreted results and drafted the manuscript.

  • Funding This work was supported by the award of a Capacity-Building Studentship to AH from the Medical Research Council (MRC) (grant number G1000397 – 1/1) North West Hub for Trials Methodology Research, UK (grant number G0800792).

  • Competing interests None.

  • Patient consent Obtained.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement Data extraction form and protocol available on request from

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.