Article Text

Download PDFPDF

Reporting quality of randomised controlled trial abstracts among high-impact general medical journals: a review and analysis
  1. Meredith Hays1,2,
  2. Mary Andrews1,2,
  3. Ramey Wilson1,2,
  4. David Callender2,
  5. Patrick G O'Malley1,2,
  6. Kevin Douglas1,2
  1. 1Department of Medicine, Uniformed Services University, Bethesda, Maryland, USA
  2. 2Department of Internal Medicine, Walter Reed National Military Medical Center, Bethesda, Maryland, USA
  1. Correspondence to Dr Meredith Hays; mhays24{at}gmail.com

Abstract

Objective The aim of this study was to assess adherence to the Consolidated Standards of Reporting Trials (CONSORT) for Abstracts by five high-impact general medical journals and to assess whether the quality of reporting was homogeneous across these journals.

Design This is a descriptive, cross-sectional study.

Setting Randomised controlled trial (RCT) abstracts in five high-impact general medical journals.

Participants We used up to 100 RCT abstracts published between 2011 and 2014 from each of the following journals: The New England Journal of Medicine (NEJM), the Annals of Internal Medicine (Annals IM), The Lancet, the British Medical Journal (The BMJ) and the Journal of the American Medical Association (JAMA).

Main outcome The primary outcome was per cent overall adherence to the 19-item CONSORT for Abstracts checklist. Secondary outcomes included per cent adherence in checklist subcategories and assessing homogeneity of reporting quality across the individual journals.

Results Search results yielded 466 abstracts, 3 of which were later excluded as they were not RCTs. Analysis was performed on 463 abstracts (97 from NEJM, 66 from Annals IM, 100 from The Lancet, 100 from The BMJ, 100 from JAMA). Analysis of all scored items showed an overall adherence of 67% (95% CI 66% to 68%) to the CONSORT for Abstracts checklist. The Lancet had the highest overall adherence rate (78%; 95% CI 76% to 80%), whereas NEJM had the lowest (55%; 95% CI 53% to 57%). Adherence rates to 8 of the checklist items differed by >25% between journals.

Conclusions Among the five highest impact general medical journals, there is variable and incomplete adherence to the CONSORT for Abstracts reporting checklist of randomised trials, with substantial differences between individual journals. Lack of adherence to the CONSORT for Abstracts reporting checklist by high-impact medical journals impedes critical appraisal of important studies. We recommend diligent assessment of adherence to reporting guidelines by authors, reviewers and editors to promote transparency and unbiased reporting of abstracts.

  • randomized controlled trials
  • CONSORT for Abstracts
  • compliance
  • guideline*
  • quality of report*

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • Data were gathered through an objective extraction process.

  • Our study benefitted from a large sample size.

  • Reviewers were blinded to the journal and randomly assigned articles to score.

  • Articles were from 2011 to 2014, providing researchers and editors adequate time to fully implement the checklist guidelines published in 2008.

  • Our study conclusions may not be applicable to journals not included in our analysis.

Background

Randomised controlled trials (RCTs) are considered the gold standard for evidence for interventions, but assessing trial validity is dependent on the quality and transparency of the study report.1 ,2 This requires the inclusion of key study information in abstracts and manuscripts so that readers can properly assess the validity and generalisability of each study and apply the findings to their patient population. Responsibility extends to peer reviewers and medical journal editors who must verify that the information needed to evaluate study quality is reported in abstracts and manuscripts, and guidelines exist to ensure that the essential elements are reported in the manuscript and the abstract.3–6 The abstract may be the only read portion of the study, and without clear reporting, this could lead to misinterpretation and poor patient outcomes.7 One study found that as many as 63% of practising internal medicine physicians relied solely on the abstract of general internal medicine or general medical journals. This was consistent for physicians with and without formal epidemiological training.7

The Consolidated Standards of Reporting Trials (CONSORT) Statement was initially developed in 1996 to improve reporting of RCTs in journals.3–6 ,8 These guidelines underwent subsequent modification in 2001, 2006 and 2010 to improve the manuscript reporting process.8 In 2008, the CONSORT for Abstracts checklist was created as an extension of the original CONSORT Statement in order to improve the reporting of RCT abstracts in journals and conference proceedings thereby allowing readers to quickly assess validity and applicability of a trial.4

Editors of high-impact journals have endorsed the use of the CONSORT guidelines to facilitate transparent and unbiased reporting of trial results.9 ,10 Articles published in high-impact journals are commonly cited in the medical literature and frequently reported in the lay press;11 accordingly, the importance of complete and unbiased reporting in these journals is paramount. However, two previous studies examining high-impact medical journals highlighted the lack of adherence to reporting guidelines in RCT abstracts.12 ,13 A number of other studies have also assessed the quality of reporting of abstracts of RCTs using various methodologies, but many of these had factors that limited their overall rigour, such as small sample sizes, limited blinding and a lack of an evaluation of inter-rater agreement.14–21 We conducted this study to rigorously assess the adherence of high-impact, high-visibility general medical journals to the reporting quality standards set forth in the CONSORT for Abstracts checklist. The primary outcome was per cent overall adherence to the 19-item CONSORT for Abstracts checklist. Secondary outcomes included per cent adherence in checklist subcategories and assessing homogeneity of reporting quality across the individual journals.

Methods

Search strategy and study selection

We conducted a descriptive, cross-sectional study of RCT abstracts in five journals with the highest impact factors in 2014.22 We included abstracts published between 2011 and 2014 in The New England Journal of Medicine (NEJM), the Annals of Internal Medicine (Annals IM), The Lancet, the British Medical Journal (The BMJ), and the Journal of the American Medical Association (JAMA) that reported the main results of parallel-group RCTs. We excluded observational or cohort studies, interim analyses, economic analyses of RCTs, post-trial follow-up studies, subgroup and secondary analyses of previously reported RCTs, editorials, letters and news reports. An author (KD) not involved in the abstract scoring applied the search strategy (figure 1) on 1 December 2014 to identify up to 100 of the most recent RCTs published in each of the top five general medical journals that met the eligibility criteria. Abstract selection for a particular journal started with those published in 2014, proceeded backwards in time and stopped when 100 abstracts for that journal that met eligibility criteria had been identified or the search for the year 2011 was completed, whichever came first. Abstracts were stored in EndNote X7 (Thomas Reuters, Philadelphia, Pennsylvania, USA). An author (DC) not involved in abstract scoring or data analysis imported abstracts from Endnote X7 into Excel (Microsoft, Redmond, Washington, USA). A customised Excel macro was used to remove PubMed Identification code, journal name, author names and journal-specific subheadings to ensure blinding of reviewers to journal. The same author (DC) maintained the key linking journal number (J1–J5) with journal name. In order to avoid bias, the authors remained blinded to journal identification, through data analysis and synthesis.

Figure 1

MEDLINE search strategy.

A priori sample size calculation using a β of 0.2 and an α of 0.05 indicated that a minimum of 58 studies per journal would be needed to detect a difference in adherence between any two journals of 25% or greater using the two-sample proportions Pearson's χ2 test. The difference of 25% was chosen by the authors to be the minimum difference they felt would be meaningful between any two journals.

Checklist development, application and inter-rater agreement

We determined compliance to each aspect of the CONSORT for Abstracts checklist through the use of a 19-item checklist (figure 2). This checklist was developed in an iterative manner by the authors by expanding the published CONSORT for Abstracts checklist to allow for evaluation of each component of the recommendations.4 Discrepancies between authors over application of checklist items were resolved by consulting the published explanation of the CONSORT for Abstracts checklist4 and by adding instructions and examples to the checklist items as shown in figure 3. Prior to scoring of study abstracts, raw inter-rater agreement for each item on the 19-item checklist was evaluated through a test run of 32 RCT abstracts published prior to 2011 and scored independently by three physician authors (MA, MH and RW) with graduate-level training in epidemiology and critical appraisal. We chose raw per cent agreement as a measure of inter-rater reliability for simplicity as well as the known difficulties with chance-corrected measures of agreement such as Cohen's κ, which can produce misleadingly low values in the setting of high-per cent agreement.15 ,23–25 After ensuring adequate inter-rater agreement with this sample, each study abstract was scored by a single author (MA, MH or RW).

Figure 2

CONSORT for Abstracts checklist.

Figure 3

Flow diagram of the study.

Data extraction and analysis

Study abstracts were randomly ordered using a computer-generated sequence in Excel (Microsoft) and divided among the three physician authors for review. Each item on the checklist was scored dichotomously (figure 3). The proportion of abstracts adherent to each checklist item was calculated for the entire sample and for the abstracts published in each journal. χ2 test of homogeneity using a significance level of 0.05 was utilized to test the null hypothesis that the proportion of abstracts adherent to checklist items was homogeneous across journals.26

Descriptive analysis was performed using STATA, V.13 (StataCorp, College Station, Texas, USA). The proportion of adherence was determined across all journals and checklist items, for all checklist items by individual journal and for all journals by individual checklist item. This represented the average of the adherence rates for all checklist items, weighted by the number of abstracts scored for each item. The number of abstracts scored differed only for the item of blinding, composed of two parts: items 10 and 11. Item 10 looked at generic blinding, whereas item 11 looked for detailed descriptions. If the study was described as blinded or masked to group assignment, the abstract was then rated as adherent if it was stated who was blinded or masked. This was concordant with the CONSORT for Abstracts checklist. If the study was not described as blinded or masked to group assignment, the abstract was not scored for blinding.

The raters (MA, MH and RW) were all trained clinicians with expertise in clinical epidemiology and critical appraisal.

Results

Journal characteristics

Individual journal characteristics are listed in table 1.

Table 1

Journal characteristics

Study characteristics

PubMed search results yielded 466 study abstracts from the top 5 general medical journals (100 from NEJM, 66 from Annals IM, 100 from The Lancet, 100 from The BMJ and 100 from JAMA), 3 of which were later excluded during abstract scoring (NEJM) because they were not RCTs (figure 3). Of note, Annals IM had fewer RCTs (n=66) published during the study timeframe than the other journals in our study. Mean agreement among the 3 reviewers for checklist items was 84% in the pre-study run-in.

Assessment of reporting quality of the CONSORT for Abstracts checklist items

Overall, adherence to the CONSORT for Abstract checklist among journals varied (table 2). Analysis of all scored items showed an overall adherence of 67% (95% CI 66% to 68%). Adherence was lowest for the reporting of allocation concealment and random sequence generation. Conformity to the checklist was highest for reporting clear interpretations of the trial, stating trial objectives clearly, and including trial registration data. Adherence rates to 8 of the checklist items differed by >25% between journals. The Lancet had the highest overall adherence rate (78%; 95% CI 76% to 80%), whereas NEJM had the lowest (55%; 95% CI 53% to 57%).

Table 2

Adherence by checklist item

When comparing compliance among the individual journals by checklist items, NEJM lagged behind in more categories (five) than all other journals combined, but led the other journals in reporting of harms. The BMJ and JAMA lagged behind the other journals in reporting the funding source of the study. Overall adherence rates displayed substantial heterogeneity among journals. The Lancet had the highest overall adherence rate, whereas NEJM had the lowest (table 3).

Table 3

Adherence to checklist items by individual journal

Discussion

This descriptive, cross-sectional analysis examining adherence to the CONSORT for Abstracts checklist in the 5 highest impact general medical journals from 2011 to 2014 showed that the overall adherence was 67%, with markedly lower adherence to individual checklist items (down to 0% for some items in some journals) and substantial variability across journals. Reporting of allocation concealment and random sequence generation in the abstract text was uncommon (<25%) across all journals except The Lancet. The Lancet showed the highest rate of overall adherence and NEJM the lowest (78% and 55%, respectively), though these differences did not meet our prespecified criteria for a meaningful difference of 25%.

Similar to prior work, we found that incomplete adherence to abstract reporting guidelines persists,12 ,13 ,27–38 particularly on domains known to influence study results (eg, allocation concealment and blinding).12 ,13 ,15 Our study improved on previous studies as a large descriptive, cross-sectional study with a larger sample (n=463) of recently published abstracts in high-impact journals where results receive the highest attention among the clinical research and practising community. This allowed us to show with statistical significance the heterogeneity across the journals. By examining abstracts published between 2011 and 2014, this study provides an updated view of the state of adherence to reporting guidelines since Ghimire et al,12 which evaluated abstracts published in 2010. Although comparison with Ghimire's work suggests modest improvement in areas such as blinding, our study showed that adherence is still suboptimal (<60% for many items). There is also wide variation in reporting between journals, indicating an opportunity for standardising abstract reporting across the medical literature. Hopewell et al4 examined the impact of editors' implementation of the CONSORT for Abstracts checklist, and their results suggested that effective application of the guidelines led to improved reporting of RCT abstracts. They showed that active implementation of the guidelines led to immediate improvement in the mean number of reported checklist items. The call for improvement in the reporting of RCTs and their abstracts is not new.12 ,28–38 Indeed, many high-impact journals have endorsed the use of the CONSORT Statement and the CONSORT for Abstracts checklist.9 Various iterations have been created in an effort to improve reporting across a variety of research venues.36

Journals whose editors endorse and enforce the checklist show evidence of improved abstract reporting.13 The five high-impact general medical journals examined in our study have endorsed the use of CONSORT as well as the CONSORT for Abstracts checklist, either through their instructions to authors, inclusion on the CONSORT website or both (table 1), although some were more explicit than others. Suboptimal adherence would seem to imply that what is lacking is enforcement. It would be expected that journals of the calibre featured in this study should be better able to enforce guidelines given a presumably more robust editorial staff and a more rigorous copy editing process. Their failure to enforce the guidelines would suggest that further steps are necessary to maintain adherence. One suggestion would be improved communication of expectations by making the CONSORT for Abstracts checklist a more obvious requirement. Also, our inter-rater agreement on our run-in data was only 84%, which may indicate a need to make the CONSORT checklist less ambiguous for authors, peer reviewers and editors in order to achieve improved adherence.

Our study had several potential limitations. First, our study’s conclusions may lack applicability to journals not included in our analysis. Second, our study did not include analysis of temporal trends within our timeframe. If such trends exist, it may be misleading to identify one journal as lagging behind the others if the rates of improvement also differed between journals. Third, we considered all checklist items to be of equal importance, but experts may differ on the relative importance of each item. Finally, our inter-rater agreement averaged only 84%.

Our study's strengths included robust and reproducible data extraction, blinding reviewers to the journal, randomly assigning articles to reviewers and inclusion of studies from a time period (2011–2014) that provided adequate opportunity for implementation of the CONSORT for Abstracts since its publication in 2008.

In conclusion, the CONSORT for Abstracts is a valuable tool for improving transparency of reporting of clinical trial results. However, our findings indicate a need for systematic editorial and reviewing processes to improve adherence to these guidelines and the transparency of abstract reporting in high-impact medical journals. If we are going to realise the full potential of the CONSORT for Abstracts checklist in improving the quality of abstract reporting, it is critical that editors, peer reviewers and authors commit to its conscientious application.

References

Footnotes

  • Contributors KD had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. KD and PGO’M were responsible for the initial study conception, design and protocol. KD, PGO’M, MA, RW, MH and DC made substantial contributions to the initial drafting of the manuscript. KD, PGO'M, MA and MH made critical revisions of the submitted manuscript. DC was responsible for de-identifying and randomly distributing abstracts to reviewers and maintaining the key to journal identity for each abstract. MA, RW and MH were responsible for abstract scoring. MH was responsible for the initial draft and overseeing and integrating individual revisions to the manuscript. By-line: KV, PGO'M, MA, RW, DC and MH.

  • Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.

  • Disclaimer The views expressed are those of the authors solely and are not to be construed as representing the views of the Department of Defense, the Department of the Navy, the Department of the Army or the Uniformed Services University of the Health Sciences.

  • Competing interests None declared.

  • Ethics approval Patients or patient information was not used in this study. This study was exempt through the Institutional Review Board.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement Extra data can be accessed via the Dryad data repository at http://datadryad.org/ with the doi:10.5061/dryad.21b04.