Objectives To describe how adverse event (AE) rates were monitored and estimated nationally across all Norwegian hospitals from 2010 to 2013, and how they developed during the monitoring period. Monitoring was based on medical record review with Global Trigger Tool (GTT).
Setting All publicly and privately owned hospitals were mandated to review randomly selected medical records to monitor AE rates. The initiative was part of the Norwegian patient safety campaign, launched by the Norwegian Ministry of Health and Care Services. It started in January 2011 and lasted until December 2013. 2010 was the baseline for the review. One of the main aims of the campaign was to reduce patient harm.
Method To standardise the medical record reviews in all hospitals, GTT was chosen as a standard method. GTT teams from all hospitals reviewed 40 851 medical records randomly selected from 2 249 957 discharges from 2010 to 2013. Data were plotted in time series for local measurement and national AE rates were estimated, plotted and monitored.
Results AE rates were estimated and published nationally from 2010 to 2013. Estimated AE rates in severity categories E-I decreased significantly from 16.1% in 2011 to 13.0% in 2013 (−3.1% (95% CI −5.2% to −1.1%)).
Conclusions Monitoring estimated AE rates emerges as a potential element in national systems for patient safety. Estimated AE rates in the category of least severity decreased significantly during the first 2 years of the monitoring.
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Strengths and limitations of this study
The study covers, with few exceptions, all publicly and privately owned hospitals in Norway.
The study is based on reviews of 40 851 hospital discharges.
The Global Trigger Tool (GTT) reviewer teams applied the same standards and had the same training.
The reviews were done by many teams and not only by one, which increases variability in results related to discretion.
The size of reviewed samples was the same independent of hospital size.
Based on several studies,1–5 the WHO has estimated that about 10% of patients are harmed in relation to hospital care in Western countries.6 ,7 This has created momentum for large-scale interventions in many countries.8 ,9 To facilitate monitoring of adverse events (AEs) within reasonable use of resources, the Institute for Healthcare Improvement (IHI) developed Global Trigger Tool (GTT) as a standard tool.9 ,10 The GTT definition of an AE is: “unintended physical injury resulting from or contributed to by medical care that requires additional monitoring, treatment or hospitalisation, or results in death”. Studies that used GTT in major US hospitals found AE rates of 27.7% and 33.2%.11 ,12 The GTT definition is at variance with the WHO definition, which may explain some of the discrepancy from the WHO estimate.
A research scan from the Health Foundation has considered GTT to compare well to other approaches, to be relatively sensitive and to identify significantly more AEs compared with self-reporting or other chart audit approaches. Most of the studies describing its use are based on large samples from multiple hospitals. Although the GTT is considered relevant for measuring harm at national level, few studies describe the process of doing so.13 ,14
The Norwegian Ministry of Health and Care Services mandated all publicly and privately owned hospitals to review randomly selected medical records from 2010. GTT was chosen as a standard method in order to be able to aggregate results. The review started on medical records from March 2010 to make a baseline. Results have been published in annual reports since 2011 (for 2010–2013).15 Hospitals plotted data in diagrams on a web-based tool, facilitated by IHI, called Extranet, together with an additional p-control chart tool. This has enabled them to follow and analyse their own results with statistical process control. The initiative from the health authorities was part of the planned national patient safety campaign, called ‘In Safe Hands’, which was launched in January 2011. The campaign aimed to reduce harmful events to patients, to establish competence and hospital routines for patient safety and to improve patient safety culture.
The aim of this study is to describe how the GTT method was applied at national level for monitoring AEs, and how the estimates developed during the monitoring period.
Setting and inclusion criteria
All 19 publicly owned hospitals in Norway participated in the 2011 and 2012 reviews. Some of these hospitals consist of several hospital locations, organised together under the same board and Chief Executive Officer (CEO). Eighteen participated in 2010 and 2013. All five privately owned but publicly funded hospitals participated in 2010, 2012 and 2013, four in 2011. According to the GTT standard methodology, discharges from paediatric, psychiatric or rehabilitation units were excluded. Patients had to be admitted to the hospitals for more than 24 h, to be included.
Translation and validation of the instrument
The original GTT white paper was translated to Norwegian by a professional translator. The professional translation was validated by two clinical consultants (attending) who read it through and gave feedback. It was not back translated since it was considered a tool and not a questionnaire. The translated version was slightly modified by the campaign secretariat, regarding these three triggers on the trigger list:
A medication commonly used against allergic reactions;
The denomination used for blood glucose was changed;
A blood test commonly used to monitor anticoagulation status.
Additional input for minor improvement of the translation came from record reviewers during the first months of the review in 2011. A protocol for how to perform the GTT review in the Norwegian campaign context was provided.16 It included a frequently asked questions (FAQ) list corresponding to the Norwegian context, in addition to the standard list.
Training and standardisation
GTT teams were trained according to the white paper standard and this is further described in the online supplementary appendix. The teams:
Read the white paper and FAQ list before participating in the standardised 1-day course.
Reviewed another 40 medical records before conducting regular reviews.
Gathered at annual national meetings, to audit how the tool was applied.
Standardisation nationally was promoted through the following rules:
Hospitals were required to establish at least one GTT team consisting of a doctor and two nurses.
Each GTT team reviewed 240 medical records per year, strictly according to the IHI method.
All eligible hospital discharges had to be included in the review.
If there were more than one GTT team in a hospital, each team would make a sample of 240 discharges from different parts of the total population of eligible discharges that could not overlap.
Identified AEs were categorised according to the severity categories E to I (description given in table 1), which are standard according to the GTT method, as well as type. GTT teams could not categorise AEs according to more than one severity.
The numbers of discharges with AEs, and AE severities and types per specified discharge were reported in an Excel template, which was annually sent to the campaign secretariat. In addition, the teams reported the total number of discharges that investigated records had been randomly selected from. This number was used for weighting the team results, when making national estimates. National estimated AE rates were presented in a national report as per cent discharges with at least one AE in severity categories E to I compounded to E-I, and F to I compounded to F-I, together with a 95% CI.15 Discharges with multiple AEs in different severity categories were reported in each of the different categories. The compounding of the E-I and F-I categories took this into account by not counting these discharges more than once.
Information that could be used to identify individual patients, for example, age, gender and date of discharge, were not reported, according to regulations for personal protection.
Cross-sectional analysis was performed on data annually reported in Excel templates from the GTT teams to calculate national AE rates with associated 95% CIs, divided into types and severities (table 1). Types of AEs were not estimated in this study, but are reported elsewhere.15 The national AE rates were calculated as a weighted average of individual means for the GTT teams from all hospitals. Since AEs according to type and severity were not symmetrically distributed across the GTT teams, non-parametric CIs for the national AE rates were calculated using 10 000 bootstrap simulations.17 The bootstrap was performed by sampling with replacement from the individual means for all GTT teams. An observation was drawn from the data with probability equal to the weight of the GTT team. For all analyses, p<0.05 was regarded as statistically significant. Microsoft Excel V.2010 was used for collecting and managing the data, cross-sectional analysis and for producing the graph. Bootstrap CIs, based on 10 000 simulations, were calculated with the R statistical software.18
At local level, AE data were measured, registered and plotted in run charts twice a month in Extranet by the GTT teams. Data could also be exported to make p-control diagrams providing analysis over the development of AEs over time, both at local and national level. Local p-control diagrams are not shown here. The national p-control diagram is given in figure 1. The chart was made based on bimonthly data registered in Extranet and the national AE rate is calculated as a weighted average of individual means for all GTT teams. The number of GTT teams contributing to each data point in the chart varied between 32 and 36, meaning that not all of the GTT teams have registered data twice a month in Extranet (39–47 teams did the annual reviews, see table 2). Numbers of discharges reviewed at each point varied between 7 and 20 for each team, and between 320 and 381 at national level. The bimonthly data from Extranet was only used to develop the national p-control diagram; the rest of the analysis was based on data annually reported from the GTT teams in the Excel templates.
To explore the relation between size of hospitals and AE rates for the severity categories E-I, a mixed-effect model with the percentage of discharges with AEs as response was fitted. Year and the number of discharges (used as a proxy for hospital size, reflecting hospital's workload) were included as fixed effects, and hospital was included as a random effect. The data were weighted according to the proportion of medical records that were sampled from each hospital. A 95% bootstrap CI based on 10 000 simulations was used to make inferences of the parameter of interest, the effect of number of discharges. Model fit was assessed using the marginal R2 for mixed-effect models.19
To explore how the national AE rates and severity develop over time, we looked at the differences between the AE rates in 2013 and 2011. To control for the dependence between hospitals at the two time points, the pairwise differences in each hospital were used. Ninety-five per cent CIs for the mean of the hospital pairwise differences were calculated using 10 000 bootstrap simulations (table 1). The analysis of the individual severity categories were considered supplemental to the analysis of the E-I and F-I categories. This is elaborated in the discussion. Since the categories overlapped, usual multiple comparison procedures, such as the Bonferroni correction, were inappropriate.
In 2010, 85% of the country's eligible discharges from March to December were included in the hospital teams’ sample frames. From 2011 and onwards, 98% or more of eligible discharges were included in the study.
Table 2 shows annual estimates and 95% CIs of discharges with at least one AE in severity categories E-I and F-I from 2010 to 2013. These are the parameters that were made public in annual reports.15 The table also gives annual numbers of reviewed discharges, numbers of discharges the records were randomly selected from and number of reviewing GTT teams.
The estimates of discharges with at least one AE in severity categories E-I show a significant decrease from 2011 to 2012 (−2.4% (−4.5% to −0.3%)) and from 2011 to 2013 (−3.1% (−5.2% to −1.1%)). The decrease from 2012 to 2013 is not statistically significant (−0.7% (−2.7% to 1.2%)). The estimates of discharges with at least one AE in severity categories F-I do not show any statistically significant decrease.
The model used to estimate the relation between size of hospitals and AE rates had an acceptable fit, with a marginal R2 of 0.31. Estimated AE rates in severity categories E-I were reduced by 0.09 percentage points (95% CI −0.15 to −0.04) for each 1000 additional discharges, which we used as a proxy for hospital size.
As seen from table 1, estimated decline in national AE rates from 2013 to 2011 was significant for the severity categories E, H and E-I when looking at the pairwise differences in each hospital.
This study describes the variation in estimates of AEs in Norwegian hospitals during a 4 years period obtained as a result of national monitoring. To what extent could the results be biased by the way the review was organised? The reviewers’ experiences from various specialties may have influenced their view on whether, for example, an AE caused intermediate or permanent harm. That could influence whether an AE was categorised as severity F or G. To reduce this potential for bias, we constructed the compounded severity category F-I. Although teams were trained in the same way (except one hospital that had already started), they could have varied in how carefully they examined the medical records. Teams that spend more time on reading nurse notes might find more AEs in the category E, than teams that read them more briefly. Such AEs could, for example, be mild decubitus ulcers that only demanded some cream put on, or falls that gave small bruises, only demanding adhesive plaster. This could have influenced hospital results. To reduce this potential for bias, we constructed the compounded severity category E-I, and counted discharges with multiple AEs only once. Team adjustment of interpretation and application of the GTT definitions and method over time could have influenced the AE estimates and contributed to the decline. Brian Bjørn in the Danish society for patient safety has communicated to the authors that in their experience AE rates of least severity, in the E category, can be exposed to variability in the teams application of the GTT definition over time. To counter we had annual national gatherings where Norwegian GTT teams audited cases and routines, to synchronise and solidify their practices. The meetings contributed to correction of occasional deviation in practice, but did not reveal widespread changes in how teams had applied the method and definitions. We can, however, not be certain about this without scientific verification. The validity of our AE estimates is to some extent confirmed by that they correlate with patients reported perceptions of patient safety surveyed in the same hospitals.20
The results indicate a decline in AEs in the compounded category of E-I and in the category of least severity, the E category. One could easily conclude that the significant decrease in the E category fully explains the decrease in the E-I category. Non-significant decreases in the other severity categories as well, contradict that. They can also contribute to the observed effect in the E-I category. The E-I category is not just the sum of the individual subcategories. One discharge with two AEs in the E and G categories would only be counted once in the E-I category, but twice in the individual subcategories, one in the E category and one in the G category. The observed decline of AEs in the H category could mean that fewer patients experienced severe AEs requiring lifesaving intervention. We consider that the result should be interpreted with caution since the CI is near zero. When we explored the relation between number of discharges in the hospitals and AE rates, we found that hospitals with fewer discharges were more likely to have slightly higher estimates of AEs than those with many discharges. Causality between the decline in AEs and the interventions of the national patient safety campaign15 could not be inferred.
There was a public criticism in 2012 regarding why possible confounding effects of demographic characteristics, like, for example, gender and age, had not been analysed. Collection of demographic data had been prevented by regulations regarding personal protection. In addition, it had been considered that the risk for the sample to be skewed regarding demographic characteristics was minute since the random samples from each year in the study were very large and had been drawn from more or less all somatic hospital discharges in the country.
GTT may reveal risks that are not necessarily captured by diagnosis-based quality registers. This is because the GTT review is done regardless of the patients’ somatic diagnosis. An additional value is that GTT supports plotting results in run charts and control charts for local monitoring, based on small data samples over time. Other opportunities lie in adapting the GTT method, so that its definitions of AEs are applied to more specific clinical contexts, and exploring how AE rates can be used for improvement purposes at hospital and clinical level.
The Norwegian patient safety campaign initiated medical record review in all hospitals to monitor AE rates. The idea was that the information could motivate hospital CEO's efforts to reduce risk for harming patients, and that the monitoring could inform the Norwegian Ministry of Health and Care Services in their governing efforts. Since the GTT tool has facilitated the Ministry to mandate 50% reduction in preventable AEs over the next 5 years, it can be said to have succeeded as a surveillance tool. But we have had little systematic information about how CEOs have used the information to monitor or improve patient safety at hospital level. It is reported that clinicians get frustrated by the data, because they mostly exist at hospital level, and therefore are not relevant to evaluate interventions at clinical level. To further expand understanding on how structural conditions and organisational culture influence AE rates, AE estimates could be compared with surveys of workplace environment and patient safety culture. These are opportunities for future research.
The study describes a successful experience of monitoring national estimates of AEs based on medical record reviews in all hospitals. Estimated AE rates in the category of least severity decreased significantly during the first 2 years of the monitoring. Smaller hospitals had slightly higher AE rates than larger hospitals. Monitoring AE estimates emerges as a potential element in national systems for patient safety.
The authors thank all GTT teams in the hospitals for their contributions to the study. They also thank the three referees for valuable comments, which have led to great improvements to the quality of the paper.
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
- Data supplement 1 - Online supplement
Contributors ETD was responsible for designing the study, training of reviewer teams, data collection, analysis and interpretation of data, and writing the manuscript. GB participated in designing the study and revising the manuscript critically. JCL participated in the statistical analysis and interpretation, in addition to revising the manuscript. MH participated in data collection, and was responsible for the statistical analysis and interpretation, in addition to revising the manuscript critically. All authors read and approved the final manuscript.
Funding Reviewer teams were trained by the National Patient Safety campaign, The Norwegian Directorate of Health. The teams work was funded by the hospitals. The research was funded by Akershus University Hospital's Health Services Research Unit and by Norwegian Computing Center.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement No additional data are available.