Objectives We evaluated whether interactive, electronic, dynamic, diagnose-specific checklists improve the quality of referral letters in gastroenterology and assessed the general practitioners’ (GPs’) acceptance of the checklists.
Design Randomised cross-over vignette trial.
Setting Primary care in Norway.
Participants 25 GPs.
Intervention The GPs participated in the trial and were asked to refer eight clinical vignettes in an internet-based electronic health record simulator. A referral support, consisting of dynamic diagnose-specific checklists, was created for the generation of referral letters to gastroenterologists. The GPs were randomised to refer the eight vignettes with or without the checklists. After a minimum of 3 months, they repeated the referral process with the alternative method.
Main outcome measures Difference in quality of the referral letters between referrals with and without checklists, measured with an objective Thirty Point Score (TPS).
Difference in variance in the quality of the referral letters and GPs’ acceptance of the electronic dynamic user interface.
Results The mean TPS was 15.2 (95% CI 13.2 to 16.3) and 22.0 (95% CI 20.6 to 22.8) comparing referrals without and with checklist assistance (p<0.001), respectively. The coefficient of variance was 23.3% for the checklist group and 39.6% for the non-checklist group. Two-thirds (16/24) of the GPs thought they had included more relevant information in the referrals with checklists, and considered implementing this type of checklists in their clinical practices, if available.
Conclusions Dynamic, diagnose-specific checklists improved the quality of referral letters significantly and reduced the variance of the TPS, indicating a more uniform quality when checklists were used. The GPs were generally positive to the checklists.
- referral letters
- General practice
- Health & safety
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
Strengths and limitations of this study
We used vignettes to standardise the setting, making the results objective, quantifiable and comparable.
The randomised cross-over design makes comparison of individual general practitioners’ (GPs’) changes in referral letter quality possible, regardless of the GPs’ initial quality level for referral letters.
Score for objective measurements of the quality of referral letters developed by our research team
Vignette design may have appeared unrealistic to the participating GPs, and may have resulted in frustration due to challenges in the virtual communication.
Appropriateness and structure of the referral letters were not measured.
High-quality, written communication is essential for adequate management of patients referred from primary to secondary care. The referral letter is frequently the only information available to the specialist when deciding the patient’s priority and selecting appropriate work-up or examinations before the first consultation at the outpatient clinic.
Referral rates from general practitioners (GPs) to secondary healthcare services are increasing1 2 and challenge the capacity of the secondary care outpatient clinics. A lowered threshold for referral may also potentially cause medical overuse3 and reduced effectiveness of the healthcare system. A considerable proportion of referral letters are of low quality or inappropriate.4–20 Such letters are a challenge for the consultants when assessing the relevance and the priority of the referrals.15 A discrepancy has also been observed between the GPs’ and the specialists’ considerations of referral letters in terms of quality and content.21 The lack of essential information may reduce the quality, safety and cost-effectiveness of the healthcare system due to the scheduling of potentially erroneous work-up or waiting times, or even unnecessary/redundant procedures.3 22
The use of electronic health record (EHR) systems and electronic referrals has increased substantially in the last decade,23 and electronic referrals have improved referral quality.24 25 This shift to increased digitalisation of the healthcare services has also opened for new solutions to facilitate the referral process. Nevertheless, menu-driven structured report generation is neither available in the GPs’ EHR nor in the hospital EHR. Initiatives to increase appropriateness of referrals have been implemented and tested with varying success.26–30 According to a Cochrane review, the implementation of structured referral paper templates has been one of few interventions with a documented effect on referral quality,26 and electronic checklists have been shown to decrease the time spent evaluating referral letters.27 However, the effect of interactive, electronic, dynamic checklists on the quality of referral letters has to our knowledge never been evaluated, neither in a clinical setting nor in a completely standardised trial, and it would be important to test the solution in a virtual setting before launching expensive development and implementation in the GPs’ EHR system.
The aim of the present trial was to assess whether interactive, electronic, dynamic and diagnose-specific checklists improved referral quality and reduced the variation in the quality for referral letters in gastroenterology, one of the major specialities in internal medicine and with the majority of patients followed in the outpatient clinic. To avoid bias from clinical and organisational variation, we wanted to perform the trial in an entirely standardised setting by using vignettes.31 32
We further wanted to assess whether the electronic dynamic user interface was well accepted by the GPs.
We hypothesised that referral letters generated with the use of interactive checklists contain more relevant information with less variation than free-text referral letters.
The primary end point of the trial was the quality of the referrals measured by a Thirty Point Score (TPS).33 The secondary end points were the variance in the quality of the referrals, and the user satisfaction of the GPs.
Between 30 April 2014 and 6 October 2014, we recruited GPs to participate in the trial, mainly in groups through already established mandatory educational groups for GPs in Norway. All MDs working in general practice in Norway were eligible for participation in the trial. GPs in the Asker and Bærum region (n=135), as well as some GPs in Oslo (n=9) and Bergen (n=4) were contacted directly and offered to participate in the trial. Additionally an email with information about the trial and invitation to participate was distributed through a national email-based debate forum for GPs in Norway (Eyr). The GPs did not receive any material compensation for participating in the trial.
We designed the study as a two-armed cross-over trial, where we block-randomised participating GPs to refer eight virtual patients either with an electronic free-text referral or with a combination of the free-text referral together with electronic checklists as referral support. Dropouts after randomisation, before starting the trial (eg, those who did not show up on the agreed date for participation), were replaced by new GPs by continuing downwards on the randomisation key. The randomisation was done before the day of the participation by using permuted block randomisation with different block sizes, generated through the website www.randomization.com.
After a minimum of 3 months, the GPs referred the same eight virtual patients again with the alternative referral letter interface, for example, those who had referred with standard free text in the first round now referred with the checklist support. We chose the 3-month interval to avoid recollection of the checklist items by the GPs who had used these in the first round.
We instructed the GPs to create the referral letters the same way they normally do with a similar real patient, using the same structure, contents and time.
In the first round of the trial, we gathered groups of GPs (n=1–7) for participation together in the hospital computer room/other venue with computer/internet access. One investigator from the study team was present to give information technology support on how to get started, and also to facilitate the communication with the vignettes when necessary, for example, by suggesting alternative phrasing of questions to the vignettes when the simulator failed to give appropriate responses. This was necessary in the beginning as the simulator was sensitive to spelling and did not have a complete natural language. The GPs were quickly accustomed to the form of communication, and mostly required minimal support. In the second round, the GPs could choose whether they wanted to complete the trial in the same way, or if they wanted to do it from their offices or homes at a time of their convenience.
Interactive Dynamic Referral Interface programme
We created an EHR simulator for generating referral letters combined with a virtual patient simulator, in cooperation with Microsoft Norway AS (figure 1). The user interface resembled common EHR systems for primary care, with a section displaying the patient’s medical history, current medication, allergies and family history. It was possible to transfer this information directly to the referral letter by clicking an interactive button. The randomisation procedure determined whether a section for generating standard free-text referral letters or the semistructured referral Interactive Dynamic Referral Interface (IDRI) was activated in the user interface.
The patient simulator, based on a chat functionality, was displayed on one side of the interface. An initial statement indicated the patient’s reason for seeking medical care, for example, “Hi, I am really troubled by loose bowels lately, and it is getting worse. What do you think it could be?” Necessary information regarding relevant symptoms and findings could be obtained by chatting with the virtual patient. The GP could write questions to the patient in the dialogue box, either in whole sentences or using keywords, and the simulator would provide the patient’s answer. In addition, it was possible to order relevant laboratory and radiological examinations. The results of the tests were displayed immediately on the screen and could be transferred to the referral letter. The information provided by the virtual patient during the ‘consultation’ could be written in the referral section using free text or registered by using the checklist function when activated.
The EHR simulator was set up according to the randomisation allocation before the day of the participation by the trial investigator who was also present at the first round of the trial.
The programme is available on the IDRI webpage: www.idri.no (username: IDRIopen, password open123).
We created eight vignettes (virtual patient cases) presenting symptoms and findings within the gastrointestinal (GI) field. The symptoms were chosen according to the main clinical situations in the Norwegian Prioritisation Guidelines for Gastroenterology (NPGg),34 namely dyspepsia, change of bowel habit, diarrhoea, rectal bleeding, long-standing abdominal pain, constipation dysphagia and jaundice/elevated liver enzymes.
Subsequently, we integrated the eight vignettes in the virtual patient simulator with a unique set of answers to anamnestic questions and to laboratory and radiology tests that were made available through using the chat function or ordering the tests in the EHR simulator.
Interactive dynamic referral checklists/support
Sending a referral letter from primary care in Norway requires that at least one International Classification of Primary Care (ICPC)-235 diagnosis is stated in the referral letter. We made a selection of relevant ICPC-2 (2005 version)35 diagnoses for digestive diseases (selected from D01to D99), omitting, for example, diagnoses for acute and paediatric diseases as well as non-GI-specific diseases and anal/oropharyngeal diseases. The selection was made based on which diagnoses could be relevant for the eight vignettes. We also added T08, weight loss, as a potential diagnosis.
When the IDRI functionality was turned on, the GPs’ selection of ICPC-2 diagnosis activated the corresponding checklist. In total, we generated 10 checklists. The list of ICPC-2 codes used and the corresponding checklists can be found in the online supplementary appendix 1.
The checklists were made based on criteria for referral letters stated respectively in the NPGg,34 the Norwegian Handbook for doctors36 and UpToDate.37 After creating an initial draft for the checklists, we adjusted and reduced the content of the checklists based on feedback from experienced gastroenterologists in the study team as well as clinical gastroenterologists.33 The selection of the checklist items was done based on which clinical information items were considered most valuable for assessing and prioritising referral letters in gastroenterology. The checklists can be accessed through the IDRI web page, and a paper-based example can be seen in the online supplementary appendix 2.
The checklists consisted of drop-down menus with check boxes or free-text fields with symptom-specific or finding-specific questions where the GP had the possibility to choose the appropriate variable. Depending on the answers, new checklist items were activated if relevant. The use of checklists was not mandated to refer the virtual patient.
Primary outcome: quality of the referral letters
We assessed all referral letters generated in the trial by using a predeveloped score, the TPS, for objectively measuring quality of referral letters.33 The TPS is a symptom-specific score that consists of the 15 most important variables for assessing and prioritising referrals for nine important GI symptoms (dyspepsia, change of bowel habit, diarrhoea, rectal bleeding, long-standing abdominal pain, constipation dysphagia, jaundice/elevated liver enzymes and weight loss). Points are assigned to the referrals depending on whether the variable/item is described in the referral or not. Both positive (eg, the patient has seen red blood in the stool) and negative (eg, the patient has not seen any blood in the stool) findings are assigned points if adequately described. The five most important variables are classified with 3 points, the next five with 2 points and the last five with 1 point, resulting in a maximum score value of 30 points.
One investigator from the study team scored all the referral letters in the present trial. The investigator was not blinded to the intervention, as the checklists with the GPs’ answers were displayed together with the referral letters as a supplement to the information in the letters.
As secondary outcomes, we also assessed the difference in variance in the quality of the referral letters and compared the frequencies of which important variables were included in referral letters generated with and without checklists.
To give an example of the frequency of individual variables in the referral letters, we did an a priori selection of some variables that we considered relevant for the referrals. We analysed six variables that were relevant for all the referral letters, four variables that were relevant for only the lower abdomen cases (diarrhoea, change of bowel habit, constipation, long-standing abdominal pain and rectal bleeding) and two variables that were relevant for only the upper abdomen cases (dyspepsia, dysphagia). For one variable, we only used the dyspepsia case as the dysphagia case would have included information of the presence of dysphagia by default in the referral letter.
In our previous paper, we reported a mean TPS of 13.3 (SD 4.9) for standard referral letters in gastroenterology without the use of a computer-based checklist.33 We expected an increase of 30% in the score when using the checklist, as well as a smaller variance of the score. The sample size calculation, comparing a mean TPS of 13.3 in the standard referrals with an expected mean of 17.3 in the referrals with checklist, with a two-sided type 1 error probability of 0.05 and a power of 0.80, yielded a minimal sample size of 21 referral pairs needed. In order to secure this quantity of referrals from each indication, we included as many GPs as possible from the local community. This resulted in 25 GPs completing the cross-over study, producing between 21 and 24 referrals per indication.
Descriptive variables are reported as means or proportions with 95% CI. We compared the mean overall TPS between referral letters with and without checklist using a multilevel linear regression model, adjusting for clinical case and the cluster GP. Paired t-test was used to compare the mean TPS stratified by clinical case. In a sensitivity analysis, we tested if the differences in the mean TPS of referral letters with or without checklist were influenced by whether the checklist was used in the first or second round (check for any learning effect). To do this we fitted a mixed linear regression model, adjusted for clinical case and added an interaction term between use of checklist and time of use of checklist (first or second round). We also performed a multivariable linear regression analysis to assess whether the scores differed for age, gender and clinical case. To assess the variance of the TPS between referral letters with and without checklist, we calculated the variance and coefficient of variance (CV) (= [SD/mean]*100%) for all referral letters with and without checklist and displayed box plots showing the median, IQR and minimum/maximum values by type of clinical case. We performed multivariable logistic regression models to compare the proportion of single variables in referral letters generated with or without checklist, adjusting for the clinical case and accounting for the cluster GP. A p value <0.05 was considered statistically significant. The statistical analyses were conducted using SPSS V.23.0 and STATA V.14 (StataCorp).
Between 30 April 2014 and 6 October 2014, 55 GPs were randomised and 45 GPs were included in the first period of the trial. Ten randomised GPs did not show up at the assigned date for participation and were therefore not included in the study. Of the 45 who attended, 25 (55.6%) participated in both rounds of the cross-over trial. The second round was completed between 3 December 2014 and 5 July 2015. The inclusion was ended because the targeted N for paired referrals was reached. One GP was excluded after the first round because he did not activate the ICPC-2 code in the checklist and consequently did not receive the intervention. Some GPs omitted the diagnosis in the individual referral letters or did not complete all of the eight referral letters in each round, resulting in 21–24 pairs of referral letters per clinical case (flow chart, online supplementary appendix 3). The participating GPs were, on average, 53 years old and more women (58%) than men participated (table 1). Sixty-four per cent (16/25) of the GPs who completed the trial had 20 years or more of experience as a doctor. Of the 25 GPs completing both rounds of the trial, 24 GPs answered the final questionnaire about the user friendliness of the system. In the counties Asker and Bærum in Norway, the average age of GPs is 50.8 years, with 47% female GPs.38
Primary outcome: quality of referral letter by TPS
The mean TPS was higher in referral letters with checklist than without checklist overall (mean Δ=6.8, 95% CI 5.1 to 8.5, p<0.001) and across all clinical cases (range of mean Δ=3.8 to 10.0), but differed significantly for the different cases (global p value for clinical case <0.001, from multivariable linear regression adjusting for time of checklist and GP cluster) (table 2 and figure 2). The smallest difference was observed in the abdominal pain referral letters (Δ=3.8 (0.8 to 6.8) and the biggest difference in the dyspepsia referral letters (Δ=10.0 (7.8 to 12.1). Multivariable regression analysis did not show any influence of gender and age of the GP on the quality of the referral letters. In a sensitivity analysis, we tested whether the increase in score differed between GPs who had the checklist in the first or second round and found no difference (p=0.303 for interaction between use of checklist and time of checklist in a multivariable mixed linear regression model) (figure 3)
Secondary outcome: variance in the quality of referral letters and frequency of specific variables
The variance in the checklist referral letters was 26.5, with a range from 4 to 30 points. The variance in the non-checklist referral letters was 36.2, with a range from 0 to 26 points. The CV was 23.3% for the checklist group and 39.6% for the non-checklist group. The box plots in figure 4 and line graphs in figure 2 graphically display the larger TPS variance in referral letters without checklist; however, the pictures differ by type of clinical case.
When looking at essential clinical variables used to differentiate between serious and less serious conditions, the difference between the checklist referrals and the non-checklist referrals was also considerable. These variables are shown in table 3.
Secondary outcome: acceptance of checklist among GPs
The GPs were generally positive on the checklists/referral support (figure 5). Two-thirds, 66.7% (16/24, 95% CI 44.7% to 84.4%), thought they had included more relevant information in the referrals when using the checklists, and that they had included information in the referrals that they would not have included if they had not had access to the checklists. Seventy-six per cent (76.2%) (16/21, three missing, CI 52.8% to 91.8%) reported that they would consider implementing the checklists in their clinical practices if available and 95.5% (21/22, two missing, CI 77.2% to 99.9%) reported that the checklists had potential for improving the collaboration between the primary and the secondary healthcare services. The GPs largely found the size of the checklists to be appropriate, but 33.3% (8/24, CI 15.6% to 55.3%) found them to contain too many questions. A combination between checkboxes and drop-down menus were preferred by 47.8% (11/23, one missing, CI 26.8 to 69.4).
We have evaluated the effect of dynamic diagnose-specific checklists on the quality of referral letters in a standardised setting using vignettes.
Statement of principal findings
We found a significantly higher quality of referral letters with more important clinical variables included when they were written with access to checklists, compared with the ones written using standard free-text templates. The variance in the quality of the referral letters was smaller in the checklist referral letters than in the non-checklist referral letters. The majority of GPs found the checklists useful and would consider using them in clinical practice. The trial was set in Norway, using vignettes from the field of gastroenterology. However, we believe that the results are likely to be transferable to other medical specialties and applicable to other countries with similar referral systems.
Comparison with existing literature
To our knowledge, no other trials have rigorously evaluated the effect of electronic interactive checklists on referral letter quality in a standardised virtual setting.
When exploring relevant literature, the general tendency is that studies aiming to improve referral quality and appropriateness have been largely ineffective. The Cochrane review on such interventions published in 200826 identified only a few successful studies, but with the main focus on referral rates39 and referral appropriateness.40 They concluded that structured referral templates may improve referral practice, but none of the included studies had used electronic referral forms or checklists. Thus, these conclusions may not be valid for comparison with the present trial.
For clinical decision support systems, a review from 2005 stated four important features associated with a beneficial effect, including (1) Automatic provision of the support as a part of clinician workflow, (2) provision of recommendations rather than just assessments, (3) provision of decision support at the time and localisation of the decision-making and (4) being computer based.41 These requirements are probably also valid for our intervention and care should be taken to integrate checklist solutions in the EHR referral systems to minimise any extra workload on the GPs receiving the support.
A few important later studies supporting the use of checklists/referral templates should be mentioned. Rokstad et al 27 made a direct implementation of checklists in existing EHR systems of GPs and found that this electronic optional guideline tool for referrals resulted in higher quality of referral letters and 34% less time spent by the specialist on evaluating each referral letter. Wåhlberg et al 29 assessed the effect of paper referral templates distributed to local GP clinics, and observed an 18% improved quality of referral letters compared with a control group. These clinical studies support the findings of the present trial, and the evidence seems to indicate that checklists are in fact useful for both the referring and the receiving clinician.
In a study of GI endoscopy reports, structured electronic checklists increased the quality of the documentation, and concluded that this was likely due to the reminder effect of the checklist.42 It also concluded that a combination of free text and checklists seems to be the best way to document the procedure.42 These conclusions form the basis for the choice of referral design in the present study, also allowing for a combination of free text and the electronic checklist.
To measure the effect of a checklist on the quality of the referral letter can be challenging because implementation in clinical practice is subject to bias both from patient case mix and from variations in the physicians’ time, stress level, etc. Vignette studies are validated against standardised patients (gold standard) and medical record extraction as an appropriate way of studying quality and variation of physician practice,31 32 43 44 and have been used in various studies.45–53 A virtual solution using clinical vignettes was therefore chosen as the most appropriate way to standardise the setting.31
Strengths and limitations of the trial
The current trial has several strengths. First, we used vignettes to standardise the clinical setting, making the results objective, quantifiable and comparable in a way which would not have been possible using real patients. Additionally, the randomised cross-over design made comparison of individual GPs’ changes in referral letter quality possible, regardless of the GPs’ initial quality level for referral letters.
However, the study has some weaknesses that we would like to address. First, we used a score (TPS) for objective measurements of the quality of referral letters developed by our research team, which could potentially have influenced the results of the current study. This score was validated in another study,33 and the results from this study show that the mean TPS for real referrals in gastroenterology is 13.7, thus somewhat lower than the TPS for free-text referrals in the present study. This most likely reflects a volunteer effect, where the participating GPs may have written referrals of higher quality than what they would have done under normal conditions in a clinical setting. However, the cross-over design makes us able to see past this potential bias.
Second, we did not achieve blinding of the TPS assessor due to the very apparent presence of checklist items in the referrals.
Third, the design of the vignettes may not have appeared realistic to the participating GPs, and also may have resulted in frustration due to challenges in the virtual communication. The alternative would have been to use standardised patients43 or to evaluate the effect of checklists in a clinical setting. However, standardised patients would have been expensive and required much more resources, and real patients would have required implementing our system in existing EHR systems before actually knowing the effect of the intervention.
Lastly, the TPS does not measure all aspects of referral quality but rather quantifies the amount of relevant information covered in each referral letter. Thus, appropriateness and structure of the referral letter are not measured by the score.33 However, the checklists did not aim to improve these aspects but rather the amount of relevant information in the referral letters, and the TPS is well suited for this purpose.
Implications of the study
The findings of the present study support the findings of Rokstad et al 27 and Wåhlberg et al,29 and the evidence supporting the effect checklists on referral letter quality now seems to be well documented. EHR providers should be encouraged to cooperate with both primary and secondary care specialists in developing and implementing appropriate electronic checklists and conduct randomised controlled trials to assess whether it also has an impact on patient outcome or healthcare costs.
The results from the GP survey indicate that a somewhat shorter checklist may be preferred to the one used in the present study.
Unanswered questions/future research
The present study have demonstrated an effect of electronic dynamic checklists on the quality of the referral letters, but the effect on the clinical management of the referred patients is still unknown.
Dynamic diagnose-specific checklists have a positive effect on the quality of referral letters in gastroenterology. The effect is most likely present also for other medical specialties, and corresponding checklists could be developed. GPs are largely positive to the idea of a checklist for referrals. The impact on the clinical management of the referrals remains unanswered.
A special thanks to the participating GPs for their contribution to the project. We could not have done it without you. We would also like to thank the eHealth department in the Vestre Viken Hospital Trust for the continued support throughout the project, as well as Murad Sæther in Microsoft AS for the help in developing the EHR simulator.
Contributors SLE, TdL and LA designed the study. SLE performed the data collection. SLE, CB and CSR performed the power and data analysis. SLE drafted the paper. All authors critically reviewed and improved it. SLE is guarantor. All authors had access to all the data and take responsibility for the integrity of the data and the analysis.
Competing interests None declared.
Ethics approval The study was reported to and approved by the Data Protection Official for research. The Regional Ethics Committee considered the study outside its mandate, and its approval was not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement Full data set and statistical code can be made available from the corresponding author. Participants’ consent was not obtained in accordance with Norwegian Data Protection legislation, but the presented data are anonymised and risk of identification is low.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.