Article Text


Impact of referral templates on patient experience of the referral and care process: a cluster randomised trial
  1. Henrik Wåhlberg1,2,
  2. Tonje Braaten1,
  3. Ann Ragnhild Broderstad2,3
  1. 1Department of Community Medicine, UiT The Arctic University of Norway, Tromso, Norway
  2. 2University Hospital of North Norway, Harstad, Norway
  3. 3Centre for Sami Health Research, UiT The Arctic University of Norway, Tromso, Norway
  1. Correspondence to Dr Henrik Wåhlberg; henrik.wahlberg{at}


Objectives To evaluate if a referral intervention improves the patient experience of the referral and treatment process.

Setting Interface between 14 primary care surgeries and a district general hospital.

Participants The 14 general practitioner (GP) surgeries (7 intervention, 7 control) in the area around the University Hospital of North Norway Harstad were randomised and all completed the study. Consecutive individual patients were recruited at their hospital appointment. A total of 500 patients were recruited with 281 in the intervention and 219 in the control arm.

Interventions Dissemination of referral templates for 4 diagnostic groups (dyspepsia, suspected colorectal cancer, chest pain and chronic obstructive pulmonary disease) coupled with intermittent surgery visits by study personnel. The control arm continued standard referral practice. The intervention was in use for 2.5 years.

Outcome The main outcome was a quality indicator score. This paper reports a secondary outcome, the patient experience, as measured by self-report questionnaires. GPs in the intervention group could not be blinded. Patients were blinded to intervention status. Analysis was based on single-question comparison with a questionnaire subscore used to assess the effect of clustering.

Results On the individual questions, overall satisfaction was very high with minor differences between the intervention and control group. Interestingly, the most negative responses, in both groups concerned questions relating to patient interaction and information. Very little evidence of clustering was found with an estimated intracluster correlations coefficient at 1.21e−11.

Conclusions In total, this indicates no clear effect of the implementation of referral templates on the patient experience, in a setting of generally high patient satisfaction.

Trial registration number NCT01470963; Results.

  • Patient experience
  • Referral
  • General practitioner

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See:

Statistics from

Strengths and limitations of this study

  • Clinically relevant research in a regular district hospital setting.

  • High response rate.

  • Newly developed questionnaire hampers wider generalisation.


Evaluation of patient experience and satisfaction is widespread with a wealth of literature concerning the development and use of questionnaires.1–5 The evaluation of patient experience can help drive quality improvement,6 and improved patient experience is associated with safety and clinical effectiveness.7

Care coordination is an important aspect of a well-functioning high-quality health service. It has been defined as ‘the deliberate integration of patient care activities between two or more participants involved in a patient's care to facilitate the appropriate delivery of health care services’.8 In the USA, the National Quality Forum (NQF) has published preferred practices for care coordination, including transitions of care.9 This report includes clear recommendations for participation of the patient, or his/her designee, in the decision, planning and execution of a care transition. This is important, as exemplified by a recent Australian article, where patients with colorectal cancer perceived that poor information exchange led to suboptimal care.10 Hence assessing patient experience of the referral process may be beneficial in assessing the effect of a referral intervention.

This article presents the patient experience aspect of a cluster randomised study evaluating the effect of the implementation of referral templates for four diagnostic groups—dyspepsia, suspected colorectal cancer, chest pain and chronic obstructive pulmonary disease (COPD)—in the patient referral pathway.11 Previously, we have shown that the referral templates led to increased referral quality,12 and the effect on the main outcome, quality of care at the hospital, is in publication. This publication aims to assess whether the implementation of a referral template in the transition of care from the general practitioner (GP) to the hospital has affected the patient experience of the care process.


Study setting

In Norway, the healthcare system is quite uniformly organised throughout the country. GPs act as gatekeepers to secondary care,13 with specialist health services delivered by governmentally owned regional health authorities, mainly through public hospitals. Some specialist outpatient care is delivered by private specialists, but this is mainly purchased by the regional health authorities. The access to private specialists in the geographical area of the current study is very limited.

Study design

The study was designed as a cluster randomised study with the GP surgery as the clustering unit. A total of 14 surgeries were randomised, 7 to the intervention and 7 to the control group. The clustered design was chosen to avoid possible spill-over effect from the intervention to control GPs. Randomisation was performed by simple drawing by a person not connected to the research team, stratified by town versus countryside location of surgery.

As the intervention was to be actively used by the GPs, the referring GP could not be blinded. Patients, hospital doctors and outcome evaluators were blinded to the intervention status of the patients. Owing to the design of the intervention, the referral letter would sometimes reveal the intervention status, if the electronic template was used. No separate sample size calculation was performed for the patient experience outcome. The full study details are published in the methods paper.11


The intervention consisted of the distribution of four separate referral templates to the intervention surgeries. These templates covered four clinical areas (dyspepsia, suspected colorectal malignancy, chest pain and COPD). The templates were to be used when initiating a new referral to the medical outpatient clinic at the University Hospital of North Norway, Harstad (UNN Harstad). The templates were distributed by the corresponding author (HW) during educational and/or lunch meetings and were provided as laminated reference sheets or in electronic form. In addition, follow-up visits were conducted regularly during the study period and intermittent mail leaflets and reminders were distributed to the intervention surgeries. Control offices continued standard referral practice.


The main outcome in the project was the quality of care delivered to each individual patient. In addition, health process indicators such as correct prioritisation were recorded and referral quality was also compared between the intervention and control group. The current paper presents the patient experience aspect of the study, as measured by self-report questionnaires.


The 14 GP surgeries primarily served by UNN Harstad were included in the randomisation process. In 2013, these surgeries had a total list size of 39 523 patients. The individual patients were recruited from consecutive new patients within one of the four clinical areas referred to, at the medical outpatient clinics at UNN Harstad. Study information and a consent form were sent to each individual patient together with their appointment letter. Further information, including a new consent form if appropriate, was provided at their hospital appointment. The individual patients were analysed as part of the intervention or control group depending on the intervention status of the GP surgery they were referred from. Children (<18 years of age) and patients with reduced capacity to consent were excluded from the study.


The study recruited patients for ∼2.5 years and a total of 538 patients were included with 290 in the intervention arm and 227 in the control arm. The remaining 21 patients were referred from GP surgeries outside the regular area of UNN Harstad, and as such neither in the intervention nor the control group. These 21 were excluded, together with 17 patients who did not fill the inclusion criteria. In total, this left 281 patients in the intervention arm and 219 patients in the control arm (figure 1).

Figure 1

Patient inclusion and questionnaire response.

Questionnaire development

Multiple tools exist for measuring different aspects of care coordination14 and patient experience; however, no complete questionnaire was located that covered the area in the current study completely. Therefore, a questionnaire was developed by combining validated questionnaires regarding patient experiences and care coordination. The questions used were the full version of the Generic Short Patient Experiences Questionnaire (GS-PEQ),15 together with two further questions used in patient experience questionnaires in the Norwegian healthcare system (questions 11 and 12)16 and the two questions about health interaction from the Commonwealth Fund Survey 2010.17 Three further questions were added to assess (1) who referred the patient, (2) if the referral was seen as appropriate and (3) an overall evaluation of the institution. Table 1 presents the questions in the questionnaire. GS-PEQ and questions 11–12 use Likert-style response categories. The health interaction questions had a yes/no response. The full questionnaire, including the demographic questions, is available on request.

Table 1

Questionnaire details

The questionnaire was piloted for content validity with four local health professionals; these felt that it covered the important aspects of patient experience and care coordination. It was then piloted with five outpatients with a median age of 72 years (average 68.8 years) to ensure face validity and acceptability. Two patients needed clarification on one of the questions before they felt they could answer, and the wording of this question was adjusted accordingly. The patients felt the questionnaire was acceptable, with logical response categories and that the questions covered their clinical path during the referral process well. These patients did not take part when the project was later initiated. No further formal evaluation of the questionnaire was carried out.

Questionnaire distribution

The questionnaire was mailed to patients who had consented to take part in the referral project presented above. To increase response rates, a prepaid response envelope was included, addresses were handwritten, the questionnaire was kept short and association with research bodies was indicated.3 For non-responders, one reminder was sent ∼1 month after the first questionnaire, with a new questionnaire and prepaid response envelope.


The study followed the principles of the Helsinki Declaration. Before recruitment started, it was presented to the Regional Committee for Medical and Health Research Ethics Northern Norway, who determined it not to be within the scope of the Health Research Act (REK NORD 2010/2259). The project has been approved by the Data Protection Officer for Research. The study is registered at, with trial registration number NCT01470963. All patients provided written informed consent.


To further aid the assessment of clustering, missing data were imputed. For the imputation, answers set as ‘not applicable’ were counted as missing. Missing data were seen to be random and multiple imputation using chained equations was employed. This has been shown to perform well for a variety of variable scaling types.18 Every variable used in further statistical analysis was entered into the imputation model, as failure to do so may bias estimates towards the null.19 The ordinal response scales for each single question were to be combined into a continuous score, and as such, it was determined that imputation with predictive mean matching was appropriate. As shown by van Buuren,19 the number of iterations can usually be quite low, between 5 and 20. In this study, the Stata standard of 10 iterations as burn-in period was used.

Statistical analysis

Results are presented on single question basis with comparison between the two groups using the Mann-Whitney U test for ordinal data and χ2 test for nominal data. No correction for clustering was made as the estimated ICC was very low (shown below). Aggregation of scores was postulated in the methods paper,11 but discarded as a main outcome as properties of the questionnaire, with a ‘not applicable’ answering category, are not easily suitable for such an approach. However, to assess the effect of clustering, a sum of scores from the GS-PEQ part of the questionnaire was calculated and a multilevel regression model was built with the GS-PEQ score from the questionnaire as the dependent variable. Intervention status was included in the model as this is the main point of interest. Gender, age, education level and self-perceived health were included in the model, as these tend to influence patient experience.20–22 Age was centred to ease interpretation in a mixed model analysis.23 Self-perceived health was reported on a five-level Likert-style scale and education level in four categories. Both were included as dummy variables in the analysis. Other confounding variables were assessed in the model and included if their inclusion led to a >10% change in the main outcome when added to the base model (main outcome+intervention status). Relevant interactions were checked for relevant variables, where p<0.10 was set as the significance level. As imputation was used, Monte Carlo error estimates were employed to assess the level of simulation error, as suggested by White et al.24 Normal evaluation of multilevel models with loglikelihood ratio tests were not carried out, as this is not well defined for multilevel models with imputed data. The analysis employed restricted maximum likelihood techniques throughout, as suggested when the number of clusters is small.25 As described, multiple imputation was used to account for missing data in the multilevel regression model assessing the effect of clustering. Stata V.13.1 (StataCorp 2013, Texas, USA) were used for all analysis.


Response rate

The response rate was 69.4% before reminders were sent out, rising to 82.0% after reminders (figure 1). The mean age for responders was 61 years and for non-responders 47 years (t-test <0.0001). There was no significant gender difference between the responders and non-responders, and the response rate did not differ significantly between the intervention and control group (χ2 test).

Missing data

Missing data for most questions were low, ranging from 0 to 11 out of 410 answered questionnaires. Statistically, these were considered missing completely at random (MCAR) with no clear relation to either age, gender, self-perceived health, disease severity or other variables.26 However, questions 6, 10 and 12 in the general part of the questionnaire had higher amounts of not applicable ranging from 14 to 34 representing 3.4–8.3% of returned questionnaires. In these questions, the word ‘treatment’ was used. This was intended to cover the medical examination and interventions during the outpatient visit. However, it seems that this has been misunderstood by several patients. It seems reasonable to assume that patients who underwent ‘only’ diagnostic evaluation felt that they had received no ‘treatment’, and hence felt unable to answer the question. This was also highlighted by one patient in a free-text response in the questionnaire. ‘Not applicable’ to questions 6, 10 and 12 did not vary significantly with age (t-test), intervention group status (χ2 test), gender (χ2 test) or self-reported health (χ2 test). This was treated as missing at random for imputation purposes (MAR).26 Question 14 had a missing rate of 15.9% but also yielded a high level of not applicable responses, at 46.0% of returned questionnaires. This was expected, as many people will not have had a new appointment with their GP following the hospital outpatient evaluation. It is also reasonable to assume that the high amount of missing was related to the same concept. The response ‘not applicable’ did not significantly vary with age (t-test p=0.868), intervention group status (χ2 test p=0.064) or self-reported health (χ2 test p=0.459).

A histogram of responses to questions with five categories showed all response sets to be skewed to the left. However, earlier work has indicated that multiple imputation can perform well, even when the categorical variable is non-normally distributed, as long as MAR does not exceed 10%.27 In a 2010 article, Finch26 argues that multiple imputation performs well for imputation of missing categorical questionnaire data. There was no association between levels of missing data and the multilevel structure of the data.

Baseline characteristics

Baseline characteristics are presented in table 2. There was no major difference between the intervention and control group with regard to gender, age, urban or rural residency or questionnaire response. The effect of the referral intervention on referral quality has previously been shown to be clinically significant with an effect of 18% (95% CI 11 to 25, p<0.001).12 However, this was for the full data set of 500 patients. To ensure that this was also representative of the subpopulation who answered the questionnaire, the multilevel regression model was employed using data from only the 410 patients who answered it. This showed an intervention effect of 19% (95% CI 12 to 27, p<0.001) on referral quality, well within the 95% CI of the full analysis.

Table 2

Selected patient baseline characteristics by intervention status

Questionnaire results

Overall satisfaction with services was high and as presented in table 3, there was little difference between the intervention and control group for the individual questions. Using the Mann-Whitney U test, χ2 test and Fisher exact test, only two questions had significant p values (Q14 and Q17); however, in these questions, the absolute difference in numbers was very small. All response sets were skewed to the left, that is, towards more positive responses.

Table 3

Questionnaire results

Interestingly, the highest numbers of scores indicating dissatisfaction were for questions 4 and 6, for the intervention and control group patients. These questions concern patient interaction and information on the treatment process.

The Cronbach α for questions 1–15 was 0.83 and for questions 1–10 0.88.

Assessment of clustering effect

In the regression model, no significant difference was seen in the GSPEQ score between the intervention and control group with the regression coefficient 0.55 (95% CI −0.37 to 1.47, p=0.24) when taking clustering into account and adjusted for confounding variables 0.57 (95% CI −0.31 to 1.46, p=0.20). No significant interaction was found, and the result was not confounded by GP specialist status, GP gender, specialist status of hospital doctor or seriousness of final diagnosis. The Monte Carlo error estimates were within the limits recommended.24 Initial multilevel analysis of the data revealed virtually no variance of the intercepts. The ICC was estimated at 1.21e−11. Hence, very little of the variation in the data was related to the clustered design.


In the presentation of the data from each question in table 3, it is quite clear that, for the most part, patients in this project report positive experiences, with no differences between the intervention and control group. It hence seems that although the intervention has increased the referral quality significantly,12 this has not translated into a more positive patient experience with the referral process and treatment, as measured by self-report questionnaires. In the current study, indepth data analysis with imputation and multilevel regression modelling was employed to further explore the effect of clustering. No clear effect of clustering was found.

A strength of the current study is the fairly high response rate (82.0%) compared with other mail response studies.28 However, the potential for non-response bias is always present. Others have previously shown the effect of this to be small.29 ,30 Earlier Norwegian studies have suggested only minor differences between answers provided by responders and non-responders, when the latter have been obtained through telephone follow-up interviews.31–33 A clear limitation is the use of short-form questionnaires with single items, which may be less valid than longer forms.34 However, shorter forms will increase the response rate.4 ,35 The current project aimed to assess the effect of a health system intervention and the patient experiences with care after this intervention. We hence decided to keep the questionnaire short to enable a high response rate and keep the patient and staff workload manageable.

The current project used a newly developed questionnaire to assess patient experience by combining previously validated questions. The general nature of the final questionnaire may be seen as a weakness, as small changes in the patient experience induced by the intervention may have been missed. Further piloting might have revealed more clearly if the questionnaire did indeed assess the patient experience with the referral and care process in an adequate way. However, in this clinically oriented project, the authors hoped that a more general questionnaire would highlight whether the intervention would cause a more overall positive, or even a negative, change. It is probable that for each individual patient, it is the experience with the entire process that matters, as opposed to the experience of a subpart of the process. If large-scale implementation of referral guidance is contemplated, a more specific questionnaire may need to be validated.

An additional weakness was the lack of a sound analytical plan proposed in the methods paper.11 To ensure transparency, the analysis presented in this paper is therefore simple and based on single-question assessment. Given the clustered nature of the study, an assessment of clustering is given for a subsection of the questionnaire, but very little effect was seen.

Comparison with other studies was difficult as no clearly comparable analysis was found, except for the two health interaction questions. In the current study, 1.7% in the intervention group and 3.5% in the control group felt the hospital specialist lacked information from the GP. About 4.2% in the intervention and 13.1% in the control group felt the GP lacked information from the specialist. In the Norwegian part of the 2010 Commonwealth Fund Survey, the same questions gave much higher negative ratings, with 12.1% indicating that the specialist lacked information from the GP and 38.3% indicating that the GP lacked information from the hospital.17 Data from the 2013 Commonwealth Fund Survey suggest similar ratings as in 2010, although the wording of the questions is slightly different.36 A Norwegian report concerning patient experience as inpatients also suggests higher dissatisfaction with co-operation between the hospital and the GP37 than in the current study. In total, this clearly suggests that the patient experience of the GP/specialist communication is better in a small district hospital than the country average suggests. It is therefore possible that the effect of the intervention on patient experience could have been higher if the level of dissatisfaction with the healthcare cooperation had been higher in the local population. However, this also may suggest that although the hospital consultants often feel information is lacking in the referrals,38 ,39 this is not necessarily experienced as a problem by patients.

In the current study, two questions were answered more negatively than others. These questions therefore probably provide the most interesting points for further quality improvement at the local facility. These two questions represent areas where communication is the main concept, namely patient involvement in the treatment process and information from doctors to patients. Others have previously shown communication and information errors as a cause for dissatisfaction,40 and in other jurisdictions even malpractice claims.41 ,42


In this project, patient satisfaction, as measured by patient experience questionnaires, was generally high, with no major differences between the intervention and control group. No clear effect of the implementation of referral templates on patient satisfaction was evident.

Interestingly, the most negative feedback, from the intervention and control group, was concerning patient interaction, involvement and information. Effective communication and involving patients in decision-making may help to increase patient satisfaction to an even higher level.


View Abstract


  • Contributors The administration and daily running of the study was performed by HW, who was also the grant holder. ARB and HW developed the questionnaire. All authors participated in the analysis and interpretation of the data. All authors revised drafts of the manuscript and approved the final version.

  • Funding This work was funded by a research grant from the Northern Norway Regional Health Authority (Helse Nord RHF) with grant number HST1026-11.

  • Competing interests None declared.

  • Ethics approval The project has been approved by the Data Protection Officer for Research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement No additional data are available.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.