Article Text

Download PDFPDF

Original research
Characterising and predicting persistent high-cost utilisers in healthcare: a retrospective cohort study in Singapore
  1. Sheryl Hui Xian Ng1,2,
  2. Nabilah Rahman1,2,
  3. Ian Yi Han Ang1,2,
  4. Srinath Sridharan1,
  5. Sravan Ramachandran1,
  6. Debby Dan Wang1,
  7. Astrid Khoo2,
  8. Chuen Seng Tan1,
  9. Mengling Feng1,
  10. Sue-Anne Ee Shiow Toh2,3,4,
  11. Xin Quan Tan1,2
  1. 1 Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
  2. 2 Regional Health System Office, National University Health System, Singapore, Singapore
  3. 3 Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
  4. 4 Singapore Population Health Improvement Centre (SPHERiC), National University Health System, Singapore, Singapore
  1. Correspondence to Dr Xin Quan Tan; kyletanxq{at}gmail.com

Abstract

Objective We aim to characterise persistent high utilisers (PHUs) of healthcare services, and correspondingly, transient high utilisers (THUs) and non-high utilisers (non-HUs) for comparison, to facilitate stratifying HUs for targeted intervention. Subsequently we apply machine learning algorithms to predict which HUs will persist as PHUs, to inform future trials testing the effectiveness of interventions in reducing healthcare utilisation in PHUs.

Design and setting This is a retrospective cohort study using administrative data from an Academic Medical Centre (AMC) in Singapore.

Participants Patients who had at least one inpatient admission to the AMC between 2005 and 2013 were included in this study. HUs incurred Singapore Dollar 8150 or more within a year. PHUs were defined as HUs for three consecutive years, while THUs were HUs for 1 or 2 years. Non-HUs did not incur high healthcare costs at any point during the study period.

Outcome measures PHU status at the end of the third year was the outcome of interest. Socio-demographic profiles, clinical complexity and utilisation metrics of each group were reported. Area under curve (AUC) was used to identify the best model to predict persistence.

Results PHUs were older and had higher comorbidity and mortality. Over the three observed years, PHUs’ expenditure generally increased, while THUs and non-HUs’ spending and inpatient utilisation decreased. The predictive model exhibited good performance during both internal (AUC: 83.2%, 95% CI: 82.2% to 84.2%) and external validation (AUC: 79.8%, 95% CI: 78.8% to 80.8%).

Conclusions The HU population could be stratified into PHUs and THUs, with distinctly different utilisation trajectories. We developed a model that could predict at the end of 1 year, whether a patient in our population will continue to be a HU in the next 2 years. This knowledge would allow healthcare providers to target PHUs in our health system with interventions in a cost-effective manner.

  • healthcare costs
  • high utiliser
  • persistence
  • machine learning
http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • The study utilised a large longitudinal database of 9 years, allowing the development of a stable definition of PHUs.

  • The prediction models developed contained features commonly available in most administrative databases, increasing applicability in other health systems.

  • Internal validation using a wide range of hyperparameters, as well as an additional round of external validation, ensured the robustness of the models selected.

  • The PHU definition did not include patients with only high outpatient utilisation and no inpatient utilisation, limiting generalisability of our findings to HUs with at least one inpatient admission.

  • External validation of the final prediction model was carried out on an external data set which did not exclude patients who died and did not have polypharmacy score as a feature, resulting in findings which must be interpreted with caution.

Background

The disproportionate concentration of the healthcare utilisation and expenditure in a small group of users is a common phenomenon observed worldwide.1–3 In the USA, the Agency for Healthcare Research and Quality reported that 1% of users in 2008 accounted for nearly 20% of all healthcare expenditure, and that 5% of users accounted for nearly 50% of expenditure.4 In Ontario, Canada, patients in the top 5% of expenditure accounted for 61% of combined hospital and nursing home expenditure and almost all of mental health and rehabilitation costs.5 6 The definition of high utilisers (HU) ranges from the top 1% to 20% in healthcare expenditure of each cohort. These HUs have been characterised as a population with high levels of chronic disease and prevalence of mental illness,1 and have been the traditional target for interventions designed to reduce healthcare utilisation.7 Unfortunately, heterogeneity in the definition of HUs and intervention design, as well as a lack of high-quality evidence in scientific literature, renders evaluation of effectiveness of these interventions difficult.7–10 Available robust evidence on these interventions is mixed, with some studies suggesting the need for identifying and targeting of specific patient groups for an intervention to be effective.7 9 11 High utilisation in the bulk of HUs tends to taper off over time and naturally regress to lower utilisation, and could also potentially explain why interventions have not been successful in reducing cost or utilisation when compared with controls.12 13 A subgroup of patients, who do not exhibit such a regression-to-the-mean effect and experience prolonged high utilisation, are termed persistent HUs (PHUs), and have been defined as individuals who achieve HU status for more than two consecutive years.2 14 Various studies suggest only 21%–40% of HUs are PHUs who remain in the top 5%–10% of expenditure 2 years later.2 15–19 Identifying this subgroup of patients with persistent high cost for active targeted interventions could achieve better results in long-term reduction of healthcare utilisation and cost to the healthcare system.

The public healthcare system in Singapore serves around 80% of acute inpatient needs, and public hospitals receive government subvention to provide subsidised care to the resident population.20 Patients may receive government subsidies, insurance coverage or funding from safety net programme before they pay out-of-pocket for acute care.20 21 PHUs would hence impose a resource burden for both the government and the health system. Moreover, due to constraints on capacity from an ageing population, the government has put in place measures to prepare for the population’s long-term care needs and move care to the community setting.22 Hence, developing interventions for PHUs would reduce avoidable admissions and shift suitable patients to be cared for in the community setting, optimising the use of finite resources and improving quality of care for patients.

Given the prolonged high utilisation by PHUs, there is opportunity for early identification of these patients for intervention to shorten the period of high utilisation. However, it is only meaningful to identify these patients early on before they have truly reached PHU status, and so predictive models would best serve this purpose. To our knowledge, only two studies have developed models for prediction of PHU status within a HU population which incorporate an assessment of model performance.19 23 Models from both studies achieved an area under curve (AUC) score exceeding 0.90, denoting outstanding predictive abilities.24 With the ubiquitous adoption of electronic health records and data sets of increasing size and complexity in recent years,25–27 machine learning algorithms have been utilised widely due to their ability to learn and identify patterns in the data in absence of parametric assumptions.28 While some studies have attempted to predict HUs using machine learning,29–32 none of these have been designed to predict persistence. Building a prediction model for PHUs using machine learning methods would not only aid in the prospective identification of PHUs but also contribute to the limited literature on predictive modelling for persistence. Furthermore, most studies on HUs and persistence have been based in the USA1; our study originating from a developed multi-ethnic Asian country will add to the literature in this field.

With the objectives of stratifying HUs for targeted interventions and contributing to the gap in literature on prediction of PHUs, our study aims to characterise PHUs and subsequently predict PHU status within a HU population using machine learning algorithms. We will divide the study population into three groups: (i) PHUs, (ii) transient high utilisers (THUs), (iii) non-HUs, describe these groups and compare their characteristics to better understand the profiles associated with persistence. Subsequently, we will use machine learning techniques to develop models that distinguish PHUs from a pool of HUs using data from the first year observed. We will then perform both internal and external validation of the model to evaluate the performance of the models and demonstrate the feasibility of predicting PHUs for potential targeted interventions. The findings from this study will add invaluable insight into the study of persistence in high healthcare utilisation, and inform future trials to test effectiveness of interventions in reducing healthcare utilisation in a persistently high utilisation group.

Methods

Description of data source

We examined data from 2005 to 2013 comprising demographics, utilisation, billing, medication and diagnosis data for both inpatient and outpatient care in National University Hospital, a 1000-bed Academic Medical Centre (AMC) in Singapore for this retrospective study. Details on the data set and its preparation have been reported separately.33 The study cohort consisted of 549 109 patients aged 21 years and above who had at least one record of either an inpatient admission, specialist consultation, therapy visit or emergency attendance between 1 January 2005 and 31 December 2013.

Definitions of PHUs and THUs

Due to lack of access to standardised cost data, we used expenditures as an estimate, a common approach adopted in HU studies.1 In our study, we defined expenditure as the total price of a service charged to a patient before any government subsidies or third-party payments (eg, payments by insurers or employers), adjusted to 2015 prices. While these billed charges may not be reflective of the true cost of service delivery, in absence of actual cost data, these charges serve as a good proxy for cost as they are often correlated with the actual cost, and systematically higher by a specified cost-to-charge ratio.34 35 We computed the expenditure for each patient over all inpatient and outpatient settings in the hospital for each observation year and classified patients as HUs if they incurred expenditure in the top 10%, or at least SG$8150 within the year. Details on the derivation of the HU cut-off are provided in online supplementary file 1. With our key objective of identifying a group of patients with extended high utilisation that could potentially be targeted for intervention, we defined PHUs in our study as those who met the HU criteria for three consecutive years, and THUs as patients who met the HU criteria for at least 1 year, but less than three consecutive years. Those who did not meet the criterion during the study period were classified as non-HUs.

Supplemental material

To divide the population into groups, we first determined if a patient met the HU criteria at any point in time, by examining each inpatient episode of the patient. If the patient had accumulated healthcare expenditure of SG$8150 in the 1 year prior to and inclusive of the discharge date, the patient would be classified as a HU (figure 1). Only the first discharge date that fulfilled the criterion was considered. That discharge date would then be fixed as the end of Year 1 (Y1) for that patient; Year 2 (Y2) would be the next year from that date, and Year 3 (Y3) the following year. HUs who had expenditures of at least SG$8150 in Y2 and Y3 were classified as PHUs, while the remaining HUs were classified as THUs. For non-HUs, the discharge date of the index admission was defined as the end of Y1, and Y2 and Y3 were correspondingly constructed. In identifying HUs, Y1 was defined as the first 365-day period fulfilling the HU criterion as opposed to looking at expenditure in a calendar year. A calendar year was not used since patients who made contact with the hospital in the later part of the calendar year would be less likely to fulfil the HU criteria as they would have had less time to accumulate expenditure, thus introducing bias.

Figure 1

Illustration of persistent high utiliser (PHU)/transient high utiliser (THU) cohort generation using three hypothetical patients.

In defining PHUs and THUs, we excluded patients who had no inpatient admissions throughout the study period, as well as patients who recorded their first contact in 2012 and 2013 due to non-availability of their respective Y2 and/or Y3 data. We also excluded patients who died in Y1, as these patients would not have been a candidate at the end of Y1 for prediction of Y2 and Y3 HU status. However, we included patients that died in Y2 and Y3, given that we are trying to predict HU status using only Y1 data, and excluding those that died in Y2 and Y3 would introduce bias by leaving only healthier individuals in the pool. We included non-HUs as a separate cohort for descriptive comparison as well.

Comparison of PHU, THU and non-HU groups

We first described the profiles of the three groups defined in this study. Patient demographics at first visit as well as clinical complexity during Y1 were reported. Clinical complexity was measured by the Charlson Comorbidity Index (CCMI) and polypharmacy score (PPS). PPS measured the maximum number of unique dispensed medications a patient was prescribed in a visit, and demonstrated convergent and criterion validity when compared with CCMI and measures of healthcare utilisation.33 In addition, we reported their expenditure, utilisation (inpatient, length of stay (LOS) and outpatient visits), as well as common diagnoses and subspecialties visited in Y1. To compare between PHU and THUs, as well as between PHU and non-HUs, we used χ2 and two-sided Wilcoxon rank-sum tests, respectively, to compare categorical and continuous characteristics.36 Within each set of comparisons with a control group, we adjusted the p values using Bonferroni’s correction,37 and assessed statistical significance using a threshold of 0.05. Thereafter, we examined the utilisation patterns of the PHU, THU and non-HU groups over a 3-year period.

Feature extraction for prediction

To construct our predictive models to identify PHUs from a HU population, we first considered the same patient-level information from the descriptive analyses as features, or predictors, in the models, with the exception of mortality. In addition, we included bill components in the model. For each visit billed to the patient, we identified the different bill components for that particular visit. The bill components included expenditure on prescription drugs, laboratory and radiology services, surgical procedures, specialised investigations (non-routine diagnostics), doctor consultations, therapy (physiotherapy, occupational therapy or speech therapy), miscellaneous procedures as well as consumables, implants and appliances. We subsequently aggregated the visit-level expenditure information into a total amount spent per patient for Y1, and calculated the proportion of the Y1 bill that each component accounted for. We reviewed all features and subsequently dropped features which had missing categorical values (eg, housing type), exhibited collinearity or had zero variance. A description of features and hyperparameters used in our model is tabulated in online supplementary file 1.

Predictive model building and validation

Given the size and complexity of our data set, we opted to use machine learning algorithms for the prediction of PHUs as they have been shown to be adept at handling large volumes of high-dimensional data in healthcare settings to enable high-quality predictions.38–40 We tested three algorithms: penalised regression, support vector machine and extreme gradient boosting (XGBoost), and evaluated their performance using standard metrics of sensitivity and specificity, as well as AUC with a two-sided DeLong’s test.41 42 For internal validation of our model, we performed repeated cross-validation with model construction on 70% of data and evaluation of performance on the remaining 30%. As the data set was imbalanced with a small number of PHUs relative to the THUs, the THUs were under-sampled within each training iteration of cross-validation.43 44

In addition, to increase applicability of the model as a predictive tool to be used as point of discharge, we identified a final model that would achieve similar predictive performance using only a subset of features. We tested sub-models containing different number of features with the highest variable importance against the full model using a two-sided DeLong’s test. This was to identify a final sub-model with the lowest number of features that had an AUC not significantly different from that of the full model.42 We reported the top features ranked by relative importance, or the improvement in accuracy the feature brought to its branch relative to the highest ranked, which would range from 0 to 100.

We then performed external validation for the final model. An independent data set of adult patients from the same administrative database for 2013 to 2016 was extracted. The HU classification process detailed in figure 1 was applied to this validation set and the final model was fitted to this data set to assess the performance of the algorithm when applied to new data. Due to lack of information on the exact medications patients were billed for in the external validation set, we performed both internal and external validation of the final model without PPS as a feature. The model without PPS as a feature was derived using the same procedure as the original final model. The same model performance metrics were reported. We used RStudio and the package caret for the computation described earlier.45 46

Patient and public involvement

The study was designed and conducted without patient or public involvement.

Results

Comparison of characteristics of PHUs, THUs and non-HUs

A total of 152 497 patients were included in the study (figure 2). There were 5094 PHUs, 62 159 THUs and 85 244 non-HUs identified (table 1). PHUs were statistically different from both THUs and non-HUs in all demographics, disease complexity and utilisation compared. Overall median expenditure across all settings was highest in the PHUs, followed by the THUs and non-HUs (PHUs: $15,015; THUs: $13,727; non-HUs: $2,942). However, median inpatient expenditure was highest in the THUs (PHU: $10,743; THU: $12,226; non-HU: $2,395). Notably, PHUs and THUs had median ages 19 and 15 years greater than the non-HUs, respectively, at the point of first contact during the study period. Median values of CCMI were highest for the PHUs (PHUs: 2; THUs: 1; non-HUs: 0), while the median PPS values were similar for PHUs and THUs. The percentage of patients that died during the study period was also highest in the PHU group (43.5%) compared with the THU group (19.7%) and the non-HU group (5.8%).

Table 1

Demographics, disease complexity and utilisation of PHUs, THUs and non-HUs

Figure 2

Inclusion and exclusion criteria for persistent high utiliser (PHU)/transient high utiliser (THU) cohort.

As seen in online supplementary file 2, diabetes, hypertension and lipid disorders were found to be prevalent conditions across all three groups. However, the highest prevalence rates were consistently seen in the PHUs and lowest in the non-HUs. In addition, visits to the cardiac subspecialty were also common to the groups. We also observed a segment of patients unique to the PHUs who were likely to be undergoing treatment for kidney diseases, from the high prevalence of chronic kidney disease (25.4%) and visits to the nephrology subspecialty (24.2%).

Supplemental material

Examining trends in total expenditure over the first 3 years, the PHUs’ expenditure generally increased, whereas the THUs and non-HUs’ spending decreased drastically from Y2 onwards (figure 3). Inpatient expenditure and visits also followed such a similar pattern for all groups. Importantly, we observed that the drop off in inpatient utilisation for THUs in Y2 and Y3 was very significant; the median inpatient utilisation across bill, LOS and visits was all zero in Y2 and Y3 (online supplementary file 3). In contrast, outpatient utilisation peaked in Y2 across all groups, with PHUs on average incurring above 20 visits in Y2 and Y3 while THUs and non-HUs had few or zero visits for both years.

Supplemental material

Figure 3

Yearly utilisation for three observation years, with yearly medians labelled. Non-HU, non-high utiliser; PHU, persistent high utiliser; THU, transient high utiliser.

Comparison of performance of candidate predictive models

Table 2 summarises the performance of algorithms used in this study for the prediction of PHUs from a pool of 67 253 patients at the end of Y1. Of the three algorithms, XGBoost performed the best in terms of AUC (83.4%, 95% CI: 82.4% to 84.4%) and sensitivity (78.7%, 95% CI: 76.5% to 80.7%). The AUC for the XGBoost model was also statistically different from the other two models compared (p<0.001). This full model was based on 514 features (online supplementary file 4), and we identified a final model with reduced number of features that exhibited similar model performance (p=0.28), which used only 51 features with the highest variable importance from the full model (figure 4). The top features for the final model were related to comorbidity, outpatient expenditure and prescription expenditure, followed by age, laboratory and radiology services, specialised investigations, therapy, doctor consultations, surgical procedures as well as consumables, appliances and implants. A description of the HUs in the study population and the external validation cohort are provided in online supplementary file 5. The AUC for the validated final model without PPS during internal validation was 83.1% (95% CI: 82.1% to 84.1%), while the AUC during external validation was slightly lower at 79.8% (95% CI: 78.8% to 80.8%) (online supplementary file 6).

Supplemental material

Supplemental material

Supplemental material

Table 2

Summary of the performance metrics of predictions made at the end of Year 1

Figure 4

Variable importance for 51 features in final model.

Discussion

Principal findings

In our study, we defined three groups of patients: PHUs who were HU for 3 years consecutively, THUs who were only HU for 1 or 2 years and non-HUs who did not qualify as an HU at any time. We described the demographics, utilisation and disease profiles of the groups and found that PHUs were distinctly different from THUs and non-HUs. Subsequently, we sought to predict which HUs would go on to be PHUs at the end of 1 year. We applied different machine learning algorithms and developed a model of excellent predictive performance. We then validated a final model with fewer features but of comparable performance for potential implementation as a tool to be used at point of discharge. The most important features in the final model were related to comorbidity, outpatient expenditure and prescription expenditure.

We identified 7% of our HU population to be PHUs, which is markedly lower than findings from other published studies on 3-year persistence.2 19 23 This could be explained by the fact that we did not capture expenditure outside the hospital setting unlike these other studies. Hence, our PHU definition may not be directly comparable to studies that incorporate home care and nursing care costs in their computation of healthcare expenditure to determine persistence.6 17 Comparing the disease profiles of the three groups of patients, the highest prevalence of lipid disorders, diabetes, hypertension and cardiac conditions was observed in the PHUs. Coupled with the older age and higher CCMI, PHUs exhibited a more medically complex profile compared with THUs and non-HUs. Of note, PHUs have been found to be older than THUs in some studies19 23 47 and younger in others,16 17 47 48 and this lack of uniform association of age with persistence may be attributable to the heterogeneity in populations studied. In contrast, the higher comorbidity burden in PHUs is consistent with most studies on persistence.14 17 19 23 We also found that the percentage of PHUs that died in our cohort was high (43.5%), especially when compared with the THU group (19.7%) and the non-HU group (5.8%). This is not unexpected given the PHUs had higher comorbidity.

In addition, the PHUs had greatly different expenditure and utilisation trajectories from the THUs. We observed that THUs’ overall utilisation and expenditure fell significantly in Y2 and Y3 from Y1, illustrating the phenomenon of regression to the mean. This trend could have been due to an acute medical need in the THUs which tapered off after the initial period of high utilisation, and has been noted in other HU populations.13 16 19 49 50 In contrast, PHUs’ utilisation and expenditure continued to increase, motivating the need to target PHUs specifically for intervention instead of HUs in general.

Strengths

Our study utilised a large longitudinal database over a 9-year period, which allowed us to study the persistence of high utilisation in a large population over a sufficiently long period. We were thus able to develop a stable definition for PHUs and a predictive model that performed well in identification of PHUs. Furthermore, the approach we took in preparing and selecting features and developing the predictive model allows for easy deployment in a real-world setting. The model can be applied at the point of discharge by drawing on data retrospectively, and contains features already commonly available in most administrative databases. In addition, we performed internal validation using repeated cross-validation and tested three models with a wide range of values for each parameter to ensure the robustness of the model selected. We also validated our final model on an external data set with model performance similar to results from internal validation, illustrating the generalisability and applicability of our predictive tool in other patient cohorts.

Limitations

There are some limitations arising from our methodology. We defined PHUs using the index admission as a frame of reference for accumulation of resource use, and so this definition cannot be applied to patients who incurred sizeable outpatient utilisation without any inpatient admissions. Our examination of persistent high users would hence only be generalisable to patients with conditions and care needs relevant to the inpatient setting. As the observation year was anchored at the first inpatient admission which met the HU criteria, it also did not include instances where patients met the PHU criteria more than once during the study period. However, in the real-world setting, this is less of an issue as patients who have met the criteria once would already be identified as a high-risk patient by the healthcare system that they seek care from. Furthermore, as the 12% of patients who died in Y2 were included in the analysis, potential bias may arise. While these patients may have exhibited Y1 characteristics sufficient to be predicted as PHUs, they did not survive long enough to fulfil the PHU definition. As a result, the predictive tool we have developed would have a higher false positive rate, and the model AUC reported would be a conservative estimate, which would be caveats to keep in mind in application of the tool.

In addition, the model was designed and validated on a single AMC. Previous research in Singapore revealed low cross-utilisation across multiple hospitals by a general patient population,51 suggesting low selection and misclassification bias in our findings due to the use of data from only one hospital. Furthermore, prevalence rates of PHUs in studies outside Singapore incorporating data from multiple hospitals are similar to our findings at 1%–4%.15–17 19 This would suggest that cross-utilisation rates of THUs and non-HUs across more than one hospital would likely be low, and correspondingly bias arising from this limitation would be modest. Generalisation of the model to other healthcare systems would be dependent on data and feature availability in those settings. However, the most important features in our models are common billing components and clinical specialties routinely recorded within administrative databases, or easily generated from International Classification of Diseases diagnosis codes, increasing the ease of applying our model in other health systems.

Also, we were unable to examine in detail the differences in socioeconomic characteristics between the groups as only housing type was available as a proxy. As persistent high utilisation has been noted to be concentrated in those of lower socioeconomic status or minority races,4 16 17 47 information such as household income, employment status or receipt of financial support would greatly increase the robustness of our comparisons.

Patients who died in Y1 in the external validation set were not removed due to lack of integration of death data, and hence the performance of the external validation must be interpreted with caution. As detailed medication was also not integrated into the external validation data set, we were unable to validate our final model with PPS as a feature. However, since the validation of the modified final model without PPS was good with an AUC of 80%, it appeared that the models developed perform reasonably well as long as most important features were present.

Implications for clinicians and policymakers

Our findings suggest that our PHU population is medically complex and tend to have recurrent high utilisation across multiple settings, and interventions should be tailored to this profile. Integrated care approaches involving case management, care coordination and disease management could potentially reduce healthcare use by this group of high risk patients.7 In addition, given the high mortality rate, integrating palliative care into interventions targeted at PHUs could also be a suitable approach to manage cost and improve quality of life in this group.52 The distinctly different utilisation trajectories between PHUs and THUs reaffirm the necessity of stratifying the HU population to identify the correct subgroup of patients, to effectively deliver interventions for these high need patients.53

Our findings also suggest that there are ways to optimise the way we select patients for intervention. At present, most intervention programmes that target high utilisation patients enrol patients based on a one-off assessment of whether they incurred utilisation above a specified threshold. Such enrolment criteria would not be able to distinguish PHUs and THUs. An evaluation of a post-discharge intervention programme for HUs where enrolment was based on utilisation in Y1, and intervention was delivered in Y2, did not show an improvement of Y2 healthcare utilisation over matched controls.54 Both the HUs and control patients in this study showed similar decrease in their high utilisation over time, illustrating the phenomenon of regression to the mean. This finding supports our observation on the need for refinement of enrolment criteria to better identify and target subgroups with increased risk of prolonged utilisation (ie, PHUs). The predictive tool we have developed in this paper will facilitate the targeting of PHUs from a pool of HUs that health systems already routinely identify and intervene on. A more targeted approach would allow healthcare systems to focus such resource intensive interventions on the group that is most likely to require and benefit from these interventions.

Unanswered questions and future research

Prediction of PHUs to identify and target them early for intervention is thus crucial. In our study, we were able to show that our models can predict prospectively which patients in our health system are likely to be PHUs with protracted high levels of healthcare utilisation. While we hypothesise that these PHUs would be better targets for healthcare utilisation reducing interventions as compared with HUs as a larger group, follow-up studies would need to be conducted to validate this. Given the scope of the study, we have not tested if the PHUs respond differently to various intervention programmes aimed at reducing utilisation. Future work will involve collaborations with groups beyond the AMC in this study that are responsible for the delivery of interventions to high utilisation groups. These collaborations would test the utility of our approach in identifying PHUs and providing these patients with different targeted interventions. Separately, testing the predictive tool in different populations and with varying HU thresholds would validate our methodology in different contexts. These further studies would test both the ability of the model to identify suitable targets in a real-world setting, and the impact of interventions targeting this group of PHUs as opposed to THUs and controls.

Conclusion

Persistent high-cost utilisers are a high-risk subgroup of patients with complex clinical needs and high expenditure and utilisation. While interventions could potentially accrue the largest benefits if this subgroup is targeted effectively, few to date have focused on identifying and intervening on this group. In our study, we constructed a definition for the identification of PHUs from a HU population, profiled the PHUs and compared them with the THUs and non-HUs. We demonstrated that the HU population could be stratified to reveal a subgroup of patients with recurrent high utilisation over time, which was distinct from the majority of patients who had transient high utilisation. Subsequently, we developed a machine learning model with the ability to predict at the end of 1 year, whether a patient in our population will continue to be a HU in the next 2 years, which is a novel approach to the prediction of persistence in healthcare utilisation. This knowledge would allow healthcare providers to more effectively target patients who have persistent utilisation, with existing or new interventions to make a greater impact in a cost-effective manner.

Acknowledgments

The authors would like to thank Professor Teo Yik Ying (Saw Swee Hock School of Public Health (SSHSPH), National University of Singapore (NUS)) for his inputs and guidance, and Mark Salloway (SSHSPH, NUS) for management and provision of access to the database used in this study.

References

Footnotes

  • Correction notice This article has been corrected since it was published. The ORCID id has been updated for Nabilah Rahman.

  • Contributors The study was conceptualised by SS, SR, SAEST and XQT. Funding was acquired by XQT and AK. The database was set up by SR, SS, DDW and NR. Statistical analysis and model building were carried out by SHXN and NR, under supervision of CST and MF. Validation data were acquired by IYHA and SAEST, and external validation was conducted by NR. The initial draft was written by SS, SR and SHXN, and refined with input from XQT, IYHA, AK, NR and CST.

  • Funding The study was cofunded by the National University Health System and National University of Singapore (NUS), with support from the Singapore Population Health Improvement Centre (SPHERiC) funded through Singapore Ministry of Health’s National Medical Research Council Centre Grant Program (grant number NMRC/CG/C026/2017_NUHS).

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Ethics approval This study was approved by the Domain Specific Review Board (DSRB), National Health Group, Singapore (2016/01011) and the data were approved as a DSRB standing database: NUS-SSHSPH/2015–00032. Waiver of informed consent was granted by the review board.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement The data analysed in the study are managed by the Research Office of Saw Swee Hock School of Public Health, National University of Singapore but access is restricted and not for public. The codes for the descriptive analyses and prediction models may be requested from SHXN through personal communication.