Article Text

Original research
COVID-19 in patients with hepatobiliary and pancreatic diseases: a single-centre cross-sectional study in East London
  1. Abu Z M Dayem Ullah1,2,
  2. Lavanya Sivapalan1,
  3. Hemant M Kocher2,3,
  4. Claude Chelala1
  1. 1Centre for Cancer Biomarker and Biotherapeutics, Barts Cancer Institute, London, UK
  2. 2Barts and the London HPB Centre, The Royal London Hospital, Barts Health NHS Trust, London, UK
  3. 3Centre for Tumour Biology, Barts Cancer Institute, London, UK
  1. Correspondence to Dr Abu Z M Dayem Ullah; d.ullah{at}


Objective To explore risk factors associated with COVID-19 susceptibility and survival in patients with pre-existing hepato–pancreato–biliary (HPB) conditions.

Design Cross-sectional study.

Setting East London Pancreatic Cancer Epidemiology (EL-PaC-Epidem) Study at Barts Health National Health Service Trust, UK. Linked electronic health records were interrogated on a cohort of participants (age ≥18 years), reported with HPB conditions between 1 April 2008 and 6 March 2020.

Participants EL-PaC-Epidem Study participants, alive on 12 February 2020, and living in East London within the previous 6 months (n=15 440). The cohort represents a multi-ethnic population with 51.7% belonging to the non-White background.

Main outcome measure COVID-19 incidence and mortality.

Results Some 226 (1.5%) participants had confirmed COVID-19 diagnosis between 12 February and 12 June 2020, with increased odds for men (OR 1.56; 95% CI 1.2 to 2.04) and Black ethnicity (2.04; 1.39 to 2.95) as well as patients with moderate to severe liver disease (2.2; 1.35 to 3.59). Each additional comorbidity increased the odds of infection by 62%. Substance misusers were at more risk of infection, so were patients on vitamin D treatment. The higher ORs in patients with chronic pancreatic or mild liver conditions, age >70, and a history of smoking or obesity were due to coexisting comorbidities. Increased odds of death were observed for men (3.54; 1.68 to 7.85) and Black ethnicity (3.77; 1.38 to 10.7). Patients having respiratory complications from COVID-19 without a history of chronic respiratory disease also had higher odds of death (5.77; 1.75 to 19).

Conclusions In this large population-based study of patients with HPB conditions, men, Black ethnicity, pre-existing moderate to severe liver conditions, six common medical multimorbidities, substance misuse and a history of vitamin D treatment independently posed higher odds of acquiring COVID-19 compared with their respective counterparts. The odds of death were significantly high for men and Black people.

  • hepatobiliary disease
  • pancreatic disease
  • COVID-19

Data availability statement

All statistical data relevant to the study are included in the article or uploaded as supplemental information. Only the corresponding author had full access to all the participants’ data in the study. The authors confirm that researchers seeking the completely anonymised final analysis dataset for this work can submit a data request to the corresponding author.

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See:

Statistics from

Strengths and limitations of this study

  • First multi-ethnic population-based study on COVID-19 in patients with hepato–pancreato–biliary group of diseases.

  • Systematic identification of the effect, or the lack of it, of individual demographic and clinical factors on the infection and mortality of COVID-19 in a large cohort of over 15 000 patients, robustly controlling for potential confounders in their evaluation.

  • Access to longitudinal data from linked primary and secondary care electronic health records, and use of rule-based phenotyping algorithms allowed for improved completeness and accuracy of the explored variables.

  • Some observed increased odds of SARS-CoV-2 infection and related death could be plausibly explained by unmeasured confounding.

  • The effects reported in the study could be influenced by the relatively smaller size of COVID-19 cases within this cohort.


COVID-19 is a novel infectious disease caused by SARS-CoV-2, with a wide-ranging disease course. Infection and mortality rates of the COVID-19 pandemic have varied widely among nations and demographics,1 while risks are still being explored, identified and categorised according to the severity.2 3 There are several confirmed risk factors of COVID-19 and severe outcomes, including old age,2 4 5 chronic pulmonary disease,2 4–6 cardiovascular disease,2 5 6 hypertension,5 chronic kidney disease,2 4 6 diabetes mellitus,2 5 obesity,2 6 7 haematological diseases,2 4 malignancy2 4–6 8 and immunocompromised state such as HIV infection.2 4 9 Medical complications following hospitalisation, including acute episodes of cardiovascular, respiratory, neurological, renal or hepatic failure, have also been linked to severe outcomes.10 There are also other risk factors reported, such as smoking11 12 or being from a Black, Asian and minority ethnic (BAME) group,13–15 the effects of which are either mixed or the reasoning is not clearly understood.4 Concerns have also been raised regarding the use of various medications with respect to the risk or protective effect to COVID-19.16–18

Patients with diseases of the liver, pancreas or biliary tract (hepato–pancreato–biliary; HPB) are considered, in general, to be at risk of developing serious medical conditions. Expression of the ACE2 gene—a receptor for the SARS-CoV-2 virus—along the gastrointestinal tract is well documented, which suggests the digestive system is a potential route for COVID-19,18 making patients with a diseased HPB system susceptible to this novel infection. The prevalence of COVID-19 among patients with hepatic conditions has been explored,6 15 19 indicating severe liver disease as a moderate risk factor for COVID-19.2 In contrast, very limited data are available on the prevalence of COVID-19 among patients with pancreatic or biliary conditions,20 although pancreatic manifestations of the disease are rare.21 22 It is important that clinical characteristics of COVID-19 are investigated for the HPB group as a whole, not only because these diseases demonstrate similar clinical–biological behaviours,23 but also since they are commonly seen by a single clinical unit with specialist expertise in the management of these diseases.

The UK has been the worst affected country in Europe by COVID-19, with a reported death toll of 44 819 as of 30 June 2020.24 At the same time, London had the highest incidence and mortality rates, with 33 775 confirmed cases and 8438 deaths.25 26 Barts Health National Health Service (NHS) Trust (BHNT) is the largest NHS Trust in England and acts as provider of district general hospital facilities for around 2.5 million population of East London as well as a range of tertiary care services.27 Between 1 March and 30 June, the three boroughs in East London—Tower Hamlets, Waltham Forest and Newham—had a combined age-standardised COVID-19-related mortality rate of 195 per 100 000 people. This was significantly higher than the rest of London where the age-standardised COVID-19-related mortality rate was 156 per 100 000 people.25 East London is also one of the most ethnically diverse local areas in the country where an estimated 57% residents belong to a BAME group.28 Significant health inequalities exist within the local population including higher rates of cancer, diabetes and obesity,29 compared with the wider population. These conditions are not only known to be a precursor or consequence to HPB diseases, but also linked to COVID-19 and severe outcomes. In this study, we integrated primary, secondary and tertiary electronic healthcare records (EHRs) of patients with HPB diseases in East London. We inspected the demographics, lifestyle, comorbidities and associated medication use of these patients, and any possible links with SARS-CoV-2 infection. We also evaluated whether the effect of these prevalent factors as well as clinical observations during COVID-19-related hospitalisation are associated with mortality. This study will inform the management of this specific cohort of patients.


Study setting and data sources

All data used for this study were collected and processed under the East London Pancreatic Cancer Epidemiology (EL-PaC-Epidem) Study at BHNT. In brief, EL-PaC-Epidem is an ongoing study that ascertains patients diagnosed or reported with HPB diseases including cancers, as well as control patients (eg, small intestine, hernia), within five BHNT hospital sites (The Royal London Hospital, Newham University Hospital, St Bartholomew’s Hospital, Whipps Cross University Hospital, Mile End Hospital) between 2008 and 2021. The study is limited to the secondary use of a specified subset of patients’ retrospective EHR generated during the course of normal care of these patients. It links EHRs from different data sources (via UK unique individual NHS numbers), including primary care through general practitioners (GPs) (Discovery East London Programme data service (DDS)) and secondary or tertiary care through hospitals (BHNT Consolidated Data Extract (CDE)). Patients who have previously informed their GPs or NHS to stop sharing their personal and health records for purposes other than their individual care were automatically excluded. The current EL-PaC-Epidem Study cohort consists of 27 321 adult patients (aged 18 years or over), diagnosed or reported with at least one of the HPB conditions (online supplemental table 1) between 1 April 2008 and 6 March 2020.

Study design and population

This is a single-centre cross-sectional study using the linked EHR data of patients with a history of HPB diseases. Within this specific patient group, the study focused on the incidence of COVID-19, and examined the association of SARS-CoV-2 infection with six common medical comorbidities (ie, diabetes, hypertension, high cholesterol, cardiovascular disease, chronic respiratory disease, renal disease), lifestyle factors (ie, smoking, alcohol use, substance misuse, obesity), and use of selected prescription medications.

As the first case of COVID-19 in London was reported on 12 February 2020, we used this as the start date for this study and extracted data on a subgroup of the EL-PaC-Epidem Study cohort until 12 June 2020 (figure 1). Eligible individuals were a resident in East London and alive on the study start date (EL-HPB). Residency of East London was inferred if a patient had at least one appointment or prescription issued from a GP in East London boroughs or had a scheduled or unscheduled visit to one of the BHNT hospitals within the last 6 months (after 12 August 2019). Patients with confirmed SARS-CoV-2 infection were identified by: (1) the presence of International Classification of Diseases 10th edition (ICD-10) or Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) codes for confirmed COVID-19 diagnosis assigned in their hospital encounters or GP records during the observation period between 12 February and 12 June 2020 (online supplemental table 2) OR (2) positive record of SARS-CoV-2 RNA through BHNT oral and/or nasal swabs test during the same period. For confirmed COVID-19 cases, the earliest date of diagnosis or positive swab test was considered as the index date, whereas 12 February 2020 was considered as index date for rest of the cohort. Patients who were assigned an ICD-10 or SNOMED CT diagnosis code for suspected COVID-19, but were neither reassigned to confirmed diagnosis nor positive RNA test, were excluded from the analysis.

Figure 1

Selection of patients for the cross-sectional study. EHR, electronic health record; EL-PaC-Epidem, East London Pancreatic Cancer Epidemiology; GP, general practitioner; HPB, hepato–pancreato–biliary.

We also examined the onset-to-death distribution within the patient group with a confirmed COVID-19 diagnosis (EL-HPB-COVID). Mortality data were collected on 12 October 2020. Following the latest Public Health England definition,30 the death of a patient within 28 days of the index date is considered as a COVID-19-related death. This is different from a 60-day window that was being used in the UK prior to 12 August 2020 to define COVID-19-related death. To ensure consistency, patients with COVID-19 who survived beyond 60 days of index date are considered as survivors in the study; nine patients who died between 29 and 60 days of diagnosis were excluded from the analysis. The onset-to-death distribution was analysed in the context of the same set of comorbidities, lifestyle factors and medication use, as well as cardiovascular, respiratory and renal complications during hospital care.


All patient data were obtained from retrospective EHR, harmonised across hospital and GP coding systems where applicable, and organised into 40 primary variables across seven categories corresponding to the focus of the study (table 1). BHNT CDE uses 2011 UK census grouping to record ethnicity, ICD-10 or SNOMED diagnosis codes for clinically relevant diagnoses, and Office of the Population Censuses and Surveys Classification of Interventions and Procedures V.4 (OPCS-4) procedural codes for treatments and procedures. Physiological observations (weight, body mass index (BMI), blood pressure) and laboratory test results are available in locally developed terms. Semistructured text entries such as discharge summaries, medical history and a lifestyle questionnaire collected during the preoperative assessment, and presenting symptoms from scheduled or unscheduled hospital visits are also available. All GP records via DDS were available in Read Codes V.2 or Clinical Terminology V.3 (CTV3) codes, except the prescribed medication records and COVID-19 diagnosis which were available in SNOMED codes. For each variable, we consulted ICD-10, SNOMED, Read, CTV3 or OPCS-4 dictionaries as appropriate to construct the mapping codelists. For some variables, codelists also included keywords to conduct automated substring search within semistructured text as well as local laboratory test and physiological observation terms.

Table 1

Variables and outcomes explored in this study

Rule-based phenotyping algorithms were developed for each categorical variable to characterise patients, integrating information from multiple sources where available to counteract bias. HPB diseases were grouped into four categories (online supplemental table 1): any malignant disease, and non-malignant diseases of liver, pancreas or biliary tract. Non-malignant liver diseases were further divided into mild and moderate to severe subgroups, extending the definition from CDMF Charlson Comorbidity Index,31 whereas non-malignant pancreas or biliary diseases were divided into acute and chronic disease subgroups (online supplemental table 1). Within each disease category, a patient was assigned to chronic (or more severe) subgroup, when data indicated the history of both acute (or mild) and chronic (more severe) conditions. A patient can either be assigned to a malignant disease category or any of the non-malignant disease subgroups. Ethnicity was grouped into four categories—White, South Asian, Black and Other. White and Black ethnic groups were defined based on the 2011 UK census classification; Indian, Pakistani and Bangladeshi origin from the Asian group represented South Asian, while the rest (ie, Mixed, Chinese, other Asian and other ethnic group) were represented in the Other group. The ethnic category recorded at the GP took precedence over hospital records.

Phenotyping algorithms defining the comorbidities were based on diagnosis codes (presence) or semistructured text search (presence or absence), with the additional inclusion of procedural codes (presence), some observation or laboratory test results (presence) and related medication use (at least three prescriptions). Patients were considered to have or have had a specific medical condition if they met at least one criterion indicating the presence of the condition before the index date, otherwise they were considered negative for the condition.

Phenotyping algorithms defining the lifestyle factors were based on the longitudinal entries (current, past or never) derived from diagnosis codes and free text search, with the additional inclusion of BMI observation for obesity. Obesity was defined as BMI of 30 kg/m2 or more. Patients assigned never status at any point but having a record of current or past status before that date were reassigned to past status. The most recent lifestyle record before or on the index date was then used to assign current, past or never status to the patients. Patients with no record of a specific lifestyle factor were classified as having missing data. Patients were assigned current, past or non-user status for medication use variables based on the number of GP prescriptions issued in the last 2 years for the medicines under specific medication groups. Patients with no record of prescription for particular medications were assigned non-user status. With at least three prescriptions issued, a patient was assigned current user status if the latest issue was within 3 months preceding the index date, and past user status otherwise. Patients with record of less than three prescriptions were classified as non-user. Patients with COVID-19 were considered to have a specific complication during admitted patient care if at least one of the hospital diagnosis codes from the complications codelist was recorded during the observation period after index date, otherwise they were considered negative for the complication. A patient was considered to have a recurrent complication if they had a history of that particular comorbidity, otherwise it was considered as a novel complication.

Selection of study variables, codelist construction, and phenotyping algorithm development were done in consultation with a panel of clinicians and scientists (HK, CC, LS). A comprehensive list of codelists and phenotyping algorithms for the study variables are available on the EL-PaC-Epidem portal (

Statistical analysis

We conducted descriptive analyses for the EL-HPB cohort as a whole, by a group for patients with confirmed SARS-CoV-2 infection and the rest (herein referred to as COVID-19 and non-COVID-19, respectively). Differences in demographic and clinical characteristics between the groups were assessed with Pearson’s χ2 test, Fisher’s exact test and Kruskal-Wallis rank-sum test, as appropriate. P values less than 0.05 were considered significant. Similar descriptive analyses were performed for the EL-HPB-COVID cohort, and by survivor and deceased groups.

To explore the risk factors associated with COVID-19 susceptibility and subsequent survival, the effect size for each variable under investigation was evaluated with ORs with 95% CIs, using regression models with a binomial distribution. Crude ORs were obtained from univariable regression models, and then simultaneously controlled for a fixed set of potential confounders (gender, ethnicity, age group) using multivariable regression models with Benjamini-Hochberg correction for p value adjustment. The median age of the overall EL-HPB cohort being 57 years, a simplified binary age grouping (18–60, 61+ years) was used in multivariable regression models for comorbidity, lifestyle, medication use and post-diagnosis complication analyses. Since a participant with non-malignant HPB diagnoses for multiple organs can be represented in multiple HPB subgroups, the effect estimation for individual HPB disease variables was further mutually controlled for other HPB diseases.

We also conducted more in-depth post hoc analysis to evaluate the confounding effect of pre-existing medical conditions by adding comorbidity covariates individually in the multivariable regression models. Finally, effect modification by non-malignant HPB disease subgroups was evaluated by adding interaction terms in the models and comparing them with models lacking this interaction via the likelihood ratio test. Any potential association between HPB diseases and COVID-19 susceptibility/mortality risk factors was further evaluated in stratified analyses according to the disease subgroups.

Patients with missing data for individual categorical variables were included in the descriptive analyses and in regression models for effect estimation. All statistical analyses and visualisations were performed in R (V.3.5.1).

Patient and public involvement

Patients and the public were involved in evaluating the design of the umbrella study (EL-PaC-Epidem), particularly the notion of collection and processing of retrospective patient data without their consent. The support from NHS Confidentiality Advisory Group was obtained based on the positive opinion posed by patients and the public.


Population characteristics

The final EL-HPB cohort consisted of 15 540 patients, after applying the eligibility criteria and excluding 168 suspected but unconfirmed COVID-19 cases. By 12 June 2020, 226 (1.5%; 145 per 10 000 adult population) confirmed cases of COVID-19 were reported in this cohort (figure 1). This was more than three times higher than in the general population of East London where prevalence of COVID-19 at the same time was 41 per 10 000 adult population.25 More than half of the COVID-19 cases had some form of non-malignant liver diseases (N=138, 62.8%); however, when comparing confirmed COVID-19 cases with the non-COVID-19 cases, we observed a disproportionate infection frequency in patients with chronic pancreatic conditions (14.1% vs 8.8%) and moderate to severe liver conditions (11.4% vs 6.9%). We also observed differences in gender, ethnic origin, and age group between COVID-19 and non-COVID-19 cases (table 2). The proportion of men was higher in the COVID-19 group compared with the baseline non-COVID-19 group (53.5% vs 43.7%, p=0.005). The same trend was observed for Black population (17.7% vs 10.7%). Patients with COVID-19 were older than patients without COVID-19 (median 67 years, IQR 55.1–80.9 years vs 57.1 years, 44.8–69.2 years, p<0.001), with a steady increase in infection frequency with age. Some 78.3% of patients with COVID-19 had three or more comorbidities, with hypertension being the most common comorbidity (85.4%), followed by high cholesterol and diabetes (table 2). Only eight patients with COVID-19 had none of the six medical conditions. In general, patients with COVID-19 had a higher rate of history of smoking, drinking, substance misuse and obesity compared with the non-COVID-19 group. Consistent with the underlying prevalent comorbidities of the COVID-19 group, a history of prescription drugs use associated with managing hypertension or cardiovascular disease (ACE inhibitor, calcium channel blocker, β-blocker, aldosterone antagonists, antiarrhythmic, antiplatelet, anticoagulant), cholesterol (statin), inflammation (glucocorticoid, β2-agonists) or background HPB condition (proton pump inhibitor) was higher in patients with COVID-19 (table 2). Intake of vitamin D was also significantly higher in patients with COVID-19.

Table 2

Differences in demographic, comorbidity, lifestyle, and medication use characteristics between COVID-19 infected and non-COVID-19 groups

Between 12 February and 12 August 2020, the all-cause mortality rate in the non-COVID-19 group was 2.4%, whereas the rate in the COVID-19 group during the same period was 27.4% (table 2). When analysing the 53 deceased and 164 surviving patients with confirmed SARS-CoV-2 infection, we found differences in gender (p=0.005) and age (p<0.001); deceased patients were older than the survivors (median 80.4 years, IQR 71.7–85.1 years vs 62.9 years, 49.8–77.4 years) with steady increase in death with age becoming prominent in those above 70 years of age (table 3). We observed a higher mortality among South Asian (34% vs 29.3%) and Black (26.4% vs 13.4%) populations, which were even more pronounced when comparing with the all-cause mortality in the non-COVID-19 group (online supplemental table 3). Higher mortality was observed for patients with pancreatic and biliary disease in general, whereas patients with liver disease had higher survival rate (table 3). The median survival period for the deceased patients from the date of confirmed COVID-19 diagnosis was 11 days (IQR 2–18 days). After stratifying patients according to the comorbidities investigated, the mortality for patients with HPB diseases with COVID-19 was at least six times higher than that of patients with HPB diseases without COVID-19 (online supplemental table 3). Diabetes, hypertension, cardiovascular and renal conditions, in particular, were associated with mortality in patients with COVID-19 (table 3). All except one deceased patient had at least three additional comorbidities, compared with 71.3% of patients who survived. The deceased group had higher proportion of patients with a history of smoking and current substance misuse, but no overall differences were observed for drinking and obesity. Notable differences were observed in the use of glucocorticoid, β2-agonists and statins. Recurrent complications were more common in the deceased group compared with survivors, however frequency of novel respiratory complications was notably higher in the deceased group (39.6% vs 21.3%).

Table 3

Differences in demographic, comorbidity, lifestyle, medication use, and post-diagnosis complication characteristics between COVID-19 survivor and deceased groups

Odds of SARS-CoV-2 infection

The risk analyses showed a greater odds of COVID-19 for men, the Black community and those with moderate to severe liver disease (figure 2). Patients with chronic pancreatic and mild liver conditions were also associated with a higher odds of infection (OR 1.89, 95% CI 1.25 to 2.85, p=0.007; 1.52, 1.07 to 2.15, p=0.039); however, post-hoc adjustment for comorbidities returned a reduced non-significant positive odds (1.57, 1.04 to 2.38, p=0.084; 1.32, 0.93 to 1.88, p=0.237), with diabetes principally responsible for this reduction (online supplemental table 4). The similar association was observed for elderly patients (over 70 years) with underlying multimorbidity as confounding factor. Patients with pre-existing renal conditions had the highest odds of COVID-19 (2.93, 2.2 to 3.89, p<0.001), followed by more than twofold increased odds for patients with hypertension, diabetes, cardiovascular or chronic respiratory disease (figure 2). However, the independent effects of hypertension and high cholesterol were absent when controlled for other comorbidities (online supplemental table 4).

Figure 2

OR estimates of COVID-19 for patients with HPB diseases with specific demographic, comorbidity, lifestyle and medication use characteristics. OR estimates for demographic characteristics are mutually controlled for each other, that is, gender, ethnicity and age group. Estimates for HPB disease subgroups are further controlled for each other. For comorbidity, lifestyle and medication use characteristics, estimates are controlled for gender, ethnicity and dichotomous age group (under and over 60 years). HPB, hepato–pancreato–biliary; NSAID, non-steroidal anti-inflammatory drug.

Substance misusers had higher odds of infection, but the higher odds observed for those with a history of smoking or obesity were due to underlying comorbidities. Patients on vitamin D treatment and past users of ACE inhibitors were associated with higher odds of infection. The slightly reduced yet significantly high odds remained after controlling for comorbidities, with renal (for vitamin D users) and cardiovascular (for ACE inhibitor users) diseases being the principal source of the reduced estimates (online supplemental table 4). Higher odds were also observed for users of PPIs, glucocorticoid, β-blockers, β2-agonists, aldosterone antagonist, muscarinic antagonist, antiplatelet, antiarrhythmic and statin compared with the non-users of these respective drugs; however, post-hoc adjustment for comorbidities returned non-significant positive odds for those.

A small number of factors appeared to modify the association between HPB disease subgroups and risk of COVID-19 infection (online supplemental table 5). In patients with mild liver disease, the odds of COVID-19 infection doubled for patients with chronic pancreatic disease compared with patients with no pancreatic condition (p value for heterogeneity, p-het by liver disease=0.02). A history of substance misuse was associated with significantly higher odds of infection, particularly for patients with chronic biliary conditions (p-het by biliary disease=0.03), and mild liver conditions (p-het by liver disease=0.04).

Odds of COVID-19-related death

The risk analyses showed an increased odds of COVID-19-related death for men, individuals from the Black community and patients who had acute respiratory complications during admitted care without a history of long-standing respiratory problems (figure 3). Increased odds of death were also observed for the glucocorticoid and β2-agonists. No HPB disease subgroups were particularly more vulnerable to COVID-19-related death, although patients with chronic pancreatic condition showed a trend towards significance. Elderly patients (over 70 years), and recent users of ACE inhibitors and non-steroidal anti-inflammatory drugs were associated with higher odds of death; however, post-hoc adjustment for comorbidities returned a non-significant positive odds for these risk factors (online supplemental table 6). Stratified analyses according to HPB disease subtypes did not reveal any meaningful effect modification, principally due to small EL-HPB-COVID sample size (data not shown).

Figure 3

OR estimates of COVID-19-related death for patients with HPB diseases with specific demographic, comorbidity, lifestyle, medication use and post-COVID-19 diagnosis complication characteristics. OR estimates for demographic characteristics are mutually controlled for each other, that is, gender, ethnicity and age group. Estimates for HPB disease subgroups are further controlled for each other. For comorbidity, lifestyle, medication use and post-diagnosis complication characteristics, estimates are controlled for gender, ethnicity and dichotomous age group (under and over 60 years). Categories with OR p>0.95 are not shown. HPB, hepato–pancreato–biliary; NSAID, non-steroidal anti-inflammatory drug.


We present, for the first time, data on a large, single-centre, multi-ethnic cohort of patients with HPB diseases, where primary, secondary and tertiary care EHRs were integrated to investigate the incidence and outcome of COVID-19, to demonstrate how key demographic characteristics and a range of comorbidities, lifestyle factors and medications are associated with SARS-CoV-2 infection and outcomes in patients with HPB diseases.

Comparison with other studies

We noted higher odds of COVID-19 in patients with prior pancreatic and liver conditions. The higher odds associated with liver conditions are consistent with earlier findings.6 19 Patients with moderate to severe liver conditions had higher susceptibility to SARS-CoV-2 infection than those with milder conditions, which could be due to the increase in abnormalities of immune function with severity of this disease group.32 We speculate that reduced pancreatic function, particularly in individuals with chronic pancreatic conditions, leading to altered digestion, and therefore gut flora, may make patients more susceptible to pathogens with an enteric route of SARS-CoV-2 infection,33 34 and also contribute to the magnitude of COVID-19 severity via modulating host-immune responses.35 Surprisingly, the most vulnerable patients with cancer had a low COVID-19 incidence rate, which may reflect the effectiveness of public health interventions such as shielding.36 However, at the same time, we noted a 17% death rate in this cohort (not due to COVID-19) in 6 months (online supplemental table 3), perhaps indicating the unintended, but potentially inevitable, negative sequelae of social distancing and reduced healthcare provisions for this group of patients as resources were diverted to COVID-19-affected patients.

Men had higher odds of infection and mortality than women, which is consistent with previous reports,1 14 and could be due to a favourable genetic predisposition to the virus,37 and/or gender differences in risk behaviours. Our study also affirms older age, particularly over 70 years, as an established risk factor for COVID-19 incidence and mortality2 4 5; however, this can be largely explained by the presence of multiple comorbidities in the older age groups.38

COVID-19 statistics have highlighted a disproportionate effect on BAME ethnic groups with an increased risk of infection and poor outcomes.13–15 Our results confirm that people from Black community are at a higher risk of both COVID-19 infection and related mortality compared with the White ethnic group. Only a small part of the excess risk in the Black community is explained by multiple comorbidities. Therefore, further variables such as deprivation, occupational exposure and living conditions might be useful to explore as potential factors behind the apparent vulnerability of the Black population to COVID-19.

All comorbidities, such as diabetes, hypertension, high cholesterol, cardiovascular disease, kidney and respiratory disease, were independently associated with an increased risk of COVID-19, whereas the presence of cardiovascular disease contributed to an added risk of death, concurring with previously reported cohort studies.4–6 11 Interestingly, our results highlight that for patients without underlying respiratory issues, an acute respiratory episode due to SARS-CoV-2 infection could be indicative of a worse outcome. This is in line with previous reports describing an unexpectedly lower prevalence of chronic respiratory conditions among those who had been admitted to hospital due to COVID-1939 40; whereas severe outcomes are often a result of respiratory complications,41 42 such as acute respiratory distress syndrome and respiratory failure.

Smoking leads to severe health consequences, which explains the greater risk observed in our cohort of past smokers with high prevalence of respiratory and cardiovascular diseases. However, current smoking status appeared to have a protective effect in our cohort after adjusting for comorbidities, as has been observed by others, an aspect which cannot be mechanistically explained.5 43 Carefully designed analyses are needed to explore the association and causality between smoking status (both current and past), associated comorbidities and COVID-19.

Although substance misuse leads to a plethora of cardiorespiratory and metabolic problems, its role in COVID-19 remains unexplored. To date, this is the first study providing a concrete measure of the risk of COVID-19 for substance misusers. Our initial results showing that substance misusers are at a heightened risk for COVID-19 irrespective of the comorbidities warrants a strong case for considering it as an independent risk factor for COVID-19, and may be related to high-risk behavioural patterns.44 45

Previous studies have found a significant relationship between obesity and an increased risk of COVID-19,7 and subsequent hospitalisation,46 advanced levels of treatment15 and death.4 6 However, our study does not suggest any particular effect of obesity on COVID-19 for patients with HPB conditions, who have a higher prevalence rate of obesity compared with the UK general population (38.8% vs 26%).47 Our study suggests that the difference in effects for potential susceptibility to COVID-19 for patients with a history of obesity is attributed more to other prevalent factors—such as cardiovascular or chronic renal disease47 48 —which in turn might be the consequences of obesity in these patients’ lifetime.

Concerns have been raised regarding the use of various medications with respect to the risk of COVID-19 and the subsequent outcome; and, our analyses contribute to that discussion for some of the widely used prescription drugs. The higher odds observed for the history of various prescription drugs use are consistent with the management of underlying prevalent comorbidities of the study cohort: cardiovascular conditions (ACE inhibitor, β-blocker, aldosterone antagonists, antiplatelet, antiarrhythmic), cholesterol (statin), chronic respiratory diseases (glucocorticoid, β2-agonists, muscarinic antagonist), or background HPB condition (PPI). An important finding from our study is the significant risk observed for vitamin D users, supportive of the possible association between development of COVID-19 and vitamin D deficiency,49 50 or specific medical conditions (such as kidney failure) where vitamin D prescription is prevalent. Given that BAME communities are observed to be at a high risk of COVID-19, and there is evidence that vitamin D deficiency is particularly common in these ethnic groups,49 50 further research on the relationship between vitamin D and COVID-19 is required, with a need to exclude confounding factors such as patients’ vitamin D level. Our results also suggest that patients currently taking PPIs are more susceptible to SARS-CoV-2 infection, which concurs with a large population-based online survey conducted in the USA.51 The use of PPIs is highly prevalent in patients with HPB diseases for the management of gastrointestinal acid-related disorders, and the finding here supports the hypothesis that current use of PPIs might influence the susceptibility to SARS-CoV-2 infection in the gastrointestinal tract through reduction of stomach acid.51 52

The literature is conflicted on the potential impact of antihypertensive drugs on COVID-19, particularly those that act as inhibitors to the renin–angiotensin–aldosterone system (RAAS) and upregulate ACE2 expression, suggesting these drugs may be potential risk factors for infection,53 54 but also as having a protective effect on outcome.55 However, recent studies found no underlying association between the use of different classes of antihypertensive drugs and the risk of developing COVID-19.16 With a high percentage of patients with hypertension in the study cohort, our finding that a high risk of COVID-19 is associated with past intake of ACE inhibitors or aldosterone agonists is suggestive of the potential risk of switching from one class of antihypertensive drug to another. This contributes to the debate of whether discontinuation of RAAS inhibitors and considering alternative antihypertensive therapy in times of COVID-19 would be a good practice or not.56 A marginal association of current use of ACE inhibitors with COVID-19-related death suggests that any increased risk of mortality is likely to be small and will need to be scrutinised in future as more data accumulate.

Our study also shows that recent users of anti-inflammatory drugs, namely glucocorticoid and β2-agonists, had increased odds of COVID-19 and subsequent poor outcome. Controlling for comorbidities resulted in non-significant odds of infection for these patient subgroups, indicating underlying medical conditions—particularly those of respiratory system—to be responsible for the increased susceptibility. However, the observed harmful associations between these drugs and COVID-19-related death could not be explained by a simplified binary representation of underlying six common health conditions. Glucocorticoid drugs, for instance, are used to treat many other inflammatory conditions, notably inflammatory bowel disease (IBD), whereas HPB diseases constitute some of the most common extraintestinal manifestations of IBD. It has been shown that use of corticosteroids is associated with adverse COVID-19 outcomes among patients with IBD.17 Had it been possible to successfully control for differences in respiratory disease severity or other medical comorbidities, we speculate to see different and possibly non-significant odds of death in these patient subgroups.

Strengths and limitations of the study

A key strength of our study is that we have systematically identified the effect, or the lack of it, of individual demographic and clinical factors on the infection and mortality of COVID-19 in a cohort of over 15 000 patients, robustly corrected for potential confounders in their evaluation. Our large population is highly representative of patients with HPB diseases from diverse ethnic groups, which contributes to the generalisability of our findings. Another strength is our use of linked EHRs, harmonised for variations in coding that exist between different EHR systems. We ascertained patient demographics, lifestyle, comorbidities and medications by linking hospital records with pseudo-anonymised longitudinal primary care records, which substantially enrich the data that are recorded on hospital visits.

Retrospective EHR-based COVID-19 studies often suffer from incomplete or missing data on patient characteristics, including key variables such as BMI, ethnicity, smoking or pre-existing comorbidities.4 57 The missing data are particularly applicable to otherwise healthy patients with COVID-19 with low use of healthcare services in the past. However, our patient cohort had already been treated or managed at BHNT hospitals at least once, and often referred through primary care, which led to near-complete data for this study, an added advantage of this study. For instance, ethnicity, a common demographic feature, is missing only for 2.8% of cases in our cohort while the rate is significantly higher in other studies (up to 20% of cases).4 57 The only variable with missing data frequency over 20% in our study is substance misuse behaviour (50.4%). This is a unique lifestyle risk variable which is not yet explored—understandably due to a lack of recorded data as people often do not disclose this information to their clinicians, 58 unless manifested in physical or mental disorders. Yet, the substance misuse history of over 7600 patients included in this study provides a good indication of the impact of COVID-19 on this understudied group.

Our study also has some important limitations. One limitation is the risk of residual confounding or confounding by indication due to unmeasured or simplified binary representation of potential confounding variables. For example, the observed association between vitamin D users and risk of COVID-19 may be different if participants’ vitamin D level/deficiency status had been taken into consideration. Similarly, the observed association between COVID-19-related death and recent use of glucocorticoid or β2-agonists may reduce or get amplified if respiratory disease severity or other indications for corticosteroid use were considered.

Another critical limitation is associated with the confirmation of East London residency for the study cohort. Patients’ addresses (current or past) are not collected under the umbrella study, which considers patients with HPB conditions (with the exception of cancer) treated or managed at BHNT hospitals as East London residents during the time of their care. The Royal London Hospital hosts one of the largest HPB centres in England, and supports patients with suspected or confirmed HPB cancer from nearby geographical areas. As the umbrella study cohort is historical, we acknowledged the probability of people moving away from East London in the meantime. In absence of a patient’s current address to confirm their residency at the outset of COVID-19 pandemic in the UK, we relied on an indirect measure to infer residency. We used a strict 6-month window preceding the study to identify a patient’s interaction with East London GPs or BHNT hospitals. Thus, we believe that any supposed reduction in the cohort size due to unaccounted change of residency within that window should have affected the COVID-19 and non-COVID-19 group in equal proportion, and hence unlikely to alter the findings we report here.

Due to the rarity of the outcome (SARS-CoV-2 infection) in the full HPB cohort, the effects reported in the study could be influenced by the smaller cohort size of COVID-19 cases. We recognise that larger sample sizes of patients with COVID-19 are needed to fully understand the effect of SARS-CoV-2 in patients with HPB conditions. Our results are the first step towards this and require validation in similar national and international cohorts.


We believe that the findings from this single-centre study, focusing on patients with a particular medical condition and in an ethnically diverse area, highlight some considerations that could guide clinical care while we await an effective antiviral strategy for COVID-19. The current findings reinforce our understanding of some of the important risk factors for SARS-CoV-2 infection but with regard to pre-existing HPB conditions, and allow stratification for risk, thereby providing a tool for policymakers to divert prevention as well as treatment to a clearly identified vulnerable population.

Data availability statement

All statistical data relevant to the study are included in the article or uploaded as supplemental information. Only the corresponding author had full access to all the participants’ data in the study. The authors confirm that researchers seeking the completely anonymised final analysis dataset for this work can submit a data request to the corresponding author.

Ethics statements

Ethics approval

All data used for this study were collected and processed under the EL-PaC-Epidem Study at Barts Health NHS Trust. The study was approved by the East of England-Essex Research Ethics Committee (19/EE/0163; 17 May 2019) and supported by the NHS Confidentiality Advisory Group for collecting and processing confidential patient information without consent (19/CAG/0219; 17 January 2020).


AZMDU is supported by Health Data Research UK (HDR-UK) to conduct the umbrella study EL-PaC-Epidem, which is funded by the UK Medical Research Council. We gratefully acknowledge support provided by the Pancreatic Cancer Research Fund (PCRF), for conducting public–patient engagement activity and facilitating ethical approval for EL-PaC-Epidem. We thank Dr Charles Gutteridge, Chief Clinical Information Officer at Barts Health NHS Trust, for his help with the collection of secondary and tertiary care data. We thank Dr Kambiz Boomla, Dr John Robson, Professor Carol Dezateux, and members of the Discovery East London Programme Board and developers at Learning Health Solutions for their support in facilitating collection of primary care patient records. Finally, we acknowledge the contribution to the research made by several members of the PCRF Tissue Bank team, Bioinformatics Unit, and clinical research fellows at Barts Cancer Institute through insightful medical and scientific discussion.


Supplementary materials


  • Contributors AZMDU designed the study, and was responsible for undertaking and completing data collection, processing and analysis. HK and CC oversaw the conduct and management of the study. AZMDU, LS, HK and CC contributed to the selection of study variables and interpretation post-analysis. AZMDU wrote the first drafts of the report and all the authors made critical revisions.

  • Funding The study is conducted under an umbrella study, focusing on the epidemiology of pancreatic and other hepatobiliary cancers in East London (EL-PaC-Epidem), funded by the Medical Research Council UK (Ref: MR/S003835/1) as a UKRI/Rutherford Fellowship to the corresponding author. No additional funding has been received for this study.

  • Competing interests None declared.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.