Article Text

Download PDFPDF

Curating a knowledge base for individuals with coinfection of HIV and SARS-CoV-2: a study protocol of EHR-based data mining and clinical implementation
  1. Chen Liang1,2,
  2. Sharon Weissman2,3,
  3. Bankole Olatosi1,2,
  4. Eric G Poon4,
  5. Michael E Yarrington4,
  6. Xiaoming Li2,5
  1. 1Department of Health Services Policy and Management, University of South Carolina, Columbia, South Carolina, USA
  2. 2Big Data Health Science Center, University of South Carolina, Columbia, South Carolina, USA
  3. 3Department of Internal Medicine, University of South Carolina, Columbia, South Carolina, USA
  4. 4Department of Medicine, Duke University, Durham, North Carolina, USA
  5. 5Department of Health Promotion Education and Behavior, University of South Carolina, Columbia, South Carolina, USA
  1. Correspondence to Dr Chen Liang; cliang{at}


Introduction Despite a higher risk of severe COVID-19 disease in individuals with HIV, the interactions between SARS-CoV-2 and HIV infections remain unclear. To delineate these interactions, multicentre Electronic Health Records (EHR) hold existing promise to provide full-spectrum and longitudinal clinical data, demographics and sociobehavioural data at individual level. Presently, a comprehensive EHR-based cohort for the HIV/SARS-CoV-2 coinfection has not been established; EHR integration and data mining methods tailored for studying the coinfection are urgently needed yet remain underdeveloped.

Methods and analysis The overarching goal of this exploratory/developmental study is to establish an EHR-based cohort for individuals with HIV/SARS-CoV-2 coinfection and perform large-scale EHR-based data mining to examine the interactions between HIV and SARS-CoV-2 infections and systematically identify and validate factors contributing to the severe clinical course of the coinfection. We will use a nationwide EHR database in the USA, namely, National COVID Cohort Collaborative (N3C). Ultimately, collected clinical evidence will be implemented and used to pilot test a clinical decision support prototype to assist providers in screening and referral of at-risk patients in real-world clinics.

Ethics and dissemination The study was approved by the institutional review boards at the University of South Carolina (Pro00121828) as non-human subject study. Study findings will be presented at academic conferences and published in peer-reviewed journals. This study will disseminate urgently needed clinical evidence for guiding clinical practice for individuals with the coinfection at Prisma Health, a healthcare system in collaboration.

  • COVID-19
  • HIV & AIDS
  • Health informatics

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.


  • Contributors CL conceived the study design and drafted the manuscript. CL completed preliminary data collection. SW, BO, EG-CP, MY and XL contributed critical edits to the manuscript. All authors reviewed and approved the manuscript.

  • Funding Research reported in this publication was supported by the National Institute of Allergy And Infectious Diseases of the National Institutes of Health under Award Number R21AI170171. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; peer reviewed for ethical and funding approval prior to submission.