Methods of using real-time social media technologies for detection and remote monitoring of HIV outcomes

Prev Med. 2014 Jun:63:112-5. doi: 10.1016/j.ypmed.2014.01.024. Epub 2014 Feb 8.

Abstract

Objective: Recent availability of "big data" might be used to study whether and how sexual risk behaviors are communicated on real-time social networking sites and how data might inform HIV prevention and detection. This study seeks to establish methods of using real-time social networking data for HIV prevention by assessing 1) whether geolocated conversations about HIV risk behaviors can be extracted from social networking data, 2) the prevalence and content of these conversations, and 3) the feasibility of using HIV risk-related real-time social media conversations as a method to detect HIV outcomes.

Methods: In 2012, tweets (N=553,186,061) were collected online and filtered to include those with HIV risk-related keywords (e.g., sexual behaviors and drug use). Data were merged with AIDSVU data on HIV cases. Negative binomial regressions assessed the relationship between HIV risk tweeting and prevalence by county, controlling for socioeconomic status measures.

Results: Over 9800 geolocated tweets were extracted and used to create a map displaying the geographical location of HIV-related tweets. There was a significant positive relationship (p<.01) between HIV-related tweets and HIV cases.

Conclusion: Results suggest the feasibility of using social networking data as a method for evaluating and detecting Human immunodeficiency virus (HIV) risk behaviors and outcomes.

Keywords: Big data; Digital epidemiology; HIV detection; HIV prevention; Social networking.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Disease Outbreaks / statistics & numerical data*
  • HIV Infections / diagnosis
  • HIV Infections / epidemiology*
  • Humans
  • Internet*
  • Prevalence
  • Public Health / methods*
  • Social Media*
  • United States / epidemiology