Using text-mining techniques in electronic patient records to identify ADRs from medicine use

Br J Clin Pharmacol. 2012 May;73(5):674-84. doi: 10.1111/j.1365-2125.2011.04153.x.

Abstract

This literature review included studies that use text-mining techniques in narrative documents stored in electronic patient records (EPRs) to investigate ADRs. We searched PubMed, Embase, Web of Science and International Pharmaceutical Abstracts without restrictions from origin until July 2011. We included empirically based studies on text mining of electronic patient records (EPRs) that focused on detecting ADRs, excluding those that investigated adverse events not related to medicine use. We extracted information on study populations, EPR data sources, frequencies and types of the identified ADRs, medicines associated with ADRs, text-mining algorithms used and their performance. Seven studies, all from the United States, were eligible for inclusion in the review. Studies were published from 2001, the majority between 2009 and 2010. Text-mining techniques varied over time from simple free text searching of outpatient visit notes and inpatient discharge summaries to more advanced techniques involving natural language processing (NLP) of inpatient discharge summaries. Performance appeared to increase with the use of NLP, although many ADRs were still missed. Due to differences in study design and populations, various types of ADRs were identified and thus we could not make comparisons across studies. The review underscores the feasibility and potential of text mining to investigate narrative documents in EPRs for ADRs. However, more empirical studies are needed to evaluate whether text mining of EPRs can be used systematically to collect new information about ADRs.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Adverse Drug Reaction Reporting Systems / statistics & numerical data*
  • Algorithms
  • Data Mining / methods*
  • Drug-Related Side Effects and Adverse Reactions*
  • Humans
  • Medical Records Systems, Computerized / statistics & numerical data*
  • Natural Language Processing*
  • Pharmacovigilance*