Article Text

Case definitions in Swedish register data to identify systemic lupus erythematosus
  1. Elizabeth V Arkema1,
  2. Andreas Jönsen2,
  3. Lars Rönnblom3,
  4. Elisabet Svenungsson4,
  5. Christopher Sjöwall5,
  6. Julia F Simard1,6,7
  1. 1Clinical Epidemiology Unit, Department of Medicine, Solna Karolinska Institute, Stockholm, Sweden
  2. 2Department of Clinical Sciences, Lund University, Lund, Sweden
  3. 3Department of Medical Sciences, Science for Life Laboratories, Uppsala University, Uppsala, Sweden
  4. 4Rheumatology Unit, Department of Medicine, Karolinska University Hospital, Karolinska Institute, Stockholm, Sweden
  5. 5Rheumatology/AIR, Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
  6. 6Division of Epidemiology, Department of Health Research and Policy, Stanford School of Medicine, Stanford, California, USA
  7. 7Division of Immunology and Rheumatology, Department of Medicine, Stanford School of Medicine, Stanford, California, USA
  1. Correspondence to Dr Elizabeth V Arkema; Elizabeth.Arkema{at}ki.se

Abstract

Objective To develop and investigate the utility of several different case definitions for systemic lupus erythematosus (SLE) using national register data in Sweden.

Methods The reference standard consisted of clinically confirmed SLE cases pooled from four major clinical centres in Sweden (n=929), and a sample of non-SLE comparators randomly selected from the National Population Register (n=24 267). Demographics, comorbidities, prescriptions and autoimmune disease family history were obtained from multiple registers and linked to the reference standard. We first used previously published SLE definitions to create algorithms for SLE. We also used modern data mining techniques (penalised least absolute shrinkage and selection operator logistic regression, elastic net regression and classification trees) to objectively create data-driven case definitions. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated for the case definitions identified.

Results Defining SLE by using only hospitalisation data resulted in the lowest sensitivity (0.79). When SLE codes from the outpatient register were included, sensitivity and PPV increased (PPV between 0.97 and 0.98, sensitivity between 0.97 and 0.99). Addition of medication information did not greatly improve the algorithm's performance. The application of data mining methods did not yield different case definitions.

Conclusions The use of SLE International Classification of Diseases (ICD) codes in outpatient clinics increased the accuracy for identifying individuals with SLE using Swedish registry data. This study implies that it is possible to use ICD codes from national registers to create a cohort of individuals with SLE.

  • RHEUMATOLOGY
  • EPIDEMIOLOGY
  • STATISTICS & RESEARCH METHODS

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.