Table 1

Data sets for linkage

DatabaseDescriptionKey variables
The Connecticut Tumour Registry (CTR)The CTR is a population-based resource for examining cancer patterns in Connecticut which includes all reported cancers diagnosed in Connecticut residents since 1935, as well as follow-up, treatment and survival data. All licensed medical providers, as well as hospitals and private pathology laboratories in the state, are required by law to report cancer cases to the registry, including those that care for incarcerated individuals. The CTR is the oldest population-based cancer registry in the country. Rigorous quality control procedures, stringent requirements in case reporting, and reciprocal cancer reporting agreements with neighbouring states allow the registry to identify cancers among all Connecticut residents even when diagnosed or treated in other states. CTR data have been used widely in research into cancer aetiology, epidemiology and quality of care.Name*, date of birth*, social security number*, age, race/ethnicity, marital status, sex, residential census tract at time of diagnosis, insurance at time of diagnosis, dates of diagnosis and treatment, vital status, date of last contact, cause of death.
Connecticut Department of Correction (CDOC)The CDOC has an annual population of approximately 15 000 individuals, with disproportionate incarceration of racial and ethnic minorities (demographically similar to rates of incarceration nationwide). CDOC also has a combined criminal justice system, where jails and prisons are under the authority of a single agency. CDOC supports research aimed at improving the health of, and reducing recidivism for, justice involved individuals and has partnered with many academic institutions on federally funded grants.56Dates of incarceration, date of release (if applicable), inmate name*, any known alias(es)*, inmate number, place of incarceration, date of birth*, race, social security number*, sex, and place of birth.
  • *These variables were used in the record linkage only and were not part of the analytical dataset.