Article Text


The use and impact of cancer medicines in routine clinical care: methods and observations in a cohort of elderly Australians
  1. Sallie-Anne Pearson1,2,3,
  2. Andrea Schaffer1,3
  1. 1Faculty of Pharmacy, University of Sydney, Sydney, New South Wales, Australia
  2. 2School of Public Health, Sydney Medical School, University of Sydney, Sydney, New South Wales, Australia
  3. 3Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
  1. Correspondence to Dr Sallie-Anne Pearson; sallie.pearson{at}


Introduction After medicines have been subsidised in Australia we know little about their use in routine clinical practice, impact on resource utilisation, effectiveness or safety. Routinely collected administrative health data are available to address these issues in large population-based pharmacoepidemiological studies. By bringing together cross-jurisdictional data collections that link drug exposure to real-world outcomes, this research programme aims to evaluate the use and impact of cancer medicines in a subset of elderly Australians in the real-world clinical setting.

Methods and analysis This ongoing research programme involves a series of retrospective cohort studies of Australian Government Department of Veterans’ Affairs (DVA) clients. The study population includes 104 635 veterans who reside in New South Wales, Australia, and were aged 65 years and over as of 1 July 2004. We will investigate trends in cancer medicines use according to cancer type and other sociodemographic characteristics as well as predictors of the initiation of cancer medicines and other treatment modalities, survival and adverse outcomes among patients with cancer. The programme is underpinned by the linkage of eight health administrative databases under the custodianship of the DVA and the New South Wales Ministry of Health, including cancer notifications, medicines dispensing data, hospitalisation data and health services data. The cancer notifications database is available from 1994 with all other databases available from 2005 onwards.

Ethics and dissemination Ethics approval has been granted by the DVA and New South Wales Population and Health Service Research Ethics Committees.

Results Results will be reported in peer-reviewed publications, conference presentations and policy forums. The programme has high translational potential, providing invaluable evidence about cancer medicines in an elderly population who are under-represented in clinical trials.

Statistics from


The emergence of effective cancer medicines is acclaimed as one of the major medical advances of the 20th century.1 For some cancers, cytotoxic chemotherapy is curative, even in the setting of metastatic disease, and the promise of long-term survival makes the risk of adverse events and financial costs worthwhile.2 Adjuvant chemotherapy for breast, colorectal or lung cancer offers significant survival gains beyond surgery alone. For patients with incurable disease, chemotherapy can also lengthen survival. The prospects for patients with cancer have improved further by the recent development of molecular profiling of tumour cells. This technology has facilitated the identification of ‘novel, druggable targets’ that can be modulated to control or cure cancer.3 Moreover, ‘personalised medicine’ has the potential to optimise treatment efficacy and minimise the toxicity of therapeutic agents.3

Drug regulatory bodies and funding agencies play an important role in translating current evidence into policy and clinical practice. Data generated in clinical trials form the basis for regulatory approval because licensing organisations such as the US Food and Drug Administration (FDA) and Australia's Therapeutic Goods Administration (TGA) request that manufacturers demonstrate new medicines are safe and efficacious before approval for marketing. After market approval, governments and other third party payers have developed subsidy systems to achieve greater access to pharmaceuticals. These systems involve a delicate balance between providing access to effective treatments while ensuring long-term fiscal sustainability.4 Worldwide, many third party payers now require that healthcare technology manufacturers provide evidence of a product's cost-effectiveness, based mostly on clinical trial evidence, to support subsidy applications.

However, regulatory approval and subsidy decisions are based on the assumption that clinical trial outcomes can be extrapolated to the general population. While clinical trials are the indisputable gold standard for demonstrating drug safety and efficacy, they have significant limitations as an evidence base for regulatory and subsidy decisions.5 ,6 Some of these drawbacks include: small sample size, short duration, use of surrogate endpoints, early stopping rules that permit cross-over to experimental therapy, lack of measurement of real clinical effects, bias towards younger patients and under-representation of patients with complex health problems.7

To add to these well-established limitations, the uncertainties for decision-makers have increased due to stakeholder demand for faster access to potentially life-saving medicines. While fast-track initiatives show commendable responsiveness to stakeholder concerns, the end result is that the premarket approval evidence base is becoming increasingly limited and medicines are reaching the marketplace earlier than ever before. Clearly, no clinical trial could ever be large enough or long enough to quantify all clinical outcomes, risks or otherwise; however, the current situation has created an urgent need to better understand how cancer medicines are prescribed in routine clinical care.

Australians receive subsidised access to a broad range of cancer therapies, including conventional cytotoxics as well as the newer biological agents. Australia is unusual among developed countries as it operates under a National Medicines Policy and a Quality Use of Medicines (QUM) framework.8 We have comprehensive policies guiding the production, licensing, subsidy and use of medicines. A significant feature of our QUM framework is the internationally recognised cost-effectiveness analysis model, employed by the Pharmaceutical Benefits Advisory Committee (PBAC) and the Repatriation Pharmaceutical Reference Committee to make recommendations for medicines subsidy on the Pharmaceutical Benefits and Repatriation Pharmaceutical Benefits Schemes (PBS and RPBS), respectively.

After medicines have been subsidised for use in Australia we know little about the way in which they are used in clinical practice, their impact on resource utilisation and whether the people who will derive the most benefit from their use are gaining access to them. This situation persists despite the existence of good quality, routinely collected administrative health data. Australia is in a unique position to undertake population-based pharmacoepidemiological research due to its universal healthcare arrangement and drug coverage for all Australians.

However, health data linkage initiatives have been impeded by privacy concerns and legislation. Moreover, there have been a range of cross-jurisdictional barriers to linking dispensing data (under the custodianship of the Commonwealth) and outcomes data such as hospitalisations (under the custodianship of individual states and territories). Some of the major barriers include the variation in privacy legislation across jurisdictions, lack of transparency and variability about the requirements of jurisdictional ethics committees and the different data release policies of data custodians.

To demonstrate the consequences of this situation we have been undertaking a comprehensive review of all peer-reviewed literature of pharmacoepidemiological research undertaken in Australia using PBS or RPBS dispensing data over a 25-year period (1987–2012). We identified 196 studies, 110 of which were published between 2007 and 2012. Approximately one-third of all published studies (n=70) were based on aggregated claims level data reporting the volume of medicines dispensed over designated time periods. These studies provide little evidence on the way in which medicines are prescribed and used in the Australian population. Another third of all studies (n=91) examined clinician or patient behaviour around a particular medication (coprescribing, potential drug interactions, switching behaviours or patient adherence to therapy). Only 13% of studies linked individual-level medicines use to a health outcome. Moreover, there have only been six studies with a cancer medicines focus.9–14 As such, pharmacoepidemiological research has no systematic framework for undertaking timely studies of medicines use and health outcomes in real-world settings. In direct contrast, the Nordic countries have had long-standing and robust approaches to accessing routinely collected data for pharmacoepidemiological research. In a recent publication, they showcased their pharmacoepidemiological research with an output of 515 research papers in a period of only 6 years (2005–2010).15

There has been a notable exception in terms of advancing pharmacoepidemiological research in Australia. The Australian Government Department of Veterans’ Affairs (DVA) subsidises healthcare and pharmaceuticals for eligible veterans and their dependants, with the collected health claims data available for use in pharmacoepidemiological research.16–27 Since the DVA are responsible for all aspects of hospital and community care for their clients they have both exposure and outcomes data under their custodianship. More than one-third of all studies included in our review above used RPBS data. Moreover, more than two-thirds of all studies examining individual level medicines use in our review used RPBS data. To date, the vast majority of the DVA's pharmacoepidemiological research has been conducted under the auspices of the Veterans’ Medicines Advice and Therapeutics Education Services (Veterans’ MATES) program.28 However, cancer pharmacoepidemiology has not been a focus of the Veterans’ MATES program.

The programme of work outlined in this protocol brings together cross-jurisdictional data collections to link medicines exposure to real-world outcomes for a subset of elderly Australians residing in New South Wales (NSW), Australia's most populous state. The overall objective of the research programme is to evaluate the use and impact of cancer medicines in the real-world clinical setting. Specifically, we will investigate the trends in utilisation of chemotherapeutic and biological agents by cancer type, patterns of care following cancer diagnoses and outcomes associated with cancer medicines use.

Methods and analysis


Australia has a publicly funded universal healthcare system entitling all Australian citizens and permanent residents to a range of subsidised health services. This includes free treatment in public hospitals (funded jointly by the Commonwealth and State/Territory governments) and subsidised treatment in private hospitals (funded jointly by the Commonwealth and private health insurance). It also includes a range of subsidised outpatient services including consultations with clinicians (funded by the Commonwealth's Medicare Benefits Scheme, MBS) and medicines prescribed in hospitals and in the community (funded by the Commonwealth's PBS; medicines prescribed to public hospital inpatients are covered primarily by the hospital budget).

The DVA funds the healthcare of eligible veterans, war widows, war widowers and their dependants.29 In addition to the coverage received by the general population via the MBS and PBS, Repatriation Gold Card holders (ex-prisoners of war, World War I and II veterans and mariners, and their war widows/widowers) are also entitled to additional DVA-approved services and pharmaceutical items not available to the general population. White Card holders (other veterans or mariners who do not qualify for a Gold Card) are entitled to treatment for specific conditions approved by the DVA including a war-related or service-related injury, any malignant cancer and/or tuberculosis. All other conditions are treated and subsidised based on the same entitlements as the Australian general population. The Orange Card provides eligible veterans and mariners of British Commonwealth and allied countries who are 70 years of age or older and Australian residents subsidised access to approved pharmaceuticals according to clinical need.30 Clients can hold both White and Orange cards simultaneously.

DVA clients are a major subgroup of the Australian population. In December 2010, they constituted approximately 6% of Australians aged 65 years and older and 27% of Australians aged at least 85 years.31 Our research programme is limited to all DVA clients residing in the largest Australian state (NSW). DVA clients residing in NSW account for approximately one-third of the Australian DVA population and have a similar and gender profile to clients residing in other Australian states.31

Study design

This ongoing research programme comprises a series of retrospective cohort studies of DVA clients diagnosed with specific cancers. It involves the linkage of eight health administrative databases under the custodianship of the DVA and the NSW Ministry of Health (table 1). Data linkages of this kind have not occurred routinely in other research programmes using DVA claims data.

Table 1

Description of linked data sets

Data sources

The linked databases are described below and in table 1.

DVA client database

The DVA provide data on clients’ sex, dates of birth and death, level of healthcare entitlement history (Gold, White, Orange) and postcode of residence history mapped to statistical local areas (SLAs) and local government areas (LGAs).32 The SLAs and LGAs form part of the Australian Standard Geographical Classification,33 established to enable the development of geography-specific statistics and classification of areas according to socioeconomic profile and remoteness. We did not obtain individual-level data on the service history of the clients in our cohort; however, almost all male DVA clients are veterans of the Australian forces and almost all female clients are war widows.31


The PBS is a national programme subsidising prescription medicines for all Australians, while the RPBS subsidises additional items for DVA clients. It contains data on all dispensed pharmaceutical items (item name and strength, date of supply, quantity supplied and number of repeats) that were subsidised in whole or in part by the DVA and dispensed in the community or in a private hospital anywhere in Australia.

DVA hospital episodes

The DVA collects information on all hospitalisations to public and private hospitals across Australia where the hospital stay was subsidised by the DVA, including dates of stay, primary diagnosis and primary procedure. Claims from public and private hospitals are processed differently and there is a significant lag in the capture of public hospital claims (up to 1 year from the date of service in some instances).

DVA health services

These data contain all billed medical and allied health services performed anywhere in Australia both in-hospital and out-of-hospital that are covered under Australia's national MBS, as well as additional items covered by the DVA. MBS items include physician visits, diagnostic and therapeutic procedures and pathology tests and additional DVA items including services provided by dentists, podiatrists, chiropractors and dieticians.

NSW Central Cancer Registry

The Central Cancer Registry (CCR) records all cancer cases diagnosed in NSW residents. The registry is run according to the International Association of Cancer Registries’ rules34 and records the cancer type (ICD-O-3), date of diagnosis, degree of spread at the time of first diagnosis for solid tumours and the date and cause of death (cancer or non-cancer).35

NSW Admitted Patients Data Collection

The Admitted Patients Data Collection (APDC) is a census of all inpatient separations (ie, discharges, transfers, deaths) from all public, private and repatriation hospitals, private day procedures centres and public nursing homes in NSW and collects data on the dates of admission and separation, and up to 50 diagnoses and procedures.

Emergency Department Data Collection

The Emergency Department Data Collection (EDDC) records the date, the reason for the visit and the visit outcome (eg, admission to hospital, death or discharge) for all ED visits to 90 public EDs across NSW. There are 150 EDs in NSW, two of which are located in private hospitals. All of the larger EDs contribute data to the EDDC so the data set includes the majority of ED attendances.36 Moreover, using the APDC, it is possible to identify ED presentations which resulted in an admission to hospital.

NSW Register of Births, Deaths and Marriages (RBDM) plus Australian Bureau of Statistics Cause of Death

The RBDM records all births and deaths in NSW. Details of all registered deaths are forwarded to the ABS where causes of death codes (ICD-10) are applied to each record. The coded data then form part of the registered death information. However, there is often a significant lag between fact and cause of death data due to the time taken to process cause of death information.

Data linkage

The first linked data were obtained in 2011. The data linkage was performed by the Centre for Health Record Linkage (CHeReL) which maintains a record linkage system for health-related data in NSW and the Australian Capital Territory in accordance with all ethical, legal, privacy and confidentiality requirements. The CHeReL keeps a Master Linkage Key that consists of continuously updated links between most NSW data sets.

The DVA provided the CHeReL with an encrypted client number and personal information for all clients. The CHeReL then assigned a project person number (PPN) to each DVA client. A ‘project key’ containing the PPN and encrypted client number for each respective database is sent to the various data custodians who decrypted the client number and attached the PPNs and requested content variables. These were subsequently sent to the researchers stripped of personally identifying information such as name and address. All the data sets were joined by the researchers using the PPNs. This process is outlined in figure 1.

Figure 1

Data linkage process. CHeReL, Centre for Health Record Linkage; DVA, Department of Veterans’ Affairs; NSW, New South Wales.

The linkage used probabilistic matching37 using the record linkage software ChoiceMaker,38 together with clerical review of possible matches. Probabilistic matching assigns a weight to pairs of records based on how likely they are to be a true match based on a comparison of name, date of birth and address. Records with high weights are considered a true match while those with low weights are false matches; uncertain matches undergo clerical review. The cut-off for determining whether a match is true or false is chosen as to minimise the number of false positives, false negatives and the number of clerical reviews.39 The linkage algorithms take into account name reversals, name shortenings and a limited number of keystroke errors in date of birth. The CHeReL routinely performs these linkages, and standard operating procedures exist to fine tune the linkage sensitivity and specificity depending on the cohort data being linked. CHeReL follows established quality assurance processes for their master linkage key, linkage procedures and linkage staff. In general, these processes achieve false positive rates of 0–5/1000 and false negative rates of <1/1000. However, the quality of the probabilistic matching process is highly dependent on the quality of the data provided, and in the case of DVA clients, important linkage variables such as name, date of birth and death and address histories are well documented. For example, an audit of the first CCR, APDC and EDDC linkages found a false positive rate of 0.4% and a false negative rate of <0.5%. Moreover, individual research projects conducted under the auspices of this programme start with a comprehensive review of the cohort to minimise the inclusion of false positive cases. This is carried out by identifying and excluding cases where there are discrepancies between dates of birth and death across different databases or in circumstances where a client has observations of health resource use after their date of death.

Study population

As this is an ongoing study, the characteristics of the population change over time. Characteristics of the study population as described below are summarised for each client at 1 July 2004 or the date of their first healthcare entitlement, whichever occurred later. There were 104 635 DVA clients who resided in NSW, were aged 65 years or older and received full or partial funding of medicines and/or health services. Figure 2 shows the proportion of the NSW population represented by DVA clients.

Figure 2

Age distribution of New South Wales population and Department of Veterans’ Affairs population, July 2004.

The vast majority of these DVA clients (89%) received full healthcare entitlements (Gold Card holders). For these individuals there will be almost complete ascertainment of subsidised healthcare utilisation by combining DVA and NSW databases; it is this subpopulation that we will generally use for analysis of health services utilisation and medicines use. An additional 5% received subsidised pharmaceuticals only (Orange Card holders). Inclusion of this cohort in analyses would be appropriate to answer research questions that require pharmaceutical claims data only. Finally, 6% of DVA clients received subsidies for health services and pharmaceuticals related to the treatment of specific conditions related to service or any malignancy (White Card holders). Inclusion of this cohort would be appropriate for research questions about the medicines and health service use of clients with cancer.

Over half (52%) of Gold Card holders were women, while men were the vast majority of White and Orange Card holders (table 2). The gender disparity at different levels of entitlement reflects the fact that the majority of female DVA clients are war widows,31 who are not eligible for White or Orange Cards. Most clients were aged between 80 and 84 years. Given the older age of this population, attrition was high with approximately 7% dying every year of follow-up. A quarter of clients (26%) had been diagnosed with cancer, with prostate, skin and colorectal cancers being the most common. White and Orange Card holders were less likely to have breast cancer given the fewer number of women with these entitlements. In a 1-year period, 54% of all DVA clients had no hospitalisations. Among those hospitalised, the comorbidity burden as measured by the Charlson index40 was low, with most having an index of 0 (table 2). Since the collection of hospital separation data is provided by the NSW Ministry of Health, it is not dependent on DVA subsidy, so there was little variation by level of DVA healthcare entitlement. In contrast, given the limited subsidy of pharmaceutical items among White Card holders, this group was more likely to have had no dispensing records in a 1-year period. When comorbidities are defined using the RxRisk,41 approximately one quarter of clients had seven or more comorbidities (table 2). Nearly two-thirds of clients lived in major cities (defined by the Accessibility/Remoteness Index of Australia),42 and nearly one-third resided in areas classified as the least disadvantaged (defined by the Index of Relative Socio-Economic Disadvantage43; table 2).

Table 2

Demographics of those alive, living in New South Wales and ≥65 years of age at 1 July 2004 or date of first healthcare entitlement

Statins were the most common class of medicines dispensed, representing between 7% and 8% of all dispensing records in a 1-year period, followed closely by proton pump inhibitors (6–7%; table 3). Among Gold Card holders, 43% of patients were ever dispensed paracetamol and 37% were ever dispensed a statin over a 1-year period. Among White and Orange Card holders, 19% and 34%, respectively, were ever dispensed a statin (table 3). The lower percentage among White Card holders likely reflects their lower rate of subsidy for pharmaceuticals. Cancer medicine use, including chemotherapy and hormone therapy, represented only 0.5% of dispensing records. Over a 1-year period the most commonly dispensed antineoplastics were gonadotropin-releasing hormone analogues, such as goserelin, and pyrimidine analogues, such as capecitabine, reflecting the high rates of prostate and colorectal cancers (table 4).

Table 3

Most common Repatriation Pharmaceutical Benefits Scheme medicines dispensed in a 1-year period

Table 4

Most common Repatriation Pharmaceutical Benefits Scheme cancer medicines (ie, cytotoxic and hormone therapy) dispensed in a 1-year period

DVA clients were high users of health services, with 64 health services per person-year. The majority of these were consultations and visits with healthcare professionals (41%), followed by pathology services (25%) and diagnostic and therapeutic procedures (14%; table 5).

Table 5

Number of health service records by type over a 1-year period

Statistical analysis

We will use a range of pharmacoepidemiological analyses to explore cancer medicines use and outcomes across our programme of work. We will identify patients diagnosed with cancer using cancer registry data, while we will identify users of cancer medicines based on pharmaceutical dispensing data. When determining specific research questions we will account for the availability and quality of the various data sources, the presence of secular trends and geographical variation that may affect the outcomes of the study. Importantly, the most contemporary cancer notification data are only available to Australian researchers until 2009, but outcomes that result in hospitalisation can be ascertained up until 2012. The predominant approaches are outlined below.

Cancer medicines use

We will ascertain use of cancer medicines using the pharmaceutical dispensing data. We will map all medicines to their WHO Anatomical Therapeutic Chemical (ATC) classification; medicines in the ATC categories L01 (‘antineoplastic agents’) and L02 (‘endocrine therapy’) will be the primary medicines of interest. Since the dispensing data do not contain information on the daily dose, we will develop an algorithm for each medicine of interest based on the amount dispensed, the number of repeats, the average time between refills by person and typical dosage, as we have carried out previously.10–12 Where this is not practicable we will use the WHO's ‘defined daily dose’. Using the hospital episode data we will also identify the administration of antineoplastic agents to public hospital inpatients, as medicines dispensed to these patients do not appear in the dispensing data.

Patterns of use

We will examine the trends in cancer medicines use using the following measures: cumulative incidence by medicine type over time, prevalence of use by medicine type by age group and by year, duration of use, average daily dose and the concomitant use of multiple cancer medicines. We will also estimate ‘off-label’ use of cancer medicines by using information about concomitant use of specific cancer medicines. We will report the characteristics of patients dispensed cancer medicines, specifically age, sex, geographical remoteness and area of socieconomic disadvantage, cancer type, disease spread and presence of comorbidities, and whether this changes over time. Age, sex, geographical remoteness and area of socieconomic disadvantage will be ascertained from the DVA client database, while cancer type and spread is available from the cancer registry. We will identify comorbidities using the Charlson index,44 and/or the Elixhauser index45; both indexes use diagnosis information from hospital separation data to identify comorbidities, and therefore clients who are not hospitalised will not be captured by these indices. Therefore, the RxRisk41 will also be used; this index uses dispensed medicines to identify 41 comorbidities.

Patterns of care

After identifying patients with cancer, we will describe the treatment pathway for specific cancer types after diagnosis. We will identify the receipt of cancer-related surgery, radiation therapy and chemotherapy using: (1) hospital separation data to identify in-hospital surgery, radiation therapy and chemotherapy; (2) pharmaceutical dispensing data to identify the dispensing of antineoplastic medicines in-hospital and out-of-hospital and (3) health services data to identify in-hospital and out-of-hospital surgery, radiation therapy and chemotherapy. We will further examine the temporal relationship between date of diagnosis and the receipt of different treatment types; where treatment is identified from multiple data sources, we will compare dates to avoid double counting.

In addition, where relevant, we will ascertain that appropriate monitoring is being performed for known side effects of cancer medicines using procedures and tests identified in health services data.

Health services utilisation

We will quantify the use of health services in the period prior to diagnosis. These include: the number of hospital separations and days spent in hospital, identified using the hospital separation data; the number of general practitioner consultations, specialist consultations and allied health consultations, identified using the health services data and the number of ED visits, identified using the ED data.

Predictors of health outcomes

We will identify predictors of the receipt of any treatment, and type of treatment, using logistic regression models. We will also examine predictors of time to therapy and time to death (both cancer death and all-cause mortality) using Kaplan-Meier curves and Cox proportional hazards models. Date of death will be ascertained using the DVA client database, while cause of death (cancer vs non-cancer) is provided by the cancer registry data. We will identify predictors of the rate of health service utilisation using Poisson regression or negative binomial regression, as appropriate. In all models the predictors we will consider include age at diagnosis, sex, geographical remoteness and area of socieconomic disadvantage, cancer type, disease spread at diagnosis and comorbidities.

We will also investigate the risk of specific adverse outcomes in clients with cancer who are taking a particular medicine, compared to a similar group of patients naive to the medicine of interest. Adverse outcomes will be identified using information such as: the dispensing of a new medicine that indicates the development of a new disease, a hospital separation with a diagnosis of the outcome of interest a new notified cancer and mortality. Care will be taken when selecting the comparison group to avoid confounding by indication; where practicable, we will apply high-dimensional propensity score analysis46 using the following dimensions: dispensed medicines, hospital diagnoses, hospital procedures and physician visit procedures. We will investigate the relationship between the timing of the adverse event and use of the medicine of interest using time-dependent Cox proportional hazard models.47 To best describe the relationship between the medicine of interest and the outcome, we will define medicine use in several different ways, using the daily dose information as described above, that is, any use, total cumulative dose, current dose, cumulative dose in the past 30 days, and recent initiation and/or discontinuation. Using different measures of medicine use will also allow us to perform sensitivity analyses to assess the robustness of our results.

Analyses will be performed using SAS Version 9.3, Stata Version 12 and R Version 3.1.0.

Validation and feasibility

As in any epidemiological study we must consider the potential biases in our research. Some of the concerns raised in relation to administrative database research and the conduct of pharmacoepidemiological research in Australia are described below.

Medicine exposure

Electronic pharmacy dispensing records are considered the gold standard of prescribed medicines exposure information, compared with patient notes and self-reported information.48 ,49 In addition, undertaking studies in the DVA client population have the distinct advantages of having good ascertainment of prescribed medicines use, even relatively cheap medicines that are not always captured in the general population. In Australia, we have comprehensive pharmaceutical claims data collections for prescribed medicines dispensed in community and private hospitals, but not for public hospital inpatients. Given the vast majority of oncology protocols are administered in the outpatient setting or to private hospital inpatients (both of which are captured in the R/PBS data), we believe this issue will not impact significantly on the outcomes of our analyses. However, if in the unlikely event that chemotherapy is administered to public hospital inpatients, it is indicated as a procedure code in the public hospital separation record. While we will not know the nature of the chemotherapy we can identify an episode when chemotherapy was administered in hospital, allowing us to estimate the chemotherapy use that was not identified in the R/PBS dispensing data. While there are clear limitations in terms of complete ascertainment of cancer medicines dispensing claims, our approach remains the best way of defining medicines exposure in whole of population studies. Since it is highly unlikely that we would miss every administration of a cancer medicine over the course of a chemotherapy protocol, we will almost certainly identify most patients who had received some type of cancer drug treatment.

Diagnosis, outcome and covariate misclassification

As the health administrative data sets lack detailed clinical information, one needs to assess the impact of misclassifying diagnoses and outcomes of interest. The accuracy of cancer incidence in DVA clients depends on the completeness and accuracy of the NSW CCR. Routine indicators of data quality and completeness show the Australian State and Territory cancer registries are operating in accordance with international standards.50 In terms of outcomes of interest, poor sensitivity may be an issue as we commonly identify outcomes that result in hospitalisation before we can identify potential outcomes of interest and the health service records contain no diagnostic information. However, poor specificity of the outcome is worse than a lack of sensitivity in most situations.51 If the specificity of the outcome is 100% then relative risk estimates are unbiased. A recent comprehensive study on the misclassification of claims data diagnoses using medical record chart review as the gold standard revealed the sensitivity of claims diagnoses is often less than moderate whereas specificity is usually 95% or greater. A high specificity of diagnostic coding in claims data can be expected because if a diagnosis is coded and recorded it is likely the diagnosis was made.52

The data also lack certain important covariates, including health behaviours such as smoking, drinking habits and obesity. Further, our data set is not linked to DVA military service records. In hospital separation data, comorbidities are only recorded if they are directly related to the reason for admission. As a result, there have been several algorithms developed to identify the presence of important comorbidities using administrative claims data, including in hospitalisation data (such as the Charlson index53) and in dispensing data (such as the RxRisk54).

In the general population, recording of services provided by Medicare Australia (ie, health services and dispensed medicines) is tied to those subsidised (in part or in full) by the government; therefore the record of their healthcare utilisation is incomplete, limiting the sorts of analyses that can be conducted. In contrast, the utilisation of health services by Australian veterans is nearly totally subsidised by the DVA, meaning there is near-complete capture of health services and hospital visits, creating a unique opportunity for study. The availability of data across multiple sources allows for validation of important endpoints such as death, for which we will have date of death information from a minimum of three sources and cause of death from a minimum of two sources (table 6).

Table 6

Data sources containing key analysis variables

Multiple data sources and observations

One of the greatest strengths of the DVA/NSW data linkage is the multiple data sources for the key analysis variables (table 2). For example, we will use hospital separation data from both NSW and DVA data sets. While some of this information is clearly duplicated, each data set brings its own unique benefits. The NSW APDC will contain hospital separations for DVA clients in NSW, and the DVA hospital episodes will record any hospital separation (as a DVA client) regardless of where it occurred in Australia.

External validity of research findings

One potential limitation of this research programme is that the health seeking behaviour and health outcomes of the veterans’ population may not be representative of older Australians. However, when compared with Australians of similar age, DVA clients have been shown to have very similar rates of health service and medicine use.55

Ethics and dissemination

The work outlined in this protocol describes a programme of research examining the real-world use and impacts of cancer medicines in a population of elderly Australians. The first linkages and data access occurred in 2011 and we have ongoing human research ethics approval until 2015.

The original proposals were submitted in two phases, the first of which approved the linkage of the DVA client data with NSW CCR and the RPBS data (approval numbers: 2008/02/060 and, E008/003). The second phase approved the linkage of the remaining data sets (approval numbers: 2010/03/217 and E010/009). The two-phased approach was developed after discussions with the DVA and NSW Ministry of Health data custodians. Given the novelty of the linkage, it was determined that the first linkage should be used to demonstrate the feasibility of the linkage after which time the additional data would be linked. Despite this, there was a 3-year delay between the first ethics approval and delivery of the data. The primary impediments related to a lack of capacity within the various agencies to extract the data and due to challenges in negotiating and signing a data service agreement between the DVA (a Commonwealth agency) and an academic institution.

The data for the research programme were released without individual consent. Therefore, the ethical and privacy issues related to the use and disclosure of information for a secondary purpose, and waiving informed consent. The use and disclosure of Commonwealth and state data are governed under the Privacy Act 1988 and Health Records and Information Privacy (HRIP) Act 2003, respectively. Information Privacy Principle (IPP) 2 under the Privacy Act 1988 (Commonwealth) provides that personal information should not be used or disclosed for any purpose other than the primary purpose of the collection. Further, Health Privacy Principles 10 and 11 of the HRIP Act provide that an organisation can only use and disclose health information for the purpose for which it was collected, or a directly related purpose.

We sought approval to use of data for a secondary purpose, that of research involving data linkage.

  • Under IPP2.1(d) use or disclosure for another purpose is permitted if (1) it is necessary for research and it is impracticable to gain consent and (2) the use is in accordance with the section 95A guidelines (which provides a process to resolve the conflict that may arise between the public interest in privacy and the public interest in medical research).

  • The statutory guidelines under the HRIP Act define the scope of particular exemptions in the health privacy principles. Under this act, data can be disclosed for a secondary purpose, such as research using the ‘research exemption’.

As such, we applied for these exemptions to the current research programme.

We sought to waive individual consent for the release of data because:

  • It is not possible or practical to obtain consent because of the large study population (more than 100 000 DVA clients) and a large proportion of DVA clients were likely to be deceased.

  • Obtaining consent would prejudice the scientific value of the research due to the high participation rates required for unbiased samples (at least 90%)56 and the Australian evidence about the sociodemographic differences between participants who consent to data linkage research and those that do not.57

  • The public interest in the research outweighs the public interest in privacy protection. This is because we know little about the way in which medicines are used in the real-world marketplace. Our research has the potential to address key issues such as the risks and benefits of specific cancer medicines in a subgroup of Australians.

We minimised the risk to personal privacy by:

  • A third party, the CHeReL, undertaking the record linkage, ensuring the smallest number of people have access to the identifiable data.

  • Ensuring data are stored securely.

  • Restricting access to identified data to officers in the CHeReL, whose access is password protected and subject to the signing of a confidentiality agreement.

  • Separating the data linkage process (where access to personally identified data is necessary), from the data analysis, so that personal information and health information are separated.

  • Ensuring researchers will not be in possession of any personal identifying information. The identifiable information (such as name and address) will be removed from the content data by linkage staff after record linkage has taken place.

  • Ensuring the files released to researchers will not contain DVA clients’ actual enrolment number, rather a unique project number generated by CHeReL staff.

  • Ensuring identifiable information will not be published.

Dissemination plan

The outcomes of this research will be submitted to international peer-reviewed journals; in particular oncology, general medical and pharmacoepidemiology journals. Furthermore, results will be presented at national and international oncology and pharmacoepidemiology conferences. We will also develop lay summaries of research findings for communication to consumer groups and policy makers where appropriate. As stipulated in our DVA data agreement we will submit all data that will be communicated in the public domain to the DVA for review and approval. Authorship will be based on the International Committee of Medical Journal Editors guidelines. Outcomes will also be posted on the University of Sydney webpage of the lead investigator and the Centre for Research Excellence in Medicines and Ageing website (under development). Direct access to the data and analytical files to other individuals or authorities is not permitted without the express permission of the approving human research ethics committees and data custodians.


The programme outlined in this protocol will form a solid foundation for cancer pharmacoepidemiology research and will allow Australian researchers to capitalise on the evidence derived and learning gained from this programme of work. Observational studies of the kind described in this protocol are particularly important as the elderly are under-represented in clinical trials but are the subpopulation with the highest rates of medicine use58 and are at the most at risk of adverse events.59 ,60 The DVA have a strong tradition of making data available for research purposes and this programme will establish a long-term programme in cancer pharmacoepidemiology.

Moreover, there have been encouraging developments in Australia in recent years that will increase the opportunities for Australian researchers to undertake whole of population, pharmacoepidemiology research. At the national level, the Department of Human Services (custodians of Medicare and PBS data) now provides timely access to person-level medicines dispensing data. At the state level, major linked data resources incorporating hospital separations, ED visits, cancer registrations, mental health service use and fact and cause of death data have been available for nearly two decades via the Western Australian Data Linkage System (WADLS) and the NSW CHeReL since 2006. A similar service has been recently established in South Australia and the Northern Territory via SA-NT Datalink. Other states will follow in the near future due to the injection of more than $A60 million by the National Collaborative Research Infrastructure Strategy (NCRIS). Moreover, there is a strong commitment from government for linked Commonwealth and State data through Integrating Authorities, which have become operational in 2012. With these advances cancer pharmacoepidemiology research in Australia has a bright future.


The authors would like to thank the Australian Government Department of Veterans’ Affairs (DVA) and the New South Wales (NSW) Ministry of Health for data provision and the Centre for Health Record Linkage for undertaking the data linkage process.


View Abstract


  • Contributors S-AP and AS drafted and approved the final manuscript. AS conducted the data analysis.

  • Funding This data linkage was funded by a Cancer Australia Project Grant (Number 568773). From 2014, the programme will be partly funded by the NHMRC Centre of Research Excellence in Medicines and Ageing (1060407).

  • Competing interests Sallie Pearson is a Cancer Institute NSW Career Development Fellow (12/CDF/2–25). Sallie Pearson was based at the University of NSW at the time she established the research programme. She is now at the University of Sydney.

  • Ethics approval The research programme was approved by the New South Wales Population and Health Services and DVA Human Research Ethics Committees

  • Provenance and peer review Not commissioned; externally peer reviewed.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.