Cohort profile: the eLIXIR Partnership—a maternity–child data linkage for life course research in South London, UK

Purpose Linked maternity, neonatal and maternal mental health records were created to support research into the early life origins of physical and mental health, in mothers and children. The Early Life Cross Linkage in Research (eLIXIR) Partnership was developed in 2018, generating a repository of real-time, pseudonymised, structured data derived from the electronic health record systems of two acute and one Mental Health Care National Health Service (NHS) Provider in South London. We present early descriptive data for the linkage database and the robust data security and governance structures, and describe the intended expansion of the database from its original development. Additionally, we report details of the accompanying eLIXIR Research Tissue Bank of maternal and neonatal blood samples. Participants Descriptive data were generated from the eLIXIR database from 1 October 2018 to 30 June 2019. Over 17 000 electronic patient records were included. Findings to date 10 207 women accessed antenatal care from the 2 NHS maternity services, with 8405 deliveries (8772 infants). This diverse, inner-city maternity service population was born in over 170 countries with an ethnic profile of 46.1% white, 19.1% black, 7.0% Asian, 4.1% mixed and 4.1% other. Of the 10 207 women, 11.6% had a clinical record in mental health services with 3.0% being treated during their pregnancy. This first data extract included 947 infants treated in the neonatal intensive care unit, of whom 19.1% were postnatal transfers from external healthcare providers. Future plans Electronic health records provide potentially transformative information for life course research, integrating physical and mental health disorders and outcomes in routine clinical care. The eLIXIR database will grow by ~14 000 new maternity cases annually, in addition to providing child follow-up data. Additional datasets will supplement the current linkage from other local and national resources, including primary care and hospital inpatient data for mothers and their children.

ABSTRACT Purpose Linked maternity, neonatal and maternal mental health records were created to support research into the early life origins of physical and mental health, in mothers and children. The Early Life Cross Linkage in Research (eLIXIR) Partnership was developed in 2018, generating a repository of real-time, pseudonymised, structured data derived from the electronic health record systems of two acute and one Mental Health Care National Health Service (NHS) Provider in South London. We present early descriptive data for the linkage database and the robust data security and governance structures, and describe the intended expansion of the database from its original development. Additionally, we report details of the accompanying eLIXIR Research Tissue Bank of maternal and neonatal blood samples. Participants Descriptive data were generated from the eLIXIR database from 1 October 2018 to 30 June 2019. Over 17 000 electronic patient records were included. Findings to date 10 207 women accessed antenatal care from the 2 NHS maternity services, with 8405 deliveries (8772 infants). This diverse, inner-city maternity service population was born in over 170 countries with an ethnic profile of 46.1% white, 19.1% black, 7.0% Asian, 4.1% mixed and 4.1% other. Of the 10 207 women, 11.6% had a clinical record in mental health services with 3.0% being treated during their pregnancy. This first data extract included 947 infants treated in the neonatal intensive care unit, of whom 19.1% were postnatal transfers from external healthcare providers. Future plans Electronic health records provide potentially transformative information for life course research, integrating physical and mental health disorders and outcomes in routine clinical care. The eLIXIR database will grow by ~14 000 new maternity cases annually, in addition to providing child follow-up data. Additional datasets will supplement the current linkage from other local and national resources, including primary care and hospital inpatient data for mothers and their children.

INTRODUCTION
Investment in health in the earliest stages of life is increasingly recognised as a means to improve the life course of health; beginning in utero, through to infancy, childhood and into adulthood. 1 2 Much of the supporting evidence underpinning a 'life course approach' to disease prevention from pregnancy and infancy onwards has been accrued from large birth cohorts [3][4][5][6] ; however, directly recruited cohorts are by definition drawn from individuals recruited over a prespecified and limited time period and thereby become rapidly outdated as temporal shifts in population demography, lifestyle and ethnicity occur. Moreover, facility-centred follow-up is expensive and difficult to sustain. 7 8 Sample attrition is common and can introduce significant methodological biases that may affect the validity of investigations into novel risk outcomes. 9 In addition, direct recruitment may result in cohorts with a limited representation of the target population because of selective inclusion. Population-based registries also offer insight into rarer diseases and outcomes, not feasible in current research cohorts. Linked administrative data are Strengths and limitations of this study ► The Early Life Cross Linkage in Research (eLIXIR) is a unique population-based database incorporating clinical data from maternity, neonatal and mental health records enabling life course studies of physical and mental health in a large, diverse, inner-city, UK population. ► Studies undertaken using eLIXIR will have not only have implications for local healthcare improvement but also the potential to provide evidence to influence healthcare in similar national/global settings. ► Missingness and inaccuracy in all routine/administrative clinical databases will be a key limitation to this database. ► The representativeness of the cohort to the UK population is limited to mixed, inner-urban catchments.
Open access increasingly used to provide evidence to guide policy and clinical management. [10][11][12] The longitudinal nature of such case registers, their size and coverage of defined populations provide an increasingly attractive alternative to the study of birth cohorts for defining the early life exposures that contribute to the population burden of physical and mental health disorders. These can provide longitudinal information on large numbers of women and children, as well as the potential for linkage with a widening portfolio of available local and national datasets to follow health from birth to adulthood (https:// digital. nhs. uk/ data-and-information/ data-collections-and-data-sets/ data-sets). Both inclusion and attrition bias of traditional birth cohorts can be overcome through routine comprehensive health records, as these can capture rich clinical data in a given population on all women receiving antenatal care and their infants. 13 Although well established in Scandinavian countries, national birth registries in the UK have not been widely used in linkage programmes using infant and childhood data, although population registry data from Scotland have for many years provided information on relationships between maternal and neonatal outcomes that has informed clinical guidelines in the UK and beyond. 14 Several linkages of clinical maternity and infant data have nonetheless shown the feasibility of the approach and usefulness, for example, in aligning hospital maternity data with national birth registration datasets, or birth registration datasets with Hospital Episode Statistics (HES), or using UK primary care pregnancy data to create a pregnancy register. [15][16][17][18][19][20][21][22][23][24][25] It is well established that maternal physical and mental well-being in pregnancy and the postpartum period can strongly influence the neonatal outcome and the physical and mental health of the child. [26][27][28][29] To our knowledge, no clinical data linkages in maternity or neonatal services have to date incorporated clinical information from maternity, neonatal and mental health services into a single continuum to interrogate these associations at a population level. The Early Life Cross Linkage in Research (eLIXIR) Partnership has been developed to address these relationships from early pregnancy, the perinatal period and beyond into later life. Funded by the Medical Research Council (MRC) in 2017, the partnership is a multidisciplinary academic collaboration that aims to combine maternal, infant and child health data into a single resource to allow information from large numbers of mothers, babies and children to be investigated over an unlimited time period. The intention is to provide a naturally accumulating database to support investigations into associations between physical and mental health in mother and child.
The eLIXIR Partnership provides a mechanism through which research datasets can be linked to clinical records, under appropriate and approved levels of anonymity and data security. An added benefit is the potential to incorporate data from multiple sources, for example, health, environment, social and education. There are, however, important ethical and legal considerations, as well as technical security requirements, if linkages are to be performed between sources of routinely collected clinical data and exemption from individual consent to be permissible. 30 Another aspect of the eLIXIR Partnership is the eLIXIR Research Tissue Bank established to link the routinely collected maternal and neonatal clinical data with biological samples. This has an advantage over static cohort studies by providing a 'dynamic' collection of samples, enabling the identification of population trends and influences of new clinical interventions. The provision of samples from women attending antenatal care will provide a unique biobank to address mechanisms of common and rarer complications in pregnancy and in neonatal life, and their consequences for the longer-term health of the mother and child. Common complications will include gestational diabetes, mental illness, prematurity and preeclampsia. Similarly, by the provision of samples from neonatal intensive care, eLIXIR will contribute to a better understanding of neonatal morbidity and mortality.
With records of over 14 000 individual births per year, eLIXIR has the potential to become one of the largest mother-infant-child datasets in Europe. This manuscript details the technical and procedural elements in place to safeguard the legal and ethical rights of service users during the development and use of the eLIXIR database and to present the demographic profile of the eLIXIR population. Both technical and procedural elements draw strongly on experience gained in setting up the Clinical Record Interactive Search (CRIS) data resource at the Maudsley National Institute for Health Research (NIHR) Biomedical Research Centre (BRC). [31][32][33] Benefits of the system Large data-linkage platforms, such as that created by the eLIXIR Partnership, provide a unique data warehouse through which important epidemiological questions can be asked, in the case of eLIXIR, within a large and diverse inner-city population. The ability to conduct these linkages allows not only the collection of a wide range of longitudinal health and social data, but also the capacity to support life course data analysis. The potential benefits arising from the use of clinical record 'big data' have been widely reported, and research databases such as eLIXIR are likely to increase in number due to the powerful and cost-effective nature of this research method. 34 35 eLIXIR is one of the first longitudinal research databases, from early pregnancy onwards, using routinely collected clinical data from maternity, neonatal Open access and mental health services that do not rely on a recruited cohort of participants.

COHORT DESCRIPTION Data sources
Maternity and neonatal data were obtained from GSTT and KCH, and mental health data from SLaM. GSTT provides a full range of hospital and community services for people in Lambeth, Southwark and Lewisham, as well as specialist care for patients from further afield including referrals for high-risk pregnancies and neonatal complications. Similarly, KCH serves the boroughs of Lambeth, Southwark and Lewisham, but also Bromley, with specialist services to patients across a wider catchment area, including referrals for obstetrics and fetal medicine. SLaM provides comprehensive mental health services to a geographic catchment of over 1.2 million residents in four south London boroughs, Croydon, Lambeth, Lewisham and Southwark, as well as some regional/ national specialist mental health services.

Maternity, birth and neonatal intensive care data
The BadgerNet Platform (CleverMed) for routine clinical data is used extensively across the UK to create electronic patient records that capture early pregnancy communitybased events and hospital-based events for low-risk and high-risk pathways of care (BadgerNet Maternity), and neonatal intensive care, neonatal transport, paediatric intensive care, neurology referrals and adult intensive care data (BadgerNet Neonatal). Within GSTT and KCH, the BadgerNet platforms are used for recording maternal/infant personal data, demographics, clinical history, clinic data (maternity only) and hospital episode data. The BadgerNet System records clinical records on a Single Care Record system, which is nationally hosted. 36 Although feasible within the BadgerNet System, linkage between maternity and neonatal data is not routinely conducted.

Mental health data
Clinical records have been fully electronic across all SLaM NHS Trust mental health services since April 2006, using the bespoke electronic Patient Journey System (ePJS) that incorporated legacy data from earlier service-specific electronic health records. The CRIS platform 33 was developed in 2007-2008 and consists of a series of dataprocessing pipelines that both structure and de-identify PJS fields, rendering pseudonymiseddata from the full clinical record available at the researcher interface, with search and database assembly functionality facilitated by a front end, designed for non-technical use. The de-identifying process and its effectiveness, including the masking of identifying information in open-text fields and the generation of a pseudonymised identifier (CRIS ID), have been previously described. 32 The wider patient-led oversight and security models for CRIS have not changed significantly since it was established. [31][32][33] Ethical approval was obtained for CRIS as a pseudonymised database for secondary analysis (Oxford C Research Ethics Committee, reference 18/SC/0372). In terms of cohort coverage, all SLaM care (including diagnoses, medication and services provided) is represented on CRIS, including Improving Access to Psychological Therapies data (IAPT; a large primary care service providing short-term psychological therapies).
Data-linkage hosting environment Data for eLIXIR are managed and stored at the Clinical Data-Linkage Service (CDLS) at SLaM: an impartial trusted third-party service that provides researchers access to linked clinical data in accordance with the strict governance conditions and processes agreed with relevant data controllers. The CDLS is managed by a small, dedicated team of informaticians, IT and Information Governance (IG) professionals (https://www. maudsleybrc. nihr. ac. uk/ facilities/ clinical-record-interactive-search-cris/), and currently hosts a range of datasets already linked with the SLaM CRIS mental health case register (eg, HES, National Cancer Registry, ONS death certification, Lambeth DataNet primary care records and National Pupil Database). The backbone of the eLIXIR database consists of a 'master patient index' (figure 1) allowing data to be robustly linked within an appropriately secure environment according to data specifications.
The CDLS hosts both source data and the master patient index, on behalf of the eLIXIR Partnership, on a secure server within the NHS firewall with role-restricted access. The CDLS additionally provides a data extraction service that meets security requirements, creating bespoke datasets for approved research use. Such derived data are managed by the approved research team and are hosted at all times on a dedicated drive within the NHS for analysis in this domain using hosted software already available at SLaM.
Four distinct services are offered by the CDLS as the data processor for the eLIXIR data. First, CDLS provides advice on permissions, approvals and contracts. These include the consideration of academic, technical, legal and ethical requirements. Second, CDLS facilitates data linkages either within the CDLS safe haven or via a third party, coordinating the secure transfer of data. Third, the CDLS is responsible for the secure storage of linked data in accordance with predefined IG and security standards. Fourth, the CDLS, as the custodian for the linked data, prepares and extracts bespoke and prespecified databases for approved eLIXIR projects and provides these to researchers. Therefore, there is no direct access by researchers to the full linked data files, enhancing data protection and confidentiality.

Data-linkage procedures and resources
The eLIXIR Partnership uses common identifiers (eg, hospital number, NHS number, name and date of birth) to link between maternity, neonatal and mental health clinical data, which is undertaken by CDLS staff, not Open access researchers. Matching is undertaken using deterministic matching techniques on a given set of identifiers. Two records are said to match if all or some of the identifiers are identical, as defined by a hierarchical set of matchranks. This creates a single master patient index including patient anonyms (mother and child), binary variables for presence/absence on each data source and key demographics. The source datasets and master patient index are stored by the CDLS behind the NHS firewall and linked data are extracted on a project-by-project basis by a CDLS informatician containing the variables and samples necessary for each study, with a study-specific encrypted anonym that contains completely pseudonymised data. In this way, databases are not stored in a linked format. The clinical data from maternity and neonatal services are extracted from BadgerNet Systems at both GSTT and KCH by information and communicaiton technology (ICT) staff at each site. These data are then sent securely to the staff at the SLaM CDLS and linked, using their common identifiers, with data from the CRIS system to incorporate mental health clinical data, where present, for each patient. This results in a comprehensive data resource to which researchers can apply for extracted data across maternity, neonatal and/or mental health services. Figure 1 details the dataflow for the eLIXIR Partnership database. Match quality is assured as 100% of infants born within the eLIXIR dataset were matched with their mothers' records with the BadgerNet System.

IG framework
Results from linkages, current and prior, are stored within the CDLS safe haven, and the CDLS plays a key role in wider governance, supplementing the role of eLIXIRspecific oversight and data security, including the secure handling and storage of identifier fields required for data linkage. Section 251 (s251) of the NHS Act 2006 allows the common law duty of confidentiality to be set aside in specific circumstances where anonymised information is not sufficient and where 'opt-in' patient consent is not practical. Opt-out information and details of the project are given to each patient entering maternity and neonatal services and patients have the option of opting out of the programme at any time. Approval under this legal framework was granted by the Health Research Authority (HRA) Confidentiality Advisory Group (CAG) to the eLIXIR team for all the above linkages, which allow data to be available in an identifiable format to a small number of data-processing staff in accordance with data sharing contracts between the data provider institutions (HRA CAG Ref: 18/CAG/0040). Therefore, for current and future data linkages within eLIXIR, ethical (REC) and s251 approval is required either through amendments to our existing agreements or new applications to these regulatory bodies. Activity for projects using linked datasets held by eLIXIR is audited by the eLIXIR Oversight Committee, helping to ensure that the researcher's project requirements (eg, clinical research, surveillance, service improvement or audit) are met and that projects progress within the agreed policy and practice framework. The primary role of the Committee is to provide the operational management of eLIXIR as identified in the eLIXIR Security Model and protocol for the eLIXIR Research Tissue Bank. In so doing, the Committee seeks to promote the scientific and ethical principles that should govern the use of eLIXIR data and seeks to represent stakeholders (who include the KHP Caldicott Guardians (individuals responsible for research governance at each NHS Trust site), service users, clinical professionals, lay persons and academics) and reflect their views and interests.
Public and patient involvement Public and patient involvement (PPI) involvement was incorporated throughout the development of the eLIXIR Partnership. The concept of the eLIXIR Partnership was presented to a variety of PPI groups, including the Maudsley BRC Data Linkage Service User and Carer Advisory Group, 37 Lambeth HealthWatch and the Young Persons Advisory Group at Great Ormond Street Hospital. PPI is ensured in the decision-making process of approving Open access eLIXIR projects through lay member representation on the eLIXIR Oversight Committee. The eLIXIR Oversight Committee reviews and approves all projects using eLIXIR data.

Research Tissue Bank
The eLIXIR Research Tissue Bank is a prospective biobank of samples from pregnant women and infants being treated by GSTT. The Research Tissue Bank is integrated into the KCL Human Tissue Act (HTA) governance structure with active recruitment and collection of samples. All pregnant women aged over 16 years who are willing and able to give informed consent and infants admitted to the neonatal unit that have parental consent are eligible for inclusion in the eLIXIR Research Tissue Bank (Cambridge East Research Ethics Committee, reference 18/EE/0120).

Maternal blood sample collection
Eligible women are recruited at the time of routine antenatal care venepuncture (11-15 weeks' gestation or later transfer of care or later antenatal care attendance) at GSTT. Women who agree to participate give written informed consent and an extra blood sample is collected at the same time as routine venepuncture, maximum volume 12 mL (2×6 mL tubes).

Infant blood sample collection
It is our intention also to recruit samples from infants admitted to the neonatal intensive care unit (NICU), when blood is drawn for routine tests. Following written informed parental consent, residual blood from routine samples, which otherwise would be discarded, will be retained and collected. The samples will be processed and stored in a similar manner to the maternal blood samples.
Following birth in the community setting (home visits), every infant, whose mother has provided consent, is offered new-born blood spot screening to exclude metabolic disorders. A health professional pricks the baby's heel to collect four drops of blood on a card. For the biobank, an extra card to those taken for clinical purposes is used to collect extra bloodspots from the infant, after the routine spots are collected using the same heel prick. This sample is posted back to the eLIXIR team and stored in the research laboratory prior to transfer to a central storage facility.
All samples are transferred for processing in the research laboratory. After processing, all tubes are labelled with a study-specific barcode and entered on to a study-specific database (FreezerPro). The samples are stored in −80°C freezers for short term, before being transported to a central storage facility (NIHR BioResource, Milton Keynes).

FINDINGS TO DATE Maternal, birth and birth outcomes
From the first data extraction (1 October 2018-30 June 2019) 10 207 women accessed antenatal care through GSTT or KCH maternity services with 8405 deliveries (8772 infants). This diverse, inner-city antenatal population was born in over 170 countries with a heterogeneous distribution across ethnic groups (table 1). Most were born outside the UK but most reported English as their primary language. Women were booked on average at 11.6 weeks gestation, which is slightly higher than the national guidelines of before 10 weeks. 38 The most common physical conditions experienced were; gynaecological problems (14%), asthma (8%) and pre-existing diabetes (6%). In addition, around one in five reported mental health problems, 3% were recorded as being exposed to female genital mutilation and 4% reported being current smokers at the time of their antenatal appointment (table 1).
With regard to birth episodes, twice as many women gave birth at GSTT than KCH with a mean gestational age at delivery of 38.8 weeks, and with 8% born prematurely (<37 completed weeks of gestation) and 3% born very prematurely (<34 weeks' gestation). Of the 8405 births, 8051 were singletons, 341 were twin births and 13 were triplets. Around half of deliveries were spontaneous cephalic, around one in five were emergency (or unspecified) caesarean section, and 15% elective caesarean section. Rates of stillbirth and neonatal deaths were 0.6% and 0.4%, respectively. The mean birth weight was 3257 g with 17% small for gestational age and 7% large for gestational age (table 2).

NICU admissions
Of the 947 infants that had been treated in NICUs across both GSTT and KCH, 19.1% were postnatal transfers from an external trust and 8.7% were in utero transfers. The main reason for admission was for respiratory disease (28%) followed by preterm birth (22%). The average length of time spent in the NICU was 15.8 days. Of outcomes following admission 46% were readmitted to a postnatal ward, 29% were discharged home and neonatal death occurred in 4%, the remainder being unknown from the data available (table 3).

Mental health
Of the 10 207 women attending antenatal care as registered in the eLIXIR database, 1184 had a clinical record in secondary mental health services (SLaM) with 307 women actively being treated at the time of their pregnancy (201 under the care of IAPT) (table 4).

Research Tissue Bank
Following all necessary governance agreements samples were collected over a period of 3 months. A total of 1271 aliquoted samples (including EDTA, serum and whole blood) from 123 women were stored in the FreezerPro system. In this period, 63.4% of women approached gave consent to take part.

Strengths and limitations
The eLIXIR Partnership has developed a unique population-based database incorporating clinical data from maternity, neonatal and mental health records.

Open access
This is supplemented by the eLIXIR Research Tissue Bank. Together this resource will provide the basis for additional linkages to enable life course studies of physical and mental health in a large, diverse, inner-city, UK population. This will have not only implications for local healthcare improvement but also the potential to provide evidence to influence healthcare in similar national/ global settings.
A limitation common to all routine/administrative clinical databases is source data missingness and inaccuracy. This, in contrast, is an advantage of prospective research cohorts. Nonetheless eLIXIR provides an opportunity for continuous feedback to the clinical provider on data absence or error. With regular meetings, we already appraise the Trust IT teams of, for example, duplicate patient data entry and missing data, especially that required for the national Maternity Services Data Set. Thus, eLIXIR and other similar research datasets can contribute directly to improved clinical reporting, and hence to better clinical care.
Another potential limitation is the loss of data from women who move outside the catchment area within the index pregnancy and thereafter, although it is our intention to supplement the linkages with data from national HES to provide information on hospitalised outcomes. As with all administrative clinical datasets, research will be limited to that which can be conducted using information routinely collected in clinical care. The intended incorporation of research datasets offsets this to some extent. Additionally, clinical data entry may involve substantial human error and data may be absent from the clinical records (ie, missing). The representativeness of the cohort to the UK population is limited to mixed, innerurban catchments. In addition, all three NHS Trusts involved in the eLIXIR Partnership incorporate specialisms or local expertise attracting national-level referrals. As a result, data may be skewed to patients with more severe or complex health issues.
There are several advantages, but also disadvantages, to using pseudonymised electronic cohorts versus more traditional consented cohorts (eg, Avon Longitudinal Study of Parents and Children or Born in Bradford (BiB)). The greatest advantage lies in the contemporary reporting of a population compared with historical cohorts, and we are aware that BiB has embarked on a    Open access the advantage of much greater depth of biological and psychological information derived from procedures and validated questionnaires.
Plans for the future As the eLIXIR database develops, expansions will incorporate health and social care data. The next phase of linkage will comprise a local primary care data resource, Lambeth DataNet (https:// selondonccg. nhs. uk/ in-yourarea/ lambeth/ our-local-plans/), for which all necessary approvals are in place. Beyond-catchment hospitalisation data could be usefully captured by linkage to national data sources, for example, HES, as mentioned, which has been incorporated in the CRIS platform. 33 Subject to approval, later linkages will incorporate prescribing and education data, in addition to a broader range of local healthcare information as eLIXIR infants enter age ranges covered by other specialties. Finally, the replication of this data-linkage model in other geographical settings in the UK would offer the potential to develop a national data network allowing both larger research cohorts and cross-site replication.

COLLABORATION
We have established a research database of maternity, neonatal and mental health clinical data not only combining maternal physical and mental health clinical data during pregnancy and later neonatal health, but also providing added value through the potential for the addition of biological measurements (ie, omics data) from the eLIXIR Tissue Bank samples. The eLIXIR Programme has the capacity to continue to grow and develop exponentially, through internal and external collaborations, with small levels of attrition to follow-up and the ability to be used for both common and rare research outcomes. Furthermore, unlike comparable research programmes, the population we sample from is diverse on both ethnicity and sociodemographic levels providing richness of data, which has the potential to lead to exciting research findings.
Disclaimer The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, the MRC or the Department of Health.
Competing interests RS declares research support received in the last 5 years from Roche, Janssen, GSK and Takeda.
Patient and public involvement Patients and/or the public were involved in the design, or conduct, or reporting, or dissemination plans of this research.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data are available upon reasonable request. Researchers can apply for data access and biomaterial by submitting a research application form to the eLIXIR team. The eLIXIR website provides information on the application process (http://www. guysandstthomasbrc. nihr. ac. uk/ microsites/ elixir). To apply to use data from eLIXIR, researchers must complete a Research Application Form (RAF), available on our website, and submit this, via email, to the eLIXIR Oversight Committee for their consideration and approval. The associated costs with accessing data are study dependent. Basic infrastructure for data storage and CDLS services is provided by the core team. Individual project costs are determined by the length of study and which datasets are required. Costs to the researcher include data access (via VPN), data cleaning and statistical support. The eLIXIR Partnership provides the infrastructure for data linkage, but external funding will be sought for additional linkage to external datasets.
Open access This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https:// creativecommons. org/ licenses/ by/ 4. 0/.