Purpose In the Netherlands, a great variety of objectively measured geo-data is available, but these data are scattered and measured at varying spatial and temporal scales. The centralisation of these geo-data and the linkage of these data to individual-level data from longitudinal cohort studies enable large-scale epidemiological research on the impact of the environment on public health in the Netherlands. In the Geoscience and Health Cohort Consortium (GECCO), six large-scale and ongoing cohort studies have been enriched with a variety of existing geo-data. Here, we introduce GECCO by describing: (1) the phenotypes of the involved cohort studies, (2) the collected geo-data and their sources, (3) the methodology that was used to link the collected geo-data to individual cohort studies, (4) the similarity of commonly used geo-data between our consortium and the nationwide situation in the Netherlands and (5) the distribution of geo-data within our consortium.
Participants GECCO includes participants from six prospective cohort studies (eg, 44 657 respondents (18–100 years) in 2006) and it covers all municipalities in the Netherlands. Using postal code information of the participants, geo-data on the address-level, postal code-level as well as neighbourhood-level could be linked to individual-level cohort data.
Findings to date The geo-data could be successfully linked to almost all respondents of all cohort studies, with successful data-linkage rates ranging from 97.1% to 100.0% between cohort studies. The results show variability in geo-data within and across cohorts. GECCO increases power of analyses, provides opportunities for cross-checking and replication, ensures sufficient geographical variation in environmental determinants and allows for nuanced analyses on specific subgroups.
Future plans GECCO offers unique opportunities for (longitudinal) studies on the complex relationships between the environment and health outcomes. For example, GECCO will be used for further research on environmental determinants of physical/psychosocial functioning and lifestyle behaviours.
- environmental data
- health geography
- geographic information systems
- longitudinal cohort studies
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Strengths and limitations of this study
The main strengths of the Geoscience and Health Cohort Consortium (GECCO) are the centralisation of a variety of objectively measured geo-data on the address-level, postal code-level and neighbourhood-level and the linkage of these environmental-level data to individual-level data from six longitudinal cohort studies in the Netherlands.
The large number of respondents in GECCO are spread out over all municipalities in the Netherlands and, consequently, the variation in environmental exposures is large.
The geo-data could be successfully linked to almost all respondents of each participating cohort study, with successful data-linkage rates ranging from 97.1% to 100.0%.
Although the collaboration between the cohort studies in GECCO increases power of analyses and enables nuanced analyses on specific subgroups, procedures are required to harmonise variables between cohort studies.
The exposome encompasses the life-course environmental exposures from the prenatal period onwards and receives growing attention in medical research with respect to its relationship with health behaviours and health outcomes.1–3 Multidisciplinary and longitudinal research combining individual-level data with environmental-level data is urgently needed to identify and better understand the environmental determinants of behaviours and health and to optimally inform policymakers. In the Netherlands, a great variety of objectively measured geo-data is available (eg, air pollution, traffic noise and area demographics), but these data are currently scattered and measured at varying spatial and temporal scales. The centralisation of these geo-data and the linkage of these data to individual-level data from longitudinal cohort studies would stimulate large-scale epidemiological research on the impact of the environment on public health in the Netherlands.
In the Geoscience and Health Cohort Consortium (GECCO), a variety of existing geo-data has been brought together and databases of six large-scale, ongoing cohort studies in the Netherlands have been enriched with these geo-data. GECCO stimulates large-scale studies on environmental determinants of health and well-being in the Netherlands. In particular, the collaboration between cohort studies within this consortium increases power of analyses, provides opportunities for cross-checking and replication, ensures sufficient geographical variation in environmental determinants and allows for analyses on specific subgroups.
In this cohort profile, we introduce GECCO and provide an overview of the collected geo-data and their sources. Furthermore, we describe the methodology that was used to link the collected geo-data to the individual cohort studies. We examine the similarity of commonly used geo-data (eg, air pollution, traffic noise and area demographics) between our consortium and the nationwide situation in the Netherlands. We investigate the variability in geo-data within GECCO by examining the distribution of geo-data within our consortium and by assessing differences in geo-data between urban and rural areas of residence.
GECCO involves six cohort studies, affiliated with Vrije Universiteit Amsterdam and/or VU University Medical Center in Amsterdam, the Netherlands. Here we focus on the linkage of geo-data to the cohort studies in 2006, because most cohort studies (five out of six) had a measurement around this year. The geo-data could be successfully linked to almost all respondents of each cohort study. The successful data-linkage rates ranged from 97.1% to 100.0% between cohort studies. Missing geo-data or postal code information (eg, as a result of living abroad) were the main reasons for unsuccessful data-linkage. The six cohort studies are described below.
The Generations2-study (http://www.generaties2.nl) is a longitudinal cohort study following first-time pregnant women during the transition to parenthood. The Generations2-study was started in 2009 and the inclusion of pregnant women ended in 2015. By following women from the first trimester of pregnancy until 6 years after birth, the study aims to obtain more insight in the adaptation of mothers to parenthood and the development of the parent-child relationships.4 Participants were recruited via midwifery practices in the Amsterdam area and via a website. Data were collected by using a variety of questionnaires, interviews and observations. Women were eligible to participate in the Generations2-study if they were pregnant of their first child and if their Dutch or English proficiency was good enough to complete the questionnaires. In total, 2000 women took part in the Generations2-study. In 2009, data were collected for 208 participants in this cohort study. Geo-data could be linked to 202 (97.1%) participants in 2009. For the Generations2-study, we show data for the year 2009 in this cohort profile.
The Longitudinal Aging Study Amsterdam (LASA; http://www.lasa-vu.nl) is a longitudinal, multidisciplinary cohort study that aims to study the determinants, trajectories and consequences of physical, cognitive, emotional and social functioning in relation to ageing.5 6 The study is based on a nationally representative sample of older adults aged 55–84 years at baseline, living in three different geographical areas in the Netherlands, covering protestant, roman-catholic and secularised areas of the country as well as urbanised and rural areas. The baseline measurement included 3107 respondents. The baseline data collection was conducted in 1992/1993, with 3 yearly follow-up waves. Additional respondents from later birth cohorts were recruited from the same sampling frame in 2002/2003 and 2012/2013. Data were collected by trained interviewers, who visited respondents at home. The main data collection was done by means of face-to-face, computer-assisted interviews. Additionally, respondents were asked to fill out a written questionnaire and to participate in a medical interview, entailing a separate visit to administer clinical measurements and ask additional questions. In 2005/2006, data were collected for 2165 participants in this cohort study. Geo-data could be linked to 2150 (99.3%) participants in 2005/2006, of which 2123 persons lived at individual addresses.
The Netherlands Longitudinal Study on Hearing (NL-SH; http://www.hooronderzoek.nl) is an ongoing prospective cohort study and focuses on the relationships between hearing impairment and several aspects of life of adults, including psychosocial functioning, work and use of healthcare. The NL-SH comprises a convenience cohort of Dutch adults between 18 and 70 years with and without hearing loss at the start of the study. The aim of the NL-SH is to compare groups of participants with and without hearing impairment on a range of aspects. This longitudinal cohort study commenced in 2006. The first follow-up wave took place 5 years after the initial measurement. A 10-year follow-up wave is currently underway. The NL-SH is a web-based study, with the entire data collection processed via the internet. The NL-SH website is used to recruit participants and to collect data.7–9 Before enrolment, participants have to perform a Dutch online speech-in-noise test and subscribe themselves to the study by completing an online registration form. Hereafter, they receive the NL-SH questionnaire. In 2006, data were collected for 1015 participants in this cohort study. Geo-data could be linked to 1012 (99.7%) participants in 2006, of which 995 persons lived at individual addresses.
The Netherlands Study of Depression and Anxiety (NESDA; http://www.nesda.nl) is a national ongoing, longitudinal study designed to investigate the long-term course and consequences of depressive and anxiety disorders and to integrate biological and psychosocial research paradigms within an epidemiological approach in order to examine (interaction between) predictors of the long-term course and consequences.10 Briefly, six assessment waves have been completed between 2004–2006 (baseline) and 2014–2016 (9 year follow-up). The study started with 2981 participants aged 18–65 years, including healthy controls and subjects with a past or current depressive and/or anxiety disorder. To represent various settings and stages of psychopathology, participants were recruited in the general population, in general practices and in mental health organisations. In NESDA, geo-data could be linked to 2974 (99.8%) participants in 2006.
The Netherlands Twin Register (NTR; http://www.tweelingenregister.org) is an ongoing cohort study and was established around 1986. The NTR examines the contribution of genes and environment to development, lifestyle, health and personality.11 The cohort study was established by recruiting young twins and multiples at birth and by approaching adolescent and young adult twins through city councils. The Young Netherlands Twin Register (YNTR) collects data from twins and multiples from birth onwards, by parent and teacher report and as of age of 14 by self-report.12 The Adult Netherlands Twin Register (ANTR) started data collection in adolescent and young adult twins and their parents and was extended to include older twins as well as siblings, spouses and adult children of twin participants.13 Survey data are collected every 2–3 years, since 1986 and 1991 for the YNTR and ANTR, respectively. Selected groups of participants are invited for specific research projects and DNA collection. In addition, biological samples and data for adult participants were obtained in a large-scale biobank effort.14 Approximately 98 000 twins and multiples are registered with the NTR. In total, over 200 000 individuals (twins, multiples, parents, siblings, spouses, etc) take part. To ensure an unrelated sample in the current project, one NTR-participant within a family was selected who was 18 years or older in 2006 and for whom a Dutch residence in 2006 was known. In total, 35 574 unrelated participants were selected. Geo-data could be linked to 35 556 (99.9%) individuals in 2006.
The New Hoorn Study (NHS; http://www.emgo.nl/research/lifestyle-overweight-and-diabetes/research-projects) is an ongoing population-based study focusing on the prevalence of impaired glucose regulation and determinants of type 2 diabetes mellitus in the city of Hoorn, the Netherlands.15 The baseline measurement took place in 2006/2007 and included 2807 participants, aged 40–65 years. Participants were invited to visit the Diabetes Research Center in Hoorn and data were collected using questionnaires and physical examinations. A follow-up measurement has been conducted in 2014/2015. In the NHS, geo-data could be linked to all 2807 (100.0%) participants in 2006/2007.
Patient and public involvement
Patients and public were not involved in the design of this study.
The richness of the GECCO database is based on the cohort data and the geo-data that have been linked to these cohorts. The phenotypes of the individual cohort studies within GECCO are summarised in table 1. In most of these cohort studies, data on demographics, socioeconomic status (SES), lifestyle factors, healthcare use, biomaterial measurements and various domains of functioning have been repeatedly measured over time (table 1).
A variety of existing geo-data on address-level, 6-digit postal code-level, 4-digit postal code-level as well as neighbourhood-level from different sources were collected. In the Netherlands, 6-digit postal code areas (average area size: 0.0025 km2), 4-digit postal code areas (average area size: 8.3 km2) and neighbourhoods (average area size: 3.1 km2) are geographically delineated areas within municipalities and include, on average, approximately 15, 1870 and 630 households, respectively.16–21 Data were available for several years, depending on the source database used. For the data-linkage between geo-information and participants of each specific cohort study, 4-digit postal codes or, if possible, 6-digit postal codes were used as identifier. All collected geo-data within GECCO can thus be linked to the postal codes of respondents. In this cohort profile, we highlight geo-data that are commonly used in exposome research (table 2). A complete list of collected geo-data is available online (http://www.emgo.nl/research/international-collaborations/longitudinal-cohort-studies/emgo-cohort-booster-project).
Population and households
Statistics Netherlands provided data regarding population and households in 4-digit postal code areas in the Netherlands (table 2).19 22–37 These data describe the population in terms of sex and age. Furthermore, the data include the proportions of Western and non-Western immigrants and also include household characteristics (eg, average household size). These data are described in more detail elsewhere.19 22–37
The Netherlands Institute of Social Research provided SES scores for each 4-digit postal code area in the Netherlands for specific years (table 2).38 39 The SES score has been modelled using several sources of nationwide survey data on the residents’ educational level, income and position in the labour market. The SES score is based on mean income, percentage of low incomes, percentage of low educated residents and percentage of unemployed residents as determined by a principal component analysis. A higher score represents a better SES.38 More information on the SES-score is available elsewhere.38 39
In the European Study of Cohorts for Air Pollution Effects (ESCAPE-project), the Institute for Risk Assessment Sciences of the Utrecht University has measured the spatial variation of residential concentrations of air pollutants in 2009 in the Netherlands (table 2).40–43 Land Use Regression (LUR) models were developed to predict air pollution concentrations at the address-level.40 42 Residential exposure to air pollutants was assessed as annual average concentrations of particulate matter with diameters≤2.5 µm (PM2.5) and 2.5–10.0 µm (PMcoarse). Furthermore, air pollution was assessed as annual average concentrations of nitrogen dioxide (NO2), nitrogen oxides (NOx) and the reflectance of PM2.5 filters, which is a proxy for elemental carbon (soot). Detailed information on the assessment of exposure to air pollution is described elsewhere.40–43 Although the data on air pollution were obtained in 2009, several studies have documented that the LUR models represent the spatial contrast in long-term average air pollution concentrations over periods of 10 years or more.44–46 In GECCO, the address-level concentrations of air pollutants were aggregated to mean values of 6-digit postal code areas. This aggregation of data facilitated the linkage of these data to individual-level data of the various cohort studies.
Road-traffic, rail-traffic and air-traffic noise
The Netherlands Environmental Assessment Agency modelled daily average noise levels of road-traffic, rail-traffic and air-traffic for specific years in the Netherlands by using the Empara Noisetool with a resolution of 25 by 25 m (table 2).47 Noise is measured in Level day-evening-night (Lden) and is expressed in A-weighted decibels (dB (A)). The measure Lden accounts for the fact that noise in the evening and the night are more annoying than during the day. The average noise levels during the day (7–19 hours), the evening (19–23 hours) and the night (23–7 hours) were calculated first and the levels of noise in the evening and the night are increased with 5 and 10 dB (A), respectively. Subsequently, the daily mean noise was calculated by dividing the noise levels during day, evening and night by 3. More details on the assessment of traffic noise raster data are available elsewhere.47
The noise level of a particular raster cell was linked to the point locations of all addresses that fall within that specific raster cell. The point locations of all addresses in the Netherlands were obtained from the Register of Addresses and Buildings (BAG-register, June 2015) of the Netherlands’ Cadastre, Land Registry and Mapping Agency, and the linkage was performed by using GeoDMS software (Object Vision BV, Amsterdam, the Netherlands). In GECCO, the address-level traffic noise data were aggregated to mean values of 6-digit postal code areas. This aggregation of data facilitated the linkage of these data to individual-level data of the various cohort studies.
The Netherlands Ministry of the Interior and Kingdom Relations has measured liveability, that is the extent to which the living environment is in line with the conditions and needs of residents, in each 4-digit postal code area in the Netherlands using the liveability score(table 2).48–50 The liveability score is based on six dimension scores, which are derived from 49 indicators. The dimensions are: (1) population, (2) social cohesion, (3) public space, (4) safety, (5) level of resources and (6) housing stock. The liveability score ranges from 1 (extremely negative) to 7 (extremely positive).48 49
Statistics Netherlands shares rich data on neighbourhood environments for specific years in the Netherlands (table 2).20 21 51–63 These data are related to several domains, including: (1) population demographics (eg, sex, age, marital status and ethnicity), (2) housing stock (eg, average home value), (3) energy consumption (eg, average electricity/gas consumption), (4) income (eg, proportions of residents/households with a low/high income), (5) social security (eg, proportions of social security beneficiaries), (6) companies (eg, number of industrial companies), (7) motor vehicles (eg, average number of cars per household), (8) area (eg, total area size of land and/or water) and (9) proximity and density of specific resources in the neighbourhood (eg, general practices, supermarkets, educational facilities, public transport facilities and cultural facilities). These data on the neighbourhood environment are extensively described elsewhere.20 21 51–63
Geographic Information Systems (GIS) software, ArcGIS V.10.1 (ESRI, Redlands, California, USA), was used to allocate the centroid of 6-digit postal code areas to a neighbourhood, as delineated by Statistics Netherlands and the Netherlands’ Cadastre, Land Registry and Mapping Agency. The GIS technique of spatial joining was used to link the neighbourhood data of Statistics Netherlands to 6-digit postal codes.
Urbanisation grade is the mean number of addresses per square kilometre within a circle with a radius of one kilometre.64 Data on the level of urbanisation of neighbourhoods are provided by Statistics Netherlands and are defined in five categories.20 21 51–63 These five categories are: (1) extremely urbanised (≥2500 addresses/km2), (2) strongly urbanised (1500–2500 addresses/km2), (3) moderately urbanised (1000–1500 addresses/km2), (4) hardly urbanised (500–1000 addresses/km2) and (5) not urbanised (<500 addresses/km2). For the purpose of this paper, urbanisation grade was dichotomised. The first three categories are defined as ‘urban area’ and the latter two categories are defined as ‘rural area’.
Findings to date
Using the postal code information for our respondents in 2006, we linked geo-data to 44 657 individuals (including 202 Generations2-participants in 2009). The mean age of all GECCO-participants was 43.8 (SD=12.0) years with an age range of 18–100 years. The study sample included 26 528 (59.4%) women and 10 048 (22.5%) participants had a high educational level (table 3).
GECCO includes respondents from national cohort studies which recruit participants from the entire country as well as respondents from regionally oriented cohort studies. To illustrate, the GECCO-participants of the NHS lived in 3 (0.7%) of the 458 Dutch municipalities in 2006, whereas GECCO-participants of the NTR covered all municipalities (table 4).
Table 4 presents the distribution of the geo-data in each individual cohort study, GECCO and the Netherlands. In general, there is variability in geo-data within and across GECCO-cohorts. The environmental exposure measures of participants in GECCO generally appear to be similar to those of residents in the entire country. For example, if we compare the point estimate of road-traffic, rail-traffic and air-traffic noise of GECCO-participants with that of all inhabitants in the Netherlands, the absolute difference is only 0.4 dB (A) (table 4).
Data on urbanisation grade were available for 43 678 participants (97.8%). The majority of these participants (58.8%) lived in urban neighbourhoods (table 4). As expected, the urban and rural neighbourhoods, in which the GECCO-participants were living, differed significantly in terms of population and household characteristics, SES, air pollution, road-traffic, rail-traffic and air-traffic noise, liveability and a range of neighbourhood characteristics (table 5). For example, the levels of road-traffic, rail-traffic and air-traffic noise and air pollution were higher in urban neighbourhoods than in rural neighbourhoods (table 5).
Recent findings in GECCO
So far, one study has been conducted in GECCO.65 Generaal et al 65 conducted a cross-sectional study, including data from 2980 NESDA-participants, to examine whether urbanisation grade and objectively obtained socioeconomic (ie, SES, home value, number of social security beneficiaries per 1000 households and proportion of immigrants), physical (ie, air pollution, traffic noise and availability of green space and water) and social aspects (ie, social cohesion and safety) of the neighbourhood in which persons live are associated with the presence and severity of depressive and anxiety disorders. The findings showed that not urbanisation grade, but rather neighbourhood socioeconomic factors (ie, low SES, more immigrants and more social security benefits), physical factors (ie, high levels of air pollution and high levels of traffic noise) and social factors (ie, lower social cohesion and less safety) were associated with the presence of depressive and anxiety disorders. Furthermore, it was found that most of these neighbourhood characteristics were also associated with increased depressive and anxiety symptoms severity.65 The study by Generaal et al 65 shows that several environmental characteristics are related to mental health of residents and suggest that policymakers should target a wide range of environmental aspects in order to promote mental health.
Currently, a nation-wide meta-analysis is conducted to examine the associations of neighbourhood characteristics with the prevalence and severity of depression, using data from 32 487 individuals from eight Dutch cohort studies (including all six GECCO cohort studies and two additional cohort studies). Furthermore, analyses are conducted in GECCO to examine the associations between the food environment and health outcomes, including obesity and type 2 diabetes mellitus. Currently, efforts are made by the GECCO research team to expand the consortium with other Dutch cohort studies.
Strengths and limitations
Following the recognition that lifestyle behaviours and health outcomes are related to our environment, there is a need for resources and improved tools to quantify environmental contributions to health and disease. Currently, the knowledge of individual-level physiology and disease manifestation as well as the tools to study these relationships are far ahead of those for the more upstream environmental characteristics that may influence behaviours and health. The main strengths of GECCO are the centralisation of a variety of objectively measured geo-data on the address-level, postal code-level and neighbourhood-level and the linkage of these environmental-level data to individual-level data from six longitudinal cohort studies in the Netherlands. GECCO facilitates and enables researchers from various disciplines to address research questions on the relationships between the environment and health outcomes. The collaboration between the cohort studies in GECCO increases the power of analyses and ensures sufficient geographical variation in environmental determinants. Although the collaboration between the cohort studies in GECCO increases power of analyses and enables nuanced analyses on specific subgroups, procedures are required to harmonise variables between cohort studies.
Some geo-data (eg, traffic noise) have been suggested to vary more over time than other geo-data (eg, air pollution).44–46 66 A strength of GECCO is that a variety of geo-data have been collected for different years. This makes it possible to link most geo-data to the exact assessment period of the cohort studies. For this cohort profile, geo-data were linked over periods as closely matched to the assessment period of the various cohort studies, resulting in a temporal mismatch of a maximum of 5 years for some of the participants. This particular mismatch is related to the linkage of 2009-data on air pollution to data from NESDA in 2004 and can still be considered as an accurate match.44–46
For this cohort profile, the linkage of geo-data with individual-level cohort data was done locally without confidential information (eg, residential addresses) leaving the research premises, so that the privacy of respondents is safeguarded at all times. Although some geo-data were collected on the address-level, it should be acknowledged that most collected geo-data in GECCO are related to administrative residential areas and are not related to specific contexts (eg, work environment or exact geographic life environment) that might also impact health and well-being of individuals.67
A consortium with such a high number of individuals using the same geo-data offers unique opportunities for (longitudinal) studies on the complex relationships between the environment and health outcomes. For example, future studies could focus on the associations of environmental exposure measures with physical functioning, psychosocial functioning, biomarkers and lifestyle behaviours.
We would like to thank all respondents who participated in the six cohort studies that are involved in GECCO. We gratefully acknowledge all organisations that collaborate with the six cohort studies in GECCO.
We also would like to thank the Institute for Risk Assessment Sciences of the Utrecht University, the coordinating center of the European Study of Cohorts for Air Pollution Effects (ESCAPE), for providing data on air pollution. Furthermore, we would like to thank the Netherlands Environmental Assessment Agency for providing data on road-traffic, rail-traffic and air-traffic noise.
Contributors EJT, JL, JWJB, DIB, SEK, MO, GW, MS, GN, CS, JHS, DJHD, BWJHP and MH participated in the conceptual design of the study. EJT identified and gathered the existing environmental registry data. BB provided geo-data on air pollution. JECD helped to obtain geo-data on road-traffic, rail-traffic and air-traffic noise. All authors critically revised the manuscript and contributed to interpretation of the data. All authors read and approved the final version of the manuscript.
Funding Financial support for GECCO was obtained from the EMGO Institute for Health and Care Research (EMGO+) of Vrije Universiteit Amsterdam and VU University Medical Center in Amsterdam, the Netherlands, which covered the identification and gathering of existing environmental registry data and linkage of these data to data from the cohort studies of the consortium. The Generations2-study is sponsored by The Netherlands Organisation for Scientific Research (NWO), Stichting tot Steun/Pro Juventute and Fonds NutsOhra Zorgsubsidies. The Longitudinal Aging Study Amsterdam (LASA) is primarily supported by grants from the Netherlands Ministry of Health Welfare and Sports, Directorate of Long-Term Care. Additional funding for LASA was obtained from multiple grants from NWO. The first measurement round of the Netherlands Longitudinal Study on Hearing (2006–2010) was financially supported by the Heinsius Houbolt Foundation, the Netherlands. The second measurement round (2010–2014) was financially supported by Sonova AG, Switzerland. The third measurement round (2016–2020) is financially supported by the EMGO+ Institute and Sonova AG, Switzerland. The infrastructure for the Netherlands Study of Depression and Anxiety is funded through the Geestkracht programme of the Netherlands Organisation for Health Research and Development (ZonMw, grant number 10-000-1002) and matching funds from participating universities and mental healthcare organisations (VU University Medical Center, GGZ Buitenamstel, GGZ Geestgronden, Leiden University Medical Center, GGZ Rivierduinen, University Medical Center Groningen, Lentis, GGZ Friesland and GGZ Drenthe). The data collection in the Netherlands Twin Register was financially supported by multiple grants from NWO and ZonMw; the Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL) (184.021.007); the European Research Council, Genetics of Mental Illness (ERC-230374); the National Institute for Mental Health (NIMH) (1RC2MH089951-01, 1RC2MH089995-01) and the Avera Institute for Human Genetics, Sioux Falls, South Dakota, USA. The New Hoorn Study was financially supported by multiple grants from the VU University Medical Center of Amsterdam, Novartis Pharma B.V., the European Union and the Innovative Medicine Initiative.
Competing interests None declared.
Patient consent Not required.
Ethics approval The six participating cohort studies in GECCO have been approved by the Ethical Review Boards of the respective institutions.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement The geo-data that have been collected within GECCO are available on request, provided that additional requirements of the original geo-data source holder are fulfilled and an agreement is made up with the GECCO Research Team. More information is available online (http://www.emgo.nl/research/international-collaborations/longitudinal-cohort-studies/emgo-cohort-booster-project) or available on request (firstname.lastname@example.org).