Introduction Schizophrenia and bipolar disorder account for a large proportion of the global burden of disease. Despite their enormous impact, little is known about their pathophysiology. Given the high heritability of schizophrenia and bipolar disorder, unbiased genetic studies offer the opportunity to gain insight into their neurobiology. However, advances in understanding the genetic architecture of schizophrenia and bipolar disorder have been based almost exclusively on subjects of Northern European ancestry. The Neuropsychiatric Genetics of African Populations-Psychosis (NeuroGAP-Psychosis) project aims to expand our understanding of the causes of schizophrenia and bipolar disorder through large-scale sample collection and analyses in understudied African populations.
Methods and analysis NeuroGAP-Psychosis is a case-control study of 34 000 participants recruited across multiple sites within Ethiopia, Kenya, South Africa and Uganda. Participants will include individuals who are at least 18 years old with a clinical diagnosis of schizophrenia or bipolar disorder (‘psychosis’) or those with no history of psychosis. Research assistants will collect phenotype data and saliva for DNA extraction. Data on mental disorders, history of physical health problems, substance use and history of past traumatic events will be collected from all participants.
DNA extraction will take place in-country, with genotyping performed at the Broad Institute. The primary analyses will include identifying major groups of participants with similar ancestry using the computation-efficient programme single nucleotide polymorphisms (SNP) weights. This will be followed by a GWAS within and across ancestry groups.
Ethics and dissemination All participants will be assessed for capacity to consent using the University of California, San Diego Brief Assessment of Capacity to Consent. Those demonstrating capacity to consent will be required to provide informed consent. Ethical clearances to conduct this study have been obtained from all participating sites. Findings from this study will be disseminated in publications and shared with controlled access public databases, such as the database of Genotypes and Phenotypes, dbGaP.
- mental Health
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
Strengths and limitations of this study
This will be the first psychiatric genetics study at this scale in Africa and across different African populations.
Significant phenotypic data will be collected with the goal of a looking at a transdiagnostic category, without necessarily being limited to Diagnostic and Statistical Manual of Mental Disorders/International Statistical Classification of Diseases and Related Health Problems diagnoses.
We will extract DNA from saliva samples which are logistically easier to manage than blood. However, there is less DNA in saliva than in blood, and there is a higher risk of contamination.
As we have chosen not to exclude based on race, some of the participants are likely to be of European descent.
Although we obtain permission to follow-up enrolled participants, this is a case-control study, and thus will only have phenotypic data at one point in time.
Neuropsychiatric disorders are the leading cause of years lived with disability in the world.1 Within mental, neurological and substance use disorders, schizophrenia and bipolar disorder account for more than 14% of years of life lost to premature mortality and years lived with disability.2 In the past several years, there have been great strides in our understanding of the genetic architecture of schizophrenia and bipolar disorder. A landmark paper in Nature from 2014 discovered 108 genome-wide significant loci for schizophrenia in ~37 000 cases and ~113 000 controls.3 In the intervening years, the number of genome-wide significant hits for schizophrenia has grown to 145.4 Successes in schizophrenia research have demonstrated that extremely large scale meta-analyses are necessary to identify genetic variants associated with neuropsychiatric disorders.5 The hope is that these breakthroughs in neuropsychiatric genetics will lead to new pharmacological targets, and ultimately treatments to reduce the global burden of psychiatric disorders.
However, for historical, cultural, financial and practical reasons, these genetic findings are based predominantly on subjects of Northern European ancestry, with a growing but still small proportion on populations of East Asian ancestry.6 7 Currently, there are major limitations in our knowledge of the genetic and environmental risk architecture of psychiatric disorders in persons of African descent.8 As a result, we are limited in our ability to understand biological mechanisms, predict genetic risk9 and produce optimal therapy for African populations. Moreover, African genomes are characterised by shorter haplotype blocks and contain almost a million more variants per individual than populations outside Africa.10 Further, genetic studies of under-represented populations afford the opportunity to discover novel loci that are invariant in European populations.11 Thus, including data from African populations in genetic studies of neuropsychiatric disorders may accelerate genetic discovery and could be useful for fine mapping of disease causing alleles.12 Studies of psychiatric genetics are in their infancy in Africa and are not yet at a scale necessary for variant discovery.
Aim and objective
The Neuropsychiatric Genetics of African Populations-Psychosis (NeuroGAP-Psychosis) project aims to expand knowledge of the genetic and environmental risk factors for neuropsychiatric disorders in Africa through large-scale sample collection and analysis, so that future advances in science and therapeutics can account for and be applicable to African populations.
Methods and analysis
The design will be a case-control study. This project is structured around two diagnostic categories: schizophrenia and bipolar I disorder (grouped under the heading ‘psychotic disorders’). The rationale for grouping schizophrenia and bipolar disorder stems from literature showing a high level of genetic correlation between schizophrenia and bipolar disorder, which may indicate that some of the same genetic variants confer risk for both phenotypes.13
Cases will be individuals with a diagnosis of schizophrenia or bipolar disorder, referred to subsequently as psychosis. Controls will be individuals from the same geographical location, without psychosis, who will be matched to cases for age, sex and ancestry.
The study will be conducted over 4 years, starting in September 2018 and ending in 2022, at several sites in Ethiopia, Kenya, South Africa and Uganda (figure 1).14 DNA extraction will be performed on site, and genotyping will performed at the Broad Institute in the USA using the Illumina Global Screening Array. Sites were selected on the basis of the following criteria: (1) proven track record of psychiatric research; (2) availability of research personnel and the necessary research infrastructure to be able to recruit thousands of participants and (3) existing trusted relationships from prior collaborations. Each of the countries where participants will be recruited from has enormous genetic diversity within and between them, which is likely to improve the ability of this research to answer the study objectives. Pilot studies are currently underway in all four countries.
Participants will be recruited from the Amanuel Mental Specialized Hospital (cases) and Black Lion Hospital also known as the Tikur Anbessa Hospital (controls).
In Kenya, participants will be recruited from the Moi Teaching and Referral Hospital and affiliated sites in Webuye, Kapenguria, Kitale, Kapsabet, Iten and Kakamega and the KEMRI-Wellcome Trust Research Programme with recruiting sites in Kilifi County, Malindi sub-County, Port Reitz and Coast General Provincial Hospitals.
Participants will be recruited from the Western Cape and Eastern Cape provinces. In the Western Cape, participants will be recruited from the Valkenberg, Lentegeur, Khayelitsha District and Groote Schuur Hospitals as well as a number of community clinics. In the Eastern Cape, participants (cases) will be recruited from the Fort England Psychiatric, Elizabeth Donkin, Tower Psychiatric and Komani Hospitals while controls will be recruited from the Nelson Mandela Academic and Dora Nginza Hospitals as well as five affiliated health clinics.
Study participants in Uganda will be recruited from the Butabika National Mental Health Referral Hospital (cases and controls), Naguru (controls only), Arua (cases and controls), Mbarara (cases and controls) and Gulu Regional Referral Hospitals (cases and controls).
Individuals with a clinical diagnosis of psychosis (cases) as confirmed by clinician referral and/or medical record review, and those without a clinical diagnosis of psychosis (controls) will be eligible to participate. All participants (cases and controls) will be required to provide written informed consent or a fingerprint in case of illiteracy and must be at least 18 years old. To ensure that participants (cases and controls) have sufficient capacity and autonomy to consent to the study, we will use the University of California, San Diego Brief Assessment of Capacity to Consent (UBACC),15 which has been used as an iterative learning tool in similar populations in South Africa.16
Exclusion criteria for cases
Individuals (cases) will be excluded if the following are present:
Absence of a diagnosis of a psychotic disorder.
Severe, intrusive levels of psychiatric symptoms at the time of consent.
Intoxication or withdrawal from alcohol or substance abuse at the time of consent.
A current psychiatric hospitalisation (inpatients).
Involuntary detention at the time of consent.
Lack of fluency in one of the languages the consent form has been translated into.
Lack of capacity to consent to the study as determined by the UBACC.
Potential cases may be approached at a later date once their symptoms are controlled if they meet other inclusion criteria. There will be no exclusions for cases or controls based on sex, ancestry/ethnicity, religious affiliations or sexual orientation.
Exclusion criteria for controls
Potential controls will be excluded if they:
Have current psychotic symptoms or a past diagnosis of a psychotic disorder.
Are currently taking medication for psychosis.
Have acute levels of alcohol or substance misuse as demonstrated by being a current inpatient or under acute medical care for substance misuse.
Lack of fluency in one of the languages the consent form has been translated into.
Lack of capacity to consent to the study as determined by the UBACC.
Eligible participants (cases) will be identified by clinical staff through review of their medical records. A research assistant (RA), who is a nurse, clinical officer, clinical assistant or bachelor’s level accredited RA and who has received study-specific training and human subjects training, will approach the prospective participant, carefully explaining to them in the local language the purpose and procedures of the study and emphasising that participation is entirely voluntary and will not impact any medical care they receive. Languages include: Acholi, Afrikaans, Amharic, English, Kigiryama, Kiswahili, Luganda, Lugbara, Runyankole and isiXhosa.
After reading the information sheet out loud to the subject, the RA will administer the UBACC, a 10-item questionnaire that evaluates the potential participant’s understanding of different components of the study. Each response will be scored on a range of 0–2, with 0 representing no understanding and 2 representing a clear understanding. In the NeuroGAP-Psychosis study, the RA will administer the UBACC over a maximum of four trials. The RA will re-explain and re-administer any items the subject answered incorrectly. The process will end if the full score of 20 is obtained. After the fourth trial, participants who are unable to achieve a score of 14.5, the cut-off originally developed for screening decisional capacity using the UBACC,15 will be excluded.
The RA will proceed to obtain informed consent from those who express interest in participation, and ask participants to provide saliva into an Oragene tube, from which their DNA will be extracted. To maximise the quality of the collected DNA, participants will be asked not to eat, drink, smoke or chew gum for 30 min prior to providing the sample during the consenting process.
Phenotypic assessments were selected on the basis of the following criteria: (1) the particular domain assessed; (2) cross-cultural validity; (3) non-proprietary measures, when possible; (4) investigators’ prior experience using the tool and (5) the time length of administering the tool. The study will collect the following battery of instruments (see figure 2).
Mini International Neuropsychiatric Interview, Standard 7.0.2 for Diagnostic and Statistical Manual of Mental Disorders-5 (MINI): modules A, C, K and O on major depressive episode, manic and hypomanic episodes and psychotic disorders and mood disorder with psychotic features, respectively.17
Life Events Checklist for Diagnostic and Statistical Manual of Mental Disorders-5: a 17-item scale covering exposures to potentially traumatic events.18
Alcohol, Smoking and Substance Involvement Screening Test, V.3.0 (ASSIST): a subset of the ASSIST on substance type and use over a participant’s lifetime and over the past 3 months.19
Composite International Diagnostic Interview screener (CIDI): a checklist within the CIDI on chronic physical conditions20 including diabetes, HIV/AIDS, epilepsy/seizures and tuberculosis.
Participant phenotypic data will be collected using encrypted tablets and uploaded to a secure cloud-based server. Other sources of data will include chart reviews to ascertain clinical diagnoses and medication use (both psychotropic and non-psychotropic) by the participants; as well as measurements of blood pressure, heart rate, height and weight.
Recruitment of controls will take place from general medical facilities (some of these facilities are located within the premises where cases will be identified). Controls will be approached in the same manner as cases and will likely consist of people who are seeking clinical care for themselves, accompanying a friend or family member to a clinic visit, or picking up a medication refill. In one site, at the KEMRI-Wellcome Trust Research Programme in Kenya, controls may also be recruited from homes through the Kilifi Health and Demographic Surveillance System (KHDSS).21 [In KHDSS, census data is recorded every 4 months through household surveys in the catchment area and linked with healthcare data from Kilifi District Hospital.]
The study visit for controls will match the process for cases except for two phenotypic batteries, the Kessler Psychological Distress Scale (K10) and the Psychosis Screening Questionnaire (PSQ) (see figure 3); instead of using the MINI for controls, the RA will administer:
K10: a 10-item questionnaire to measure anxiety and depression.22
PSQ: a brief screening tool for psychotic conditions.23
All participants will receive compensation to offset the time and inconvenience of participating in the study based on recommendations from the local ethics committees and principal investigators. The estimated length of a study visit is 90 to 120 min.
Patient and public involvement
Patients’ priorities and experiences were taken into account in the design of the study. Prior research and clinical work with people with neuropsychiatric disorders at each of the sites informed the research question and study design, including limiting the length of the study battery, minimising intrusive sample collection (saliva instead of blood) and incorporating a tool as part of the consent process to assess understanding of the research and study visit. In addition, most of the sites have a precedent of genetics research and we engaged with teams from those other genetic studies to incorporate patient concerns emerging from genetic research. Study findings will be shared with local community advisory boards.
Sample size and power calculation
Over a period of 4 years, the study team expects to recruit a total of 17 000 schizophrenia and bipolar disorder cases and 17 000 controls across four countries: Ethiopia, Kenya, South Africa and Uganda. The geographic and demographic sampling distribution of the controls will follow that of the cases, and cases and controls will be matched for ethnic ancestry in all analyses. These numbers, described in more detail below, will give adequate statistical power to independently examine the genetic architecture of schizophrenia and bipolar disorder in African populations, the distribution and influence of rare copy number variation on neuropsychiatric disease risk in African populations and determine the consistency of genetic influences on neuropsychiatric disease risk between populations of European and African descent. Ultimately, these data will be analysed in combination with other studies of psychiatric genetics, most notably from the Psychiatric Genomics Consortium, which will further increase our power to discover new and validate current variants with small effect sizes.
With 17 000 cases and controls, using a threshold of 2.5×10-8 for genome-wide significance, we have >90% power to find common variants (Minor Allele Frequency (MAF) >5%) with an OR of 1.15 or greater.24 For more common variants with MAF=25%, we have 89% power to a see significant OR as low as 1.06. These effect sizes are within the range of what was found in a recent Genome-wide Association Study (GWAS) of schizophrenia in a European population.3
DNA extraction and processing
All study participants will provide a 2.0 mL sample of saliva by spitting into a funnel in an Oragene OR-500 kit. DNA extraction will take place in-country, with one aliquot of the extract sent to the Broad Institute in Cambridge, MA, USA for genotyping. The remaining DNA will be frozen and stored in-country within designated biobanks and will be governed by the rules of its institution. In Ethiopia, DNA will be stored at the Black Lion Hospital. In Kenya, DNA will be stored at Moi University and KEMRI-Wellcome Trust Research Programme Biorepository Laboratories. In South Africa, DNA will be stored in a biobank at the Division of Human Genetics at the University of Cape Town, while in Uganda, DNA will be stored at the Integrated Biorepository Lab of H3Africa in Uganda (IBR-H3A-U).
All samples will be assayed on the same genotyping array at the same facility. To maximise information and allow for comparisons across studies genotyped on different platforms, we will impute SNP genotypes not present on specific arrays. Genotype imputation of untyped markers will be performed using default parameters in IMPUTE2 using appropriate reference panels.
To ensure research staff understand the study and their responsibilities, teams will undergo multiple trainings. Study staff who have contact with research participants will complete the online Human Subject Training offered by the US National Institutes of Health. Site leadership and Harvard staff will provide in-person training to the teams by providing an overview of the project, leading discussions, providing role playing on ethics and confidentiality, on recruiting and consenting participants, on reportable events, and on storing forms and data properly. Additionally, there will be hands-on training using the electronic tablets, collecting saliva using Oragene kits, administering the phenotypic batteries, and taking physiological measures, such as height and weight. A checklist will be created of key skills staff need before they can begin interacting with participants one-on-one (figure 4). This form will need to be signed and dated by the staff member and the staff member’s supervisor before he or she can begin working with participants; it will then be uploaded to the regulatory binder. Harvard teams will reinforce these skills with multiple site visits per year and weekly video calls.
In addition, Broad Institute wet lab staff will work in partnership with the lab teams in Africa to standardise the DNA extraction process across the five sites and to increase the quality of the extractions through visits, video calls and email. Where needed, the Broad staff will train lab members on DNA extraction from saliva and establish a sample chain of custody process in order to decrease the likelihood of sample swaps.
Statistical analysis plans
(a) Ancestry analyses
In contrast to most samples previously analysed by the psychiatric genetics community—which were predominately of European ancestry25—the NeuroGAP-Psychosis samples will include individuals of predominantly African ancestry and will be drawn from multiple populations across East and South Africa where genetic variation will be more structured and demographically complex.26 27 Therefore, before proceeding with any phenotype analyses we will first analyse genetic ancestry in our cohorts to identify large, homogeneous groups as described below to conserve optimal power in the GWAS and ultimately control for additional population stratification within these groups to avoid spurious results.
We will take advantage of the large amount of genotypic information available and control for potential population stratification in a standard and validated two-step process. First, we will compute principal components analysis (PCA) to adjust for population substructure. We will also include principal components (PCs) when computing relatedness using PC-AiR.28 We will define clusters of ancestries using ADMIXTURE29 and choose the optimal number of clusters with fivefold cross-validation to identify major groups of participants with similar ancestry. The inclusion of extensive population reference samples compiled by our group and available from other studies, such as the 1000 Genomes Project30 and the African Genome Variation Project,27 will increase the power of this approach and lead to stable individual ancestry calling across studies. Second, if significant differences are found between populations in our cohorts, as expected, we will define more homogeneous populations within several standard deviations of these reference data, selecting each population by constructing multidimensional ellipses across the PCs. Our initial analyses will then be conducted separately on more homogeneous ancestry groups, and later meta-analysed. Within each group, we will conduct PCA again using PC-AiR, and the PCs that explain the most variation within the cohort will be included as covariates in the regression analyses. To adjust for cryptic relatedness that considers ancestry, we will compute a genetic relatedness matrix using PC-Relate,31 which can be used as a random effect in a linear or logistic mixed model. In addition, as new methods are developed by our group and others to account for population structure, we will modify our analyses when appropriate.
(b) Genome-wide Association Study
The primary GWAS discovery analysis will be conducted for the ‘psychotic disorders’ diagnosis, as well as through subsets stratified by primary disorder (schizophrenia, bipolar disorder) using logistic regression. All analyses will be performed using PLINK32 and R.33 To correct for multiple comparisons and the large number of variants in the African genome, we will apply a more conservative significance threshold of p<2.5×10-8.34 Using these subsets, we will assess the genetic correlation and diagnostic stratification among phenotypes.
(c) Heritability estimates
Estimation of heritability, the fraction of disease risk attributable to inherited genetic differences, is an important initial analytic goal. Heritability estimates speak to the anticipated architecture and power of a GWAS, a primary endpoint of the NeuroGAP-Psychosis project. We will use genome-wide complex trait analysis to estimate SNP-based heritability within Africa.25 35 We will assess genetic correlation with existing GWAS in other globally diverse populations using POPCORN.36
Ethics and dissemination
Ethical and safety considerations will be taken across multiple levels. Since the subjects the study aims to recruit are deemed vulnerable populations, additional measures will be taken to protect them. Potential participants will be excluded if they are presenting with severe, intrusive levels of psychiatric symptoms at the time of consent. In addition, as described previously, the RAs will use the UBACC15 16 during the consent process to make sure participants understand the study, what is required of them, and that they can withdraw at any point. Participants who pass the UBACC and who want to continue will be required to provide written informed consent or a fingerprint in lieu of a signature.
All steps will be taken to mitigate the potential loss of confidentiality for participants. A two-part ID system will be instituted in order to limit the possibility of connecting the participant with their data. There will be one ID for the phenotypic information and one for genetic information. A single encrypted database linking the phenotypic ID and genetic ID will be stored in-country. The participant’s name and contact information will be on the consent form, which will be stored in a locked cabinet. This information will be put in a separate secure database and will never leave the country of origin. No data that is considered identifiable, by Health Insurance Portability and Accountability Act standards, will ever be collected on the electronic tablets.
We aim to share results through a number of mechanisms in addition to conferences and peer-reviewed publications. In order to maximise scientific utility of the samples/data and to minimise data waste, both genomic and phenotypic data from NeuroGAP-Psychosis will be deposited in controlled access public databases, such as the database of Genotypes and Phenotypes (dbGaP), the Psychiatric Genomics Consortium and/or the European Genome-Phenome Archive. By pooling NeuroGAP-Psychosis’ data with that from other studies, it will be possible to maximise statistical power. Sharing the data will also contribute to the ‘genomic revolution’ currently underway in Africa due to the pioneering work of the African Genome Variation Project, MalariaGEN, African Society of Human Genetics and Human Hereditary and Health in Africa (H3Africa).27 37–40
To help ensure all collaborators have the capacity to work with all the data that will be created, a unique training programme has been created, the Global Initiative for Neuropsychiatric Genetics Education in Research (GINGER),41 that will run in parallel to the NeuroGAP-Psychosis study. Over a 2-year period, 17 early-career researchers from Ethiopia, Kenya, South Africa and Uganda will take part in online and in-person trainings on topics including biostatistics, genetic analysis and epidemiology. Ultimately, the goal of GINGER is to build the next generation of neuropsychiatric geneticists who will be able to work on the NeuroGAP-Psychosis data and continue its legacy in the future.
Patient consent for publication Not required.
AS and DA contributed equally.
Contributors KCK, DS: conceptualised and designed the study. AS, DA, RES, LA, EK, SMK, VdM, CRJCN, DS, ST, ZZ, KCK: wrote the protocol, selected the phenotype questionnaires and added site-specific information. LC, ARM: wrote the sample size, power calculations and statistical analysis plans. MMC, GS: contributed to the UBACC process. AS, DA: wrote the draft of the manuscript and incorporated the revisions by the co-authors; they contributed equally to this paper. All authors reviewed the manuscript for intellectual content, contributed to revisions and approved the final version for publication.
Funding This study is funded by the Stanley Center for Psychiatric Research at Broad Institute. The sponsor was involved in the study design, in the writing of the report and in the decision to submit the paper for publication. The sponsor will be involved in the analysis and interpretation of the data. ARM was supported by funding from the National Institutes of Health (K99MH117229).
Competing interests DS has received research grants and/or consultancy honoraria from Biocodex, Lundbeck, Servier and Sun.
Ethics approval Ethical clearances to conduct this study have been obtained from all participating sites, including: Ethiopia: Addis Ababa University College of Health Sciences (#014/17/Psy) and the Ministry of Science and Technology National Research Ethics Review Committee (#3.10/14/2018). Kenya: Moi University School of Medicine Institutional Research and Ethics Committee (#IREC/2016/145, approval number: IREC 1727), Kenya National Council of Science and Technology (#NACOSTI/P/17/56302/19576) KEMRI Centre Scientific Committee (CSC# KEMRI/CGMRC/CSC/070/2016), KEMRI Scientific and Ethics Review Unit (SERU#KEMRI/SERU/CGMR-C/070/3575). South Africa: The University of Cape Town Human Research Ethics Committee (#466/2016) and Walter Sisulu University Research and Ethics Committee (# 051/2016). Uganda: The Makerere University School of Medicine Research and Ethics Committee (SOMREC #REC REF 2016-057) and the Uganda National Council for Science and Technology (UNCST #HS14ES). USA: The Harvard T.H. Chan School of Public Health (#IRB17-0822).
Provenance and peer review Not commissioned; externally peer reviewed.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.