Article Text

Download PDFPDF

Cohort profile
Cohort profile: molecular signature in pregnancy (MSP): longitudinal high-frequency sampling to characterise cross-omic trajectories in pregnancy in a resource-constrained setting
  1. Tobias Brummaier1,2,3,4,
  2. Basirudeen Syed Ahamed Kabeer5,
  3. Pornpimon Wilaisrisak1,
  4. Mupawjay Pimanpanarak1,
  5. Aye Kyi Win1,
  6. Sasithon Pukrittayakamee6,7,
  7. Alexandra K Marr5,
  8. Tomoshige Kino5,
  9. Souhaila Al Khodor5,
  10. Annalisa Terranegra5,
  11. Verena I Carrara1,2,
  12. Francois Nosten1,2,
  13. Jürg Utzinger3,4,
  14. Damien Chaussabel5,
  15. Daniel H Paris3,4,
  16. Rose McGready1,2
  1. 1Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Tak, Thailand
  2. 2Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK
  3. 3Swiss Tropical and Public Health Institute, Basel, Switzerland
  4. 4University of Basel, Basel, Switzerland
  5. 5Sidra Medicine, Doha, Qatar
  6. 6Mahidol Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
  7. 7Department of Clinical Tropical Medicine, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
  1. Correspondence to Dr Tobias Brummaier; tobias.brummaier{at}


Purpose A successful pregnancy relies on the interplay of various biological systems. Deviations from the norm within a system or intersystemic interactions may result in pregnancy-associated complications and adverse pregnancy outcomes. Systems biology approaches provide an avenue of unbiased, in-depth phenotyping in health and disease. The molecular signature in pregnancy (MSP) cohort was established to characterise longitudinal, cross-omic trajectories in pregnant women from a resource constrained setting. Downstream analysis will focus on characterising physiological perturbations in uneventful pregnancies, pregnancy-associated complications and adverse outcomes.

Participants First trimester pregnant women of Karen or Burman ethnicity were followed prospectively throughout pregnancy, at delivery and until 3 months post partum. Serial high-frequency sampling to assess whole blood transcriptomics and microbiome composition of the gut, vagina and oral cavity, in conjunction with assessment of gene expression and microbial colonisation of gestational tissue, was done for all cohort participants.

Findings to date 381 women with live born singletons averaged 16 (IQR 15–18) antenatal visits (13 094 biological samples were collected). At 5% (19/381) the preterm birth rate was low. Other adverse events such as maternal febrile illness 7.1% (27/381), gestational diabetes 13.1% (50/381), maternal anaemia 16.3% (62/381), maternal underweight 19.2% (73/381) and a neonate born small for gestational age 20.2% (77/381) were more often observed than preterm birth.

Future plans Results from the MSP cohort will enable in-depth characterisation of cross-omic molecular trajectories in pregnancies from a population in a resource-constrained setting. Moreover, pregnancy-associated complications and unfavourable pregnancy outcomes will be investigated at the same granular level, with a particular focus on population relevant needs such as effect of tropical infections on pregnancy. More detailed knowledge on multiomic perturbations will ideally result in the development of diagnostic tools and ultimately lead to targeted interventions that may disproportionally benefit pregnant women from this resource-limited population.

Trial registration number NCT02797327.

  • clinical physiology
  • epidemiology
  • reproductive medicine
  • fetal medicine
  • maternal medicine

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

Strengths and limitations of this study

  • The major strength is the prospective nature of the study and frequent follow-up, coupled with high-frequency sampling and thus, availability of detailed clinical information and a considerable number of biological samples.

  • High-throughput analysis, in combination with clinical data, will enable investigation of a number of pregnancy-related physiological and pathological changes.

  • Frequently populations from low-resource settings are disproportionally burdened by adverse birth outcomes that may be based on exposure to different communicable diseases; hence, including them in high-end clinical research, addresses a significant research gap and may result in improvements of limited relevance to high-income countries.

  • Low numbers for some phenotypes (eg, preterm birth) may prove to be detrimental for the validity of observed differences; although the power will depend on the magnitude of observed differences in molecular signatures.

  • In low-resource settings, complete biological sample sets are often difficult to obtain, which may downsize the richness of the data.


A successful pregnancy relies on well-timed adaptations and the interplay of multiple maternal biological systems. These interactions and temporal changes affect various organ systems, such as the cardiovascular, respiratory, endocrine system or metabolic systems, and, more recently, pivotal immunological adaptions and changes in the human microbiome became evident.1–5 Physiological adaptions of the immune system during pregnancy play a central role in implantation and placentation, promotion of fetal growth and initiation of labour and delivery.2 Deviations from the norm of this fine-tuned immune clock may lead to dysregulation in biological networks and cause various pregnancy-associated complications with their immediate consequences for the mother and fetus.6

With a growing body of evidence of the human microbiome’s significance in health and disease, investigating its role in reproductive medicine has opened up an avenue to a deeper understanding of adverse pregnancy outcomes.7 The placenta represents the fetomaternal link. Its crucial role during pregnancy is underlined by the fact that failure in its development and function is associated with a variety of pregnancy-associated complications (eg, pre-eclampsia, intrauterine growth restriction and preterm birth, PTB).8

In recent years, investigation of these biological systems has greatly improved our understanding of their role in healthy pregnancies and in pregnancies resulting in unfavourable outcomes. However, there is a need for longitudinal, multiomics profiling studies that may further contribute to the understanding of physiological adaptions, the interconnections of various biological systems and their significance in pregnancy-associated complications.3

Research, in particular high-end research, is often biased towards high-income countries.9 This imbalance is aggravated by the fact that, due to population based differences, results are often not generalisable,10 populations living in resource-constrained settings are more affected by the consequences of pregnancy-associated complications (eg, inadequate access to safe iatrogenic birth or lack of advanced neonatal care) and have different epidemiological patterns of communicable diseases. Consequently, pregnant women in low-income settings would benefit disproportionately from early identification of pregnancy-associated complications and targeted interventions.

Thus, the aim of establishing this prospective molecular signature in pregnancy (MSP) cohort was to characterise cross-omic trajectories in pregnant women from a resource-limited setting and describe pregnancy-associated complications (eg, PTB, gestational diabetes mellitus (GDM), anaemia, underweight and born too small for gestational age (SGA)). The term ‘molecular signature’ was chosen, as it refers to molecular markers that can be used for in-depth description of a particular phenotype.11

Perturbations related to the immune response will be investigated by measuring the abundance of RNAs in circulating nucleated cells (ie, leucocytes) via capillary blood sampling at multiple time points.12 Microbiome profiling of the intestinal and vaginal niche in pregnancy, at delivery and post partum will be complemented by assessment of the oral microbiome. Lastly, placental tissue will be investigated for the presence of bacterial colonisation as well as for the relationship between the genome structure and changes in global patterns of gene expression. The generated molecular data will then be analysed in the context of clinical and patient related data. As described in the study protocol,13 an immediate focus was the investigation of PTB.

The MSP cohort profile represents the link between development of the study protocol and the results. Accordingly, cohort characteristics, including the recruitment and follow-up process, demographics, pregnancy outcomes, biological samples available for analysis and preliminary results, are presented.

Cohort description

Setting and participants

Women with an unremarkable medical and obstetric history attending the antenatal care (ANC) facilities of Shoklo Malaria Research Unit (SMRU), a field station of the Faculty of Tropical Medicine at Mahidol University (Bangkok, Thailand) that is part of the Mahidol-Oxford Research Unit, were invited to participate. SMRU operates free-of-charge walk-in services for marginalised migrant populations in clinics at the frontier of Thailand with Myanmar (figure 1). This mobile population mostly resides in small, rural villages. The major source of income in this community comes from work in the agricultural sector or as daily labourers, while about one in four women stay at home to look after the household.14 As most of these daily employments provide only minimum wages coupled with the often undocumented legal status, the living conditions of most pregnant women in this community are very basic.15 The official minimal wages in Myanmar are around MMK4800 (US$3) per day, and in Thailand around THB300 (US$9) per day.

Figure 1

Setting and location of recruitment clinics.

Between September 2016 and July 2018, trained counsellors from the community provided general information about the trial to all pregnant women who registered at SMRU ANC. Detailed information, including visual aids, was used to illustrate the purpose, procedures and timing of follow-up. Potential risks and benefits of the study were explained to women meeting the following inclusion criteria: (1) pregnant woman willing and able to give written informed consent; (2) Karen or Burman ethnicity; (3) age 18–49 years; (4) healthy, with viable singleton first trimester (8+0 to <14+0 weeks) pregnancy; (5) planning to deliver at the SMRU birth unit and (6) able and willing to comply with study requirements. If these criteria were met and appropriate consent was provided, women were followed from the first trimester, throughout pregnancy, at delivery and until 3 months post partum.

It was estimated that approximately 400 women could be enrolled to build the cohort from this low-resource environment when considering constraints of clinical capacity and budget availability. With this sample size, events such as PTB (estimated at 8%, that is, 30 cases expected) would be feasible but also provide scope to engage with other pregnancy-associated complications, tropical infections (eg, small liver fluke infections estimated at 5%, that is, 20 cases expected) and febrile illness (estimated at 10%, that is, an expectation of 40 cases) with a sufficient number of uneventful pregnancies remaining to act as controls.

Figure 2 provides an overview of the number of participants recruited and followed. During the active recruitment period, 4292 pregnant women registered to ANC. Seventy per cent (2544/3609) were ineligible due to late presentation to ANC. There were two local conditions that limited recruitment. The first issue pertained to access; pregnant women following SMRU ANC come from villages scattered along the Thailand-Myanmar border (figure 1) with transportation constraints affecting their ability to attend, further aggravated in the monsoon season as roads become impassable. The second issue related to staffing. As only one ultrasound machine was available at each site, there was an upper limit to the number of women who could have study related fetal growth scans done on a single day. Challenges in access and staffing issues led to the non-consideration of 178 and 475 women, respectively.

Figure 2

Screening, enrolment and outcome flow chart. ANC, antenatal care; EGA, estimated gestational age; FHB, fetal heartbeat; SMRU, Shoklo Malaria Research Unit.

Overall, 683 women received full information about the purpose and procedures of the study. Of these, 430 agreed to participate (response rate: 63%) and consented by providing a signature or thumbprint if the event of an illiterate participant. Thirty participants were recruited in the cohort to replace women who were lost before delivery, while the study was still open to recruitment. Two participants were excluded as the fetal heartbeat (FHB) was absent after they had already agreed to the study (figure 2).

Routine ANC procedures at SMRU are described in the study protocol.13 Demographic characteristics of 381 enrolled women with a live birth outcome are summarised (table 1). To demonstrate the scope of the cohort, selected subgroups are summarised alongside the characteristics for all 381 women.

Table 1

Basic characteristics of women with a live birth outcome (n=381) and subgroups of selected pregnancy-associated complications in the molecular signature in pregnancy cohort

Participant follow-up

Women were followed every 2 weeks during pregnancy, at delivery and monthly in the first 3 months after delivery. Trial participants were encouraged to report to the study clinic if they experienced an adverse event, with special consideration for febrile episodes to maximise information on the nature of the experienced morbidity and its effect on the pregnancy.13 Overall, 6443 study visits during pregnancy were recorded, of which 6140 were from women with a live birth outcome. The median number of ANC visits for women with a live birth was 16 (IQR 15–18). In the uncommon event of cohort participants attending other health facilities, diagnosis and treatment was recorded at the next visit. All participants were encouraged to deliver at SMRU under the care of a skilled birth attendant. Women from the cohort achieved a low rate of home birth 6.6% (25/381) in contrast to the estimated 7 in 10 births that take place at home in rural Myanmar.16

As an estimator for adherence to study protocol, the number of expected and actual samples provided by women with a known pregnancy outcome were calculated. No significant difference was noted when comparing the proportion of expected and actual samples between capillary blood, stool and vaginal swab samples (p=0.070, p=0.767 and p=0.919, respectively). Missed follow-ups were the main reason for less than expected samples being provided. Additional samples were provided from participants who experienced a febrile illness, and on a few occasions, samples were erroneously taken as they were not required by the study protocol.

Early termination from the study occurred in 11.0% (47/428) of pregnancies, most of whom left ANC before birth 70.2% (33/47) due to migration. Adverse pregnancy outcomes were recorded in 25.5% (12/47) cases and two (4.3%) participants withdrew consent, as they no longer wanted to provide vaginal swab samples (figure 2).

Data collection

Data covering three major aspects were collected, namely (1) clinical data; (2) genome-wide transcript abundance and (3) analysis of microbial composition in various anatomical sites. Table 2 summarises all study relevant procedures.

Table 2

Study procedures and timing of the molecular signature in pregnancy cohort13

Routine ANC procedures included physical and obstetric examination, recording of concomitant medications and 6-weekly ultrasound scans to monitor fetal growth. Clinical data were collected, so that trajectories of maternal and fetal physical change would be available by the study endpoint. Two-weekly gestational weight, fetal growth and haematocrit were done as malnutrition and anaemia (and formerly malaria) are prevalent pregnancy-associated morbidities.17–19

Ultrasound scans to monitor fetal growth were performed 6-weekly. Throughout the study period, 2850 scans were done; 2693 of these were scans in women with a known, viable birth outcome (157 scans accounted for other pregnancy outcomes).

Additional details describing the study conduct and routine ANC procedures are available elsewhere.13 Briefly, capillary blood, faecal and vaginal swab samples were taken during pregnancy, at delivery and in the postpartum period. To assess the influence of nutrition on the faecal microbiome, each faecal sample was coupled with a 24-hour food recall (table 2). In the event of a febrile episode, an additional set of these samples were taken together with a standardised fever screening battery. Oral microbiome was assessed in the second trimester and at delivery. Placental tissue samples were taken at delivery along with umbilical cord blood serum and maternal serum.

As SMRU birthing units do not have capacity for caesarean sections, participants were referred to the closest public hospitals when indicated, and hence, delivery samples were not available, in common with births at home or in other health facilities. Even though gestational tissue samples were not available for these participants, outcome measures and neonatal anthropometry were mostly available.

Lastly, in an effort to understand the acceptance of the high-frequency sampling in combination with the dense follow-up schedule and the perception of invasiveness of sampling procedures, an acceptability survey at study enrolment and completion was conducted.

Biological samples

Standard operating procedures (SOPs) focused on detailed description of the sample collection procedures were drafted before study commencement and are available on request. To accommodate the high number of samples, a unique barcoded sample sticker was placed on each tube, containing the participants study code that was assigned at enrolment, as well as date and time when samples were collected. All samples were transferred from the study sites to SMRU’s central laboratory daily and stored according to SOPs. Freezerworks (Dataworks Development, Mountlake Terrace, Washington, USA), a biorepository software was used for sample management and tracking.

Including women lost to follow-up and pregnancies that resulted in a miscarriage or stillbirth, 13 536 biological samples were taken between study enrolment and 3-month post partum. Considering multiple aliquots for vaginal swab samples (four aliquots at each sampling time point), saliva samples (n=4) and placenta samples (n=12), the total number of samples available for testing is 25 816.

Table 3 provides an overview of biological samples available for analysis and figure 3 depicts the distribution of capillary blood, faecal and vaginal swab samples in relation to the estimated gestation of pregnancy.

Figure 3

Number of samples over the course of pregnancy by week of gestation. EGA, estimated gestational age.

Table 3

Overview of available biological samples for women with a live birth outcome (n=381) and selected subgroups of the molecular signature in pregnancy cohort

All biological samples will be analysed and interpreted in the context of clinical data. To investigate physiological trajectories and deviations from these in the event of pregnancy-associated complications, a nested case–control approach will be applied, and potential confounding factors will be adjusted for. Results will be published in original research articles alongside more detailed background information pertaining to the respective content knowledge, precise methods of sampling procedures and analytical steps of respective laboratory-based methods to enable the scientific community to draw their conclusions on appropriateness. The MSP cohort profile and the previously published study protocol13 will serve as a cross-reference for all results.

The molecular signatures pertaining to biological samples will be analysed by collaborators at Sidra Medicine (Doha, Qatar). To assure appropriate sample handling, shipment is delegated to World Courier (AmerisourceBergen; Chesterbrook, Pennsylvania, USA), a leading, global specialty and logistics provider. All samples are kept on dry ice and shipped in freezer boxes with thick styrofoam walls that are equipped with temperature monitors. At the time of writing, 16 916 biological samples have been successfully transferred.

Participant involvement

Study participants were not directly involved in the study design, the elaboration of research questions and outcome measures or the recruitment and conduct of the study. However, the protocol was reviewed and approved by a local community advisory board whose members are representatives of the local community and act as a bridge between researchers and the local population.20 Their role was to advise on ethical and operational aspects of the study and they will provide a channel to inform the community of the results of this study.

Findings to date

Demographic data, findings during pregnancy and relevant outcome data for the overall cohort population with a live birth are presented in tables 1 and 4, respectively. According to study objectives, data of commonly observed pregnancy-associated complications (GDM, anaemia and underweight) and adverse pregnancy outcomes (PTB and SGA) are presented alongside the overall cohort.

Table 4

Pregnancy outcome and neonatal data of women with a life birth outcome (n=381) and selected subgroups in the molecular signature in pregnancy cohort

At a rate of 5.0% (19/381), PTB (defined as delivery before 37 weeks of gestation), was considerably less than the worldwide estimate of approximately 10.6% (uncertainty interval 9.0%–12.0%) of live-born infants.21 When considering all pregnancies followed at SMRU during the study period, the observed proportion of PTB was roughly 6.8%, a number still below the worldwide average. The estimated gestational age (EGA) at delivery of PTB cases ranged from 30+2 to 36+6 (weeks+days) with the majority of cases (89.5%) being moderate/late PTB (i.e. ≥32 and<37 weeks of gestation). Of the 19 PTB cases, one was iatrogenic with induction for severe preeclampsia at an EGA of 36+3, while the remainder were spontaneous, without any discernible causative factor. Prioritising healthy women at the outset and frequent visits with healthcare providers might have contributed to earlier identification of health issues and timely intervention. Population-based factors that might be associated with the low PTB rate are the low rate of sexually transmitted diseases, and preliminary data suggest that prevalence of Gardnerella vaginalis, a facultative anaerobic coccobacillus that is associated with PTB, is low in this population. This low PTB rate is unique in itself and will be thoroughly assessed by a holistic investigation of available demographic, clinical and cross-omic data.

There were six spontaneous miscarriages (median EGA 16+3 (IQR 16+1–17+2)) and one pregnancy was terminated due to a congenital abnormality (severe hydrops fetalis) at an EGA of 20+1. There were four stillbirths; all were identified antepartum as intrauterine fetal deaths before completion of 37 weeks of gestation. One was unexplained with an absent FHB at 28+1 weeks, one was associated with pre-eclampsia (absent FHB at 31+1), one had a tight cord around the neck at birth (absent FHB at 35+3), and in one case the mother ingested organophosphate with suicidal intent (absent FHB at 36+3). One unexpected neonatal death was recorded in a term neonate born to a mother with an unremarkable pregnancy and delivery.

Seven of 381 (1.8%) women experienced Plasmodium vivax malaria during pregnancy and one woman was diagnosed with P. vivax malaria at delivery. Soil-transmitted helminth infections are common in this population and were detected in 21.3% (81/381) of pregnant women with a live birth outcome. Hookworm was detected in 15.0% (57/381) cases, Trichuris trichiura in 6.3% (24/381) and Ascaris lumbricoides in 4.2% (16/381). Fourteen women had more than one soil-transmitted helminth infection concurrently. Foodborne trematode infections (eg, small liver flukes) were detected in 6.8% (26/381) of the participants. The immunological fingerprint of helminthic infections and their effect on pregnancy is poorly understood. Hence, effects of helminth infections on the immune system or the intestinal microbiome in pregnancy will be studied with data from this cohort.

Generally, communicable diseases are often associated with adverse pregnancy outcomes. Fever, which is often the cardinal symptom of an infectious disease, was recorded in 7.1% (27/381) of pregnant women. Whether longitudinal, high-frequency sampling has potential for early detection, or detection prior to the onset of symptoms, of communicable diseases in pregnancy, will be determined. Availability of data pertaining to communicable disease will also allow appropriate adjustment for investigation of other objectives (eg, PTB).

Overall, only four cases (1.0%) of preeclampsia and one case (0.3%) of eclampsia were recorded which is also at the lower end of the epidemiological range (0.2–9.2).22 A common pregnancy-associated complication was GDM, which affected 13.1% (50/381) pregnancies with a live birth. GDM is the most common metabolic disorder in pregnancy and is of increasing concern in high-income settings around the globe but also in this resource-limited population.23 24 While the proportion of pregnant women diagnosed with underweight in this population is decreasing, 19.2% (73/381) of pregnancies were still complicated by a diagnosis of underweight.19 Maternal anaemia defined as haematocrit of less than 30% complicated 16.3% (62/381) of pregnancies and SGA (20.2% (77/381)) according to standards as published by the Intergrowth-21st consortium25 was also commonly observed and remains a solicitude in this pregnant population. As shown in figure 3, cohort enrolment in early pregnancy, coupled with the high-frequency sampling provides a plethora of cross-omic data. This will enable to identify signatures preceding the development of these complications, their impact on pregnancy outcome and to characterise how they plateau in the postpartum period.

Ultralow capillary blood samples are safe and a reliable method to monitor longitudinal gene expression trajectories. Samples acquired from this MSP cohort were used to develop a modified SOP for extraction of RNA from small volume whole blood samples preserved in Tempus solution26 and the performance of ultralow blood samples was benchmarked previously.27 Figure 4 shows a comparison between different stages in pregnancy (early pregnancy and late pregnancy), and the non-pregnant control (3-month post partum) by exploratory analysis of gene expression patterns (RNASeq) in 19 MSP women with uneventful, term pregnancies. The preliminary finding of clustering of different timepoints in pregnancy and post partum, respectively, follows patterns reported by Ngo et al who were able to predict EGA by analysing cell free RNA transcripts in maternal blood.28

Figure 4

Principal component (PC) analysis of whole blood gene expression data (RNAseq) of 19 uneventful, term pregnancies of the MSP cohort compared between first trimester (early pregnancy), third trimester (late pregnancy) and 3- month post partum (non-pregnant). MSP, molecular signature in pregnancy.

Strengths and limitations

The major strength of this cohort is the prospective and the frequent follow-up along the trajectory from first trimester to 3 months post partum. Coupled with the availability of a large number of well characterised biological samples, the high-throughput analysis, in combination with clinical data, we anticipate that a number of pregnancy-related physiological and pathological changes can be investigated and described in detail. Systematic approaches enable the researchers to assess the entirety of a biological system and are thought to minimise the bias introduced through selective parameters.

Populations originating from low-resource settings do not often have the same level of access to advanced research when compared with high-income settings. While some data on intestinal microbiome are available,29 data generated from the MSP cohort will characterise composition and perturbations of the vaginal microbiome for the first time in this population.

The small number of PTB was not anticipated and is lower than rates reported internationally. While a larger figure would be desirable from an analytical point of view, comparison of molecular markers and associated population-based factors to populations with a higher risk of PTB will provide novel insight for this low observed proportion. Generally, a selection and exclusion bias at participant enrolment might have contributed to the low number of PTB, as women with an unremarkable medical and obstetric history were given priority to minimise the risk of potential lost to follow-up and subsequent sample loss. Hence, inference on the power of the result cannot be made at this point, as the power will depend on the magnitude of observed differences in molecular signatures between term and preterm pregnancies.

More faecal samples were missed compared with capillary blood or vaginal swab samples. In an effort to reduce sample loss for potential future research, a nested project was conceived to assess whether collection and transfer of fresh faecal samples in the long-term storage tube at home is feasible, sample quality is affected, and acceptability of self-collection is favourable.

Larger cohorts would be preferable to address critical research questions such as PTB. However, with the high-frequency sampling and the overall cost of the study conduct and sample analysis, the researchers enrolled not more participants than deemed to be necessary to address the objectives of this project.


Proposals for collaborations and other ideas to enhance scientific output from data generated in this cohort are welcome. SMRU is part of the Mahidol Oxford Tropical Medicine Research Unit and follows data sharing policies, as published by the Bioethics and Engagement Department. Data from the MSP cohort will be made available through the Data Access Committee at Mahidol Oxford Tropical Medicine Research Unit. Data sharing policies are available at: An application form can be downloaded under this link. Gene expression data will be deposited on the NCBI Gene Expression Omnibus and data from 16S rRNA sequencing will be stored as a bioproject in NCBI for public Sequence Read archives.


We would like to express our gratitude to the cohort participants: without the contribution of the women following ANC visits at SMRU this work would not be possible. Moreover, we do acknowledge the continuous effort of staff in the recruitment centres for the excellent participant management and their pivotal role in data collection.



  • Collaborators The MSP cohort is a collaboration between Shoklo Malaria Research Unit, a field station of the Faculty of Tropical Medicine, Mahidol University (Bangkok, Thailand) that is part of the Mahidol-Oxford Research Unit; Sidra Medicine (Doha, Qatar), which is part of Qatar Foundation for Education, Science and Community Development; and the Swiss Tropical and Public Health Institute, an associated institute of the University of Basel (Basel, Switzerland).

  • Contributors All authors listed contributed substantially to this manuscript in accordance with the ICMJE criteria.

  • Funding This work was supported by Sidra Medicine, a member of the Qatar Foundation for Education, Science and Community Development. The Shoklo Malaria Research Unit is part of the Wellcome Trust Mahidol University Oxford Tropical Medicine Research Programme supported by the Wellcome Trust of Great Britain (Major Overseas Programme).

  • Map disclaimer The depiction of boundaries on the map(s) in this article does not imply the expression of any opinion whatsoever on the part of BMJ (or any member of its group) concerning the legal status of any country, territory, jurisdiction or area or of its authorities. The map(s) are provided without any warranty of any kind, either express or implied.

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement Data are available on reasonable request. All data relevant to the study are included in the article. Deidentified data from the MSP cohort will be accessible through the Data Access Committee at Mahidol Oxford Tropical Medicine Research Unit. Gene expression data and data from 16S rRNA sequencing will be deposited on public platforms.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.