Article Text

Download PDFPDF

Nutritionists’ Health Study cohort: a web-based approach of life events, habits and health outcomes
  1. Luciana Gavilan Dias Folchetti1,
  2. Isis Tande da Silva1,
  3. Bianca de Almeida-Pititto2,
  4. Sandra Roberta G Ferreira3
  1. 1Department of Nutrition, School of Public Health, University of Sao Paulo, São Paulo, Brazil
  2. 2Department of Preventive Medicine, Federal University of Sao Paulo, São Paulo, Brazil
  3. 3Department of Epidemiology, School of Public Health, University of Sao Paulo, São Paulo, Brazil
  1. Correspondence to Professor Sandra Roberta G Ferreira; sandrafv{at}


Introduction Non-communicable chronic diseases (NCCDs) represent a burden for public health. Alongside the established cardiometabolic risk factors such as high blood pressure and disorders of glucose and lipid metabolism, living habits and nutritional status at different stages of life are seen as contributors to this scenario. Gut microbiota composition and subclinical inflammation have been pointed out as underlying mechanisms of NCCDs. Studies involving health professionals have brought relevant contributions to the knowledge about risk factors. Technological advances facilitate data collection and analysis for big samples. A web-based survey addressed to collect data from a cohort study, which is able to identify NCCDs risk factors, is highly desirable. The objective of the Brazilian Nutritionists’ Health Study (NutriHS) is to gather online information on early life events, daily habits, emergent cardiometabolic risk factors and health outcomes of a specific subset of the Brazilian population.

Methods and analysis NutriHS, developed at the School of Public Health—University of Sao Paulo, Brazil, is a research initiative that enrols undergraduates of nutrition courses from Brazilian universities and graduated volunteers. A web-based self-administered system was designed to collect health-related data. After fulfilling online questionnaires (socioeconomic, early life events and lifestyle data), participants are invited to a clinical visit for physical examination and laboratory procedures (blood sampling, faeces collection and body composition). At a 3-year interval, they will be invited to repeat similar procedures.

Ethics and dissemination The NutriHS research protocol was approved by the Institutional Ethics Committee and is providing promising data which contribute to the understanding of pathophysiological links between early life events, body composition, gut microbiota, and inflammatory and metabolic risk profile. The combination of a friendly tool with the innovative purposes of NutriHS offers a remarkable resource for testing hypotheses about mechanisms of nutrition-related diseases and further planning of preventive programmes in public health.

  • Nutritionists’ Health Study
  • web-based system
  • early-life events
  • gut microbiota
  • cardiometabolic risk factors

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See:

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • Low cost for development and potential of amplification of the web-based system.

  • Availability of self-reported and measured data which allow validation study.

  • Innovative data to be associated with early outcomes.

  • Population sample has skills to accurate answers self-reported technical questionnaires and potential high compliance in the long-term study.

  • Language-related limitation offered in the web-based system (only in Portuguese).


Numerous factors from intrauterine life and birth determine adult health and well-being. Monitoring of early life events and daily habits has provided tools to protect individual health from major damage and other threats. Health outcomes depend on a combination of genetic and environmental factors.

Delivery conditions, birth weight and early events in infancy have been involved in the genesis of future diseases.1–4 Such a relationship, initially raised by Barker,5 has been further confirmed,6–9 although underlying mechanisms are not yet completely understood. Recently, the types of delivery and child nutrition have been shown to influence the gut microbiota composition, which in turn modulates the immune system and the risk of diseases.10–13

Epidemiological and clinical studies have contributed towards identifying modifiable risk factors and deepening the understanding of the pathophysiology of non-communicable chronic diseases (NCCDs), such as obesity, type 2 diabetes mellitus and cardiovascular disease. These lifestyle-related morbidities represent a heavy burden for public health systems due to complications, increased mortality and high costs.14–16

Knowledge of risk factors for NCCDs, collected prospectively, is essential for enhancing evidence and planning interventions. Epidemiological studies involving health professionals have brought significant contributions to the knowledge about the role of behavioural factors in health outcomes.17 ,18 This kind of study often requires large population samples, followed for long periods, which generate a significant amount of data, high costs and the need for trained professionals.15–18 More commonly, health surveys have been based on paper-and-pencil questionnaires.

Printed questionnaire-based surveys are complex and time-consuming, and their quality depends on accuracy in data collecting, processing and uploading to create a consistent database before analysis. Despite extending study duration and representing a financial burden, these steps are crucial for adequate data analysis to achieve reliable conclusions.19 ,20

Technological advances can minimise errors and duplicated information, reduce costs, as well as provide the almost immediate construction of a database using bioinformatic tools.20 ,21 Multiple and diverse software for data analysis tailored to specific questions are available, but very few are designed specifically to investigate the role of early life events and daily habits.

The internet is a promising tool for overcoming some of the logistic and financial feasibility constraints in research, and is gaining considerable popularity.21 In contrast to other fields, such as in psychological and marketing research, in epidemiology, <1% of the reported studies are internet-assisted.14 Successful examples of large-scale studies in which enrolment and follow-up occur over the internet are the Millennium Cohort Study,22 the Nurses and Midwives e-Cohort Study,23 and the Danish Web-based Pregnancy Planning Study.24 In nutritional epidemiology, there is the NutriNet-Santé Study, a web-based prospective study launched in France in 2009.25

In middle-income countries, similar initiatives using the internet as a tool for data collection are scarce, and when considering the inclusion of health professionals they are inexistent. The development of web-based surveys able to identify environmental risk factors in developing countries with a wide geographic area and limited resources would be highly desirable. As far as we know, the use of self-administered online questionnaires for this kind of population has not yet been reported.

The purpose of the current study was to describe the methodology of the Nutritionists’ Health Study (NutriHS), a prospective web-based study developed at the School of Public Health, University of São Paulo (FSP/USP), Brazil, to gather information on early life events, daily habits, emergent cardiometabolic risk factors and health outcomes of a specific subset of the Brazilian population—undergraduates and graduates in nutrition. The outcome of major interest is excessive body weight and obesity-related morbidities.

Methods and analysis

Implementation of the NutriHS study included the following steps: (1) development of the web-based system; (2) collection of data related to early life events, lifestyle habits, anthropometry and body composition; (3) assessment of the cardiovascular profile based on traditional and emerging risk factors; (4) analysis of the association of early life events and lifestyle habits with the gut microbiota composition and cardiovascular risk factors.

The web-based system

With the support of the FSP/USP Informatics Center, a dedicated web-based online self-administered system ( was developed, named e-NutriHS, to collect health-related data based on previously validated printed questionnaires. Details on its development have been described elsewhere.26 In brief, the web-based system was built using free programming languages: HyperText Markup Language (HTML), Cascading Style Sheets (CSS) and JavaScript used to create a working dynamic website with a friendly user's interface, the Hypertext P reprocessor (PHP), server-side scripting language, used to transfer data to the NutriHS database; and Structured Query Language (SQL) is the database programming language used at the database management system, MySQL.

The colours for the NutriHS website—green and orange—refer to health, nutrition and youth. The logotype and slogan call attention to the need to increase knowledge on the role of nutrition for health.

The development also included skip patterns to hide questions (depending on the person's answer to a prior question), randomly organised questions, and validation checks providing additional information to simplify and help responding, pointing out questions that needed attention. The e-NutriHS data can be transferred to the Excel format, allowing further analyses. Since the database is located at the FSP/USP server, its security was guaranteed.

Prior to releasing e-NutriHS to participants, a prototype was created (26). Twenty volunteers, graduates at the FSP/USP who have been working in the health area, aged 25–60 years, tested the system and checked its ease of use, data entry and website design. On the basis of their opinions, a user-friendly version of e-NutriHS was released.

The website provides details on the study purposes, the team involved and contact information, and also represents an opportunity for learning with recent research reports from international literature. In order to increase awareness of NutriHS, a social network page on the internet was created (https:// with a direct link to the e-NutriHS homepage. This website plays an important role in stimulating dialogue among researchers and participants.

Study design, recruitment and inclusion

Given the growing recognition of the importance of the life course approach for understanding the determinants of NCCDs, NutriHS was designed to be a cohort study in which retrospective and prospective data are being collected. The study was designed to include all undergraduates and graduates from Brazilian Nutrition Colleges, aged ≥18 years. Undergraduates from nutrition courses and nutritionist volunteers were selected based on their potential skill to navigate the internet and on the expected high quality of data provided. Exclusion criteria are pregnancy and no access to the internet.

Volunteer recruitment was carried out using multiple strategies. Invitations were made to the university's staff, lectures for undergraduates or graduates were offered, news was published on the FSP/USP website and on the NutriHS social network page, and banners were placed in strategic places.

After signing the electronic informed consent, users are required to fill out personal data and to create a unique login and password. Then their access to questionnaires is released. Once questionnaires were completed, participants are instructed to schedule, on the e-NutriHS, a face-to-face visit at the Health Center for clinical examination and biological sampling. Recruitment started in March 2014 and this is planned to be completed by December 2016. After a 3-year interval, they will be invited to complete a similar set of questionnaires.

Questionnaires and variables of interests

The baseline set of questionnaires deals with socioeconomic, early life events and lifestyle data, including physical activity and eating habits. The order for filling out these questionnaires is chosen by the participant. Excessive body weight is a major outcome, which is being assessed using anthropometric measurements and densitometry. Other variables will be used as exposures, predictors, potential confounders or effect modifiers.

Demographic, socioeconomic lifestyle and clinical questionnaires

The structured questionnaires obtained data regarding skin colour, educational level, marital status, person(s) they live with, parents or guardians, family income, number of people in the house, work, work hours, use of medications, medical history, familial medical history, a restrictive diet, alcohol consumption, tobacco consumption, height and weight.

Early life events questionnaire

The contents of this questionnaire allow one to obtain retrospective information about the mothers of the participants, their pregnancies and data related to birth and childhood, as well as maternal health and educational level during the pregnancy period. Before filling the questionnaire, instructions are provided, recommending participants to ask their parents (mainly mothers) or to look at their birth registry to get specific information regarding gestational period and early life events.

Data collected are divided into two parts regarding mothers and participants. Mother's age at the participant's birth, educational level, occupation, height, weight before pregnancy, use of medications and pregnancy conditions, such as weight gain and health problems, are reported. Also, participants are asked to answer as to prematurity, twins, birth weight, breast feeding, time of breast feeding, time of introduction of formula, medications used, weight gain during childhood and adolescence, fruit and vegetable consumption, and physical activity during childhood.

Physical activity questionnaire

The short version of the International Physical Activity Questionnaire (IPAQ), validated for the Brazilian population,27 is being employed. Intensity of physical activity (walking, moderate and intense), frequency (per week) and duration are obtained. These data enable estimates of total physical activity as quantitative or categorical variables (low, moderate and high). A sedentary pattern has been assessed by time spent sitting.

Dietary assessment

Diet is assessed using a validated 102-item food frequency questionnaire28 regarding eating habits over the past year. Data inserted into e-NutriHS are immediately transformed into the equivalent food amounts for 1 day. The system is able to provide reports of macronutrients and micronutrients, based on the United States Department of Agriculture (USDA) National Nutrient Database for Standard Reference.

Clinical assessment

Completion of questionnaires is required in order to access the link for scheduling the face-to-face visit to the FSP/USP Public Health Center. A couple of days before the visit, an email with instructions for biological sampling and body composition determination is sent to each participant. This visit represents an opportunity to double-check previously collected selected data.

Anthropometry and blood pressure

Weight is obtained on a digital scale accurate to 100 g and height was measured using a fixed stadiometer to the nearest 0.5 cm. They are used to determine the body mass index, which was calculated as weight in kilograms divided by the square of the height in metres (kg/m2). Waist circumference is taken in centimetres at the midpoint between the lowest rib and the iliac crest. Body composition is being determined by dual-emission X-ray absorptiometry (DXA Lunar GE) that allows accurate estimates of body compartments, including visceral fat. Sitting blood pressure levels are measured using an automatic device, expressed in mm Hg.

Biochemical data

Blood samples are collected for plasma glucose and lipid profile, determined using the hexokinase method and enzymatic colorimetric assay, respectively. Aliquots are frozen at −80°C for further determinations of apolipoproteins, hormones and inflammatory markers. Insulin is determined by ELISA and high-sensitivity C reactive protein by immunochemistry. ELISA kits are also used for the measurements of cytokine concentrations. Insulin resistance will be estimated by the HOMA-IR index.29 Extra aliquots are stored for future analyses of interest to NutriHS.

Metagenomic analysis of gut microbiota

Participants are instructed to collect a faecal sample and refrigerate it before the visit. Aliquots are frozen until the metagenomic analysis. The pyrosequencing of 16S subunit ribosomal amplicons will be performed using next-generation sequencing technology. After obtaining the sequences, they are subjected to phylogenetic analysis to characterise the faecal microbiota. Bioinformatic procedures, comparing the sequences extracted from the faecal samples with standardised databases, allow determination of the abundance of bacteria belonging to phyla and genera.

Data and statistical analysis plan

NutriHS planned to prospectively collect data on emergent lifestyle-related risk factors of a specific subset of the healthy Brazilian population. For its cross-sectional phase, data analyses included the description of the population sample to be followed. Additionally, a validation substudy of the web-collected self-reported data is being conducted. A number of statistical analyses have been planned to test associations of knowledge in nutrition and changes in lifestyle, as well as in body composition and biochemical profile.

Analyses are being performed using IBM SPSS Statistics V.20.0. Collected measures and derived variables will be summarised using means and SD, medians and IQR, and/or frequencies. Student's t-test or one-way analysis of variance when comparing more than two groups will be used for continuous variables with normal distribution. Pearson's correlation coefficient will be employed to test correlations. Correspondent non-parametric tests will be used for non-normal distributed variables. For validation studies, agreement has been analysed by κ statistics and the Bland-Altman plot method. For multivariate analyses, generalised linear regression models or logistic regression for dichotomous variables will be used. For all tests, a p value <0.05 will be considered significant.

Ethics and dissemination


The substantiated opinions were registered by the numbers: Of. COEP 991.542/15 and Of. COEP 257.513/13.

When accessing e-NutriHS, participants must sign an electronic informed consent before the registration process. Individual privacy is respected and data will be stored in an e-NutriHS database following the regulations applied to security. All publications will respect confidentiality.

Discussion and dissemination

Analyses involving the issue cardiovascular risk factors and public health, anchored in population-based studies or specific population subsets, have been conducted in developed countries. Such studies are of great relevance for proposals and formulation of public policies tailored to local situations. Cohorts of health professionals, like the Nurses’ Health Study I and II ( and the Health Professionals Follow-up Study (https://, which included thousands of participants in North America, brought remarkable scientific contributions for identifying and intervening in major risk factors for NCCDs.17 ,18 One strong point of studies involving professionals is the high quality of the specific data collected.

The current Brazilian epidemiological scenario of increasing longevity and nutritional transition highlights the importance of obesity-related chronic diseases for mortality. Along with the traditional cardiometabolic risk factors, such as hypertension and impaired metabolism of glucose and lipids, lifestyle and nutritional status in different stages of life, including the intrauterine period, have been pointed out as relevant determinants of this situation.3 ,8 ,30 Scientific evidence has associated low birth weight, caesarean delivery and formula milk feeding with excessive body adiposity and type 2 diabetes mellitus in adult life.31–37 Recently, Brazil exhibited relatively high rates of undernutrition; therefore, mothers of the NutriHS participants might have come from adverse conditions. Since it has been reported that the second generation is the one that suffers the biggest health impact,38 ,39 our study represents an opportunity for investigating risk factors for chronic diseases or even their early outcomes.

The atherosclerotic process and insulin resistance markers, both directly associated with the immune system, and subclinical inflammation have been considered as emergent risk factors.40–42 NutriHS is providing promising data which may contribute towards deepening the understanding of pathophysiological links of early life events, body composition and inflammatory risk profile. The gut microbiome is meant to have a central role in mediating immune system alterations that predispose to a proinflammatory status and metabolic disorders.10 ,43 ,44 Studies about gut microbiota have fomented the hypothesis of dysbiosis early in the cycle of life, triggering mechanisms that favour body fat accumulation and insulin resistance in adult life.45 Associations of events during intrauterine life or at delivery with gut microbiota composition in childhood have been demonstrated.46–48 Despite the recognition that the microbiota profile results from an interaction between environmental and genetic factors,49 to the best of our knowledge, investigations about its association with early life events and emergent cardiometabolic risk factors in young adults have not yet been reported. The ability of eating habits to change the gut microbiota composition in NutriHS could help to elucidate how dietary factors influence cardiometabolic risk.

Young adulthood, when preventive strategies could be more effective, is a unique moment for identifying early metabolic disturbances. The NutriHS cohort, by following undergraduates and graduates from nutrition colleges prospectively, has a great potential to enhance knowledge regarding the role of behavioural factors, especially related to diet and lifestyle, and underlying mechanisms of NCCDs. High-quality data are expected to be obtained from this stratus of the young Brazilian population. The age range of the NutriHS participants may confer less risk of recall bias, due to proximity to infancy and to more reliable information provided by parents and guardians present in their lives.

Innovative technologies in nutritional epidemiology, such e-NutriHS, that deal with the entire process, from data gathering to analysis, are promising to overcome some of the logistical and financial feasibility constraints that can affect the conduction of such large-scale surveys. Furthermore, the broad reach of geographic areas can enhance participation rates. However, concerns about security, duplicate records and sampling issues (selection bias) of online surveys should be raised.14 ,21 Some of these concerns could be overcome, for example, through cryptography and security management of the database, screening scripts to check implausible answers and to detect and not allow saving duplicate registries.14 ,19 ,22 ,25 Recent studies have reinforced that web-based data collection represents a valid and suitable method.20 ,50 ,51

Therefore, the initiative of building a website dedicated to this study, tailored to our local situation, should be emphasised. In addition to producing reliable data, e-NutriHS proved to be easy to use, less costly and less time-consuming than paper-and-pencil collections. Furthermore, extra questionnaires can be included in the web system once the researchers decide to investigate additional concerns.

Since NutriHS implementation, users’ satisfaction seems to be positive. Participants have found the system easy and practical to use. From 1102 users (983 undergraduates and 119 graduates), the predominance of the female sex (94%) was expected and mean age of the sample is 25.1 (SD 7.4) years (unpublished data). Data on diet, physical activity and others are ready for preliminary analysis. This scenario is in contrast to many epidemiological studies, which require a long period to get results. Another advantage of e-NutriHS is related to the fact that participants receive quick feedback about their health conditions.

Taking into consideration all these advantages, we believe that NutriHS will contribute to elucidate the temporal sequence regarding some causal determinants and pathological effects, scarcely investigated in other cohort studies. Preliminary results (unpublished data) have fulfilled our expectation of establishing associations among events in the early stages of life, diet, gut microbiota and circulating biomarkers, reinforcing the hypothesis of intrauterine programming interacting with environmental exposures to influence the risk profile for certain NCCDs, particularly for cardiometabolic diseases.

In summary, we conclude that the description of the NutriHS methodology, developed at FSP/USP, Brazil, is a relevant initiative to facilitate research in subsets of populations. In addition, e-NutriHS proved to be a powerful and user-friendly tool for producing reliable information, which could also be used in the field of public health. The combination of a friendly tool with the innovative purposes of NutriHS offers a remarkable resource for testing hypotheses, clarifying mechanisms of nutrition-related diseases and further planning of preventive programmes.


This study is being supported by the School of Public Health, University of São Paulo. The authors thank Renata Vidonsky Luthold, Angélica MM Valente, Briana Duarte Teixeira, Bianca Sansão Waltrick, Priscila Koritar, Angélica Almeida Obara and Marle dos Santos Alvarenga for their contribution to this project.



  • Contributors LGDF, ITdS, BdA-P and SRGF designed the study; LGDF, ITdS, BdA-P and SRGF participated in the elaboration of the protocol. LGDF programmed the web-based system. LGDF and ITdS have been participating in the acquisition of data. LGDF, ITdS, BdA-P and SRGF drafted the manuscript. All authors have read and approved the final manuscript.

  • Competing interests None declared.

  • Patient consent Obtained.

  • Ethics approval The NutriHS study protocol and study documentation, including all questionnaires and the informed consent, were approved by the Institutional Ethics Committee, Committee for Ethics in Research, School of Public Health, University of Sao Paulo.

  • Provenance and peer review Not commissioned; externally peer reviewed.