Efficacy of deep learning methods for predicting under-five mortality in 34 low-income and middle-income countries

Adeyinka Emmanuel Adegbosin; Bela Stantic; Jing Sun

doi:10.1136/bmjopen-2019-034524

Article Text

PDF

XML

Public health

Original research

Efficacy of deep learning methods for predicting under-five mortality in 34 low-income and middle-income countries

http://orcid.org/0000-0001-5218-1008Adeyinka Emmanuel Adegbosin1,
Bela Stantic2,
http://orcid.org/0000-0002-0097-2438Jing Sun1

¹School of Medicine, Griffith University, Gold Coast, Queensland, Australia
²School of Information and Communication Technology, Griffith University, Nathan, Queensland, Australia

Correspondence to Dr Jing Sun; j.sun{at}griffith.edu.au

Abstract

Objectives To explore the efficacy of machine learning (ML) techniques in predicting under-five mortality (U5M) in low-income and middle-income countries (LMICs) and to identify significant predictors of U5M.

Design This is a cross-sectional, proof-of-concept study.

Settings and participants We analysed data from the Demographic and Health Survey. The data were drawn from 34 LMICs, comprising a total of n=1 520 018 children drawn from 956 995 unique households.

Primary and secondary outcome measures The primary outcome measure was U5M; secondary outcome was comparing the efficacy of deep learning algorithms: deep neural network (DNN); convolution neural network (CNN); hybrid CNN-DNN with logistic regression (LR) for the prediction of child’s survival.

Results We found that duration of breast feeding, number of antenatal visits, household wealth index, postnatal care and the level of maternal education are some of the most important predictors of U5M. We found that deep learning techniques are superior to LR for the classification of child survival: LR sensitivity=0.47, specificity=0.53; DNN sensitivity=0.69, specificity=0.83; CNN sensitivity=0.68, specificity=0.83; CNN-DNN sensitivity=0.71, specificity=0.83.

Conclusion Our findings provide an understanding of determinants of U5M in LMICs. It also demonstrates that deep learning models are more efficacious than traditional analytical approach.

machine learning
deep learning
random forest
under-five mortality
community child health
maternal medicine

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjopen-2019-034524

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

The models were tested using a very large data sample, drawn from over 1 million households.
The survey used a cluster sampling approach and is representative of each country included.
Socioeconomic, political and cultural differences between the included countries may limit generalisability of the results.
The cross-sectional design of the study means we can only infer association and not causality.
Our study does not reflect subnational trends and patterns.

Introduction

Recent global estimates showed that 5.3 million under-five deaths occurred in 2018; this is equivalent to 15 000 deaths every day and 39 deaths per 1000 live births.1 A majority of the children who die before their fifth birthday live in sub-Saharan Africa and Southeast Asia; most of these deaths result from preventable and treatable causes.1 2 Although these estimates represent a significant improvement in under-five mortality (U5M) levels when compared with the levels in the early 1990s, ‘preventable death of one child is still too many’.1 2

High levels of U5M in low-income and middle-income countries (LMICs) is usually a syndromic feature of a weak health system,3 and U5MR is a key barometer of the state of a nation’s health system and an important impact measure that is reliant on health system input such as health financing, health workforce and infrastructure.3 4 These inputs in turn determine health service access, readiness, quality and safety and consequently influences coverage of interventions such as antenatal care coverage, postnatal care, demand for family planning satisfied, skilled birth attendance, care for childhood illnesses, nutritional supplementation, etc.4 5

Studies have shown that improving child survival requires engaging intricately with a host of child health determinants, including biological, environmental and socioeconomic factors such as level of maternal education, household income, environmental sanitation and hygiene.5–7 The framework of distal and proximate social, environmental and biological determinants was first described by Mosley and Chen.5 Unfortunately, many LMICs are constrained by limited finances and limited health budgets, and are unable to intervene on all of the determinants of child health at the same time.3 It is therefore increasingly important to identify the most important determinants to be prioritised and to determine the most pressing socioeconomic issues that can serve as a starting point for government and policy makers to focus on intervention strategy.

Furthermore, intervention measures need to be equity-oriented in order to be effective.8 Hence, disaggregated household level monitoring of coverage and impact indicators are crucial for informing policies and programmatic interventions in the sustainable development goal (SDG) era.9 It is important to understand the status of every child as against simply exploring global trends, in order to ‘leave no one behind’ and to ‘reach the furthest behind first’.10 In light of the SDG pledge, monitoring changes at household or community level may require new methodological approaches in engaging with the ‘big data’, which continues to be generated through ongoing household surveys such as the Demographic Health Survey (DHS) and Multiple Indicator Cluster Survey.11 12 An expansion of traditional analytical approach may be pertinent and key to effectively monitor health intervention coverage and impact. Machine learning (ML) techniques may represent a novel analytical approach to unravel previously unseen trends; these techniques expand on existing statistical approaches and use methods that are not based on a priori assumptions about the distribution of the data.13

Artificial intelligence (AI) described ‘as a scientific discipline rooted in mathematics, philosophy and computer science attempts to develop systems with properties of intelligence’.13 ML is a subdiscipline of AI, ‘where computer programs learn to solve new problems for which they weren’t explicitly programmed, by learning associations and patterns from example data’.13 ML deploys a broader set of statistical models than those traditionally used in medicine or public health. Example of such being deep learning models.13 AI and ML techniques broaden existing statistical models and offer additional tool sets to achieve public health milestones that may not have been previously feasible. For example, ML has been used for real-time surveillance of disease outbreak through social media data mining,14 AI have been used in large-scale evidence synthesis to guide health promotion and health policy.15 It is important to state that although AI offers new possibilities for targeted and personalised public health practice, its application must still be guided by social and structural determinants of health; this has also been highlighted by other AI researchers.16

In a report recently released by the United States Agency for International Development (USAID) centre for innovation and impact, on the use of AI in global health, AI-enabled population health was identified as one of AI use cases that could have the greatest impact on improving health quality, cost and access in LMICs.17

AI-enabled population health encompasses public health surveillance and prediction, population risk management, population health intervention selection and targeting.17 In this current study, we explored the efficacy of deep learning as a technique for population health surveillance and intervention targeting. Deep learning ‘discovers intricate structure in large data sets by using backpropagation algorithm to indicate how a machine should change its internal parameter used to compute representation in each layer from the representation in the previous layer’.18 Deep learning algorithms have shown excellent performance in genomics, proteomics, drug discovery, speech recognition, visual recognition, object detection and several other domains.18

There have been numerous empirical studies on the various applications of ML in hospital settings for prognostication,19 20 triage21 and prediction of mortality in the hospital setting.22 However, application of ML is yet to be demonstrated in population health studies, where it may represent a potential transformative tool.13 The objective of our study is to fill the gap on application of ML in population health studies, and other previously highlighted gaps. One of the previously highlighted gaps concerns the need to identify the most important determinants of U5MR. To explore these determinants, we employed a data-driven approach by using the random forest algorithm for feature selection, rather than using the traditional hierarchical approach for multivariate analysis, which tends to be highly user-driven and usually involving the development of conceptual frameworks that prejudges the relevance of a limited set of determinants (independent variables).23 Random forest is an efficient classification and regression algorithm that combines several randomised decision trees and aggregates their predictions. It is especially useful when the number of variables is larger than the number of observations.24

The random forest approach allows an unlimited number of variables or determinants to be incorporated into the model. The algorithm automatically tests several hypothesis and selects features that best predicts the outcome, based on information gained from each variable.20

Another gap is the need for new ways to gain insights and to unravel previously unseen trends in the prediction of U5M from disaggregated household level data. To fill this gap, we also compared the efficacy of deep learning algorithms: deep neural network (DNN); convolution neural network (CNN); hybrid CNN-DNN with logistic regression (LR) for classifying child survival, and for predicting age of death. We hypothesise that deep learning methods will outperform traditional methods such as LR in the prediction of U5M.

Finally, in this work, we make recommendations on ML implementation, and the new regulatory and ethical considerations for the use of novel ML techniques in public health.25

Methods

Data source and analytical tools

We conducted an analysis on DHS data from 34 LMICs. The DHS is a nationally representative household survey developed by the USAID in the 1980s.26 The survey provides data on fertility, family planning, maternal and child health, gender, HIV/AIDS, malaria and nutrition.27 In total, over 350 surveys have been carried out in over 90 countries.26 The survey uses a two-stage cluster sampling design, further details about the survey and its design are published elsewhere.27 Combined multicountry data for this study were obtained from the IPUMS-DHS portal.28 Combined DHS data were available for a total of 34 LMICs on the IPUMS-DHS database. We used all available data in these countries from 1987 to 2017 (see online supplementary table 1) . Permission to use data for all included countries was granted by the DHS programme. Analysis was conducted using Python software V.3.7. The programming codes used for the various analysis are accessible on Github using the following link: https://github.com/drulna/u5mr_predict

Supplemental material

[bmjopen-2019-034524supp001.pdf]

Patient and public involvement

There are no patients involved in this study.

Data preprocessing

Any real-world dataset needs preprocessing to convert it into a representation that can be used to train a model. This can heavily affect the model’s performance. This dataset had several irrelevant features, such as IPUMS identifiers created to merge multicountry data. We excluded 14 such features and included 41 features in the final model. Like many census data, the DHS data often contain variables with missing observations. All variables except place of residence (rural/urban) had some level of missingness which range from 5% to 60% of the observation in certain cases, we removed all variables on anthropometric measure due to significant missingness. We performed data preprocessing using the forward-fill approach to replace missing data. There exist multiple strategies that can be deployed to handle missing values,29 we tested other approaches and tested the models accordingly, only the forward-fill approach was found to provide reproducible and plausible outcomes . ‘Forward Fill’ strategy involves replacing every missing value with the next real values for each column. This clean and preprocessed data were used for the rest of the analysis.

Variables

Outcome variable

The outcome variable is the risk of death before the age of 5 years, measured as the duration of survival in months from birth.

Independent variables (model features)

The determinants included in the model can be broadly classified into maternal-level determinants, household socioeconomic characteristics and child-level determinants.

Maternal factors

These encompasses maternal behavioural and determinants within the reproductive care continuum, which includes duration of breast feeding, number of antenatal visits when the child was in utero, highest level of maternal education, administration of tetanus injection during pregnancy, provision of prenatal care by a skilled provider, delivery care provider, postpartum health check, unmet need for family planning, prenatal care, pregnancy wanted or not wanted.

Household socioeconomic factors

The household factors included are the household wealth index, the geographical location of the household (urban or rural) and who has final say of the woman’s health within the household.

Child-level factors

These include child’s postnatal check, sex of the child, oral polio vaccination, measles vaccination, diphtheria, pertussis and tetanus vaccination, BCG vaccination, age of the child and care for childhood illnesses such as diarrhoea and suspected symptoms of pneumonia. Survey-specific definition of all included determinants are published elsewhere.28

Feature selection

We use random forest to check feature importance with respect to its predictive power. figure 1 shows the feature importance (red bar) and variance of each tree in random forest (black vertical line). It can be observed that ‘duration of breast feeding’ has the most importance to predict a child’s death. However, there are some features that are of limited importance. We perform feature selection based on this information. We drop all features whose importance are <0.001, because we found that the accuracy of the classifier does not improve beyond this level, and adding the additional attributes only creates unnecessary additional computational overhead. In total, 29 features fell within our cut-off for feature importance and included in the final model. For comparing the utility of feature selection, we perform two experiments. One without feature selection (on all original 41 features) and one with feature selection (on selected 29 features).

Figure 1

Architecture of the deep neural network (DNN)-convolution neural network (CNN) ensemble model. FC, fully connected.

Model selection

We selected multivariate LR as an example of traditional model.20 Three deep learning techniques (DNN, CNN and DNN-CNN) were selected as modern ML approaches. For all the four models, we pose this problem as a multiclass problem, such that each value in the label is assigned an integer and then we binarize the output (ie, one-hot encoding). All categorical attributes are also converted to numerical, that is, dummy variables, by mapping each unique value to a number. After careful consideration, we concluded that the best ratio for training is 75% of the data, while the remaining 25% of the data are reserved for testing purposes. This choice is in line with literature and close to 80/20, which is quite a commonly used training/testing ratio, often referred to as the Pareto principle. We compare the performance of LR as a representative of traditional model, with three deep learning methods: DNN, CNN and hybrid CNN-DNN.

Model architecture

Deep neural networks

DNNs are a special kind of neural network with multiple hidden layers and usually hundreds of units in hidden layers. Each neuron of one layer is connected to every neuron of subsequent layer, also called fully connected (FC) layers. For each layer in DNN, a weight matrix is learnt. DNNs act as blackbox and can learn the data representation automatically with backpropagation of the error at final layer. A softmax layer is usually used to get final prediction for the class label.

Convolutional neural networks

CNN is a specific deep learning architecture that learns a filter instead of weight matrix. This filter is used to perform convolution with input data to get a feature map. This feature map can then be forwarded to a final softmax layer for prediction. A key advantage of using CNN over DNN is that it requires fewer parameters and less iterations to converge as only last layer is FC.

In our presentation, we give results for DNN, CNN and a hybrid of DNN and CNN. We show that the later gives the most optimal results, leveraging benefits of the two worlds.

Hybrid DNN-CNN ensemble model

In this model, the input is forwarded to two streams, where one represents DNN while the other CNN. As our input is one dimensional (1D), we use 1D CNN. With regard to DNN stream, the input is forwarded to an FC layer with 100 units. For non-linearity, the activation function ReLU is used which is defined as . This is followed by a batch normalisation (BN) and dropout (DO) layer to avoid feature co-adaptation. Then a second FC layer with 50 units is used to squash the information, which is again followed by BN and DO layers. The output of this layer is forwarded for concatenation with the output of CNN stream (figure 1).

Regarding the CNN stream, the input is forwarded to a 1D CNN layer with 128 filters and kernel/filter size of 2, with ReLU non-linear activations. The output is followed by a maxpooling to drop low information activations. This is followed by a BN and DO layer. The information is squashed into an FC layer with 50 units, which is again followed by BN and DO. Finally, the output is forwarded for concatenation with the output of DNN stream.

The combined features of both streams are then forwarded to a single FC layer with softmax activation, which results in class probabilities. The class label is assigned based on maximum probability. The detailed diagram of the architecture is shown as figure 1.

To optimise the hyperparameters such as optimizer, DO rate and learning rate, we used grid search. The available choices for hyperparameters and the selected value are given in table 1. To stop the training, we employed early stopping strategy where the training was stopped if the validation accuracy did not improve for 20 epochs. A checkpoint was created at the epoch where the validation accuracy showed improvement as compared with previous checkpoint. The choice of number of layers, number of neurons in each layer and number of filters in CNN was made empirically.

View this table:

Table 1

Hyperparameters choice and selected values through grid search

Model evaluation

We evaluated the performance of each model using a receiver operating characteristic (ROC) plot, we also derived the weighted precision, sensitivity (also known as recall), specificity, f1-score and area under the curve (AUC) for each model. The formula for calculating the performance metrics are as follows:

Precision = ; F1-score = ; Specificity =

The models were evaluated before and after feature selection. Analysis was initially conducted using all preselected variables. We thereafter optimised the various models based on empirical results from the random forest analysis. As this is a multiclass problem, the ROC plots and performance metrics are all based on micro-averages.

Results

Characteristics of the study population

A total population size of (n=1, 520 018) children drawn from 956 995 unique households were included in the study, the sample was drawn across 34 LMICs. The sample size drawn from each of the included countries is presented in online supplementary table 1. The mean age of the total children population is 1.89 (±1.40). Majority (n=1 100 211; 72.7 %) resides in rural areas. Just under half (n=636 882; 45.2%) were in the lowest two wealth quintile (Q1 and Q2). Majority (n=1 100 262; 73.2%) were uneducated or had only primary education, majority received some form of postnatal check, delivery care tetanus injection before birth and approximately two-third breast fed their children for >6 months (table 2). A total of n=111 907 (7.3%) under-five deaths were recorded survey-wide across all 34 countries. Nearly half, 48.9% (n=54 825) of these deaths were neonatal death.

View this table:

Table 2

Descriptive analysis of the study population

Feature importance

Overall, key determinants of U5MR include maternal factors such as duration of breast feeding, number of antenatal visits when the child was in utero, provision of maternal postnatal care by a skilled provider, highest level of maternal education, administration of tetanus injection during pregnancy, prenatal care provision by a skilled provider. Significant household socioeconomic factors include household wealth index and geographical location of the household. Time to child’s postnatal check was found to be the most significant child level determinant (figure 2).

Figure 2

Feature importance using random forest.

Model comparisons (before feature selection)

Comparison of the performance of the models before feature selection reveals that hybrid of CNN-DNN performs the best in terms of all metrics (sensitivity=0.68, specificity=0.83), while LR performs the worst (sensitivity=0.47, specificity=0.53) (table 3).

View this table:

Table 3

Performance comparison (without feature selection)

Figure 3 shows the ROC curves for all the classifiers. It shows that hybrid CNN-DNN model outperforms all other models.

Figure 3

Micro-average receiver operating characteristic (ROC) curve before feature selection. CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.

Model comparisons (after feature selection)

We found that feature selection does not improve the performance of LR. However, for all deep learning-based models, feature selection results in performance gain. The most performance gain is shown by CNN-DNN (sensitivity=0.71, specificity=0.83). CNN-DNN model performs the best out of all classifiers in both settings, that is, before feature selection and after feature selection (table 4).

View this table:

Table 4

Metrics comparison after feature selection

In figure 4, we present ROC curves for all the classifiers. It shows that hybrid CNN-DNN model remains the top performer of all the models.

Figure 4

Micro-average receiver operating characteristic (ROC) curve after feature selection. CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.

Discussion

A number of maternal-level, child-level and socioeconomic indicators were found to influence U5M. Duration of breast feeding was found to be a significant maternal-level determinant. Previous studies corroborate our findings, it has been shown that children breast fed for a longer duration have lower infectious disease morbidity and mortality, and better chance of survival than those who are breast fed for shorter periods, or not breast fed at all.30 Multiple studies have also shown that early initiation of breast feeding, and exclusive breast feeding reduces both neonatal and early infant mortality.30 31 In addition to breast feeding, several other factors within the continuum of essential obstetric care, such as antenatal care visits, postnatal care, delivery care and maternal tetanus immunisation were found to be significant predictors of U5M. These may partly be explained by our finding, which showed that nearly half of the mortality occurred during early neonatal life, which is in line with other previous studies.32 33 Several previous studies have shown that provision of essential obstetric care is vital for survival during the neonatal period.34 35 In addition, we found that the household wealth index was a slightly more important determinant compared with maternal level of education. This finding however contradicts the work of Fuchs et al, where they argued that mother’s education is the fundamental determinant of child mortality and is relatively more important than income level. They argued that education impacts the child’s health through better maternal health, increased health-specific knowledge, avoidance of traditional, harmful behaviours, greater economic resource as a consequence of education and general female empowerment.36 They however highlighted that other social scientists have often considered education and income as generally highly correlated and tend to be regarded as interchangeable indicators of socioeconomic status.36

The timing of the child’s postnatal check and the gender of the child were also found to be predictive of child’s survival. Postnatal check within 24 hours of birth have been shown to be crucial in identifying, managing or referring complications and ultimately in preventing child mortality.35

Our findings regarding the superiority of ML over traditional approaches such as LR in predictive analysis are also in line with findings elsewhere.20 37

This study however has some limitations. First, this is proof-of-concept cross-sectional study; hence, we can only draw inference on associations, and not on causality. Second, we did not measure change over time. Future studies should consider incorporating temporal data points, to draw inference on changes over time, and possibly causality. Finally, we did not explore individual country, regional and subgroup level variations and cannot conclude that the degree of association is the same across different countries and subgroups, due to differences in socioeconomic, geographical, cultural and political realities. Hence, future studies should consider disaggregating with stratifiers such as income, education and place of residence, to explore subgroup differences.

Recommendations for ML implementation, governance and ethics

Our recommendations regarding the implementation and regulation of ML are fourfold. First, there is a burgeoning risk that the adoption and benefits of ML may be imbalanced.38 High-income countries are beginning to increasingly adopt and benefit from deploying some of these novel technologies; therefore, there is the risk of extending the disparity between LMICs and high-income countries even further. To achieve equity in the implementation of this technology, there is a need for capacity building across board and collaborative use of technological resources between LMICs.

Second, regarding AI research governance and ethics (regulation), the capabilities of AI application in public health are not yet fully understood, and its application is still evolving. This implies that any regulatory attempt will effectively require understanding the capabilities of AI as a tool in public health and medicine. Like other medical research endeavours, the regulatory framework and ethical guidelines will have to evolve, as our understanding of the application of AI evolves. As such, we posit that there is a concordance between regulation, governance, research and development of AI technology. In the light of this, we suggest collaboration between research institutions, academic stakeholders, policy makers and regulatory authorities. There is a need to engage with all stakeholders across the spectrum of AI research, development and ethics.

Third, we believe that existing medical research ethical guidelines are highly applicable and cover several aspects of ML research. However, there is a need to strengthen regulatory aspects pertaining to data security and protection. The growth in the adoption of ML analytical techniques will usher an increase in the level of data transactions and with this, comes the potential risk of breaches to health data privacy. There are existing capabilities to re-identify anonymised data, using a few parameters within the data. Hence, regulatory efforts need to focus on data security, especially reducing the risks of data re-identification.

Fourth, as knowledge and application of AI continues to grow in leaps and bounds, and while regulatory efforts are still rudimentary and trying to catch up, we envisage a vacuum in governance, which will have to be filled. As such, there may be a need for the development and ratification of regulatory framework, which may be possible through the collaboration of multiple stakeholders.

Conclusions

This study demonstrates the superiority of ML as a tool for understanding previously unseen insights in large global health data. We have shown that ML algorithms such as random forest, may be more insightful than the user-dependent traditional hierarchical approach of testing a limited set of determinants for outcome prediction in multivariate analysis. Using random forest, we found that duration of breast feeding, household wealth index and level of maternal education are the most important determinants of U5MR. In addition, we also show that deep learning algorithms are more sensitive and specific for the prediction of U5MR and this finding may be applicable to other multivariate models, for data-rich population studies.

Going forward, the most important implication of this study is that if deep learning algorithms such as the one we describe in this study, are deployed in production in combination with spatial data, it is possible to identify and flag children who are most at risk and not likely to survive until the age of 5, such that necessary interventions can be targeted to communities where those children live. To the best of our knowledge, there are no existing studies that have investigated U5M, using a similar analytical approach.

Acknowledgments

The authors would like to acknowledge the contributions of Professor Hong Zhou and the team at UNICEF Office.

References

↵
1. World Health Organization
. Children: reducing mortality [Internet]. Facts sheet children: reduing mortality, 2018. Available: https://www.who.int/news-room/fact-sheets/detail/children-reducing-mortality [Accessed 5 Apr 2019].
↵
1. UNICEF
. One is too many Ending child deaths from pneumonia and diarrhoea [Internet]. UNICEF, 2016. Available: http://data.unicef.org/topic/child-health/pneumonia/ [Accessed 5 Apr 2019].
↵
1. Maruthappu M,
2. Ng KYB,
3. Williams C, et al
. Government health care spending and child mortality. Pediatrics 2015;135:e887–94.doi:10.1542/peds.2014-1600pmid:http://www.ncbi.nlm.nih.gov/pubmed/25733755
OpenUrl Abstract/FREE Full Text
↵
1. World Health Organization
. Indicator and monitoring framework for the global strategy for women’s, children’s and adolescents’ health (2016-2030), 2016.
↵
1. Mosley WH,
2. Chen LC
. An analytical framework for the study of child survival in developing countries. Popul Dev Rev 1984;10:25. doi:10.2307/2807954
↵
1. Feng XL,
2. Theodoratou E,
3. Liu L, et al
. Social, economic, political and health system and program determinants of child mortality reduction in China between 1990 and 2006: a systematic analysis. J Glob Health 2012;2:010405. doi:10.7189/jogh.02.010405pmid:http://www.ncbi.nlm.nih.gov/pubmed/23198134
OpenUrl PubMed
↵
1. Barros FC,
2. Victora CG,
3. Scherpbier R, et al
. Socioeconomic inequities in the health and nutrition of children in low/middle income countries. Rev Saude Publica 2010;44:1–16.doi:10.1590/s0034-89102010000100001pmid:http://www.ncbi.nlm.nih.gov/pubmed/20140324
OpenUrl CrossRef PubMed Web of Science
↵
1. Adegbosin AE,
2. Zhou H,
3. Wang S, et al
. Systematic review and meta-analysis of the association between dimensions of inequality and a selection of indicators of reproductive, maternal, newborn and child health (RMNCH). J Glob Health 2019;9:010429. doi:10.7189/jogh.09.010429pmid:http://www.ncbi.nlm.nih.gov/pubmed/31131102
OpenUrl PubMed
↵
1. Health Organization W
. Handbook on health inequality monitoring: with a special focus on low- and middle-income countries.
↵
1. UN department of economic and social affairs
. Leaving no one behind | UN DESA | United Nations Department of Economic and Social Affairs [Internet]. Available: https://www.un.org/development/desa/en/news/sustainable/leaving-no-one-behind.html [Accessed 8 Apr 2019].
↵
1. Corsi DJ,
2. Neuman M,
3. Finlay JE, et al
. Demographic and health surveys: a profile. Int J Epidemiol 2012;41:1602–13.doi:10.1093/ije/dys184pmid:http://www.ncbi.nlm.nih.gov/pubmed/23148108
OpenUrl CrossRef PubMed Web of Science
↵
1. Unicef, Bng-, -Noorani
. Monitoring the situation of children and women for 20 years.
↵
1. Panch T,
2. Szolovits P,
3. Atun R
. Artificial intelligence, machine learning and health systems. J Glob Health 2018;8:020303. doi:10.7189/jogh.08.020303pmid:http://www.ncbi.nlm.nih.gov/pubmed/30405904
OpenUrl PubMed
↵
1. Șerban O,
2. Thapen N,
3. Maginnis B, et al
. Real-Time processing of social media with sentinel: a syndromic surveillance system incorporating deep learning for health classification. Inf Process Manag 2019;56:1166–84.doi:10.1016/j.ipm.2018.04.011
OpenUrl
↵
1. Michie S,
2. Thomas J,
3. Johnston M, et al
. The human Behaviour-Change project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation. Implement Sci 2017;12:121.doi:10.1186/s13012-017-0641-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29047393
OpenUrl CrossRef PubMed
↵
1. Panch T,
2. Pearson-Stuttard J,
3. Greaves F, et al
. Artificial intelligence: opportunities and risks for public health. Lancet Digit Health 2019;1:e13–14.doi:10.1016/S2589-7500(19)30002-0
OpenUrl
↵
1. USAID’s Center for Innovation AND Impact
. Artificial intelligence in global health defining a collective path forward.
↵
1. LeCun Y,
2. Bengio Y,
3. Hinton G
. Deep learning. Nature 2015;521:436–44.doi:10.1038/nature14539pmid:http://www.ncbi.nlm.nih.gov/pubmed/26017442
OpenUrl CrossRef PubMed
↵
1. Weng SF,
2. Reps J,
3. Kai J, et al
. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One 2017;12:e0174944. doi:10.1371/journal.pone.0174944pmid:http://www.ncbi.nlm.nih.gov/pubmed/28376093
OpenUrl CrossRef PubMed
↵
1. Taylor RA,
2. Pare JR,
3. Venkatesh AK, et al
. Prediction of in-hospital mortality in emergency department patients with sepsis: a local big data-driven, machine learning approach. Acad Emerg Med 2016;23:269–78.doi:10.1111/acem.12876pmid:http://www.ncbi.nlm.nih.gov/pubmed/26679719
OpenUrl PubMed
↵
1. Horng S,
2. Sontag D,
3. Halpern Y, et al
. Undefined. creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning, 2017. Available: journals.plos.org
↵
1. Arya R,
2. Wei G,
3. McCoy JV, et al
. Decreasing length of stay in the emergency department with a split emergency severity index 3 patient flow model. Acad Emerg Med 2013;20:1171–9.doi:10.1111/acem.12249pmid:http://www.ncbi.nlm.nih.gov/pubmed/24238321
OpenUrl PubMed
↵
1. Victora CG,
2. Huttly SR,
3. Fuchs SC, et al
. The role of conceptual frameworks in epidemiological analysis: a hierarchical approach. Int J Epidemiol 1997;26:224–7.doi:10.1093/ije/26.1.224pmid:http://www.ncbi.nlm.nih.gov/pubmed/9126524
OpenUrl CrossRef PubMed Web of Science
↵
1. Biau G,
2. Scornet E
. A random forest guided tour. Test 2016;25:197–227.doi:10.1007/s11749-016-0481-7
OpenUrl
↵
1. Zandi D,
2. Reis A,
3. Vayena E, et al
. New ethical challenges of digital technologies, machine learning and artificial intelligence in public health: a call for papers. Bull World Health Organ 2019;97:2. doi:10.2471/BLT.18.227686
↵
1. Croft TN,
2. Marshall AMJ,
3. Allen CK
. Guide to DHS statistics 2018.
↵
1. Burgert CR and DP
. Linking DHS household and spa facility surveys: data considerations and Geospatial methods. DHS spatial analysis report, 2014.
↵
1. Boyle EH,
2. King M, MS
. IPUMS Demographic and Health Surveys: Version 6 [dataset]. IPUMS and ICF, 2018, 2018.
↵
1. Mckinney W
. Pandas: a foundational python library for data analysis and statistics.
↵
1. Victora CG,
2. Bahl R,
3. Barros AJD, et al
. Breastfeeding in the 21st century: epidemiology, mechanisms, and lifelong effect. Lancet 2016;387:475–90.doi:10.1016/S0140-6736(15)01024-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/26869575
OpenUrl CrossRef PubMed
↵
1. Health NSG-TLG
. Undefined. timing of initiation, patterns of breastfeeding, and infant survival: prospective analysis of pooled data from three randomised trials. Elsevier, 2016.
↵
1. Sankar MJ,
2. Natarajan CK,
3. Das RR, et al
. When do newborns die? A systematic review of timing of overall and cause-specific neonatal deaths in developing countries. J Perinatol 2016;36 Suppl 1:S1–11.doi:10.1038/jp.2016.27pmid:http://www.ncbi.nlm.nih.gov/pubmed/27109087
OpenUrl PubMed
↵
1. Hug L,
2. Alexander M,
3. You D, et al
. National, regional, and global levels and trends in neonatal mortality between 1990 and 2017, with scenario-based projections to 2030: a systematic analysis. Lancet Glob Health 2019;7:e710–20.doi:10.1016/S2214-109X(19)30163-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31097275
OpenUrl CrossRef PubMed
↵
1. de Souza S,
2. Duim E,
3. Nampo FK
. Determinants of neonatal mortality in the largest international border of Brazil: a case-control study. BMC Public Health 2019;19:1304. doi:10.1186/s12889-019-7638-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/31619198
OpenUrl PubMed
↵
1. Langlois Étienne V,
2. Miszkurka M,
3. Zunzunegui MV, et al
. Inequities in postnatal care in low- and middle-income countries: a systematic review and meta-analysis. Bull World Health Organ 2015;93:259–70.doi:10.2471/BLT.14.140996pmid:http://www.ncbi.nlm.nih.gov/pubmed/26229190
OpenUrl CrossRef PubMed
↵
1. Fuchs R,
2. Pamuk E,
3. Lutz W, et al
. Education or wealth: which matters more for reducing child mortality in developing countries? Vienna Yearbook of Population Research. 2010;8:175–99.doi:10.1553/populationyearbook2010s175
OpenUrl
↵
1. Panesar SS,
2. D’Souza RN,
3. Yeh F-C, et al
. Machine learning versus logistic regression methods for 2-year mortality prognostication in a small heterogeneous glioma database. bioRxiv 2018;472555.
↵
1. Schwab K
. The Fourth Industrial Revolution, by Klaus Schwab | World Economic Forum [Internet]. Available: https://www.weforum.org/about/the-fourth-industrial-revolution-by-klaus-schwab [Accessed 5 May 2019].

Footnotes

Contributors AEA conceptualised the study, conducted the data extraction, analysed the data and wrote the first draft of the manuscript. JS contributed to the conceptualisation of the study, critically edited and proofread the document. BS proofread the document.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Patient consent for publication Not required.
Ethics approval Permission to use the data from all included countries was granted by Measure DHS. Ethics approval exemption was granted for use of this secondary data by the Griffith University Human Research Ethics Committee.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data may be obtained from a third party and are not publicly available. The datasets generated and analysed during the current study are available subject to permission from the DHS programme, in the (IPUMS-DHS) repository (https://www.idhsdata.org/idhs/index.shtml).

[1] ↵
World Health Organization
. Children: reducing mortality [Internet]. Facts sheet children: reduing mortality, 2018. Available: https://www.who.int/news-room/fact-sheets/detail/children-reducing-mortality [Accessed 5 Apr 2019].

[2] World Health Organization

[3] ↵
UNICEF
. One is too many Ending child deaths from pneumonia and diarrhoea [Internet]. UNICEF, 2016. Available: http://data.unicef.org/topic/child-health/pneumonia/ [Accessed 5 Apr 2019].

[4] UNICEF

[5] ↵
Maruthappu M,
Ng KYB,
Williams C, et al
. Government health care spending and child mortality. Pediatrics 2015;135:e887–94.doi:10.1542/peds.2014-1600pmid:http://www.ncbi.nlm.nih.gov/pubmed/25733755
OpenUrl Abstract/FREE Full Text

[6] Maruthappu M,

[7] Ng KYB,

[8] Williams C, et al

[9] ↵
World Health Organization
. Indicator and monitoring framework for the global strategy for women’s, children’s and adolescents’ health (2016-2030), 2016.

[10] World Health Organization

[11] ↵
Mosley WH,
Chen LC
. An analytical framework for the study of child survival in developing countries. Popul Dev Rev 1984;10:25. doi:10.2307/2807954

[12] Mosley WH,

[13] Chen LC

[14] ↵
Feng XL,
Theodoratou E,
Liu L, et al
. Social, economic, political and health system and program determinants of child mortality reduction in China between 1990 and 2006: a systematic analysis. J Glob Health 2012;2:010405. doi:10.7189/jogh.02.010405pmid:http://www.ncbi.nlm.nih.gov/pubmed/23198134
OpenUrl PubMed

[15] Feng XL,

[16] Theodoratou E,

[17] Liu L, et al

[18] ↵
Barros FC,
Victora CG,
Scherpbier R, et al
. Socioeconomic inequities in the health and nutrition of children in low/middle income countries. Rev Saude Publica 2010;44:1–16.doi:10.1590/s0034-89102010000100001pmid:http://www.ncbi.nlm.nih.gov/pubmed/20140324
OpenUrl CrossRef PubMed Web of Science

[19] Barros FC,

[20] Victora CG,

[21] Scherpbier R, et al

[22] ↵
Adegbosin AE,
Zhou H,
Wang S, et al
. Systematic review and meta-analysis of the association between dimensions of inequality and a selection of indicators of reproductive, maternal, newborn and child health (RMNCH). J Glob Health 2019;9:010429. doi:10.7189/jogh.09.010429pmid:http://www.ncbi.nlm.nih.gov/pubmed/31131102
OpenUrl PubMed

[23] Adegbosin AE,

[24] Zhou H,

[25] Wang S, et al

[26] ↵
Health Organization W
. Handbook on health inequality monitoring: with a special focus on low- and middle-income countries.

[27] Health Organization W

[28] ↵
UN department of economic and social affairs
. Leaving no one behind | UN DESA | United Nations Department of Economic and Social Affairs [Internet]. Available: https://www.un.org/development/desa/en/news/sustainable/leaving-no-one-behind.html [Accessed 8 Apr 2019].

[29] UN department of economic and social affairs

[30] ↵
Corsi DJ,
Neuman M,
Finlay JE, et al
. Demographic and health surveys: a profile. Int J Epidemiol 2012;41:1602–13.doi:10.1093/ije/dys184pmid:http://www.ncbi.nlm.nih.gov/pubmed/23148108
OpenUrl CrossRef PubMed Web of Science

[31] Corsi DJ,

[32] Neuman M,

[33] Finlay JE, et al

[34] ↵
Unicef, Bng-, -Noorani
. Monitoring the situation of children and women for 20 years.

[35] Unicef, Bng-, -Noorani

[36] ↵
Panch T,
Szolovits P,
Atun R
. Artificial intelligence, machine learning and health systems. J Glob Health 2018;8:020303. doi:10.7189/jogh.08.020303pmid:http://www.ncbi.nlm.nih.gov/pubmed/30405904
OpenUrl PubMed

[37] Panch T,

[38] Szolovits P,

[39] Atun R

[40] ↵
Șerban O,
Thapen N,
Maginnis B, et al
. Real-Time processing of social media with sentinel: a syndromic surveillance system incorporating deep learning for health classification. Inf Process Manag 2019;56:1166–84.doi:10.1016/j.ipm.2018.04.011
OpenUrl

[41] Șerban O,

[42] Thapen N,

[43] Maginnis B, et al

[44] ↵
Michie S,
Thomas J,
Johnston M, et al
. The human Behaviour-Change project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation. Implement Sci 2017;12:121.doi:10.1186/s13012-017-0641-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29047393
OpenUrl CrossRef PubMed

[45] Michie S,

[46] Thomas J,

[47] Johnston M, et al

[48] ↵
Panch T,
Pearson-Stuttard J,
Greaves F, et al
. Artificial intelligence: opportunities and risks for public health. Lancet Digit Health 2019;1:e13–14.doi:10.1016/S2589-7500(19)30002-0
OpenUrl

[49] Panch T,

[50] Pearson-Stuttard J,

[51] Greaves F, et al

[52] ↵
USAID’s Center for Innovation AND Impact
. Artificial intelligence in global health defining a collective path forward.

[53] USAID’s Center for Innovation AND Impact

[54] ↵
LeCun Y,
Bengio Y,
Hinton G
. Deep learning. Nature 2015;521:436–44.doi:10.1038/nature14539pmid:http://www.ncbi.nlm.nih.gov/pubmed/26017442
OpenUrl CrossRef PubMed

[55] LeCun Y,

[56] Bengio Y,

[57] Hinton G

[58] ↵
Weng SF,
Reps J,
Kai J, et al
. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One 2017;12:e0174944. doi:10.1371/journal.pone.0174944pmid:http://www.ncbi.nlm.nih.gov/pubmed/28376093
OpenUrl CrossRef PubMed

[59] Weng SF,

[60] Reps J,

[61] Kai J, et al

[62] ↵
Taylor RA,
Pare JR,
Venkatesh AK, et al
. Prediction of in-hospital mortality in emergency department patients with sepsis: a local big data-driven, machine learning approach. Acad Emerg Med 2016;23:269–78.doi:10.1111/acem.12876pmid:http://www.ncbi.nlm.nih.gov/pubmed/26679719
OpenUrl PubMed

[63] Taylor RA,

[64] Pare JR,

[65] Venkatesh AK, et al

[66] ↵
Horng S,
Sontag D,
Halpern Y, et al
. Undefined. creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning, 2017. Available: journals.plos.org

[67] Horng S,

[68] Sontag D,

[69] Halpern Y, et al

[70] ↵
Arya R,
Wei G,
McCoy JV, et al
. Decreasing length of stay in the emergency department with a split emergency severity index 3 patient flow model. Acad Emerg Med 2013;20:1171–9.doi:10.1111/acem.12249pmid:http://www.ncbi.nlm.nih.gov/pubmed/24238321
OpenUrl PubMed

[71] Arya R,

[72] Wei G,

[73] McCoy JV, et al

[74] ↵
Victora CG,
Huttly SR,
Fuchs SC, et al
. The role of conceptual frameworks in epidemiological analysis: a hierarchical approach. Int J Epidemiol 1997;26:224–7.doi:10.1093/ije/26.1.224pmid:http://www.ncbi.nlm.nih.gov/pubmed/9126524
OpenUrl CrossRef PubMed Web of Science

[75] Victora CG,

[76] Huttly SR,

[77] Fuchs SC, et al

[78] ↵
Biau G,
Scornet E
. A random forest guided tour. Test 2016;25:197–227.doi:10.1007/s11749-016-0481-7
OpenUrl

[79] Biau G,

[80] Scornet E

[81] ↵
Zandi D,
Reis A,
Vayena E, et al
. New ethical challenges of digital technologies, machine learning and artificial intelligence in public health: a call for papers. Bull World Health Organ 2019;97:2. doi:10.2471/BLT.18.227686

[82] Zandi D,

[83] Reis A,

[84] Vayena E, et al

[85] ↵
Croft TN,
Marshall AMJ,
Allen CK
. Guide to DHS statistics 2018.

[86] Croft TN,

[87] Marshall AMJ,

[88] Allen CK

[89] ↵
Burgert CR and DP
. Linking DHS household and spa facility surveys: data considerations and Geospatial methods. DHS spatial analysis report, 2014.

[90] Burgert CR and DP

[91] ↵
Boyle EH,
King M, MS
. IPUMS Demographic and Health Surveys: Version 6 [dataset]. IPUMS and ICF, 2018, 2018.

[92] Boyle EH,

[93] King M, MS

[94] ↵
Mckinney W
. Pandas: a foundational python library for data analysis and statistics.

[95] Mckinney W

[96] ↵
Victora CG,
Bahl R,
Barros AJD, et al
. Breastfeeding in the 21st century: epidemiology, mechanisms, and lifelong effect. Lancet 2016;387:475–90.doi:10.1016/S0140-6736(15)01024-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/26869575
OpenUrl CrossRef PubMed

[97] Victora CG,

[98] Bahl R,

[99] Barros AJD, et al

[100] ↵
Health NSG-TLG
. Undefined. timing of initiation, patterns of breastfeeding, and infant survival: prospective analysis of pooled data from three randomised trials. Elsevier, 2016.

[101] Health NSG-TLG

[102] ↵
Sankar MJ,
Natarajan CK,
Das RR, et al
. When do newborns die? A systematic review of timing of overall and cause-specific neonatal deaths in developing countries. J Perinatol 2016;36 Suppl 1:S1–11.doi:10.1038/jp.2016.27pmid:http://www.ncbi.nlm.nih.gov/pubmed/27109087
OpenUrl PubMed

[103] Sankar MJ,

[104] Natarajan CK,

[105] Das RR, et al

[106] ↵
Hug L,
Alexander M,
You D, et al
. National, regional, and global levels and trends in neonatal mortality between 1990 and 2017, with scenario-based projections to 2030: a systematic analysis. Lancet Glob Health 2019;7:e710–20.doi:10.1016/S2214-109X(19)30163-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31097275
OpenUrl CrossRef PubMed

[107] Hug L,

[108] Alexander M,

[109] You D, et al

[110] ↵
de Souza S,
Duim E,
Nampo FK
. Determinants of neonatal mortality in the largest international border of Brazil: a case-control study. BMC Public Health 2019;19:1304. doi:10.1186/s12889-019-7638-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/31619198
OpenUrl PubMed

[111] de Souza S,

[112] Duim E,

[113] Nampo FK

[114] ↵
Langlois Étienne V,
Miszkurka M,
Zunzunegui MV, et al
. Inequities in postnatal care in low- and middle-income countries: a systematic review and meta-analysis. Bull World Health Organ 2015;93:259–70.doi:10.2471/BLT.14.140996pmid:http://www.ncbi.nlm.nih.gov/pubmed/26229190
OpenUrl CrossRef PubMed

[115] Langlois Étienne V,

[116] Miszkurka M,

[117] Zunzunegui MV, et al

[118] ↵
Fuchs R,
Pamuk E,
Lutz W, et al
. Education or wealth: which matters more for reducing child mortality in developing countries? Vienna Yearbook of Population Research. 2010;8:175–99.doi:10.1553/populationyearbook2010s175
OpenUrl

[119] Fuchs R,

[120] Pamuk E,

[121] Lutz W, et al

[122] ↵
Panesar SS,
D’Souza RN,
Yeh F-C, et al
. Machine learning versus logistic regression methods for 2-year mortality prognostication in a small heterogeneous glioma database. bioRxiv 2018;472555.

[123] Panesar SS,

[124] D’Souza RN,

[125] Yeh F-C, et al

[126] ↵
Schwab K
. The Fourth Industrial Revolution, by Klaus Schwab | World Economic Forum [Internet]. Available: https://www.weforum.org/about/the-fourth-industrial-revolution-by-klaus-schwab [Accessed 5 May 2019].

[127] Schwab K

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Strengths and limitations of this study

Introduction

Methods

Data source and analytical tools

Supplemental material

Patient and public involvement

Data preprocessing

Variables

Outcome variable

Independent variables (model features)

Maternal factors

Household socioeconomic factors

Child-level factors

Feature selection

Model selection

Model architecture

Deep neural networks

Convolutional neural networks

Hybrid DNN-CNN ensemble model

Model evaluation

Results

Characteristics of the study population

Feature importance

Model comparisons (before feature selection)

Model comparisons (after feature selection)

Discussion

Recommendations for ML implementation, governance and ethics

Conclusions

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password